🛳️

Titanic

Description

The Titanic dataset is a classical public dataset, which contains 1309 records about the Titanic's passengers who were victims of the most infamous shipwrecks in history on April 15, 1912.
The dataset provides the passenger data (i.e. name, age, gender, socio-economic class, etc.) in accordance with the outcome for each passenger (survived or deceased).
It is often used for practicing exploratory data analysis and building classification models to predict the survival of a particular passenger based on the features provided.

Data Location

Storage → Samples → titanic.csv
 
Dataset features
Variable
Definition
Survival (0 = No, 1 = Yes)
Passenger's name
Ticket class (1 = 1st, 2 = 2nd, 3 = 3rd)
Sex (male, female)
Age in years
Number of siblings/spouses aboard the Titanic
Number of parents/children aboard the Titanic
Ticket number
Passenger fare
Cabin number
Port of embarkation (C = Cherbourg, Q = Queenstown, S = Southampton)

Examples of usage

 
titanic