titanic: Titanic

titanicR Documentation

Titanic

Description

Classification data to predict the fate of passengers on the ocean liner "Titanic". Contains 10 features and 1309 observations. Target column is "Survived".

Pre-processing

  • All column names have been changed to snake_case.

  • training and test set have been joined. Observations of the test set have a missing value in the target column "survived".

  • Column '"survived"' has been re-encoded to a factor with levels '"yes"' and '"no"'.

  • Id column has been removed.

  • Passenger class "pclass" has been converted to an ordered factor.

  • Features "sex" and "embarked" have been converted to factors.

  • Empty strings in "cabin" and "embarked" have been encoded as missing values.

Source

titanic and https://www.kaggle.com/c/titanic/data

Examples

data("titanic", package = "mlr3data")
str(titanic)

mlr3data documentation built on July 9, 2023, 7:29 p.m.