R/data.R

#' Income dataset from Kaggle.
#'
#' A dataset containing incomes (<=50K vs. >50K) and predictors.
#' A common becnhmark dataset for tabular models.
#'
#' @format A data frame with 32,561 rows and 15 variables:
#' \describe{
#'   \item{income}{income as a binary variable}
#'   ...
#' }
#' @source \url{https://www.kaggle.com/lodetomasi1995/income-classification}
"income"

#' Customer churn dataset from Kaggle.
#'
#' Each row represents a customer, each column contains customer’s attributes.
#'
#' @format A data frame with 7,043 rows and 21 variables:
#' \describe{
#'   \item{income}{income as a binary variable}
#'   ...
#' }
#' @source \url{https://www.kaggle.com/blastchar/telco-customer-churn}
"blastchar"
cmcmaster1/torchtabular documentation built on Dec. 19, 2021, 5:17 p.m.