Headlines_train | R Documentation |
This data comes from Chakraborty et. al., which combines headlines from
a variety of news and clickbait sources. Some headlines contain
subject matter inappropriate for classroom use. Given the volume of headlines
containing such language (especially for clickbait == TRUE
), this filtering
might not catch all problematic headlines. User discretion is advised.
The training dataset is a random sample of approximately 80% of the observations
from the original dataset.
The testing dataset is a random sample of the remaining 20% of the observations not found in the training set.
Headlines_train
Headlines_test
A data frame with 18,360 rows and 3 variables:
a character vector
a logical vector
an integer vector
A data frame with 4,589 rows and 3 variables:
https://github.com/bhargaviparanjape/clickbait/
nrow(Headlines_train)
nrow(Headlines_test)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.