bbc_articles_full: Full BBC Articles data

Description Usage Format Details Source

Description

Full BBC Articles data

Usage

1

Format

A tibble, with 927 observations of separate documents and their contents. This results in two columns.

words

The words from a given article

document

The 'document' (article) ID

Details

A collection of business and politics BBC news articles. Each row represents each article (document), with a document ID and a string of the text content with stop words removed. This is a 'dirty' version of the bbc_articles dataset, where we now have a string of text for each observation, as opposed to a single word.

Source


mangoTraining documentation built on April 28, 2021, 9:07 a.m.