get_text: Get BBC News article
In unimi-dse/1ed5ff0d: BBC News article text analysis

Description Usage Arguments Value Note Examples

View source: R/GetText.R

Scrap article headline and body text from BBC News website (https://www.bbc.com/news/), merge them together and create a Corpus for text mining.

1	get_text(url_end)

url_end

character string, an ending part of BBC News particular atricle URL (everything after https://www.bbc.com/news/). For example, article URL is "https://www.bbc.com/news/world-us-canada-51381625". Only "world-us-canada-51381625" should be pasted

art_c - Corpus representing a collection of text documents with an article text (each article paragraph as a single document in Corpus)

Please, check that URL (url_end) exists before running the function, otherwise you will get an "Error in open.connection(x, "rb") : HTTP error 404". Please, insert URLs of the articles in English only. Only for BBC News, not BBC Sports , Travel, Worklife, etc.