tm.plugin.europresse: Import Articles from 'Europresse' Using the 'tm' Text Mining Framework

Share:

Provides a 'tm' Source to create corpora from articles exported from the 'Europresse' content provider as HTML files. It is able to read both text content and meta-data information (including source, date, title, author and pages).

Author
Milan Bouchet-Valat [aut, cre]
Date of publication
2016-08-23 17:22:18
Maintainer
Milan Bouchet-Valat <nalimilan@club.fr>
License
GPL (>= 2)
Version
1.4
URLs

View on CRAN

Man pages

EuropresseSource
Europresse Source
readEuropresse
Read in a Europresse article in the HTML format
tm.plugin.europresse-package
A plug-in for the tm text mining framework to import articles...

Files in this package

tm.plugin.europresse
tm.plugin.europresse/inst
tm.plugin.europresse/inst/texts
tm.plugin.europresse/inst/texts/europresse_test1.html
tm.plugin.europresse/inst/texts/europresse_test2.html
tm.plugin.europresse/NAMESPACE
tm.plugin.europresse/NEWS
tm.plugin.europresse/R
tm.plugin.europresse/R/EuropresseSource.R
tm.plugin.europresse/R/readEuropresseHTML.R
tm.plugin.europresse/MD5
tm.plugin.europresse/DESCRIPTION
tm.plugin.europresse/man
tm.plugin.europresse/man/EuropresseSource.Rd
tm.plugin.europresse/man/readEuropresse.Rd
tm.plugin.europresse/man/tm.plugin.europresse-package.Rd