tm.plugin.europresse: Import Articles from 'Europresse' Using the 'tm' Text Mining Framework

Provides a 'tm' Source to create corpora from articles exported from the 'Europresse' content provider as HTML files. It is able to read both text content and meta-data information (including source, date, title, author and pages).

AuthorMilan Bouchet-Valat [aut, cre]
Date of publication2016-08-23 17:22:18
MaintainerMilan Bouchet-Valat <nalimilan@club.fr>
LicenseGPL (>= 2)
Version1.4
https://r-forge.r-project.org/projects/r-temis/

View on CRAN

Functions

eoi.EuropresseSource Man page
EuropresseSource Man page
getElem.EuropresseSource Man page
readEuropresseHTML1 Man page
readEuropresseHTML2 Man page
tm.plugin.europresse Man page
tm.plugin.europresse-package Man page

Files

tm.plugin.europresse
tm.plugin.europresse/inst
tm.plugin.europresse/inst/texts
tm.plugin.europresse/inst/texts/europresse_test1.html
tm.plugin.europresse/inst/texts/europresse_test2.html
tm.plugin.europresse/NAMESPACE
tm.plugin.europresse/NEWS
tm.plugin.europresse/R
tm.plugin.europresse/R/EuropresseSource.R tm.plugin.europresse/R/readEuropresseHTML.R
tm.plugin.europresse/MD5
tm.plugin.europresse/DESCRIPTION
tm.plugin.europresse/man
tm.plugin.europresse/man/EuropresseSource.Rd tm.plugin.europresse/man/readEuropresse.Rd tm.plugin.europresse/man/tm.plugin.europresse-package.Rd

Questions? Problems? Suggestions? or email at ian@mutexlabs.com.

Please suggest features or report bugs with the GitHub issue tracker.

All documentation is copyright its authors; we didn't write any of that.