tm.plugin.lexisnexis: Import Articles from 'LexisNexis' Using the 'tm' Text Mining Framework

Provides a 'tm' Source to create corpora from articles exported from the 'LexisNexis' content provider as HTML files. It is able to read both text content and meta-data information (including source, date, title, author and pages). Note that the file format is highly unstable: there is no warranty that this package will work for your corpus, and you may have to adjust the code to adapt it to your particular format.

AuthorMilan Bouchet-Valat [aut, cre]
Date of publication2016-06-29 22:36:53
MaintainerMilan Bouchet-Valat <nalimilan@club.fr>
LicenseGPL (>= 2)
Version1.3
https://r-forge.r-project.org/projects/r-temis/

View on CRAN

Files

tm.plugin.lexisnexis
tm.plugin.lexisnexis/inst
tm.plugin.lexisnexis/inst/texts
tm.plugin.lexisnexis/inst/texts/lexisnexis_test_fr.html
tm.plugin.lexisnexis/inst/texts/lexisnexis_test_en.html
tm.plugin.lexisnexis/tests
tm.plugin.lexisnexis/tests/import.R
tm.plugin.lexisnexis/NAMESPACE
tm.plugin.lexisnexis/NEWS
tm.plugin.lexisnexis/R
tm.plugin.lexisnexis/R/LexisNexisSource.R tm.plugin.lexisnexis/R/startup.R tm.plugin.lexisnexis/R/readLexisNexisHTML.R
tm.plugin.lexisnexis/MD5
tm.plugin.lexisnexis/DESCRIPTION
tm.plugin.lexisnexis/man
tm.plugin.lexisnexis/man/tm.plugin.lexisnexis-package.Rd tm.plugin.lexisnexis/man/LexisNexisSource.Rd tm.plugin.lexisnexis/man/readLexisNexis.Rd

Questions? Problems? Suggestions? or email at ian@mutexlabs.com.

Please suggest features or report bugs with the GitHub issue tracker.

All documentation is copyright its authors; we didn't write any of that.