tm.plugin.lexisnexis: Import Articles from 'LexisNexis' Using the 'tm' Text Mining Framework

Share:

Provides a 'tm' Source to create corpora from articles exported from the 'LexisNexis' content provider as HTML files. It is able to read both text content and meta-data information (including source, date, title, author and pages). Note that the file format is highly unstable: there is no warranty that this package will work for your corpus, and you may have to adjust the code to adapt it to your particular format.

Author
Milan Bouchet-Valat [aut, cre]
Date of publication
2016-06-29 22:36:53
Maintainer
Milan Bouchet-Valat <nalimilan@club.fr>
License
GPL (>= 2)
Version
1.3
URLs

View on CRAN

Man pages

LexisNexisSource
LexisNexis Source
readLexisNexis
Read in a LexisNexis article in the HTML format
tm.plugin.lexisnexis-package
A plug-in for the tm text mining framework to import articles...

Files in this package

tm.plugin.lexisnexis
tm.plugin.lexisnexis/inst
tm.plugin.lexisnexis/inst/texts
tm.plugin.lexisnexis/inst/texts/lexisnexis_test_fr.html
tm.plugin.lexisnexis/inst/texts/lexisnexis_test_en.html
tm.plugin.lexisnexis/tests
tm.plugin.lexisnexis/tests/import.R
tm.plugin.lexisnexis/NAMESPACE
tm.plugin.lexisnexis/NEWS
tm.plugin.lexisnexis/R
tm.plugin.lexisnexis/R/LexisNexisSource.R
tm.plugin.lexisnexis/R/startup.R
tm.plugin.lexisnexis/R/readLexisNexisHTML.R
tm.plugin.lexisnexis/MD5
tm.plugin.lexisnexis/DESCRIPTION
tm.plugin.lexisnexis/man
tm.plugin.lexisnexis/man/tm.plugin.lexisnexis-package.Rd
tm.plugin.lexisnexis/man/LexisNexisSource.Rd
tm.plugin.lexisnexis/man/readLexisNexis.Rd