tm.plugin.lexisnexis-package | R Documentation |
This package provides a tm Source to create corpora from articles exported from the LexisNexis content provider as HTML files.
Typical usage is to create a corpus from HTML files
exported from LexisNexis (here called myLexisNexisArticles.html
).
Setting language=NA
allows the language to be set automatically
from the information provided by Factiva:
# Import corpus source <- LexisNexisSource("myLexisNexisArticles.html") corpus <- Corpus(source, readerControl = list(language = NA)) # See how many articles were imported corpus # See the contents of the first article and its meta-data inspect(corpus[1]) meta(corpus[[1]])
Currently, only HTML files saved in English and French are supported. Please send the maintainer examples of LexisNexis files in your language if you want it to be supported.
See link{LexisNexisSource}
for more details and real examples.
Milan Bouchet-Valat <nalimilan@club.fr>
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.