tm.plugin.factiva: Import articles from Factiva using the tm text mining framework

This package provides a tm Source to create corpora from articles exported from the Dow Jones Factiva content provider as XML or HTML files. It is able to read both text content and meta-data information (including source, date, title, author, subject, geographical coverage, company, industry, and various provider-specific fields).

Author
Milan Bouchet-Valat [aut, cre], Grigorij Ljubownikow [ctb]
Date of publication
2014-07-05 16:46:20
Maintainer
Milan Bouchet-Valat <nalimilan@club.fr>
License
GPL (>= 2)
Version
1.5
URLs

View on CRAN

Man pages

FactivaSource
Factiva Source
readFactiva
Read in a Factiva article in XML or HTML formats
tm.plugin.factiva-package
A plug-in for the tm text mining framework to import articles...

Files in this package

tm.plugin.factiva
tm.plugin.factiva/inst
tm.plugin.factiva/inst/texts
tm.plugin.factiva/inst/texts/factiva_test.html
tm.plugin.factiva/inst/texts/factiva_test.xml
tm.plugin.factiva/inst/texts/reut21578-factiva.xml
tm.plugin.factiva/NAMESPACE
tm.plugin.factiva/NEWS
tm.plugin.factiva/R
tm.plugin.factiva/R/readFactivaXML.R
tm.plugin.factiva/R/readFactivaHTML.R
tm.plugin.factiva/R/FactivaSource.R
tm.plugin.factiva/MD5
tm.plugin.factiva/DESCRIPTION
tm.plugin.factiva/man
tm.plugin.factiva/man/tm.plugin.factiva-package.Rd
tm.plugin.factiva/man/readFactiva.Rd
tm.plugin.factiva/man/FactivaSource.Rd