Description Usage Arguments Value References See Also
Read in a Reuters-21578 XML document.
1 2 | readReut21578XML(elem, language, id)
readReut21578XMLasPlain(elem, language, id)
|
elem |
a named list with the component |
language |
a string giving the language. |
id |
Not used. |
An XMLTextDocument
for readReut21578XML
, or a
PlainTextDocument
for readReut21578XMLasPlain
,
representing the text and metadata extracted from elem$content
.
Emms, Martin and Luz, Saturnino (2007). Machine Learning for Natural Language Processing. European Summer School of Logic, Language and Information, course reader. http://www.homepages.ed.ac.uk/sluzfil/esslli07/mlfornlp.pdf
Lewis, David (1997) Reuters-21578 Text Categorization Collection Distribution 1.0. http://kdd.ics.uci.edu/databases/reuters21578/reuters21578.html
Luz, Saturnino XML-encoded version of Reuters-21578. http://www.homepages.ed.ac.uk/sluzfil/esslli07/data/reuters21578-xml.tar.bz2
Reader
for basic information on the reader infrastructure
employed by package tm.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.