readReut21578XML: Read In a Reuters-21578 XML Document

readReut21578XMLR Documentation

Read In a Reuters-21578 XML Document

Description

Read in a Reuters-21578 XML document.

Usage

readReut21578XML(elem, language, id)
readReut21578XMLasPlain(elem, language, id)

Arguments

elem

a named list with the component content which must hold the document to be read in.

language

a string giving the language.

id

Not used.

Value

An XMLTextDocument for readReut21578XML, or a PlainTextDocument for readReut21578XMLasPlain, representing the text and metadata extracted from elem$content.

References

Lewis, David (1997). Reuters-21578 Text Categorization Collection Distribution. UCI Machine Learning Repository. \Sexpr[results=rd]{tools:::Rd_expr_doi("10.24432/C52G6M")}.

See Also

Reader for basic information on the reader infrastructure employed by package tm.


tm documentation built on Sept. 11, 2024, 6:47 p.m.