readEuropresse: Read in a Europresse article in the HTML format

Description Usage Arguments Details Value Author(s) See Also

Description

Read in an article exported from Europresse in the HTML format.

Usage

1
2
  readEuropresseHTML1(elem, language, id)
  readEuropresseHTML2(elem, language, id)

Arguments

elem

A list with the named element content which must hold the document to be read in.

language

A character vector giving the text's language. If set to NA, the language will automatically be set to the value reported in the document (which is usually correct).

id

A character vector representing a unique identification string for the returned text document.

Details

readEuropresseHTML1 reads documents in the old format, while readEuropresseHTML2 reads documents in the new one. EuropresseSource automatically chooses the correct reader based on the structure of the file.

Value

A PlainTextDocument with the contents of the article and the available meta-data set.

Author(s)

Milan Bouchet-Valat

See Also

getReaders to list available reader functions.


tm.plugin.europresse documentation built on May 29, 2017, 11:01 a.m.