WebCorpus: WebCorpus constructor function.

Description Usage Arguments

View source: R/corpus.R

Description

WebCorpus adds further methods and meta data to Corpus and therefore constructs a derived class of Corpus. Most importantly, WebCorpus calls $PostFUN on the generated WebCorpus, which retrieves the main content for most implemented WebSources. Thus it enables an efficient retrieval of new feed items (corpus.update). All additional WebCorpus fields are added to tm$meta like $source, $readerControl and $postFUN.

Usage

1
2
WebCorpus(x, readerControl = list(reader = reader(x), language = "en"),
  postFUN = x$postFUN, retryEmpty = TRUE, ...)

Arguments

x

object of type Source, see also Corpus

readerControl

specifies reader to be used for Source, defaults to list(reader = x$DefaultReader, language = "en"

postFUN

function to be applied to WebCorpus after web retrieval has been completed, defaults to x$PostFUN

retryEmpty

specifies if retrieval for empty content elements should be repeated, defaults to TRUE

...

additional parameters for Corpus function (actually Corpus reader)


mannau/tm.plugin.webmining documentation built on May 21, 2019, 11:24 a.m.