corpus: Corpus

Description Usage Arguments Examples

View source: R/corpus.R

Description

Build a corpus from documents or a directory of text files.

Usage

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
corpus(document, ..., update_lexicon = TRUE,
  update_inverse_index = TRUE)

## S3 method for class 'document'
corpus(document, ..., update_lexicon = TRUE,
  update_inverse_index = TRUE)

## S3 method for class 'documents'
corpus(document, ..., update_lexicon = TRUE,
  update_inverse_index = TRUE)

directory_corpus(directory, update_lexicon = TRUE,
  update_inverse_index = TRUE)

Arguments

document

First document, a list, or a vector of documents.

...

Objects inheriting of class document to build a corpus.

update_lexicon

Whether to update the lexicon, see update_lexicon.

update_inverse_index

Whether to update the inverse index, see update_inverse_index.

directory

Path to a directory of text files.

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
## Not run: 
init_textanalysis()

# build document
doc1 <- string_document("First document.")
doc2 <- string_document("Second document.")

corpus <- corpus(doc1, doc2)

## End(Not run)

news-r/textanalysis documentation built on Nov. 4, 2019, 9:40 p.m.