Description Usage Arguments Value Examples
Split documents in a corpus into documents of one of more paragraphs.
1 | split_documents(corpus, chunksize, preserveMetadata = TRUE)
|
corpus |
A |
chunksize |
The number of paragraphs each new document should contain at most. |
preserveMetadata |
Whether to preserve the meta-data of original documents. |
A Corpus
object with split documents.
1 2 3 | file <- system.file("texts", "reut21578-factiva.xml", package="tm.plugin.factiva")
corpus <- import_corpus(file, "factiva", language="en")
split_documents(corpus, 3)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.