extreme_docs: extreme_docs

Description Usage Arguments Details Value Examples

View source: R/CA.R

Description

Print documents which have the most extreme coordinations on an axis of correspondence analysis.

Usage

1
extreme_docs(corpus, ca, axis, ndocs = 10, nterms = 25)

Arguments

corpus

A Corpus object.

ca

A CA object.

axis

The CA axis to consider.

ndocs

The number of (most contributive) documents to print.

nterms

The number of terms to highlight in documents.

Details

Occurrences of the nterms most extreme terms are highlighted. If stemming or other transformations have been applied to original words using combine_terms, all original words which have been transformed to the specified terms are highlighted.

Value

Corpus object (invisibly).

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
file <- system.file("texts", "reut21578-factiva.xml", package="tm.plugin.factiva")
corpus <- import_corpus(file, "factiva", language="en")
dtm <- build_dtm(corpus)
ca <- corpus_ca(corpus, dtm)
contributive_docs(corpus, ca, 1)

# Also works when terms have been combined
dict <- dictionary(dtm)
dtm2 <- combine_terms(dtm, dict)
ca2 <- corpus_ca(corpus, dtm2)
extreme_docs(corpus, ca2, 1)

R.temis documentation built on May 13, 2021, 1:08 a.m.