distinct.corpus: Subset documents distinct/unique by document variables

Description Usage Arguments Examples

View source: R/distinct.R

Description

Select only documents that are unique/distinct with respect to values of their document variables.

Usage

1
2
## S3 method for class 'corpus'
distinct(.data, ..., .keep_all = FALSE)

Arguments

.data

a corpus object with document variables

...

comma-separated list of unquoted document variables, or expressions involving document variables

.keep_all

If TRUE, keep all variables in .data. If a combination of ... is not distinct, this keeps the first row of values.

Examples

1
2
3
4
distinct(data_corpus_inaugural[1:5], President) %>%
  summary()
distinct(data_corpus_inaugural[1:5], President, .keep_all = TRUE) %>%
  summary()

quanteda/quanteda.tidy documentation built on April 11, 2021, 3:44 p.m.