filterCount: Subcorpus With Count Filter

Description Usage Arguments Value Examples

View source: R/filterCount.R

Description

Generates a subcorpus by restricting it to texts containing a specific number of words.

Usage

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
filterCount(...)

## Default S3 method:
filterCount(text, count = 1L, out = c("text", "bin", "count"), ...)

## S3 method for class 'textmeta'
filterCount(
  object,
  count = 1L,
  out = c("text", "bin", "count"),
  filtermeta = TRUE,
  ...
)

Arguments

...

Not used.

text

Not necassary if object is specified, else should be object$text: list of article texts

count

An integer marking how many words must at least be found in the text.

out

Type of output: text filtered corpus, bin logical vector for all texts, count the counts.

object

A textmeta object

filtermeta

Logical: Should the meta component be filtered, too?

Value

textmeta object if object is specified, else only the filtered text. If a textmeta object is returned its meta data are filtered to those texts which appear in the corpus by default (filtermeta).

Examples

1
2
3
4
5
6
7
8
9
texts <- list(A="Give a Man a Fish, and You Feed Him for a Day.
Teach a Man To Fish, and You Feed Him for a Lifetime",
B="So Long, and Thanks for All the Fish",
C="A very able manipulative mathematician, Fisher enjoys a real mastery
in evaluating complicated multiple integrals.")

filterCount(text=texts, count=10L)
filterCount(text=texts, count=10L, out="bin")
filterCount(text=texts, count=10L, out="count")

tosca documentation built on Oct. 28, 2021, 5:07 p.m.