quanteda: Quantitative Analysis of Textual Data

load("/home/kohei/Documents/Brexit/Analysis/data_tokens_guardian.RData")
toks <- data_tokens_guardian[1:1000]

toks <- tokens_remove(toks, stopwords("english"), padding = TRUE)
toks <- tokens_remove(toks, "\\p{S}", "regex", padding = TRUE)

microbenchmark::microbenchmark(
    textstat_collocations(toks, size = 2:5, min_count = 10),
    times = 10
)

head(out[out$count_nested / out$count < 0.1,], 20)
head(out, 20)

quanteda/quanteda documentation built on Sept. 4, 2024, 7:56 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

quanteda/quanteda
Quantitative Analysis of Textual Data

tests/benchmarks/benchmark_textstat_collocations.R
In quanteda/quanteda: Quantitative Analysis of Textual Data

R Package Documentation

Browse R Packages

We want your feedback!

quanteda/quanteda Quantitative Analysis of Textual Data

tests/benchmarks/benchmark_textstat_collocations.R In quanteda/quanteda: Quantitative Analysis of Textual Data

R Package Documentation

Browse R Packages

We want your feedback!

quanteda/quanteda
Quantitative Analysis of Textual Data

tests/benchmarks/benchmark_textstat_collocations.R
In quanteda/quanteda: Quantitative Analysis of Textual Data