spacyr: Wrapper to the 'spaCy' 'NLP' Library

# spacy tokenizer bench
library(microbenchmark)
library(quanteda)
library(spacyr)
library(dplyr)

spacy_initialize()
text <- texts(data_corpus_irishbudget2010 %>% corpus_reshape("sentences"))
microbenchmark(just_tokenize = spacy_tokenize(text),
               remove_punct = spacy_tokenize(text, remove_punct = TRUE),
               quanteda = tokens(text),
               times = 3)

quanteda/spacyr documentation built on Feb. 5, 2025, 12:59 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

quanteda/spacyr
Wrapper to the 'spaCy' 'NLP' Library

tests/misc/tokenizer_bench.R
In quanteda/spacyr: Wrapper to the 'spaCy' 'NLP' Library

R Package Documentation

Browse R Packages

We want your feedback!

quanteda/spacyr Wrapper to the 'spaCy' 'NLP' Library

tests/misc/tokenizer_bench.R In quanteda/spacyr: Wrapper to the 'spaCy' 'NLP' Library

R Package Documentation

Browse R Packages

We want your feedback!

quanteda/spacyr
Wrapper to the 'spaCy' 'NLP' Library

tests/misc/tokenizer_bench.R
In quanteda/spacyr: Wrapper to the 'spaCy' 'NLP' Library