textprocessingDSI-package: Efficiently clean a corpus in parallel
In avkoehl/textprocessingDSI: Clean an arbitrarily large corpus for topic modelling over many cores

Description Details Author(s) Examples

This package was designed for corpuses too large for conventional R cleaning methods. The majority of the functionality of this package is written in cpp and linked with Rcpp. The R code is for the most part a wrapper around the cpp files that perform the operations on the corpus. Given an input list of text files, this package provides the tools to clean, tokenize, and reduce the number of terms of the corpus to prepare it for topic modelling and other nlp tasks.

This section should provide a more detailed overview of how to use the package, including the most important functions.

Arthur Koehl

Maintainer: Arthur Koehl <avkoehl@ucdavis.edu>

  ## Not run: 
     ## Optional simple examples of the most important functions
     ## These can be in \dontrun{} and \donttest{} blocks.   
  
## End(Not run)

avkoehl/textprocessingDSI documentation built on June 5, 2019, 7:41 p.m.

avkoehl/textprocessingDSI index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

avkoehl/textprocessingDSI
Clean an arbitrarily large corpus for topic modelling over many cores

textprocessingDSI-package: Efficiently clean a corpus in parallel
In avkoehl/textprocessingDSI: Clean an arbitrarily large corpus for topic modelling over many cores

Description

Details

Author(s)

Examples

Related to textprocessingDSI-package in avkoehl/textprocessingDSI...

R Package Documentation

Browse R Packages

We want your feedback!

avkoehl/textprocessingDSI Clean an arbitrarily large corpus for topic modelling over many cores

textprocessingDSI-package: Efficiently clean a corpus in parallel In avkoehl/textprocessingDSI: Clean an arbitrarily large corpus for topic modelling over many cores

Description

Details

Author(s)

Examples

Related to textprocessingDSI-package in avkoehl/textprocessingDSI...

R Package Documentation

Browse R Packages

We want your feedback!

avkoehl/textprocessingDSI
Clean an arbitrarily large corpus for topic modelling over many cores

textprocessingDSI-package: Efficiently clean a corpus in parallel
In avkoehl/textprocessingDSI: Clean an arbitrarily large corpus for topic modelling over many cores