dselivanov/text2vec: Modern Text Mining Framework for R

Fast and memory-friendly tools for text vectorization, topic modeling (LDA, LSA), word embeddings (GloVe), similarities. This package provides a source-agnostic streaming API, which allows researchers to perform analysis of collections of documents which are larger than available RAM. All core functions are parallelized to benefit from multicore machines.

Getting started

Package details

MaintainerDmitriy Selivanov <[email protected]>
LicenseGPL (>= 2) | file LICENSE
URL http://text2vec.org
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
dselivanov/text2vec documentation built on Sept. 23, 2018, 1:57 a.m.