teramonagi/scdv: SCDV : Sparse Composite Document Vectors using soft clustering over distributional representations

Sparse Composite Document Vector (SCDV) is a feature vector formation technique for documents to get distributional paragraph vector representations. In SCDV, word embedding's are clustered to capture multiple contexts in which words occur. They are then chained together to form document topic-vectors that can express complex, multi-topic documents. The approach is described in more detail in the article by Dheeraj et al. (2016) <arXiv:1612.06778>.

Getting started

Package details

Maintainer
LicenseMIT + file LICENSE
Version0.1.0
URL http://github.com/teramonagi/scdv
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
install.packages("remotes")
remotes::install_github("teramonagi/scdv")
teramonagi/scdv documentation built on June 1, 2019, 3:58 a.m.