paolofantini/Supreme: Make it easier applying LDA topic models to a corpus of Italian Supreme Court decisions

This package provides tools that make it easier building a corpus of documents starting from the original xml files. It also provides a set of functions for reducing the dimensionality (number of columns) of obtained document-term matrix in both cases of supervised and unsupervised matrix and implements a new strategy for selecting the number of topics based on logistic classification. This strategy can be considered as an alternative to the general criterion of perplexity.

Getting started

Package details

AuthorPaolo Fantini <paolo.fantini@uniroma1.com>
MaintainerPaolo Fantini <paolo.fantini@uniroma1.com>
LicenseGPL-2
Version0.1.0
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
install.packages("remotes")
remotes::install_github("paolofantini/Supreme")
paolofantini/Supreme documentation built on May 24, 2019, 6:14 p.m.