lcm_dfm2: Document term matrix of LCM abstracts

lcm_dfm2R Documentation

Document term matrix of LCM abstracts

Description

Each term is a unigram, i.e. a word, except for common phrases. Stop words and rare words were removed.

Usage

lcm_dfm2

Format

A 'quanteda' 'dfm' object

Source

See the 'lcm_text_mining.Rmd' file in the 'data-raw' in the [GitHub repo of this package](https://www.github.com/pachterlab/museumst) for how this matrix was generated, including what the phrases and stopwords are and what counts as a rare term.


pachterlab/museumst documentation built on April 20, 2024, 11:26 p.m.