quanteda.dictionaries: An R package consisting of dictionaries for text analysis and...

quanteda.dictionariesR Documentation

An R package consisting of dictionaries for text analysis and associated utilities

Description

Provides text analysis dictionaries and additional functionality for their use in text analysis frameworks, especially quanteda.

Details

The package contains several dictionaries that capture positive and negative sentiment, and other types of emotions. Moreover, the package contains word lists of words spelled differently in British and American English. These lists can be used to adjust text corpora and avoid double-counting the same word with different spellings in the same corpus.

The second main purpose of quanteda.dictionaries is the function liwcalike. It allows analysing text corpora in a LIWC-alike fashion. LIWC (Linguistic Inquiry and Word Count) is a standalone software distributed at https://www.liwc.app. liwcalike takes a quanteda corpus as an input and allows to easily apply dictionaries to the text corpus. The output returns a data.frame consisting of percentages and other quantities, as well as the count of all dictionary categories in each document.

Source code and additional information

https://github.com/kbenoit/quanteda.dictionaries


kbenoit/quanteda.dictionaries documentation built on Feb. 9, 2023, 3:28 p.m.