README.md
In koheiw/LSS: Semi-Supervised Algorithm for Document Scaling

LSS: Semi-supervised algorithm for document scaling

In quantitative text analysis, the cost of training supervised machine learning models tend to be very high when the corpus is large. Latent Semantic Scaling (LSS) is a semi-supervised document scaling technique that I developed to perform large scale analysis at low cost. Taking user-provided seed words as weak supervision, it estimates polarity of words in the corpus by latent semantic analysis and locates documents on a unidimensional scale (e.g. sentiment).

From CRAN:

install.packages("LSX")

From Github:

devtools::install_github("koheiw/LSX")

Please visit the package website to understand the usage of the functions:

Please read the following papers for the algorithm and methodology, and its application to non-English texts (Japanese and Hebrew):

Watanabe, Kohei. 2020. “Latent Semantic Scaling: A Semisupervised Text Analysis Technique for New Domains and Languages”, Communication Methods and Measures.
Watanabe, Kohei, Segev, Elad, & Tago, Atsushi. (2022). “Discursive diversion: Manipulation of nuclear threats by the conservative leaders in Japan and Israel”, International Communication Gazette.

LSS has been used for research in various fields of social science.

Nakamura, Kentaro. 2022 Balancing Opportunities and Incentives: How Rising China’s Mediated Public Diplomacy Changes Under Crisis, International Journal of Communication.
Zollinger, Delia. 2022 Cleavage Identities in Voters’ Own Words: Harnessing Open-Ended Survey Responses, American Journal of Political Science.
Brändle, Verena K., and Olga Eisele. 2022. “A Thin Line: Governmental Border Communication in Times of European Crises” Journal of Common Market Studies.
Umansky, Natalia. 2022. “Who gets a say in this? Speaking security on social media”. New Media & Society.
Rauh, Christian, 2022. “Supranational emergency politics? What executives’ public crisis communication may tell us”, Journal of European Public Policy.
Trubowitz, Peter and Watanabe, Kohei. 2021. “The Geopolitical Threat Index: A Text-Based Computational Approach to Identifying Foreign Threats”, International Studies Quarterly.
Vydra, Simon and Kantorowicz, Jaroslaw. 2020. “Tracing Policy-relevant Information in Social Media: The Case of Twitter before and during the COVID-19 Crisis”. Statistics, Politics and Policy.
Watanabe, Kohei. 2017. “Measuring News Bias: Russia’s Official News Agency ITAR-TASS’s Coverage of the Ukraine Crisis”, European Journal Communication.

More publications are available on Google Scholar.

koheiw/LSS documentation built on March 9, 2024, 4:41 a.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

Tweet to @rdrrHQ

GitHub issue tracker

ian@mutexlabs.com