lsa: Latent Semantic Analysis

The basic idea of latent semantic analysis (LSA) is, that text do have a higher order (=latent semantic) structure which, however, is obscured by word usage (e.g. through the use of synonyms or polysemy). By using conceptual indices that are derived statistically via a truncated singular value decomposition (a two-mode factor analysis) over a given document-term matrix, this variability problem can be overcome.

AuthorFridolin Wild
Date of publication2015-05-08 19:58:09
MaintainerFridolin Wild <f.wild@open.ac.uk>
LicenseGPL (>= 2)
Version0.73.1

View on CRAN

Functions

alnumx Man page
associate Man page
as.textmatrix Man page
corpus_essays Man page
corpus_scores Man page
corpus_training Man page
cosine Man page
delTriple Man page
dimcalc Man page
dimcalc_fraction Man page
dimcalc_kaiser Man page
dimcalc_ndocs Man page
dimcalc_raw Man page
dimcalc_share Man page
entropy Man page
fold_in Man page
getSubjectId Man page
getTriple Man page
gw_entropy Man page
gw_gfidf Man page
gw_idf Man page
gw_normalisation Man page
lsa Man page
lw_bintf Man page
lw_logtf Man page
lw_tf Man page
print.textmatrix Man page
query Man page
sample.textmatrix Man page
setTriple Man page
specialchars Man page
stopwords_ar Man page
stopwords_de Man page
stopwords_en Man page
stopwords_fr Man page
stopwords_nl Man page
stopwords_pl Man page
summary.textmatrix Man page
textmatrix Man page
textvector Man page

Files

lsa
lsa/tests
lsa/tests/lsa-tests.R
lsa/tests/polski.RData
lsa/NAMESPACE
lsa/demo
lsa/demo/lsa_landauer.R
lsa/demo/lsa_plot.R
lsa/demo/00Index
lsa/demo/lsa_essayscoring.R
lsa/data
lsa/data/corpus_scores.rda
lsa/data/specialchars.rda
lsa/data/stopwords_pl.rda
lsa/data/stopwords_ar.rda
lsa/data/stopwords_nl.rda
lsa/data/stopwords_en.rda
lsa/data/stopwords_de.rda
lsa/data/corpus_training.rda
lsa/data/corpus_essays.rda
lsa/data/alnumx.rda
lsa/data/stopwords_fr.rda
lsa/R
lsa/R/associate.R lsa/R/cosine.R lsa/R/query.R lsa/R/weightings.R lsa/R/triples.R lsa/R/textmatrix.R lsa/R/dimcalc.R lsa/R/sample.textmatrix.R lsa/R/lsa.R
lsa/MD5
lsa/DESCRIPTION
lsa/ChangeLog
lsa/man
lsa/man/textmatrix.Rd lsa/man/summary.textmatrix.Rd lsa/man/specialchars.Rd lsa/man/associate.Rd lsa/man/cosine.Rd lsa/man/corpora.Rd lsa/man/print.textmatrix.Rd lsa/man/triples.Rd lsa/man/weightings.Rd lsa/man/sample.textmatrix.Rd lsa/man/lsa.Rd lsa/man/as.textmatrix.Rd lsa/man/query.Rd lsa/man/dimcalc.Rd lsa/man/foldin.Rd lsa/man/alnumx.Rd lsa/man/stopwords.Rd

Questions? Problems? Suggestions? or email at ian@mutexlabs.com.

Please suggest features or report bugs with the GitHub issue tracker.

All documentation is copyright its authors; we didn't write any of that.