PRISMA: Protocol Inspection and State Machine Analysis

Loads and processes huge text corpora processed with the sally toolbox (<http://www.mlsec.org/sally/>). sally acts as a very fast preprocessor which splits the text files into tokens or n-grams. These output files can then be read with the PRISMA package which applies testing-based token selection and has some replicate-aware, highly tuned non-negative matrix factorization and principal component analysis implementation which allows the processing of very big data sets even on desktop machines.

Install the latest version of this package by entering the following in R:
install.packages("PRISMA")
AuthorTammo Krueger, Nicole Kraemer
Date of publication2017-02-28 00:34:16
MaintainerTammo Krueger <tammokrueger@googlemail.com>
LicenseGPL (>= 2.0)
Version0.2-6

View on CRAN

Files

inst
inst/extdata
inst/extdata/README
inst/extdata/sallyPreprocessing.py
inst/extdata/asap.tar.gz
inst/doc
inst/doc/PRISMA.R
inst/doc/PRISMA.pdf
inst/doc/PRISMA.Rnw
NAMESPACE
data
data/thesis.rda
data/asap.rda
R
R/dimensionEstimation.R R/prisma.R R/matrixFactorization.R
vignettes
vignettes/PRISMA.Rnw
vignettes/PRISMA.bib
vignettes/asap.pdf
README.md
MD5
build
build/vignette.rds
DESCRIPTION
man
man/prismaNMF.Rd man/prismaDuplicatePCA.Rd man/estimateDimension.Rd man/getMatrixFactorizationLabels.Rd man/loadPrismaData.Rd man/generics.Rd man/generics_dimension.Rd man/thesis.Rd man/generics_mf.Rd man/PRISMA-package.Rd man/prismaHclust.Rd man/asap.Rd man/corpusToPrisma.Rd man/getDuplicateData.Rd

Questions? Problems? Suggestions? or email at ian@mutexlabs.com.

Please suggest features or report bugs with the GitHub issue tracker.

All documentation is copyright its authors; we didn't write any of that.