PRISMA: Protocol Inspection and State Machine Analysis
Version 0.2-6

Loads and processes huge text corpora processed with the sally toolbox (). sally acts as a very fast preprocessor which splits the text files into tokens or n-grams. These output files can then be read with the PRISMA package which applies testing-based token selection and has some replicate-aware, highly tuned non-negative matrix factorization and principal component analysis implementation which allows the processing of very big data sets even on desktop machines.

Browse man pages Browse package API and functions Browse package files

AuthorTammo Krueger, Nicole Kraemer
Date of publication2017-02-28 00:34:16
MaintainerTammo Krueger <tammokrueger@googlemail.com>
LicenseGPL (>= 2.0)
Version0.2-6
Package repositoryView on CRAN
InstallationInstall the latest version of this package by entering the following in R:
install.packages("PRISMA")

Man pages

asap: The ASAP Data Set
corpusToPrisma: Convert tm copus to PRISMA
estimateDimension: Estimate Inner Dimension
generics: Generics For PRISMA Objects
generics_dimension: Generics For PRISMA Objects
generics_mf: Generics For PRISMA Objects
getDuplicateData: Restores Data with Duplicates
getMatrixFactorizationLabels: Convert Coordinates of Matrix Factorization to Labels
loadPrismaData: Load PRISMA Data Files
prismaDuplicatePCA: Matrix Factorization Based on Replicate-Aware PCA
prismaHclust: Matrix Factorization Based on Hierarchical Clustering
prismaNMF: Matrix Factorization Based on Replicate-Aware NMF
PRISMA-package: PRISMA - Protocol Inspection and State Machine Analysis
thesis: The Thesis Data Set

Functions

PRISMA Man page
PRISMA-package Man page
RRbyCV Source code
asap Man page
calcClassForSparseMatrix Source code
calcDatacluster Source code
calcLinesPerThreshold Source code
compressByGroup Source code
corpusToPrisma Man page Source code
count2bin Source code
count2freq Source code
duplicateRemover Source code
estimateDimension Man page Source code
filterDataByTestAndCor Source code
genBase Source code
getDuplicateData Man page Source code
getMatrixFactorizationLabels Man page Source code
groupCorrelatedNgrams Source code
loadPrismaData Man page Source code
normBase Source code
plot.prisma Man page Source code
plot.prismaDimension Man page Source code
plot.prismaMF Man page Source code
plotMatrixFactor Source code
pmf Source code
preprocessPrismaData Source code
print.prisma Man page Source code
print.prismaDimension Man page Source code
prismaDuplicatePCA Man page Source code
prismaHclust Man page Source code
prismaNMF Man page Source code
readFSally Source code
readHarry Source code
readPrismaInput Source code
readRaw Source code
readSally Source code
reconstructSparsePCA Source code
scrambleFeature Source code
scramblePCA Source code
sparse.cor Source code
sparseCov Source code
sparsePCA Source code
thesis Man page
ttestNgrams Source code

Files

inst
inst/extdata
inst/extdata/README
inst/extdata/sallyPreprocessing.py
inst/extdata/asap.tar.gz
inst/doc
inst/doc/PRISMA.R
inst/doc/PRISMA.pdf
inst/doc/PRISMA.Rnw
NAMESPACE
data
data/thesis.rda
data/asap.rda
R
R/dimensionEstimation.R
R/prisma.R
R/matrixFactorization.R
vignettes
vignettes/PRISMA.Rnw
vignettes/PRISMA.bib
vignettes/asap.pdf
README.md
MD5
build
build/vignette.rds
DESCRIPTION
man
man/prismaNMF.Rd
man/prismaDuplicatePCA.Rd
man/estimateDimension.Rd
man/getMatrixFactorizationLabels.Rd
man/loadPrismaData.Rd
man/generics.Rd
man/generics_dimension.Rd
man/thesis.Rd
man/generics_mf.Rd
man/PRISMA-package.Rd
man/prismaHclust.Rd
man/asap.Rd
man/corpusToPrisma.Rd
man/getDuplicateData.Rd
PRISMA documentation built on May 20, 2017, 1:11 a.m.