FeatureHashing: Creates a Model Matrix via Feature Hashing with a Formula Interface

Feature hashing, also called as the hashing trick, is a method to transform features of a instance to a vector. Thus, it is a method to transform a real dataset to a matrix. Without looking up the indices in an associative array, it applies a hash function to the features and uses their hash values as indices directly. The method of feature hashing in this package was proposed in Weinberger et al. (2009). The hashing algorithm is the murmurhash3 from the digest package. Please see the README in https://github.com/wush978/FeatureHashing for more information.

AuthorWush Wu [aut, cre], Michael Benesty [aut, ctb]
Date of publication2015-10-18 01:06:55
MaintainerWush Wu <wush978@gmail.com>
LicenseGPL (>= 3) | file LICENSE
Version0.9.1.1
https://github.com/wush978/FeatureHashing

View on CRAN

Functions

CSCMatrix-class Man page
[,CSCMatrix,missing,numeric,ANY-method Man page
\%*\%,CSCMatrix,numeric-method Man page
[,CSCMatrix,numeric,missing,ANY-method Man page
[,CSCMatrix,numeric,numeric,ANY-method Man page
dim<-,CSCMatrix-method Man page
dim,CSCMatrix-method Man page
hashed.interaction.value Man page
hashed.model.matrix Man page
hashed.value Man page
hash.mapping Man page
hash.sign Man page
hash.size Man page
intToRaw Man page
ipinyou Man page
ipinyou.test Man page
ipinyou.train Man page
\%*\%,numeric,CSCMatrix-method Man page
simulate.split Man page
test.tag Man page

Files

FeatureHashing
FeatureHashing/inst
FeatureHashing/inst/ftprl.R
FeatureHashing/inst/doc
FeatureHashing/inst/doc/FeatureHashing.R
FeatureHashing/inst/doc/SentimentAnalysis.html
FeatureHashing/inst/doc/FeatureHashing.html
FeatureHashing/inst/doc/SentimentAnalysis.Rmd
FeatureHashing/inst/doc/FeatureHashing.Rmd
FeatureHashing/inst/runTest.R
FeatureHashing/tests
FeatureHashing/tests/test-interpret.tag.R
FeatureHashing/tests/test-subsetting.R
FeatureHashing/tests/test-signed.hash.R
FeatureHashing/tests/test-product.R
FeatureHashing/tests/test-as-dgCMatrix.R
FeatureHashing/tests/test-hash.mapping.R
FeatureHashing/tests/test-progress_bar.R
FeatureHashing/tests/test-hashing.R
FeatureHashing/tests/test-transpose.R
FeatureHashing/tests/test-split.R
FeatureHashing/tests/test-memcheck.R
FeatureHashing/tests/test-existence_collision.R
FeatureHashing/tests/test-empty_array.R
FeatureHashing/src
FeatureHashing/src/as.cpp
FeatureHashing/src/Makevars
FeatureHashing/src/hashed_model_matrix.cpp
FeatureHashing/src/hash_function.h
FeatureHashing/src/bswap_32.cpp
FeatureHashing/src/split.cpp
FeatureHashing/src/hash_internal.cpp
FeatureHashing/src/bswap_32.h
FeatureHashing/src/subsetting.cpp
FeatureHashing/src/product.cpp
FeatureHashing/src/intToRaw.cpp
FeatureHashing/src/split.h
FeatureHashing/src/hashed_model_matrix.h
FeatureHashing/src/digest.c
FeatureHashing/src/Makevars.win
FeatureHashing/src/RcppExports.cpp
FeatureHashing/src/digestlocal.h
FeatureHashing/src/vector_converter.h
FeatureHashing/NAMESPACE
FeatureHashing/data
FeatureHashing/data/test.tag.rda
FeatureHashing/data/ipinyou.rda
FeatureHashing/Changelog
FeatureHashing/R
FeatureHashing/R/hashed.model.matrix.R FeatureHashing/R/simulate.split.R FeatureHashing/R/RcppExports.R FeatureHashing/R/hash.mapping.R FeatureHashing/R/hash.size.R FeatureHashing/R/zzz.R FeatureHashing/R/matrix.R
FeatureHashing/vignettes
FeatureHashing/vignettes/SentimentAnalysis.Rmd
FeatureHashing/vignettes/FeatureHashing.Rmd
FeatureHashing/vignettes/vignette.css
FeatureHashing/vignettes/FeatureHashing.bib
FeatureHashing/README.md
FeatureHashing/MD5
FeatureHashing/build
FeatureHashing/build/vignette.rds
FeatureHashing/DESCRIPTION
FeatureHashing/man
FeatureHashing/man/hash.mapping.Rd FeatureHashing/man/simulate.split.Rd FeatureHashing/man/hashed.model.matrix.Rd FeatureHashing/man/hash.size.Rd FeatureHashing/man/CSCMatrix-class.Rd FeatureHashing/man/test.tag.Rd FeatureHashing/man/intToRaw.Rd FeatureHashing/man/ipinyou.Rd
FeatureHashing/LICENSE

Questions? Problems? Suggestions? or email at ian@mutexlabs.com.

Please suggest features or report bugs with the GitHub issue tracker.

All documentation is copyright its authors; we didn't write any of that.