FeatureHashing: Creates a Model Matrix via Feature Hashing with a Formula Interface

Feature hashing, also called as the hashing trick, is a method to transform features of a instance to a vector. Thus, it is a method to transform a real dataset to a matrix. Without looking up the indices in an associative array, it applies a hash function to the features and uses their hash values as indices directly. The method of feature hashing in this package was proposed in Weinberger et al. (2009). The hashing algorithm is the murmurhash3 from the digest package. Please see the README in https://github.com/wush978/FeatureHashing for more information.

AuthorWush Wu [aut, cre], Michael Benesty [aut, ctb]
Date of publication2015-10-18 01:06:55
MaintainerWush Wu <wush978@gmail.com>
LicenseGPL (>= 3) | file LICENSE
Version0.9.1.1
https://github.com/wush978/FeatureHashing

View on CRAN

Files in this package

FeatureHashing
FeatureHashing/inst
FeatureHashing/inst/ftprl.R
FeatureHashing/inst/doc
FeatureHashing/inst/doc/FeatureHashing.R
FeatureHashing/inst/doc/SentimentAnalysis.html
FeatureHashing/inst/doc/FeatureHashing.html
FeatureHashing/inst/doc/SentimentAnalysis.Rmd
FeatureHashing/inst/doc/FeatureHashing.Rmd
FeatureHashing/inst/runTest.R
FeatureHashing/tests
FeatureHashing/tests/test-interpret.tag.R
FeatureHashing/tests/test-subsetting.R
FeatureHashing/tests/test-signed.hash.R
FeatureHashing/tests/test-product.R
FeatureHashing/tests/test-as-dgCMatrix.R
FeatureHashing/tests/test-hash.mapping.R
FeatureHashing/tests/test-progress_bar.R
FeatureHashing/tests/test-hashing.R
FeatureHashing/tests/test-transpose.R
FeatureHashing/tests/test-split.R
FeatureHashing/tests/test-memcheck.R
FeatureHashing/tests/test-existence_collision.R
FeatureHashing/tests/test-empty_array.R
FeatureHashing/src
FeatureHashing/src/as.cpp
FeatureHashing/src/Makevars
FeatureHashing/src/hashed_model_matrix.cpp
FeatureHashing/src/hash_function.h
FeatureHashing/src/bswap_32.cpp
FeatureHashing/src/split.cpp
FeatureHashing/src/hash_internal.cpp
FeatureHashing/src/bswap_32.h
FeatureHashing/src/subsetting.cpp
FeatureHashing/src/product.cpp
FeatureHashing/src/intToRaw.cpp
FeatureHashing/src/split.h
FeatureHashing/src/hashed_model_matrix.h
FeatureHashing/src/digest.c
FeatureHashing/src/Makevars.win
FeatureHashing/src/RcppExports.cpp
FeatureHashing/src/digestlocal.h
FeatureHashing/src/vector_converter.h
FeatureHashing/NAMESPACE
FeatureHashing/data
FeatureHashing/data/test.tag.rda
FeatureHashing/data/ipinyou.rda
FeatureHashing/Changelog
FeatureHashing/R
FeatureHashing/R/hashed.model.matrix.R FeatureHashing/R/simulate.split.R FeatureHashing/R/RcppExports.R FeatureHashing/R/hash.mapping.R FeatureHashing/R/hash.size.R FeatureHashing/R/zzz.R FeatureHashing/R/matrix.R
FeatureHashing/vignettes
FeatureHashing/vignettes/SentimentAnalysis.Rmd
FeatureHashing/vignettes/FeatureHashing.Rmd
FeatureHashing/vignettes/vignette.css
FeatureHashing/vignettes/FeatureHashing.bib
FeatureHashing/README.md
FeatureHashing/MD5
FeatureHashing/build
FeatureHashing/build/vignette.rds
FeatureHashing/DESCRIPTION
FeatureHashing/man
FeatureHashing/man/hash.mapping.Rd FeatureHashing/man/simulate.split.Rd FeatureHashing/man/hashed.model.matrix.Rd FeatureHashing/man/hash.size.Rd FeatureHashing/man/CSCMatrix-class.Rd FeatureHashing/man/test.tag.Rd FeatureHashing/man/intToRaw.Rd FeatureHashing/man/ipinyou.Rd
FeatureHashing/LICENSE

Questions? Problems? Suggestions? or email at ian@mutexlabs.com.

All documentation is copyright its authors; we didn't write any of that.