FeatureHashing: Creates a Model Matrix via Feature Hashing with a Formula Interface

Share:

Feature hashing, also called as the hashing trick, is a method to transform features of a instance to a vector. Thus, it is a method to transform a real dataset to a matrix. Without looking up the indices in an associative array, it applies a hash function to the features and uses their hash values as indices directly. The method of feature hashing in this package was proposed in Weinberger et al. (2009). The hashing algorithm is the murmurhash3 from the digest package. Please see the README in https://github.com/wush978/FeatureHashing for more information.

Author
Wush Wu [aut, cre], Michael Benesty [aut, ctb]
Date of publication
2015-10-18 01:06:55
Maintainer
Wush Wu <wush978@gmail.com>
License
GPL (>= 3) | file LICENSE
Version
0.9.1.1
URLs

View on CRAN

Man pages

CSCMatrix-class
CSCMatrix
hashed.model.matrix
Create a model matrix with feature hashing
hash.mapping
Extract mapping between hash and original values
hash.size
Compute minimum hash size to reduce collision rate
intToRaw
Convert the integer to raw vector with endian correction
ipinyou
iPinYou Real-Time Bidding Dataset for Computational...
simulate.split
Simulate how 'split' work in 'hashed.model.matrix' to split...
test.tag
test.tag

Files in this package

FeatureHashing
FeatureHashing/inst
FeatureHashing/inst/ftprl.R
FeatureHashing/inst/doc
FeatureHashing/inst/doc/FeatureHashing.R
FeatureHashing/inst/doc/SentimentAnalysis.html
FeatureHashing/inst/doc/FeatureHashing.html
FeatureHashing/inst/doc/SentimentAnalysis.Rmd
FeatureHashing/inst/doc/FeatureHashing.Rmd
FeatureHashing/inst/runTest.R
FeatureHashing/tests
FeatureHashing/tests/test-interpret.tag.R
FeatureHashing/tests/test-subsetting.R
FeatureHashing/tests/test-signed.hash.R
FeatureHashing/tests/test-product.R
FeatureHashing/tests/test-as-dgCMatrix.R
FeatureHashing/tests/test-hash.mapping.R
FeatureHashing/tests/test-progress_bar.R
FeatureHashing/tests/test-hashing.R
FeatureHashing/tests/test-transpose.R
FeatureHashing/tests/test-split.R
FeatureHashing/tests/test-memcheck.R
FeatureHashing/tests/test-existence_collision.R
FeatureHashing/tests/test-empty_array.R
FeatureHashing/src
FeatureHashing/src/as.cpp
FeatureHashing/src/Makevars
FeatureHashing/src/hashed_model_matrix.cpp
FeatureHashing/src/hash_function.h
FeatureHashing/src/bswap_32.cpp
FeatureHashing/src/split.cpp
FeatureHashing/src/hash_internal.cpp
FeatureHashing/src/bswap_32.h
FeatureHashing/src/subsetting.cpp
FeatureHashing/src/product.cpp
FeatureHashing/src/intToRaw.cpp
FeatureHashing/src/split.h
FeatureHashing/src/hashed_model_matrix.h
FeatureHashing/src/digest.c
FeatureHashing/src/Makevars.win
FeatureHashing/src/RcppExports.cpp
FeatureHashing/src/digestlocal.h
FeatureHashing/src/vector_converter.h
FeatureHashing/NAMESPACE
FeatureHashing/data
FeatureHashing/data/test.tag.rda
FeatureHashing/data/ipinyou.rda
FeatureHashing/Changelog
FeatureHashing/R
FeatureHashing/R/hashed.model.matrix.R
FeatureHashing/R/simulate.split.R
FeatureHashing/R/RcppExports.R
FeatureHashing/R/hash.mapping.R
FeatureHashing/R/hash.size.R
FeatureHashing/R/zzz.R
FeatureHashing/R/matrix.R
FeatureHashing/vignettes
FeatureHashing/vignettes/SentimentAnalysis.Rmd
FeatureHashing/vignettes/FeatureHashing.Rmd
FeatureHashing/vignettes/vignette.css
FeatureHashing/vignettes/FeatureHashing.bib
FeatureHashing/README.md
FeatureHashing/MD5
FeatureHashing/build
FeatureHashing/build/vignette.rds
FeatureHashing/DESCRIPTION
FeatureHashing/man
FeatureHashing/man/hash.mapping.Rd
FeatureHashing/man/simulate.split.Rd
FeatureHashing/man/hashed.model.matrix.Rd
FeatureHashing/man/hash.size.Rd
FeatureHashing/man/CSCMatrix-class.Rd
FeatureHashing/man/test.tag.Rd
FeatureHashing/man/intToRaw.Rd
FeatureHashing/man/ipinyou.Rd
FeatureHashing/LICENSE