FeatureHashing: Creates a Model Matrix via Feature Hashing with a Formula Interface
Version 0.9.1.1

Feature hashing, also called as the hashing trick, is a method to transform features of a instance to a vector. Thus, it is a method to transform a real dataset to a matrix. Without looking up the indices in an associative array, it applies a hash function to the features and uses their hash values as indices directly. The method of feature hashing in this package was proposed in Weinberger et al. (2009). The hashing algorithm is the murmurhash3 from the digest package. Please see the README in https://github.com/wush978/FeatureHashing for more information.

AuthorWush Wu [aut, cre], Michael Benesty [aut, ctb]
Date of publication2015-10-18 01:06:55
MaintainerWush Wu <wush978@gmail.com>
LicenseGPL (>= 3) | file LICENSE
Version0.9.1.1
URL https://github.com/wush978/FeatureHashing
Package repositoryView on CRAN
InstallationInstall the latest version of this package by entering the following in R:
install.packages("FeatureHashing")

Getting started

README.md
FeatureHashing
Sentiment Analysis via R, FeatureHashing and XGBoost

Popular man pages

CSCMatrix-class: CSCMatrix
hash.mapping: Extract mapping between hash and original values
hash.size: Compute minimum hash size to reduce collision rate
intToRaw: Convert the integer to raw vector with endian correction
ipinyou: iPinYou Real-Time Bidding Dataset for Computational...
simulate.split: Simulate how 'split' work in 'hashed.model.matrix' to split...
test.tag: test.tag
See all...

All man pages Function index File listing

Man pages

CSCMatrix-class: CSCMatrix
hashed.model.matrix: Create a model matrix with feature hashing
hash.mapping: Extract mapping between hash and original values
hash.size: Compute minimum hash size to reduce collision rate
intToRaw: Convert the integer to raw vector with endian correction
ipinyou: iPinYou Real-Time Bidding Dataset for Computational...
simulate.split: Simulate how 'split' work in 'hashed.model.matrix' to split...
test.tag: test.tag

Functions

Files

inst
inst/ftprl.R
inst/doc
inst/doc/FeatureHashing.R
inst/doc/SentimentAnalysis.html
inst/doc/FeatureHashing.html
inst/doc/SentimentAnalysis.Rmd
inst/doc/FeatureHashing.Rmd
inst/runTest.R
tests
tests/test-interpret.tag.R
tests/test-subsetting.R
tests/test-signed.hash.R
tests/test-product.R
tests/test-as-dgCMatrix.R
tests/test-hash.mapping.R
tests/test-progress_bar.R
tests/test-hashing.R
tests/test-transpose.R
tests/test-split.R
tests/test-memcheck.R
tests/test-existence_collision.R
tests/test-empty_array.R
src
src/as.cpp
src/Makevars
src/hashed_model_matrix.cpp
src/hash_function.h
src/bswap_32.cpp
src/split.cpp
src/hash_internal.cpp
src/bswap_32.h
src/subsetting.cpp
src/product.cpp
src/intToRaw.cpp
src/split.h
src/hashed_model_matrix.h
src/digest.c
src/Makevars.win
src/RcppExports.cpp
src/digestlocal.h
src/vector_converter.h
NAMESPACE
data
data/test.tag.rda
data/ipinyou.rda
Changelog
R
R/hashed.model.matrix.R
R/simulate.split.R
R/RcppExports.R
R/hash.mapping.R
R/hash.size.R
R/zzz.R
R/matrix.R
vignettes
vignettes/SentimentAnalysis.Rmd
vignettes/FeatureHashing.Rmd
vignettes/vignette.css
vignettes/FeatureHashing.bib
README.md
MD5
build
build/vignette.rds
DESCRIPTION
man
man/hash.mapping.Rd
man/simulate.split.Rd
man/hashed.model.matrix.Rd
man/hash.size.Rd
man/CSCMatrix-class.Rd
man/test.tag.Rd
man/intToRaw.Rd
man/ipinyou.Rd
LICENSE
FeatureHashing documentation built on May 19, 2017, 10:41 a.m.

Questions? Problems? Suggestions? Tweet to @rdrrHQ or email at ian@mutexlabs.com.

Please suggest features or report bugs in the GitHub issue tracker.

All documentation is copyright its authors; we didn't write any of that.