Man pages for mlampros/textTinyR
Text Processing for Small or Big Data Files

big_tokenize_transformString tokenization and transformation for big data sets
bytes_converterbytes converter of a text file ( KB, MB or GB )
cosine_distancecosine distance of two character strings (each string...
dense_2sparseconvert a dense matrix to a sparse matrix
dice_distancedice similarity of words using n-grams
levenshtein_distancelevenshtein distance of two words
load_sparse_binaryload a sparse matrix in binary format
matrix_sparsitysparsity percentage of a sparse matrix
read_charactersread a specific number of characters from a text file
read_rowsread a specific number of rows from a text file
save_sparse_binarysave a sparse matrix in binary format
sparse_MeansRowMens and colMeans for a sparse matrix
sparse_SumsRowSums and colSums for a sparse matrix
sparse_term_matrixTerm matrices and statistics ( document-term-matrix,...
text_file_parsertext file parser
tokenize_transform_textString tokenization and transformation ( character string or...
tokenize_transform_vec_docsString tokenization and transformation ( vector of documents...
token_statstoken statistics
utf_localeutf-locale for the available languages
vocabulary_parserreturns the vocabulary counts for small or medium ( xml and...
mlampros/textTinyR documentation built on Jan. 21, 2018, 10:58 a.m.