SpeedReader: High Performance Text Analysis

library(SpeedReader)
context("TFIDF")

test_that("That tfidf works", {
    # load data
    # data(document_term_vector_list)
    # data(document_term_count_list)
    files <- get_file_paths(source = "test sparse doc-term")

    sdtm <- generate_sparse_large_document_term_matrix(
        file_list = files,
        file_directory = NULL,
        vocabulary = NULL,
        maximum_vocabulary_size = -1,
        using_document_term_counts = TRUE)

    test_tfidf <- tfidf(sdtm,
                      colnames(sdtm),
                      remove_documents_with_no_terms = FALSE,
                      only_calculate_corpus_level_statistics = TRUE,
                      display_rankings = TRUE,
                      top_words_to_display = 40)



})

matthewjdenny/SpeedReader documentation built on March 25, 2020, 5:32 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

matthewjdenny/SpeedReader
High Performance Text Analysis

tests/testthat/test_tfidf.R
In matthewjdenny/SpeedReader: High Performance Text Analysis

R Package Documentation

Browse R Packages

We want your feedback!

matthewjdenny/SpeedReader High Performance Text Analysis

tests/testthat/test_tfidf.R In matthewjdenny/SpeedReader: High Performance Text Analysis

R Package Documentation

Browse R Packages

We want your feedback!

matthewjdenny/SpeedReader
High Performance Text Analysis

tests/testthat/test_tfidf.R
In matthewjdenny/SpeedReader: High Performance Text Analysis