cleanNLP: A Tidy Data Model for Natural Language Processing

library(cleanNLP)

context("Testing tools for working with textual data")

data(un)

test_that("testing utils_tfidf", {
  cnlp_init_stringi()
  anno <- cnlp_annotate(un, verbose=FALSE)

  tf_direct <- cnlp_utils_tfidf(anno$token)
  expect_equal(dim(tf_direct), c(30, 79))
  expect_equal(anno$document$doc_id, rownames(tf_direct))
})


test_that("testing tidy_pca", {

  cnlp_init_stringi()
  anno <- cnlp_annotate(un, verbose=FALSE)

  res <- cnlp_utils_pca(cnlp_utils_tfidf(anno$token))
  expect_equal(rownames(res), anno$document$doc_id)
  expect_equal(colnames(res), c("PC1", "PC2"))

  res <- cnlp_utils_pca(cnlp_utils_tfidf(anno$token), k=4)
  expect_equal(rownames(res), anno$document$doc_id)
  expect_equal(colnames(res), c("PC1", "PC2", "PC3", "PC4"))

})

statsmaths/cleanNLP documentation built on May 21, 2024, 6:47 a.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

statsmaths/cleanNLP
A Tidy Data Model for Natural Language Processing

tests/testthat/test-tools.R
In statsmaths/cleanNLP: A Tidy Data Model for Natural Language Processing

R Package Documentation

Browse R Packages

We want your feedback!

statsmaths/cleanNLP A Tidy Data Model for Natural Language Processing

tests/testthat/test-tools.R In statsmaths/cleanNLP: A Tidy Data Model for Natural Language Processing

R Package Documentation

Browse R Packages

We want your feedback!

statsmaths/cleanNLP
A Tidy Data Model for Natural Language Processing

tests/testthat/test-tools.R
In statsmaths/cleanNLP: A Tidy Data Model for Natural Language Processing