preText: Diagnostics to Assess the Effects of Text Preprocessing Decisions
Version 0.4.4

Functions to assess the effects of different text preprocessing decisions on the inferences drawn from the resulting document-term matrices they generate.

AuthorMatthew J. Denny <mdenny@psu.edu>, Arthur Spirling <as9934@nyu.edu>,
Date of publication2016-10-08 19:02:21
MaintainerMatthew J. Denny <mdenny@psu.edu>
LicenseGPL-3
Version0.4.4
Package repositoryView on CRAN
InstallationInstall the latest version of this package by entering the following in R:
install.packages("preText")

Getting started

README.md
Getting Started With preText

Popular man pages

dfm_scaling_test: Comparison of dfms using N-dimensional scaling, with a test...
factorial_preprocessing: A function to perform factorial preprocessing of a corpus of...
mantel_comparison_to_base: Ensemble Mantel Tests
preText: preText: Diagnostics to Assess The Effects of Text...
preText_score_plot: preText specification plot
remove_infrequent_terms: Remove infrequently occurring terms from quanteda dfm.
scaling_comparison: Scaling Comparison.
See all...

All man pages Function index File listing

Man pages

calculate_prediction_errors: Calculate mean prediction error for preprocessing decisions.
dfm_scaling_test: Comparison of dfms using N-dimensional scaling, with a test...
document_position_plots: Document Position Plots
factorial_preprocessing: A function to perform factorial preprocessing of a corpus of...
mantel_comparison: Ensemble Mantel Tests
mantel_comparison_to_base: Ensemble Mantel Tests
optimal_k_comparison: Optimal Topic Model k Comparison
preprocessing_choice_regression: Preprocessing Choice Regressions
preText: preText: Diagnostics to Assess The Effects of Text...
preText_score_plot: preText specification plot
preText_test: preText Test
regression_coefficient_plot: Regression Coefficient Plot
remove_infrequent_terms: Remove infrequently occurring terms from quanteda dfm.
scaling_comparison: Scaling Comparison.
topic_key_term_plot: Plot Prevalence of Topic Key Terms
topic_novelty_score: Topic Top-Terms Novelty Score
UK_Manifestos: Full text of 69 UK party manifestos from 1918-2001.
wordfish_comparison: Wordfish Comparison.
wordfish_rank_plot: Plot of Wordfish rankings of documents

Functions

UK_Manifestos Man page
calculate_prediction_errors Man page Source code
dfm_scaling_test Man page Source code
document_position_plots Man page Source code
factorial_preprocessing Man page Source code
find_optimal_number_of_topics Source code
get_perplexities Source code
mantel_comparison Man page Source code
mantel_comparison_to_base Man page Source code
multiplot Source code
onAttach Source code
optimal_k_comparison Man page Source code
parallel_preprocess Source code
parallel_rank_test Source code
preText Man page Source code
preText-package Man page
preText_score_plot Man page Source code
preText_test Man page Source code
preprocessing_choice_regression Man page Source code
regression_coefficient_plot Man page Source code
remove_infrequent_terms Man page Source code
scaling_comparison Man page Source code
temporal_filter Source code
topic_key_term_plot Man page Source code
topic_novelty_score Man page Source code
wordfish_comparison Man page Source code
wordfish_rank_plot Man page Source code

Files

inst
inst/doc
inst/doc/getting_started_with_preText.Rmd
inst/doc/getting_started_with_preText.html
inst/doc/getting_started_with_preText.R
tests
tests/testthat.R
tests/testthat
tests/testthat/test_factorial_preprocessing.R
NAMESPACE
data
data/UK_Manifestos.rda
R
R/scaling_comparison.R
R/parallel_preprocess.R
R/parallel_rank_test.R
R/preText_test.R
R/preText.R
R/mantel_comparison.R
R/document_position_plots.R
R/regression_coefficient_plot.R
R/wordfish_rank_plot.R
R/find_optimal_number_of_topics.R
R/get_perplexities.R
R/topic_novelty_score.R
R/remove_infrequent_terms.R
R/wordfish_comparison.R
R/topic_key_term_plot.R
R/Package_Documentation.R
R/temporal_filter.R
R/preText_score_plot.R
R/dfm_scaling_test.R
R/multiplot_ggplot2.R
R/preprocessing_choice_regression.R
R/factorial_preprocessing.R
R/Data_Documentation.R
R/calculate_prediction_errors.R
R/zzz.R
R/mantel_comparison_to_base.R
R/optimal_k_comparison.R
vignettes
vignettes/getting_started_with_preText.Rmd
README.md
MD5
build
build/vignette.rds
DESCRIPTION
man
man/preText.Rd
man/regression_coefficient_plot.Rd
man/dfm_scaling_test.Rd
man/preText_score_plot.Rd
man/scaling_comparison.Rd
man/mantel_comparison_to_base.Rd
man/calculate_prediction_errors.Rd
man/preText_test.Rd
man/optimal_k_comparison.Rd
man/preprocessing_choice_regression.Rd
man/factorial_preprocessing.Rd
man/mantel_comparison.Rd
man/wordfish_comparison.Rd
man/document_position_plots.Rd
man/topic_novelty_score.Rd
man/remove_infrequent_terms.Rd
man/topic_key_term_plot.Rd
man/UK_Manifestos.Rd
man/wordfish_rank_plot.Rd
preText documentation built on May 20, 2017, 2:56 a.m.

Questions? Problems? Suggestions? Tweet to @rdrrHQ or email at ian@mutexlabs.com.

Please suggest features or report bugs in the GitHub issue tracker.

All documentation is copyright its authors; we didn't write any of that.