R/textstem-package.R

#' Tools for Stemming and Lemmatizing Text
#'
#' Tools that stem and lemmatize text.  Stemming is a process that removes
#' endings such as suffixes.  Lemmatization is the process of grouping 
#' inflected forms together as a single base form.
#' @docType package
#' @name textstem
#' @aliases textstem package-textstem
NULL


#' 2012 U.S. Presidential Debates
#'
#' A dataset containing a cleaned version of all three presidential debates for
#' the 2012 election.
#'
#' @details
#' \itemize{
#'   \item person. The speaker
#'   \item tot. Turn of talk
#'   \item dialogue. The words spoken
#'   \item time. Variable indicating which of the three debates the dialogue is from
#' }
#'
#' @docType data
#' @keywords datasets
#' @name presidential_debates_2012
#' @usage data(presidential_debates_2012)
#' @format A data frame with 2912 rows and 4 variables
NULL





#' Sam I Am Text
#'
#' A dataset containing a character vector of the text from Seuss's 'Sam I Am'.
#'
#' @docType data
#' @keywords datasets
#' @name sam_i_am
#' @usage data(sam_i_am)
#' @format A character vector with 169 elements
#' @references Seuss, Dr. (1960). Green Eggs and Ham.
NULL
trinker/textstem documentation built on June 1, 2019, 1:47 a.m.