R/textstem-package.R

#' Tools for Stemming and Lemmatizing Text
#'
#' Tools that stem and lemmatize text.  Stemming is a process that removes
#' endings such as suffixes.  Lemmatization is the process of grouping 
#' inflected forms together as a single base form.
#' @docType package
#' @name textstem
#' @aliases textstem package-textstem
NULL


#' 2012 U.S. Presidential Debates
#'
#' A dataset containing a cleaned version of all three presidential debates for
#' the 2012 election.
#'
#' @details
#' \itemize{
#'   \item person. The speaker
#'   \item tot. Turn of talk
#'   \item dialogue. The words spoken
#'   \item time. Variable indicating which of the three debates the dialogue is from
#' }
#'
#' @docType data
#' @keywords datasets
#' @name presidential_debates_2012
#' @usage data(presidential_debates_2012)
#' @format A data frame with 2912 rows and 4 variables
NULL





#' Sam I Am Text
#'
#' A dataset containing a character vector of the text from Seuss's 'Sam I Am'.
#'
#' @docType data
#' @keywords datasets
#' @name sam_i_am
#' @usage data(sam_i_am)
#' @format A character vector with 169 elements
#' @references Seuss, Dr. (1960). Green Eggs and Ham.
NULL

Try the textstem package in your browser

Any scripts or data that you put into this service are public.

textstem documentation built on May 2, 2019, 6:42 a.m.