Tools to clean and process text. Tools are geared at checking for substrings that are not optimal for analysis and replacing or removing them (normalizing) with more analysis friendly substrings (see Sproat, Black, Chen, Kumar, Ostendorf, & Richards (2001) <doi:10.1006/csla.2001.0169>) or extracting them into new variables. For example, emoticons are often used in text but not always easily handled by analysis algorithms. The replace_emoticon() function replaces emoticons with word equivalents.
Package details |
|
---|---|
Author | Tyler Rinker [aut, cre], ctwheels StackOverflow [ctb] |
Maintainer | Tyler Rinker <tyler.rinker@gmail.com> |
License | GPL-2 |
Version | 0.9.3 |
URL | https://github.com/trinker/textclean |
Package repository | View on CRAN |
Installation |
Install the latest version of this package by entering the following in R:
|
Any scripts or data that you put into this service are public.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.