This sub-tab offers basic text preprocessing transformation. For stopword removal and more general word list exclusion, see the 'Filter' sub-tab of the 'Data' tab.
Make the text uniformly lowercase.
Removes all punctuation from the text.
Remove all numbers from the text. Only counts digits (0-9), and not for example Roman numerals.
Multiple, consecutive whitespace characters are transformed to a single blank space.
Stems the text. 'Stemming' is the process of transforming a derived word to its base 'stem'. For example, stemming the words
and so on, would all reduce to the word 'stem'.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.