| tokenize_text_by_language | R Documentation |
Language-aware tokenizer used across embedders and keyword search
tokenize_text_by_language(text, language = "en", remove_stopwords = FALSE)
text |
Input text |
language |
"en" or "ml" |
remove_stopwords |
Remove English stopwords when language is "en" |
Character vector of tokens
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.