Extract and save ngrams from text to aid in the removal of boilerplate text from documents.
1 | get_ngrams(input_dir, ngram_dir, language = "en")
|
input_dir |
Directory containing text files to extract ngrams from. |
ngram_dir |
Directory to save ngrams to. |
language |
Language in which documents are written. |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.