Description Usage Format Source
The corpus is a random subset of 25,000 sentences from one of the Indonesian Leipzig Corpora files, i.e., the "ind_news_2008_300K-sentences.txt"
. This corpus file originally contains 300,000 sentences of Indonesian online newspapers.
1 |
A character vector of 25,000 elements of sentences.
http://wortschatz.uni-leipzig.de/en/download
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.