View source: R/ja-text8-reader.R
read_ja_text8 | R Documentation |
Download and read the ja.text8 corpus as a tibble.
read_ja_text8(
url =
"https://s3-ap-northeast-1.amazonaws.com/dev.tech-sketch.jp/chakki/public/ja.text8.zip",
size = NULL
)
url |
String. |
size |
Integer. If supplied, samples rows by this argument. |
By default, this function reads the ja.text8 corpus as a tibble by splitting it into sentences. The ja.text8 as whole corpus consists of over 582,000 sentences, 16,900,026 tokens, and 290,811 vocabularies.
A tibble.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.