Man pages for paithiov909/ldccr
Utilities for Various Japanese Corpora

AozoraBunkoSnapshotMeta data of text files published on Aozora Bunko
jrte_rte_filesData for Textual Entailment
ldccr-packageldccr: Utilities for Various Japanese Corpora
ldnws_categoriesList of categories of the Livedoor News Corpus
NekoTextWhole text of ‘Wagahai Wa Neko Dearu’ written by Natsume...
parse_jrte_judgesParse judges column of 'rte.*.tsv'
parse_jrte_reasoningParse reasoning column of 'rte.*.tsv'
read_aozoraDownload text file from Aozora Bunko
read_ja_text8Read the ja.text8 corpus
read_jrteRead the JRTE Corpus
read_ldnwsRead the Livedoor News Corpus
sqidsGenerate random-looking IDs from integer ranks
sqids_implEncode and decode 'Sqids'
unidic-downloaderDownload and unzip 'UniDic'
utilsUtility functions
paithiov909/ldccr documentation built on Feb. 3, 2025, 12:16 a.m.