Man pages for paithiov909/ldccr
Utilities for Various Japanese Corpora

AozoraBunkoSnapshotMeta data of text files published on Aozora Bunko
clean_emojiRemove emojis
clean_urlRemove URLs
download_unidicDownload and unzip 'UniDic'
is_within_eraCheck if dates are within Japanese era
jrte_rte_filesData for Textual Entailment
ldccr-packageldccr: Utilities for Various Japanese Corpora
ldnws_categoriesList of categories of the Livedoor News Corpus
NekoTextWhole text of ‘Wagahai Wa Neko Dearu’ written by Natsume...
parse_jrte_judgesParse judges column of 'rte.*.tsv'
parse_jrte_reasoningParse reasoning column of 'rte.*.tsv'
parse_to_jdateParse dates to Japanese dates
read_aozoraDownload text file from Aozora Bunko
read_ja_text8Read the ja.text8 corpus
read_jrteRead the JRTE Corpus
read_ldnwsRead the Livedoor News Corpus
unidic_availablesList of available 'UniDic'
paithiov909/ldccr documentation built on Oct. 14, 2024, 3:44 a.m.