audubon-package: audubon: Japanese Text Processing Tools

audubon-packageR Documentation

audubon: Japanese Text Processing Tools

Description

logo

A collection of Japanese text processing tools for filling Japanese iteration marks, Japanese character type conversions, segmentation by phrase, and text normalization which is based on rules for the 'Sudachi' morphological analyzer and the 'NEologd' (Neologism dictionary for 'MeCab'). These features are specific to Japanese and are not implemented in 'ICU' (International Components for Unicode).

Author(s)

Maintainer: Akiru Kato paithiov909@gmail.com

Other contributors:

  • Koki Takahashi (Author of japanese.js) [copyright holder]

  • Shuhei Iitsuka (Author of budoux) [copyright holder]

  • Taku Kudo (Author of TinySegmenter) [copyright holder]

See Also

Useful links:


paithiov909/audubon documentation built on Sept. 28, 2024, 8:47 a.m.