str_jnormalize: Converts characters following the rules of 'neologd'

View source: R/normalize-str.R

str_jnormalizeR Documentation

Converts characters following the rules of 'neologd'

Description

Converts characters following the rules of 'neologd'

Usage

str_jnormalize(str)

Arguments

str

Input vector.

Details

Converts the characters into normalized style basing on rules that is recommended by the Neologism dictionary for MeCab.

Value

a character

See Also

https://github.com/neologd/mecab-ipadic-neologd/wiki/Regexp.ja

Examples

str_jnormalize(
  paste0(
    "    \uff30",
    "\uff32\uff2d\uff2c\u300    \u526f    \u8aad    \u672c      "
  )
)
str_jnormalize(
  paste0(
    "\u5357\u30a2\u30eb\u30d7\u30b9\u306e\u3000\u5929\u7136\u6c34",
    "-\u3000\uff33\uff50\uff41\uff52\uff4b\uff49\uff4e\uff47\u3000",
    "\uff2c\uff45\uff4d\uff4f\uff4e\u3000\u30ec\u30e2\u30f3\u4e00\u7d5e\u308a"
 )
)

uribo/zipangu documentation built on Feb. 27, 2023, 11:37 p.m.