Description Usage Arguments Details Value Note Author(s) See Also Examples
Japanese characters in a string or character vector are romanized with
the their sounds for the English-speaking world. While
kakasi
in Nippon package works for romanization of
Japanese, alternative romanization of Japanese is limitedly available
with kana2roma
. Unlike the kakasi
function,
kana2roma
works without any help of an external library.
1 2 |
x |
A character vector including Japanese Hiragana or Katakana |
type |
A character string specifying the type of romanization. Default is "Hepburn" |
cap |
logical. Capital letters to be uppercased, Default is FALSE |
ascii.only |
logical. Transcribed with ASCII characters only. Default is TRUE |
Japanese strings are often made up a mixture of Chinese characters
(Kanji), Kana (Hiragana and Katakana) and Romaji (Latin phonetical
pronunciation). kana2roma
transcribes Kana to Romaji without
any help of external programs, such as kakasi. It should be useful
especially when users want to sanitize and make readable Japanese
strings in data set for the English-speaking world. The function
supports three main romanization systems. Although the Nihon-shiki
(ISO3602 Strict) is the official system in Japan, Hepburn is most
widely used especially for proper noun, and officially adopted in
naming systems for railway station and roads. A variant of Hepburn is
authorized by the Japanese Foreign Ministry for use in passports.
For place names or other proper nouns, set “cap = TRUE
” in
kana2roma
(default is FALSE) to capitalize the first letters
in Romaji strings.
Set “ascii.only = TRUE
” in kana2roma
(this is default)
if a user needs to suppress non-ASCII Romaji. Otherwise, a pure
romanization system may return values with non-ASCII codes, that is,
macron.
A character vector
kana2roma
supports only Kana (Hiragana and Katakana). All other
characters are just ignored and output as it is. If users need convert
from Kanji to Romaji, use kakasi
instead of
kana2roma
.
Rigidly, there are many variants of the three main romanization systems with small differences. Yet another romanization is used in an input methods engine of computers. Since the function strictly and simply follows the three romanization systems, some Kana characters may be failed due to lack of authorized conversion rules. Yet, some unsupported conversion rules will be implemented as optional in the future.
Susumu Tanimura aruminat@gmail.com
See Also as kakasi
.
1 2 3 |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.