Unicode Character Names
Find the names or labels of Unicode characters, or Unicode characters by their name.
1 2 3
an R object which can be coerced to a
arguments to be passed to
The Unicode Standard provides a convention for labeling code points
that do not have character names (control, reserved, noncharacter,
private-use and surrogate code points). These labels can be obtained
By default, exact matching is used for finding Unicode characters by
type = "grep",
grepl is used for
x against the Unicode character names; for now, Hangul
syllable and CJK Unified Ideograph names are ignored in this case.
u_char_label, a character vector
with the names or labels, respectively, of the corresponding Unicode
u_char object giving the
Unicode characters with name exactly matching the given names.
1 2 3 4 5 6 7 8 9 10 11 12 13
x <- as.u_char(utf8ToInt("Austria")) u_char_name(x) ## Derived Hangul syllable character names are also supported for ## finding characters by exact matching: x <- u_char_name("0xAC00") x u_char_from_name(x) ## Find all Unicode characters with name matching 'DIGIT ONE'. x <- u_char_from_name("\\bDIGIT ONE\\b", "g") ## And show their names. u_char_name(x)