stri_unescape_unicode: Un-escape All Escape Sequences

View source: R/escape.R

stri_unescape_unicodeR Documentation

Un-escape All Escape Sequences

Description

Un-escapes all known escape sequences.

Usage

stri_unescape_unicode(str)

Arguments

str

character vector

Details

Uses ICU's facilities to un-escape Unicode character sequences.

The following escape sequences are recognized: \a, \b, \t, \n, \v, \?, \e, \f, \r, \", \', \\, \uXXXX (4 hex digits), \UXXXXXXXX (8 hex digits), \xXX (1-2 hex digits), \ooo (1-3 octal digits), \cX (control-X; X is masked with 0x1F). For \xXX and \ooo, beware of non-valid UTF-8 byte sequences.

Note that some versions of R on Windows cannot handle characters defined with \UXXXXXXXX.

Value

Returns a character vector. If an escape sequence is ill-formed, the result will be NA and a warning will be given.

Author(s)

Marek Gagolewski and other contributors

See Also

The official online manual of stringi at https://stringi.gagolewski.com/

Gagolewski M., stringi: Fast and portable character string processing in R, Journal of Statistical Software 103(2), 2022, 1-59, \Sexpr[results=rd]{tools:::Rd_expr_doi("10.18637/jss.v103.i02")}

Other escape: stri_escape_unicode()

Examples

stri_unescape_unicode('a\\u0105!\\u0032\\n')


stringi documentation built on May 29, 2024, 8:16 a.m.