View source: R/encoding_conversion.R
| stri_enc_toutf32 | R Documentation |
UTF-32 is a 32-bit encoding where each Unicode code point corresponds to exactly one integer value. This function converts a character vector to a list of integer vectors so that, e.g., individual code points may be easily accessed, changed, etc.
stri_enc_toutf32(str)
str |
a character vector (or an object coercible to) to be converted |
See stri_enc_fromutf32 for a dual operation.
This function is roughly equivalent to a vectorized call
to utf8ToInt(enc2utf8(str)).
If you want a list of raw vectors on output,
use stri_encode.
Unlike utf8ToInt, if ill-formed UTF-8 byte sequences are detected,
a corresponding element is set to NULL and a warning is generated.
To deal with such issues, use, e.g., stri_enc_toutf8.
Returns a list of integer vectors.
Missing values are converted to NULLs.
Marek Gagolewski and other contributors
The official online manual of stringi at https://stringi.gagolewski.com/
Gagolewski M., stringi: Fast and portable character string processing in R, Journal of Statistical Software 103(2), 2022, 1-59, \Sexpr[results=rd]{tools:::Rd_expr_doi("10.18637/jss.v103.i02")}
Other encoding_conversion:
about_encoding,
stri_enc_fromutf32(),
stri_enc_toascii(),
stri_enc_tonative(),
stri_enc_toutf8(),
stri_encode()
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.