stri_numbytes | R Documentation |
Counts the number of bytes needed to store each string in the computer's memory.
stri_numbytes(str)
str |
character vector or an object coercible to |
Often, this is not the function you would normally use
in your string processing activities. See stri_length
instead.
For 8-bit encoded strings, this is the same as stri_length
.
For UTF-8 strings, the returned values may be greater
than the number of code points, as UTF-8 is not a fixed-byte encoding:
one code point may be encoded by 1-4 bytes
(according to the current Unicode standard).
Missing values are handled properly.
The strings do not need to be re-encoded to perform this operation.
The returned values do not include the trailing NUL bytes, which are used internally to mark the end of string data (in C).
Returns an integer vector of the same length as str
.
Marek Gagolewski and other contributors
The official online manual of stringi at https://stringi.gagolewski.com/
Gagolewski M., stringi: Fast and portable character string processing in R, Journal of Statistical Software 103(2), 2022, 1-59, \Sexpr[results=rd]{tools:::Rd_expr_doi("10.18637/jss.v103.i02")}
Other length:
%s$%()
,
stri_isempty()
,
stri_length()
,
stri_pad_both()
,
stri_sprintf()
,
stri_width()
stri_numbytes(letters)
stri_numbytes(c('abc', '123', '\u0105\u0104'))
## Not run:
# this used to fail on Windows, where there were no native support
# for 4-bytes Unicode characters; see, however, stri_unescape_unicode():
stri_numbytes('\U001F600') # compare stri_length('\U001F600')
## End(Not run)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.