stri_enc_mark: Get Declared Encodings of Each String
In stringi: Fast and Portable Character String Processing Facilities

stri_enc_mark

R Documentation

Get Declared Encodings of Each String

Description

Reads declared encodings for each string in a character vector as seen by stringi.

Usage

stri_enc_mark(str)

Arguments

str

character vector or an object coercible to a character vector

Details

According to Encoding, R has a simple encoding marking mechanism: strings can be declared to be in latin1, UTF-8 or bytes.

Moreover, we may check (via the R/C API) whether a string is in ASCII (R assumes that this holds if and only if all bytes in a string are not greater than 127, so there is an implicit assumption that your platform uses an encoding that extends ASCII) or in the system's default (a.k.a. unknown in Encoding) encoding.

Intuitively, the default encoding should be equivalent to the one you use on stdin (e.g., your 'keyboard'). In stringi we assume that such an encoding is equivalent to the one returned by stri_enc_get. It is automatically detected by ICU to match – by default – the encoding part of the LC_CTYPE category as given by Sys.getlocale.

Value

Returns a character vector of the same length as str. Unlike in the Encoding function, here the possible encodings are: ASCII, latin1, bytes, native, and UTF-8. Additionally, missing values are handled properly.

This gives exactly the same data that is used by all the functions in stringi to re-encode their inputs.

Author(s)

Marek Gagolewski and other contributors

stringi
Fast and Portable Character String Processing Facilities

stri_enc_mark: Get Declared Encodings of Each String
In stringi: Fast and Portable Character String Processing Facilities

Get Declared Encodings of Each String

Description

Usage

Arguments

Details

Value

Author(s)

See Also

Related to stri_enc_mark in stringi...

R Package Documentation

Browse R Packages

We want your feedback!

stringi Fast and Portable Character String Processing Facilities

stri_enc_mark: Get Declared Encodings of Each String In stringi: Fast and Portable Character String Processing Facilities

Get Declared Encodings of Each String

Description

Usage

Arguments

Details

Value

Author(s)

See Also

Related to stri_enc_mark in stringi...

R Package Documentation

Browse R Packages

We want your feedback!

stringi
Fast and Portable Character String Processing Facilities

stri_enc_mark: Get Declared Encodings of Each String
In stringi: Fast and Portable Character String Processing Facilities