cwb_charsets: Character sets supported by CWB

View source: R/cwb.R

cwb_charsetsR Documentation

Character sets supported by CWB

Description

The function returns a character vector with characters sets (charsets) supported by the Corpus Workbench (CWB). The vector is derived from the the CorpusCharset object defined in the header file of the corpus library (CL).

Usage

cwb_charsets()

Details

Early versions of the CWB were developed for "latin1", "utf8" support has been introduced with CWB v3.2. Note that RcppCWB is tested only for "latin1" and "utf8" and that R uses "UTF-8" rather than utf8" (CWB) by convention.

Examples

cwb_charsets()

RcppCWB documentation built on Sept. 24, 2024, 1:08 a.m.