The package offers various functions to read, transcode and process data. There are many different function to read in data. Also a general framework to recode nominal data is included. Further, there is a general approach to describe orthographic systems through so-called Orthography Profiles. It offers functions to write such profiles based on some actual written text, and to test and correct profiles given concrete data. The main end-use is to produce tokenized texts in so-called tailored grapheme clusters.
Package: | qlcData |
Type: | Package |
Version: | 0.2.1 |
Date: | 2018-01-05 |
License: | GPL-3 |
Various functions to read specific data formats of QLC are documented in read_align
, read.profile
, read.recoding
.
The recode
function allows for an easy and transparent way to specify a recoding of an existing nominal dataset. The specification of the recoding-decisions is preferably saved in an easily accessible YAML-file. There are utility function write.profile
for writing and reading such files included.
For processing of strings using orthography profiles, the central function is tokenize
. A basic sceleton for an orthography profile can be produced with write.profile
Michael Cysouw <cysouw@mac.com>
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.