qlcData-package: Processing data for quantitative language comparison (QLC)

Description Details Author(s)

Description

The package offers various functions to read, transcode and process data. There are many different function to read in data. Also a general framework to recode nominal data is included. Further, there is a general approach to describe orthographic systems through so-called Orthography Profiles. It offers functions to write such profiles based on some actual written text, and to test and correct profiles given concrete data. The main end-use is to produce tokenized texts in so-called tailored grapheme clusters.

Details

Package: qlcData
Type: Package
Version: 0.2.1
Date: 2018-01-05
License: GPL-3

Various functions to read specific data formats of QLC are documented in read_align, read.profile, read.recoding.

The recode function allows for an easy and transparent way to specify a recoding of an existing nominal dataset. The specification of the recoding-decisions is preferably saved in an easily accessible YAML-file. There are utility function write.profile for writing and reading such files included.

For processing of strings using orthography profiles, the central function is tokenize. A basic sceleton for an orthography profile can be produced with write.profile

Author(s)

Michael Cysouw <cysouw@mac.com>


qlcData documentation built on May 2, 2019, 8:29 a.m.