huber: Comparative vocabulary for indigenous languages of Colombia...
In cysouw/qlcMatrix: Utility Sparse Matrix Functions for Quantitative Language Comparison

huber

R Documentation

Comparative vocabulary for indigenous languages of Colombia (Huber & Reed 1992)

Description

Data from Huber & Reed (1992), containing a comparative vocabulary (a ‘wordlist’) for 69 indigenous languages from Colombia.

Usage

data(huber)

Format

A data frame with 27521 observations on the following 4 variables.

CONCEPT: a factor with 366 levels, indicating the comparative concepts
COUNTERPART: a character vector listing the actual wordforms as described in Huber & Reed 1992
DOCULECT: a factor with 71 levels, indicating the languages from which the wordforms are taken (‘documented lects’, abbreviated as ‘doculect’). These are 69 indigenous languages from Colombia, and English and Spanish.
TOKENS: a tokenized version of the counterparts: spaces are added between graphemic units (i.e. groups of unicode characters that are functioning as a single unit in the orthography)

Details

The editors have attempted to use a harmonized orthography throughout all languages, approximately based on IPA, though there are still many language-specific idiosyncrasies included. However, the translations into English and Spanish are written in their regular orthography, and not in the IPA-dialect as used for the other languages. In general, the ‘translations’ into English and Spanish are simply lowercase versions of the concept-names, included here to more flexibly identify the meaning of words in the Colombian languages. In many cases these translations are somewhat clunky (e.g. ‘spring of water’), and are missing the proper orthography details (e.g. ‘Adams apple’).

The book was digitized in the QuantHistLing project and provided here as an example of dealing efficiently with reasonably large data. Care has been taken to faithfully represent the original transcription from the printed version.

Source

Huber, Randall Q. & Robert B. Reed. 1992. Vocabulario Comparativo: Palabras Selectas de Lenguas Indigenas de Colombia. Bogota: Instituto Linguistico de Verano. available online at https://colombia.sil.org/es/resources/archives/18886. Copyright 2014 SIL International.

Examples

data(huber)
head(huber)

cysouw/qlcMatrix documentation built on July 3, 2024, 8:44 p.m.

cysouw/qlcMatrix index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

cysouw/qlcMatrix
Utility Sparse Matrix Functions for Quantitative Language Comparison

huber: Comparative vocabulary for indigenous languages of Colombia...
In cysouw/qlcMatrix: Utility Sparse Matrix Functions for Quantitative Language Comparison

Comparative vocabulary for indigenous languages of Colombia (Huber & Reed 1992)

Description

Usage

Format

Details

Source

Examples

Related to huber in cysouw/qlcMatrix...

R Package Documentation

Browse R Packages

We want your feedback!

cysouw/qlcMatrix Utility Sparse Matrix Functions for Quantitative Language Comparison

huber: Comparative vocabulary for indigenous languages of Colombia... In cysouw/qlcMatrix: Utility Sparse Matrix Functions for Quantitative Language Comparison

Comparative vocabulary for indigenous languages of Colombia (Huber & Reed 1992)

Description

Usage

Format

Details

Source

Examples

Related to huber in cysouw/qlcMatrix...

R Package Documentation

Browse R Packages

We want your feedback!

cysouw/qlcMatrix
Utility Sparse Matrix Functions for Quantitative Language Comparison

huber: Comparative vocabulary for indigenous languages of Colombia...
In cysouw/qlcMatrix: Utility Sparse Matrix Functions for Quantitative Language Comparison