knitr::opts_chunk$set( collapse = TRUE, comment = "#>" )
library(SfL) library(DT) library(lme4)
The SfL
package was created to accompany the Statistics for Linguistics online workshop in August/September 2021. The workshop was a satellite event to the 3rd Forensic Linguistics Short Course.
This vignette gives an overview of data sets included in the SfL
package. Please refer to the functions vignette for an overview of included functions.
This is a full list of all data sets currently contained in SfL
:
Variables contained within the individual data sets are described in their pertinent documentation.
The Age and Looks data set, data_a
, contains data on age and appearance of fictional individuals.
A possible question to investigate with this data set is whether there is a dependency between different variables of the data set.
# load data data("data_a") # number of rows and variables dim(data_a)
The Incomplete Neutralization data set, data_c
, contains data on the phonological phenomenon of incomplete neutralization. The values are fictional, but closely related to real data, i.e. the data by Roettger et al. (2014).
A possible question to investigate with this data set is whether vowels are shorter when preceding voiceless word-final consonants.
# load data data("data_c") # number of rows and variables dim(data_c)
The Duration of word-final /s/ in English data set, data_s
, contains data on three different types of word-final /s/ in English, i.e. non-morphemic, plural, and is-clitic /s/. The data is fictional, but closely related to the original data used in a study by Schmitz et al. (2020).
A possible question to investigate with this data set is whether the duration of /s/ depends on the type of /s/.
# load data data("data_s") # number of rows and variables dim(data_s)
The Tonal Alignment in German data set, data_t
, contains data on tonal alignment in German. The values are fictional, but closely related to real data, i.e. the data by Schmitz (2018).
A possible question to investigate with this data set is whether the frequency of low points differs by syllable structure.
# load data data("data_t") # number of rows and variables dim(data_t)
The Vowel Shortening in German data set, data_v
, contains data on the duration of vowels in open, simplex, and complex syllables in German. The data is fictional, but closely related to the original data used by Schmitz et al. (2018).
A possible question to investigate with this data set is whether vowel duration differs by syllable structure.
# load data data("data_v") # number of rows and variables dim(data_v)
Roettger, T. B., Winter, B., Kirby, J., Grawunder, S., & Grice, M. (2014). Assessing incomplete neutralization of final devoicing in German. Journal of Phonetics, 43: 11- 25.
Schmitz, D. (2018). Tonal Alignment and Segmental Composition in German. Master's Thesis. Universität zu Köln, Köln, Germany.
Schmitz, Dominic and Esser, Janina. (2021). SfL: Statistics for Linguistics. R package version 0.2. URL: https://github.com/dosc91/SfL
Schmitz, D., Baer-Henney, D., & Plag, I. (submitted). The duration of word-final /s/ differs across morphological categories in English: Evidence from pseudowords. Revision invited by Phonetica.
Schmitz, D., Cho, H.-E., & Niemann, H. (2018). Vowel shortening in German as a function of syllable structure. Proceedings of 13. Phonetik und Phonologie Tagung. Humboldt University Berlin, Berlin, 28-29 September, 2017.
Please message the author at contact@dominicschmitz.com in case of any questions, errors or ideas.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.