quallmer.app: Interactive Validation App for 'quallmer'

Companion package to 'quallmer' providing an interactive 'shiny' application for manual coding, reviewing large language model (LLM) generated annotations, and computing inter-rater reliability metrics. Supports three modes: blind manual coding, LLM output validation, and agreement calculation. Computes standard reliability metrics including Krippendorff's alpha (Krippendorff 2019 <doi:10.4135/9781071878781>), Cohen's kappa, Fleiss' kappa (Fleiss 1971 <doi:10.1037/h0031619>), intraclass correlation coefficient (ICC), and percent agreement for nominal, ordinal, interval, and ratio data. Also computes gold-standard validation metrics including accuracy, precision, recall, and F1 scores following Sokolova and Lapalme (2009 <doi:10.1016/j.ipm.2009.03.002>).

Getting started

Package details

AuthorSeraphine F. Maerz [aut, cre] (ORCID: <https://orcid.org/0000-0002-7173-9617>), Kenneth Benoit [aut] (ORCID: <https://orcid.org/0000-0002-0797-564X>)
MaintainerSeraphine F. Maerz <seraphine.maerz@unimelb.edu.au>
LicenseMIT + file LICENSE
Version0.1.0
URL https://github.com/quallmer/quallmer.app
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:
install.packages("quallmer.app")

Try the quallmer.app package in your browser

Any scripts or data that you put into this service are public.

quallmer.app documentation built on March 8, 2026, 5:06 p.m.