R/OBJECT_germeval2018_train.R

#' GermEval-2018 train data
#'
#'This file represents the training data from the shared task. 
#'It contains the annotation for both tasks of the shared task.
#'The data have been taken from this repo \url{https://github.com/uds-lsv/GermEval-2018-Data}.
#'
#' @docType data
#'
#' @usage data("germeval_train")
#'
#'
#' @format A data frame containing 5009 rows and 4 columns
#' \describe{
#'   \item{id}{id}
#'   \item{pretextis}{text, similar to a tweet}
#'   \item{c1}{classification of text as "OFFENSE" or "OTHER"}
#'   \item{c2}{classification of text as "ABUSE, "INSULT", "PROFANITY" or "OTHER"}
#'  }
#' @details More details can be found here: \url{https://github.com/uds-lsv/GermEval-2018-Data/blob/master/guidelines-iggsa-shared.pdf}.
#' licenced under CC-BY 4.0
#'
#' @source Please cite this dataset as: 
#' 'Michael Wiegand, Melanie Siegel, and Josef Ruppenhofer: 
#' "Overview of the GermEval 2018 Shared Task on the Identification of Offensive Language", 
#' in Proceedings of the GermEval, 2018, Vienna, Austria.'



"germeval_train"
sebastiansauer/pradadata documentation built on Nov. 6, 2023, 11:32 a.m.