conll_get_regions: Extract regions from NER annotations (CoNNL format).

View source: R/ner.R

conll_get_regionsR Documentation

Extract regions from NER annotations (CoNNL format).

Description

Extract regions from NER annotations (CoNNL format).

Usage

conll_get_regions(x)

Arguments

x

A data.frame, a data.table, or any other object that can be coerced to a data.table. The input table is expected to have the columns "token" and "ner", and "cpos".

Examples

x <- data.frame(
  token = c(
    "Die",
    "Bundeskanzlerin",
    "Angela",
    "Merkel",
    "spricht",
    "im",
    "Bundestag",
    "zur",
    "Lage",
    "der",
    "Nation",
    "."
  ),
  ne = c("O", "O", "B-PERS", "I-PERS", "O", "O", "B-ORG", "O", "O", "O", "O", "O"),
  stringsAsFactors = FALSE
)
x[["cpos"]] <- 100L:(100L + nrow(x) - 1L)
tab <- conll_get_regions(x)

cwbtools documentation built on Nov. 27, 2023, 5:11 p.m.