crf_evaluation: Basic classification evaluation metrics for multi-class...
In crfsuite: Conditional Random Fields for Labelling Sequential Data in Natural Language Processing

crf_evaluation

R Documentation

Basic classification evaluation metrics for multi-class labelling

Description

The accuracy, precision, recall, specificity, F1 measure and support metrics are provided for each label in a one-versus the rest setting.

Usage

crf_evaluation(
  pred,
  obs,
  labels = na.exclude(unique(c(as.character(pred), as.character(obs)))),
  labels_overall = setdiff(labels, "O")
)

Arguments

`pred`	a factor with predictions
`obs`	a factor with gold labels
`labels`	a character vector of possible values that `pred` and `obs` can take. Defaults to the values in the data
`labels_overall`	a character vector of either labels which is either the same as `labels` or a subset of `labels` in order to compute a weighted average of the by-label statistics

Value

a list with 2 elements:

bylabel: data.frame with the accuracy, precision, recall, specificity, F1 score and support (number of occurrences) for each label
overall: a vector containing
- the overall accuracy
- the metrics precision, recall, specificity and F1 score which are weighted averages of these metrics from list element bylabel, where the weight is the support
- the metrics precision, recall, specificity and F1 score which are averages of these metrics from list element bylabel giving equal weight to each label

Examples

pred <- sample(LETTERS, 1000, replace = TRUE)
gold <- sample(LETTERS, 1000, replace = TRUE)
crf_evaluation(pred = pred, obs = gold, labels = LETTERS) 


x <- ner_download_modeldata("conll2002-nl")
x <- crf_cbind_attributes(x, terms = c("token", "pos"), 
                          by = c("doc_id", "sentence_id"))
crf_train <- subset(x, data == "ned.train")
crf_test <- subset(x, data == "testa")
attributes <- grep("token|pos", colnames(x), value=TRUE)
model <- crf(y = crf_train$label, 
             x = crf_train[, attributes], 
             group = crf_train$doc_id, 
             method = "lbfgs") 
             
## Use the model to score on existing tokenised data
scores <- predict(model, 
                  newdata = crf_test[, attributes], 
                  group = crf_test$doc_id)
crf_evaluation(pred = scores$label, obs = crf_test$label)
crf_evaluation(pred = scores$label, obs = crf_test$label, 
  labels = c("O", 
             "B-ORG", "I-ORG", "B-PER", "I-PER", 
             "B-LOC", "I-LOC", "B-MISC", "I-MISC"))
             
         
library(udpipe)
pred <- txt_recode(scores$label, 
                   from = c("B-ORG", "I-ORG", "B-PER", "I-PER", 
                            "B-LOC", "I-LOC", "B-MISC", "I-MISC"),
                   to = c("ORG", "ORG", "PER", "PER", 
                          "LOC", "LOC", "MISC", "MISC"))
obs <- txt_recode(crf_test$label, 
                  from = c("B-ORG", "I-ORG", "B-PER", "I-PER", 
                           "B-LOC", "I-LOC", "B-MISC", "I-MISC"),
                  to = c("ORG", "ORG", "PER", "PER", 
                         "LOC", "LOC", "MISC", "MISC"))
crf_evaluation(pred = pred, obs = obs, 
               labels = c("ORG", "LOC", "PER", "MISC", "O"))

crfsuite documentation built on Sept. 17, 2023, 1:06 a.m.

crfsuite index

README.md Conditional Random Fields for NLP"

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

crfsuite
Conditional Random Fields for Labelling Sequential Data in Natural Language Processing

crf_evaluation: Basic classification evaluation metrics for multi-class...
In crfsuite: Conditional Random Fields for Labelling Sequential Data in Natural Language Processing

Basic classification evaluation metrics for multi-class labelling

Description

Usage

Arguments

Value

Examples

Related to crf_evaluation in crfsuite...

R Package Documentation

Browse R Packages

We want your feedback!

crfsuite Conditional Random Fields for Labelling Sequential Data in Natural Language Processing

crf_evaluation: Basic classification evaluation metrics for multi-class... In crfsuite: Conditional Random Fields for Labelling Sequential Data in Natural Language Processing

Basic classification evaluation metrics for multi-class labelling

Description

Usage

Arguments

Value

Examples

Related to crf_evaluation in crfsuite...

R Package Documentation

Browse R Packages

We want your feedback!

crfsuite
Conditional Random Fields for Labelling Sequential Data in Natural Language Processing

crf_evaluation: Basic classification evaluation metrics for multi-class...
In crfsuite: Conditional Random Fields for Labelling Sequential Data in Natural Language Processing