golden_rules: Sentence Boundary Disambiguation Edge Cases

Description Usage Format Details References

Description

A slightly filtered dataset containing Dias's sentence boundary disambiguation edge cases. This is a nested data set with the outcome column as a nested list of desired splits. The non-ASCII cases and spaced ellipsis examples have been removed.

Usage

1

Format

A data frame with 45 rows and 3 variables

Details

References

Dias, Kevin S. 2015. Golden Rules (English). Retrieved: https://s3.amazonaws.com/tm-town-nlp-resources/golden_rules.txt


textshape documentation built on May 29, 2021, 1:07 a.m.