chimera_detect: Detect multilingual texts

Description Usage Arguments Value

View source: R/deduplication_functions.R

Description

Detects texts that contain two languages in discrete blocks, such as a translation of a title

Usage

1
chimera_detect(text, overlap = 0.5)

Arguments

text

the text to check for chimeras

overlap

how large of a window to check for chimeras; an overlap of .5 will split the text into two chunks split at the middle, whereas higher levels of overlap will result in larger chunks to check

Value

if a chimera is detected, the language codes of contained languages


elizagrames/synthesisr documentation built on May 26, 2019, 10:34 a.m.