Compute total and cumulative corpus coverage fraction of a dictionary.
1 2 3 4 5 6 7 8 9 10 11 12 13
word_coverage(object, corpus, ...) ## S3 method for class 'sbo_dictionary' word_coverage(object, corpus, ...) ## S3 method for class 'character' word_coverage(object, corpus, .preprocess = identity, EOS = "", ...) ## S3 method for class 'sbo_kgram_freqs' word_coverage(object, corpus, ...) ## S3 method for class 'sbo_predictions' word_coverage(object, corpus, ...)
either a character vector, or an object inheriting from one of
a character vector.
further arguments passed to or from other methods.
preprocessing function for training corpus. See
a length one character vector. String containing End-Of-Sentence
This function computes the corpus coverage fraction of a dictionary, that is the fraction of words appearing in corpus which are contained in the original dictionary.
This function is a generic, accepting as
object argument any object
storing a dictionary, along with a preprocessing function and a list
of End-Of-Sentence characters. This includes all
sbo main classes:
object is a character vector, the preprocessing
function and the End-Of-Sentence characters must be specified explicitly.
The coverage fraction is computed cumulatively, and the dependence of
coverage with respect to maximal rank can be explored through
(see examples below)
1 2 3 4 5
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.