Description Usage Format Usage Methods Arguments Examples
BM25 stands for Best Matching 25. It is widely using for ranking documents and a preferred method than TF*IDF scores. It is used to find the similar documents from a corpus, given a new document. It is popularly used in information retrieval systems. This implementation uses multiple cores for faster and parallel computation.
1 |
R6Class
object.
For usage details see Methods, Arguments and Examples sections.
1 2 3 |
$new()
Initialise the instance of the class. Here you pass the complete corpus of the documents
$most_similar()
it returns the topn most similar documents from the corpus
$compute()
it returns a similarity score for all the documents in the corpus, given a sentence
a list containing sentences
boolean value used to activate parallel computation, defaults to FALSE
1 2 3 4 5 6 |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.