stringdist: Approximate String Matching and String Distance Functions

Implements an approximate string matching version of R's native 'match' function. Can calculate various string distances based on edits (Damerau-Levenshtein, Hamming, Levenshtein, optimal sting alignment), qgrams (q- gram, cosine, jaccard distance) or heuristic metrics (Jaro, Jaro-Winkler). An implementation of soundex is provided as well. Distances can be computed between character vectors while taking proper care of encoding or between integer vectors representing generic sequences.

Package details

AuthorMark van der Loo [aut, cre], Jan van der Laan [ctb], R Core Team [ctb], Nick Logan [ctb]
Date of publication2016-12-16 15:25:23
MaintainerMark van der Loo <>
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:

Try the stringdist package in your browser

Any scripts or data that you put into this service are public.

stringdist documentation built on May 29, 2017, 7:55 p.m.