uniqtag: Abbreviate Strings to Short, Unique Identifiers

For each string in a set of strings, determine a unique tag that is a substring of fixed size k unique to that string, if it has one. If no such unique substring exists, the least frequent substring is used. If multiple unique substrings exist, the lexicographically smallest substring is used. This lexicographically smallest substring of size k is called the "UniqTag" of that string.

Package details

AuthorShaun Jackman [cre]
MaintainerShaun Jackman <[email protected]>
LicenseMIT + file LICENSE
URL https://github.com/sjackman/uniqtag
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:

Try the uniqtag package in your browser

Any scripts or data that you put into this service are public.

uniqtag documentation built on May 1, 2019, 6:35 p.m.