csps-efpc/TokenLink: Joins two dataframes using tokens or like words

Joins two dataframes with records. It can look at multiple columns and for comparison. TokenLink mainly looks at words in common between two columns. Taking in to account word frequency but not word order. This technique is bested used on dataset with multiple words in a single column, or dataset with a single word in a column where one word is much more common then others. An example might be City with Toronto being very common and Blackburn Hamlet being uncommon.

Getting started

Package details

Maintainer
LicenseMIT + file LICENSE
Version0.0.0.9000
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
install.packages("remotes")
remotes::install_github("csps-efpc/TokenLink")
csps-efpc/TokenLink documentation built on Feb. 10, 2023, 3:30 a.m.