Provides functions to aid the identification of probable/possible duplicates in Plant Genetic Resources (PGR) collections using 'passport databases' comprising of information records of each constituent sample. These include methods for cleaning the data, creation of a searchable Key Word in Context (KWIC) index of keywords associated with sample records and the identification of nearly identical records with similar information by fuzzy, phonetic and semantic matching of keywords.
|Author||J. Aravind [aut, cre] (0000-0002-4791-442X), J. Radhamani [aut], Kalyani Srinivasan [aut], B. Ananda Subhash [aut], R. K. Tyagi [aut], ICAR-NBGPR [cph], Maurice Aubrey [ctb] (Double Metaphone), Kevin Atkinson [ctb] (Double Metaphone), Lawrence Philips [ctb] (Double Metaphone)|
|Date of publication||2018-01-13 06:21:29 UTC|
|Maintainer||J. Aravind <[email protected]>|
|License||GPL-2 | GPL-3|
|URL||https://cran.r-project.org/package=PGRdup https://github.com/aravind-j/PGRdup https://doi.org/10.5281/zenodo.841963 https://aravind-j.github.io/PGRdup/ https://www.rdocumentation.org/packages/PGRdup|
|Package repository||View on CRAN|
Install the latest version of this package by entering the following in R:
Any scripts or data that you put into this service are public.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.