bdc: Biodiversity Data Cleaning

It brings together several aspects of biodiversity data-cleaning in one place. 'bdc' is organized in thematic modules related to different biodiversity dimensions, including 1) Merge datasets: standardization and integration of different datasets; 2) Pre-filter: flagging and removal of invalid or non-interpretable information, followed by data amendments; 3) Taxonomy: cleaning, parsing, and harmonization of scientific names from several taxonomic groups against taxonomic databases locally stored through the application of exact and partial matching algorithms; 4) Space: flagging of erroneous, suspect, and low-precision geographic coordinates; and 5) Time: flagging and, whenever possible, correction of inconsistent collection date. In addition, it contains features to visualize, document, and report data quality – which is essential for making data quality assessment transparent and reproducible. The reference for the methodology is Bruno et al. (2022) <doi:10.1111/2041-210X.13868>.

Getting started

Package details

AuthorBruno Ribeiro [aut, cre] (<https://orcid.org/0000-0002-7755-6715>), Santiago Velazco [aut] (<https://orcid.org/0000-0002-7527-0967>), Karlo Guidoni-Martins [aut] (<https://orcid.org/0000-0002-8458-8467>), Geiziane Tessarolo [aut] (<https://orcid.org/0000-0003-1361-0062>), Lucas Jardim [aut] (<https://orcid.org/0000-0003-2602-5575>), Steven Bachman [ctb] (<https://orcid.org/0000-0003-1085-6075>), Rafael Loyola [ctb] (<https://orcid.org/0000-0001-5323-2735>)
MaintainerBruno Ribeiro <ribeiro.brr@gmail.com>
LicenseGPL (>= 3)
Version1.1.5
URL https://brunobrr.github.io/bdc/ (website) https://github.com/brunobrr/bdc
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:
install.packages("bdc")

Try the bdc package in your browser

Any scripts or data that you put into this service are public.

bdc documentation built on April 3, 2025, 10:53 p.m.