R packages for data quality

What’s already out there

packages
Author

Martin Westgate

Published

April 17, 2026

There are a bunch of R packages available for getting biodiversity data. It is worth saying, first of all, that the data aggregators themselves do a lot of work on data quality. GBIF and the living atlases have systems that tag (and sometimes, alter) records as having various problems. Therefore galah and rgbif are relevant resources here; but that’s a topic for a different time. This page is focussed on R packages that have cleaning biodiversity data as their central purpose, or include a cleaning component as part of some wider workflow.

Table 1: R packages for data quality listed on CRAN
Package Updated Documentation DOI
CoordinateCleaner 24/10/2023 ropensci.github.io/CoordinateCleaner/index.html 10.1111/2041-210X.13152
bdc 24/1/2026 brunobrr.github.io/bdc/index.html 10.1111/2041-210X.13868
BeeBDC 6/2/2026 jbdorey.github.io/BeeBDC/ 10.1038/s41597-023-02626-w
gatoRs 17/5/2024 nataliepatten.github.io/gatoRs/ NA
RuHere 17/2/2026 wevertonbio.github.io/RuHere/ 10.64898/2026.02.02.703373