In TIBHannover/BacDiveR: Programmatic Interface For The Bacterial Diversity Metadatabase by DSMZ

knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>"
)

Unfortunately, the BacDive Web Service does not allow SQL-like queries for the content of specific fields within the strain's datasets. If you find the functionality explained in BacDive-ing in too limited, please try the following, semi-automatic approach to using BacDiveR.

Visit BacDive.DSMZ.de/AdvSearch and prepare your search in that web interface. It enables SQL-like searches against the ca. 135 of BacDive's accessible data fields.

Run your advanced search (query). The example below searches for all strains that type a species pathogenic to both plant and human. Note the results list with the "hits" on the right, and the now much longer URL. It contains/encodes all the terms and parameters of your advanced search.

Copy the URL of the results page from your browser's address bar. Alternatively, copy it from the "Download list of BacDive Ids" link to the top right of the "hits" list.
Paste the copied URL into a call to the data <- bd_retrieve_by_search("…") function.
Enjoy the list of downloaded datasets, just as you would after using data <- bd_retrieve_data(searchTerm = ..., searchType = ...).

Mass-downloading datasets

bd_retrieve_data(searchTerm = …, searchType = "taxon") can be used to download all datasets for the genus or a specific species given in …. Broader searches are possible through the advanced search, for example for all Archaea:

Archaea_data <- bd_retrieve_by_search("https://bacdive.dsmz.de/advsearch?advsearch=search&site=advsearch&searchparams%5B70%5D%5Bcontenttype%5D=text&searchparams%5B70%5D%5Btypecontent%5D=contains&searchparams%5B70%5D%5Bsearchterm%5D=archaea")

Please note the messages about estimated download times for such large downloads.

Storing datasets offline

This is not a BacDiveR feature, but base R's saveRDS() is particularly useful for offline-storage of lots of search results, because re-downloading them would take rather long. Continuing the Archaea example, the following code writes the dataset to a file, loads it again, and verifies the data integrity:

saveRDS(Archaea_data, "Archaea.rds", version = 3)
Archaea_data_stored <- readRDS("Archaea.rds")
identical(Archaea_data, Archaea_data_stored)

TIBHannover/BacDiveR documentation built on June 2, 2022, 2:51 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

TIBHannover/BacDiveR
Programmatic Interface For The Bacterial Diversity Metadatabase by DSMZ

In TIBHannover/BacDiveR: Programmatic Interface For The Bacterial Diversity Metadatabase by DSMZ

Mass-downloading datasets

Storing datasets offline

R Package Documentation

Browse R Packages

We want your feedback!

TIBHannover/BacDiveR Programmatic Interface For The Bacterial Diversity Metadatabase by DSMZ

In TIBHannover/BacDiveR: Programmatic Interface For The Bacterial Diversity Metadatabase by DSMZ

Mass-downloading datasets

Storing datasets offline

R Package Documentation

Browse R Packages

We want your feedback!

TIBHannover/BacDiveR
Programmatic Interface For The Bacterial Diversity Metadatabase by DSMZ