searchDatasets: Search dataset metadata

View source: R/searchDatasets.R

searchDatasetsR Documentation

Search dataset metadata

Description

Search for datasets of interest based on matching text in the associated metadata.

Usage

searchDatasets(
  query,
  cache = cacheDirectory(),
  overwrite = FALSE,
  latest = TRUE
)

Arguments

query

String or a gypsum.search.object, see Examples.

cache, overwrite

Arguments to pass to fetchMetadataDatabase.

latest

Whether to only consider the latest version of each dataset.

Details

The returned DataFrame contains the usual suspects like the title and description for each dataset, the number of rows and columns, the organisms and genome builds involved, whether the dataset has any pre-computed reduced dimensions, and so on. More details can be found in the Bioconductor metadata schema at https://github.com/ArtifactDB/bioconductor-metadata-index.

Value

A DataFrame where each row corresponds to a dataset, containing various columns of metadata. Some columns may be lists to capture 1:many mappings.

Author(s)

Aaron Lun

See Also

surveyDatasets, to easily obtain a listing of all available datasets.

Examples

searchDatasets("brain")[,c("name", "title")]
searchDatasets(defineTextQuery("Neuro%", partial=TRUE))[,c("name", "title")]
searchDatasets(defineTextQuery("10090", field="taxonomy_id"))[,c("name", "title")]
searchDatasets(
   defineTextQuery("GRCm38", field="genome") &
   (defineTextQuery("neuro%", partial=TRUE) | 
    defineTextQuery("pancrea%", partial=TRUE))
)[,c("name", "title")]


LTLA/scRNAseq documentation built on April 24, 2024, 5:58 p.m.