get.sample.attributes.INSDC: Downloads all sequence sample attributes

Description Usage Arguments Details Value Author(s) See Also Examples

View source: R/Get_SequenceData_INSDC.R

Description

Downloads all sample attributes (that is, additional environmental or other associated data) from INSDC. Note: requires a user-specified API-key to acces the INSDC databases. see https://ncbiinsights.ncbi.nlm.nih.gov/2017/11/02/new-api-keys-for-the-e-utilities/ to generate an API-key

Usage

1
get.sample.attributes.INSDC(sampleID, apiKey, BioPrjct)

Arguments

sampleID

a list. a list of one ore more SRA sample IDs (that is "Run" numbers). This argument can be left blank if input is provided for the BioPrjct argument (see further)

apiKey

a character string. A personal API-key to the access the NCBI databases, and required to use the Entrez Programming Utilities (E-utilities). An API-key (API stands for application programming interface) is a unique identifier used to authenticate a user. A personal API-key can easily be generated at requested at https://ncbiinsights.ncbi.nlm.nih.gov/2017/11/02/new-api-keys-for-the-e-utilities/

BioPrjct

a character string or a vector with character strings. Providing the associated BioProject numbers helps to give more understandable error messages. Alternativey, if the sampleID argument is empty, all sample IDs from the given BioProject numbers will be given to the sampleID argument.

Details

Each sequence data sample ("Run") typically has additional measurements or metadata associated with it. However, these are difficult to find, and cannot be downloaded together with the nucleotide sequence data. The get.sample.attributes.INSDC function will fetch the all the metadata of the samples given to the sampleID (or BioPrjct) argument from the INSDC. To do this, get.sample.attributes.INSDC will use the Entrez Programming Utilities (E-utilities) from NCBI, and will access the databases with the API-key provided by the user. see https://ncbiinsights.ncbi.nlm.nih.gov/2017/11/02/new-api-keys-for-the-e-utilities/ for more info on getting an API-key.

Value

a dataframe with the data found

Author(s)

Maxime Sweetlove CC-0 2019

See Also

Other downloading data functions: download.sequences.INSDC(), get.BioProject.metadata.INSDC()

Examples

1
get.sample.attributes.INSDC(BioPrjct="PRJNA303951", apiKey="YouPersonalAPIKey")

biodiversity-aq/OmicsMetaData documentation built on Dec. 19, 2021, 9:44 a.m.