refdb_import_NCBI: Download and import NCBI Nucleotide records

View source: R/import_NCBI.R

refdb_import_NCBIR Documentation

Download and import NCBI Nucleotide records

Description

This function allows to search and download data from the the NCBI Nucleotide database. Additionally it uses the NCBI Taxonomy database to get the sequence taxonomic classification.

Usage

refdb_import_NCBI(
  query,
  full = FALSE,
  max_seq_length = 10000,
  seq_bin = 200,
  verbose = TRUE
)

Arguments

query

a character string with the query.

full

a logical. If FALSE (the default), only a subset of the most important fields is included in the result.

max_seq_length

a numeric giving the maximum length of sequences to retrieve. Useful to exclude complete genomes.

seq_bin

number of sequences to download at once.

verbose

print information in the console.

Details

This function uses several functions of the rentrez package to interface with the NCBI's EUtils API.*

Value

A tibble.

Errors

Error in curl::curl_fetch_memory(url, handle = handle) : transfer closed with outstanding read data remaining
This error seems to appear with long sequences. You can try to decrease max_seq_length to exclude them.

Examples

## Not run: 
silo_ncbi <- refdb_import_NCBI("Silo COI")

## End(Not run)


refdb documentation built on Sept. 22, 2022, 5:07 p.m.