euks: Eukaryotic genomes at NCBI

Description Usage Format Details Source Examples

Description

Eukaryotic genome sequencing projects at NCBI

Usage

1

Format

A genomes data frame with observations on the following 21 variables.

acc

BioProject id

name

Organism name

status

Sequencing status

released

First public sequence release

taxid

Taxonomy id

acc

BioProject Accession number

group

Phylum

subgroup

Class level

size

Total length of DNA (Mb)

gc

Percent GC (guanine or cytosine)

assembly

Name of the genome assembly (from NCBI Assembly database)

chromosomes

Number of chromosomes

organelles

Number of organelles

plasmids

Number of plasmids

wgs

Four-letter Accession prefix followed by version

scaffolds

Number of scaffolds

genes

Number of genes

proteins

Number of proteins

modified

Last modification date

center

Sequencing center

biosample

BioSample Accession number

Details

Excludes projects that represent only organelles

Source

downloaded from ftp.ncbi.nlm.nih.gov/genomes/GENOME_REPORTS/eukaryotes.txt

Examples

1
2
3
4
5
6

cstubben/genomes2 documentation built on May 12, 2017, 1:19 p.m.