Description Usage Format Details Source Examples
Prokaryotic genome sequencing projects at NCBI.
1 |
A genomes data frame with observations on the following 23 variables.
pidBioProject id
nameOrganism name
statusSequencing status
releasedFirst public sequence release
taxidTaxonomy id
accBioProject Accession number
groupPhylum
subgroupClass level
sizeTotal length of DNA (Mb)
gcPercent GC (guanine or cytosine)
refseqRefseq chromosome sequence accessions
insdcGenBank chromosome sequence accessions
plasmid.refseqRefseq plasmid sequence accessions
plasmid.insdcGenBank plasmid sequence accessions
wgsFour-letter WGS Accession prefix followed by version
scaffoldsNumber of scaffolds/contigs
genesNumber of genes
proteinsNumber of proteins
modifiedLast modification date
center Sequencing center
biosample BioSample Accession number
assemblyAssembly Accession number
referenceReference or representative genome
BioProject IDs are no longer unique and the table was modified on Nov 1, 2013 to include BioSample and Assembly accessions. See email on NCBI announcement regarding bacterial strain-level TaxID management for details
downloaded from ftp.ncbi.nlm.nih.gov/genomes/GENOME_REPORTS/prokaryotes.txt
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 | data(proks)
proks
#single row
t(proks[1,])
class(proks)
attributes(proks)[c("date","url")]
summary(proks)
## check for missing release dates
table2(proks$status,!is.na(proks$wgs), dnn=list("Status", "Has WGS acc?"))
plot(proks)
plotby(proks, log='y', las=1, top=2)
hist(proks$size[proks$size<15], br=50, main="", col="blue", xlab="Size (Mb)")
## download recent table from NCBI
## Not run: update(proks)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.