CCMP1375: 'Assessment' Objects for _Prochlorococcus marinus_ subsp....

Description Usage Format Details Mapping object Results objects Getting the strain's genome Source

Description

Objects of class Assessment and either subclass DataMap or subclass Results for Prochlorococcus marinus subsp. marinus strain CCMP1375

Usage

1
2
3
4
5

Format

All 5 objects have a list structure. For specifics on what the two types of Assessment objects contain, please see the AssessORF package.

Details

'CCMP1375_PreSaved_DataMapObj' is an object of subclass DataMap.

'CCMP1375_PreSaved_ResultsObj_GenBank' is an object of subclass Results with predicted genes from the GenBank database.

'CCMP1375_PreSaved_ResultsObj_GeneMarkS2' is an object of subclass Results with predicted genes from the program GeneMarkS-2.

'CCMP1375_PreSaved_ResultsObj_Glimmer' is an object of subclass Results with predicted genes from the program Glimmer.

'CCMP1375_PreSaved_ResultsObj_Prodigal' is an object of subclass Results with predicted genes from the program Prodigal.

For all 5 objects, values of two key identifying list items within each object are listed below.

Mapping object

The mapping object, 'CCMP1375_PreSaved_DataMapObj', stores the mapping of proteomics evidence and evolutionary conservation evidence to the strain's genome. The raw proteomics mass spectra data came from ProteomeXchange dataset PXD005745. The related genomes used to determine evolutionary conservation all came from the order Synechococcales, and links to their sequences were downloaded from NCBI's Genome Browser. The object was built using the MapAssessmentData function from the AssessORF package.

Results objects

The 4 results objects, 'CCMP1375_PreSaved_ResultsObj_GenBank', 'CCMP1375_PreSaved_ResultsObj_GeneMarkS2', 'CCMP1375_PreSaved_ResultsObj_Glimmer', and 'CCMP1375_PreSaved_ResultsObj_Prodigal', store how much evidence there is supporting or against each gene in a set of predicted genes for the strain's genome. The four objects were built using the AssessGenes function from the AssessORF package, a set of genes, and the mapping object described here.

For the GenBank object, the genes were downloaded from the corresponding record in the GenBank database. For the other three objects, GenemarkS-2 (web server), Prodigal (2.6.3), and Glimmer (3.02) were run at default settings in order to generate the set of predicted genes for the corresponding results object.

The GeneLeftPos, GeneRightPos, and the GeneStrand within each of the four results objects provide positional information on the set of predicted genes used to generate that object.

The GeneSource list item within each of the four results objects describes where the predicted genes came from (either "GenBank", "GeneMarkS2", "Glimmer", or "Prodigal" respectively).

Getting the strain's genome

Use SaveGenomeToPath("CCMP1375", <INSERT FILE PATH HERE>) to save the genome sequence for the strain to your local machine.

Source

Proteomcis data: http://proteomecentral.proteomexchange.org/cgi/GetDataset?ID=PXD005745

NCBI's Genome Browser: https://www.ncbi.nlm.nih.gov/genome/browse/#!/prokaryotes/

GenBank record: https://www.ncbi.nlm.nih.gov/nuccore/NC_005042.1


DRK248/AssessORFData documentation built on Feb. 2, 2020, 11:06 p.m.