get.mgi.features: get.mgi.features

View source: R/get.mgi.features.R

get.mgi.featuresR Documentation

get.mgi.features

Description

Retrieve the MGI features within a genomic region. FALSEeatures include genes, gene models, non-coding RNA, etc., but not SNPs. This allows the user to filter by source and type of feature.

Usage

  get.mgi.features(file = "ftp://ftp.jax.org/SNPtools/genes/MGI.20140803.sorted.txt", 
  chr = NULL, start = NULL, end = NULL, source = c("all", "MGI", "VEGA", "ENSEMBL", 
  "Blat", "NCBI_Gene"), type = c("all", "gene", "pseudogenic_transcript", 
  "pseudogenic_exon", "pseudogene", "match", "match-part", "transcript", "exon", 
  "mRNA", "five_prime_UTR", "start_codon", "CDS", "stop_codon", "three_prime_UTR",
  "pseudogenic_mRNA", "pseudogenic_start_codon", "pseudogenic_CDS", 
  "pseudogenic_stop_codon", "pseudogenic_five_prime_UTR", "pseudogenic_three_prime_UTR",
  "sequence_feature"))

Arguments

file

character, the full path to the Tabix indexed and zipped MGI feature file. Default = "http://cgd.jax.org/tools/SNPtools/MGI/MGI.20130305.sorted.txt.gz" for internal JAX use.

chr

Numeric vector, chr for each start and end position. Chr, start and end must all have the same length.

start

Numeric vector, start position in Mb or bp for each chr. Chr, start and end must all have the same length.

end

Numeric vector, end position in Mb or bp for each chr. Must be greater than or equal to the corresponding start value. Chr, start and end must all have the same length.

source

Character vector, the source of the annotation. Options are ("all", "MGI", "VEGA", "ENSEMBL", "Blat", "NCBI_Gene"). "all" returns all features.

type

Character vector, the type of feature. Options are ("all", "gene", "pseudogenic_transcript", "pseudogenic_exon", "pseudogene", "match", "match-part", "transcript", "exon", "mRNA", "five_prime_UTR", "start_codon", "CDS", "stop_codon", "three_prime_UTR", "pseudogenic_mRNA", "pseudogenic_start_codon", "pseudogenic_CDS", "pseudogenic_stop_codon", "pseudogenic_five_prime_UTR", "pseudogenic_three_prime_UTR", "sequence_feature"). "all" returns all features.

Details

This function is designed to return features from the MGI gene feature file (GFALSEFALSE). You can select multiple regions on different chromosomes.

Value

A list of data.frames for each region requested. Each data.frame will contain 14 columns; seqid, source, type, start, stop, score, strand, phase, ID, Name, Parent, Dbxref, mgiName, bioType. If there is only one requested region, a single data frame is returned.

Author(s)

Daniel Gatti

References

The MGI GFALSEFALSE file is at ftp://ftp.informatics.jax.org/pub/mgigff/.

Examples

  ## Not run: 
  genes = get.mgi.features(chr = 7, start = 103 end = 105 source = "MGI", type = "gene")
  
## End(Not run)

dmgatti/DOQTL documentation built on April 7, 2024, 10:35 p.m.