Description Usage Arguments Value Note Author(s) Examples
This function connects to the UCSC Genome Browser public
database and downloads annotation elements (gene
co-ordinates, exon co-ordinates, gene identifications
etc.) for each of the supported organisms, but using UCSC
instead of Ensembl. See the help page of
metaseqr
for a list of supported organisms.
The function downloads annotation for an organism genes or
exons.
1 2 | get.ucsc.annotation(org, type, refdb="ucsc",
multic=FALSE)
|
org |
the organism for which to download annotation. |
type |
either |
refdb |
either |
multic |
a logical value indicating the
presence of multiple cores. Defaults to
|
A data frame with the canonical (not isoforms!) genes or
exons of the requested organism. When
type="genes"
, the data frame has the following
columns: chromosome, start, end, gene_id, gc_content,
strand, gene_name, biotype. When type="exon"
the
data frame has the following columns: chromosome, start,
end, exon_id, gene_id, strand, gene_name, biotype. The
gene_id and exon_id correspond to UCSC or RefSeq gene
and exon accessions respectively. The gene_name corresponds
to HUGO nomenclature gene names.
The data frame that is returned contains only "canonical"
chromosomes for each organism. It does not contain
haplotypes or random locations and does not contain
chromosome M. Note also that as the UCSC databases do
not contain biotype classifications like Ensembl. These
will be returned as NA
and as a result, some
quality control plots will not be available.
Panagiotis Moulos
1 2 | hg19.genes <- get.ucsc.annotation("hg19","gene","ucsc")
mm9.exons <- get.ucsc.annotation("mm9","exon","refseq")
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.