Description Usage Arguments Details Value Examples
This function returns a BAM file representing reads overlapping regions specified either as chromosomal regions or as gencode gene symbols.
1 2 3 |
uuid |
character(1) identifying the BAM file resource |
regions |
character() vector describing chromosomal regions,
e.g., |
symbols |
character() vector of gencode gene symbols, e.g.,
|
destination |
character(1) default |
overwrite |
logical(1) default FALSE can destination be overwritten? |
progress |
logical(1) default |
token |
character(1) security token allowing access to restricted data. Almost all BAM data is restricted, so a token is usually required. See https://gdc-docs.nci.nih.gov/API/Users_Guide/Authentication_and_Authorization/. |
legacy |
logical(1) whether or not to use the "legacy" archive, containing older, non-harmonized data. |
This function uses the Genomic Data Commons "slicing" API to get portions of a BAM file specified either using "regions" or using HGNC gene symbols.
character(1) destination to the downloaded BAM file
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 | slicing("df80679e-c4d3-487b-934c-fcc782e5d46e",
regions="chr17:75000000-76000000",
token=gdc_token())
# Get 10 BAM files.
bamfiles = files() %>%
filter(data_format=='BAM') %>%
results(size=10) %>% ids()
# Current alignments at the GDC are to GRCh38
library('TxDb.Hsapiens.UCSC.hg38.knownGene')
all_genes = genes(TxDb.Hsapiens.UCSC.hg38.knownGene)
first3genes = all_genes[1:3]
# remove strand info
strand(first3genes) = '*'
# We can get our regions easily now
as.character(first3genes)
# Use parallel downloads to speed processing
library(BiocParallel)
register(MulticoreParam())
fnames = bplapply(bamfiles, slicing, overwrite = TRUE,
regions=as.character(first3genes))
# 10 BAM files
fnames
library(GenomicAlignments)
lapply(unlist(fnames), readGAlignments)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.