downloadGEOdata: Download a GSE record from GEO

Description Usage Arguments Details Note Author(s) See Also Examples

View source: R/downloadGEOdata.R

Description

Downloads the SOFT files for the GSE, GPLs, GSMs, and GDSs corresponding to the GSE identifier provided by the user from GEO to the user's local machine

Usage

1
downloadGEOdata(GSEid, destdir = getwd())

Arguments

GSEid

character string specifying the GSE to be downloaded from GEO

destdir

directory where to locate the BigMac directory used for storing the SOFT files downloaded from GEO. The default directory is the current working directory

Details

In the Gene Expression Omnibus (GEO) high-throughput functional genomics data is stored in SOFT (Simple Omnibus Format in Text) file format. Examples are the series record (GSE), the sample record (GSM), the platform record (GPL), and the dataset record (GDS). More information about the different types of SOFT files can be found at http://www.ncbi.nlm.nih.gov/geo/info/overview.html.
The function downloadGEOdata uses (or creates, if it does not exist yet) a data directory called BigMac in a directory destdir specified by the user. The BigMac directory contains several subdirectories: annotation, COMPENDIUM, data and log. The data directory contains further subdirectories to store the downloaded .soft files corresponding to GSEs, GSMs, GPLs, and GDSs downloaded from GEO. More information about the structure of the BigMac directory can be found at http://wiki.bioinformaticslaboratory.nl/foswiki/bin/view/BioLab/CompendiumDB. If an existing BigMac directory is detected that already contains the necessary SOFT files, these files will not be downloaded from GEO again.

Note

If the BigMac directory already exists, the function downloadGEOdata will try to store the downloaded data in the existing directory structure. Therefore, in order to avoid errors, do not change BigMac's directory structure.

Author(s)

Umesh K. Nandal

See Also

loadDatabaseSchema, loadDataToCompendium

Examples

1
2
3
4
5
6
 ## Not run: 
  # Download the files related to the specified GSE from GEO to the BigMac directory 
  # in the user's current working directory
  downloadGEOdata(GSEid="GSE23183")
 
## End(Not run)

Example output

Loading required package: Biobase
Loading required package: BiocGenerics
Loading required package: parallel

Attaching package: 'BiocGenerics'

The following objects are masked from 'package:parallel':

    clusterApply, clusterApplyLB, clusterCall, clusterEvalQ,
    clusterExport, clusterMap, parApply, parCapply, parLapply,
    parLapplyLB, parRapply, parSapply, parSapplyLB

The following objects are masked from 'package:stats':

    IQR, mad, sd, var, xtabs

The following objects are masked from 'package:base':

    Filter, Find, Map, Position, Reduce, anyDuplicated, append,
    as.data.frame, basename, cbind, colMeans, colSums, colnames,
    dirname, do.call, duplicated, eval, evalq, get, grep, grepl,
    intersect, is.unsorted, lapply, lengths, mapply, match, mget,
    order, paste, pmax, pmax.int, pmin, pmin.int, rank, rbind,
    rowMeans, rowSums, rownames, sapply, setdiff, sort, table, tapply,
    union, unique, unsplit, which, which.max, which.min

Welcome to Bioconductor

    Vignettes contain introductory material; view with
    'browseVignettes()'. To cite Bioconductor, see
    'citation("Biobase")', and for packages 'citation("pkgname")'.

Loading required package: GEOquery
Setting options('download.file.method.GEOquery'='auto')
Setting options('GEOquery.inmemory.gpl'=FALSE)
Loading required package: RMySQL
Loading required package: DBI
sh: 1: perl: Permission denied

compendiumdb documentation built on May 2, 2019, 4:06 p.m.