DataProcessing: DataProcessing

Description Usage Arguments Details Value Examples

Description

The DataProcessing function processes raw .CEL files to probe intensities values with the help of functions of the aroma.affymetrix package. It returns a data frame and saves it as an .RData file.

Usage

1
2
3
4
DataProcessing(chipType = "HuEx-1_0-st-v2", tags = "coreR3,A20071112,EP",
  Name = "ColonCancer", ExonSummarization = TRUE,
  GeneSummarization = TRUE, FIRMA = TRUE, location = "test",
  verbose = TRUE)

Arguments

chipType

The name of the chip type of the array data.

tags

Tags that is added to the chipType.

Name

The name of the data.

ExonSummarization

Logical. Should the data be summarized at the exon level?

GeneSummarization

Logical. Should the data be summarized at the gene level?

FIRMA

Logical. Should the FIRMA model be performed on the data?

location

The location where the .rda file is to be stored. Defaults to the current working directory.

verbose

Logical. If TRUE, messages are printed during the data processing.

Details

The DataProcessing function is a wrapper of several functions of the aroma.affymetrix package. To obtain the data to perform the GDS model on the raw .CEL files are background corrected with the rma background correction and normalization is performed with the quantile normalization. In order for the function to run properly, a chipType and its possible tags need to be specified. It is also important to have the same folder structure as required by the aroma.affymetrix package. This implies the following: a rawData folder with therein a folder with the "Name" parameter. This "Name" folder should contain a folder with the chipType name and herein the .CEL files should be placed. Also a folder annotationData should be present. Herein a folder chipTypes should be make which contains folders for type of chips with the respective names. In the folder of each chiptype the corresponding .cdf file should be saved. The processed data will be at the specified location save as data frame with the first colum the gene IDs and the second column the exon IDs. All other columns contain the sample values. Further the object also contains a vector of the unique gene ID and a vector of the unique exon IDs. If requested, exon and gene level summarization are performed and saved as data frames at the specified location. Further,the option is provided to perform the FIRMA model on the data as well.

Value

An .rda file that is saved at the specified location.

Examples

1
2
3
4
5
6
## Not run: 
DataProcessing(chipType="HuEx-1_0-st-v2",tags="coreR3,A20071112,EP",
Name="ColonCancer",ExonSummarization=TRUE,GeneSummarization=TRUE,
FIRMA=TRUE,location="ColonCancer",verbose=TRUE)

## End(Not run)

GDS documentation built on May 2, 2019, 6:09 p.m.