Description Usage Arguments Value Author(s)
View source: R/convert.gct2submap.R
Converts a gct file and cls file so that:
- one row per gene symbol & gene symbol becomes the Name of each row,
rather than the probe id
– different methods of collapsing rows
- cls file starts from 1, rather than 0
- choose the most variable genes, either by a coefficient of variation
threshold (eg 50)
or the top N most variable genes, as defined by CV.
1 2 3 | convert.gct2submap(gct.file, cls.file, chip.file, gct.out, cls.out,
collapse.mode = c("max.avg", "mean", "median", "max.cv"),
cv.thresh = NULL, topN.thresh = NULL)
|
gct.file |
standard input files to be modified. |
cls.file |
standard input files to be modified. |
chip.file |
used to map probe ID's to gene symbols |
gct.out |
output files. |
cls.out |
output files. |
collapse.mode |
how to choose the value for each gene. |
cv.thresh |
[optional] CV threshold to use. CV's are percentages, so 50, or 100 is a good place to start. Thresholding is done after collapsing to unique genes. This overrides topN.thresh. |
topN.thresh |
[optional] take the top N genes when ranked by their CV's. Must leave cv.thresh=NULL. Thresholding is done after collapsing to unique genes. |
max.avg |
calculate the average for each probeset & choose the values from the probes with largest average value |
mean |
for N probesets mapping to the same gene, calculate the mean or median of those N values for each sample. |
median |
for N probesets mapping to the same gene, calculate the mean or median of those N values for each sample. |
max.cv |
calculate the CV for each probe & choose the values from the probe with highest cv |
creates a new gct & cls file. The gct file has 1 row per gene which passed the filters & has gene symbols in column 1 (the Name column). The cls file starts from '1' instead of '0' which means it may not work with other GenePattern modules!
Mark Cowley, 2010-07-06
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.