markerSelection: markers selection

Description Usage Arguments Details Value Author(s) See Also

View source: R/SNPselection.R

Description

This function selects, for each chromosome, the most relevant markers according to a response.

Usage

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
markerSelection(
  dataSetName,
  dataResponse,
  chromosome = 1:22,
  signal = c("CN", "fracB"),
  normalTumorArray,
  onlySNP = FALSE,
  nbFolds = 10,
  loss = c("logistic", "linear"),
  plot = TRUE,
  pkg = c("HDPenReg", "spikeslab"),
  ...
)

Arguments

dataSetName

The name of the data-set folder.

dataResponse

A csv files or a data.frame with 2 columns : "files" and "response". The column "files" contains the filename to extract and the second column the response associated to the file.

chromosome

A vector containing the number of the chromosomes for the SNPs selection.

signal

either "CN" or "fracB". corresponding to which signal will be analyzed (default="CN").

normalTumorArray

Only in the case of normal-tumor study. A csv file or a data.frame containing the mapping between normal and tumor files. The first column contains the name of normal files and the second the names of associated tumor files.

onlySNP

(only if signal="CN"). If TRUE, only the SNPs probes are used (default=FALSE).

nbFolds

number of folds in the cross validation (default=10).

loss

either "logistic" (binary response) or "linear" (quantitative response), default is "logistic"

plot

If TRUE, cross-validation mean squared error is plotted (default=TRUE).

pkg

Either "HDPenReg" or "spikeslab". Ued package in linear case.

...

Other parameters for HDlars, glmnet or spikeslab function.

Details

This function requires to use the aroma folder architecture. In your working directory, there must have the rawData folder and totalAndFracBData folder. This function launches the lars algorithm on the CN or fracB data and uses a cross-validation to select the most appropriate solution.

Value

a list containing length(chromosme) elements. Each element is a list containing

chr

chromosome corresponding to the signal.

markers.index

A vector containing the index of all selected markers.

markers.position

A vector containing the position of all selected markers.

markers.names

A vector containing the names of all selected markers.

coefficient

A vector containing the coefficients of all selected markers.

intercept

Intercept of the model.

Author(s)

Quentin Grimonprez

See Also

HDPenReg, glmnet, spikeslab


MPAgenomics documentation built on March 30, 2021, 5:13 p.m.