select.genes: Select (automatically) the most appropriate gene columns of...

Description Usage Arguments Examples

View source: R/datasets.R

Description

This function allows you to (automatically) select the most appropriate gene columns of your dataset.

Usage

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
select.genes(
  df,
  genes = NULL,
  selected.genes = NULL,
  features = NULL,
  max.genes = 100,
  selection.criteria = c("non.zeros", "mean", "variance", "random"),
  min.non.zeros = 10,
  cor = TRUE,
  cor.method = c("spearman", "kendall", "pearson"),
  cor.threshold = 0.25
)

Arguments

df

Dataset.

genes

Geneset with features (optional).

selected.genes

User-selected genes (optional).

features

Features you want to select (optional).

max.genes

Maximum number of selected genes (optional).

selection.criteria

Criteria to be applied when it is necessary to select a maximum number of genes from the previously filtered ones. Default: 'non.zeros'

min.non.zeros

Minimum number of non-zero values per gene (optional).

cor

Using correlation to exclude unconnected genes (optional).

cor.method

Correlation method: 'spearman', 'kendall', or 'pearson'. Default: 'spearman'

cor.threshold

Correlation threshold. Default: 0.1

Examples

1
2
df <- select.genes(df, max.genes=100, min.non.zeros=2)
df <- select.genes(df, genes=genes, features=c('tumor.suppressor', 'breast.cancer'), cor=TRUE, cor.threshold=0.7)

rlebron-bioinfo/gnlearn documentation built on July 25, 2020, 12:38 p.m.