input_miss: Process the dataset with missing values
In Biocomb: Feature Selection and Classification with the Embedded Validation Procedures for Biomedical Data Analysis

Description Usage Arguments Details Value References See Also Examples

The main function for handling with missing values. It performs the missing values imputation using two different approachs: imputation with mean values and using the nearest neighbour algorithm. It can handle both numerical and nominal values. The function also delete the features with the number of missing values more then specified threshold. The results is in the form of “list” with the processed dataset and the logical value, which indicates the success or failure of processing. The processed dataset can be used in the algorithms for feature selection “select.process” and classification “classifier.loop”.

1 2	input_miss(matrix,method.subst="near.value", attrs.nominal=numeric(),delThre=0.2)

`matrix`	a dataset, a matrix of feature values for several cases, the last column is for the class labels. Class labels could be numerical or character values. The maximal number of classes is ten.
`method.subst`	a method of missing value processing. There are two realized methods: substitution with mean value ('mean.value') and nearest neighbour algorithm ('near.value').
`attrs.nominal`	a numerical vector, containing the column numbers of the nominal features, selected for the analysis.
`delThre`	the minimal threshold for the deletion of features with missing values. It is in the interval [0,1], where for delThre=0 all features having at least one missing value will be deleted.

This function's main job is to handle the missing values in the dataset. See the “Value” section to this page for more details.

Data can be provided in matrix form, where the rows correspond to cases with feature values and class label. The columns contain the values of individual features and the last column must contain class labels. The maximal number of class labels equals 10. The class label features and all the nominal features must be defined as factors.

The data are provided with reasonable number of missing values that is preprocessed with one of the imputing methods.

A returned list consists of the the following fields:

`data`	a processed dataset
`flag.miss`	logical value; if TRUE the processing is successful, if FALSE the input dataset is returned without processing.

McShane LM, Radmacher MD, Freidlin B, Yu R, Li MC, Simon R. Methods for assessing reproducibility of clustering patterns observed in analyses of microarray data. Bioinformatics. 2002 Nov;18(11):1462-9.

select.process, classifier.loop

# example for dataset with missing values
data(leukemia_miss)
xdata=leukemia_miss

# class label must be factor
xdata[,ncol(xdata)]<-as.factor(xdata[,ncol(xdata)])

# nominal features must be factors
attrs.nominal=101
xdata[,attrs.nominal]<-as.factor(xdata[,attrs.nominal])

delThre=0.2
out=input_miss(xdata,"mean.value",attrs.nominal,delThre)
if(out$flag.miss)
{
 xdata=out$data
}

Biocomb documentation built on May 1, 2019, 9:38 p.m.

Biocomb index

Package overview

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

Biocomb
Feature Selection and Classification with the Embedded Validation Procedures for Biomedical Data Analysis

input_miss: Process the dataset with missing values
In Biocomb: Feature Selection and Classification with the Embedded Validation Procedures for Biomedical Data Analysis

Description

Usage

Arguments

Details

Value

References

See Also

Examples

Related to input_miss in Biocomb...

R Package Documentation

Browse R Packages

We want your feedback!

Biocomb Feature Selection and Classification with the Embedded Validation Procedures for Biomedical Data Analysis

input_miss: Process the dataset with missing values In Biocomb: Feature Selection and Classification with the Embedded Validation Procedures for Biomedical Data Analysis

Description

Usage

Arguments

Details

Value

References

See Also

Examples

Related to input_miss in Biocomb...

R Package Documentation

Browse R Packages

We want your feedback!

Biocomb
Feature Selection and Classification with the Embedded Validation Procedures for Biomedical Data Analysis

input_miss: Process the dataset with missing values
In Biocomb: Feature Selection and Classification with the Embedded Validation Procedures for Biomedical Data Analysis