generate.data.miss: Generate the dataset with missing values

Description Usage Arguments Details Value References See Also Examples

Description

The function for the generation the dataset with missing values from the input dataset with all the values. It is mainly intended for the testing purposes. The results is in the form of “data.frame” which corresponds to the input data.frame or matrix, where missing values are inserted. The percent of missing values is supplied as the input parameters. The processed dataset can be used in the algorithms for missing value imputation “input_miss” or for any other purposes.

Usage

1
generate.data.miss(data,percent=5,filename=NULL)

Arguments

data

a dataset, a matrix of feature values for several cases, the last column is for the class labels. Class labels could be numerical or character values. This data set has not missing values

percent

a numerical value for the percent of the missing values to be inserted into the dataset.

filename

a character name of the output file to save the dataset with missing values.

Details

This function's main job is to generate the dataset with missing values from the input dataset with all the values. See the “Value” section to this page for more details.

Data can be provided in matrix form, where the rows correspond to cases with feature values and class label. The columns contain the values of individual features and the last column must contain class labels. The maximal number of class labels equals 10. The class label features and all the nominal features must be defined as factors.

Value

A returned data.frame corresponds to the input dataset with inserted missing values.

References

McShane LM, Radmacher MD, Freidlin B, Yu R, Li MC, Simon R. Methods for assessing reproducibility of clustering patterns observed in analyses of microarray data. Bioinformatics. 2002 Nov;18(11):1462-9.

See Also

input_miss, select.process, classifier.loop

Examples

1
2
3
4
5
6
7
# example

data(leukemia72_2)

percent =5
f.name=NULL #file name to include
out=generate.data.miss(data=leukemia72_2,percent=percent,filename=f.name)

Example output

Loading required package: gtools
Loading required package: Rcpp
Warning messages:
1: In rgl.init(initValue, onlyNULL) : RGL: unable to open X11 display
2: 'rgl_init' failed, running with rgl.useNULL = TRUE 
3: .onUnload failed in unloadNamespace() for 'rgl', details:
  call: fun(...)
  error: object 'rgl_quit' not found 

Biocomb documentation built on May 1, 2019, 9:38 p.m.