Home

/

CRAN

/

NHEMOtree

/

Sim_Data: Simulation of data for use in NHEMOtree

Sim_Data: Simulation of data for use in NHEMOtree
In NHEMOtree: Non-hierarchical evolutionary multi-objective tree learner to perform cost-sensitive classification

Description Usage Arguments Details Author(s) See Also Examples

Simulation of data with one grouping variable containing four classes and 20 explanatory variables. Variables X1 to X3 are informative for seperating the four classes. Variable X1 separates class 1, X2 separates class 1 and class 2, and X3 separates class 3 from class 4. Variables X4, X5, and X6 are created on basis of X3 and can also be used to separate class 3 from class 4 but with decreased prediction accuracy.

1	Sim_Data(Obs, VG=1, VP1=0.05, VP2=0.1, VP3=0.3)

`Obs`	Amount of observations.
`VG`	Overall accuracy for data separation \in [0,1] with VG=1 (default) for perfect seperation.
`VP1`	Decrease of prediction accuracy for variable X4 in comparison with X3 to separate class 3 from class 4 (default: VP1=0.05).
`VP2`	Decrease of prediction accuracy for variable X5 in comparison with X3 to separate class 3 from class 4 (default: VP2=0.1).
`VP3`	Decrease of prediction accuracy for variable X6 in comparison with X3 to separate class 3 from class 4 (default: VP3=0.3).

With this function data with one grouping variable containing four classes and 20 explanatory variables X1 to X10 is simulated.

Variable X1 separates class 1, X2 separates class 1 and class 2, and X3 separates class 3 from class 4. For all samples belonging to the according classes the explanatory variables X1 to X3 are drawn from a normal distribution with μ=80 and σ^2=25. Samples which are not allocated to the corresponding class are drawn from a uniform distribution with minimum 0 and an adjustable maximum value. The maximum values of the uniform distributions are the smallest drawn random values of each variable.

Variables X4, X5, and X6 are created on basis of X3 and separate class 3 from class 4, too. However, the prediction accuracy of these variables decreases gradually. The decrease is assigned by 'VP1', 'VP2', and 'VP3'. Thus, the according amount of the discriminating samples of former class 3 are disturbed by assigning a value drawn from a uniform distribution. Accordingly, X4, X5 and X6 discriminate class 3 worse than X3. X7 to X10 are noisy variables drawn from a normal distribution that contain no information.

Noise is added to the class assignment by a binomial distribution. Each potential class is only with probability "VG" the equivalent class and with probability 1-"VG" one of the other classes.

Variable costs correlate with their prediction accuracy so that variables containing more information are more expensive than variables with less or none information. The costs of the variables are generated with function "Sim_Costs".

Swaantje Casjens

Sim_Costs, NHEMOtree

1 2	d<- Sim_Data(Obs=200) head(d)

NHEMOtree documentation built on May 2, 2019, 7:32 a.m.

NHEMOtree index

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

NHEMOtree
Non-hierarchical evolutionary multi-objective tree learner to perform cost-sensitive classification

Sim_Data: Simulation of data for use in NHEMOtree
In NHEMOtree: Non-hierarchical evolutionary multi-objective tree learner to perform cost-sensitive classification

Description

Usage

Arguments

Details

Author(s)

See Also

Examples

Related to Sim_Data in NHEMOtree...

R Package Documentation

Browse R Packages

We want your feedback!

NHEMOtree Non-hierarchical evolutionary multi-objective tree learner to perform cost-sensitive classification

Sim_Data: Simulation of data for use in NHEMOtree In NHEMOtree: Non-hierarchical evolutionary multi-objective tree learner to perform cost-sensitive classification

Description

Usage

Arguments

Details

Author(s)

See Also

Examples

Related to Sim_Data in NHEMOtree...

R Package Documentation

Browse R Packages

We want your feedback!

NHEMOtree
Non-hierarchical evolutionary multi-objective tree learner to perform cost-sensitive classification

Sim_Data: Simulation of data for use in NHEMOtree
In NHEMOtree: Non-hierarchical evolutionary multi-objective tree learner to perform cost-sensitive classification