generate.sample2: Sample2 generator of synthetic data

generate.sample2R Documentation

Sample2 generator of synthetic data

Description

Multivariate normally distributed data synthetic generator. Data sets with 2 clusters are randomly generated. n examples for each class are generated. n 10000-dimensional examples for each class are generated. All classes (each one of n examples) has only no-noisy features but there is substantial overlapping between classes The first class (first n examples) has its features centered in 1 (first 5000 features) and 2 (last 5000 features) The second class (second n examples) has its features centered in -1 (first 5000 features) and -2 (last 5000 features) The diagonal of the covariance matrix of the first class has its first 2500 element equal to 0.5, the next 2500 equal to 1, the next 2500 to 0.5 and the last to 1. The diagonal of the covariance matrix of the second class has its first 5000 element equal to 1, the next 5000 equal to 2

Usage

generate.sample2(n = 2)

Arguments

n

number of examples for each class

Value

a real data matrix with 10000 rows (variables) and n*2 columns (examples)

Author(s)

Giorgio Valentini valentini@di.unimi.it

Examples

generate.sample2()
generate.sample2(n = 20)

clusterv documentation built on June 8, 2025, 10:21 a.m.