generate_sample_data: Generate genetic sample data

Description Usage Details Value

Description

This function generates a data set that could be used for testing the algorithms.

Usage

1

Details

This function generates a set of unclassified data and a vector to evaluate the clustering. The data should be interpreted as two hidden clusters. For each cluster n=2000 individuals are generated, of which some become deseased. The disease appears, if

Value

The function returns a list

$data

A data.frame with 20 observed markers and about 630 individuals. The Markers are called

  • snpA, snpB, snpC, snpD with MAF 0.2

  • noise1 to noise4 with MAF 0.1

  • noise5 to noise12 with MAF 0.2

  • noise13 to noise16 with MAF 0.3

$classification

a vector containing for each observation, in which cluster it has been generated


imbs-hl/MDRDist documentation built on May 18, 2019, 4:45 a.m.