CreateData: Creates a data set

Description Usage Arguments Details Value Author(s) Examples

View source: R/CreateData.R

Description

This function creates an artificial data set based on either dimensional or categorical latent structure, which can vary according to a number of basic parameters. Such data can be useful for getting to know the taxometric programs and becoming familiar with their output by conducting analyses using data sets whose parameters are known.

Usage

1
2
CreateData(str, n = 600, k = 4, p = 0.5, d = 2, r = 0, r.tax = 0, r.comp = 0, 
g = 0, h = 0, cuts = 0, uniform = F, seed = 1)

Arguments

str

The type of data to be generated. Specify either "dim" for dimensional data or "cat" (or anything else) for categorical data.

n

Sample size. The default value is 600.

k

Number of variables. The default value is 4.

p

Taxon base rate. The default value is .5.

d

Standardized mean difference between groups. The default value is 2.

r

Correlation among variables. The default value is 0.

r.tax

Correlation among variables within the taxon. The default value is 0.

r.comp

Correlation among variables within the complement. The default value is 0.

g

Parameter used to control asymmetry (scalar); sign indicates direction and absolute value indicates magnitude of skew (e.g., +/- .30 yields substantial asymmetry).

h

Parameter used to control tail weight (scalar); positive values yield tails that are longer/thinner than a standard normal curve, negative values do the reverse (e.g., +/- .15 is a substantial departure from normality).

cuts

Parameter used to create ordered categorieas, if nonzero (scalar); number of categories will be cuts + 1.

uniform

Whether to generate random values (the program default) or use uniformly distributed quantiles (T/F).

seed

Random number seed; specifying the same seed enables users to generate and analyze identical data sets. The default value is 1.

Details

Users should call this function directly if they wish to create an artificial data set.

Value

Data matrix; k columns contain data, final column contains classification.

Author(s)

John Ruscio <[email protected]> and Shirley Wang <[email protected]> Maintainer: John Ruscio <[email protected]>

Examples

1
2
3
4
5
# creates a categorical data set
test.cat <- CreateData("cat")

# creates a dimensional data set
test.dim <- CreateData("dim")

RTaxometrics documentation built on Sept. 10, 2017, 1:03 a.m.