createExternalCovariates: Create covariates by sampling from an external file
In MikeKSmith/MSToolkit: The MSToolkit Library For Clinical Trial Design

View source: R/createExternalCovariates.R

createExternalCovariates

R Documentation

Create covariates by sampling from an external file

Description

Create covariates by sampling from an external file.

Usage

createExternalCovariates(
  subjects,
  names,
  file,
  sameRow = TRUE,
  subset = NULL,
  refCol = NULL,
  dataId = idCol,
  idCol = getEctdColName("Subject"),
  percent = 20,
  seed = .deriveFromMasterSeed(),
  includeIDCol = TRUE,
  refColSuffix = "refCol",
  workingPath = getwd()
)

Arguments

`subjects`	(Required) Number of subjects for which to sample sets of external covariate values
`names`	(Required) Names of the covariates to use from the `file`
`file`	(Required) Input file name. This should be either a valid csv file or a NONMEM data file containing all the variables given by `names`, `idCol`
`sameRow`	(Optional) A logical value). Should all the covariates be sampled from the same rows or should the sampling be done independantly for each covariate. Using `sameRow = TRUE` would maintain the multivariate structure of the imported dataset and is faster. TRUE by default
`subset`	(Optional) Any subset to be performed on the imported dataset before doing any sampling. The subset is parsed by the `parseRangeCode` function. No subsetting is performed by default
`refCol`	(Optional) The reference column in the data file. If given, the output dataset will also contain an additional column indicating the origin of each row from the original dataset. This option is not compatible with `sameRow = FALSE`. By default, reference variables are not used
`dataId`	(Optional) The subject variable in the input dataset, equal to `idCol` by default. By default, it will be the same as "idCol"
`idCol`	(Optional) The subject variable in the output dataset. "SUBJ" by default
`percent`	(Optional) When a subset is performed on the input data, if the number of rows remaining in the dataset after subset is less than `percent` \ used
`seed`	(Optional) Random generator seed to use. The current random seed is used by default
`includeIDCol`	(Optional) A logical value. Should the subject variable be included in the output dataset? When `createCovariates` calls this function, it does not need the subject variable. TRUE by default
`refColSuffix`	(Optional) The suffix to use when creating the `refCol` variable. If the `refCol` variable is "SUBJ", then in the output dataset it will be created as "SUBJ" suffixed with "refColSuffix". By default, "refCol" is used as the suffix
`workingPath`	(Optional) Working path from which to import covariate file. By default, the current working directory is used

Details

The sampling is always done with replacement.

The refCol setting is typically used with the parameter component, see createParameters or createExternalParameters to maintain consistency between imported covariates and imported parameters.

Value

A data frame containing the imported variables, and possibly a reference variable.

Author(s)

Mike K Smith mstoolkit@googlemail.com

Examples

## Not run: 

  # an example file from the unit tests of the MSToolkit package
  wPath <- system.file( "Runit", "data", "createCovariates", package = "MSToolkit" )

  # sample 20 subjects from the example file
  dat <- createExternalCovariates( 20, names = "X1",
    subset = c(".7 < X1 < .8", "-1 <= X2 <= 1"),
    file = "testCovariates.csv", workingPath = wPath )
  print( dat )

  # maintaining the origin of each row
  dat <- createExternalCovariates( 20, names = "X1, X2",
    subset = c(".7 < X1 < .8", "-1 <= X2 <= 1"),
    file = "testCovariates.csv", workingPath = wPath, refCol = "ID" )
  print( dat )


## End(Not run)

MikeKSmith/MSToolkit documentation built on July 3, 2024, 1:53 a.m.