sim.control: Set options that control gene drop simulation of relationship...
In CrypticIBDcheck: Identifying Cryptic Relatedness in Genetic Association Studies

Description Usage Arguments Details Value Author(s) See Also Examples

Set options that control gene drop simulation of relationship pairs.

sim.control(simulate=FALSE, 
    rships=c("unrelated","MZtwins","parent-offspring","full-sibs","half-sibs"),
    nsim=rep(200,length(rships)), userdat=NULL,
    geno.err=1/1000, hom2hom.err=0, fitLD=TRUE, LDfiles=NULL, cl=NULL)

`simulate`	Should data be simulated by gene drop to allow the user to assign relationships to outlying pairs? Default is FALSE. See Details for more information.
`rships`	A character vector specifying the relationships to simulate. The choices are currently `"unrelated"`, `"MZtwins"`, `"parent-offspring"`, `"full-sibs"`, `"half-sibs"`, `"cousins"`, or `"user"`, for unrelated, monozygotic-twin/duplicate, parent-offspring, full-sibling, half-sibling, first-cousin, or user-defined relationship pairs, respectively. The default relationships are `"unrelated"`, `"MZtwins"`, `"parent-offspring"`, `"full-sibs"` and `"half-sibs"`. Please note that half-sibling, avuncular and grandparent-grandchild relationships cannot be distinguished on the basis of IBD coefficients. Partial matches (e.g., `"par"` rather than `"parent-offspring"`) are allowed, as shown in the Examples.
`nsim`	Numeric vector of numbers of pairs to be simulated by gene drop for each relationship. The default is 200 for each relationship listed in `rships`.
`userdat`	A data frame of information on the pedigree to simulate for a user-defined relationship pair. The columns of this data frame must be named as follows: `ids`, the IDs of the members of the pedigree; `dadids`, the IDs of each pedigree member's father, or zero if the father is not in the pedigree; `momids`, the IDs of each pedigree member's mother, or zero if the mother is not in the pedigree; `gender`, the gender of each subject coded as 1 for male and 2 for female. See Examples for an example. Default is `NULL`.
`geno.err`	Genotyping error rate. Each genotype is sampled with probability `geno.err` to be measured incorrectly, according to a simple error model. In the error model, heterozygous genotypes are equally likely to be called as either of the homozygous genotypes. Homozygous genotypes are called as the other homozygous genotype with probability `hom2hom.err` and as heterozygous with probability 1-`hom2hom.err`. The default value of `geno.err` is 0.001.
`hom2hom.err`	The probability a homozygous genotype that is miscalled is miscalled as the other homozygous genotype. Default is 0, so that miscalled homozyous genotypes are always called heterozygous.
`fitLD`	Should an LD model be fit to the data for use in gene drop simulations? Default is TRUE. Ignored if `simulate=FALSE`. See Details for more information.
`LDfiles`	Character vector of the names of files containing LD models fit by `FitGMLD`. These may be present from a previous call to `IBDcheck`. There must be one file name for each chromosome of data in `snp.data`, and the files must be ordered by chromosome number. Default is `NULL`. Ignored if `fitLD=FALSE` or `simulate=FALSE`.
`cl`	A SNOW cluster that can be used to split fitting of LD models and gene drop simulations across a compute cluster. Default is `NULL`.

When simulate=TRUE, IBDcheck simulates data from pairs with known relationship that can be used to generate prediction ellipses on the graphical displays as a reference. Unrelated, parent-offspring, full sibling, half sibling, cousin, or user-defined relationships are simulated by gene drop and their estimated IBD coeficients are computed as for pairs of study subjects. Monozygotic twins/duplicates are not simulated by gene drop. Rather, they are simulated by randomly sampling a study individual and then applying the genotyping error model twice to make two copies. Gene drop simulations can be based on loci in linkage equilibrium (fitLD=FALSE) or on a fitted LD model that accounts for inter-locus correlation (fitLD=TRUE).

A list whose components are the function inputs.

Annick Joelle Nembot-Simo, Jinko Graham and Brad McNeney

IBDcheck

# Set simulation parameters to simulate unrelated, parent-offspring and
# full-sibling pairs. Leave other simulation parameters at their 
# default values (e.g., nsim=rep(200,length(rships)), fitLD=TRUE).
ss<-sim.control(simulate=TRUE,rships=c("unrel","parent","full"))
## Not run: 
# Create an IBD object to use as input to IBDcheck.
data(Nhlsim)
popsam<-Nhlsim$csct==0 # controls
# Use chromosomes 20, 21 and 22 only.
cind<-(Nhlsim$chromosome==20|Nhlsim$chromosome==21|Nhlsim$chromosome==22)
dat<-new.IBD(Nhlsim$snp.data[,cind],Nhlsim$chromosome[cind],
                Nhlsim$physmap[cind],popsam)
# Run IBDcheck
cibd<-IBDcheck(dat,simparams=ss)
# Save the names of the LD files for future simulations.
LDfiles<-cibd$simparams$LDfiles

# Use the fitted LD model from cibd to add 100 more simulated, unrelated pairs
# and save the updated IBD object in cibd2.
ss<-sim.control(simulate=TRUE,LDfiles=LDfiles,rships="unrelated",nsim=100) 
ff<-filter.control(filter=FALSE) # No need to re-filter the SNP data in cibd
cibd2<-IBDcheck(cibd2,filterparams=ff,simparams=ss)

# Add 200 simulated first-cousin pairs to cibd2, an IBD object which has no
# simulated first-cousin pairs. Save the updated IBD object in cibd3.
ss<-sim.control(simulate=TRUE,LDfiles=LDfiles,rships="cousins") 
ff<-filter.control(filter=FALSE) # No need to re-filter the SNP data in cibd2
cibd3<-IBDcheck(cibd2,filterparams=ff,simparams=ss)

# Add 200 simulated pairs having the user-specified mother-daughter relationship, 
# with mother and father being first cousins. See the package vignette 
# "CrypticIBDcheck", Figure 4, for a picture of this pedigree. Save the updated
# IBD object in cibd4. 
userdat<-data.frame(ids=1:9,
                    dadids=c(3,5,7,0,9,9,0,0,0),
                    momids=c(2,4,6,0,8,8,0,0,0),
                    gender=c(2,2,1,2,1,2,1,2,1))
ss<-sim.control(simulate=TRUE,LDfiles=LDfiles,rships=c("user"),userdat=userdat)
ff<-filter.control(filter=FALSE) # No need to re-filter the SNP data in cibd3
cibd4<-IBDcheck(cibd3,simparams=ss,filterparams=ff)

# Distribute fitting of LD models and gene drop simulations for each 
# chromosome across a cluster running on a local computer.  See the 
# package vignette, vignette("CrypticIBDcheck") for an example of running
# IBDcheck in batch mode on a compute cluster. Save the updated IBD object
# in cibd5.
library(parallel)
cl<-makeCluster(3,type="SOCK")
clusterEvalQ(cl,library("CrypticIBDcheck"))
ss<-sim.control(simulate=TRUE,cl=cl) # Leave all other sim params at defaults
cibd5<-IBDcheck(dat,simparams=ss)
stopCluster(cl)

## End(Not run)