getInput | R Documentation |
This function has GWAS summary statistics data files as inputs, perform genetic instrument selection and return matrices that are ready to use for GRAPPLE
getInput(
sel.files,
exp.files,
out.files,
plink_refdat,
max.p.thres = 0.01,
cal.cor = T,
p.thres.cor = 0.5,
get.marker.candidates = T,
marker.p.thres = 1e-05,
marker.p.source = "exposure",
clump_r2 = 0.001,
clump_r2_formarkers = 0.05,
plink_exe = NULL
)
sel.files |
A vector of the GWAS summary statistics file names for the risk factors SNP selection. Each GWAS file is a ".csv" or ".txt" file containing a data frame that at least has a column "SNP" for the SNP ids and "pval" for the p-values. The length of |
exp.files |
A vector of length |
out.files |
The GWAS summary statistics file name for the disease data, can be a vector of length |
plink_refdat |
The reference genotype files (.bed, .bim, .fam) for clumping using PLINK (loaded with –bfile). |
max.p.thres |
The upper threshold of the selection p-values for a SNP to be selected before clumping. It only requires that at least one of the p-values of the risk factors of the SNPs to be below the threshold. Default is |
cal.cor |
Whether calculate the |
p.thres.cor |
The lower threshold of the p-values for a SNP to be used in calculating the correlation matrix. It only select SNPs whose p-values are above the threshold for all risk factors. Default is |
get.marker.candidates |
Whether getting SNPs which are used for mode marker selection. Only applies to cases where the number of risk factors |
marker.p.thres |
P-value threshold of p-values in the exposure files for mode markers. Default is |
marker.p.source |
source of p-values of mode markers, a string of either "exposure" or "selection". Default is "exposure" for obtaining more markers. |
clump_r2 |
The clumping r2 threshold in PLINK for genetic instrument selection. Default is set to 0.001 for selection of independent SNPs. |
clump_r2_formarkers |
The clumping r2 threshold in PLINK. Default is set to 0.05 for selection of candidates for the marker SNPs. |
plink_exe |
The name of the plink exe. Default is NULL, which uses "plink". For users with Linux systems, one may want to have a different name, like "./plink" depending on where they install plink |
A list of selected summary statistics, which include
data |
A data frame of size |
marker.data |
A data frame for marker candidate SNPs, which has the same columns as |
.
cor.mat |
The estimated |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.