runDiagnostics: Data reduction procedure
In genphen: A tool for quantification of associations between genotypes and phenotypes in genome wide association studies (GWAS) with Bayesian inference and statistical learning

Description Usage Arguments Details Value Author(s) See Also Examples

The methods implemented in genphen are statistically superior to the ones implemented by most classical (frequentist) tools for GWAS. A major challenge, however, of our method is the substantially increased computational cost when analyzing thousands of SNPs. Inspired by the biological assumption that the major fraction of the studied SNPs are non-informative (genetic noise) with respect to the selected phenotype, various data reduction techniques can be implemented to quickly scan the SNP and discard a substantial portion of the the SNPs deemed to be clearly non-informative.

1	runDiagnostics(genotype, phenotype, phenotype.type, rf.trees)

`genotype`	Character matrix/data frame or a vector, containing SNPs/SAAPs as columns or alternatively as DNAMultipleAlignment or AAMultipleAlignment Biostrings object.
`phenotype`	Numerical vector.
`phenotype.type`	Character indicator of the type of the phenotype, with 'Q' for a quantitative, or 'D' for a dichotomous phenotype.
`rf.trees`	Number of random forest trees (default = 5,000).

The data reduction procedure includes the following steps:

The complete data (genotypes and a single phenotype) is used to train a random forest (RF) model, which will quantify the importance of each SNP/SAAP in explaining the phenotypeassociation between each SNP and the phenotype.
We can then plot the distribution of variable importances, to get an insight into the structure of the importances values and potentially disect the signal from the noise.
The main analysis can then be performed with runGenphen using a subset (based on their importance) of SNPs

`site`	id of the site (e.g. position in the provided sequence alignment)
`importance`	Magnitude of importance (impurity) of the site, estimated with random forest implemented in R package ranger

Simo Kitanovski <simo.kitanovski@uni-due.de>

runGenphen, runPhyloBiasCheck

# genotypes:
data(genotype.saap)
# quantitative phenotype:
data(phenotype.saap)

# run diagnostics
diag <- runDiagnostics(genotype = genotype.saap,
                       phenotype = phenotype.saap,
                       phenotype.type = "Q",
                       rf.trees = 5000)

genphen documentation built on Nov. 8, 2020, 5:03 p.m.

genphen index

README.md genphen overview

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

genphen
A tool for quantification of associations between genotypes and phenotypes in genome wide association studies (GWAS) with Bayesian inference and statistical learning

runDiagnostics: Data reduction procedure
In genphen: A tool for quantification of associations between genotypes and phenotypes in genome wide association studies (GWAS) with Bayesian inference and statistical learning

Description

Usage

Arguments

Details

Value

Author(s)

See Also

Examples

Related to runDiagnostics in genphen...

R Package Documentation

Browse R Packages

We want your feedback!

genphen A tool for quantification of associations between genotypes and phenotypes in genome wide association studies (GWAS) with Bayesian inference and statistical learning

runDiagnostics: Data reduction procedure In genphen: A tool for quantification of associations between genotypes and phenotypes in genome wide association studies (GWAS) with Bayesian inference and statistical learning

Description

Usage

Arguments

Details

Value

Author(s)

See Also

Examples

Related to runDiagnostics in genphen...

R Package Documentation

Browse R Packages

We want your feedback!

genphen
A tool for quantification of associations between genotypes and phenotypes in genome wide association studies (GWAS) with Bayesian inference and statistical learning

runDiagnostics: Data reduction procedure
In genphen: A tool for quantification of associations between genotypes and phenotypes in genome wide association studies (GWAS) with Bayesian inference and statistical learning