Calculates the Genomic Estimated Breeding Value by using Random Forest method.
STGS.rf(X, Y, r)
X is a design matrix of marker genotype of size n×p where n are no of Individuals under study (i.e. genotype, lines) and p are no of markers.
Y is a vector of individuals of size n×1.
fraction of testing data (ranges from (0-1)) used during model fitting (suppose if one want to use 75% of data for model training and remaining 25% for model testing so one has to define r=0.25).
This function fits model by dividing data into two part i.e. training sets and testing sets. Former one is used to build the models and later one for performance evaluation. The performance of model is evaluated by calculating model accuracy i.e. pearson correlation coefficient between actual phenotypic value and predicted phenotypic value. Whole procedures is repeated 25 times and accuracy is averaged.
$Pred GEBV's for genotype under study
$Accuracy model accuracy i.e. pearson correlation coefficient between actual phenotypic value and predicted phenotypic value
Breiman, L. (2001), Random Forests, Machine Learning 45(1), 5-32.
Breiman, L (2002), “Manual On Setting Up, Using, And Understanding Random Forests V3.1”,https://www.stat.berkeley.edu/~breiman/Using_random_forests_V3.1.pdf.
A. Liaw and M. Wiener (2002). Classification and Regression by randomForest. R News 2(3), 18-22
1 2 3 4 5 6 7 8 9 10 11
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.