README.md

Bilinear package for fitting bilinear models (AMMI, GGE, etc.)

v0.2 release

The new release allows for missing data to be imputed with an EM algorithm within bilinear(). I have fixed a few other issues, and will continue to update and fix any bugs that arise. Please post issues to the issue board here on this github page. I will get to them as soon as I can.

Package formatting

To install, first install devtools by entering an R environment and typing.

install.packages("devtools")

Load the devtools library with

library(devtools)

Then install bilinear by typing

install_github("nsantantonio/Bilinear")

You can now load the library as you would any other R package

library(Bilinear)

I highly reccomend that you do not download these files directly! Follow the installation instructions above.

Microsoft R open

I have had issues using devtools with Microsoft R open https://mran.microsoft.com/open. I have supplied a source tarball, Bilinear_0.2.1.tar.gz for those who use MRAN, which can be found in the v0.2.1 release. You can also just clone the repo and build on your own machine.

git clone https://github.com/nsantantonio/Bilinear

R CMD build Bilinear

R CMD INSTALL Bilinear

Missing cells are now allowed!

Missing genotype/environment combinations are now allowed, and are imputed with an expectation maximization algorithm within bilinear(). There is also functionality to allow the user to determine the accuracy of imputation, using the em() function (although this function is not required to be called by the user under typical circumstances). While I have imposed no limit on the degree of missingness, I would suggest avoiding data sets with poor interconnectivity (i.e. poor overlap of genotype/environment information). An extreme example would be one set of lines evaluated in one set of environments and another set of genotypes evaluated in another set of environments put together and connected with missing values.

Bilinear

Fit bilinear models using AMMI (Additive Main effects Multiplicative Interaction) or GGE/SREG (Genotype and Genotype by Environment /Sites REGression) and make 2D and 3D biplots. Also tests for the number of significant dimensions of GxE using a parameteric bootstrap test method suggested by Forkman and Piepho (2014). An additional test for significant multiplicative terms, the FR test from Piepho (1995), has been also been implemented in version 0.1.

type ?bilinear for some examples. Bilinear version 0.1 contains the relevant functions, namely bilinear() to fit model and run statistical tests for signficant dimensions, BBplot() to plot 2D and 3D biplots, and AMMIplot() to produce linear and winner plots for AMMI models such as those in Gauch & Zobel (1997).

Plant Breeders Disclaimer

It is this author's opinion that GGE type models are more difficult to interpret than AMMI type models, as the genotypic main effect will be present in one or more of the remaining dimensions of the residual matrix, but not necessarily the first (largest) dimension (e.g. when Var(GxE) > Var(G)). Therefore, I recommend using AMMI type models, as both genotype and environment main effects are removed from the residuals before GxE effects are assessed. However, GGE type models are provided here as a freely available resource for those who want to use them. The decorateGGE option of BBplot will draw the mega-environment delineations suggested by Yan et al. (2000).

Proceed with Caution

This software is in development and bugs may (almost certainly) be present. I have tried to make the package flexible to different data inputs, but this leads to difficulties in testing all the possible ways one might try to input data. See known bugs below.

Known issues that have been fixed

Known issues to be addressed

Soy data

The soy.Rdata included here is from Zobel, Wright and Gauch (1988) and is subset from the gauch.soy dataset included in the agridat package. This was included to show the equivalence between using within environment means and raw data.

Ontario data

The ontario.Rdata included here is the Onterio wheat data containing 18 genotypes evaluated in 9 environments from from Yan et al. (2000) transcribed directly from the Yan et al. (2007) paper.

References

Imputation using expectation maximization
Bootstrap test
F test
SVD sign ambiguity (might not be relevant for centered matrices...)

selected background

AMMI
GGE
EM for unbalanced data
Discussion of bilinear models and biplot interpretation. I highly recommend reading them in the order shown here:


nsantantonio/Bilinear documentation built on Aug. 18, 2020, 2:31 p.m.