This vignette provides instructions to access the marker data from The Triticaea Toolbox (T3) that were used in the simulations. T3 is a database for breeding data on wheat, oat, and barley. This database was developed as part of the Triticeae Coordinated Agricultural Project (T-CAP). During the course of this project and a related project (the Barley CAP), many barley breeding lines were genotyped using 3072 SNP markers. From two participatig breeding programs (UMN and NDSU), 96 six-row barley breeding lines were submitted per year for four years. We used the genotype data on these lines for our simulation.
library(GSSimTPUpdate)
data(CAP.lines)
Next, save the vector of line names to the clipboard by running the folowing
write.table(CAP.lines, "clipboard", quote = F, row.names = F, col.names = F)
Go to the website of The Triticeae Toolbox. There is no need to register to access the data.
Under "Quick Links" on the left-hand side, click on "Lines."
Paste the line names into the text box labelled "Name" and hit the "Search" button at the bottom. After a short while, the lines will have been populated in the box underneath the "Search" button. 764 lines should be present. Click the "Add to Selected" button to save the lines.
Under the "Download" tab at the top of the page, click on "Genotype and Phenotype Data."
Before downloading the information, select a genetic map by clicking the "genetic map" button. Select the map "Barley, 2011 OPA, Consensus."
Set the follwing filter criteria using the text boxes: Minimum MAF >= 0, Remove markers missing > 10% of data, and Remove lines missing > 10% of data.
Hit the "Create file" button for the rrBLUP encoding. The database may take a little while to create the download file. Grab some coffee.
When the file creation is complete, a "Download Zip file of results" button will appear at the bottom. Click it to download the file.
Unzip the file to retrive the contents. The file "genotype.hmp.txt" will
contain the genotype calls for each line at each SNP, as well as SNP information
such as name, allele, chromosome, and genetic position. This can be used in
the CAP_data_preparation.R
script.
Alternatively, the genotype data is already included in this package. See the
CAP_data_preparation
script for code to prepare the markers for the simulation.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.