Introduction

This vignette provides instructions to access the marker data from The Triticaea Toolbox (T3) that were used in the simulations. T3 is a database for breeding data on wheat, oat, and barley. This database was developed as part of the Triticeae Coordinated Agricultural Project (T-CAP). During the course of this project and a related project (the Barley CAP), many barley breeding lines were genotyped using 3072 SNP markers. From two participatig breeding programs (UMN and NDSU), 96 six-row barley breeding lines were submitted per year for four years. We used the genotype data on these lines for our simulation.

Data Access Steps

  1. First load the package in R with the command
library(GSSimTPUpdate)
  1. In the package there is a saved vector of six-row barley CAP line names. Call it with the command as so:
data(CAP.lines)
Next, save the vector of line names to the clipboard by running the folowing
write.table(CAP.lines, "clipboard", quote = F, row.names = F, col.names = F)
  1. Go to the website of The Triticeae Toolbox. There is no need to register to access the data.

  2. Under "Quick Links" on the left-hand side, click on "Lines."

  3. Paste the line names into the text box labelled "Name" and hit the "Search" button at the bottom. After a short while, the lines will have been populated in the box underneath the "Search" button. 764 lines should be present. Click the "Add to Selected" button to save the lines.

  4. Under the "Download" tab at the top of the page, click on "Genotype and Phenotype Data."

  5. Before downloading the information, select a genetic map by clicking the "genetic map" button. Select the map "Barley, 2011 OPA, Consensus."

  6. Set the follwing filter criteria using the text boxes: Minimum MAF >= 0, Remove markers missing > 10% of data, and Remove lines missing > 10% of data.

  7. Hit the "Create file" button for the rrBLUP encoding. The database may take a little while to create the download file. Grab some coffee.

  8. When the file creation is complete, a "Download Zip file of results" button will appear at the bottom. Click it to download the file.

  9. Unzip the file to retrive the contents. The file "genotype.hmp.txt" will contain the genotype calls for each line at each SNP, as well as SNP information such as name, allele, chromosome, and genetic position. This can be used in the CAP_data_preparation.R script.

Alternatively, the genotype data is already included in this package. See the CAP_data_preparation script for code to prepare the markers for the simulation.



UMN-BarleyOatSilphium/GSSimTPUpdate documentation built on May 9, 2019, 7:40 p.m.