knitr::opts_chunk$set(echo = TRUE, eval = FALSE)

Introduction

This vignette provides instructions to access the marker data from The Triticeae Toolbox (T3) that were used in several simulation projects.

Data Access Steps

  1. First load the package in R with the command
library(pbsimData)

Spring Two-Row CAP Lines

  1. In the package there is a saved vector of two-row barley CAP line names. Call it with the command as so:
data("s2_cap_lines", package = "pbsimData")

Next, save the vector of line names to the clipboard by running the folowing

write.table(s2_cap_lines, "clipboard", quote = F, row.names = F, col.names = F)
  1. Go to the website of The Triticeae Toolbox. There is no need to register to access the data.

  2. Under "Quick Links" on the left-hand side, click on "Lines."

  3. Paste the line names into the text box labelled "Name" and hit the "Search" button at the bottom. After a short while, the lines will have been populated in the box underneath the "Search" button. r length(s2_cap_lines) lines should be present. Click the "Add to Selected" button to save the lines.

  4. Under "Quick Links" on the left-hand side, click on "Markers."

  5. Under "search using pattern matching," use the following regular expression to search for markers (without quotes): ^[1-2]{2}. 3,140 markers should be selected (you will see this number next to the "Markers" link).

  6. Under the "Download" tab at the top of the page, click on "Genotype and Phenotype Data."

  7. Before downloading the information, select a genetic map by clicking the "genetic map" button. Select the map "Barley, 2011 OPA, Consensus."

  8. Set the follwing filter criteria using the text boxes: Minimum MAF >= 0, Remove markers missing > 100% of data, and Remove lines missing > 100% of data.

  9. Hit the "Create file" button for the rrBLUP encoding. The database may take a little while to create the download file. Grab some coffee.

  10. When the file creation is complete, a "Download Zip file of results" button will appear at the bottom. Click it to download the file.

  11. Unzip the file to retrive the contents. The file "genotype.hmp.txt" will contain the genotype calls for each line at each SNP, as well as SNP information such as name, allele, chromosome, and genetic position. This can be used in the CAP_data_preparation.R script.

Spring Six-Row CAP Lines

  1. In the package there is a saved vector of six-row barley CAP line names. Call it with the command as so:
data("s6_cap_lines", package = "pbsimData")

Next, save the vector of line names to the clipboard by running the folowing

write.table(s6_cap_lines, "clipboard", quote = F, row.names = F, col.names = F)
  1. Go to the website of The Triticeae Toolbox. There is no need to register to access the data.

  2. Under "Quick Links" on the left-hand side, click on "Lines."

  3. Paste the line names into the text box labelled "Name" and hit the "Search" button at the bottom. After a short while, the lines will have been populated in the box underneath the "Search" button. r length(s6_cap_lines) lines should be present. Click the "Add to Selected" button to save the lines.

  4. Under "Quick Links" on the left-hand side, click on "Markers."

  5. Under "search using pattern matching," use the following regular expression to search for markers (without quotes): ^[1-2]{2}

  6. Under the "Download" tab at the top of the page, click on "Genotype and Phenotype Data."

  7. Before downloading the information, select a genetic map by clicking the "genetic map" button. Select the map "Barley, 2011 OPA, Consensus."

  8. Set the follwing filter criteria using the text boxes: Minimum MAF >= 0, Remove markers missing > 100% of data, and Remove lines missing > 100% of data.

  9. Hit the "Create file" button for the rrBLUP encoding. The database may take a little while to create the download file. Grab some coffee.

  10. When the file creation is complete, a "Download Zip file of results" button will appear at the bottom. Click it to download the file.

  11. Unzip the file to retrive the contents. The file "genotype.hmp.txt" will contain the genotype calls for each line at each SNP, as well as SNP information such as name, allele, chromosome, and genetic position. This can be used in the CAP_data_preparation.R script.



neyhartj/pbsimData documentation built on May 31, 2019, 10:36 a.m.