divPart: Genetic differentiation statistics and their estimators

Description Usage Arguments Details Value Author(s) References Examples

Description

divPart (diversity partition), allows for the calculation of three main diversity partition statistics and their respective estimators. The function can be used to mainly explore locus values to identify 'outliers' and also to visualise pairwise differentiation between populations. Bootstrapped confidence intervals are calculated also. Results can. be optionally plotted for data exploration.

Usage

1
2
3
4
5
6
divPart(infile = NULL, outfile = NULL, gp = 3,
        pairwise = FALSE, WC_Fst = FALSE, 
        bs_locus = FALSE, 
        bs_pairwise = FALSE,
        bootstraps = 0, plot = FALSE, 
        parallel = FALSE)

Arguments

infile

Specifying the name of the ‘genepop’ (Rousset, 2008) file from which the statistics are to be calculated This file can be in either the 3 digit of 2 digit format, and must contain only one whitespace separator (e.g. “space” or “tab”) between each column including the individual names column. The number of columns must be equal to the number of loci + 1 (the individual names column). If this file is not in the working directory the file path must be given. The name must be a character string (i.e. enclosed in “” or '').

outfile

Allows users to specify a prefix for an output folder. Name must a character string enclosed in either “” or ''.

gp

Specifies the digit format of the infile. Either 3 (default) or 2.

pairwise

A logical argument indicating whether standard pairwise diversity statistics should be calculated and returned as a diagonal matrix.

WC_Fst

A Logical argument indicating whether Weir & Cockerham's 1984 F-statistics should be calculated. NOTE - Calculating these statistics adds significant time to analysis when carrying out pairwise comparisons.

bs_locus

Gives users the option to bootstrap locus statistics. Results will be written to .xlsx workbook by default if the package 'xlsx' is installed, and to a .html file if plot=TRUE. If 'xlsx' is not installed, results will be written to .txt files.

bs_pairwise

Gives users the option to bootstrap statistics across all loci for each pairwise population comparison. Results will be written to a .xlsx file by default if the package 'xlsx' is installed, and to a .html file if plot=TRUE. If 'xlsx' is not installed, results will be written to .txt files.

bootstraps

Determines the number of bootstrap iterations to be carried out. The default value is bootstraps = 0, this is only valid when all bootstrap options are false. There is no limit on the number of bootstrap iterations, however very large numbers of bootstrap iterations (< 1000) on even modest data sets (e.g. 265 individuals x 38 loci) will take over 30 minutes to run on a most PCs).

plot

Optional interactive .html image file of the plotted bootstrap results for loci if bs_locus = TRUE and pairwise population comparisons if bs_pairwise = TRUE. The default option is plot = FALSE.

parallel

A logical input, indicating whether your analysis should be run in parallel mode or sequentially. parallel = TRUE is only valid if the packages, parallel, doParallel and foreach are installed.

Details

All results will be written to a user defined folder ("working_directory/outfile"). The format of outputs will vary depending on the availability of the package 'xlsx' in the users local package library. If 'xlsx' is available, results will be written to an Excel workbook. If 'xlsx' is not available, results will be written to .txt files.

Value

standard

A matrix containing identical data to the Standard_stats worksheet in the .xlsx workbook.

estimate

A matrix containing identical data to the Estimated_stats worksheet in the .xlsx workbook.

pairwise

A group of six matrices containing population pairwise statistics. This object is identical to that written as 'pairwise-stats' in the .xlsx workbook.

bs_locus

A list containing six matrices of locus values for Gst, G'st, D(Jost), Gst-(est), G'st-(est), and D(Jost)-(est) along with their respective 95% confidence interval.

bs_pairwise

A list containing six matrices of pairwise values for Gst, G'st, D(Jost), Gst-(est), G'st-(est), and D(Jost)-(est) along with their respective 95% confidence intervals.

Author(s)

Kevin Keenan <kkeenan02@qub.ac.uk>

References

Dragulescu, A.D., “xlsx: Read, write, formal Excel 2007 and Excel 97/2000/xp/2003 files”, R package version 0.4.2, url:http://CRAN.R-project.org/package=xlsx, (2012).

Guile, D.P., Shepherd, L.A., Sucheston, L., Bruno, A., and Manly, K.F., “sendplot: Tool for sending interactive plots with tool-tip content.”, R package version 3.8.10, url: http://CRAN.R-project.org/package=sendplot, (2012).

Hedrick, P., “A standardized genetic differentiation measure,” Evolution, vol. 59, no. 8, pp. 1633-1638, (2005).

Jost, L., “G ST and its relatives do not measure differentiation,” Molec- ular Ecology, vol. 17, no. 18, pp. 4015-4026, (2008).

Manly, F.J., “Randomization, bootstrap and Monte Carlo methods in biology”, Chapman and Hall, London, 1997.

Nei, M. and Chesser, R., “Estimation of fixation indices and gene diver- sities,” Ann. Hum. Genet, vol. 47, no. Pt 3, pp. 253-259, (1983).

R Development Core Team (2011). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0, URL http://www.R-project.org/.

Revolution Analytics (2012). doParallel: Foreach parallel adaptor for the parallel package. R package version 1.0.1. http://CRAN.R-project.org/package=doParallel

Revolution Analytics (2012). foreach: Foreach looping construct for R. R package version 1.4.0. http://CRAN.R-project.org/package=foreach

Rousset, F., “genepop'007: a complete re-implementation of the genepop software for Windows and Linux.,” Molecular ecology resources, vol. 8, no. 1, pp. 103-6, (2008).

Weir, B.S. & Cockerham, C.C., Estimating F-Statistics, for the Analysis of Population Structure, Evolution, vol. 38, No. 6, pp. 1358-1370 (1984).

Examples

1
2
3
4
5
6
7
8
9
## Not run: 
# simply use the following format to run the function
 
test_result <- divPart(infile = 'mydata', outfile = "myresults', 
                       gp = 3, pairwise = TRUE, bs_locus = TRUE, 
                       bs_pairwise = TRUE, bootstraps = 1000, 
                       plot = TRUE)

## End(Not run)

diveRsity documentation built on May 1, 2019, 10:30 p.m.