dist.gene: Pairwise Distances from Genetic Data

Description Usage Arguments Details Value Note Author(s) See Also

View source: R/dist.gene.R


This function computes a matrix of distances between pairs of individuals from a matrix or a data frame of genetic data.


dist.gene(x, method = "pairwise", pairwise.deletion = FALSE,
          variance = FALSE)



a matrix or a data frame (will be coerced as a matrix).


a character string specifying the method used to compute the distances; two choices are available: "pairwise" and "percentage", or any unambiguous abbreviation of these.


a logical indicating whether to delete the columns with missing data on a pairwise basis. The default is to delete the columns with at least one missing observation.


a logical, indicates whether the variance of the distances should be returned (default to FALSE).


This function is meant to be very general and accepts different kinds of data (alleles, haplotypes, SNP, DNA sequences, ...). The rows of the data matrix represent the individuals, and the columns the loci.

In the case of the pairwise method, the distance d between two individuals is the number of loci for which they differ, and the associated variance is d(L - d)/L, where L is the number of loci.

In the case of the percentage method, this distance is divided by L, and the associated variance is d(1 - d)/L.

For more elaborate distances with DNA sequences, see the function dist.dna.


an object of class dist. If variance = TRUE an attribute called "variance" is given to the returned object.


Missing data (NA) are coded and treated in R's usual way.


Emmanuel Paradis

See Also

dist.dna, cophenetic.phylo, dist

ape documentation built on Nov. 17, 2017, 5:18 a.m.