Skip to main content
Dryad

Dataset for: Guidelines for standardising the application of discriminant analysis of principal components to genotype data

Cite this dataset

Thia, Joshua (2022). Dataset for: Guidelines for standardising the application of discriminant analysis of principal components to genotype data [Dataset]. Dryad. https://doi.org/10.5061/dryad.b8gtht7f0

Abstract

Data and scripts required to replicate the analyses in Thia (2022) "Guidelines for standardising the application of discriminant analysis of principal components to genotype data" in Molecular Ecology.

This study aimed to address methodological misunderstandings and misuse of the DAPC method in population genetics. The analyses are used to illustrate that for genotype data comprising k effective populations, there are only k1 PC axes that describe populations structure, and that are biologically informative. These PC axes are the only suitable axes for modelling the among-population differences with a DA. Use of many more than k−1 PC axes leads to decreasing biological relevancy of the final DA solution, with implications for misinterpretations of population structure.

Methods

Metapopulations with different migrations rates and levels of genetic differentaition were simualted using fastsimcoal v2.7. 

Simulated individuals were imported into R v4.1.2 for further downstream analysis.

Usage notes

See the README for details on setting up computing environment, the working directory, and executing scripts.