Dataset for: Guidelines for standardising the application of discriminant analysis of principal components to genotype data
Citation
Thia, Joshua (2022), Dataset for: Guidelines for standardising the application of discriminant analysis of principal components to genotype data, Dryad, Dataset, https://doi.org/10.5061/dryad.b8gtht7f0
Abstract
Data and scripts required to replicate the analyses in Thia (2022) "Guidelines for standardising the application of discriminant analysis of principal components to genotype data" in Molecular Ecology.
This study aimed to address methodological misunderstandings and misuse of the DAPC method in population genetics. The analyses are used to illustrate that for genotype data comprising k effective populations, there are only k−1 PC axes that describe populations structure, and that are biologically informative. These PC axes are the only suitable axes for modelling the among-population differences with a DA. Use of many more than k−1 PC axes leads to decreasing biological relevancy of the final DA solution, with implications for misinterpretations of population structure.
Methods
Metapopulations with different migrations rates and levels of genetic differentaition were simualted using fastsimcoal v2.7.
Simulated individuals were imported into R v4.1.2 for further downstream analysis.
Usage notes
See the README for details on setting up computing environment, the working directory, and executing scripts.