Genetic diversity and evolutionary patterns of Taraxacum kok-saghyz Rodin
Zhang, Zhibin (2022), Genetic diversity and evolutionary patterns of Taraxacum kok-saghyz Rodin, Dryad, Dataset, https://doi.org/10.5061/dryad.qnk98sfgh
Taraxacum kok-saghyz Rodin (TKS) is an important potential alternative source of natural inulin and rubber production, which has great significance for the production of industrial products. In this study, we sequenced 58 wild TKS individuals collected from four different geography regions worldwide to elucidate the population structure, genetic diversity and the patterns of evolution. Also, the first flowering time, crown diameter, morphological characteristics of leaf and scape of all TKS individuals were measured and evaluated statistically. Phylogenetic analysis based on SNPs and cluster analysis based on agronomic traits showed that all 58 TKS individuals could be roughly divided into three distinct groups: I) Zhaosu county in Xinjiang (population AB, including a few individuals from population C and D); II) Tekes county in Xinjiang (population C); and III) Tuzkol lake in Kazakhstan (population D). Population D exhibited a closer genetic relationship with population C compared to population AB. Genetic diversity analysis further revealed that population expansion from C and D to AB occurred, as well as gene flow between them. Additionally, some natural selection regions were identified in AB population. Function annotation of candidate genes identified in these regions revealed that they mainly participated in biological regulation processes, such as transporter activity, structural molecule activity and molecular function regulator. We speculated that the genes identified in selective sweep regions may contributed to TKS adaptation to the Yili River Valley of Xinjiang. In general, this study provides new insights in clarifying population structure and genetic diversity analysis of TKS using SNP molecular markers and agronomic traits.
Additional file 1: Table S1. Summary information of the 58 TKS individuals.
Additional file 2: Table S2. Statistics information of SNP detected in this study
Additional file 3: Table S3. Annotation information of 80 candidate genes in selective sweep regions.
Additional file 4: Table S4. Analysis of molecular variance.
Additional file 5: Table S5. Population differentiation index Fst.
Additional file 6: Table S6. KEGG pathway enrichment analysis for 80 candidate genes in the identified selective sweep regions.