Data from: Genetic diversity and population structure of the Taigan dog breed
Data files
Oct 08, 2025 version files 6.74 MB
-
README.md
2.59 KB
-
taigan_only.bed
344.23 KB
-
taigan_only.bim
6.40 MB
-
taigan_only.fam
163 B
Abstract
The Taigan is an ancient sighthound breed native to the Tien Shan Mountains in Kyrgyzstan and adapted to hunting at high altitudes and in rough terrain. Previous studies have provided insights into its phylogenetic relationships, but more data are needed to determine whether the Taigan is genetically distinct from related sighthounds. In this study, we conducted a comprehensive genetic analysis using short tandem repeat (STR) markers and high-density single nucleotide polymorphism (SNP) array data to assess genetic diversity, population structure, and differentiation from other sighthound breeds. The analysis showed high polymorphism and an excess of heterozygosity (F = -0.013), indicating a balanced genetic structure. Bayesian clustering identified five genetic clusters among the Taigans, with no dominant lineage, suggesting a diverse gene pool. PCA and ADMIXTURE assigned the Taigan to the eastern sighthound group, closely clustered with the Kazakh Tazy. Fst analysis identified 95 significant variants between the Taigan and the Kazakh Tazy. Functional annotation revealed enrichment in three signaling pathways: axon guidance, ABC transporter and basal cell carcinoma, indicating genetic differences related to neuronal development, detoxification and epidemal adaptation. These results provide new insights into the genetic traits of the Taigan and suggest that despite its close relationship with the Kazakh Tazy and Afghan Hound, selection pressure, geographic isolation and historical breeding practices may have contributed to its differentiation. Further studies with larger data sets are needed to determine the genetic divergence between the Taigan and the Kazakh Tazy.
Dataset DOI: 10.5061/dryad.zpc866tks
Description of the data and file structure
The data was collected as part of an experimental effort to investigate the genetic diversity and structure of Taigan dogs. SNP whole-genome genotyping was performed using the CanineHD Genotyping BeadChip, and SNP analysis was conducted to compare Taigans with other sighthound breeds.
Files and variables
File: taigan_only.bim
Description: This file contains variant (SNP) information in PLINK binary format.
Columns and Variables:
- CHR (Chromosome) – Chromosome number (1-38 for autosomes, X for sex chromosome, 0 for unknown).
- SNP_ID (SNP Identifier) – Unique identifier for the SNP (e.g., rsID or marker name).
- Genetic_Distance (cM) – Distance in centimorgans (often set to 0 if unknown).
- BP_Position (Base Pair Position) – Genomic coordinate of the SNP in base pairs (bp).
- Allele1 (A1, Reference Allele) – The allele coded as the minor/reference allele.
- Allele2 (A2, Alternate Allele) – The allele coded as the major/alternate allele.
File: taigan_only.fam
Description: This file contains sample information for individuals in the dataset.
Columns and Variables:
- FID (Family ID) – Family identifier (e.g., "Taigan").
- IID (Individual ID) – Unique identifier for each dog (e.g., "TG2", "TG3").
- PID (Paternal ID) – ID of the father (0 if unknown).
- MID (Maternal ID) – ID of the mother (0 if unknown).
- SEX (Sex Code) –
1= Male2= Female0= Unknown
- PHENO (Phenotype Code) –
1= Unaffected2= Affected-9= Missing phenotype (default in PLINK)
File: taigan_only.bed
Description: This is the binary genotype file that stores genotype data in a compressed PLINK format
Code/software
The dataset consists of PLINK binary format files (.bed, .bim, .fam), which require specialized bioinformatics software for processing and analysis.
PLINK (v1.9 or v2.0)
- Description: PLINK is an open-source whole-genome association analysis toolset designed for genetic data. It is used to filter, manipulate, and analyze SNP data from genotyping chips.
- Website: https://www.cog-genomics.org/plink/
Access information
Other publicly accessible locations of the data:
- n/a
Data was derived from the following sources:
- n/a
