Genomic diversity and selection in the racing greyhound of Great Britain
Data files
Feb 13, 2026 version files 1.74 GB
-
69_GB_Greys_GP99.bed
639.44 MB
-
69_GB_Greys_GP99.bim
1.03 GB
-
69_GB_Greys_GP99.fam
3.38 KB
-
Combined_Final.bed
51.98 MB
-
Combined_Final.bim
15.79 MB
-
Combined_Final.fam
10.47 KB
-
README.md
2.47 KB
Abstract
The Greyhound, originally used for hunting, has been selected for competitive racing. Here, we present the first comprehensive population genomic analysis of racing Greyhounds (n=54) in the context of 14 other dog breeds (n=352) using 473K SNP genotypes. Inbreeding in the Greyhound (FROH = 0.47) was higher than most other breeds, reflecting positive selection for athletic traits but also raising concerns about potential health impacts. Although very long runs of homozygosity (ROH) were less common in the Greyhound than some breeds, large ROH islands suggest that selection for advantageous traits is relatively recent. Genomic regions under strong selection overlapped with ROH islands on CFA2, CFA4, CFA9, CFA10, CFA25, and CFA28, and contained candidate genes with marked allele frequency differences relative to other breeds. Notably, the strongest selection signal on CFA25 overlapped with the longest ROH island, encompassing the SLC46A3, POLR1D, FLT3, SLC7A1, MTUS2, LHFPL6, USPL1, and USP12 genes that have biological functions in vision, craniofacial morphology, neurological adaptability, and skeletal, cardiac, and muscle biology, implicating them in shaping the Greyhound’s morphological and athletic phenotype. This study provides insights into the population genetic structure and selection pressures in the racing Greyhound, identifying key genomic regions and candidate genes that may underlie racing capabilities. Importantly for animal welfare, these results provide a framework for managing inbreeding to optimise health and performance for future generations.
Han, H., T.A. Blackett, M.L.H. Campbell, A.R. Holtby, B.A. McGivney, E.W. Hill
Dataset DOI: 10.5061/dryad.s4mw6m9k6
Dataset
This repository contains genotype datasets in PLINK binary format (.bed, .bim, .fam). Two datasets are provided:
- 69_GB_Greys_GP99
- Combined_Final
Each dataset consists of three coordinated files sharing the same filename prefix:
<dataset>.bed # Binary genotype file
<dataset>.bim # Variant information file
<dataset>.fam # Sample information file
These three files must always be used together.
- The .bed file contains the genotype matrix in compressed binary format and cannot be opened as plain text.
- The .bim and .fam files contain variant and sample metadata, respectively, and can be opened as plain text using a text editor.
The .bim file describes the variants (e.g., chromosome, position, alleles), and the .fam file describes the individuals (e.g., sample ID, sex, phenotype). The order of variants in .bim and individuals in .fam corresponds directly to the genotype data stored in .bed.
Detailed specifications of the PLINK binary file format are available at:
https://www.cog-genomics.org/plink/1.9/input#bed
Using PLINK
Basic summary statistics:
Plink --dog --bfile 69_GB_Greys_GP99 --freq
plink --dog --bfile Combined_Final –missing
File list
- 69_GB_Greys_GP99.bed
- 69_GB_Greys_GP99.bim
- 69_GB_Greys_GP99.fam
- Combined_Final.bed
- Combined_Final.bim
- Combined_Final.fam
File descriptions
69_GB_Greys_GP99:
- 69 Great Britain racing greyhounds
- Approximately 34 million SNPs
- Derived from imputed low-pass whole-genome sequencing after quality control for genotype probability (GP) scores > 0.99
- Imputed genotypes converted to hard genotype calls for PLINK export
- This dataset represents high-density genome-wide variation for Great Britain (GB) racing greyhounds and was generated to enable comparative population genomics analyses through merging with the other dog breeds datasets
Combined_Final:
- 547,156 SNPs
- Generated after merging:
- 69_GB_Greys_GP99
- Public dog dataset from Morrill et al., 2022
- SNP harmonization, filtering, and quality control were applied
- This dataset is prepared for population genetic and comparative analyses.
