Genomic prediction in the wild: a case study in Soay sheep
Data files
Nov 11, 2021 version files 151.08 MB
-
coatcolour.bed
42.52 MB
-
coatcolour.bim
1.32 MB
-
coatcolour.fam
793.34 KB
-
coatpattern.bed
42.52 MB
-
coatpattern.bim
1.32 MB
-
coatpattern.fam
793.34 KB
-
foreleg.bed
10.22 MB
-
foreleg.bim
1.34 MB
-
foreleg.fam
382.09 KB
-
hindleg.bed
10.33 MB
-
hindleg.bim
1.34 MB
-
hindleg.fam
389.83 KB
-
hornlength.bed
4.30 MB
-
hornlength.bim
1.35 MB
-
hornlength.fam
173.44 KB
-
jaw.bed
8.19 MB
-
jaw.bim
1.34 MB
-
jaw.fam
305.25 KB
-
metacarpal.bed
8.12 MB
-
metacarpal.bim
1.35 MB
-
metacarpal.fam
303.42 KB
-
Readme.txt
898 B
-
weight.bed
10.61 MB
-
weight.bim
1.34 MB
-
weight.fam
397.62 KB
Feb 23, 2022 version files 151.08 MB
Abstract
Genomic prediction, the technique whereby an individual’s genetic component of their phenotype is estimated from its genome, has revolutionised animal and plant breeding and medical genetics. However, despite being first introduced nearly two decades ago, it has hardly been adopted by the evolutionary genetics community studying wild organisms. Here, genomic prediction is performed on eight traits in a wild population of Soay sheep. The population has been the focus of a >30 year evolutionary ecology study and there is already considerable understanding of the genetic architecture of the focal Mendelian and quantitative traits. We show that the accuracy of genomic prediction is high for all traits, but especially those with loci of large effect segregating. Five different methods are compared, and the two methods that can accommodate zero-effect and large-effect loci in the same model tend to perform best. If the accuracy of genomic prediction is similar in other wild populations, then there is a real opportunity for pedigree-free molecular quantitative genetics research to be enabled in many more wild populations; currently the literature is dominated by studies that have required decades of field data collection to generate sufficiently deep pedigrees. Finally, some of the potential applications of genomic prediction in wild populations are discussed.
Methods
The dataset describes SNP genotype and phenotype information, provided in Plink binary format. The data were collected from a wild population of Soay sheep, as part of a long-term study that has been running since 1985. SNP data are from the Illumina Ovine SNP50 beadchip array. Descriptions of the file contents are available as a readme document.
Usage notes
Users are strongly advised to contact the corresponding author (j.slate@sheffield.ac.uk), for further details on the long-running St Kilda Soay sheep project and the methods used in the field and laboratory.