Skip to main content
Dryad

Population genomics of an emergent tri-species hybrid zone

Cite this dataset

Natola, Libby; Seneviratne, Sampath; Irwin, Darren (2022). Population genomics of an emergent tri-species hybrid zone [Dataset]. Dryad. https://doi.org/10.5061/dryad.prr4xgxpz

Abstract

Isolating barriers that drive speciation are commonly studied in the context of two-species hybrid zones. There is however evidence that more complex introgressive relationships are common in nature. Here, we use field observations and genomic analysis, including the sequencing and assembly of a novel reference genome, to study an emergent hybrid zone involving two colliding hybrid zones of three woodpecker species: red-breasted, red-naped, and yellow-bellied sapsuckers (Sphyrapicus ruber, S. nuchalis, and S. varius). Surveys of the area surrounding Prince George, British Columbia, Canada, show that all three species are sympatric, and Genotyping-by-Sequencing identifies hybrids from each species pair and birds with ancestry from all three species. Observations of phenotypes and genotypes of mated pairs provide evidence for assortative mating, though there is some heterospecific pairing. Hybridization is more extensive in this tri-species hybrid zone than in two di-species hybrid zones. However, there is no evidence of a hybrid swarm, and admixture is constrained to contact zones, so we classify this region as a tension zone and invoke selection against hybrids as a likely mechanism maintaining species boundaries. Analysis of sapsucker age classes does not show disadvantages in hybrid survival to adulthood, so we speculate the selection upholding the tension zone may involve hybrid fecundity. Gene flow among all sapsuckers in di-species hybrid zones suggests introgression likely occurred before the formation of this tri-species hybrid zone and might result from bridge hybridization, vagrancies, or other three-species interactions.

Methods

See README for details on use of data files and scripts. Multiplexed *.fastq.gz files are described in the data processing scripts, but for re-use please see individual demultiplexed files from NCBI SRA database number SUB11950717. The pg_variants* files are called within the R PCA scripts. Descriptions of all variables in .csv data files are included in csv_variables_descriptions.txt. 

Usage notes

Canu v 1.9, Purge Haplotigs, Repeat Masker v 4.1.1, BUSCO v 5.1.2, Satsuma v 2.0, VCFtools v 0.1.16, GATK v 3.8, ADMIXTURE v 1.3.0, R,  Ternary v 1.2.2, R scripts from Irwin et al. 2016 and 2018.

Funding

Natural Sciences and Engineering Research Council, Award: RGPIN-2017-03919

Natural Sciences and Engineering Research Council, Award: RGPAS-2017-507830

Society of Canadian Ornithologists, Award: Baillie Award

American Museum of Natural History, Award: Chapman Award

Explorers Club, Award: Mamont Grant

Society for the Study of Evolution, Award: RC Lewontin Early Award

Warner and Hildegard Hesse, Award: Graduate Research Award