Skip to main content
Dryad

Synchronized file of allele frequencies for: Using seasonal genomic changes to understand historical adaptation to new environments: parallel selection on stickleback in highly-variable estuaries

Cite this dataset

Garcia-Elfring, Alan et al. (2021). Synchronized file of allele frequencies for: Using seasonal genomic changes to understand historical adaptation to new environments: parallel selection on stickleback in highly-variable estuaries [Dataset]. Dryad. https://doi.org/10.5061/dryad.fbg79cntg

Abstract

Parallel evolution is considered strong evidence for natural selection. However, few studies have investigated the process of parallel selection as it plays out in real time. The common approach is to study historical signatures of selection in populations already well adapted to different environments. Here, to document selection under natural conditions, we study six populations of threespine stickleback (Gasterosteus aculeatus) inhabiting bar-built estuaries that undergo seasonal cycles of environmental changes. Estuaries are periodically isolated from the ocean due to sandbar formation during dry summer months, with concurrent environmental shifts that resemble the long-term changes associated with postglacial colonization of freshwater habitats by marine populations. We used pooled whole-genome sequencing (Pool-WGS) to track seasonal allele frequency changes in six of these populations and search for signatures of natural selection. We found consistent changes in allele frequency across estuaries, suggesting a potential role for parallel selection. Functional enrichment among candidate genes included transmembrane ion transport and calcium binding, which are important for osmoregulation and ion balance. The genomic changes that occur in threespine stickleback from bar-built estuaries could provide a glimpse into the early stages of adaptation that have occurred in many historical marine to freshwater transitions.

Methods

We sampled stickleback from six bar-built estuaries at two time points (i.e., 12 pools). We applied pooled sequencing of whole-genomes (12 pools). Fst estimates were produced using the fst-sliding.pl program of Popoolation2 (https://sourceforge.net/p/popoolation2/wiki/Manual/). The scripts used to analyze the results are available at https://github.com/garfring.

Usage notes

This is the synchronized file created using the mpileup2sync.jar program of Popoolation2. The synchronized file contains the allele frequencies for all positions in the reference genome and for all populations being analyzed. Only informative SNPs have been kept. Population order: Waddell T1 = column 1, Waddell T2 = column 7; Lombardi T1 = column 2, Lombardi T2 = column 4; Old Dairy T1 = column 3, Old Dairy T2 = column 6; Younger T1 = column 5, Younger T2 = column 12; Scott T1 = column 8, Scott T2 = column 9; Laguna T1 = column 10, Laguna T2 = column 11.

Funding

Canada Research Chairs

Natural Sciences and Engineering Research Council

NOAA Cooperative Institute for Marine Ecosystems and Climate

NOAA Cooperative Institute for Marine Ecosystems and Climate