Skip to main content

Natural selection shapes variation in genome-wide recombination rate in Drosophila pseudoobscura

Cite this dataset

Samuk, Kieran; Manzano-Winkler, Brenda; Ritz, Kathryn R.; Noor, Mohamed A.F. (2020). Natural selection shapes variation in genome-wide recombination rate in Drosophila pseudoobscura [Dataset]. Dryad.


While recombination is widely recognized to be a key modulator of numerous evolutionary phenomena, we have a poor understanding of how recombination rate itself varies and evolves within a species. Here, we performed a comprehensive study of recombination rate (rate of meiotic crossing over) in two natural populations of Drosophila pseudoobscura from Utah and Arizona, USA. We used an amplicon sequencing approach to obtain high-quality genotypes in approximately 8000 individual backcrossed offspring (17 mapping populations with roughly 530 individuals each), for which we then quantified crossovers. Interestingly, variation in recombination rate within and between populations largely manifested as differences in genome-wide recombination rate rather than remodeling of the local recombination landscape. Comparing populations, we discovered individuals from the Utah population displayed on average 8% higher crossover rates than the Arizona population, a statistically significant difference. Using a QST-FST analysis, we found that this difference in crossover rate was dramatically higher than expected under neutrality, indicating that this difference may have been driven by natural selection. Finally, using a combination of short and long read whole-genome sequencing, we found no significant association between crossover rate and structural variation at the 200-400kb scale. Our results demonstrate that (1) there is abundant variation in genome-wide crossover rate in natural populations, (2) at the 200-400kb scale, recombination rate appears to vary largely genome wide, rather than in specific intervals and (3) interpopulation differences in recombination rate may be the result of local adaptation.


- Primers were designed as described in Samuk et al. 2020

- We followed the GT-seq library prep protocol in Campbell et al. (2017)

- We sequenced the libraries in two NextSeq 500 lanes at the Duke Genomics Core

- We demultiplexed the libraries using fastq-multx, as shown here

Usage notes

- Data set contains paired-end amplicon reads for ~9800 individuals from 19 separate mapping populations of Drosophila pseudoobscura.

- Files are named like r1.AFC_14_15_A01.fq:

r1 = Read 1 (paired end)
AFC = The source population, AFC = American Fork Canyon, UT or MC = Madera Canyon, AZ.
14 = The inbred line ID (this is the identifier for the mapping population).
15 = A 96 well plate identifier (individuals were collected and prepped in plates)
A01 = A well indentifier, which also serves as an individual ID.


National Science Foundation, Award: DEB-1545627

National Science Foundation, Award: DEB-1754022

National Science Foundation, Award: DEB-1754439