Structural variants underlie parallel adaptation following global invasion
Data files
Aug 29, 2025 version files 45.12 KB
-
DatasetA_commongarden.csv
41.15 KB
-
README.md
3.97 KB
Abstract
Rapid adaptation during invasion has historically been considered limited and unpredictable. We leverage whole-genome sequencing of >2600 plants across six continents to investigate the relative roles of colonization history and adaptation during the worldwide invasion of Trifolium repens. Introduced populations contain high levels of genetic variation with independent colonization histories evident on different continents. Five large structural variants on three chromosomes exist as standing genetic variation within the native range, and exhibit strong signatures of parallel climate-associated adaptation across continents. Common gardens in the native and introduced ranges demonstrate that three structural variants exhibit patterns of selection consistent with local adaptation across each range. Our results provide strong evidence that rapid and parallel adaptation during invasion is caused by large-effect structural variants introduced throughout the world.
This is a comma delimited file (.csv) which contains fitness data for 472 indviduals from four field common gardens located in the native and introduced ranges of Trifolium repens, as described in the main text and methods of Battlay et al. in review. Briefly, seeds from 46 populations were collected along a latitudinal transect in North American by Lucas Albano and 47 populations were collected along a latitudinal transect in Western Europe by Simon Innes. Seeds from each of these populations were grown in a greenhouse refresher generation and crossed to other plants from the same population. The resulting lines were grown in four common gardens (Uppsula, Sweden; Montpellier, France; Mississauga, Canada; Lafayette, USA). The methods of Battlay et al. describes the fitness metrics measured for each individual. Tissue was taken for DNA extractions for each individual. Methods describing DNA extractions, library construction, and sequencing are described in the methods of Battlay et al..
Description of the Data and file structure
Each row of this spreadsheet represents an individual plant from one of the four common gardens except for the first row header. Missing data is coded as '999'
Questions? Please contact Nic Kooyers (nkooyers@gmail.com).
Format (Column -- Variable -- Description)
Column A -- Garden -- The common garden location (Lafayette, Toronto, Montpellier, Uppsula)
Column B -- Continent -- The continent where the common garden was located (NAM = North America or EUR = Europe)
Column C -- File -- The .bam file associated with the individual.
Column D -- Population -- The population where the individual originated.
Column E -- Latitude -- The latitude of the population from column D. (decimal degrees)
Column F -- Longitude -- The longitude of the population from column D. (decimal degrees)
Column H -- Hb7a1 -- The genotype at the hb7a1 haploblock. (0 = homozygous for the reference allele, 0.5 = heterozygous, 1 = homozygous for the alternative allele)
Column I -- Hb7a2 -- The genotype at the hb7a2 haploblock. (0 = homozygous for the reference allele, 0.5 = heterozygous, 1 = homozygous for the alternative allele)
Column J -- Hb7b -- The genotype at the hb7b haploblock. (0 = homozygous for the reference allele, 0.5 = heterozygous, 1 = homozygous for the alternative allele)
Column K -- Hb9 -- The genotype at the hb9 haploblock. (0 = homozygous for the reference allele, 0.5 = heterozygous, 1 = homozygous for the alternative allele)
Column L -- Hb13 -- The genotype at the hb13 haploblock. (0 = homozygous for the reference allele, 0.5 = heterozygous, 1 = homozygous for the alternative allele)
Column M -- Survived1st -- Whether the individual survived the first year (0 = No, 1= Yes)
Column N -- GrowthRate -- The growth rate of the individual during the first year of the experiment (cm/day)
Column O -- FloweredFull -- Whether the individual survived to flower during the experiment (0 = No, 1= Yes)
Column P -- FlowerHeadNumberFull -- The number of inflorescences an individual produced during the experiment
Column Q -- SeedSetMassFull -- The mass of seeds that an individual produced during the experiment (grams)
Sharing/access Information
Links to other publicly accessible locations of the data:
https://dataview.ncbi.nlm.nih.gov/object/PRJNA1081485?reviewer=f4t49mnf9207oiuc4i798vlkih
Was data derived from another source?
The allelic data for each structural variant was derived from low coverage Whole Genome Sequencing. The reads for this sequencing can be found at this link:
https://dataview.ncbi.nlm.nih.gov/object/PRJNA1081485?reviewer=f4t49mnf9207oiuc4i798vlkih
This dataset contains fitness data for 472 indviduals from four field common gardens located in the native and introduced ranges of Trifolium repens (white clover), as described in the main text and methods of Battlay et al.. Briefly, seeds from 46 populations were collected along a latitudinal transect in North American by Lucas Albano and 47 populations were collected along a latitudinal transect in Western Europe by Simon Innes. Seeds from each of these populations were grown in a greenhouse refresher generation and crossed to other plants from the same population. The resulting lines were grown in four common gardens (Uppsula, Sweden; Montpellier, France; Mississauga, Canada; Lafayette, USA). The methods of Battlay et al. describes the fitness metrics measured for each individual. Tissue was taken for DNA extractions for each individual. Methods describing DNA extractions, library construction, and sequencing are described in the methods of Battlay et al..
- Battlay, Paul; Hendrickson, Brandon T; Mendez-Reneau, Jonas I et al. (2024). Structural variants underlie parallel adaptation following global invasion [Preprint]. openRxiv. https://doi.org/10.1101/2024.07.09.602765
