Data from: Admixture and environment shape population genetic and phytochemical variation across a conifer hybrid zone
Data files
Feb 12, 2026 version files 19.10 MB
-
juniper_environmental_data.csv
94.61 KB
-
juniper_GCMS_data.csv
250.97 KB
-
juniper_genotype_probabilities.csv
18.75 MB
-
README.md
4.72 KB
Abstract
Ancestry variation in hybrid zones can reflect the causes and genetic basis of reproductive isolation and result in novel phenotypic variation with the potential for extended ecological effects. Junipers (Juniperus) are foundational tree species in many semi-arid landscapes of western North America and often hybridize in zones of secondary contact. Such hybridization can be ecologically significant in foundational tree species, due to the strong genetic control and ecological consequences of plant chemistry. We generated genetic and phytochemical data to analyze hybridization among Juniperus grandis, J. occidentalis, and J. osteosperma in western Nevada and its impact on plant chemistry. We used population genomic data (9,125 SNPs; 326 individuals; 25 populations) to quantify patterns of genetic variation across populations and species and to characterize ancestry variation in hybrids. While populations within species showed little genetic differentiation, the parental species formed distinct, monophyletic lineages with clear phenotypic and ecological differences. Hybrids occupied intermediate environments, contained ancestry from all three parents, and were mainly F1 or backcross hybrids. Phytochemical data (GC-MS; 163 terpenoid compounds) were likewise analyzed to understand the consequences of hybridization for plant chemistry. The parental species and hybrids displayed distinct phytochemical profiles, with hybrids often characterized by a combination of transgressive and intermediate chemical concentrations. Our results illustrate that geography and environment shape hybrid ancestry for a syngameon involving three Juniperus species, and that admixture generates novel phytochemical variation likely to have ecological consequences.
https://doi.org/10.5061/dryad.0vt4b8h91
Description of the data and file structure
This data set contains the genotype probabilities, environmental data, and phenotypic terpenoid data that was used in the analyses in Uckele et al., testing the relationships between geography, environment, and genetic variation across a natural juniper hybrid zone located in the western United States.
All files include a sample_ID column that uniquely identifies each individual sample. Each sample_ID follows this structured format:
- Population ID: A two-letter code (see Supplementary Table 1)
- Separator: An underscore (_)
- Species ID: A two-letter code, where:
- GR = Juniperus grandis
- HY = Hybrid
- OC = Juniperus occidentalis
- OS = Juniperus osteosperma
- Separator: An underscore (_)
- Sample Number: A unique numeric identifier for the individual sample
This standardized naming convention ensures consistency across all datasets and facilitates easy cross-referencing.
Files and variables
File: juniper_environmental_data.csv
Description: This file contains climate variables associated with the geographic coordinates (latitude/longitude) of each sampling population. For each geographic coordinate, 25 climate variables were extracted at 30-arcsec (approximately 800 meters) resolution from PRISM (PRISM Climate Group, 2007) and the Climate Water Deficit Toolbox (Dilts and Yang, 2015).
Variables
- sample_ID: unique identifier for each sample
- mintemp: minimum temperature (degree C)
- maxtemp: maximum temperature
- prcpann: annual precipitation (mm)
- temprang: temperature range
- WS.AETorSWB: Spring ratio of water supply (WS) to the greater of actual evapotranspiration (AET) or soil water balance (SWB)
- AET.CWD: ratio of actual evapotranspiration (AET) to climate water deficit (CWD)
- prcpsum: summer precipitation
- pdAET_SWB: positive difference between actual evapotranspiration (AET) and soil water balance (SWB)
- pcseas: precipitation seasonality
- cumlSWB: cumulative soil water balance (SWB)
- WS.AET: ratio of water supply (WS) to actual evapotranspiration (AET)
- SWB.AET: ratio of soil water balance (SWB) to actual evapotranspiration (AET)
- PET.AET: ratio of potential evapotranspiration (PET) to actual evapotranspiration (AET)
- prcpwin: winter precipitation
- prcpspr: spring precipitation
- prcpfal: fall precipitation
- pdWS_AETorSWB: positive difference between water supply (WS) and the greater of actual evapotranspiration (AET) or soil water balance (SWB)
- mxtmpwin: maximum winter temperature
- mxtmpsum: maximum summer temperature
- mxtmpspr: maximum spring temperature
- mxtmpfal: maximum fall temperature
- mntmpwin: minimum winter temperature
- mntmpsu: minimum summer temperature
- mntmpspr: minimum sprint temperature
- mntmpfal: minimum fall temperature
- cumlWS: cumulative water supply (WS)
- cumlPET: cumulative potential evapotranspiration (PET)
- cumlCWD: cumulative climate water deficit (CWD)
- cumlAET: cumulative actual evapotranspiration (AET)
- monsoon: monsoonality index
- latitude: latitude (degrees)
- longitude: longitude (degrees)
- elevation: elevation (meters)
File: juniper_GCMS_data.csv
Description: This file contains concentration data for 163 terpenoid compounds measured via gas chromatography. Each sample’s chemical profile is represented by concentration values associated with specific retention times.
Variables
- sample_ID: unique identifier for each sample
- Compound Data: The remaining columns correspond to retention times recorded during gas chromatography, with each column representing the concentration of a specific terpenoid compound. The retention time in the column header serves as an identifier for the compound.
File: juniper_genotype_probabilities.csv
Description: This file contains genotype probabilities derived from ddRADseq data using a Bayesian hierarchical model implemented in entropy (Gompert et al. 2014; Shastry et al. 2021). These probabilities provide a measure of certainty for genotype calls across multiple genomic loci.
Variables
- sample_ID: unique identifier for each sample
- Genotype Data: The remaining columns correspond to genomic loci. Each column represents the inferred genotype probabilities at a specific locus. Values range from 0 to 2:
- 0: Indicates a homozygous genotype call for the reference allele.
- 1: Indicates a heterozygous genotype call.
- 2: Indicates a homozygous genotype call for the alternative allele.
