Data from: Admixture and environment shape population genetic and phytochemical variation across a conifer hybrid zone

Published Feb 12, 2026 on Dryad. https://doi.org/10.5061/dryad.0vt4b8h91

Data files

Feb 12, 2026 version files 19.10 MB

juniper_environmental_data.csv

94.61 KB
juniper_GCMS_data.csv

250.97 KB
juniper_genotype_probabilities.csv

18.75 MB
README.md

4.72 KB

Abstract

Ancestry variation in hybrid zones can reflect the causes and genetic basis of reproductive isolation and result in novel phenotypic variation with the potential for extended ecological effects. Junipers (Juniperus) are foundational tree species in many semi-arid landscapes of western North America and often hybridize in zones of secondary contact. Such hybridization can be ecologically significant in foundational tree species, due to the strong genetic control and ecological consequences of plant chemistry. We generated genetic and phytochemical data to analyze hybridization among Juniperus grandis, J. occidentalis, and J. osteosperma in western Nevada and its impact on plant chemistry. We used population genomic data (9,125 SNPs; 326 individuals; 25 populations) to quantify patterns of genetic variation across populations and species and to characterize ancestry variation in hybrids. While populations within species showed little genetic differentiation, the parental species formed distinct, monophyletic lineages with clear phenotypic and ecological differences. Hybrids occupied intermediate environments, contained ancestry from all three parents, and were mainly F1 or backcross hybrids. Phytochemical data (GC-MS; 163 terpenoid compounds) were likewise analyzed to understand the consequences of hybridization for plant chemistry. The parental species and hybrids displayed distinct phytochemical profiles, with hybrids often characterized by a combination of transgressive and intermediate chemical concentrations. Our results illustrate that geography and environment shape hybrid ancestry for a syngameon involving three Juniperus species, and that admixture generates novel phytochemical variation likely to have ecological consequences.

https://doi.org/10.5061/dryad.0vt4b8h91

Description of the data and file structure

This data set contains the genotype probabilities, environmental data, and phenotypic terpenoid data that was used in the analyses in Uckele et al., testing the relationships between geography, environment, and genetic variation across a natural juniper hybrid zone located in the western United States.

All files include a sample_ID column that uniquely identifies each individual sample. Each sample_ID follows this structured format:

Population ID: A two-letter code (see Supplementary Table 1)
Separator: An underscore (_)
Species ID: A two-letter code, where:
- GR = Juniperus grandis
- HY = Hybrid
- OC = Juniperus occidentalis
- OS = Juniperus osteosperma
Separator: An underscore (_)
Sample Number: A unique numeric identifier for the individual sample

This standardized naming convention ensures consistency across all datasets and facilitates easy cross-referencing.

Files and variables

File: juniper_environmental_data.csv

Description: This file contains climate variables associated with the geographic coordinates (latitude/longitude) of each sampling population. For each geographic coordinate, 25 climate variables were extracted at 30-arcsec (approximately 800 meters) resolution from PRISM (PRISM Climate Group, 2007) and the Climate Water Deficit Toolbox (Dilts and Yang, 2015).

Variables

sample_ID: unique identifier for each sample
mintemp: minimum temperature (degree C)
maxtemp: maximum temperature
prcpann: annual precipitation (mm)
temprang: temperature range
WS.AETorSWB: Spring ratio of water supply (WS) to the greater of actual evapotranspiration (AET) or soil water balance (SWB)
AET.CWD: ratio of actual evapotranspiration (AET) to climate water deficit (CWD)
prcpsum: summer precipitation
pdAET_SWB: positive difference between actual evapotranspiration (AET) and soil water balance (SWB)
pcseas: precipitation seasonality
cumlSWB: cumulative soil water balance (SWB)
WS.AET: ratio of water supply (WS) to actual evapotranspiration (AET)
SWB.AET: ratio of soil water balance (SWB) to actual evapotranspiration (AET)
PET.AET: ratio of potential evapotranspiration (PET) to actual evapotranspiration (AET)
prcpwin: winter precipitation
prcpspr: spring precipitation
prcpfal: fall precipitation
pdWS_AETorSWB: positive difference between water supply (WS) and the greater of actual evapotranspiration (AET) or soil water balance (SWB)
mxtmpwin: maximum winter temperature
mxtmpsum: maximum summer temperature
mxtmpspr: maximum spring temperature
mxtmpfal: maximum fall temperature
mntmpwin: minimum winter temperature
mntmpsu: minimum summer temperature
mntmpspr: minimum sprint temperature
mntmpfal: minimum fall temperature
cumlWS: cumulative water supply (WS)
cumlPET: cumulative potential evapotranspiration (PET)
cumlCWD: cumulative climate water deficit (CWD)
cumlAET: cumulative actual evapotranspiration (AET)
monsoon: monsoonality index
latitude: latitude (degrees)
longitude: longitude (degrees)
elevation: elevation (meters)

File: juniper_GCMS_data.csv

Description: This file contains concentration data for 163 terpenoid compounds measured via gas chromatography. Each sample’s chemical profile is represented by concentration values associated with specific retention times.

Variables

sample_ID: unique identifier for each sample
Compound Data: The remaining columns correspond to retention times recorded during gas chromatography, with each column representing the concentration of a specific terpenoid compound. The retention time in the column header serves as an identifier for the compound.

File: juniper_genotype_probabilities.csv

Description: This file contains genotype probabilities derived from ddRADseq data using a Bayesian hierarchical model implemented in entropy (Gompert et al. 2014; Shastry et al. 2021). These probabilities provide a measure of certainty for genotype calls across multiple genomic loci.

Variables

sample_ID: unique identifier for each sample
Genotype Data: The remaining columns correspond to genomic loci. Each column represents the inferred genotype probabilities at a specific locus. Values range from 0 to 2:
- 0: Indicates a homozygous genotype call for the reference allele.
- 1: Indicates a heterozygous genotype call.
- 2: Indicates a homozygous genotype call for the alternative allele.

Data from: Admixture and environment shape population genetic and phytochemical variation across a conifer hybrid zone

Data files

Abstract

README: Data from: Admixture and environment shape population genetic and phytochemical variation across a conifer hybrid zone

Description of the data and file structure

Files and variables

File: juniper_environmental_data.csv

Variables

File: juniper_GCMS_data.csv

Variables

File: juniper_genotype_probabilities.csv

Variables