Genome-wide RAD sequencing resolves the evolutionary history of serrate leaf Juniperus and reveals discordance with chloroplast phylogeny
Uckele, Kathryn; Adams, Robert; Parchman, Thomas; Schwarzbach, Andrea (2021), Genome-wide RAD sequencing resolves the evolutionary history of serrate leaf Juniperus and reveals discordance with chloroplast phylogeny, Dryad, Dataset, https://doi.org/10.5061/dryad.qbzkh18df
Juniper (Juniperus) is an ecologically important conifer genus of the Northern Hemisphere, the members of which are often foundational tree species of arid regions. The serrate leaf margin clade is native to topologically variable regions in North America, where hybridization has likely played a prominent role in their diversification. Here we use a reduced-representation sequencing approach (ddRADseq) to generate a phylogenomic data set for 68 accessions representing all 22 species in the serrate leaf margin clade, as well as a number of close and distant relatives, to improve understanding of diversification in this group. Phylogenetic analyses using three methods (SVDquartets, maximum likelihood, and Bayesian) yielded highly congruent and well-resolved topologies. These phylogenies provided improved resolution relative to past analyses based on Sanger sequencing of nuclear and chloroplast DNA, and were largely consistent with taxonomic expectations based on geography and morphology. Calibration of a Bayesian phylogeny with fossil evidence produced divergence time estimates for the clade consistent with a late Oligocene origin in North America, followed by a period of elevated diversification between 12 and 5 Mya. Comparison of the ddRADseq phylogenies with a phylogeny based on Sanger-sequenced chloroplast DNA revealed five instances of pronounced discordance, illustrating the potential for chloroplast introgression, chloroplast transfer, or incomplete lineage sorting to influence organellar phylogeny. Our results improve understanding of the pattern and tempo of diversification in Juniperus, and highlight the utility of reduced-representation sequencing for resolving phylogenetic relationships in non-model organisms with reticulation and recent divergence.
We sampled leaf material from 68 Juniperus individuals representing 22/22 serrate juniper species, 2/5 serrate juniper varieties, and 14 additional outgroup taxa. Leaf samples were dried and stored in silica gel, and DNA was extracted with Qiagen DNeasy Plant Mini Kits (Qiagen Inc., Valencia, CA, USA). Reduced representation libraries for Illumina sequencing were constructed using a ddRADseq method. Libraries were size-selected for fragments between 350 and 450 bp in length and two lanes of single-end 100-base sequencing were executed using an Illumina HiSeq 2500 platform.
We used the tapioca pipeline (https://github.com/ncgr/tapioca) to identify and discard Illumina primer/adaptor sequences and potential biological sequence contaminants (e.g., PhiX, E. coli) and discarded all reads which aligned to the Juniperus squamata chloroplast genome (GenBank Accession Number MK085509). We demultiplexed reads to individual using a custom Perl script which trims restriction site-associated bases and parses the reads according to unique barcode sequences assigned to each individual.
We used ipyRAD (Eaton 2014) to de novo assemble our reads according to the parameter settings provided in the ipyrad param files. Two alignments were generated, one including all sampled taxa, and the other including a subset of only serrate juniper taxa: 1) All 68 individual samples were included in an assembly in which much more missing data was permitted and 2) 22 individual samples representing the 22 serrate juniper species were included in an assembly in which loci with missing data were more rigorously filtered.
Baylor University, Award: 032512
National Science Foundation, Award: 1650114