Skip to main content

The complex history of genome duplication and hybridization in North American gray treefrogs

Cite this dataset

Booker, William W. et al. (2020). The complex history of genome duplication and hybridization in North American gray treefrogs [Dataset]. Dryad.


Polyploid speciation has played an important role in evolutionary history across the tree of life, yet there remain large gaps in our understanding of how polyploid species form and persist. While systematic studies have been conducted in numerous polyploid complexes, recent advances in sequencing technology have demonstrated that conclusions from data-limited studies may be spurious and misleading. The North American gray treefrog complex, consisting of the diploid Hyla chrysoscelis and the tetraploid Hyla versicolor, has long been used as a model system in a variety of biological fields, yet all taxonomic studies to date were conducted with only a few loci from nuclear and mitochondrial genomes. Here, we utilized anchored hybrid enrichment and high-throughput sequencing to capture hundreds of loci along with whole mitochondrial genomes to investigate the evolutionary history of this complex. We used several phylogenetic and population genetic methods, including coalescent simulations and testing of polyploid speciation models with Approximate Bayesian Computation (ABC), to determine that H. versicolor was most likely formed via autopolyploidization from a now extinct lineage of H. chrysoscelis. We also uncovered evidence of significant hybridization between diploids and tetraploids where they co-occur, and show that historical hybridization between these groups led to the re-formation of distinct polyploid lineages following the initial whole genome duplication event. Our study indicates that a wide variety of methods and explicit model testing of polyploid histories can greatly facilitate efforts to uncover the evolutionary history of polyploid complexes.


Sequence data was collected using Anchored Hybrid Enrichment and sequenced on an Illumina Hi-seq. Data were processed using the standard Anchored pipeline as described in the article.

Usage notes

SNPs.rar: SNP files used for STRUCTURE and other SNP analyses. 

GTF_trees_alignments: Trees and alignments for all phylogenetic analyses.

Allelephaser.jar: Custom allele phaser script. First used in Pyron (2016) (doi:10.5061/ dryad.51v22).

GTF_Polyploid_ABC: All scripts and example data to conduct polyploid ABC model testing analyses.

ABC_locusinfo.txt: Loci and number of individuals for each lineage used for polyploid ABC analyses.


National Science Foundation, Award: DGE 1449440

National Science Foundation, Award: DEB 1120516

Missouri Research Council, Award: URC-15-106

Missouri Research Council, Award: URC-15-106