Data for: Additional complexity in historic and contemporary gene flow among hoary, Vancouver Island, and Olympic marmots revealed by microsatellites and ultraconserved elements
Data files
Mar 19, 2026 version files 11.62 GB
-
Clean_UCE_reads.zip
11.50 GB
-
Marmot_filtered_vcf75p.vcf.recode.vcf
2.15 MB
-
README.md
4.88 KB
-
UCE_contigs.zip
112.04 MB
Abstract
Alpine species are inordinately threatened by habitat loss and precipitation changes resulting from climate change. In North America’s Pacific Northwest, three closely related alpine mammal species – hoary, Olympic, and Vancouver Island marmots – may face greater negative impacts of climate change relative to species found at lower elevations. Phylogenetic studies have found these three species form a monophyletic complex; however, discordant evolutionary histories between mitochondrial and nuclear genes suggest that gene flow may have occurred between these marmot species. Furthermore, mitochondrial data find two reciprocally monophyletic mitochondrial clades (haploclades) of hoary marmots. Nuclear data do not recover this pattern, and interspecific relationships among the markers are not consistent. We used nine microsatellite loci and ultraconserved elements (UCEs) to explore patterns of nuclear gene flow among marmot species in the PNW. Analyses of microsatellite data indicate no current gene flow between hoary and Vancouver Island marmots or between hoary and Olympic marmots, but do reveal nuclear gene flow among hoary marmot haploclades. Additionally, UCE data reveal historic gene flow between hoary and Vancouver Island marmots. Overall, our results suggest that historic mitochondrial introgression between hoary and Vancouver Island marmots, as well as male-biased dispersal, are driving mito-nuclear discordance in this species complex.
https://doi.org/10.5061/dryad.tb2rbp0bc
Description of the data and file structure
This dataset accompanies the publication:
Additional complexity in historic and contemporary gene flow among hoary, Vancouver Island, and Olympic marmots revealed by microsatellites and ultraconserved elements. Natalie M. Hamilton, Nicholas J. Kerhoulas, Kathryn M. Everson, Aren M. Gunderson, and Link E. Olson
Files and variables
File: Supplementary_Table_S4_UCE.xlsx
Description: This file contains metadata for UCE reads, supplementary table from publication
Variables
- Species name abbreviations: M. caligata = Marmota caligata, M. flaviventris = Marmota flaviventris, M. olympus = Marmota olympus, M. vancouverensis = Marmota vancouverensis
- Collection ID:Museum Catalog Number. Additional information about each specimen can be found by using this information to search on VertNet.Org and/or Arctosdb.org. MSB = Museum of Southwestern Biology, UAM = University of Alaska Museum, UWBM = University of Washington Burke Museum, and ROM = Royal Ontario Museum.
- Collection ID also includes museum catalog numbers, typical of all museum collections.
- Country, State, Lat, Lon: provide sampling location information including the latitude (Lat) and longitude (Lon)
- Clade: this refers to two mitochondrial lineages that are found in the original text, con = continental mitochondrial lineage, cos = coastal mitochondrial lineage
File: Clean_UCE_reads.zip
Description: This file contains cleaned UCE reads from specimens included in the study. Cleaned reads refer to sequences where adapters and low-quality reads have been removed. Reads were cleaned using the illumiprocessor function in Phyluce. Cleaned UCE reads can be used in downstream analyses, such as SNP calling.
Variables:
- Additional information on sample ID can be found in Supplementary_Table_S4_UCE.xlsx, briefly, folders are named species_museumcode
- Folders named by sample ID
- Within each, folder "split-adapter-quality-trimmed" contains READ1 and READ2 files for each sample. These hold reads that remain in a pair (reads are in the same consecutive order in each file). The READ-singleton file holds READ1 or READ2 reads that lost their "mate" or "paired-read" because of trimming or removal.
File: UCE_contigs.zip
Description: This file contains assembled UCE reads corresponding to individuals in UCE_reads.zip, with the addition of the Marmota monax reference genome from GenBank (https://www.ncbi.nlm.nih.gov/datasets/genome/GCF_021218885.2/). Individuals are named according to species and Museum Catalog Number.
- For more information on specimens see Supplementary_Table_S4_UCE.xlsx
File: Marmot_filtered_vcf75p.vcf.recode.vcf
Description: This file contains the input data for Dsuite analysis. This is a vcf file that has been filtered for quality reads that are present in 75% of the data. This is the filtered output of the SNP calling pipeline.
Code/software
UCE processing and analyses
A walkthrough of UCE processing can be found on GitHub: https://github.com/NmHamilton/Marmot_Historic_Contemporary_GeneFlow. Below is a short description of
- Associated workflow is available: https://github.com/NmHamilton/Marmot_Historic_Contemporary_GeneFlow?tab=readme-ov-file#Calling-snps
- Software needed: Phyluce, Python 2, BWA, GATK 3.8.1.0, GATK 3.46, vcftools, Dsuite
- Associated data: UCE_reads.zip, UCE_contigs.zip, Marmot_filtered_vcf75p.vcf.recode.vcf
Access information
Other publicly accessible locations of the data:
Data was derived from the following sources:
- UCE data: https://datadryad.org/stash/dataset/doi:10.5061/dryad.6hdr7sr56, public domain
DNA extraction and loci amplification
We amplified 15 microsatellite loci previously developed for use in marmots and closely related species (Supplementary Table S1). Amplification of microsatellite loci followed Schuelke (2000). For fragment analyses, reactions were combined by using 1.5 μL from each of three PCR reactions of independent loci (with different fluorescent dyes) for the same individual and brought to a final volume of 10 μL with 5.5 μL Hi-Di Formamide (Applied Biosystems, Foster City, CA). Microsatellite loci were analyzed at the DNA Facility on Science Hill at Yale University using a Liz-500 size standard (Applied Biosystems, Foster City, California). Alleles were scored using GeneMarker v. 2.6.0 (SoftGenetics, State College, PA).
To further delimit the distribution of hoary marmot mtDNA clades, we also extracted DNA from dried study skins or adventitious tissue removed from skeletal material from 138 museum specimens for which archived fresh tissues were not available (Supplementary Table S2). DNA extractions were conducted in the University of Alaska Museum’s ancient DNA extraction facility using the Promega DNA IQ system (Promega Corp., Madison, WI) following the manufacturer’s protocol. We used one general mammalian primer and one specifically designed for M. caligata to amplify and sequence overlapping fragments of the mitochondrial cytochrome b gene (Table S1). PCR thermal-cycling parameters followed Kerhoulas and Arbogast (2010) but used an annealing temperature of 48 degrees Celsius for 1.5 minutes (see Supplementary Data for additional details). We then determined clade assignments via Sanger sequencing and/or the use of a restriction enzyme. Then, we added 3.75 uL of NE Buffer 3 and 0.35 uL of restriction enzyme Bs1I (New England BioLabs, Ipswich, MA) to 25 uL of each successful PCR reaction and incubated at 55o C for 70 minutes, followed by 80 degree Celsius for 20 minutes, and finally held at 4 degree Celsius Selection of this restriction enzyme was based on sequence data from 167 M. caligata specimens with known clade membership (Kerhoulas et al. 2015). Gel electrophoresis was used to visualize the size and number of fragments for clade assignment. Members of the coastal haplotype clade lacked a binding site for this restriction enzyme and produced a single (222 bp) band, while members of the continental haplotype clade had a single binding site and produced bands at 91 and 131 bp.
Ultraconserved element data were provided by Kendall Mills and Kathryn Everson from Mills et al. 2023: https://doi.org/10.1016/j.ympev.2023.107785
