A comprehensive phylogeny of Pieridae butterflies reveals a strong correlation between diversification and temperature
Data files
Mar 29, 2024 version files 13.10 MB
Abstract
Temperature is thought to be a key factor influencing global species richness patterns. We investigate the link between temperature and diversification in the butterfly Family Pieridae by combining next-generation sequences and published molecular data with fine-grained distribution data. We sampled nearly 600 pierid species to build the most comprehensive molecular phylogeny of the Family and curated a distribution dataset of more than 800,000 occurrences. We found strong evidence that species in environments with more stable daily temperatures or cooler maximum temperatures in the warm seasons have higher speciation rates. Furthermore, speciation and extinction rates decreased in tandem with global temperatures through geological time, resulting in a constant net diversification.
README: A comprehensive phylogeny of Pieridae butterflies reveals a strong correlation between diversification and temperature
Tree inference
- Folder with individual loci directory containing 425 .fas files
- FcC_supermatrix.phy: Supermatrix of 425 concatenated files
- IQTREE_425.tre: Best of 100 IQ-TREE analyses partitioned and modeled according to ModelFinder
- ModelFinder_PartitionScheme_Models.nex: ModelFinder partition scheme and selected models of nucleotide evolution
- Sample_List.xlsx: Specimen information for samples included in this study. Locality and voucher institution information is provided for samples sequenced for this study and included in Kawahara et al. (2022). Taxonomy follows Lamas (2015).
- ABRI = African Butterfly Research Institute, Kenya
- AMNH = American Museum of Natural History, United States
- ANHRT = African Natural History Research Trust
- CCNY = City College of New York, United States
- CLDZ = Coleção de Lepidoptera, Departamento de Zoologia, Universidade Federal do Rio Grande do Sul, Brazil
- CSIC-UPF = Consejo Superior de Investigaciones Científicas, Universidad Pompeu Fabra, Spain
- MCZ = Museum of Comparative Zoology, Harvard University, United States
- MGCL = McGuire Center for Lepidoptera & Biodiversity, University of Florida, United States
- NCBS = National Centre for Biological Science, India
- UMD = University of Maryland, United States
- UMUT = University Museum, University of Tokyo, Japan
- ZFMK = Zoological Research Museum Alexander Koenig, Germany
- Some species have sequence data from more than one specimen. When available, specimen information for these taxa is separated with a forward slash.
Divergence time estimation
- dated_MCC_1Kite_renamed.tre: Consensus of 100 TreePL analyses using Kawahara et al. (2019) derived secondary calibrations
- dated_espeland_renamed.tre: Consensus of 100 TreePL analyses using Espeland et al. (2018) derived secondary calibrations
- dated_MCC_fossils_renamed.tre: Consensus of 100 TreePL analyses using fossils primary calibrations
Trait data
- Bio_variables.xlsx: Total number of records and temperature values per species for all WorldClim variables used in this study: annual mean temperature (BIO1), mean diurnal range (BIO2), maximum temperature in warmest month (BIO5), minimum temperature in the coldest month (BIO6), temperature annual range (BIO7).
- Record_references.xlsx: Original occurrence data sources with digital object identifiers (DOIs) were gathered using 'occCitation' with R library occCite.
Diversification
- BAMM_fractions.txt: Fraction file used with BAMM with percentages of species in the tree for each genus as well as an overall genus-level backbone sampling fraction.
- Pieridae_RevBayes_rates_MAP.tre: Maximum a posteriori tree LSBDS tree inferred with RevBayes, 10,000 generations
- QuaSSE_details.xlsx: Detailed results of QuaSSE correlations between speciation and the WorldClim temperature variables annual mean temperature (BIO1), mean diurnal range (BIO2), maximum temperature in warmest month (BIO5), minimum temperature in the coldest month (BIO6), and temperature annual range (BIO7). Parameters are shown for the best-fit model for each analysis.
- AIC: Akaike Information Criterion
- lambda: speciation rate
- l.m: slope
- mu: extinction rate
- l.xmid: inflection point (sigmoidal plot)
- y0/l.y0: asymptotic value at low x
- y1/l.y1: asymptotic value at high x
- l.r: steepness of sigmoid
Methods
Sequence data were obtained through anchored hybrid enrichment de novo sequencing and from publicly available sequence data.