Coalescent species’ delimitation in mimetic beetles of the genus Ceroglossus Solier (Coleoptera: Carabidae): The importance of species’ delineation to the understanding of the drivers of phenotypic diversity
Data files
May 08, 2026 version files 9.60 GB
-
barcode_file_pyrad.txt
2.17 KB
-
bu8ssp16ind_398L_1to10.txt
586.07 KB
-
bu8ssp16ind_500L.ctl
1.78 KB
-
bu8ssp16ind_ch2_398L_1to10.txt
660.34 KB
-
bu8ssp16ind_ch2_500L.ctl
1.81 KB
-
ch12ssp21ind_398L_1to10.txt
778.64 KB
-
ch12ssp21ind_500L.ctl
1.86 KB
-
ch12ssp21ind_dar7_398L_1to10.txt
852.92 KB
-
ch12ssp21ind_dar7_398L.ctl
1.89 KB
-
CML1A_R1.fastq.gz
9.59 GB
-
dar16_398L_1to10.txt
598.55 KB
-
dar16_398L.ctl
1.78 KB
-
dar8ssp16ind_398L_1to10.txt
601.05 KB
-
dar8ssp16ind_500L_guided.ctl
1.80 KB
-
dar8ssp16ind_ch2_398L_1to10.txt
676.21 KB
-
dar8ssp16ind_ch2_500L_guided.ctl
1.84 KB
-
Imap.bu8ssp16ind_500L.txt
105 B
-
Imap.bu8ssp16ind_ch2_500L.txt
118 B
-
Imap.ch12ssp21ind_500L.txt
144 B
-
Imap.ch12ssp21ind_dar7_398L.txt
160 B
-
Imap.dar16.txt
123 B
-
Imap.dar8ssp16ind_ch2.txt
137 B
-
Imap.dar8ssp16ind.txt
122 B
-
params.txt
3.78 KB
-
README.md
6.30 KB
-
SVDq_buq16ch2_c90d5m8p.20snp10.nex
142.96 KB
-
SVDq_chil21dar3_c90d5m8p.20snp10.nex
480.51 KB
-
SVDq_dar16_c90d5m8p3snp10.nex
286.36 KB
-
SVDq_dargroup18ch2_c90d5m8p.20snp10.nex
363.14 KB
Abstract
This data contains the raw genomic data generated via RADSeq from beetles of the genus Ceroglossus from south Chile. The data also include files used in the phylogenetic and species delimitation analyses conducted on these beetles.
The raw genomic data was generated from DNA extractions from 96 individuals belonging to 29 subspecies/color morphs and three species groups: C. chilensis, C. buqueti, and the C. darwini. Specimens were collected in South Chile and are preserved in 95% ethanol at the Museum of Zoology, University of Michigan (UMMZ).
DNA was extracted from the legs (usually one leg per individual) using a DNeasy Blood and Tissue Kit (Cat. No. 69581; Qiagen Inc.) following the manufacturer’s protocol.
DNA was double-digested with EcoRI and MseI restriction enzymes, followed by ligation of Illumina adaptor sequences and unique 10-base-pair barcodes. Ligation products were pooled among samples and size-selected using a Pippin Prep (Sage Science) machine (fragments of 180-280 bp), amplified by iProofTM High-Fidelity DNA Polymerase (BIO-RAD) with 12 cycles. The libraries were sequenced at The Centre for Applied Genomics (Hospital for Sick Children, Toronto, Canada) on the Illumina HiSeq2500 platform to generate 100-bp, single-end reads. To increase the data for some low-coverage samples, a second library was sequenced by selecting fragments of 350-450 bp.
More than 241 million reads were produced from the two lanes of Illumina sequencing (124 million and 116 million reads for the first and second libraries, respectively), which were subsequently pooled and processed using iPyrad to obtain files ready for phylogenetic analyses.
Dataset DOI: 10.5061/dryad.t76hdr8c2
Description of the data and file structure
The data consist of raw genomic data and files used for phylogenetics and species delimitation analyses.
Files and variables
File: params.txt
Description: file with all parameter settings used for processing the data in the pyRAD software.
File: ch12ssp21ind_500L.ctl
Description: CTL file for BPP species delimitation analysis for the Ceroglossus chilensis group without outgroup.
File: ch12ssp21ind_dar7_398L.ctl
Description: CTL file for BPP species delimitation analysis for the Ceroglossus chilensis group with outgroup.
File: bu8ssp16ind_ch2_500L.ctl
Description: CTL file for BPP species delimitation analysis for the Ceroglossus buqueti group with outgroup.
File: dar16_398L.ctl
Description: CTL file for BPP species delimitation analysis within the C. darwini darwini subspecies.
File: bu8ssp16ind_500L.ctl
Description: CTL file for BPP species delimitation analysis for the Ceroglossus buqueti group without outgroup.
File: SVDq_buq16ch2_c90d5m8p.20snp10.nex
Description: Input (.nex) file for SVDquartets phylogenetic analysis for the C. buqueti group. It contains the genetic data and subspecies assignments.
File: SVDq_dar16_c90d5m8p3snp10.nex
Description: Input (.nex) file for SVDquartets phylogenetic analysis. It contains the genetic data and subspecies assignments for the subspecies C. darwini darwini. This file was used in the delimitation of sites as putative species within a single color race.
File: dar8ssp16ind_500L_guided.ctl
Description: CTL file for BPP species delimitation analysis for the Ceroglossus darwini species group without outgroup.
File: dar8ssp16ind_ch2_500L_guided.ctl
Description: CTL file for BPP species delimitation analysis for the Ceroglossus darwini species group with outgroup.
File: SVDq_chil21dar3_c90d5m8p.20snp10.nex
Description: Input (.nex) file for SVDquartets phylogenetic analysis for the C. chilensis group. It contains the genetic data and subspecies assignments.
File: barcode_file_pyrad.txt
Description: File containing the barcodes with associated sample labels. This file was used to sort the raw genomic into specimen folders during the first step of pyrad.
File: SVDq_dargroup18ch2_c90d5m8p.20snp10.nex
Description: Input (.nex) file for SVDquartets phylogenetic analysis for the C. darwini group. It contains the genetic data and subspecies assignments
File: ch12ssp21ind_398L_1to10.txt
Description: loci file for BPP species delimitation analysis for the Ceroglossus chilensis group without outgroup.
File: bu8ssp16ind_398L_1to10.txt
Description: loci file for BPP species delimitation analysis for the Ceroglossus buqueti group without outgroup.
File: Imap.ch12ssp21ind_500L.txt
Description: Imap file for BPP species delimitation analysis for the Ceroglossus chilensis group without outgroup.
File: Imap.dar16.txt
Description: Imap file for BPP species delimitation analysis within a single C. darwini darwini subspecies to test whether population structure was picked up as species structure.
File: bu8ssp16ind_ch2_398L_1to10.txt
Description: loci file for BPP species delimitation analysis for the Ceroglossus buqueti group with outgroup.
File: Imap.bu8ssp16ind_500L.txt
Description: Imap file for BPP species delimitation analysis for the Ceroglossus buqueti group without outgroup.
File: ch12ssp21ind_dar7_398L_1to10.txt
Description: loci file for BPP species delimitation analysis for the Ceroglossus chilensis group with outgroup.
File: dar16_398L_1to10.txt
Description: loci file for BPP species delimitation analysis within C. darwini darwini subspecies.
File: dar8ssp16ind_398L_1to10.txt
Description: loci file for BPP species delimitation analysis for the Ceroglossus darwini species group without outgroup.
File: dar8ssp16ind_ch2_398L_1to10.txt
Description: loci file for BPP species delimitation analysis for the Ceroglossus darwini species group with outgroup.
File: Imap.ch12ssp21ind_dar7_398L.txt
Description: Imap file for BPP species delimitation analysis for the Ceroglossus chilensis group with outgroup
File: Imap.bu8ssp16ind_ch2_500L.txt
Description: Imap file for BPP species delimitation analysis for the Ceroglossus buqueti group with outgroup.
File: Imap.dar8ssp16ind_ch2.txt
Description: Imap file for BPP species delimitation analysis for the Ceroglossus darwini species group with outgroup.
File: Imap.dar8ssp16ind.txt
Description: Imap file for BPP species delimitation analysis for the Ceroglossus darwini species group without outgroup.
File: CML1A_R1.fastq.gz
Description: Raw data for Ceroglossus species obtained from first library, selecting fragment size of 180-280 bp in the Pippin-prep. Raw data from a second round of sequencing, targeting fragment size of 350-450, are available from the corresponding author upon request.
Code/software
Raw genomic data were processed in PyRAD v.3.0.5 (Eaton, 2014) to generate files ready to use in phylogenetic software.
SVDq*.nex files were used to run SVDQuartets in Paup version 4.0 (Swofford, 2002) to estimate species trees. These were then used as guided trees in BPP species delimitation analyses.
The BPP analyses required 3 files (these files are included for each of our analyses):
- A *.ctl file that contains all the parameters required to run the analysis. It also include the file names to call the other two required files necessary to run the BPP analysis.
- A loci file (*.txt), that includes the genetic data in the PHYLIP format (https://phylipweb.github.io/phylip/).
- An Imap file (*.txt) containing the association between individuals and putative species.
