Supplementary data from: Species delimitation under allopatry: Genomic insights within and across continents in Lepidoptera
Data files
May 27, 2025 version files 73.55 MB
-
A.caja_species_samples.tsv
690 B
-
A.glandon_Barcodes_alignment_with_outgroup.fas
11.33 KB
-
A.glandon_Barcodes_alignment_with_outgroup.fas.treefile
768 B
-
A.glandon_species_samples.tsv
687 B
-
Agriades_glandon_new_astral.tre
817 B
-
Agriades_glandon_new_concat.fas
2.29 MB
-
Agriades_glandon_new_loci.treefile
355.87 KB
-
Agriades_glandon_partitioned_tree.treefile
976 B
-
Agriades_glandon_snps.vcf
2.03 MB
-
Arctia_caja_astral_corrected_filtered.tre
1.02 KB
-
Arctia_caja_Barcodes_alignment_with_outgroup.fas
13.93 KB
-
Arctia_caja_Barcodes_alignment_with_outgroup.fas.treefile
861 B
-
Arctia_caja_loci_corrected.treefile
833.30 KB
-
Arctia_caja_new_concat_corrected.fas
6.82 MB
-
Arctia_caja_partitioned_tree_corrected.treefile
4.85 KB
-
Arctia_caja_snps_filtered.vcf
689.21 KB
-
Arctia_caja_snps.vcf
4.99 MB
-
C_tullia_new_astral.tre
989 B
-
C_tullia_new_concat.fas
3.17 MB
-
C_tullia_new_loci.treefile
453.85 KB
-
C.sororiata_species_samples.tsv
899 B
-
C.tullia_Barcodes_alignment_with_outgroup_1.fas
11.94 KB
-
C.tullia_Barcodes_alignment_with_outgroup_1.fas.treefile
864 B
-
C.tullia_species_samples.tsv
790 B
-
Carsia_sororiata_Barcodes_alignment_with_outgroup.fas
14.81 KB
-
Carsia_sororiata_Barcodes_alignment_with_outgroup.fas.treefile
985 B
-
Carsia_sororiata_new_astral.tre
1.11 KB
-
Carsia_sororiata_new_concat.fas
4.49 MB
-
Carsia_sororiata_new_loci.treefile
641.43 KB
-
Carsia_sororiata_partitioned_tree.treefile
1.33 KB
-
Carsia_sororiata_snps_filtered.vcf
746.21 KB
-
Carsia_sororiata_snps.vcf
2.55 MB
-
Coenonympha_tullia_partitioned_tree.treefile
1.16 KB
-
Coenonympha_tullia_snps_filtered.vcf
770.84 KB
-
Coenonympha_tullia_snps.vcf
4.83 MB
-
E.sudetica_species_samples.tsv
669 B
-
E.vittaria_Barcodes_alignment_with_outgroup.fas
9.06 KB
-
E.vittaria_Barcodes_alignment_with_outgroup.fas.treefile
575 B
-
E.vittaria_species_samples.tsv
541 B
-
Elophos_vittaria_new_astral.tre
665 B
-
Elophos_vittaria_new_concat.fas
1.95 MB
-
Elophos_vittaria_new_loci.treefile
309.05 KB
-
Elophos_vittaria_partitioned_tree.treefile
791 B
-
Elophos_vittaria_snps_filtered.vcf
500.71 KB
-
Elophos_vittaria_snps.vcf
1.61 MB
-
Eudonia_sudetica_Barcodes_alignment_with_outgroup.fas
12.07 KB
-
Eudonia_sudetica_Barcodes_alignment_with_outgroup.fas.treefile
704 B
-
Eudonia_sudetica_new_astral.tre
910 B
-
Eudonia_sudetica_new_concat.fas
3.18 MB
-
Eudonia_sudetica_new_loci.treefile
429.83 KB
-
Eudonia_sudetica_partitioned_tree.treefile
1.10 KB
-
Eudonia_sudetica_snps_filtered.vcf
487.48 KB
-
Eudonia_sudetica_snps.vcf
2.12 MB
-
M_brunneata_new_astral.tre
1.16 KB
-
M_brunneata_new_concat.fas
3.81 MB
-
M_brunneata_new_loci.treefile
538.22 KB
-
M_brunneata_partitioned_tree.treefile
1.35 KB
-
M.brunneata_Barcodes_alignment_with_outgroup.fas
14.81 KB
-
M.brunneata_Barcodes_alignment_with_outgroup.fas.treefile
938 B
-
M.brunneata_species_samples.tsv
928 B
-
Macaria_brunneata_snps_filtered.vcf
757.46 KB
-
Macaria_brunneata_snps.vcf
2.63 MB
-
README.md
2.48 KB
-
S_hochenwarthi_new_astral.tre
955 B
-
S_hochenwarthi_new_concat.fas
3.63 MB
-
S_hochenwarthi_new_loci.treefile
550.84 KB
-
S_hochenwarthi_partitioned_tree.treefile
1.12 KB
-
S.hochenwarthi_Barcodes_alignment_with_outgroup.fas
12.09 KB
-
S.hochenwarthi_Barcodes_alignment_with_outgroup.fas.treefile
707 B
-
S.hochenwarthi_species_samples.tsv
773 B
-
Syngrapha_hochenwarthi_snps_filtered.vcf
609.87 KB
-
Syngrapha_hochenwarthi_snps.vcf
2.14 MB
-
X.lorezi_Barcodes_alignment_with_outgroup_new.fas
11.50 KB
-
X.lorezi_Barcodes_alignment_with_outgroup_new.fas.treefile
832 B
-
X.lorezi_species_samples.tsv
596 B
-
X.sepciosa_species_samples.tsv
675 B
-
X.speciosa_Barcodes_alignment_with_outgroup.fas
12.35 KB
-
X.speciosa_Barcodes_alignment_with_outgroup.fas.treefile
770 B
-
Xestia_lorezi_new_astral.tre
764 B
-
Xestia_lorezi_new_concat.fas
3.37 MB
-
Xestia_lorezi_new_loci.treefile
473.05 KB
-
Xestia_lorezi_partitioned_tree.treefile
918 B
-
Xestia_lorezi_snps.vcf
1.46 MB
-
Xestia_speciosa_new_astral.tre
857 B
-
Xestia_speciosa_new_concat.fas
3.33 MB
-
Xestia_speciosa_new_loci.treefile
498.67 KB
-
Xestia_speciosa_partitioned_tree.treefile
1.01 KB
-
Xestia_speciosa_snps_filtered.vcf
548.38 KB
-
Xestia_speciosa_snps.vcf
2.80 MB
Abstract
Delimitation of allopatric populations into species remains subjective and largely arbitrary. Many cold-adapted species from the subarctic and Central and Southern European Mountain systems display frequent allopatry with disjunct distributions of their populations. The same concerns Holarctic species which are many and which almost regularly show various degrees of differentiation between the continents. In this study, we analyze high-throughput target enrichment data for ten groups of arctic-alpine and Holarctic lepidopteran species sampled from four main regions across the Holarctic realm – Fennoscandia, North America, Alps and Altai. We first aimed to assess whether the genetic differences in the nuclear genome reflect observed DNA barcode divergences and second, whether the gap between population and species-level differences can be dissected using genomic data. We compared the phylogenetic trees and uncorrected pairwise genetic distances obtained from target enrichment and the mitochondrial cytochrome oxidase I (COI) barcodes for each of the study species. Additionally, we also performed a suite of population genetic and species delimitation analyses to further shed light on patterns of intraspecific variation using a large number of nuclear markers. We observed that in about one half of the cases, DNA barcodes tended to show phylogenetic relationships similar to the target enrichment markers. We report varying levels of nuclear genetic differentiation among the populations analyzed, starting from low differentiation of geographically separated populations to the deeper separation of some Nearctic population and further arctic-alpine disjunction. Given that no single consistent pattern emerged across different case studies, we demonstrate that the delimitation of allopatric populations into species could be done much more efficiently and in a consistent manner if based on a large set of universal genetic loci, which would help in reaching standards for taxonomic delimitation of allopatric populations.
Description of files
This folder contains the data files associated with the manuscript titled 'Species Delimitation Under Allopatry: Genomic Divergences Within and Across Continents in Lepidoptera'. The description of the contents of the folder is given below. (NOTE: Upon secondary inspection of individual locus alignments for Arctia caja, non-homologous sequences corresponding to the sample represented by a long-branch in ML phylogenetic tree were found for Arctia caja. Such sequences were removed and the concatenated and loci files were constructed again for Arctia caja).
- Files ending with '_new_astral.tre' - Contains phylogenetic trees inferred using ASTRAL for all 9 species except Arctia caja. The ASTRAL tree for Arctia caja is represented in file 'Arctia_caja_astral_corrected_filtered.tre'
- Files ending with '_Barcode_alignment_with_outgroup.fas' - Contains barcode alignments used as an input to infer barcode tree
- Files ending with '_Barcodes_alignment_with_outgroup.fas.treefile' - Contains Maximum Likelihood phylogenetic trees inferred for barcodes using IQtree
- Files ending with '_new_concat.fas' - Concatenated supermatrix and input file used to infer Maximum likelihood tree for target enrichment dataset using IQtree
- Files ending with '_new_loci.treefile' - Gene trees inferred for each study species using IQtree
- Files ending with '_partitioned_tree.treefile' - Maximum Likelihood phylogenetic trees inferred for each study species using IQtree
- vcf files - SNPs in vcf file format for each study species after first and second filtering (ending with _filtered) step respectively
- Supplementary_file_v.2.docx - Supplementary file
- tsv files ending with '_species_samples' contains a key for each species linking specimen names from treefiles above to their respective sample codes
- File 'Arctia_caja_new_concat_corrected.fas' is corrected concatenated supermatrix file for Arctia caja after removing non-homologous sequences from its alignment. Similarly, 'Arctia_caja_loci_corrected.treefile' is corrected loci file and 'Arctia_caja_astral_corrected_filtered.tre' is ASTRAL tree based on these loci.
The target enrichment raw data is available from SRA under BioProject PRJNA922447. All the newly generated barcodes from this study have been deposited on BOLD database.
The nuclear genomic data was generated using target enrichment method. Barcodes were generated through standard barcoding lab protocol and sanger sequencing. The raw target enrichment data was processed using TEnriAn pipeline (Mayer et al. 2021) which is specifically designed for processing of target enrichment data. The downstream phylogenetic and other analyses were carried out using commonly used programs such as IQtree (version 2), STRUCTURE, etc., and species delimitation analyses were carried out using programs tr2 and SODA.
The barcode sequences were processed using Geneious Prime to generate the multiple sequence alignments and phylogenetic trees were generated using IQtree 2.
For other analyses such as PCA and generating P-distance plots, R packages adegenet and ggplot2 were used.
To open the Maximum likelihood phylogenetic trees and ASTRAL trees (ending with .treefile or .tre), FigTree program would be needed. Nucleotide alignments can be viewed using any alignment editor such as AliView, Geneious. The vcf files can be read using vcfR package in R.
- Joshi, Mukta; Espeland, Marianne; Huemer, Peter et al. (2024). Species delimitation under allopatry: genomic insights within and across continents in Lepidoptera. Insect Systematics and Diversity. https://doi.org/10.1093/isd/ixae027
