Skip to main content
Dryad

Data from: Integrating megabarcoding and metabarcoding to unlock diversity and distribution data shortfalls in dark taxa

Data files

Jun 24, 2026 version files 731.90 MB

Click names to download individual files

Abstract

Persistent biodiversity data shortfalls undermine our capacity to detect species, map their distributions, and characterise their spatial genetic structure, limiting robust biogeographic analyses and the development of effective conservation strategies, particularly in hyperdiverse invertebrate groups where hidden diversity remains largely undocumented. This study develops and demonstrates the potential of an integrated high-throughput sequencing (HTS) framework to improve the representation of hidden diversity in regional species inventories and to help close critical gaps in our understanding of species distributions and genetic diversity from a conservation biogeography perspective. Focusing on the Canary Islands (Spain), the workflow combines megabarcoding of more than 4000 mesofauna specimens to generate a curated species-level molecular reference library with community DNA metabarcoding of 168 soil samples, enabling consistent taxonomic assignment across insular landscapes and increasing the spatial and genetic resolution of occurrence data. We identified 145 species of mites and springtails, including 49 species newly recorded for the archipelago and numerous genetically distinct lineages likely representing undescribed taxa. Integration of the barcode library with metabarcoding data produced 1,440 species occurrences, revealing extensive distributional gaps, multiple range expansions, and strong within-island phylogeographic structuring, indicating prevalent diversification at fine spatial scales. These results highlight a deep, taxonomically broad underestimation of soil biodiversity and demonstrate that this integrative approach provides a transferable model for advancing the biogeography, evolutionary understanding, and conservation of dark and cryptic taxa across broad taxonomic and conservation-relevant contexts.