Towards DNA metabarcoding-based haplotype for monitoring terrestrial arthropod communities
Data files
May 13, 2025 version files 22.37 MB
-
Metabarcoding_Arthropods_Traps.tar.xz
22.36 MB
-
README.md
1.21 KB
Abstract
Whole-organism community DNA (wocDNA) metabarcoding has become a necessary and powerful tool to study mixed communities from bulk samples by detecting short DNA sequences. This review integrates the use of arthropods as ecological indicators through wocDNA metabarcoding, covering key methodological aspects from sampling design to bioinformatics, and highlighting ASV-based (Amplicon Sequence Variants) pipelines for incorporating haplotype-level variation. The success of wocDNA metabarcoding relies on proper sampling methods, good laboratory practices, cleaning of bulk samples, selecting suitable arthropod sizes, efficient DNA extractions, accurate library quantification, appropriate genetic markers for accurate taxonomic identification, and effective sequence filtering. Incorporating haplotype data into monitoring unveils intraspecific variation that could complement species-level diversity, thus enhancing conventional analyses. Choosing abundance- or incidence-based approaches is key for selecting appropriate statistical methods. WocDNA metabarcoding provides biodiversity estimates for entire communities and allows researchers to derive insights that can inform the formulation or refinement of hypotheses. These refined hypotheses, based on the obtained biodiversity estimates, can then guide ecosystem conservation.
This dataset includes the sequences aligned, ASVs tables, and ASVs with Threshold generated from 33 whole communities of biodiversity soups in Nevado de Toluca using different trap types. For additional information on the sample codes and scripts, please see at GitHub repository https://github.com/Nancy-Galvez/MetaBC_Traps.
Data content:
1AlignedSeqcontains the results of0.4Searching_Stop_Codons.txt. This file contains each ASV dataset was aligned in Geneious for searching codon stops. You can use the FASTA files in the0.5to_get_ASV_tables.shscript.2ASVs_Tablecontains the results of the0.5to_get_ASV_tables.shanalysis. Each community table was generated with read-counts (haplotype abundance) of each retained ASV for the eight orders by matching ASVs against the complete collection of reads. After each ASV table, you can use0.6to_get_UPGMAtree_GMYC_MH_lineages_All.rand3.to_get_conservative_threhold_on_original_all.r3ASVThresholdCommunitiescontains all matrix tables used in4.to_get_Diversity_All.r
Enjoy it
