Supplementary data from: Monitoring insect biodiversity and comparison of sampling strategies using metabarcoding: a case study in the Yanshan Mountains, China
Data files
Mar 16, 2026 version files 832.95 MB
-
Original_data.zip
829.42 MB
-
README.md
4.76 KB
-
Table_S1__The_details_of_the_sampling_localities.xlsx
14.40 KB
-
Table_S2_The_details_for_solution_after_DNA_extraction.xlsx
21.97 KB
-
Table_S3_Characteristics_of_74_samples_after_sequencing_and_OTUs_picking.xlsx
13.20 KB
-
Table_S4__The_entire_OTUs_table_for_all_specimens_that_the_sequence_has_not_been_rarefied.xlsx
1.84 MB
-
Table_S5__The_insect_OTU_table_with_even_sequencing_depth.xlsx
1.59 MB
-
Table_S6__Alpha_diversity_indexes_for_each_sample.xlsx
14.02 KB
-
Table_S7_The_exclusively_detected_families_in_the_groups_collected_by_different_methods.xlsx
10.83 KB
-
Table_S8_The_exclusively_detected_families_in_the_groups_collected_from_different_habitats.xlsx
10.30 KB
-
Table_S9__Anosim_tests_for_composition_differences_among_different_habitat_categories.xlsx
9.98 KB
Abstract
Insects are the richest and most diverse group of animals and yet there remains a lack, not only of systematic research into their distribution across some key regions of the planet, but of standardized sampling strategies for their study. The Yanshan Mountains, being the boundary range between the Inner Mongolian Plateau and the North China Plain, present an indispensable piece of the insect biodiversity puzzle: both requiring systematic study and offering opportunities for the development of standardized methodologies. This is the first use of DNA metabarcoding to survey the insect biodiversity of the Yanshan Mountains. The study focuses on differences of community composition among samples collected via different methods and from different habitat types. In total, 74 bulk samples were collected from five habitat types (scrubland, woodland, wetland, farmland, and grassland) using three collection methods (sweep netting, Malaise traps, and light traps). After DNA extraction, PCR amplification, sequencing, and diversity analysis were performed, a total of 7,427 Operational Taxonomic Units (OTUs) at ≥ 97% sequence similarity level were delimited, of which, 7,083 OTUs were identified as belonging to Insecta. Orthoptera, Diptera, Coleoptera and Hemiptera were found to be the dominant orders according to community composition analysis. Nonmetric multidimensional scaling (NMDS) analysis based on Bray-Curtis distances revealed highly divergent estimates of insect community composition among samples differentiated by collection method (R = 0.524802, p = 0.001), but nonsignificant difference among samples differentiated according to habitat (R = 0.051102, p = 0.078). The study therefore appears to indicate that the concurrent use of varied collection methods is essential to the accurate monitoring of insect biodiversity.
Dataset DOI: 10.5061/dryad.x3ffbg7p7
Description of the data and file structure
A total of 74 bulk samples were collected from five habitat types (scrubland, woodland, wetland, farmland, and grassland) using three collection methods (sweep netting, Malaise traps, and light traps) from the Yanshan Mountains, China. After DNA extraction, PCR amplification, sequencing, and diversity analysis were performed. Nonmetric multidimensional scaling (NMDS) analysis based on Bray–Curtis distances revealed highly divergent estimates of insect community composition among samples differentiated by the collection method, but nonsignificant difference among samples differentiated according to habitat.
Files and variables
Table_S1__The_details_of_the_sampling_localities.xlsx
Description: S: sweep netting group, MT: Malaise trap group, LT: light trap group; E: East longitude, N: North latitude, ASL: Above Sea Level
Table_S2_The_details_for_solution_after_DNA_extraction.xlsx
Description: S: sweep netting group, MT: Malaise trap group, LT: light trap group.
Table_S3_Characteristics_of_74_samples_after_sequencing_and_OTUs_picking.xlsx
Description: S: sweep netting group, MT: Malaise trap group, LT: light trap group.
Table_S4__The_entire_OTUs_table_for_all_specimens_that_the_sequence_has_not_been_rarefied.xlsx
Description: S: sweep netting group, MT: Malaise trap group, LT: light trap group.
Table_S5__The_insect_OTU_table_with_even_sequencing_depth.xlsx
Description: S: sweep netting group, MT: Malaise trap group, LT: light trap group; HQS: High-quality sequences, OTUs: operational taxonomic units
Table_S6__Alpha_diversity_indexes_for_each_sample.xlsx
Description: S: sweep netting group, MT: Malaise trap group, LT: light trap group.
Table_S7_The_exclusively_detected_families_in_the_groups_collected_by_different_methods.xlsx
Description: S: sweep netting group, MT: Malaise trap group, LT: light trap group.
Table_S8_The_exclusively_detected_families_in_the_groups_collected_from_different_habitats.xlsx
Description: S: sweep netting group, MT: Malaise trap group, LT: light trap group.
Table_S9__Anosim_tests_for_composition_differences_among_different_habitat_categories.xlsx
Description: S: sweep netting group, MT: Malaise trap group, LT: light trap group.
File: Original_data.zip
Description: Metabarcoding sequencing data was in .fq format. It can be opened with notepad.
The raw data with the DNA sequences presented in this study are under Bioproject PRJNA804856, BioSample accession SAMN25810026-SAMN25810099 and NCBI SRA: SRR17956399-SRR17956326.
Files on Zenodo (Supplemental Information)
These were generated in personalbio genescloud of https://www.genescloud.cn.
- S: sweep netting group, MT: Malaise trap group, LT: light trap group.
File: FIGURE_S1_Statistics_of_insect_community_composition_at_the_family_level.jpg
File: FIGURE_S2_Statistics_of_insect_community_composition_at_the_genus_level.jpg
File: FIGURE_S3_Rarefaction_curves.jpg
File: FIGURE_S4_Venn_diagram_of_the_observed_OTUs_from_three_sampling_methods.jpg
File:FIGURE_S5_Analysis_of_NMDS_plots_based_on_Jaccard_distances_of_the_three_collection_methods.jpg
File:FIGURE_S6_Heat_map_displaying_the_relative_abundance_of_samples_comparing_three_different_collection_methods.jpg
File: FIGURE_S7_Alpha_diversity_estimates_of_the_five_different_habitats.jpg
File: FIGURE_S8_Analysis_of_NMDS_plots_based_on_Bray–Curtis_distance_of_the_five_different_habitats.jpg
- NMDS: Non-metric multidimensional scaling
File:FIGURE_S9_Analysis_of_NMDS_plots_based_on_Jaccard_distances_of_the_five_different_habitats.jpg
- NMDS: Non-metric multidimensional scaling
Code/software
All of the files (Figures S1-9 and Tables S1-9) are .jpg and .xlsx files, so Microsoft can be used to view the data.
The original data downloaded from NCBI https://www.ncbi.nlm.nih.gov/ can be opened with notepad.
Access information
Other publicly accessible locations of the data: No.
Data was derived from the following sources:
https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJNA804856
https://www.ncbi.nlm.nih.gov/biosample/?term=SAMN25810026 (-SAMN25810099)
https://www.ncbi.nlm.nih.gov/sra/?term=SRR17956399 (-SRR17956326)
