Estimating required sample sizes for gut content metabarcoding studies of dietary diversity in spiders
Data files
Apr 24, 2025 version files 970.77 MB
-
metadata.csv
4.42 KB
-
preyfamily.csv
39.02 KB
-
preyorder.csv
6.36 KB
-
preyspecies.csv
116.01 KB
-
README.md
1.22 KB
-
saturation_sequences.zip
970.61 MB
Abstract
Spiders are dominant predators in terrestrial ecosystems, regulating invertebrate biomass through predation. However, analysing predation patterns in the field is challenging, especially as spiders are extra-intestinal liquid feeders and often active at night. Gut content metabarcoding (GCM) provides a cost-effective method to study the prey composition of spiders. In this study, we determine the minimum sample size of spider specimens required to obtain representative estimates of local prey composition. Six common spider species were studied using GCM and we show that the number of analysed individuals per spider species influenced the saturation levels for prey at different taxonomic levels. To achieve 90% saturation at the prey species level, between 50 and over 150 individuals per spider species had to be analysed, which is challenging for ecological multi-site studies. Hence we argue for consideration of coarser taxonomic levels, such as family or order, which often provide sufficient ecological information to understand predator-prey interactions. At these levels, saturation can be achieved with much smaller sample sizes of 4 to 8 individuals per species, making ecological GCM studies more cost-effective. These results highlight the benefits of considering the taxonomic resolution in metabarcoding studies to address ecological research questions. We further provide information that will facilitate future investigations of predator-prey dynamics, not just in spiders, but also in other extra-intestinal liquid feeders like for example ground beetles.
https://doi.org/10.5061/dryad.3xsj3txpz
Data set contents:
Metadata:
file: metadata.csv
description: contains metadata of the samples
Variables
sample_id : unique sample identifier, (naming scheme corresponding only to collector, not any other quality of the sample)
type: sample type (spider or negative control)
species: species of the spider sample;
NA = not applicable to controls, as they contain no spider sample
Sequences:
file: saturation_sequences.zip
Description: This folder consists of 264 fastaq.gz for 132 samples of different spider species or control samples (2 files per sample, forward and reverse read each). The raw sequence data originates from Illumina sequencing in fastq format.
Prey species table
file: preyspecies.csv
Description: presence/absence data of prey species in every sample
Prey family table
file: preyfamily.csv
Description: presence/absence data of prey families in every sample
Prey order table
file: preyorder.csv
Description: presence/absence data of prey ordersin every sample
- Melcher, Anja Carina; Krehenwinkel, Henrik; Harms, Danilo; Birkhofer, Klaus (2025). Estimating Required Sample Sizes for Gut Content Metabarcoding Studies of Dietary Diversity in Spiders. Journal of Applied Entomology. https://doi.org/10.1111/jen.13433
