Data from: Fruitbody and root data infer different environmental niches for ectomycorrhizal fungi
Data files
May 27, 2026 version files 5.19 MB
-
66_species_info.csv
3.02 KB
-
bio1.tif
729.76 KB
-
bio12.tif
1.61 MB
-
bio3.tif
957.57 KB
-
nitrogen.tif
871.09 KB
-
ocs.tif
563.94 KB
-
ph.tif
446.55 KB
-
README.md
3.02 KB
Abstract
Ectomycorrhizal (ECM) fungi play a vital role in temperate and boreal forests where they are the dominant type of mycorrhizal fungi. Characterising the environmental niche (EN) of ECM fungi can help understand their distribution and predict their response to environmental change. Typically, occurrence records of fruitbodies from field observations or reproductive structures archived in collections are used to model species distributions. However, our understanding of ECM fungal distributions is hindered by species with cryptic life cycles or fruiting patterns and can be greatly improved using observations based on DNA sequences obtained from ECM roots or from soil. We assessed how well a single data source could predict the niche of ECM fungi for species with conspicuous and inconspicuous fruitbodies. We used fruitbody and root data from 66 common ECM fungal species in same geographic extent in dominant forests of Europe, classified into conspicuous and inconspicuous species. The fruitbody data were extracted from public databases, and the root data from soil sampling of 136 European forest monitoring plots. We estimated the niches for combined data sources (fruitbody and root data) and for each individual data source using six key environmental variables for ECM fungal composition. We then examined how estimated niche overlap and area (number of grid cells within niche) varied for the two data sources between conspicuous and inconspicuous species. We found that although the niches estimated using combined data from the two data sources had high overlap with the niches estimated from fruitbody data for conspicuous fungi, the niches estimated from fruitbody data had low or medium overlap with the niches estimated using root data for most species. The overlap between the two data sources for conspicuous species was significantly larger than that for inconspicuous species. Root data were important for estimating the niche of inconspicuous species, which had a high ratio of root data to fruitbody data. These results can guide future sampling and conservation of fungi.
Ectomycorrhizal fungal data and Environmental variables
We have submitted the name of studied ectomycorrhizal (ECM) fungi (66_species_info.csv), environmental variables (bio1.tif, bio3.tif, bio12.tif, nitrogen.tif, ocs.tif, ph.tif) and R script (species_environmental_niche.R).
Descriptions:
-
ECM fungal data
ECM fungal data includes 66 ECM fungal species names, classification, and current status in the Red List. The coordinates of these species cannot be obtained in the following ways.
-
66_species_info.csv:
Column 1: Species - Scientific name of ECM fungal species (as used in GBIF/PlutoF).
Column 2: Group - ECM fungal fruitbody visibility. Categories include: 1) 'conspicuous', which means ECM fungi produce above-ground mushrooms; 2) 'inconspicuous', which means ECM fungi produce below-ground truffles or crusts.
Column 3: Red List Status - Living status of species in the Red List. Categories include: 1) 'No describe', which means the species cannot be found in the Red List; 2) 'Least Concern', which means the species are widespread, abundant, and face no immediate, severe threats to their survival.
-
Fruitbody data with coordinates can be downloaded from GBIF and PlutoF by using the provided species names. The GBIF data were cleaned following recommendations from Zizka et al. (2019, 2020). The downloaded PlutoF data were cleaned by removing records without coordinate information, collected before 1971, or collected outside Europe.
-
Root data is obtained from van der Linde's study (van der Linde et al., 2018), which was collected from 136 ICP Forests long-term monitoring plots. It contains species coordinates (longitude and latitude), classification information,n, and collected environmental sampling data (i., host trees, throughfall nitrogen deposition, and mean annual temperature).
-
-
Environmental variables
Bioclimatic variables (current) were obtained from the CHELSA dataset (version 2.1) and resampled to the same spatial properties as soil variables.
- bio1.tif: Mean Annual Near-Surface Air Temperature (℃).
- bio3.tif: Isothermality (℃).
- bio12.tif: Annual Precipitation (kg m-2 month-1).
- nitrogen.tif: Soil total nitrogen (g/kg) at 5–15 cm depth (original resolution: 250 m in SoilGrids, resampled to 2.5 arc-minutes).
- ocs.tif: Soil organic carbon stock (kg/m²). Data were downloaded from an earlier version of SoilGrids, where a 5–15 cm depth layer was available. This dataset was retained to ensure consistency with other soil variables used in the analysis.
- ph.tif: Soil pH (H₂O) at 5–15 cm depth (original resolution: 250 m in SoilGrids, resampled to 2.5 arc-minutes).
Code/Software
RStudio is required to run species_environmental_niche.R; the script was created using version 4.1.2.
Note: Soil organic carbon stock (ocs) only has data at soil depth 0-30cm in SoilGrids now.
We used fungal records obtained from fruitbody and root data to estimate the environmental niche of 66 common ECM fungal species in European forests, classified into conspicuous and inconspicuous species based on the visibility of their fruitbodies, which may significantly affect both niche and distribution inferences for these fungi. The fruitbody data were extracted from public databases (GBIF, UNITE) while the root data (from individual ectomycorrhizas) were obtained from a dataset of 136 ICP Forests long-term monitoring plots. We estimated the niches for fruitbody data, root data, and their combined data separately using six key environmental variables known to affect ECM fungal community composition: mean annual air temperature, isothermality, annual precipitation amount, soil total nitrogen, soil pH, and soil organic carbon stock. We then examined how estimated niche overlap and area (number of grid cells within niche) varied for the two data sources between conspicuous and inconspicuous species.
All analyses are finished in R Studio.
- The 'CoordinateCleaner' R package** was used for GBIF data cleaning.
- The 'usdm' R package was used to test the multicollinearity of selected environmental variables.
- The 'ecospat' R package was used to calculate species environmental niche, including niche area, niche density, and niche overlap.
