Pedology and plant provenance can improve predictions of species distributions of the Australian native flora: a calibrated and validated modelling exercise on 5,033 species
Data files
May 22, 2025 version files 1.08 GB
-
BDW_000_005_EV_N_P_AU_TRN_N_20230607.tif
40.10 MB
-
bio01.tif
3.68 MB
-
bio02.tif
1.64 MB
-
bio04.tif
13.50 MB
-
bio05.tif
3.98 MB
-
bio06.tif
3.92 MB
-
bio12.tif
7.65 MB
-
bio13.tif
3.44 MB
-
bio14.tif
1.65 MB
-
bio15.tif
3.14 MB
-
CEC_000_005_EV_N_P_AU_TRN_N_20220826.tif
30.03 MB
-
CLY_000_005_EV_N_P_AU_TRN_N_20210902.tif
31.63 MB
-
DES_000_200_EV_N_P_AU_TRN_C_20190901.tif
47.25 MB
-
NTO_000_005_EV_N_P_AU_NAT_C_20140801.tif
56.97 MB
-
PTO_000_005_EV_N_P_AU_NAT_C_20140801.tif
57.86 MB
-
README.md
2.32 KB
-
Relief_twi_3s.tif
43.46 MB
-
SI_Ecoregion.pdf
684.45 MB
-
SOC_000_005_EV_N_P_AU_TRN_N_20220727.tif
46.61 MB
Abstract
Species distribution models (SDMs) are valuable tools for assessing species' responses to environmental factors and identifying areas suitable for their survival. The careful selection of input variables is critical, as their interactions, and correlations with other environmental factors can affect model performance. This study evaluates the influence of climate and soil variables on SDMs’ performance for 5,033 Australian plant species, selected to represent the largest phylogenetic diversity of native terrestrial vascular flora. Using an ensemble of correlative models, we assessed the predictive performance of climate and soil variables, individually and in combination, across four distinct ecoregions: Desert (n = 640 species), Mediterranean (n = 1,246), Temperate (n = 1,936), and Tropical (n = 1,211). Our results demonstrate that on a continental scale, climate variables have a greater influence on plant distributions than soil variables. Although incorporating soil and climate variables enhanced model performance in some ecoregions, our results indicate that relying solely on small-scale variables such as soil may increase the likelihood of overfitting. In soil-only models, Clay content (CLY), Nitrogen Total Organic (NTO), and Soil Organic Carbon (SOC) were important across modelled species, with their relevance varying by ecoregion. Our findings have significant implications for understanding the interplay between climate, soil, and plant distribution within diverse ecoregions. By highlighting the crucial role of climate in large-scale models, this study serves as a foundation for developing more accurate predictions of plant distributions, ultimately improving model accuracy for biodiversity assessments.
Dataset DOI: 10.5061/dryad.9cnp5hqwn
File Descriptions
This dataset supports a species distribution modeling (SDM) study and contains 17 environmental predictor variables, including bioclimatic variables from the WorldClim database and soil variables from the Soil and Landscape Grid of Australia (SLGA). All raster files are provided in .tif (GeoTIFF) format and can be opened with standard GIS software such as QGIS or ArcGIS, or processed in R using packages like terra or raster. In addition, the dataset includes:
- A Supplementary Information (SI) PDF describing additional methodology and results.
- A README.md file that documents the file contents.
- R scripts used in the modeling workflow.
Bioclimatic Variables (bioXX.tif
)
bio01.tif
– Annual Mean Temperaturebio02.tif
– Annual Mean Diurnal Rangebio04.tif
– Temperature Seasonalitybio05.tif
– Maximum Temperature of Warmest Monthbio06.tif
– Minimum Temperature of Coldest Monthbio12.tif
– Annual Precipitationbio13.tif
– Precipitation of Wettest Monthbio14.tif
– Precipitation of Driest Monthbio15.tif
– Precipitation Seasonality
Soil Variables
BDW_000_005_EV_N_P_AU_TRN_N_20230607.tif
– Bulk DensityCEC_000_005_EV_N_P_AU_TRN_N_20220826.tif
– Cation Exchange CapacityCLY_000_005_EV_N_P_AU_TRN_N_20210902.tif
– Clay ContentDES_000_200_EV_N_P_AU_TRN_C_20190901.tif
– Depth of SoilSOC_000_005_EV_N_P_AU_TRN_N_20220727.tif
– Soil Organic CarbonNTO_000_005_EV_N_P_AU_NAT_C_20140801.tif
– Nitrogen Total OrganicPTO_000_005_EV_N_P_AU_NAT_C_20140801.tif
– Total PhosphorusRelief_twi_3s.tif
– Topographic Wetness Index (TWI)
Code/Software
Modeling and data processing were conducted using R version 4.3.0. The script includes steps for:
- Preprocessing environmental rasters
- Preparing occurrence and background data
- Calibrating individual SDMs
- Creating ensemble models
Key packages used in the workflow: terra
, raster
, sf
, MASS
, covsel
, dismo
, flexsdm
, biomod2