Data and code from: Human activities and topographical heterogeneity shape Pinus L. species richness patterns in North America
Data files
May 19, 2026 version files 25.86 MB
-
code.R
1.66 KB
-
README.md
2.14 KB
-
Species_distribution_records.csv
24.52 MB
-
Species_richness_and_variable_values_in_each_grid_cell.csv
1.34 MB
Abstract
Understanding the distribution patterns of species richness and their underlying drivers is a fundamental issue in macroecology and biogeography. While climate, soil, topography, and human activities are widely recognized as key determinants, their interactive effects on species richness patterns remain underexplored, especially for ecologically and economically valuable Pinus species. Furthermore, the role of indirect effects among variables in shaping these patterns remains incompletely clarified. Here, we integrated distribution data for 60 North American Pinus species to investigate richness patterns across different range sizes. Using Structural Equation Modeling (SEM), we quantified the relative explanatory power of hypotheses related to environmental and anthropogenic factors. We also identified diversity hotspots using the complementary algorithm and the top 5% richness approach. Our results show that topographical heterogeneity is the primary direct correlative factor explaining richness patterns for overall species. Human activities are the dominant correlative factor for both overall and wide-ranged species: their total effect on overall species is primarily derived from indirect associations through modified environmental conditions, while they exert the strongest direct effect on wide-ranged species. The Janzen hypothesis better explains the richness patterns of narrow-ranged species, emphasizing the importance of climatic stability and habitat specificity. Additionally, the complementary algorithm outperformed the top 5% richness approach in identifying priority conservation areas by efficiently capturing all species. This study elucidates the divergent associative mechanisms shaping Pinus species richness across different range sizes, highlighting that human activities and topographical heterogeneity play context-dependent roles. These findings enhance our understanding of the multi-faceted mechanisms influencing Pinus species richness patterns and provide a targeted scientific basis for biodiversity conservation planning and adaptive management strategies under global climate change.
Description of the data and file structure
This dataset consists of two tables and an R code,one containing the distribution coordinates of each Pinus species, and the other recording the species richness and corresponding variable values of each grid cell. The R code provides the necessary code for statistical analysis conducted in this research.
File: code.R
Description: This R document provides the code involved in the statistical analysis conducted in this study.
File: Species_richness_and_variable_values_in_each_grid_cell.csv
Description: Species richness and corresponding variable values for each grid cell at a resolution of 50 × 50 km.
Variables
- Grid cells_id: Number of grid cells
- Overall species_SR: Species richness values for overall species
- Wide-ranged species_SR: Species richness values for wide-ranged species
- Narrow-ranged species_SR: Species richness values for narrow-ranged species
- MAT: Mean annual temperature (°C)
- Tmin: Min temperature of coldest month (°C)
- MTWQ: Mean temperatures of warmest quarter (°C)
- PET: Potential evapotranspiration (mm)
- MAP: Annual precipitation (mm)
- PDQ: Mean precipitation of driest quarter (mm)
- PWAQ: Mean precipitation of warmest quarter (mm)
- AET: Annual actual evapotranspiration (mm)
- TSN: Temperature seasonality
- PSN: Precipitation seasonality
- ELER: Elevational range (m)
- TA: Temperature absolute anomaly (°C)
- PA: Precipitation absolute anomaly (mm)
- BDOD: Bulk density (kg/dm³)
- CLAY: Clay content (%)
- SILT: Silt content (%)
- SAND: Sand content (%)
- CEC: Cation exchange capacity (cmolc/kg)
- SOC: Soil organic carbon content (g/kg)
- N: Total nitrogen (g/kg)
- pH: Soil pH
- HFP: The Human Footprint
File: Species_distribution_records.csv
Description: Coordinates of species distribution points.
Variables
- Species: Species name
- Longitude: Longitude (decimal degrees)
- Latitude: Latitude (decimal degrees)
