Sandel, Brody et al. (2021), Predicting intraspecific trait variation among California’s grasses, Dryad, Dataset,


1. Plant species can show considerable morphological and functional variation along environmental gradients. This intraspecific trait variation (ITV) can have important consequences for community assembly, biotic interactions, ecosystem functions and responses to global change. However, directly measuring ITV across many species and wide geographic areas is often infeasible. Thus, a method to predict spatial variation in a species’ functional traits could be valuable. 2. We measured specific leaf area (SLA), height and leaf area (LA) of grasses across California, covering 59 species at 230 sampling locations. We asked how these traits change along climate gradients within each species and used machine learning to predict local trait values for any species at any location based on phylogenetic position, local climate and that species’ mean traits. We then examined how much these local predictions alter patterns of assemblage-level trait variation across the state. 3. Most species exhibited higher SLA and grew taller at higher temperatures and produced larger leaves in drier conditions. The random forests predicted spatial variation in functional traits very accurately, with correlations up to 0.97. Because trait records were spatially biased towards warmer areas, and these areas tend to have higher SLA individuals within each species, species means of SLA were upwardly biased. As a result, using species means over-estimates SLA in the cooler regions of the state. Our results also suggest that height may be substantially under-predicted in the warmest areas. 4. Synthesis: Using only species mean traits to characterize the functional composition of communities risks introducing substantial error into trait-based estimates of ecosystem properties including decomposition rates or NPP. The high performance of random forests in predicting local trait values provides a way forward for estimating high-resolution patterns of ITV without a massive data collection effort.


These data were collected by the coauthors, working at sites across California. It also includes some measurements from databases or published sources. 

Usage Notes

The file includes 7 columns and 1860 rows, not including the header. The columns are:

Species: Name according to the Jepson Manual
Longitude: Longitude coordinate of the record
Latitude: Latitude coordinate of the record
SLA: Specific leaf area (m2/g)
Height: Height (cm)
Area: Area of a single leaf (cm2)
Dataset: Source for the record.