Data from: Wealth does not buy richness: Plant and soil bacterial diversity in temperate suburban lawns are not influenced by income
Data files
Apr 20, 2026 version files 1.53 MB
-
otu_table.csv
590.53 KB
-
plant_data.csv
10.35 KB
-
README.md
5.62 KB
-
site_data.csv
1.50 KB
-
taxonomy_table.csv
925.92 KB
Abstract
Patterns of biodiversity in urban and suburban areas have been increasingly linked to the behaviors and affluence of their residents. Residents of affluent neighborhoods tend to plant more flowering plant species and generally benefit from cooler temperatures and more tree cover. These factors, among others, lead to increased plant, bird, arthropod, and lizard biodiversity in affluent neighborhoods. This dataset pertains to an investigation of this pattern in plant and soil bacterial communities in suburban lawns in Raleigh, North Carolina. We surveyed the diversity of plants and soil bacteria across 30 homes, located in census block groups with median household incomes from $22,000 to $208,000. Overall, we detected 144 plant genera and 7,908 bacterial OTUs. Although bacterial richness increased with soil pH, and soil pH increased weakly with household income, household income was nevertheless a poor predictor of plant and soil bacterial diversity. The lack of correlation between income and plant or bacterial diversity in our humid, subtropical study area is consistent with a broader literature suggesting that effects of affluence on biodiversity are most pronounced in arid climates. This dataset protects the location of individual sampling locations for the privacy of residents, but includes income information at the level of each census block group. For each of the 30 sites, the dataset provides median income of the census block group from 2014 census data; the number of OTUs observed in the rarefied sample; calculated diversity metrics, including Shannon's H and evenness, soil pH, and the number of plant genera detected in 12 one-meter quadrats. OTU abundance per site is provided in a separate file, and taxonomic information about each OTU is in a third file.
Dataset DOI: 10.5061/dryad.t76hdr8cj
Description of the data and file structure
This dataset contains information required to replicate analyses of Sow et al. (2026), testing the hypothesis that residential yards in wealthier neighborhoods would have higher biodiversity of plants and soil bacteria. The dataset includes biodiversity data for plants and soil bacteria sampled from 30 residential yards in Raleigh, North Carolina. Complementary data includes raw sequence data, measured soil pH at each site, and neighborhood median income from the 2014 Census. All sampling in residential yards was conducted with permission of residents and property owners.
Files and variables
File: site_data.csv
Description: Core dataset including biodiversity, income, and soil data for each of the 30 sites included in the study. Each site is one residential yard.
Variables
- Site_name: Unique identifier for each site; site names are ordered by neighborhood income such that S1 is in the census block group with the lowest median income of those sampled, and S30 is in the census block group with the highest median income of those sampled.
- Median_income: Median household income of the census block group in which the study site was located, from the 2014 US Census.
- Richness: Number of bacterial OTUs detected, after samples from all sites were rarefied to a shared sample size of 106,251 sequences.
- Shannon_H: Shannon's biodiversity index, calculated from the rarefied OTU data
- Evenness: OTU evenness, calculated from the rarefied OTU data
- mean_pH: soil pH (mean of two technical replicates for one pooled soil sample per site)
- Plant_genera: number of plant genera detected in 12 one-meter quadrats per site.
File: plant_data.csv
Description: Occurrence matrix for all detected plant genera across the 30 study sites. Plant genera are rows, sites are columns, and each cell contains '1' if the plant was detected at the site and '0' if it was not.
Variables
- Plant genus: Plant genus
- S1: S1 to S30 are the unique identifiers for each site, corresponding to column 'Site_name' in file 'site_data.csv'. Each cell contains '0' if the plant genus on that row was not detected, and '1' if the plant genus on that row was detected, at that site.
- S2: See explanation for S1.
- S3: See explanation for S1.
- S4: See explanation for S1.
- S5: See explanation for S1.
- S6: See explanation for S1.
- S7: See explanation for S1.
- S8: See explanation for S1.
- S9: See explanation for S1.
- S10: See explanation for S1.
- S11: See explanation for S1.
- S12: See explanation for S1.
- S13: See explanation for S1.
- S14: See explanation for S1.
- S15: See explanation for S1.
- S16: See explanation for S1.
- S17: See explanation for S1.
- S18: See explanation for S1.
- S19: See explanation for S1.
- S20: See explanation for S1.
- S21: See explanation for S1.
- S22: See explanation for S1.
- S23: See explanation for S1.
- S24: See explanation for S1.
- S25: See explanation for S1.
- S26: See explanation for S1.
- S27: See explanation for S1.
- S28: See explanation for S1.
- S29: See explanation for S1.
- S30: See explanation for S1.
File: otu_table.csv
Description: Abundance matrix for all bacterial OTUs across the 30 study sites. OTUs are rows, sites are columns, and each cell contains the number of sequences for that OTU detected at that site, after all samples were rarefied to a shared size of 106,251 sequences.
Variables
- OTU_ID: Unique identifier for each OTU.
- S1: S1:S30 are the unique identifiers for each site, corresponding to column 'Site_name' in file 'site_data.csv'. Each cell contains the number of sequences for the OTU on that row, at that site.
- S2: See explanation for S1.
- S3: See explanation for S1.
- S4: See explanation for S1.
- S5: See explanation for S1.
- S6: See explanation for S1.
- S7: See explanation for S1.
- S8: See explanation for S1.
- S9: See explanation for S1.
- S10: See explanation for S1.
- S11: See explanation for S1.
- S12: See explanation for S1.
- S13: See explanation for S1.
- S14: See explanation for S1.
- S15: See explanation for S1.
- S16: See explanation for S1.
- S17: See explanation for S1.
- S18: See explanation for S1.
- S19: See explanation for S1.
- S20: See explanation for S1.
- S21: See explanation for S1.
- S22: See explanation for S1.
- S23: See explanation for S1.
- S24: See explanation for S1.
- S25: See explanation for S1.
- S26: See explanation for S1.
- S27: See explanation for S1.
- S28: See explanation for S1.
- S29: See explanation for S1.
- S30: See explanation for S1.
File: taxonomy_table.csv
Description: Taxonomic information for each OTU
Variables
- OTU_ID: Unique identifier for each OTU; corresponds to OTU_ID in file 'otu_table.csv'.
- Kingdom: name of taxonomic kingdom
- Phylum: name of taxonomic phylum
- Class: name of taxonomic class
- Order: name of taxonomic order
- Family: name of taxonomic family
- Genus: name of taxonomic genus
- Species: specific epithet
- Confidence: Confidence score for OTU taxonomic assignment from 0 (no confidence) to 1 (perfect match)## Code/software
Data can be viewed in any software that reads .csv files, such as R, Excel, etc.
Access information
Data was derived from the following sources:
- Census block group median incomes were sourced from the 2014 US Census, which is in the public domain.
