Skip to main content

Data from: Symbioses with nitrogen-fixing bacteria: nodulation and phylogenetic data across legume genera

Cite this dataset

Afkhami, Michelle E. et al. (2018). Data from: Symbioses with nitrogen-fixing bacteria: nodulation and phylogenetic data across legume genera [Dataset]. Dryad.


How species interactions shape global biodiversity and influence diversification is a central – but also data-hungry – question in evolutionary ecology. Microbially-based mutualisms are widespread and could cause diversification by ameliorating stress and thus allowing organisms to colonize and adapt to otherwise unsuitable habitats. Yet the role of these interactions in generating species diversity has received limited attention, especially across large taxonomic groups. In the massive angiosperm family Leguminosae, plants often associate with root-nodulating bacteria that ameliorate nutrient stress by fixing atmospheric nitrogen. These symbioses are ecologically-important interactions, influencing community assembly, diversity, and succession, contributing ~100-290 million tons of N annually to natural ecosystems, and enhancing growth of agronomically-important forage and crop plants worldwide. In recent work attempting to determine whether mutualism with N-fixing bacteria led to increased diversification across legumes, we were unable to definitively resolve the relationship between diversification and nodulation. We did, however, succeed in compiling a very large searchable, analysis-ready database of nodulation data for 749 legume genera (98% of Leguminosae genera; LPWG 2017), which, along with associated phylogenetic information, will provide a valuable resource for future work addressing this question and others. For each legume genus, we provide information about the species richness, frequency of nodulation, subfamily association, and topological correspondence with an additional data set of 100 phylogenetic trees curated for database compatibility. We found 386 legume genera were confirmed nodulators (i.e., all species examined for nodulation nodulated), 116 were non-nodulating, 4 were variable (i.e., containing both confirmed nodulators and confirmed non-nodulators), and 243 had not been examined for nodulation in published studies. Interestingly, data exploration revealed that nodulating legume genera are ~3× more species-rich than non-nodulating genera, but we did not find evidence that this difference in diversity was due to differences in net diversification rate. Our metadata file describes in more detail the structure of these data that provide a foundational resource for future work as more nodulation data become available, and as greater phylogenetic resolution of this ca. 19,500-species family comes into focus.

Usage notes


National Science Foundation, Award: IOS-1401840