Machaerium acutifolium complex morphometry
Data files
Dec 17, 2025 version files 169.10 KB
-
README.md
6.34 KB
-
Script_ecological_analysis.R
11.99 KB
-
Script_morphometric_analysis.R
21.29 KB
-
Supplemental_TableS1.csv
104.98 KB
-
Supplemental_TableS2.csv
24.49 KB
Abstract
The neotropical jacarandá species of the Machaerium acutifolium complex (Leguminosae, Papilionoideae) are among the most widely distributed in the genus, occurring in six countries of South America, where it displays great ecological and economic importance. Across its distribution in almost all Brazilian biomes, M. acutifolium presents an evident phenotypic plasticity, mainly in the shape and size of the leaflets, that led to the description of different taxonomic entities over the years. However, the significant morphological variation observed within the complex’s species prompts consideration for further studies. Here, we re-evaluate the taxonomic delimitation within the M. acutifolium species complex using morphometric and bioclimatic data. A nomenclatural review and morphological assessment of M. acutifolium and related species were carried out to support the PCA morphometric analyses of 23 vegetative and 27 reproductive characters, from a total of 99 specimens as well as eco-geographic features of 131 specimens.
Dataset DOI: 10.5061/dryad.ngf1vhj6h
Description of the data and file structure
We selected 99 specimens representing “Machaerium acutifolium”, from which 79 specimens are trees from Central, Southeast and Southern regions of Brazil, and 20 specimens are shrubs and/or trees growing in the Northeast region. In this sample, 51 are flowering specimens and 48 represent fruiting specimens. We assessed 23 continuous vegetative characters and 27 continuous reproductive characters taken from mature samples.
To explore the ecological variation of species in the climatic space, we used 131 accurately georeferenced specimens to extract bioclimatic variables from the WorldClim v.2.1 model layers database
Files and variables
File: Supplemental_TableS1.xlsx
Description: All continuous morphometric variables were measured in centimeters (cm), except for ratios, which are dimensionless, and leaflet areas, which are expressed in square centimeters (cm²).
A total of 23 continuous vegetative and 27 continuous reproductive characters were measured from 166 mature samples.
Measurement scheme
- Each continuous variable was measured three times per specimen (columns with suffix 1, 2, and 3, e.g., PL1, PL2, PL3).
- The suffix M indicates the mean value calculated from the three measurements (e.g., PLM).
- Some cells in the dataset contain NULL values. These values indicate that the information was not available at the time of data collection or could not be obtained for a given specimen (e.g., damaged, incomplete, or missing structures). NULL values do not indicate that the variable is not applicable; rather, they represent missing observations. No data were intentionally hidden.
Vegetative characters
- PLM – Petiole length (cm)
- RLM – Rachis length (cm)
- BPLM – Basal pulvinule length (cm)
- LBLM – Basal leaflet length (cm)
- WBLM – Basal leaflet width at the widest point (cm)
- DBLM – Distance from leaflet base to point of maximum width, basal leaflet (cm)
- RLWBM – Ratio of basal leaflet length to width (unitless)
- RBWM – Ratio of distance from leaflet base to maximum width, basal leaflet (unitless)
- ABLM – Basal leaflet area (cm²)
- MPLM – Middle leaflet pulvinule length (cm)
- LMLM – Middle leaflet length (cm)
- WMLM – Middle leaflet width at the widest point (cm)
- DMLM – Distance from leaflet base to point of maximum width, middle leaflet (cm)
- RLWM – Ratio of middle leaflet length to width (unitless)
- RMWM – Ratio of distance from leaflet base to maximum width, middle leaflet (unitless)
- AMLM – Middle leaflet area (cm²)
- TPLM – Terminal leaflet pulvinule length (cm)
- LTLM – Terminal leaflet length (cm)
- WTLM – Terminal leaflet width at the widest point (cm)
- DTLM – Distance from leaflet base to point of maximum width, terminal leaflet (cm)
- RLWTM – Ratio of terminal leaflet length to width (unitless)
- RTWM – Ratio of distance from leaflet base to maximum width to width, terminal leaflet (unitless)
- ATLM – Terminal leaflet area (cm²)
Reproductive characters
- BRL – Bracteole length (cm)
- BRW – Bracteole width (cm)
- CLL – Calyx length (cm)
- CLW – Calyx width (cm)
- SPL – Standard petal length (cm)
- SPW – Standard petal width (cm)
- WPL – Wing petal length (cm)
- WPW – Wing petal width (cm)
- KPL – Keel petal length (cm)
- KPW – Keel petal width (cm)
- SSL – Staminal sheath length (cm)
- SFL – Staminal filament length (cm)
- ATHL – Anther length (cm)
- ATW – Anther width (cm)
- NEL – Nectary length (cm)
- NEW – Nectary width (cm)
- GSL – Gynoecium stipe length (cm)
- OVL – Ovary length (cm)
- OVW – Ovary width at the widest point (cm)
- STL – Style length (cm)
- SCL – Seed chamber length (cm)
- SCW – Seed chamber width (cm)
- WFLM – Fruit wing length (cm)
- WFWM – Fruit wing width (cm)
- TFL – Total fruit length (cm)
- RSCW – Ratio of seed chamber length to wing length (unitless)
File: Supplemental_TableS2.xlsx
Description: To explore the ecological variation of species in the climatic space, we used 131 accurately georeferenced specimens to extract bioclimatic variables from the WorldClim v.2.1 model layers database. The file Supplemental_TableS2 was used in the analysis to retrieve and associate WorldClim bioclimatic variables with each sample. This spreadsheet was employed by the analysis code as a reference to link specimen locality information with the corresponding bioclimatic data extracted from WorldClim.
The use of this file ensures consistency between the occurrence data and the environmental variables used in subsequent analyses.
Variables
- Annual Precipitation (BIO12)
- Coefficient of Variation (BIO 15)
Code/software
Input files:
The script Script_morphometric_analysis.R, uploaded in the Dryad repository, uses the input file Supplemental_TableS1.csv, which is also available in the same Dryad deposit.
The script Script_ecological_analysis.R, uploaded in Dryad, uses the input file Supplemental_TableS2.csv, which is likewise available in the same Dryad repository.
All analyses were made using the R packages ade4 (Dray and Dufour 2007), factoextra (Kassambara and Mundt 2020), FactoMineR (Sebastien et al. 2008), tibble (Müller and Wickham 2023), tidyverse (Wickham et al. 2019), and vegan (Oksanen et al. 2022), while the graphics were plotted using functions from the packages ggplot2 (Villanueva and Chen 2019), cowplot (Wilke 2020), and viridis (Garnier et al. 2023).
To produce the ecological analysis and the distribution map, we used the R packages rnaturalearth (Massicotte and South 2023), rnaturalearthdata (South 2017), sf (Pebesma 2018), and the graphics using functions from the packages ggplot2 (Villanueva and Chen 2019), cowplot (Wilke 2020), viridis (Garnier et al. 2023), ggspatial (Dunnington 2023), ggmap (Kahle and Wickham 2013).
