Skip to main content

Richness (species numbers) of butterflies in the Iberian Peninsula on a 50 x 50 km UTM grid

Cite this dataset

García-Barros, Enrique (2022). Richness (species numbers) of butterflies in the Iberian Peninsula on a 50 x 50 km UTM grid [Dataset]. Dryad.


This file contains the on richness (species numbers) of butterflies from the Iberian Peninsula as they were used to model richness on contemporary climatic and non-climatic variables, then to forecast richness values to future climate scenarios. This is a text *.csv file.


This data set was extracted fro the authors data set "ATLAMAR", not published in this version. The data derive those used for a distribution atlas (García-Barros et al., 2004), updated by the authors to 2019. The data cover the presence of ca. 230 species of Papilionidae, Hesperiidae, Pieridae, Lycaenidae, Riodinidae and Nymphalidae in the continental territories of Spain and Portugal. It contains over 400.000 records. For estimates of completeness, one "record" was estimated as the unique combination of species, location, coordinates, date and observer/collector.

The columns are: CELL (arbitrary cell number), LONdegrees (Longitude in decimal degrees of the MGRS cell centroid), LATdegrees (Latitude, same conditions as for the Longitude), Sobs (number of species presently recorded for that cell or square), SestC (richness as determined by accumulation curves), Sest (estimated richness as used as data input), FAIR (cells fitting the minimum of criteria required to fit the accumulation curve),WEIGHT (weight given to the cases to opertae using weighted regression), SELECTION_COMP>80% (cells with values of completeness equal or higher than 80%, to be eventually selected as the training set of cases), VALIDATIONset (cells with high  [>97%] value of Completeness, to be used as teh validation subset), COMPLETENESS% (percentage of cell completeness calculated as the ratio of observed/predicted richness as determined by the accumulation curves). The remaining are the standardised values of the explanatory variables selected in any of the models: Lon (Longitude), Lat (Latitude), DCOAST (minimum distance from the cell centroid to the coast), AREA (area of ecah cell actually occupied by land), TMINavg (average minimum temperature from WorldClim), TMINabs (absolute minimum temperature from WorldClim), TMAXavg (average maximum temperature from WorldClim), TMAXabs (maximum temperature from WorldClim), Tavg (average yearly temperature from WorldClim), Trange (temperature range, max-min from WorldClim), Pavg (mean yearly precipitation from WorlClim), Psummer (precipitation during the warmest quarter from WorldClim), Pran (range of precipitation in one year, max-min from WorldClim).

Usage notes

Missing values are coded as "NA". This apply only to the richness values estimated from the accumulation curves, for some cells that did not met the minimum criteria to allow this calculation in a reliable manner. For such cells, predicted values (present) and forecasts (future) were calculated.


JJ CC Castilla - La Mancha (Spain), Award: SBPLY/17/180501/000492

MINECO (Spain), Award: CGL2017-86926-P