Data and code from: Population assessment and long-term analysis of the endangered Comal Springs riffle beetle, Heterelmis comalensis
Data files
May 22, 2026 version files 475.53 KB
-
Comal-Springs-riffle-beetle-long-term-data_2025-11-05.csv
339.98 KB
-
cs_fs_df.csv
17.40 KB
-
csrb_custom_functions.R
12.19 KB
-
csrb_fall_2024.csv
1.91 KB
-
csrb_pop_df_JAGS.csv
21.75 KB
-
Pintar-Sullivan-CSRB-long-term-analysis.R
15.93 KB
-
R_Workflow_Local_Springflow_Covariate_Derivation_and_Relative_Abundance_Modeling.R
60.88 KB
-
README.md
5.49 KB
Abstract
This dataset accompanies the manuscript titled "Population assessment and long-term analysis of the endangered Comal Springs riffle beetle, Heterelmis comalensis" in the journal Insect Conservation and Diversity. Data files and code are associated with two components of this manuscript and include data on a second, co-occurring species, Microcylloepus pusillus. 1) A 1-year population study during five one-month sampling periods over the years 2023–2024 at 80 sites across the Comal Spring system (Comal County, Texas, United States). 2) A long-term dataset of all abundance data of the species collected with standardized methods from May 2004 through November 2025 in the Comal Springs system.
Description of data and code
csrb_pop_df_JAGS.csv
This is a CSV file of data used to perform final analyses in the population study
Column names (variables):
Date: date that the lure was retrieved (YYYY-MM-DD)Site: site number corresponding to StationStation: location at which local springflow (Station_Q) was measured (corresponds to locations listed on Figure S24)Sample.period: ordinal sampling period during the population study corresponding to periods listed in Table S1Sample: ordinal number assigned to each lure within each sampling period and localityHeterelmis.adults: number of Heterelmis comalensis adults on each lureHeterelmis.larvae: number of Heterelmis comalensis larvae on each lureMicrocylloepus.adults: number of Microcylloepus pusillus adults on each lureMicrocylloepus.larvae: number of Microcylloepus pusillus larvae on each lureBiofilm.total: estimated total coverage of biofilm on the lure (percent)Station_Q: measured local springflow at each station (cubic feet per second)Station_Q_30D: estimated 30-day local springflow average at each Station (cubic feet per second)
cs_fs_df.csv
This is a CSV file of springflow measurements used to generate the local springflow covariate
Column names (variables):
Date: date that the measurement was taken (YYYY-MM-DD)Station: location at which local springflow (Q) was measured (corresponds to locations listed on Figure S24)Station_No: number for each StationQ: measured springflow at each station (cubic feet per second)Unit: device name used to measure springflow
csrb_fall_2024.csv
This is a CSV file of the fall 2024 biomonitoring data used to illustrate the framework for the biological monitoring application in Appendix C.
Column names (variables):
Date: date that the lure was retrieved (YYYY-MM-DD)Site: site number corresponding to StationStation: location at which local springflow (Station_Q) was measured (corresponds to locations listed on Figure S24)Heterelmis.adults: number of Heterelmis comalensis adults on each lureBiofilm.total: estimated total coverage of biofilm on the lure (percent)Station_Q: measured local springflow at each station (cubic feet per second)Station_Q_30D: estimated 30-day local springflow average at each Station (cubic feet per second)
csrb_custom_functions.R
This is an R file with the code to generate custom functions for the population analysis.
R_Workflow_Local_Springflow_Covariate_Derivation_and_Relative_Abundance_Modeling.R
This is an R file with the code to run the analyses and generate the figures for the population analysis.
Comal-Springs-riffle-beetle-long-term-data_2025-11-05.csv
This is the full long-term dataset CSV file with all samples collected from 2004 to 2025 and associated covariates used in analyses.
Column names (variables):
N: continuous number (unique identifier) for each sampleSiteID: identifier for each individual site that was monitoredLocality: name of the locality in which a lure was setSector: unique identifier for spatial groupings of sites within each localityMethod: method by which each site was sampledrag: cotton rag used from 2004 to 2013lure: cotton lure used from 2012 to 2025
Date.start: date that the lure was set in a spring (if known) (YYYY-MM-DD)Date.end: date that the lure was removed from the spring (YYYY-MM-DD)Day.of.year: ordinal day of the year (1 = 1 January)Days.set: number of days that the lure was set in the systemDays.set.est: number of days that the lure was set in the system, with number of days estimated for sites whereDate.startwas not knownSample.project: name of the project for which lures were setLure.condition: final condition of the lure upon retrievaldry: lure was above waterexposed: lure was exposed to lightgood: lure was in good conditionlost: lure could not be foundmoved: lure was otherwise out of its original location
Disturbed: whether or not (yes, no) a lure was disturbed in the place it was setHeterelmis.larvae: number of Heterelmis comalensis larvae on each lureHeterelmis.adults: number of Heterelmis comalensis adults on each lureMicrocylloepus.larvae: number of Microcylloepus pusillus larvae on each lureMicrocylloepus.adults: number of Microcylloepus pusillus adults on each lureTemp180DayAv: average air temperature (degrees Celsius) over the previous 180 days from the date each lure was retrievedPrecip30Day: total precipitation (millimeters) over the previous 30 days from the date each lure was retrievedDis180Dav: average total system discharge (cubic feet per second) for Comal Springs over the previous 180 days from the date each lure was retrievedyear: ordinal year in which each lure was retrievedmonth: ordinal month of the year in which each lure was retrievedday: ordinal day of the month in which each lure was retrievedLocal_30Day_SF_Avg: average local springflow for each sector (cubic feet per second)
Pintar-Sullivan-CSRB-long-term-analysis.R
This is an R file with the code to run the analyses and generate the figures with the long-term dataset.
