Data from: Associations between stream habitat and energetic condition in juvenile coho salmon (Oncorhynchus kisutch)
Data files
Mar 25, 2026 version files 139.92 KB
-
CumulativeModelData_DRYAD.csv
110.87 KB
-
README.md
5.79 KB
-
Temporal_analysis_data_DRYAD.csv
23.26 KB
Abstract
Spatiotemporal variation in habitat quality and quantity impacts individual fish energetics by influencing growth, energy storage, and survival, ultimately shaping population dynamics. However, few studies have explicitly linked individual energetic condition to variation in habitat across both space and time. We investigated the influence of freshwater habitat characteristics and season on both physiological (e.g., percent lipid, energy density) and morphological (e.g., relative condition factor) metrics of juvenile coho salmon energetic condition in interior British Columbia, Canada. Physiological metrics responded to spatial and temporal variation in habitat characteristics. Among sites, a higher percent lipid was associated with lower water temperature and higher energy density, and elevated stream nutrient concentrations. Across seasons, energetic condition declined over summer (July-September) and again over winter (September-April). Post-winter percent lipid and energy density values converged on proposed lower thresholds for survival. Collectively, this research shows the potential for using physiological energetic condition metrics as indicators of spatial and temporal variation in habitat quality. Quantifying how habitat changes affect juvenile salmon energetic condition could improve the evaluation of different habitat protection and restoration actions.
Dataset DOI: 10.5061/dryad.4f4qrfjsb
Description of the data and file structure
Files and variables
File: CumulativeModelData_DRYAD.csv
Description: This dataset contains the physiological, morphological, and habitat data used to evaluate spatial drivers of energetic condition in juvenile coho salmon (Oncorhynchus kisutch) across multiple sites in the North Thompson watershed (2019–2023).
Variables
(not all variables were used in the model, site variables were tested for collinearity etc..)
-
site: site name (not gazetted)
-
lat: latitude of site
-
long: longitude of site
-
date: date sampled
-
unique_id: unique identifier of individual fish
-
weight_g: mass of fish (grams)
-
final_age: age (from scale analysis, in years)
-
fork_length_mm: field fork length (to the nearest mm)
-
year: year sampled
-
fished_reach_m: reach length fished within each site (in m)
-
gradient: gradient of site
-
lwd_vol_m3: large wood debris volume in fished reach (in volume, m3)
-
lwd_pieces: large woody pieces, in fished reach
-
pool_riffle_ratio: ratio of pool to riffle in site reach
-
lwd_pieces_area: Number of large wood debris pieces divided by the total area (in metres) of the reach
-
site type: Classification of site (e.g. wetland, tributary)
-
density: conspecific density calculated.
-
ATU_2: accumulated thermal units (daily units summed over the 30 days prior to sample date)
-
days_of_data: days used to calculate accumulated thermal units.
-
TP_conc: Total phosphorus concentration (in ug/L)
-
elevation_ft: elevation of site (in feet)
-
mean_h2o: mean percentage moisture content in individual sample (average of two replicates)
-
mean_carbon: mean percentage carbon content in individual sample (average of two replicates)
-
std_dev_h2o: Standard deviation moisture (standard deviation between two replicates)
-
cv_h2o: coefficient of variation moisture (CV between two replicates)
-
st_dev_carbon: Standard deviation carbon (standard deviation between two replicates)
-
cv_carbon:coefficient of variation carbon (CV between two replicates)
-
h2o_pass_fail: (CV <20% across replicates), otherwise fail and was not used in data analysis
-
carbon_pass_fail: (CV <20% across replicates), otherwise fail and was not used in data analysis
-
thaw_wt: lab-measured fish weight (g)
-
lab_fork_length_mm: lab measured fork length (mm)
-
final_lipid_weight: final whole-body lipid weight
-
final_lipid_percent: final whole-body percent lipid (% gg -1 body weight)
-
cv_lipid: coefficient of variation percent lipid
-
lipid_pass_fail: (CV <20% across replicates), otherwise fail and was not used in data analysis
-
total_body_lipid: lipid mass (g) divided by total body mass (in grams)
-
condition_factor_K: Fulton's condition factor
-
Kn: Le Cren's condition factor K (see Supplementary for equation)
-
j_date: Julian Date of field collection
-
percentprotein: calculated percent protein (see Supplementary for equation) (% gg -1
weight)
-
ed: calculated energy density (see Supplementary for equation) (kJg -1 )
File: Temporal_analysis_data_DRYAD.csv
Description: This dataset includes the information required to evaluate seasonal changes in juvenile coho salmon energetic condition for Age-0+ and Age-1+ fish. Seasonal analyses test hypotheses about how condition changes between midsummer (July–September) and the following spring (April), and how these seasonal patterns differ among sites.
Variables
-
unique_id: unique fish ID
-
mean_h2o: mean % moisture between two replicates
-
mean_carbon: mean % carbon between two replicates
-
std_dev_h2o: standard deviation % moisture between two replicates
-
st_dev_carbon: standard deviation % carbon between two replicates
-
cv_h2o: coefficient of variation % moisture between two replicates
-
cv_carbon: coefficient of variation % carbon between two replicates
-
carbon_pass_fail: (CV <20% across replicates), otherwise fail
-
h2o_pass_fail: (CV <20% across replicates), otherwise fail
-
date: date sampled
-
weight_g: individual fish weight in grams
-
fork_length_mm: fish fork length in millimetres (field measured)
-
site: site name (not gazetted)
-
month: month fish was sampled
-
thaw_wt: mass whole body fish (in grams)
-
final_lipid_percent: whole-body percent lipid (% gg -1
body weight)
-
final_lipid_weight: lipid whole-body weight (gg -1
body weight)
-
lab_fork_length_mm: measured lab fork length, na if not measurable
-
cv_lipid: coefficient of variation % lipid
-
std_dev_lipid: standard deviation % lipid
-
lipid_pass_fail: Pass (CV <20% across replicates), otherwise fail and was not included in data analysis
-
per_protein:percent protein (% gg -1
body weight), (see Supplementary for equation)
-
ed: Energy density, (see Supplementary for equation) (kJg -1 )
-
Kn: le Cren's condition factor (see Supplementary for equation)
NA- not applicable. Missing or NA values in the dataset indicate either samples could not be measured (e.g. for lab fork length due to damage to fins following freezing), or an inadequate sample size/ large variation in replicate values meant value was not measured/excluded from dataset (e.g. carbon and moisture, percent protein and energy density.
Code/software
All data compilation and statistical analyses were performed using R 2023.12.1 (R Core Team, 2024), in RStudio (Posit team, 2024).
We performed modelling using glmmTMB, and model averaging using the top candidate models when there was no clear top model (weights <0.9) (Symonds & Moussalli, 2011, R package MuMin, Bartoń, 2024).
Spatial Model Data:
This dataset (Cumulative_Model.csv) contains the physiological, morphological, and habitat data used to evaluate spatial drivers of energetic condition in juvenile coho salmon (Oncorhynchus kisutch) across multiple sites in the North Thompson watershed (2019–2023). The data support analyses examining how local habitat characteristics influence percent lipid, protein, energy density, and relative body condition (Kn) in Age-0+ and Age-1+ juveniles. The dataset includes raw physiological measurements, derived condition metrics, site-level habitat covariates, and model outputs used in hierarchical spatial analyses.
Physiological measurements represent the whole-body composition of juvenile salmon and include lipid content, moisture, protein, carbon content, and derived energy density. Lipid content was quantified using a modified chloroform–methanol extraction suitable for small-bodied juveniles (0.76–14.88 g). Each individual was homogenized, and duplicate ~0.20 g subsamples were extracted. Whole-body lipid mass and percent lipid were calculated from extraction ratios scaled to total wet mass. Replicates with high variability (CV > 20%) were re-run, and averaged values were retained.
Moisture, protein, and carbon content were derived from duplicate ~0.30 g subsamples dehydrated at 80°C for 24 hours and combusted to ash at 500°C. Percent protein was estimated following Trudel et al. (2005) as the proportion of tissue not accounted for by water, lipid, and carbon. Total energy density (kJ g⁻¹ wet weight) was calculated using standard lipid and protein caloric conversion factors specific to coho salmon (Brett & Groves 1979; Trudel et al. 2005).
The dataset includes fork length and body mass measurements for each individual. These measurements were used to derive Le Cren’s relative condition factor (Kn), a mass-independent index of body condition that compares observed mass to expected mass for a given length. Expected mass was generated from population-specific allometric parameters (a, b) estimated from >5,000 individuals collected during long-term mark–recapture surveys (2019–2023). Kn > 1 indicates better-than-average condition for a given length, while Kn < 1 indicates poorer condition. Fork length measured in the field was used to avoid fin-damage issues affecting lab measurements.
Each fish is associated with site-level environmental variables used to explain spatial variation in condition. Variables include accumulated thermal units (ATUs), conspecific density (fish/m²), large wood counts, pool–riffle ratio, elevation, total phosphorus, and stream gradient. Juvenile coho salmon densities in tributary sites were estimated using a hierarchical Bayesian closed mark-recapture model (HBM) developed by Korman et al. (2016) that was based on Wyatt (2002). All covariates were standardized (mean-centered and scaled by one standard deviation). A Pearson correlation matrix was used to exclude highly collinear variables (|r| < 0.80 retained).
The dataset was fed into hierarchical Gaussian (energy density and K condition) and beta regression models (for percent lipid and protein) describing condition metrics as a function of individual traits (mass, age, and their interaction) and site-level habitat features. Models were fitted using R (2023.12.1) and RStudio.
The general model structure includes individual-level fixed effects (mass, age, mass × age), site-level random intercepts, allowing individuals from the same site to share similarities (e.g., genetic, environmental), and site-level predictors: environmental variables influencing the site-level intercept.
Model selection was based on AICc across all subsets of habitat covariates (global model dredged). When no single best model was identified (AICc weight < 0.9), model-averaged parameter estimates and unconditional confidence intervals were generated using the natural-average method. Relative variable importance was calculated from summed AICc weights for each variable across the model set. Predicted condition values and model performance metrics, including RMSPE (%), were calculated for each condition metric.
Across condition metrics, fish mass was consistently the strongest predictor of energetic condition—4× stronger than any habitat variable for percent lipid, and 6× stronger for both Kn and energy density. Age and mass showed clear interactive effects, with age-1+ fish approximately 25% heavier and exhibiting different allometric scaling relationships than age-0+. Habitat features exhibited more modest effects, and conspecific density did not significantly influence condition.
Seasonal Data
This dataset (Temporal_analysis_data.csv) includes the information required to evaluate seasonal changes in juvenile coho salmon energetic condition for Age-0+ and Age-1+ fish. Seasonal analyses test hypotheses about how conditions change between midsummer (July–September) and the following spring (April), and how these seasonal patterns differ among sites.
For each site, the dataset contains measurements of mass, percent lipid, percent protein, relative condition factor (Kn), and energy density (ED) collected in July, August, September, and April across multiple years. Each record includes the month of capture, allowing month-specific condition estimates and comparisons.
To quantify seasonal trends in condition, a separate regression model was built for each condition metric at each site. Mass, month, and their interaction were included as predictors to account for allometric changes in condition and the possibility that the mass–condition relationship differs across months.
Percent lipid and percent protein, which are proportional measures, were modeled using beta regression with a logit link. Energy density and Kn, which are continuous metrics, were modeled using Gaussian linear regression. Mass was centered prior to analysis to improve the interpretability of month-specific effects.
Months were treated as a categorical variable (July, August, September, April), enabling comparison of both the average condition in each month and the month-specific relationship between condition and mass. For each site and condition metric, the dataset includes:
- Month-specific mean condition for an average-mass individual (calculated from the model intercept and month coefficients).
- Month-specific allometric slopes represent strongly condition scales with body mass in each month.
- Interaction effects identifying whether the mass–condition relationship changes seasonally.
To assess statistical differences among months, pairwise comparisons were performed with Bonferroni correction to control family-wise error when testing multiple month combinations.
