Skip to main content
Dryad

Data from: Spatial modeling of sociodemographic risk for COVID-19 mortality

Data files

Jul 21, 2023 version files 181.15 MB
Feb 28, 2024 version files 182.18 MB
Sep 12, 2024 version files 133.38 MB

Abstract

Background: In early 2020, the Coronavirus Disease 2019 (COVID-19) rapidly spread across the United States (US), exhibiting significant geographic variability. While several studies have examined the predictive relationships of differing factors on COVID-19, few have looked at spatiotemporal variation of COVID-19 deaths at refined geographic scales.

Methods: The objective of this analysis is to examine the spatiotemporal variation in COVID-19 deaths with respect to socioeconomic, health, demographic, and political factors. We use multivariate regression applied to Health and Human Services (HHS) regions as well as nationwide county-level geographically weighted random forest (GWRF) models. Analyses were performed on data from three separate time frames which correspond to the spread of distinct viral variants in the US: pandemic onset until May 2021, May 2021 through November 2021, and December 2021 until April 2022. Spatial autocorrelation was additionally examined using a local and global Moran’s I test statistic.

Results: Multivariate regression results for all regions across three time windows suggest that existing measures of social vulnerability for disaster preparedness (SVI) are predictive of a higher degree of mortality from COVID-19. In comparison, GWRF models provide a more robust evaluation of feature importance and prediction, exposing the value of local features for prediction, such as obesity, which is obscured by coarse-grained analysis. Spatial autocorrelation indicates positive spatial clustering,
with a progression from positively clustered low deaths for liberal counties (cold spots) to positively clustered high deaths for conservative counties (hot spots).

Conclusion: GWRF results indicate that a more nuanced modeling strategy is useful for determining spatial variation versus regional modeling approaches which may not capture feature clustering along border areas. Spatially explicit modeling approaches, such as GWRF, provide a more robust feature importance assessment of sociodemographic risk factors in predicting COVID-19 mortality.