Skip to main content
Dryad

Data from: Multi-perspective predictive modeling for acute kidney injury in general hospital populations using electronic medical records

Cite this dataset

He, Jianqin et al. (2018). Data from: Multi-perspective predictive modeling for acute kidney injury in general hospital populations using electronic medical records [Dataset]. Dryad. https://doi.org/10.5061/dryad.qc90qb4

Abstract

Objective: Acute kidney injury (AKI) in hospitalized patients puts them at much higher risk for developing future health problems such as chronic kidney disease, stroke, and heart disease. Accurate AKI prediction would allow timely prevention and intervention. However, current AKI prediction researches pay less attention to model building strategies that meet complex clinical application scenario. This study aims to build and evaluate AKI prediction models from multiple perspectives that reflect different clinical applications. Material and Methods: A retrospective cohort of 76,957 encounters and relevant clinical variables were extracted from a tertiary care, academic hospital electronic medical record (EMR) system between November 2007 and December 2016. Five machine learning methods were used to build prediction models. Prediction tasks from four clinical perspectives with different modeling and evaluation strategies were designed to build and evaluate the models. Results: Experimental analysis of the AKI prediction models built from four different clinical perspectives suggest a realistic prediction performance in cross-validated AUC ranging from 0.720 to 0.764. Discussion: Results show that models built at admission is effective for predicting AKI events in the next day; models built using data with a fixed lead time to AKI onset is still effective in the dynamic clinical application scenario in which each patient’s lead time to AKI onset is different. Conclusion: To our best knowledge, this is the first systematic study to explore multiple clinical perspectives in building predictive models for AKI in the general inpatient population to reflect real performance in clinical application.

Usage notes