Lake Victoria region block sub-county level risk data
Data files
Sep 10, 2025 version files 63.82 KB
-
README.md
14.01 KB
-
schri_all.csv
49.81 KB
Abstract
The health risks of climate change need to be identified to inform the prioritization of adaptation efforts. This is particularly true within low- and middle-income countries (LMICs) with limited resources, heterogenous climates, and varying degrees of social vulnerability. In Kenya, diarrheal disease is one of the leading causes of death and identifying risk factors of diarrheal disease is critical. This research aims to characterize factors associated with a high risk of diarrheal disease in western Kenya by developing a risk index based on the Intergovernmental Panel on Climate Change (IPCC) risk framework. We developed a conceptual model of risk factors based on prior research with risk factors grouped into the four components of the IPCC risk framework: hazard, exposure, and vulnerability (which is comprised of sensitivity and adaptive capacity). We obtained 30 data elements corresponding to the four components for 99 sub-counties in 14 western Kenya counties. We conducted principal component analysis (PCA) to develop a risk index for diarrheal disease. Our risk index aligns with epidemiological literature, including precipitation, temperature, water sanitation and hygiene (WASH), sensitive populations, education, poverty, and health facilities. Within counties, we found that the risk varied substantially, and a geographic cluster of high-risk sub-counties was identified. Our findings should be useful for policymakers and health officials in Kenya to prioritize efforts to prepare communities for health impacts of climate change. The process may be useful for standardizing approaches to assessing the risk of climate-sensitive health outcomes.
https://doi.org/10.5061/dryad.crjdfn3dj
Description of the data and file structure
In this research we aim to estimate a risk of diarrheal disease on a subnational scale in western Kenya. Following a search of publicly available data we were able to obtain 30 variables on the county or sub-county level. Weather data such as average, and extreme precipitation and temperature, were obtained from the Kenya Meteorological Department (KMD). The average monthly maximum temperature, minimum temperature, and total precipitation were obtained for 2010 to 2022 on a daily scale and averaged by month. Data were not obtained for years before 2010 because changes in county boundaries that occurred between 2009 and 2010. Climate variability was measured as the average standard deviation of the monthly maximum temperature, minimum temperature, and total precipitation from 2010 to 2022. Extreme events were measured as the frequency of days over the 95th percentile of precipitation, maximum temperature, and minimum temperature per month from 2014 to 2022, to match the time of adaptive capacity and sensitivity data sources. Adaptive capacity, sensitivity and exposure variables were abstracted from census data from the Kenya National Bureau of Statistics (KNBS), Kenya Ministry of Health, Food and Agriculture Organization, National Imagery and Mapping Agency of the US, and the peer reviewed literature.
Files and variables
File: schri_all.csv
Description: CSV data set of variables used to calculate risk of diarrheal disease on a sub-county level in the Lake Victoria Region Block of Kenya
Sub County Level Data:
Variable | Description |
---|---|
County | County |
Sub_County | Sub_county within the county |
Total_Population | Total number of people in the sub county |
Rural_Population | Number of people living in rural areas |
Urban_Population | Number of people living in urban areas |
Child_Population | Number of children under the age of 18 |
Elderly_Population | Number of elderly people, over the age of 65 |
Housing_Type_permanent | Number of households with permanent housing – houses which are built with bricks and has iron roofs |
Housing_Type__non-permanent | Number of households with non-permanent housing – houses are built with either mud walls or grass thatched roof |
Education_Level___Primary | Number of adults with primary education |
Education_Level___Secondary | Number of adults with secondary education |
Education_Level___tvet | Number of adults with technical and vocational education and training (TVET) education |
Education_Level___University | Number of adults with university education |
Education_Level___adult_basic | Number of adults with adult basic education |
Education_Level___madrasa_duksi | Number of adults with madrasa duksi education |
Literacy_Rate_litrate_ | Percentage of the population over 18 that can read and write |
Improved_Sanitation_Facility | Number of households with access to improved sanitation |
Electricity | Number of households with electricity |
Average_Household_size | Average size of a household |
Improved_Drinking_Water_Source | Number of households with access to improved drinking water |
Population_female | Number of females |
Population_Density_No_per_Sq_ | Population per land area |
Number_of_Households | Number of households |
Land_Area_sq_km_ | Land area in the sub county per square kilometer |
Sex_Ratio_No_of_Males_per_100 | Number of males per 100 females |
Dispensary | Number of health facilities classified as a dispensary |
Health_Centre | Number of health facilities classified as a health centre |
Hospitals | Number of health facilities classified as a hospital |
Medical _Center | Number of health facilities classified as a medical centre |
Medical_Clinic | Number of health facilities classified as a medical clinic |
Nursing_Home | Number of health facilities classified as a nursing home |
Stand_Alone | Number of health facilities classified as a stand alone |
Primary_Health | Number of health facilities classified as primary health |
NoHealthFacilities | 1 represents that there are no health facilities in that sub county |
Total_bedsandcots | Number of cots and beds in each sub county at any type of health facility |
Children | Proportion of the total population that is a child |
Elderly | Proportion of the total population that is elderly |
Female | Proportion of the total population that is female |
Rural | Proportion of the total population that is rural |
Urban | Proportion of the total population that is urban |
Adult_Population | Count of the adult population, total pop minus child pop and elderly pop |
Adult | Proportion of the total population that is an adult |
Primary_Ed | Proportion of the adult population with a primary education |
Secondary_Ed | Proportion of the adult population with a secondary education |
Univ_Ed | Proportion of the adult population with a university education |
Adult_Ed | Proportion of the adult population with an adult basic education |
Madrasa_duksi_Ed | Proportion of the adult population with a madrasa Duksi education |
TVET_Ed | Proportion of the adult population with a TVET education |
HDX | Whether or not this sub-county is present in the HDX shapefile, used for mapping and spatial data |
Distance_to_Urban_Center | Average distance in minutes to an urban center in the subcounty |
Number_of_rivers | Number of distinct river segments in the sub-county |
Number_of_floodplains | Number of flood plains in the sub-county |
TP_informal | Number of people living in informal settlements |
Pop_InformalSettlements | Proportion of the total population living in informal settlements |
County Level Only Data:
Variable | Description |
---|---|
Poverty | Following up with SWAP about this |
StuntingRate | Rate, following up with SWAP about this |
Infant_MR | Probability of a child born dying before age 1 per 1000 live births |
Under_5_MR | Probability of a child dying before age 5 per 1000 live births |
AdultMR_Male | Probability of dying between age 15 and 60 per 1000 population for males |
AdultMR_Female | Probability of dying between age 15 and 60 per 1000 population for females |
ElderlyMR_Male | Probability of dying after age 60 per 1000 population for males |
Elderly_MR_Female | Probability of dying after age 60 per 1000 population for females |
CIDP_Total_Score | Total score from CIDP evaluation |
CIDP_Group | Low, medium or high connection groups based on CIDP total |
Density_of_doctors_nurses_and_clinicalofficers | Number of all health professionals per 10,000 people |
Density_of_doctors | Number of doctors per 10,000 |
Density_of_nurses | Number of nurses per 10,000 |
Density_of_clinicalofficers | Number of clinical officers per 10,000 |
Weather Data:
Variable | Description |
---|---|
Std_pre | Standard deviation in precipitation in mm |
std_tmax | Standard deviation in max temperature |
std_tmin | Standard deviation in minimum temperature |
averagetempdiff | Average daily difference in minimum and maximum temperature |
avg_pre | Average daily precipitation in mm |
avg_Tmax | Average daily maximum temperature in C |
avg_Tmin | Average daily minimum temperature in C |
freq_eh | Average number of extreme heat days |
freq_ec | Average number of extreme cold days |
freq_er | Average number of extreme rain days |
** **
Code/software
SAS Studio
Access information
Other publicly accessible locations of the data:
- NA
Data was derived from the following sources:
- Kenya National Bureau of Statistics
- Kenya Meterological Department
- Kenya Ministry of Health
- Food and Agriculture Organization of the United Nations
- National Imagery and Mapping Agency of the United States
- Published Literature