Analysis of
2014 Behavioral Risk Factor Surveillance System
Healthcare Analytics
Table of Contents
- Executive Summary
- BRFSS Overview + Scope of Analysis
- Exploratory Data Analysis
- Modeling
- Recommendations + Follow Up Research
Executive Summary
- BRFSS is an annnual survey of americans over the age of 18
- Survey data is tricky (sampling + survey designs)
- Based on our predictive model, we would estimate 39/1000 people have diabetes
- Out of those without diabetes, we can incentivize 38/961 people who are at high risk of diabetes
BRFSS Overview + Scope of Analysis
- Behaviorial Risk Factor Surveillance System (BRFSS) is an annual survey corrdinated by State Level Health Departments and the Center for Disease Control (CDC)
- Phone surveys are conducted in all states asking questions pertaining to demographics, risk behaviors, chronic disease and conditions, etc…
- The aim of this study is to predict the probability of a survey participant have diabetes using common health survey variables and identify those at probable risk of diabetes
Exploratory Data Analysis
Exploratory Data Analysis (Continued)
Exploratory Data Analysis (Continued)
Exploratory Data Analysis (Continued)
Modeling
diabetes ~ ["distance_bmi"^2, "age", "physical_activity"]
- Accuracy : .86
- AUC: .66
- Count Predicted True: 3567
- Count High Risk (Prob between 45%-49.9%): 2845
diabetes ~ ["distance_bmi"^2, OHOT("age", "gen_health", "income_level"), "difficulty_walk", "high_bp", "high_chol"]
- Accuracy : .87
- AUC: .72
- Count Predicted True: 9902
- Count High Risk (Prob between 45%-49.9%): 4542