SlideShare a Scribd company logo
1 of 8
Obesity and
Environmental Factors
A MACHINE LEARNING APPROACH
Ballard, Harnagel, Peterson
Obesity is a
Serious Problem
 Obesity is weight that is higher than what is
considered as a healthy weight for a given height
 Nearly 40% of adults (approximately 94 million)
 Nearly 20% of children (approximately 13 million)
 Remarkably, according to the USDA, it is getting
worse— quickly
 USDA studies indicate that by 2030 nearly one-half of
the U.S. population will be obese
 Obesity is linked with undesirable medical issues,
including:
 Heart disease
 Diabetes
 Cancer
 The costs are also staggering with estimates of $147
billion per year
Research Goals
 Determine if machine learning (ML) could be used to predict the obesity of
an out-of-sample observation.
 If ML works, develop the most performant model(s):
 Most performant single model
 Most performant ensemble
 Interpret the relationship between (features) environmental factors and
obesity (target)
 The astute amongst you have already identified a dilemma:
 Generally the most performant models are the least interpretable
 We pursued a dual modeling approach
The Data
 Source: United States Department of Agriculture (USDA)
 Geo-centric, each observation is a state and county combination.
 Wide variety of features from a broad spectrum of perspectives:
 access and proximity to grocery stores
 restaurant availability and expenditures
 food assistance
 food prices and taxes
 health and physical activity
 socioeconomic characteristics
 Wrangling:
 Removed obvious sources of data leakage
 Removed rows & columns missing over 90% of data
 Missing values imputed with median
 Few true strings. Coerced some features into numeric dtypes in pandas.
Interpretation
 Research goal was to
understand relationship
between features and target
 There is no one perfect method
 Fisher score was selected due to
broad usage and general good
performance
Performance
 Research goal was to develop
most performant models:
 Single model
 Ensemble
 LogicPlum leveraged to focus
on the findings
Performance
 Research goal was to develop
most performant models:
 Single model
 Ensemble
 LogicPlum leveraged to focus
on the findings
Conclusions
ML METHODS ARE POWERFUL IN
PREDICTING OBESITY
ML METHODS PROVIDE INSIGHT
INTO CORRELATIONS

More Related Content

Similar to Obesity and Environmental Factors; A Machine Learning Approach

Preeclampsia research presentation
Preeclampsia research presentationPreeclampsia research presentation
Preeclampsia research presentationMeriel
 
Note; you didn’t corrected this question network models and netw.docx
Note; you didn’t corrected this question network models and netw.docxNote; you didn’t corrected this question network models and netw.docx
Note; you didn’t corrected this question network models and netw.docxcurwenmichaela
 
Gender and a food secure future: What do we need to know? What do we need to do?
Gender and a food secure future: What do we need to know? What do we need to do?Gender and a food secure future: What do we need to know? What do we need to do?
Gender and a food secure future: What do we need to know? What do we need to do?CGIAR
 
INTRODUCTION TO HEALTHCARE RESEARCH METHODS: Correlational Studies, Case Seri...
INTRODUCTION TO HEALTHCARE RESEARCH METHODS: Correlational Studies, Case Seri...INTRODUCTION TO HEALTHCARE RESEARCH METHODS: Correlational Studies, Case Seri...
INTRODUCTION TO HEALTHCARE RESEARCH METHODS: Correlational Studies, Case Seri...Dr. Khaled OUANES
 
Healthy Food Accessibility and Obesity: Case Study of Pennsylvania, USA
Healthy Food Accessibility and Obesity: Case Study of Pennsylvania, USAHealthy Food Accessibility and Obesity: Case Study of Pennsylvania, USA
Healthy Food Accessibility and Obesity: Case Study of Pennsylvania, USArsmahabir
 
Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Chirag Patel
 
The Internet and Information· One of the most effective strate.docx
The Internet and Information· One of the most effective strate.docxThe Internet and Information· One of the most effective strate.docx
The Internet and Information· One of the most effective strate.docxarnoldmeredith47041
 
Summarization Techniques in Association Rule Data Mining For Risk Assessment ...
Summarization Techniques in Association Rule Data Mining For Risk Assessment ...Summarization Techniques in Association Rule Data Mining For Risk Assessment ...
Summarization Techniques in Association Rule Data Mining For Risk Assessment ...IJTET Journal
 
BSRS2010 Final Presentation: mHealth course (ppt)
BSRS2010 Final Presentation: mHealth course (ppt)BSRS2010 Final Presentation: mHealth course (ppt)
BSRS2010 Final Presentation: mHealth course (ppt)Heather Zornetzer
 
missing-data-and-multiple-imputation-in-clinical-epidemiolog
missing-data-and-multiple-imputation-in-clinical-epidemiolog missing-data-and-multiple-imputation-in-clinical-epidemiolog
missing-data-and-multiple-imputation-in-clinical-epidemiolog simbycris
 
The Mis-measure of Health Care: Can Measurement, Improvement, and Cost Reduct...
The Mis-measure of Health Care: Can Measurement, Improvement, and Cost Reduct...The Mis-measure of Health Care: Can Measurement, Improvement, and Cost Reduct...
The Mis-measure of Health Care: Can Measurement, Improvement, and Cost Reduct...The Commonwealth Fund
 
1Running head OBESITY 3Running head OBESITY.docx
1Running head OBESITY 3Running head OBESITY.docx1Running head OBESITY 3Running head OBESITY.docx
1Running head OBESITY 3Running head OBESITY.docxfelicidaddinwoodie
 
A Comparative Analysis of Hybridized Genetic Algorithm in Predictive Models o...
A Comparative Analysis of Hybridized Genetic Algorithm in Predictive Models o...A Comparative Analysis of Hybridized Genetic Algorithm in Predictive Models o...
A Comparative Analysis of Hybridized Genetic Algorithm in Predictive Models o...Shakas Technologies
 
Hss4303b mortality and morbidity
Hss4303b   mortality and morbidityHss4303b   mortality and morbidity
Hss4303b mortality and morbiditycoolboy101pk
 
Diabetes Mellitus Prediction System Using Data Mining
Diabetes Mellitus Prediction System Using Data MiningDiabetes Mellitus Prediction System Using Data Mining
Diabetes Mellitus Prediction System Using Data Miningpaperpublications3
 
Khoury ashg2014
Khoury ashg2014Khoury ashg2014
Khoury ashg2014muink
 
East Effectiveness of Nurse Practitioner Coordinated Team.pdf
East Effectiveness of Nurse Practitioner Coordinated Team.pdfEast Effectiveness of Nurse Practitioner Coordinated Team.pdf
East Effectiveness of Nurse Practitioner Coordinated Team.pdfsdfghj21
 
1Running head OBESITY 4Running head OBESITY.docx
1Running head OBESITY 4Running head OBESITY.docx1Running head OBESITY 4Running head OBESITY.docx
1Running head OBESITY 4Running head OBESITY.docxvickeryr87
 
DIAGNOSIS OF OBESITY LEVEL BASED ON BAGGING ENSEMBLE CLASSIFIER AND FEATURE S...
DIAGNOSIS OF OBESITY LEVEL BASED ON BAGGING ENSEMBLE CLASSIFIER AND FEATURE S...DIAGNOSIS OF OBESITY LEVEL BASED ON BAGGING ENSEMBLE CLASSIFIER AND FEATURE S...
DIAGNOSIS OF OBESITY LEVEL BASED ON BAGGING ENSEMBLE CLASSIFIER AND FEATURE S...ijaia
 

Similar to Obesity and Environmental Factors; A Machine Learning Approach (20)

Preeclampsia research presentation
Preeclampsia research presentationPreeclampsia research presentation
Preeclampsia research presentation
 
Note; you didn’t corrected this question network models and netw.docx
Note; you didn’t corrected this question network models and netw.docxNote; you didn’t corrected this question network models and netw.docx
Note; you didn’t corrected this question network models and netw.docx
 
Gender and a food secure future: What do we need to know? What do we need to do?
Gender and a food secure future: What do we need to know? What do we need to do?Gender and a food secure future: What do we need to know? What do we need to do?
Gender and a food secure future: What do we need to know? What do we need to do?
 
INTRODUCTION TO HEALTHCARE RESEARCH METHODS: Correlational Studies, Case Seri...
INTRODUCTION TO HEALTHCARE RESEARCH METHODS: Correlational Studies, Case Seri...INTRODUCTION TO HEALTHCARE RESEARCH METHODS: Correlational Studies, Case Seri...
INTRODUCTION TO HEALTHCARE RESEARCH METHODS: Correlational Studies, Case Seri...
 
Healthy Food Accessibility and Obesity: Case Study of Pennsylvania, USA
Healthy Food Accessibility and Obesity: Case Study of Pennsylvania, USAHealthy Food Accessibility and Obesity: Case Study of Pennsylvania, USA
Healthy Food Accessibility and Obesity: Case Study of Pennsylvania, USA
 
Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416
 
The Internet and Information· One of the most effective strate.docx
The Internet and Information· One of the most effective strate.docxThe Internet and Information· One of the most effective strate.docx
The Internet and Information· One of the most effective strate.docx
 
Summarization Techniques in Association Rule Data Mining For Risk Assessment ...
Summarization Techniques in Association Rule Data Mining For Risk Assessment ...Summarization Techniques in Association Rule Data Mining For Risk Assessment ...
Summarization Techniques in Association Rule Data Mining For Risk Assessment ...
 
BSRS2010 Final Presentation: mHealth course (ppt)
BSRS2010 Final Presentation: mHealth course (ppt)BSRS2010 Final Presentation: mHealth course (ppt)
BSRS2010 Final Presentation: mHealth course (ppt)
 
missing-data-and-multiple-imputation-in-clinical-epidemiolog
missing-data-and-multiple-imputation-in-clinical-epidemiolog missing-data-and-multiple-imputation-in-clinical-epidemiolog
missing-data-and-multiple-imputation-in-clinical-epidemiolog
 
HBM- vehicular emissions
HBM- vehicular emissionsHBM- vehicular emissions
HBM- vehicular emissions
 
The Mis-measure of Health Care: Can Measurement, Improvement, and Cost Reduct...
The Mis-measure of Health Care: Can Measurement, Improvement, and Cost Reduct...The Mis-measure of Health Care: Can Measurement, Improvement, and Cost Reduct...
The Mis-measure of Health Care: Can Measurement, Improvement, and Cost Reduct...
 
1Running head OBESITY 3Running head OBESITY.docx
1Running head OBESITY 3Running head OBESITY.docx1Running head OBESITY 3Running head OBESITY.docx
1Running head OBESITY 3Running head OBESITY.docx
 
A Comparative Analysis of Hybridized Genetic Algorithm in Predictive Models o...
A Comparative Analysis of Hybridized Genetic Algorithm in Predictive Models o...A Comparative Analysis of Hybridized Genetic Algorithm in Predictive Models o...
A Comparative Analysis of Hybridized Genetic Algorithm in Predictive Models o...
 
Hss4303b mortality and morbidity
Hss4303b   mortality and morbidityHss4303b   mortality and morbidity
Hss4303b mortality and morbidity
 
Diabetes Mellitus Prediction System Using Data Mining
Diabetes Mellitus Prediction System Using Data MiningDiabetes Mellitus Prediction System Using Data Mining
Diabetes Mellitus Prediction System Using Data Mining
 
Khoury ashg2014
Khoury ashg2014Khoury ashg2014
Khoury ashg2014
 
East Effectiveness of Nurse Practitioner Coordinated Team.pdf
East Effectiveness of Nurse Practitioner Coordinated Team.pdfEast Effectiveness of Nurse Practitioner Coordinated Team.pdf
East Effectiveness of Nurse Practitioner Coordinated Team.pdf
 
1Running head OBESITY 4Running head OBESITY.docx
1Running head OBESITY 4Running head OBESITY.docx1Running head OBESITY 4Running head OBESITY.docx
1Running head OBESITY 4Running head OBESITY.docx
 
DIAGNOSIS OF OBESITY LEVEL BASED ON BAGGING ENSEMBLE CLASSIFIER AND FEATURE S...
DIAGNOSIS OF OBESITY LEVEL BASED ON BAGGING ENSEMBLE CLASSIFIER AND FEATURE S...DIAGNOSIS OF OBESITY LEVEL BASED ON BAGGING ENSEMBLE CLASSIFIER AND FEATURE S...
DIAGNOSIS OF OBESITY LEVEL BASED ON BAGGING ENSEMBLE CLASSIFIER AND FEATURE S...
 

Recently uploaded

CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlkumarajju5765
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 

Recently uploaded (20)

CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 

Obesity and Environmental Factors; A Machine Learning Approach

  • 1. Obesity and Environmental Factors A MACHINE LEARNING APPROACH Ballard, Harnagel, Peterson
  • 2. Obesity is a Serious Problem  Obesity is weight that is higher than what is considered as a healthy weight for a given height  Nearly 40% of adults (approximately 94 million)  Nearly 20% of children (approximately 13 million)  Remarkably, according to the USDA, it is getting worse— quickly  USDA studies indicate that by 2030 nearly one-half of the U.S. population will be obese  Obesity is linked with undesirable medical issues, including:  Heart disease  Diabetes  Cancer  The costs are also staggering with estimates of $147 billion per year
  • 3. Research Goals  Determine if machine learning (ML) could be used to predict the obesity of an out-of-sample observation.  If ML works, develop the most performant model(s):  Most performant single model  Most performant ensemble  Interpret the relationship between (features) environmental factors and obesity (target)  The astute amongst you have already identified a dilemma:  Generally the most performant models are the least interpretable  We pursued a dual modeling approach
  • 4. The Data  Source: United States Department of Agriculture (USDA)  Geo-centric, each observation is a state and county combination.  Wide variety of features from a broad spectrum of perspectives:  access and proximity to grocery stores  restaurant availability and expenditures  food assistance  food prices and taxes  health and physical activity  socioeconomic characteristics  Wrangling:  Removed obvious sources of data leakage  Removed rows & columns missing over 90% of data  Missing values imputed with median  Few true strings. Coerced some features into numeric dtypes in pandas.
  • 5. Interpretation  Research goal was to understand relationship between features and target  There is no one perfect method  Fisher score was selected due to broad usage and general good performance
  • 6. Performance  Research goal was to develop most performant models:  Single model  Ensemble  LogicPlum leveraged to focus on the findings
  • 7. Performance  Research goal was to develop most performant models:  Single model  Ensemble  LogicPlum leveraged to focus on the findings
  • 8. Conclusions ML METHODS ARE POWERFUL IN PREDICTING OBESITY ML METHODS PROVIDE INSIGHT INTO CORRELATIONS