SlideShare a Scribd company logo
1 of 43
Predicting Obesity Rates in the US
D3M
Discrete Choice Models
 Regression models we were studying so far had a continuous
dependent variable (e.g. sales of a product, House prices etc.)
o Predictor variables could be continuous or discrete (dummy variables)
 Often the phenomenon of interest (i.e. our dependent
variable) is discrete
o Vote or not
o Customer acquisition/defection
o Buy/no-buy
o Click on a banner Ad
o Survive/Don’t survive
With discrete outcomes, we are predicting probabilities (of say
customer defection).
Properties of probabilities?
3
• With binary or categorical dependent variables
standard regression analysis is not appropriate
• Example
• binary dependent variable y coded to be zero for non-purchases and
one for purchases
• X is a continuous metric say price
• Problems
 The error terms are heteroskedastic (variance of the dependent
variable is different with different values of the independent variables
 Does not meet the assumptions of standard ols regression
 Prediction often below zero and values above one
Why Regression does not work
with
0 for non purchases
1 for purchases
y x
y
    

 

4
Discrete choice models
 Generalize the regression model for the situations
where y is a non-metric variable
o a binary (0-1) variable or
o an ordinal variable (like a questionnaire item assuming the
values completely disagree, disagree, neither, agree,
completely agree) or
o a categorical variable (for example a nominal variable
recording the preferred Brand).
 The right-hand side variables can be discrete or continuous
 Similar to linear regression but interpretations are different
i iy x    
5
Logit: We want our predictions to be a probability
Solution: instead of estimating
we estimate the model
which, after rearranging, equals
nn xcxcxcc
yp
yp



...
)1(1
)1(
ln 22110
nn
nn
xcxcc
xcxcc
e
e
yp 


 ...
...
110
110
1
)1(
Case Study: Predicting Obesity Rates
8
Obesity Trends* Among U.S. Adults
BRFSS, 1990
(*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person)
No Data <10% 10%–14%
9
Obesity Trends* Among U.S. Adults
BRFSS, 1991
(*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person)
No Data <10% 10%–14% 15%–19%
10
Obesity Trends* Among U.S. Adults
BRFSS, 1992
(*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person)
No Data <10% 10%–14% 15%–19%
11
Obesity Trends* Among U.S. Adults
BRFSS, 1993
(*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person)
No Data <10% 10%–14% 15%–19%
12
Obesity Trends* Among U.S. Adults
BRFSS, 1994
(*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person)
No Data <10% 10%–14% 15%–19%
13
Obesity Trends* Among U.S. Adults
BRFSS, 1995
(*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person)
No Data <10% 10%–14% 15%–19%
14
Obesity Trends* Among U.S. Adults
BRFSS, 1996
(*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person)
No Data <10% 10%–14% 15%–19%
15
Obesity Trends* Among U.S. Adults
BRFSS, 1997
(*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person)
No Data <10% 10%–14% 15%–19% ≥20%
16
Obesity Trends* Among U.S. Adults
BRFSS, 1998
(*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person)
No Data <10% 10%–14% 15%–19% ≥20%
17
Obesity Trends* Among U.S. Adults
BRFSS, 1999
(*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person)
No Data <10% 10%–14% 15%–19% ≥20%
18
Obesity Trends* Among U.S. Adults
BRFSS, 2000
(*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person)
No Data <10% 10%–14% 15%–19% ≥20%
19
Obesity Trends* Among U.S. Adults
BRFSS, 2001
(*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person)
No Data <10% 10%–14% 15%–19% 20%–24% ≥25%
(*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person)
Obesity Trends* Among U.S. Adults
BRFSS, 2002
No Data <10% 10%–14% 15%–19% 20%–24% ≥25%
20
21
Obesity Trends* Among U.S. Adults
BRFSS, 2003
(*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person)
No Data <10% 10%–14% 15%–19% 20%–24% ≥25%
Obesity Trends* Among U.S. Adults
BRFSS, 2004
22
(*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person)
No Data <10% 10%–14% 15%–19% 20%–24% ≥25%
Obesity Trends* Among U.S. Adults
BRFSS, 2005
23
(*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person)
No Data <10% 10%–14 15%–19% 20%–24% 25%–29% ≥30%
Obesity Trends* Among U.S. Adults
BRFSS, 2006
24
(*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person)
No Data <10% 10%–14 15%–19% 20%–24% 25%–29% ≥30%
Obesity Trends* Among U.S. Adults
BRFSS, 2007
25
(*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person)
No Data <10% 10%–14 15%–19% 20%–24% 25%–29% ≥30%
Obesity Trends* Among U.S. Adults
BRFSS, 2008
26
(*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person)
No Data <10% 10%–14 15%–19% 20%–24% 25%–29% ≥30%
Obesity Trends* Among U.S. Adults
BRFSS, 2009
27
(*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person)
No Data <10% 10%–14 15%–19% 20%–24% 25%–29% ≥30%
Obesity Trends* Among U.S. Adults
BRFSS, 2010
28
(*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person)
No Data <10% 10%–14 15%–19% 20%–24% 25%–29% ≥30%
Data Source
BRFSS Survey by CDC
BRFSS Data at CDC
• Behavioral Risk Factor Surveillance System
(BRFSS) at CDC:
 World’s largest survey
• Monthly telephone interviews 18 years old of age or
older living in households
• 50 states, the District of Columbia, Puerto Rico,
Guam and the Virgin Islands
 Pooled data for 2001-2010.
 Approximately 3 million observations.
 We are using a random sample from 2006-2010
Data
Always start by summary statistics
OLS Coefficients
OLS
Logit Coefficients
Logit
Marginal Effects
Interpretation
 For continuous variables: Change in probability of being
Obese for a 1 unit change in the variable. In our case, only
AGE is continuous. Increasing AGE by 1 year, lowers
probability of being obese by 0.1%. Small effect but see later.
 For dummy variables, change in probability compared to the
reference category. Person with “No High School” has 7%
higher likelihood of being obese compared to a person with
college degree (holding everything else fixed)
Capturing non-linear Age Effects
OLS Regression
Capturing non-linear Age Effects
Logit
Marginal Effects
What do we conclude about Age now?
Age Effect is non-linear
Obesity

More Related Content

What's hot (20)

Obesity
ObesityObesity
Obesity
 
Obesity
ObesityObesity
Obesity
 
Obesity
ObesityObesity
Obesity
 
My seminar Obesity by Hani
My seminar Obesity by HaniMy seminar Obesity by Hani
My seminar Obesity by Hani
 
Insel10ebrup Ppt Ch11
Insel10ebrup Ppt Ch11Insel10ebrup Ppt Ch11
Insel10ebrup Ppt Ch11
 
Obesity
ObesityObesity
Obesity
 
obesity
obesityobesity
obesity
 
Obesity
ObesityObesity
Obesity
 
Obesity
ObesityObesity
Obesity
 
Obesity
ObesityObesity
Obesity
 
obesity management
 obesity management obesity management
obesity management
 
OBESITY
OBESITY OBESITY
OBESITY
 
Obesity in women by Dr. Sharda Jain presented on 17th August 14 at DMA Cente...
Obesity in women by Dr. Sharda Jain presented  on 17th August 14 at DMA Cente...Obesity in women by Dr. Sharda Jain presented  on 17th August 14 at DMA Cente...
Obesity in women by Dr. Sharda Jain presented on 17th August 14 at DMA Cente...
 
Obesity power point 2018
Obesity power point 2018Obesity power point 2018
Obesity power point 2018
 
Obesity and Cardiovascular Diseases
Obesity and Cardiovascular DiseasesObesity and Cardiovascular Diseases
Obesity and Cardiovascular Diseases
 
OBESITY & OVERWEIGHT ‘a modern day havoc ’
OBESITY & OVERWEIGHT‘a modern day havoc ’OBESITY & OVERWEIGHT‘a modern day havoc ’
OBESITY & OVERWEIGHT ‘a modern day havoc ’
 
OBESITY
OBESITYOBESITY
OBESITY
 
Obesity
ObesityObesity
Obesity
 
Obesity
Obesity Obesity
Obesity
 
Obesity
ObesityObesity
Obesity
 

Viewers also liked

Aiello - IMQ - 15° Convegno Europeo CSG
Aiello - IMQ - 15° Convegno Europeo CSGAiello - IMQ - 15° Convegno Europeo CSG
Aiello - IMQ - 15° Convegno Europeo CSGCentro Studi Galileo
 
Rumah Junior - Kelas Motivasi
Rumah Junior - Kelas MotivasiRumah Junior - Kelas Motivasi
Rumah Junior - Kelas MotivasiRumah Junior
 
Watching cat videos
Watching cat videosWatching cat videos
Watching cat videosmackzukein
 
Cci depliant apprentissage 2016
Cci depliant apprentissage 2016Cci depliant apprentissage 2016
Cci depliant apprentissage 2016Handirect 05
 
Funny cat videos and dogs
Funny cat videos and dogsFunny cat videos and dogs
Funny cat videos and dogscridrando
 
Goodman Final Presentation_French_Jeremy
Goodman Final Presentation_French_JeremyGoodman Final Presentation_French_Jeremy
Goodman Final Presentation_French_JeremyJeremy French
 
I cms e la legge Stanca
I cms e la legge StancaI cms e la legge Stanca
I cms e la legge StancaGianluigi Cogo
 
Carto parcours de-maintien-en_emploi-2
Carto parcours de-maintien-en_emploi-2Carto parcours de-maintien-en_emploi-2
Carto parcours de-maintien-en_emploi-2Handirect 05
 
Voigt EPEE - 15th European Conference CSG
Voigt EPEE - 15th European Conference CSGVoigt EPEE - 15th European Conference CSG
Voigt EPEE - 15th European Conference CSGCentro Studi Galileo
 
Brand Asset Case Study
Brand Asset Case StudyBrand Asset Case Study
Brand Asset Case Studyveesingh
 
N4 Communication - Basic Communication Principles for N4 students at TVET Col...
N4 Communication - Basic Communication Principles for N4 students at TVET Col...N4 Communication - Basic Communication Principles for N4 students at TVET Col...
N4 Communication - Basic Communication Principles for N4 students at TVET Col...Varsity College
 
Dalla partecipazione al governo condiviso
Dalla partecipazione al governo condivisoDalla partecipazione al governo condiviso
Dalla partecipazione al governo condivisoGianluigi Cogo
 
Городская неделя профилактики употребления алкоголя «Будущее в моих руках!»
 Городская неделя  профилактики употребления алкоголя  «Будущее в моих руках!» Городская неделя  профилактики употребления алкоголя  «Будущее в моих руках!»
Городская неделя профилактики употребления алкоголя «Будущее в моих руках!»Андрей Афанасьев
 
N4 Communication - Organisational Communication for students at TVET Colleges...
N4 Communication - Organisational Communication for students at TVET Colleges...N4 Communication - Organisational Communication for students at TVET Colleges...
N4 Communication - Organisational Communication for students at TVET Colleges...Varsity College
 
Finite and non finite verbs
Finite and non finite verbsFinite and non finite verbs
Finite and non finite verbsJaya Prabu
 

Viewers also liked (19)

Aiello - IMQ - 15° Convegno Europeo CSG
Aiello - IMQ - 15° Convegno Europeo CSGAiello - IMQ - 15° Convegno Europeo CSG
Aiello - IMQ - 15° Convegno Europeo CSG
 
Rumah Junior - Kelas Motivasi
Rumah Junior - Kelas MotivasiRumah Junior - Kelas Motivasi
Rumah Junior - Kelas Motivasi
 
Watching cat videos
Watching cat videosWatching cat videos
Watching cat videos
 
Cci depliant apprentissage 2016
Cci depliant apprentissage 2016Cci depliant apprentissage 2016
Cci depliant apprentissage 2016
 
Kontrak Berkala Emas UBS
Kontrak Berkala Emas UBSKontrak Berkala Emas UBS
Kontrak Berkala Emas UBS
 
Impact of internal factors on strategic planning
Impact of internal factors on strategic planningImpact of internal factors on strategic planning
Impact of internal factors on strategic planning
 
Funny cat videos and dogs
Funny cat videos and dogsFunny cat videos and dogs
Funny cat videos and dogs
 
Goodman Final Presentation_French_Jeremy
Goodman Final Presentation_French_JeremyGoodman Final Presentation_French_Jeremy
Goodman Final Presentation_French_Jeremy
 
I cms e la legge Stanca
I cms e la legge StancaI cms e la legge Stanca
I cms e la legge Stanca
 
210
210210
210
 
Carto parcours de-maintien-en_emploi-2
Carto parcours de-maintien-en_emploi-2Carto parcours de-maintien-en_emploi-2
Carto parcours de-maintien-en_emploi-2
 
Voigt EPEE - 15th European Conference CSG
Voigt EPEE - 15th European Conference CSGVoigt EPEE - 15th European Conference CSG
Voigt EPEE - 15th European Conference CSG
 
Brand Asset Case Study
Brand Asset Case StudyBrand Asset Case Study
Brand Asset Case Study
 
N4 Communication - Basic Communication Principles for N4 students at TVET Col...
N4 Communication - Basic Communication Principles for N4 students at TVET Col...N4 Communication - Basic Communication Principles for N4 students at TVET Col...
N4 Communication - Basic Communication Principles for N4 students at TVET Col...
 
Dalla partecipazione al governo condiviso
Dalla partecipazione al governo condivisoDalla partecipazione al governo condiviso
Dalla partecipazione al governo condiviso
 
Городская неделя профилактики употребления алкоголя «Будущее в моих руках!»
 Городская неделя  профилактики употребления алкоголя  «Будущее в моих руках!» Городская неделя  профилактики употребления алкоголя  «Будущее в моих руках!»
Городская неделя профилактики употребления алкоголя «Будущее в моих руках!»
 
Script
ScriptScript
Script
 
N4 Communication - Organisational Communication for students at TVET Colleges...
N4 Communication - Organisational Communication for students at TVET Colleges...N4 Communication - Organisational Communication for students at TVET Colleges...
N4 Communication - Organisational Communication for students at TVET Colleges...
 
Finite and non finite verbs
Finite and non finite verbsFinite and non finite verbs
Finite and non finite verbs
 

Similar to Obesity

Childhood obesity j_fw-audio 3-23-11 final
Childhood obesity j_fw-audio 3-23-11 finalChildhood obesity j_fw-audio 3-23-11 final
Childhood obesity j_fw-audio 3-23-11 finalbethanybutcher
 
Paul Resnick, "Healthier Together: Social Approaches to Health and Wellness"
Paul Resnick, "Healthier Together: Social Approaches to Health and Wellness"Paul Resnick, "Healthier Together: Social Approaches to Health and Wellness"
Paul Resnick, "Healthier Together: Social Approaches to Health and Wellness"summersocialwebshop
 
Metabolism
MetabolismMetabolism
Metabolismcallr
 
Obesity trends 2009
Obesity trends 2009Obesity trends 2009
Obesity trends 2009Joe Fahs
 
www.Bariatric-Surgery-Source.com : Obesity Statistics in America: 1985 - 2009
www.Bariatric-Surgery-Source.com : Obesity Statistics in America: 1985 - 2009www.Bariatric-Surgery-Source.com : Obesity Statistics in America: 1985 - 2009
www.Bariatric-Surgery-Source.com : Obesity Statistics in America: 1985 - 2009Quinlan2
 
Obesity Trends in U.S. from 1985 through 2010
Obesity Trends in U.S. from 1985 through 2010Obesity Trends in U.S. from 1985 through 2010
Obesity Trends in U.S. from 1985 through 2010Art Rothafel
 
Obesity Trends 2008
Obesity Trends 2008Obesity Trends 2008
Obesity Trends 2008y2kemo
 
obesity_trends_2009
obesity_trends_2009obesity_trends_2009
obesity_trends_2009ravikolli
 
Obesity trends 2009
Obesity trends 2009Obesity trends 2009
Obesity trends 2009Dane Conrad
 
Obesity trends 2010
Obesity trends 2010Obesity trends 2010
Obesity trends 2010kellybolton
 
Obesity trends 2010
Obesity trends 2010Obesity trends 2010
Obesity trends 2010misteraugie
 
Tackling Childhood Obesity The Role Of Good Communications
Tackling Childhood Obesity   The Role Of Good CommunicationsTackling Childhood Obesity   The Role Of Good Communications
Tackling Childhood Obesity The Role Of Good Communicationsbevpostma
 
Obesity Final Presentation2
Obesity Final Presentation2Obesity Final Presentation2
Obesity Final Presentation2pwyncess
 
ENG 101 Slidecast
ENG 101 SlidecastENG 101 Slidecast
ENG 101 Slidecastsamanthayer
 
Obesity trends 2010
Obesity trends 2010Obesity trends 2010
Obesity trends 2010mbluestone94
 
Obesity trends 2010
Obesity trends 2010Obesity trends 2010
Obesity trends 2010efranck047
 
Obesity trends 2010
Obesity trends 2010Obesity trends 2010
Obesity trends 2010mbluestone94
 

Similar to Obesity (20)

Childhood obesity j_fw-audio 3-23-11 final
Childhood obesity j_fw-audio 3-23-11 finalChildhood obesity j_fw-audio 3-23-11 final
Childhood obesity j_fw-audio 3-23-11 final
 
Paul Resnick, "Healthier Together: Social Approaches to Health and Wellness"
Paul Resnick, "Healthier Together: Social Approaches to Health and Wellness"Paul Resnick, "Healthier Together: Social Approaches to Health and Wellness"
Paul Resnick, "Healthier Together: Social Approaches to Health and Wellness"
 
Metabolism
MetabolismMetabolism
Metabolism
 
Obesity trends 2009
Obesity trends 2009Obesity trends 2009
Obesity trends 2009
 
www.Bariatric-Surgery-Source.com : Obesity Statistics in America: 1985 - 2009
www.Bariatric-Surgery-Source.com : Obesity Statistics in America: 1985 - 2009www.Bariatric-Surgery-Source.com : Obesity Statistics in America: 1985 - 2009
www.Bariatric-Surgery-Source.com : Obesity Statistics in America: 1985 - 2009
 
CDC U.S. Obesity Trends Maps,1999-2009
CDC U.S. Obesity Trends  Maps,1999-2009CDC U.S. Obesity Trends  Maps,1999-2009
CDC U.S. Obesity Trends Maps,1999-2009
 
Obesity Trends in U.S. from 1985 through 2010
Obesity Trends in U.S. from 1985 through 2010Obesity Trends in U.S. from 1985 through 2010
Obesity Trends in U.S. from 1985 through 2010
 
Obesity Trends 2008
Obesity Trends 2008Obesity Trends 2008
Obesity Trends 2008
 
Obesity trends 2009
Obesity trends 2009Obesity trends 2009
Obesity trends 2009
 
obesity_trends_2009
obesity_trends_2009obesity_trends_2009
obesity_trends_2009
 
Obesity trends 2009
Obesity trends 2009Obesity trends 2009
Obesity trends 2009
 
Obesity trends 2010
Obesity trends 2010Obesity trends 2010
Obesity trends 2010
 
Obesity trends 2010
Obesity trends 2010Obesity trends 2010
Obesity trends 2010
 
Tackling Childhood Obesity The Role Of Good Communications
Tackling Childhood Obesity   The Role Of Good CommunicationsTackling Childhood Obesity   The Role Of Good Communications
Tackling Childhood Obesity The Role Of Good Communications
 
Obesity Final Presentation2
Obesity Final Presentation2Obesity Final Presentation2
Obesity Final Presentation2
 
ENG 101 Slidecast
ENG 101 SlidecastENG 101 Slidecast
ENG 101 Slidecast
 
Obesity trends 2010
Obesity trends 2010Obesity trends 2010
Obesity trends 2010
 
Obesity trends 2010
Obesity trends 2010Obesity trends 2010
Obesity trends 2010
 
Obesity trends 2010
Obesity trends 2010Obesity trends 2010
Obesity trends 2010
 
Overvekt i USA: 1985 - 2010
Overvekt i USA: 1985 - 2010Overvekt i USA: 1985 - 2010
Overvekt i USA: 1985 - 2010
 

More from veesingh

Brand Analytics
Brand AnalyticsBrand Analytics
Brand Analyticsveesingh
 
Store segmentation progresso
Store segmentation progressoStore segmentation progresso
Store segmentation progressoveesingh
 
Pricing strategy progresso
Pricing strategy progressoPricing strategy progresso
Pricing strategy progressoveesingh
 
Regressioin mini case
Regressioin mini caseRegressioin mini case
Regressioin mini caseveesingh
 
Identification1
Identification1Identification1
Identification1veesingh
 
Pricing Strategies for Brands
Pricing Strategies for BrandsPricing Strategies for Brands
Pricing Strategies for Brandsveesingh
 
Fat Tax Slideshow
Fat Tax SlideshowFat Tax Slideshow
Fat Tax Slideshowveesingh
 
Correlation causality
Correlation causalityCorrelation causality
Correlation causalityveesingh
 
Unsupervised learning
Unsupervised learningUnsupervised learning
Unsupervised learningveesingh
 
Field experiments
Field experimentsField experiments
Field experimentsveesingh
 
Brand mining
Brand miningBrand mining
Brand miningveesingh
 
D3M Commodity
D3M Commodity D3M Commodity
D3M Commodity veesingh
 
D3M Online Reviews
D3M Online ReviewsD3M Online Reviews
D3M Online Reviewsveesingh
 
D3M Politics
D3M PoliticsD3M Politics
D3M Politicsveesingh
 

More from veesingh (15)

Slalom
SlalomSlalom
Slalom
 
Brand Analytics
Brand AnalyticsBrand Analytics
Brand Analytics
 
Store segmentation progresso
Store segmentation progressoStore segmentation progresso
Store segmentation progresso
 
Pricing strategy progresso
Pricing strategy progressoPricing strategy progresso
Pricing strategy progresso
 
Regressioin mini case
Regressioin mini caseRegressioin mini case
Regressioin mini case
 
Identification1
Identification1Identification1
Identification1
 
Pricing Strategies for Brands
Pricing Strategies for BrandsPricing Strategies for Brands
Pricing Strategies for Brands
 
Fat Tax Slideshow
Fat Tax SlideshowFat Tax Slideshow
Fat Tax Slideshow
 
Correlation causality
Correlation causalityCorrelation causality
Correlation causality
 
Unsupervised learning
Unsupervised learningUnsupervised learning
Unsupervised learning
 
Field experiments
Field experimentsField experiments
Field experiments
 
Brand mining
Brand miningBrand mining
Brand mining
 
D3M Commodity
D3M Commodity D3M Commodity
D3M Commodity
 
D3M Online Reviews
D3M Online ReviewsD3M Online Reviews
D3M Online Reviews
 
D3M Politics
D3M PoliticsD3M Politics
D3M Politics
 

Recently uploaded

Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxBoston Institute of Analytics
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degreeyuu sss
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 

Recently uploaded (20)

Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 

Obesity

  • 1. Predicting Obesity Rates in the US D3M
  • 2. Discrete Choice Models  Regression models we were studying so far had a continuous dependent variable (e.g. sales of a product, House prices etc.) o Predictor variables could be continuous or discrete (dummy variables)  Often the phenomenon of interest (i.e. our dependent variable) is discrete o Vote or not o Customer acquisition/defection o Buy/no-buy o Click on a banner Ad o Survive/Don’t survive With discrete outcomes, we are predicting probabilities (of say customer defection). Properties of probabilities?
  • 3. 3 • With binary or categorical dependent variables standard regression analysis is not appropriate • Example • binary dependent variable y coded to be zero for non-purchases and one for purchases • X is a continuous metric say price • Problems  The error terms are heteroskedastic (variance of the dependent variable is different with different values of the independent variables  Does not meet the assumptions of standard ols regression  Prediction often below zero and values above one Why Regression does not work with 0 for non purchases 1 for purchases y x y         
  • 4. 4 Discrete choice models  Generalize the regression model for the situations where y is a non-metric variable o a binary (0-1) variable or o an ordinal variable (like a questionnaire item assuming the values completely disagree, disagree, neither, agree, completely agree) or o a categorical variable (for example a nominal variable recording the preferred Brand).  The right-hand side variables can be discrete or continuous  Similar to linear regression but interpretations are different i iy x    
  • 5. 5
  • 6. Logit: We want our predictions to be a probability Solution: instead of estimating we estimate the model which, after rearranging, equals nn xcxcxcc yp yp    ... )1(1 )1( ln 22110 nn nn xcxcc xcxcc e e yp     ... ... 110 110 1 )1(
  • 7. Case Study: Predicting Obesity Rates
  • 8. 8 Obesity Trends* Among U.S. Adults BRFSS, 1990 (*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person) No Data <10% 10%–14%
  • 9. 9 Obesity Trends* Among U.S. Adults BRFSS, 1991 (*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person) No Data <10% 10%–14% 15%–19%
  • 10. 10 Obesity Trends* Among U.S. Adults BRFSS, 1992 (*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person) No Data <10% 10%–14% 15%–19%
  • 11. 11 Obesity Trends* Among U.S. Adults BRFSS, 1993 (*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person) No Data <10% 10%–14% 15%–19%
  • 12. 12 Obesity Trends* Among U.S. Adults BRFSS, 1994 (*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person) No Data <10% 10%–14% 15%–19%
  • 13. 13 Obesity Trends* Among U.S. Adults BRFSS, 1995 (*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person) No Data <10% 10%–14% 15%–19%
  • 14. 14 Obesity Trends* Among U.S. Adults BRFSS, 1996 (*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person) No Data <10% 10%–14% 15%–19%
  • 15. 15 Obesity Trends* Among U.S. Adults BRFSS, 1997 (*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person) No Data <10% 10%–14% 15%–19% ≥20%
  • 16. 16 Obesity Trends* Among U.S. Adults BRFSS, 1998 (*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person) No Data <10% 10%–14% 15%–19% ≥20%
  • 17. 17 Obesity Trends* Among U.S. Adults BRFSS, 1999 (*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person) No Data <10% 10%–14% 15%–19% ≥20%
  • 18. 18 Obesity Trends* Among U.S. Adults BRFSS, 2000 (*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person) No Data <10% 10%–14% 15%–19% ≥20%
  • 19. 19 Obesity Trends* Among U.S. Adults BRFSS, 2001 (*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person) No Data <10% 10%–14% 15%–19% 20%–24% ≥25%
  • 20. (*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person) Obesity Trends* Among U.S. Adults BRFSS, 2002 No Data <10% 10%–14% 15%–19% 20%–24% ≥25% 20
  • 21. 21 Obesity Trends* Among U.S. Adults BRFSS, 2003 (*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person) No Data <10% 10%–14% 15%–19% 20%–24% ≥25%
  • 22. Obesity Trends* Among U.S. Adults BRFSS, 2004 22 (*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person) No Data <10% 10%–14% 15%–19% 20%–24% ≥25%
  • 23. Obesity Trends* Among U.S. Adults BRFSS, 2005 23 (*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person) No Data <10% 10%–14 15%–19% 20%–24% 25%–29% ≥30%
  • 24. Obesity Trends* Among U.S. Adults BRFSS, 2006 24 (*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person) No Data <10% 10%–14 15%–19% 20%–24% 25%–29% ≥30%
  • 25. Obesity Trends* Among U.S. Adults BRFSS, 2007 25 (*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person) No Data <10% 10%–14 15%–19% 20%–24% 25%–29% ≥30%
  • 26. Obesity Trends* Among U.S. Adults BRFSS, 2008 26 (*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person) No Data <10% 10%–14 15%–19% 20%–24% 25%–29% ≥30%
  • 27. Obesity Trends* Among U.S. Adults BRFSS, 2009 27 (*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person) No Data <10% 10%–14 15%–19% 20%–24% 25%–29% ≥30%
  • 28. Obesity Trends* Among U.S. Adults BRFSS, 2010 28 (*BMI ≥30, or ~ 30 lbs. overweight for 5’ 4” person) No Data <10% 10%–14 15%–19% 20%–24% 25%–29% ≥30%
  • 30. BRFSS Data at CDC • Behavioral Risk Factor Surveillance System (BRFSS) at CDC:  World’s largest survey • Monthly telephone interviews 18 years old of age or older living in households • 50 states, the District of Columbia, Puerto Rico, Guam and the Virgin Islands  Pooled data for 2001-2010.  Approximately 3 million observations.  We are using a random sample from 2006-2010
  • 31. Data
  • 32. Always start by summary statistics
  • 34. OLS
  • 36. Logit
  • 38. Interpretation  For continuous variables: Change in probability of being Obese for a 1 unit change in the variable. In our case, only AGE is continuous. Increasing AGE by 1 year, lowers probability of being obese by 0.1%. Small effect but see later.  For dummy variables, change in probability compared to the reference category. Person with “No High School” has 7% higher likelihood of being obese compared to a person with college degree (holding everything else fixed)
  • 39. Capturing non-linear Age Effects OLS Regression
  • 40. Capturing non-linear Age Effects Logit
  • 41. Marginal Effects What do we conclude about Age now?
  • 42. Age Effect is non-linear