SlideShare a Scribd company logo
1 of 19
Data Science from the Perspective of an Applied Economist Scott Nicholson – @scootrous
This Talk A 30 minute Applied Economics PhD Will make you a better data scientist Exhibits the value-add of econometrician on a data science team
Recent Research by Economists Why Do Mothers Breastfeed Girls Less than Boys? Evidence and Implications for Child Health in India Family Violence and Football: The Effect of Unexpected Emotional Cues on Violent Behavior Does Terrorism Work? Racial Discrimination Among NBA Referees The Effects of Lottery Prizes on Winners and Their Neighbors: Evidence from the Dutch Postcode Lottery
What Makes an Applied Economist? Intuition Methods Curiosity about human decision-making Attention to underlying mechanisms
If you care about prediction, think like a computer scientist. If you care about causality,  think like an economist.
Gradations of Identifying Causal Relationships Randomized controlled experiments Natural experiments Regression discontinuity Panel data econometrics Instrumental variables
Randomized Controlled Experiment The Gold Standard
Natural Experiment How does having been a child soldier in Uganda affect lifetime earnings and likelihood of voting?
  Natural Experiment 	How does a 100 point decrease in SAT score affect likelihood of entering a ‘top’ school?
Regression Discontinuity Does voting increase the likelihood of voting in the next election? Turnout rate in 2004 election Just eligible to vote in 2000 election Just NOT eligible to vote in 2000 election
Regression Discontinuity Does being a prisoner in a maximum security prison increase the likelihood of prisoner misconduct?
Panel Data Econometrics Which site activities are predictive of future engagement?
Panel Data Econometrics Do voters experience ‘fatigue’ from long ballots?
Instrumental Variables We believe that LinkedIn helps people find better professional opportunities. Can the weather help us establish causation?
What Do Economists Think About the Most?
If you care about prediction, think like a computer scientist. If you care about causality, think like an economist.
Sources Blattman, Christoper; Jeannie Annan. 2010. The Consequences of Child Soldiering. The Review of Economics and Statistics, November 2010, 92(4): 882–898 Meredith, Marc. 2009. Persistence in Political Participation. Quarterly Journal of Political Science 4(3): 186-208 Richard A. Berk; Jan de Leeuw. 1999. An Evaluation of California's Inmate Classification System Using a Generalized Regression Discontinuity Design. Journal of the American Statistical Association, Vol. 94, No. 448. (Dec., 1999), pp. 1045-1052 Augenblick, Ned; Scott Nicholson. 2011. Ballot Position, Choice Fatigue, and Voter Behavior. Submitted, under review. http://faculty.haas.berkeley.edu/ned/Choice_Fatigue.pdf Photo credit (cats): Eric Cheng / Lytro
We’re hiring! snicholson@linkedin.com
Thank You! Scott Nicholson – @scootrous

More Related Content

Similar to Data Science from the Perspective of an Applied Economist

Difference in gender attitude in investment decision making in india
Difference in gender attitude in investment decision making in indiaDifference in gender attitude in investment decision making in india
Difference in gender attitude in investment decision making in indiaAlexander Decker
 
11.difference in gender attitude in investment decision making in india
11.difference in gender attitude in investment decision making in india11.difference in gender attitude in investment decision making in india
11.difference in gender attitude in investment decision making in indiaAlexander Decker
 
Machine Learning Crash Course 2017 - Genova - DIBRIS - IIT - MIT
Machine Learning Crash Course 2017 - Genova - DIBRIS - IIT - MITMachine Learning Crash Course 2017 - Genova - DIBRIS - IIT - MIT
Machine Learning Crash Course 2017 - Genova - DIBRIS - IIT - MITPietro Leo
 
Discussion # 1 Due Weds 081921Wk 1 Discussion 1 - Statistics [
Discussion # 1 Due Weds 081921Wk 1 Discussion 1 - Statistics [Discussion # 1 Due Weds 081921Wk 1 Discussion 1 - Statistics [
Discussion # 1 Due Weds 081921Wk 1 Discussion 1 - Statistics [AlyciaGold776
 
Contemporary Political Statement
Contemporary Political StatementContemporary Political Statement
Contemporary Political StatementGina Alfaro
 
AAPOR 2012 Langer Probability
AAPOR 2012 Langer ProbabilityAAPOR 2012 Langer Probability
AAPOR 2012 Langer ProbabilityLangerResearch
 
Essay On Cricket Match For 9Th Class In Hindi
Essay On Cricket Match For 9Th Class In HindiEssay On Cricket Match For 9Th Class In Hindi
Essay On Cricket Match For 9Th Class In HindiNatasha Smith
 
Essay On The World Has Become A Global Village
Essay On The World Has Become A Global VillageEssay On The World Has Become A Global Village
Essay On The World Has Become A Global VillageMegan Sanchez
 
Data: Past, Present, and Future (Lecture 1, Spring 2018)
Data: Past, Present, and Future (Lecture 1, Spring 2018)Data: Past, Present, and Future (Lecture 1, Spring 2018)
Data: Past, Present, and Future (Lecture 1, Spring 2018)chris wiggins
 
mHealth Israel conference_Professor Erez Shmueli_MIT Media Lab_social physics...
mHealth Israel conference_Professor Erez Shmueli_MIT Media Lab_social physics...mHealth Israel conference_Professor Erez Shmueli_MIT Media Lab_social physics...
mHealth Israel conference_Professor Erez Shmueli_MIT Media Lab_social physics...Levi Shapiro
 
Running head IDENTITY THEFT1IDENTITY THEFT 4Identit.docx
Running head IDENTITY THEFT1IDENTITY THEFT 4Identit.docxRunning head IDENTITY THEFT1IDENTITY THEFT 4Identit.docx
Running head IDENTITY THEFT1IDENTITY THEFT 4Identit.docxwlynn1
 
Lab Write-Up Rubric Lab Write-Ups are worth 10 points tota.docx
Lab Write-Up Rubric Lab Write-Ups are worth 10 points tota.docxLab Write-Up Rubric Lab Write-Ups are worth 10 points tota.docx
Lab Write-Up Rubric Lab Write-Ups are worth 10 points tota.docxsmile790243
 
Modeling Social Data, Lecture 2: Introduction to Counting
Modeling Social Data, Lecture 2: Introduction to CountingModeling Social Data, Lecture 2: Introduction to Counting
Modeling Social Data, Lecture 2: Introduction to Countingjakehofman
 
Human Sciences for ToK
Human Sciences for ToKHuman Sciences for ToK
Human Sciences for ToKplangdale
 
Why UBI is Necessary to Restore Trust and Save Democracy
Why UBI is Necessary to Restore Trust and Save DemocracyWhy UBI is Necessary to Restore Trust and Save Democracy
Why UBI is Necessary to Restore Trust and Save DemocracyScott Santens
 

Similar to Data Science from the Perspective of an Applied Economist (20)

Difference in gender attitude in investment decision making in india
Difference in gender attitude in investment decision making in indiaDifference in gender attitude in investment decision making in india
Difference in gender attitude in investment decision making in india
 
11.difference in gender attitude in investment decision making in india
11.difference in gender attitude in investment decision making in india11.difference in gender attitude in investment decision making in india
11.difference in gender attitude in investment decision making in india
 
Machine Learning Crash Course 2017 - Genova - DIBRIS - IIT - MIT
Machine Learning Crash Course 2017 - Genova - DIBRIS - IIT - MITMachine Learning Crash Course 2017 - Genova - DIBRIS - IIT - MIT
Machine Learning Crash Course 2017 - Genova - DIBRIS - IIT - MIT
 
Discussion # 1 Due Weds 081921Wk 1 Discussion 1 - Statistics [
Discussion # 1 Due Weds 081921Wk 1 Discussion 1 - Statistics [Discussion # 1 Due Weds 081921Wk 1 Discussion 1 - Statistics [
Discussion # 1 Due Weds 081921Wk 1 Discussion 1 - Statistics [
 
Statistics Exericse 29
Statistics Exericse 29Statistics Exericse 29
Statistics Exericse 29
 
Sais.34.1
Sais.34.1Sais.34.1
Sais.34.1
 
Essay Pharmacy
Essay PharmacyEssay Pharmacy
Essay Pharmacy
 
Contemporary Political Statement
Contemporary Political StatementContemporary Political Statement
Contemporary Political Statement
 
AAPOR 2012 Langer Probability
AAPOR 2012 Langer ProbabilityAAPOR 2012 Langer Probability
AAPOR 2012 Langer Probability
 
Essay On Cricket Match For 9Th Class In Hindi
Essay On Cricket Match For 9Th Class In HindiEssay On Cricket Match For 9Th Class In Hindi
Essay On Cricket Match For 9Th Class In Hindi
 
Essay On The World Has Become A Global Village
Essay On The World Has Become A Global VillageEssay On The World Has Become A Global Village
Essay On The World Has Become A Global Village
 
Data: Past, Present, and Future (Lecture 1, Spring 2018)
Data: Past, Present, and Future (Lecture 1, Spring 2018)Data: Past, Present, and Future (Lecture 1, Spring 2018)
Data: Past, Present, and Future (Lecture 1, Spring 2018)
 
mHealth Israel conference_Professor Erez Shmueli_MIT Media Lab_social physics...
mHealth Israel conference_Professor Erez Shmueli_MIT Media Lab_social physics...mHealth Israel conference_Professor Erez Shmueli_MIT Media Lab_social physics...
mHealth Israel conference_Professor Erez Shmueli_MIT Media Lab_social physics...
 
Running head IDENTITY THEFT1IDENTITY THEFT 4Identit.docx
Running head IDENTITY THEFT1IDENTITY THEFT 4Identit.docxRunning head IDENTITY THEFT1IDENTITY THEFT 4Identit.docx
Running head IDENTITY THEFT1IDENTITY THEFT 4Identit.docx
 
Lab Write-Up Rubric Lab Write-Ups are worth 10 points tota.docx
Lab Write-Up Rubric Lab Write-Ups are worth 10 points tota.docxLab Write-Up Rubric Lab Write-Ups are worth 10 points tota.docx
Lab Write-Up Rubric Lab Write-Ups are worth 10 points tota.docx
 
Ijsrp p10682
Ijsrp p10682Ijsrp p10682
Ijsrp p10682
 
Modeling Social Data, Lecture 2: Introduction to Counting
Modeling Social Data, Lecture 2: Introduction to CountingModeling Social Data, Lecture 2: Introduction to Counting
Modeling Social Data, Lecture 2: Introduction to Counting
 
Andre blais 1
Andre blais 1Andre blais 1
Andre blais 1
 
Human Sciences for ToK
Human Sciences for ToKHuman Sciences for ToK
Human Sciences for ToK
 
Why UBI is Necessary to Restore Trust and Save Democracy
Why UBI is Necessary to Restore Trust and Save DemocracyWhy UBI is Necessary to Restore Trust and Save Democracy
Why UBI is Necessary to Restore Trust and Save Democracy
 

Recently uploaded

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 

Recently uploaded (20)

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 

Data Science from the Perspective of an Applied Economist

  • 1. Data Science from the Perspective of an Applied Economist Scott Nicholson – @scootrous
  • 2. This Talk A 30 minute Applied Economics PhD Will make you a better data scientist Exhibits the value-add of econometrician on a data science team
  • 3. Recent Research by Economists Why Do Mothers Breastfeed Girls Less than Boys? Evidence and Implications for Child Health in India Family Violence and Football: The Effect of Unexpected Emotional Cues on Violent Behavior Does Terrorism Work? Racial Discrimination Among NBA Referees The Effects of Lottery Prizes on Winners and Their Neighbors: Evidence from the Dutch Postcode Lottery
  • 4. What Makes an Applied Economist? Intuition Methods Curiosity about human decision-making Attention to underlying mechanisms
  • 5. If you care about prediction, think like a computer scientist. If you care about causality, think like an economist.
  • 6. Gradations of Identifying Causal Relationships Randomized controlled experiments Natural experiments Regression discontinuity Panel data econometrics Instrumental variables
  • 8. Natural Experiment How does having been a child soldier in Uganda affect lifetime earnings and likelihood of voting?
  • 9. Natural Experiment How does a 100 point decrease in SAT score affect likelihood of entering a ‘top’ school?
  • 10. Regression Discontinuity Does voting increase the likelihood of voting in the next election? Turnout rate in 2004 election Just eligible to vote in 2000 election Just NOT eligible to vote in 2000 election
  • 11. Regression Discontinuity Does being a prisoner in a maximum security prison increase the likelihood of prisoner misconduct?
  • 12. Panel Data Econometrics Which site activities are predictive of future engagement?
  • 13. Panel Data Econometrics Do voters experience ‘fatigue’ from long ballots?
  • 14. Instrumental Variables We believe that LinkedIn helps people find better professional opportunities. Can the weather help us establish causation?
  • 15. What Do Economists Think About the Most?
  • 16. If you care about prediction, think like a computer scientist. If you care about causality, think like an economist.
  • 17. Sources Blattman, Christoper; Jeannie Annan. 2010. The Consequences of Child Soldiering. The Review of Economics and Statistics, November 2010, 92(4): 882–898 Meredith, Marc. 2009. Persistence in Political Participation. Quarterly Journal of Political Science 4(3): 186-208 Richard A. Berk; Jan de Leeuw. 1999. An Evaluation of California's Inmate Classification System Using a Generalized Regression Discontinuity Design. Journal of the American Statistical Association, Vol. 94, No. 448. (Dec., 1999), pp. 1045-1052 Augenblick, Ned; Scott Nicholson. 2011. Ballot Position, Choice Fatigue, and Voter Behavior. Submitted, under review. http://faculty.haas.berkeley.edu/ned/Choice_Fatigue.pdf Photo credit (cats): Eric Cheng / Lytro
  • 19. Thank You! Scott Nicholson – @scootrous

Editor's Notes

  1. what i want to do...applied econ phd in less than 30 minuteswhat i'm going to talk about is a set of intuition and methodoligies that economists use to answer a certain set of questionsand in the process make you a better data scientist AND understand the contributions economists can make to DS teamsthe type of questions that we're going to talk about is teasing causation from correlationthe typical toolkit of data scientists of machine learning algorithms or fitting statistical models is insufficient for identifying causality from observational dataTypically we use A/B tests to send the right email, find the best UX, make the most $, but what if we can’t run an A/B test?if you can't run an A/B test, what are the options availble to you to get causation out of data?My perspective…about me
  2. Economists are interested in a wide variety of topics where data can inform us of the world through better understanding incentives and individuals’ decision making processes.For applied economists doing these kinds of research, what is in their toolkit?
  3. If you want to predict whether or not someone will vote or what a child’s score on a standardized test will be, think like a CS.To find causal effects of how changes to one variable affect another variable, think like an economist.You need to look for random variations in the data that allow you to identify causal effects, not just the prediction of what school a student will end up in.
  4. Spectrum…Decreasing in confidence of gaining causality
  5. This technique needs no explanation. We are all familiar with controlled experiments either in the lab, an email or a UX on the web. This is the gold standard when you have the ability/time/resources to construct the experiment. What if you only have observational data?What if you only have data from the past and need to disentangle causality from correlation?What if the experiment you want to run is not feasible or unethical?Example: examining the effects of pre-kindergarten classes on student achievement.
  6. Natural experiment: treatment groups were assigned without researcher interventionAnother method for disentangling causality from correlation is to exploit natural variation in the data.Look for random sources of variation that are correlated with the outcome variable but uncorrelated with the explanatory variable (feature)What is the value of an extra 100 points on the SAT? We can follow outcomes of these students to find out.Email outageVoter fatigueServer outages, search results
  7. Regression discontinuity: assignment to treatment/control determined by a threshold that is exogenously decided by external factorsQuestion: How much does voting in one election affect your likelihood of voting in the next election?Problem:Also correlated with age. Older people exhibit higher turnout.Selection issues for why people choose to voteVoting rights are in the constitution! Can’t randomly vary them.What if you turned 18 on the last day eligible voters were able to register for a presidential election. Let’s say 2008 where Obama really inspired a lot of young people. What if your friend turned 18 the day AFTER the final registration date. You were able to vote and your friend wasn’t. Turns out you are 1) more likely to vote in subsequent elections and 2) more likely to have the same party affiliation as who you voted for in that previous election.
  8. QUESTIONDoes being assigned to a high-security prison make a prison more likely to engage in misconduct?PROBLEMMore dangerous prisoners tend to be assigned to higher-security prisonsSOLUTIONClassification score…similarly-dangerous prisoners, but sent to prisons of different security levelsIMPLEMENTInteract classification score with cutoff
  9. Panel data: Following observations over time allows us to control for subject-specific (unobservable) effects Going further away from the gold standard of A/B testing and moving closer to establishing predictive power
  10. The next level of gradations…QUESTIONDo voters tire and not vote on some contests as they move down the ballot?PROBLEMInfeasible to run a RCEContests less salient as you move down the ballotSome precincts may be more likely to just not vote SOLUTIONPanel data: Following observations over time allows us to control for subject-specific (unobservable) effects Plus: natural experiment allows us to observe a contest at different positions on the ballotThis one is actually a combination of panel data & natural experimentVoter fatigue confounded with lower information contests appearing further down the ballotSolutionFor the same state proposition, we observe variation in ballot position across voters in different precincts due to different sets of local offices on ballot. Controlling for some other stuff, we can estimate the causal effect from voter fatigue from moving a contest 1 position further down the ballot.MethodologyFixed and randomeffects estimators
  11. Instrumental variables: For your predictor that is correlated with a confounding factor, find an “instrument” that is correlated with your predictor and dependent variable but not the confounding variableDisentangling causation from correlation really means that we need to deal with the confounding factor that is correlated with both our outcome variable and our explanatory variable. Finding an instrument means to find a variable that is correlated with the explanatory variable
  12. At this slide, wrap it all up. Economists bring a specialized skill set to the table, think about causality before all else. Some skills gap but