SlideShare a Scribd company logo
1 of 17
© 2016 The MITRE Corporation. All rights reserved.
Chr i s t ophe r Te i x e i r a
Le a d Da t a Sc i e nt is t, MI TRE
Life of a Data Scientist
Tw i t te r: @ CT_ An a l yt i c s
Li nk e dI n: h t t p : //l nke d .i n/CTe i xe i r a
The author's affiliation with The MITRE Corporation is provided for identification purposes only, and
is not intended to convey or imply MITRE's concurrence with, or support for, the positions, opinions
or viewpoints expressed by the author.
| 2 |
© 2016 The MITRE Corporation. All rights reserved.
Math + Baseball = Future Red Sox General Manager…
 Mathematics BS from Worcester Polytechnic Institute
– Concentration in Applied Statistics
– Capstone Project: Statistical Analysis of Defensive
Production in Major League Baseball
 Operations Research MS from George Mason University
– Concentration in Decision Analysis
– Capstone Project: Team Organization for Senior Softball
Approved for Public Release; Distribution Unlimited. Case Number 16-2345
| 3 |
© 2016 The MITRE Corporation. All rights reserved.
How many industries use data science?
 Operations Research Analyst at SAIC
– NASA: Planning Missions to the Moon
– DHS: Securing our Public Transit Systems
 Advanced Analytics Senior Consultant at IBM
– AETNA: Regionalized Fraud Analysis
– DOD: Defeating IEDs
 Senior Analytic Consultant at Epsilon
– SunTrust: Real-time marketing
– Bank of America: Understanding customer sentiment
Approved for Public Release; Distribution Unlimited. Case Number 16-2345
| 4 |
© 2016 The MITRE Corporation. All rights reserved.
Lead Data Scientist at MITRE
 Using my skills from both school and
previous jobs
– Discrete Event Simulation, System Dynamics,
Agent Based Modeling
– Data Analysis and Processing
– Statistical Analysis and
Systems Engineering Techniques
– Data Visualization
 Problems I help solve:
– How can we use math and statistics to better
serve Veterans waiting for benefits?
– Can simulations help us to plan how clean up
America’s nuclear waste more safely,
effectively, and within a reasonable cost?
– How can we make use of data and models to
run our own organization more effectively?
– Can predictive analytics be used to help
increase child welfare? Chris Teixeira at the White House for the White House Foster
Care & Technology Hackathon, May 28, 2016
Approved for Public Release; Distribution Unlimited. Case Number 16-2345
| 5 |
© 2016 The MITRE Corporation. All rights reserved.
Typical Day in the Life of a Data Scientist
Data Analysis
20%
Cleaning Data
40%
Statistical or
Machine Learning
Model Building
25%
Creating
Visualization
10%
Presenting Analysis
5%
Typical Day's Activities
| 6 |
© 2016 The MITRE Corporation. All rights reserved.
Typical Day in the Life of a Lead Data Scientist
Data Analysis
20%
Cleaning Data
10%
Statistical or Machine
Learning Model
Building
25%
Creating
Visualization
10%
Presenting Analysis
35%
Typical Day's Activities
| 7 |
© 2016 The MITRE Corporation. All rights reserved.
Evaluating Children at Risk
Source: https://www.behance.net/gallery/3751117/Stop-Child-Abuse
Approved for Public Release; Distribution Unlimited. Case Number 16-2345
| 8 |
© 2016 The MITRE Corporation. All rights reserved.
National Public-Private Partnership to
Eliminate Abuse and Neglect Fatalities
ABUSE AND
NEGLECT
FATALITIES
FFRDC-operated,
trusted analytic
environment
PROTECT
OUR KIDS
ACT OF 2012
and
CECANF Objectives
Identify
Analyze
Control
Manage
Predictive Risk Modeling, Data
Visualization Tools, Monitoring
and Reporting
Integrated Stakeholder
and Child Welfare Data
Approved for Public Release; Distribution Unlimited. Case Number 16-2345
© 2016 The MITRE Corporation. All rights reserved.For Internal MITRE Use.
| 9 |
Not t he s oc c e r c ompa ny…
But t he be s t pl a c e t o w or k you ha ve pr oba bl y ne ve r he a r d of !
The MITRE Corporation
Approved for Public Release; Distribution Unlimited. Case Number 16-2345
| 10 |
© 2016 The MITRE Corporation. All rights reserved.
Established to Serve the Public Interest
Approved for Public Release; Distribution Unlimited. Case Number 16-2345
| 11 |
© 2016 The MITRE Corporation. All rights reserved.
Today We Operate Seven FFRDCs
Approved for Public Release; Distribution Unlimited. Case Number 16-2345
| 12 |
© 2016 The MITRE Corporation. All rights reserved.
Understanding FFRDCs
Approved for Public Release; Distribution Unlimited. Case Number 16-2345
| 13 |
© 2016 The MITRE Corporation. All rights reserved.
Our Employees
Approved for Public Release; Distribution Unlimited. Case Number 16-2345
© 2016 The MITRE Corporation. All rights reserved.For Internal MITRE Use.
| 14 |
Advice on Getting into
Data Science at MITRE
| 15 |
© 2016 The MITRE Corporation. All rights reserved.
Necessary Skills for a Data Scientist
 Background in at least one of the following fields:
– Statistics
– Mathematics
– Operations Research
– Computer Science
 Other skills:
– Excellent written and verbal communication skills
– Demonstrated ability to manipulate large datasets with at least one modern programming
language (e.g., Python, SAS, MATLAB, C++, R, Java)
– Experience leveraging COTS tools or writing programs to visualize multi-dimensional data
– Ability to apply, modify and formulate algorithms and processes to solve challenging
problems
– Prior experience working with databases (e.g., Oracle, MySQL, MongoDB)
Junior Data Scientist (27721BR)
www.mitre.org/careers
| 16 |
© 2016 The MITRE Corporation. All rights reserved.
Advice on Getting a Data Science position
Don’t be afraid to be a bit geeky!
Start a Github account
Learn something new and show it off
–A new language (e.g. JavaScript / R / Python)
–Analyze your own data (e.g. Fitbit or Apple Health)
Compete on a kaggle team
Volunteer at DataKind
© 2016 The MITRE Corporation. All rights reserved.For Internal MITRE Use.
| 17 |
Questions?

More Related Content

What's hot

Career in Data Science
Career in Data ScienceCareer in Data Science
Career in Data ScienceActonRoy
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data sciencebhavesh lande
 
Data Con LA 2020 Keynote - Bryan Kirschner
Data Con LA 2020 Keynote - Bryan KirschnerData Con LA 2020 Keynote - Bryan Kirschner
Data Con LA 2020 Keynote - Bryan KirschnerData Con LA
 
Saket Saurabh at AI Frontiers: Data Operations or: How I Learned to Stop Data...
Saket Saurabh at AI Frontiers: Data Operations or: How I Learned to Stop Data...Saket Saurabh at AI Frontiers: Data Operations or: How I Learned to Stop Data...
Saket Saurabh at AI Frontiers: Data Operations or: How I Learned to Stop Data...AI Frontiers
 
Data analytics ppt
Data analytics pptData analytics ppt
Data analytics ppteveilleux18
 
Getting Ready For 3rd Generation Platform
Getting Ready For 3rd Generation PlatformGetting Ready For 3rd Generation Platform
Getting Ready For 3rd Generation PlatformData Science Thailand
 
Analytics, Big Data and The Cloud II Conference - Kiribatu Labs
Analytics, Big Data and The Cloud II Conference - Kiribatu LabsAnalytics, Big Data and The Cloud II Conference - Kiribatu Labs
Analytics, Big Data and The Cloud II Conference - Kiribatu LabsPawel Brzeminski
 
NUS-ISS Learning Day 2018- Sentiment analysis in finance
NUS-ISS Learning Day 2018- Sentiment analysis in financeNUS-ISS Learning Day 2018- Sentiment analysis in finance
NUS-ISS Learning Day 2018- Sentiment analysis in financeNUS-ISS
 
Python for Data Science | Python Data Science Tutorial | Data Science Certifi...
Python for Data Science | Python Data Science Tutorial | Data Science Certifi...Python for Data Science | Python Data Science Tutorial | Data Science Certifi...
Python for Data Science | Python Data Science Tutorial | Data Science Certifi...Edureka!
 
big data analytics pgpmx2015
big data analytics pgpmx2015big data analytics pgpmx2015
big data analytics pgpmx2015Sanmeet Dhokay
 
Data Science Courses - BigData VS Data Science
Data Science Courses - BigData VS Data ScienceData Science Courses - BigData VS Data Science
Data Science Courses - BigData VS Data ScienceDataMites
 
Societal Impact of Applied Data Science on the Big Data Stack
Societal Impact of Applied Data Science on the Big Data StackSocietal Impact of Applied Data Science on the Big Data Stack
Societal Impact of Applied Data Science on the Big Data StackStealth Project
 
Data science & data scientist
Data science & data scientistData science & data scientist
Data science & data scientistVijayMohan Vasu
 
Big data analysis the new big thing
Big data analysis  the new big thing Big data analysis  the new big thing
Big data analysis the new big thing Alexis Martinez
 
Data Science Popup Austin: Data Meet Product
Data Science Popup Austin: Data Meet Product Data Science Popup Austin: Data Meet Product
Data Science Popup Austin: Data Meet Product Domino Data Lab
 
Synthetic data generation for machine learning
Synthetic data generation for machine learningSynthetic data generation for machine learning
Synthetic data generation for machine learningQuantUniversity
 

What's hot (20)

Career in Data Science
Career in Data ScienceCareer in Data Science
Career in Data Science
 
Data Science
Data ScienceData Science
Data Science
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data science
 
Data Con LA 2020 Keynote - Bryan Kirschner
Data Con LA 2020 Keynote - Bryan KirschnerData Con LA 2020 Keynote - Bryan Kirschner
Data Con LA 2020 Keynote - Bryan Kirschner
 
Saket Saurabh at AI Frontiers: Data Operations or: How I Learned to Stop Data...
Saket Saurabh at AI Frontiers: Data Operations or: How I Learned to Stop Data...Saket Saurabh at AI Frontiers: Data Operations or: How I Learned to Stop Data...
Saket Saurabh at AI Frontiers: Data Operations or: How I Learned to Stop Data...
 
Equity Valuation (AI Augmented Diligence using BayesDB)
Equity Valuation (AI Augmented  Diligence using BayesDB)Equity Valuation (AI Augmented  Diligence using BayesDB)
Equity Valuation (AI Augmented Diligence using BayesDB)
 
What is Data?
What is Data?What is Data?
What is Data?
 
Data analytics ppt
Data analytics pptData analytics ppt
Data analytics ppt
 
Getting Ready For 3rd Generation Platform
Getting Ready For 3rd Generation PlatformGetting Ready For 3rd Generation Platform
Getting Ready For 3rd Generation Platform
 
Analytics, Big Data and The Cloud II Conference - Kiribatu Labs
Analytics, Big Data and The Cloud II Conference - Kiribatu LabsAnalytics, Big Data and The Cloud II Conference - Kiribatu Labs
Analytics, Big Data and The Cloud II Conference - Kiribatu Labs
 
NUS-ISS Learning Day 2018- Sentiment analysis in finance
NUS-ISS Learning Day 2018- Sentiment analysis in financeNUS-ISS Learning Day 2018- Sentiment analysis in finance
NUS-ISS Learning Day 2018- Sentiment analysis in finance
 
Python for Data Science | Python Data Science Tutorial | Data Science Certifi...
Python for Data Science | Python Data Science Tutorial | Data Science Certifi...Python for Data Science | Python Data Science Tutorial | Data Science Certifi...
Python for Data Science | Python Data Science Tutorial | Data Science Certifi...
 
big data analytics pgpmx2015
big data analytics pgpmx2015big data analytics pgpmx2015
big data analytics pgpmx2015
 
Data Science Courses - BigData VS Data Science
Data Science Courses - BigData VS Data ScienceData Science Courses - BigData VS Data Science
Data Science Courses - BigData VS Data Science
 
Societal Impact of Applied Data Science on the Big Data Stack
Societal Impact of Applied Data Science on the Big Data StackSocietal Impact of Applied Data Science on the Big Data Stack
Societal Impact of Applied Data Science on the Big Data Stack
 
Data science & data scientist
Data science & data scientistData science & data scientist
Data science & data scientist
 
Big data analysis the new big thing
Big data analysis  the new big thing Big data analysis  the new big thing
Big data analysis the new big thing
 
Data Science Popup Austin: Data Meet Product
Data Science Popup Austin: Data Meet Product Data Science Popup Austin: Data Meet Product
Data Science Popup Austin: Data Meet Product
 
Synthetic data generation for machine learning
Synthetic data generation for machine learningSynthetic data generation for machine learning
Synthetic data generation for machine learning
 
Data science Big Data
Data science Big DataData science Big Data
Data science Big Data
 

Similar to WIT Career Lecture Series - CTeixeira Data Scientist

Rady School Master of Science Business Analytics (MSBA) Program Overview
Rady School Master of Science Business Analytics (MSBA) Program OverviewRady School Master of Science Business Analytics (MSBA) Program Overview
Rady School Master of Science Business Analytics (MSBA) Program OverviewUC San Diego Rady School of Management
 
Minne analytics presentation 2018 12 03 final compressed
Minne analytics presentation 2018 12 03 final   compressedMinne analytics presentation 2018 12 03 final   compressed
Minne analytics presentation 2018 12 03 final compressedBonnie Holub
 
Modernizing Architecture for a Complete Data Strategy
Modernizing Architecture for a Complete Data StrategyModernizing Architecture for a Complete Data Strategy
Modernizing Architecture for a Complete Data StrategyCloudera, Inc.
 
Minne analytics presentation 2018 12 03 final compressed
Minne analytics presentation 2018 12 03 final   compressedMinne analytics presentation 2018 12 03 final   compressed
Minne analytics presentation 2018 12 03 final compressedBonnie Holub
 
L3 Big Data and Application.pptx
L3  Big Data and Application.pptxL3  Big Data and Application.pptx
L3 Big Data and Application.pptxShambhavi Vats
 
Impact of big data on analytics
Impact of big data on analyticsImpact of big data on analytics
Impact of big data on analyticsCapgemini
 
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...IJSCAI Journal
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...gerogepatton
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...gerogepatton
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...gerogepatton
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...ijscai
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...gerogepatton
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...ijscai
 
[MU630] 004. Business Intelligence & Decision Support
[MU630] 004. Business Intelligence & Decision Support[MU630] 004. Business Intelligence & Decision Support
[MU630] 004. Business Intelligence & Decision SupportAriantoMuditomo
 
Analytics trends 2016 the next evolution
Analytics trends 2016 the next evolutionAnalytics trends 2016 the next evolution
Analytics trends 2016 the next evolutionYann Lecourt
 
Analytics Trends 2016: The next evolution
Analytics Trends 2016: The next evolutionAnalytics Trends 2016: The next evolution
Analytics Trends 2016: The next evolutionDeloitte United States
 
Digital Strategy Lecture 3
Digital Strategy Lecture 3Digital Strategy Lecture 3
Digital Strategy Lecture 3Neil Kelley
 
Cool vs Creepy - Ethics and Data Science - Cooper 2Feb
Cool vs Creepy - Ethics and Data Science - Cooper 2FebCool vs Creepy - Ethics and Data Science - Cooper 2Feb
Cool vs Creepy - Ethics and Data Science - Cooper 2FebCathy Cooper
 
Breakout 3. AI for Sustainable Development and Human Rights: Inclusion, Diver...
Breakout 3. AI for Sustainable Development and Human Rights: Inclusion, Diver...Breakout 3. AI for Sustainable Development and Human Rights: Inclusion, Diver...
Breakout 3. AI for Sustainable Development and Human Rights: Inclusion, Diver...Saurabh Mishra
 

Similar to WIT Career Lecture Series - CTeixeira Data Scientist (20)

Rady School Master of Science Business Analytics (MSBA) Program Overview
Rady School Master of Science Business Analytics (MSBA) Program OverviewRady School Master of Science Business Analytics (MSBA) Program Overview
Rady School Master of Science Business Analytics (MSBA) Program Overview
 
Minne analytics presentation 2018 12 03 final compressed
Minne analytics presentation 2018 12 03 final   compressedMinne analytics presentation 2018 12 03 final   compressed
Minne analytics presentation 2018 12 03 final compressed
 
Modernizing Architecture for a Complete Data Strategy
Modernizing Architecture for a Complete Data StrategyModernizing Architecture for a Complete Data Strategy
Modernizing Architecture for a Complete Data Strategy
 
Minne analytics presentation 2018 12 03 final compressed
Minne analytics presentation 2018 12 03 final   compressedMinne analytics presentation 2018 12 03 final   compressed
Minne analytics presentation 2018 12 03 final compressed
 
L3 Big Data and Application.pptx
L3  Big Data and Application.pptxL3  Big Data and Application.pptx
L3 Big Data and Application.pptx
 
Impact of big data on analytics
Impact of big data on analyticsImpact of big data on analytics
Impact of big data on analytics
 
Data Mining With Big Data
Data Mining With Big DataData Mining With Big Data
Data Mining With Big Data
 
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
 
[MU630] 004. Business Intelligence & Decision Support
[MU630] 004. Business Intelligence & Decision Support[MU630] 004. Business Intelligence & Decision Support
[MU630] 004. Business Intelligence & Decision Support
 
Analytics trends 2016 the next evolution
Analytics trends 2016 the next evolutionAnalytics trends 2016 the next evolution
Analytics trends 2016 the next evolution
 
Analytics Trends 2016: The next evolution
Analytics Trends 2016: The next evolutionAnalytics Trends 2016: The next evolution
Analytics Trends 2016: The next evolution
 
Digital Strategy Lecture 3
Digital Strategy Lecture 3Digital Strategy Lecture 3
Digital Strategy Lecture 3
 
Cool vs Creepy - Ethics and Data Science - Cooper 2Feb
Cool vs Creepy - Ethics and Data Science - Cooper 2FebCool vs Creepy - Ethics and Data Science - Cooper 2Feb
Cool vs Creepy - Ethics and Data Science - Cooper 2Feb
 
Breakout 3. AI for Sustainable Development and Human Rights: Inclusion, Diver...
Breakout 3. AI for Sustainable Development and Human Rights: Inclusion, Diver...Breakout 3. AI for Sustainable Development and Human Rights: Inclusion, Diver...
Breakout 3. AI for Sustainable Development and Human Rights: Inclusion, Diver...
 

WIT Career Lecture Series - CTeixeira Data Scientist

  • 1. © 2016 The MITRE Corporation. All rights reserved. Chr i s t ophe r Te i x e i r a Le a d Da t a Sc i e nt is t, MI TRE Life of a Data Scientist Tw i t te r: @ CT_ An a l yt i c s Li nk e dI n: h t t p : //l nke d .i n/CTe i xe i r a The author's affiliation with The MITRE Corporation is provided for identification purposes only, and is not intended to convey or imply MITRE's concurrence with, or support for, the positions, opinions or viewpoints expressed by the author.
  • 2. | 2 | © 2016 The MITRE Corporation. All rights reserved. Math + Baseball = Future Red Sox General Manager…  Mathematics BS from Worcester Polytechnic Institute – Concentration in Applied Statistics – Capstone Project: Statistical Analysis of Defensive Production in Major League Baseball  Operations Research MS from George Mason University – Concentration in Decision Analysis – Capstone Project: Team Organization for Senior Softball Approved for Public Release; Distribution Unlimited. Case Number 16-2345
  • 3. | 3 | © 2016 The MITRE Corporation. All rights reserved. How many industries use data science?  Operations Research Analyst at SAIC – NASA: Planning Missions to the Moon – DHS: Securing our Public Transit Systems  Advanced Analytics Senior Consultant at IBM – AETNA: Regionalized Fraud Analysis – DOD: Defeating IEDs  Senior Analytic Consultant at Epsilon – SunTrust: Real-time marketing – Bank of America: Understanding customer sentiment Approved for Public Release; Distribution Unlimited. Case Number 16-2345
  • 4. | 4 | © 2016 The MITRE Corporation. All rights reserved. Lead Data Scientist at MITRE  Using my skills from both school and previous jobs – Discrete Event Simulation, System Dynamics, Agent Based Modeling – Data Analysis and Processing – Statistical Analysis and Systems Engineering Techniques – Data Visualization  Problems I help solve: – How can we use math and statistics to better serve Veterans waiting for benefits? – Can simulations help us to plan how clean up America’s nuclear waste more safely, effectively, and within a reasonable cost? – How can we make use of data and models to run our own organization more effectively? – Can predictive analytics be used to help increase child welfare? Chris Teixeira at the White House for the White House Foster Care & Technology Hackathon, May 28, 2016 Approved for Public Release; Distribution Unlimited. Case Number 16-2345
  • 5. | 5 | © 2016 The MITRE Corporation. All rights reserved. Typical Day in the Life of a Data Scientist Data Analysis 20% Cleaning Data 40% Statistical or Machine Learning Model Building 25% Creating Visualization 10% Presenting Analysis 5% Typical Day's Activities
  • 6. | 6 | © 2016 The MITRE Corporation. All rights reserved. Typical Day in the Life of a Lead Data Scientist Data Analysis 20% Cleaning Data 10% Statistical or Machine Learning Model Building 25% Creating Visualization 10% Presenting Analysis 35% Typical Day's Activities
  • 7. | 7 | © 2016 The MITRE Corporation. All rights reserved. Evaluating Children at Risk Source: https://www.behance.net/gallery/3751117/Stop-Child-Abuse Approved for Public Release; Distribution Unlimited. Case Number 16-2345
  • 8. | 8 | © 2016 The MITRE Corporation. All rights reserved. National Public-Private Partnership to Eliminate Abuse and Neglect Fatalities ABUSE AND NEGLECT FATALITIES FFRDC-operated, trusted analytic environment PROTECT OUR KIDS ACT OF 2012 and CECANF Objectives Identify Analyze Control Manage Predictive Risk Modeling, Data Visualization Tools, Monitoring and Reporting Integrated Stakeholder and Child Welfare Data Approved for Public Release; Distribution Unlimited. Case Number 16-2345
  • 9. © 2016 The MITRE Corporation. All rights reserved.For Internal MITRE Use. | 9 | Not t he s oc c e r c ompa ny… But t he be s t pl a c e t o w or k you ha ve pr oba bl y ne ve r he a r d of ! The MITRE Corporation Approved for Public Release; Distribution Unlimited. Case Number 16-2345
  • 10. | 10 | © 2016 The MITRE Corporation. All rights reserved. Established to Serve the Public Interest Approved for Public Release; Distribution Unlimited. Case Number 16-2345
  • 11. | 11 | © 2016 The MITRE Corporation. All rights reserved. Today We Operate Seven FFRDCs Approved for Public Release; Distribution Unlimited. Case Number 16-2345
  • 12. | 12 | © 2016 The MITRE Corporation. All rights reserved. Understanding FFRDCs Approved for Public Release; Distribution Unlimited. Case Number 16-2345
  • 13. | 13 | © 2016 The MITRE Corporation. All rights reserved. Our Employees Approved for Public Release; Distribution Unlimited. Case Number 16-2345
  • 14. © 2016 The MITRE Corporation. All rights reserved.For Internal MITRE Use. | 14 | Advice on Getting into Data Science at MITRE
  • 15. | 15 | © 2016 The MITRE Corporation. All rights reserved. Necessary Skills for a Data Scientist  Background in at least one of the following fields: – Statistics – Mathematics – Operations Research – Computer Science  Other skills: – Excellent written and verbal communication skills – Demonstrated ability to manipulate large datasets with at least one modern programming language (e.g., Python, SAS, MATLAB, C++, R, Java) – Experience leveraging COTS tools or writing programs to visualize multi-dimensional data – Ability to apply, modify and formulate algorithms and processes to solve challenging problems – Prior experience working with databases (e.g., Oracle, MySQL, MongoDB) Junior Data Scientist (27721BR) www.mitre.org/careers
  • 16. | 16 | © 2016 The MITRE Corporation. All rights reserved. Advice on Getting a Data Science position Don’t be afraid to be a bit geeky! Start a Github account Learn something new and show it off –A new language (e.g. JavaScript / R / Python) –Analyze your own data (e.g. Fitbit or Apple Health) Compete on a kaggle team Volunteer at DataKind
  • 17. © 2016 The MITRE Corporation. All rights reserved.For Internal MITRE Use. | 17 | Questions?

Editor's Notes

  1. Talking points: Here today to discuss how math can serve the public interest and support the federal government.
  2. Talking Points: Go to college to support getting your dream job, for me that’s becoming the GM for the Red Sox. Obviously, I didn’t do that, so what did I end up doing with a math degree? References: Spray charts from fangraphs.com
  3. Talking points: I became a “consultant”… Started with planning missions to the moon under the Constellation Program Then after 9/11, helped analyze and quantify ways to keep people safe as they ride public transit Graduated with my masters and wanted bigger challenges working directly with customers. After getting married, moved back to Boston and work on applying math to marketing problems outside of the classic approaches of determining who was likely to respond to promotions. But, there are other ways to use math and help out the public.
  4. Talking Points: So I took my talents to MITRE where I’ve had the opportunity to serve the public interest in many facets. From helping the Veterans Administration serve veterans better to helping the Department of Energy plan to treat nuclear waste. Even traveling to the White House to tackle Foster Care and ways to hack a technology solution together.
  5. Talking Points: First task is exploring a dataset I get to understand what I received. Next is manipulating it to a point where it’s useful. Building and tuning the model. Visualizing the data. Then communicating the results.
  6. Talking Points: First task is exploring a dataset I get to understand what I received. Next is manipulating it to a point where it’s useful. Building and tuning the model. Visualizing the data. Then communicating the results.
  7. Talking point: MITRE can step in using internal research budgets to figure out how we can step in to use math to tackle this problem. Using a combination of statistics, optimization and data visualization, we can use our unique position to bring together lots of data sources to identify systemic issues in child welfare. How can MITRE do this? Let me tell you a bit about what it means to operate FFRDCs
  8. MITRE is a not-for-profit company chartered in 1958 to work solely in the public interest. Among a number of companies established in the late 1940s and 1950s to bring public sector resources in science and technology to work for govt. in an independent, conflict-free environment. This led to the establishment of an ecosystem of federal research centers. RAND, the Lincoln Lab, Aerospace, and the National Labs are among the 40 federally funded research and development centers that exist today. At present, MITRE only operates federally funded research and development centers.
  9. Today we operate seven FFRDCs. This puts MITRE in a unique position to serve as a bridge among agencies, facilitating collaboration and sharing common solutions broadly. National Security Engineering Center NSEC helps the government make choices based on objective technical assessments, mission requirements, and budgetary constraints. We also transfer the prototypes or system improvements that our own staff develops either directly to our sponsors or to commercial companies for production. Center for Advanced Aviation System Development CAASD provides the FAA with advanced technical capabilities in systems engineering, mathematics, and computer science. We also apply in-depth domain knowledge in air traffic management and airspace user operations relevant to the National Airspace System (NAS) as well as international aviation. Center for Enterprise Modernization CEM, sponsored by the Internal Revenue Service and co-sponsored by the Department of Veterans Affairs, takes on the challenge of reshaping and modernizing the technology infrastructure as well as mission-critical business and management functions of civilian federal government agencies. Homeland Security Systems Engineering and Development Institute HS SEDI helps the Department of Homeland Security improve its performance in critical functions, such as acquisition processes, risk and program management, information technology engineering, and decision-making capabilities. Judiciary Engineering and Modernization Center JEMC provides objective assessments of the technical challenges the judiciary faces, analyzing the impact and risks of both available and emerging systems. CMS Alliance to Modernize Healthcare CAMH works across the health community on a range of business, policy, technology, and operational challenges. National Cybersecurity Center of Excellence The National Cybersecurity Center of Excellence is the first FFRDC solely dedicated to enhancing the security of the nation’s information systems.
  10. Federally funded research and development centers play an important role in working with government and industry to deliver game-changing solutions to complex challenges. FFRDCs must: Meet a "special long-term research and development need" that cannot be met by in-house staff or traditional contractor resources. "Operate in the public interest with objectivity and independence" and "be free from organizational conflicts of interest." Receive access to sensitive and proprietary data "beyond that which is common to the normal contractual relationship." This ensures that sponsoring organizations receive fully informed guidance that reflects an understanding of all critical points of view. Form channels of expertise from multiple sources to advance government missions. These characteristics enable FFRDCs to act as long-term strategic partners with the government in such areas as: Systems engineering and integration Research and development Study and analysis FFRDCs are an especially valuable resource when an agency confronts a challenge for which there is no obvious solution or a situation in which there are many viable solutions. In these cases, and independent analysis is required to determine which choice offers the agency the greatest benefits in terms of efficiency, effectiveness, and affordability.
  11.   Total number of employees – 7,300 ·         Average years of experience – 25 ·         Average tenure with MITRE – 12 ·         Percentage of employees with advanced degrees – 67% with masters degrees; 12% with doctorates  
  12. Taken from Model Based Analytics Job Req 27721BR
  13. Apple health logo: https://cdn.macworld.co.uk/cmsdata/features/3525196/Health-Icon_800_thumb800.jpg Fitbit logo: https://s2.q4cdn.com/857130097/files/images/Fitbit-logo-RGB.png Github logo: https://github.com/logos Kaggle logo: https://www.kaggle.com/contact DataKind logo: http://www.datakind.org/static/images/icons/DataKind_orange.png