SlideShare a Scribd company logo
1 of 10
Competitive Advantage. Elegantly Engineered.
A Different
Data Science Methodology
We use data, analytics, and design to help clients
perform at their best.
Machine Intelligence catalyzes innovation, engineers machine learning
applications, and builds enduring capabilities.
We’re creative, rigorous, and efficient. We bring the sophistication of a
large strategy firm with the speed and value of a focused boutique.
We apply proven techniques, designs, and world-class expertise to:
• Improve how companies engage customers
• Optimize machine performance
• Enhance process results
Models reproduce how questions are answered
in training data.
Business, not IT, should design training data.
Most project time is used understanding how
data is generated and building training data sets.
Machine Learning is Simple
Real
world
Training
Data Results
Generally a subset of
scenarios in the real
world.
Data trains models that
reproduce decisions in
the training data with
80-95% accuracy.
The full set of all
consumers, machines, or
business results that a
model will forecast.
A Different Data Science Methodology
Many data science projects jump into
algorithms and technology.
We reverse the usual approach by first
rigorously defining the business question
and understanding data.
The methodology:
• Aligns the whole business
• Sets practical expectations
• Leads change
• Builds sustaining capabilities
Data
Technology
Business question
Business
goals
Time
and
focus
Data
Technology
Steps
Foundation
• Align change across the business
• Understand data
• Define the business question
Results
• Sustain capabilities
• Communicate value
• Build application
Model
• Iterate production model
• Pilot models
• Build training data
1.
2.
3.
Project Phasing
• Most time is spent understanding data and building training data.
• An early pilot is key to refining to training data and building support for change.
• Developing the full application starts early with a UX for the pilot model.
1. Set Foundation
A. Define the business question
B. Align change
C. Understand data
• Learn and set expectations on the data science process and cloud hosting.
• Define precise business questions.
• Model how answering the business question delivers results.
• Link business and regulatory needs to training data design and algorithm selection, e.g. does a
model require easy explainability?
• Build a coalition of sponsors and communicate the vision.
• Define roles for compliance, customer service, finance, marketing, product, and sales.
• Understand the data generating process: genchi genbutsu.
• Visualize the “shape of the data”: distributions, sensitivity, clusters, anomalies, and
sparseness. Identify quality issues.
• Capture rules and map data flows from source systems.
2. Build Models
A. Build training data
B. Pilot models
C. Iterate production models
• Form business and IT team: roles, super-labelers, biases.
• Design the data set’s scenarios and set quality criteria.
• Visualize attributes and confirm with business sponsors.
• Define rules to pre-process data and select open source algorithms.
• Visualize and communicate results. Show an early win. Ideally, prototype the UX.
• Plan enhancements to training data, algos, and applications.
• Refine data (feature shaping and dimensionality reduction).
• Customize rules and algorithms.
• Connect into the broader application starting with the data model.
3. Deliver Results
A. Build application
B. Communicate value
C. Sustain capabilities
• Visualize UX, define data model and APIs.
• Set non-functional requirements such as scalability, latency, and security.
• Define test plan.
• Communicate how the solution makes jobs better and brings value to customers
• Build understanding and support with key influencers
• Use multiple channels (meetings, email, calls) repeatedly to ensure reaching people
• Optimize costs and scalability. Plan for decreased costs.
• Confirm team skills and capacity to evolve the models.
• Set plan for and automate re-training models. Set expectations that models may expand the
range of scenarios covered and/or may improve precision.
Contact
Machine Intelligence Partners LLC serves clients
globally. Our people are centered in Boston,
Bozeman, Grand Rapids, London, New York, San
Francisco, and Washington, D.C.
Client relationship leaders:
New York
Jeremy Lehman
917.225.2011
jeremy.lehman@machineintel.com
Washington, D.C.
Philippe Berckmans
804.405.6009
philippe.berckmans@machineintel.com
Machine Intelligence is an Amazon Technology Partner
and member of the Microsoft Partner Network.
We are a veteran-owned small business.

More Related Content

What's hot

MonetizingStatistics
MonetizingStatisticsMonetizingStatistics
MonetizingStatistics
Aaron Sankey
 

What's hot (15)

Managing uncertainty in ai performance target setting
Managing uncertainty in ai performance target settingManaging uncertainty in ai performance target setting
Managing uncertainty in ai performance target setting
 
Learn How to Make Machine Learning Work
Learn How to Make Machine Learning WorkLearn How to Make Machine Learning Work
Learn How to Make Machine Learning Work
 
Establish the right practices for Effective AI
Establish the right practices for Effective AIEstablish the right practices for Effective AI
Establish the right practices for Effective AI
 
Indhu resume
Indhu resumeIndhu resume
Indhu resume
 
MonetizingStatistics
MonetizingStatisticsMonetizingStatistics
MonetizingStatistics
 
Integrating A.I. and Machine Learning with your Demand Forecast
Integrating A.I. and Machine Learning with your Demand ForecastIntegrating A.I. and Machine Learning with your Demand Forecast
Integrating A.I. and Machine Learning with your Demand Forecast
 
Resume
ResumeResume
Resume
 
Sudheera_Profile
Sudheera_ProfileSudheera_Profile
Sudheera_Profile
 
Top 5 high demand jobs in data science
Top 5 high demand jobs in data scienceTop 5 high demand jobs in data science
Top 5 high demand jobs in data science
 
Business intelligence prof nikhat fatma mumtaz husain shaikh
Business intelligence  prof nikhat fatma mumtaz husain shaikhBusiness intelligence  prof nikhat fatma mumtaz husain shaikh
Business intelligence prof nikhat fatma mumtaz husain shaikh
 
ceresume
ceresumeceresume
ceresume
 
Experiment idea poster-p2
Experiment idea poster-p2Experiment idea poster-p2
Experiment idea poster-p2
 
This is AI doing – applying artificial intelligence to business problems by H...
This is AI doing – applying artificial intelligence to business problems by H...This is AI doing – applying artificial intelligence to business problems by H...
This is AI doing – applying artificial intelligence to business problems by H...
 
resume
resumeresume
resume
 
New patterns of innovation
New patterns of innovationNew patterns of innovation
New patterns of innovation
 

Similar to Machine intelligence data science methodology 060420

[DSC Europe 23] Josip Saban - Leading AI teams.pptx
[DSC Europe 23] Josip Saban - Leading AI teams.pptx[DSC Europe 23] Josip Saban - Leading AI teams.pptx
[DSC Europe 23] Josip Saban - Leading AI teams.pptx
DataScienceConferenc1
 

Similar to Machine intelligence data science methodology 060420 (20)

DS Life Cycle
DS Life CycleDS Life Cycle
DS Life Cycle
 
DS Life Cycle
DS Life CycleDS Life Cycle
DS Life Cycle
 
AI Class Topic 3: Building Machine Learning Predictive Systems (Predictive Ma...
AI Class Topic 3: Building Machine Learning Predictive Systems (Predictive Ma...AI Class Topic 3: Building Machine Learning Predictive Systems (Predictive Ma...
AI Class Topic 3: Building Machine Learning Predictive Systems (Predictive Ma...
 
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjnWHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
 
Smart Data Module 4 d drive_business models
Smart Data Module 4 d drive_business modelsSmart Data Module 4 d drive_business models
Smart Data Module 4 d drive_business models
 
Doing Analytics Right - Designing and Automating Analytics
Doing Analytics Right - Designing and Automating AnalyticsDoing Analytics Right - Designing and Automating Analytics
Doing Analytics Right - Designing and Automating Analytics
 
Starter Kit for Collaboration from Karuana @ Microsoft IT
Starter Kit for Collaboration from Karuana @ Microsoft ITStarter Kit for Collaboration from Karuana @ Microsoft IT
Starter Kit for Collaboration from Karuana @ Microsoft IT
 
How to Build an AI/ML Product and Sell it by SalesChoice CPO
How to Build an AI/ML Product and Sell it by SalesChoice CPOHow to Build an AI/ML Product and Sell it by SalesChoice CPO
How to Build an AI/ML Product and Sell it by SalesChoice CPO
 
[DSC Europe 23] Josip Saban - Leading AI teams.pptx
[DSC Europe 23] Josip Saban - Leading AI teams.pptx[DSC Europe 23] Josip Saban - Leading AI teams.pptx
[DSC Europe 23] Josip Saban - Leading AI teams.pptx
 
Get your data analytics strategy right!
Get your data analytics strategy right!Get your data analytics strategy right!
Get your data analytics strategy right!
 
Machine Learning in Customer Analytics
Machine Learning in Customer AnalyticsMachine Learning in Customer Analytics
Machine Learning in Customer Analytics
 
Business Analytics Training Catalog - QueBIT Trusted Experts in Business Anal...
Business Analytics Training Catalog - QueBIT Trusted Experts in Business Anal...Business Analytics Training Catalog - QueBIT Trusted Experts in Business Anal...
Business Analytics Training Catalog - QueBIT Trusted Experts in Business Anal...
 
A Guide to Machine Learning Developer in 2024.pdf
A Guide to Machine Learning Developer in 2024.pdfA Guide to Machine Learning Developer in 2024.pdf
A Guide to Machine Learning Developer in 2024.pdf
 
how to successfully implement a data analytics solution.pdf
how to successfully implement a data analytics solution.pdfhow to successfully implement a data analytics solution.pdf
how to successfully implement a data analytics solution.pdf
 
Simplify Your Analytics Strategy
Simplify Your Analytics StrategySimplify Your Analytics Strategy
Simplify Your Analytics Strategy
 
Embedded Analytics
Embedded AnalyticsEmbedded Analytics
Embedded Analytics
 
Machine Learning: The First Salvo of the AI Business Revolution
Machine Learning: The First Salvo of the AI Business RevolutionMachine Learning: The First Salvo of the AI Business Revolution
Machine Learning: The First Salvo of the AI Business Revolution
 
Simplify your analytics strategy
Simplify your analytics strategySimplify your analytics strategy
Simplify your analytics strategy
 
Data Science Introduction by Emerging India Analytics
Data Science Introduction by Emerging India AnalyticsData Science Introduction by Emerging India Analytics
Data Science Introduction by Emerging India Analytics
 
Dhrub_Resume_New
Dhrub_Resume_NewDhrub_Resume_New
Dhrub_Resume_New
 

Recently uploaded

Displacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second DerivativesDisplacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second Derivatives
23050636
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
wsppdmt
 
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
pwgnohujw
 
Abortion pills in Doha {{ QATAR }} +966572737505) Get Cytotec
Abortion pills in Doha {{ QATAR }} +966572737505) Get CytotecAbortion pills in Doha {{ QATAR }} +966572737505) Get Cytotec
Abortion pills in Doha {{ QATAR }} +966572737505) Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
mikehavy0
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Bertram Ludäscher
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
ahmedjiabur940
 
Abortion pills in Jeddah |+966572737505 | get cytotec
Abortion pills in Jeddah |+966572737505 | get cytotecAbortion pills in Jeddah |+966572737505 | get cytotec
Abortion pills in Jeddah |+966572737505 | get cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
sourabh vyas1222222222222222222244444444
sourabh vyas1222222222222222222244444444sourabh vyas1222222222222222222244444444
sourabh vyas1222222222222222222244444444
saurabvyas476
 
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
acoha1
 
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
acoha1
 
Simplify hybrid data integration at an enterprise scale. Integrate all your d...
Simplify hybrid data integration at an enterprise scale. Integrate all your d...Simplify hybrid data integration at an enterprise scale. Integrate all your d...
Simplify hybrid data integration at an enterprise scale. Integrate all your d...
varanasisatyanvesh
 

Recently uploaded (20)

Capstone in Interprofessional Informatic // IMPACT OF COVID 19 ON EDUCATION
Capstone in Interprofessional Informatic  // IMPACT OF COVID 19 ON EDUCATIONCapstone in Interprofessional Informatic  // IMPACT OF COVID 19 ON EDUCATION
Capstone in Interprofessional Informatic // IMPACT OF COVID 19 ON EDUCATION
 
jll-asia-pacific-capital-tracker-1q24.pdf
jll-asia-pacific-capital-tracker-1q24.pdfjll-asia-pacific-capital-tracker-1q24.pdf
jll-asia-pacific-capital-tracker-1q24.pdf
 
Displacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second DerivativesDisplacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second Derivatives
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
 
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
 
Predictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting TechniquesPredictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting Techniques
 
Abortion pills in Doha {{ QATAR }} +966572737505) Get Cytotec
Abortion pills in Doha {{ QATAR }} +966572737505) Get CytotecAbortion pills in Doha {{ QATAR }} +966572737505) Get Cytotec
Abortion pills in Doha {{ QATAR }} +966572737505) Get Cytotec
 
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
 
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
Abortion pills in Jeddah |+966572737505 | get cytotec
Abortion pills in Jeddah |+966572737505 | get cytotecAbortion pills in Jeddah |+966572737505 | get cytotec
Abortion pills in Jeddah |+966572737505 | get cytotec
 
sourabh vyas1222222222222222222244444444
sourabh vyas1222222222222222222244444444sourabh vyas1222222222222222222244444444
sourabh vyas1222222222222222222244444444
 
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
 
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarjSCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
 
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
 
DAA Assignment Solution.pdf is the best1
DAA Assignment Solution.pdf is the best1DAA Assignment Solution.pdf is the best1
DAA Assignment Solution.pdf is the best1
 
Simplify hybrid data integration at an enterprise scale. Integrate all your d...
Simplify hybrid data integration at an enterprise scale. Integrate all your d...Simplify hybrid data integration at an enterprise scale. Integrate all your d...
Simplify hybrid data integration at an enterprise scale. Integrate all your d...
 

Machine intelligence data science methodology 060420

  • 1. Competitive Advantage. Elegantly Engineered. A Different Data Science Methodology
  • 2. We use data, analytics, and design to help clients perform at their best. Machine Intelligence catalyzes innovation, engineers machine learning applications, and builds enduring capabilities. We’re creative, rigorous, and efficient. We bring the sophistication of a large strategy firm with the speed and value of a focused boutique. We apply proven techniques, designs, and world-class expertise to: • Improve how companies engage customers • Optimize machine performance • Enhance process results
  • 3. Models reproduce how questions are answered in training data. Business, not IT, should design training data. Most project time is used understanding how data is generated and building training data sets. Machine Learning is Simple Real world Training Data Results Generally a subset of scenarios in the real world. Data trains models that reproduce decisions in the training data with 80-95% accuracy. The full set of all consumers, machines, or business results that a model will forecast.
  • 4. A Different Data Science Methodology Many data science projects jump into algorithms and technology. We reverse the usual approach by first rigorously defining the business question and understanding data. The methodology: • Aligns the whole business • Sets practical expectations • Leads change • Builds sustaining capabilities Data Technology Business question Business goals Time and focus Data Technology
  • 5. Steps Foundation • Align change across the business • Understand data • Define the business question Results • Sustain capabilities • Communicate value • Build application Model • Iterate production model • Pilot models • Build training data 1. 2. 3.
  • 6. Project Phasing • Most time is spent understanding data and building training data. • An early pilot is key to refining to training data and building support for change. • Developing the full application starts early with a UX for the pilot model.
  • 7. 1. Set Foundation A. Define the business question B. Align change C. Understand data • Learn and set expectations on the data science process and cloud hosting. • Define precise business questions. • Model how answering the business question delivers results. • Link business and regulatory needs to training data design and algorithm selection, e.g. does a model require easy explainability? • Build a coalition of sponsors and communicate the vision. • Define roles for compliance, customer service, finance, marketing, product, and sales. • Understand the data generating process: genchi genbutsu. • Visualize the “shape of the data”: distributions, sensitivity, clusters, anomalies, and sparseness. Identify quality issues. • Capture rules and map data flows from source systems.
  • 8. 2. Build Models A. Build training data B. Pilot models C. Iterate production models • Form business and IT team: roles, super-labelers, biases. • Design the data set’s scenarios and set quality criteria. • Visualize attributes and confirm with business sponsors. • Define rules to pre-process data and select open source algorithms. • Visualize and communicate results. Show an early win. Ideally, prototype the UX. • Plan enhancements to training data, algos, and applications. • Refine data (feature shaping and dimensionality reduction). • Customize rules and algorithms. • Connect into the broader application starting with the data model.
  • 9. 3. Deliver Results A. Build application B. Communicate value C. Sustain capabilities • Visualize UX, define data model and APIs. • Set non-functional requirements such as scalability, latency, and security. • Define test plan. • Communicate how the solution makes jobs better and brings value to customers • Build understanding and support with key influencers • Use multiple channels (meetings, email, calls) repeatedly to ensure reaching people • Optimize costs and scalability. Plan for decreased costs. • Confirm team skills and capacity to evolve the models. • Set plan for and automate re-training models. Set expectations that models may expand the range of scenarios covered and/or may improve precision.
  • 10. Contact Machine Intelligence Partners LLC serves clients globally. Our people are centered in Boston, Bozeman, Grand Rapids, London, New York, San Francisco, and Washington, D.C. Client relationship leaders: New York Jeremy Lehman 917.225.2011 jeremy.lehman@machineintel.com Washington, D.C. Philippe Berckmans 804.405.6009 philippe.berckmans@machineintel.com Machine Intelligence is an Amazon Technology Partner and member of the Microsoft Partner Network. We are a veteran-owned small business.