SlideShare a Scribd company logo
What is MLOps?🚀
Machine Learning Operations
Leonardo Moraes
Speaker
www.amaris.com
Leonardo Moraes
● Team Leader &
Experienced Consultant
Amaris Consulting, 2023
● PhD in Computer Science
USP, Brazil, in progress
● MSc in Computer Science
USP, Brazil, 2020
● Bachelor in Computer Science
UFMS, Brasil, 2017
Agenda
● How does Data Science work?
● How does MLOps support Data Science?
● How can I join this wave?
1. What is Data Science?
4
5
New tech world
Without realizing it, people generate data all the time
on the Internet
● Marketplaces - Site, payment method, search...
● Media player - Search, recommendations, advertisements...
in offline
● Supermarket - What do we buy, how to organize the rows…
● Payment - Debit card, credit card, PIX, TED, DOC...
6
According to Gartner Research
● in 2020, an average of 44 trillion of
gigabytes (zettabytes) of data in the world
● 2.2 million terabytes generated per day
New tech world
How can we analyze it?
1. Business Specialist
2. Data Analyst
3. Data Scientist
7
Example - Borrowing money
Traditional design
● If you are 25 years old or older
and have an income of R$3,000.00
● I can borrow R$ 15,000.00 to pay in 5 years
Scientific design
● Considering a risk (e.g., 5% risk),
for the bank not to lose money
● Input: Age, gender, gross salary,...
● Output: Value and financing period.
What is Data Science?
8
Data science is a multidisciplinary
field that uses scientific processes,
algorithms, and systems to extract
knowledge and insights from data.
by Google Bard
Logo generated by ideogram
2. How does Data
Science work?
9
10
Knowledge Discovery in Databases
Data Science Procedure
Advances in Knowledge Discovery and Data Mining.
Fayyad, U. M., et. al (1996)
Procedure executed by
● Data Scientist
● Data Engineer
● ML Engineer
11
Knowledge Discovery in Databases
Data Science Procedure
Advances in Knowledge Discovery and Data Mining.
Fayyad, U. M., et. al (1996)
Selection
● Identification of the subset
of data that should be
considered in the process
12
Knowledge Discovery in Databases
Data Science Procedure
Advances in Knowledge Discovery and Data Mining.
Fayyad, U. M., et. al (1996)
Preprocessing
● Data cleaning; covers any
processing about data
quality and data integrity
13
Knowledge Discovery in Databases
Data Science Procedure
Advances in Knowledge Discovery and Data Mining.
Fayyad, U. M., et. al (1996)
Transformation
● Data aggregation,
transformation; encode
data into inputs recognized
by algorithms
14
Knowledge Discovery in Databases
Data Science Procedure
Advances in Knowledge Discovery and Data Mining.
Fayyad, U. M., et. al (1996)
Data Mining
● Practice of analyzing large
databases in order to
generate new information
● Tasks: classification,
clustering, image
recognition, etc..
15
Knowledge Discovery in Databases
Data Science Procedure
Advances in Knowledge Discovery and Data Mining.
Fayyad, U. M., et. al (1996)
Interpretation
● Transform identified/inferred
patterns into knowledge
● Make new knowledge
available to customers
16
Knowledge Discovery in Databases
Data Science Procedure
Advances in Knowledge Discovery and Data Mining.
Fayyad, U. M., et. al (1996)
Data Engineer
Data Scientist
M
L
E
n
g
i
n
e
e
r
● Data Engineer - Prepare
and organize the data
● Data Scientist - Generate
knowledge and insights
● ML Engineer - Automation
and delivery a product
3. How does MLOps
support Data Science?
17
18
MLOps
In the world of data science and machine
learning, the process of developing, deploying,
and maintaining models can be complex and
challenging. MLOps, short for Machine Learning
Operations, has emerged as a crucial discipline
that aims to streamline this process and make it
more manageable, efficient, and effective.
Essential MLOps (2023),
by Data Science Horizons
19
MLOps in Data Science
MLOps: Continuous delivery and
automation pipelines in machine learning
(Google, NIPS 2014 Workshop)
Challenge
● Technical debt
● High monetary risk factors
Goal
● apply DevOps principles to
ML systems (MLOps)
● automation and monitoring
20
MLOps x DevOps
● CI is no longer only about testing and validating code, but
also testing and validating data, data schemas, and models.
● CT is a new property, unique to ML systems, that's concerned
with automatically retraining and serving the models.
● CD is no longer about a single software package or a service,
but an ML training pipeline that should automatically deploy
another service (model prediction service).
21
MLOps Maturity Levels
MLOps: Continuous delivery and automation
pipelines in machine learning (Google)
MLOps Maturity Model with
Azure Machine Learning (Azure)
Maturity Level Training Process Release Process Technology
Level 0 - No MLOps Untracked file Manual, hand-off ● Manual builds and deployments
● Manual testing of model and application
● No tracking of model performance
Level 1 - DevOps no
MLOps
Untracked file Semi-automatized ● Automated builds
● Automated tests for application code
Level 2 - Automated
Training
Tracked, run results and
model artifacts
Automated release, code
is version controlled
● Automated model training
● Tracking of model training performance
● Model orchestration and management
Level 3 - Full MLOps
Automated Retraining
Tracked, run results and
model artifacts, retraining
set up based on metrics
Automated, CI/CD
pipeline set up, A/B testing
has been added
● Automated model training and testing
● Centralized metrics from deployed model
● Automated tests for all code
22
Maturity L.0 - no MLOps
Blogpost - MLOps Maturity Model with Azure Machine Learning (Azure)
23
Maturity L.1 - DevOps no MLOps
Blogpost - MLOps Maturity Model with Azure Machine Learning (Azure)
24
Maturity L.2 - Automated Training
Blogpost - MLOps Maturity Model with Azure Machine Learning (Azure)
25
Blogpost - MLOps Maturity Model with Azure Machine Learning (Azure)
Maturity L.3 - Full MLOps
4. How can I
join this wave?
26
27
1. Learn the basics of Data Science
2. High programming Skills, in Python, CI/CD, Git, Linux
Machine Learning Engineer
General
1. Build a Portfolio, like personal projects, contributions
2. Networking - Participation in Communities and Events
3. Stay updated - Lifelong Learning!
How To Become MLOps Engineer in 2024
by Asad iqbal
28
MLOps challenge 🎯
Driving Innovation: Leveraging Machine Learning Platforms
by Leonardo Moraes, 2024
5.
Conclusion
29
End!
30
MLOps: ML more efficient, reliable, and scalable.
● Observability: The ability to monitor models helps
identify degraded performance or anomalies quickly.
● Optimization: enhance the performance, efficiency,
and scalability of machine learning systems.
● Automation: allows for smooth and lower-risk
rollouts of new ML model versions.
Team Leader &
Experienced Consultant
Leonardo Moraes
Thank you for
your attention!

More Related Content

Similar to What is Machine Learning Operations (MLOps)?

2016 03-16 digital energy luncheon
2016 03-16 digital energy luncheon2016 03-16 digital energy luncheon
2016 03-16 digital energy luncheonMark Reynolds
 
Credit card fraud detection using python machine learning
Credit card fraud detection using python machine learningCredit card fraud detection using python machine learning
Credit card fraud detection using python machine learningSandeep Garg
 
Career in Python and data science
Career in Python and data science Career in Python and data science
Career in Python and data science Sagar Hedau
 
Machine learning at scale - Webinar By zekeLabs
Machine learning at scale - Webinar By zekeLabsMachine learning at scale - Webinar By zekeLabs
Machine learning at scale - Webinar By zekeLabszekeLabs Technologies
 
From DevOps to MLOps: practical steps for a smooth transition
From DevOps to MLOps: practical steps for a smooth transitionFrom DevOps to MLOps: practical steps for a smooth transition
From DevOps to MLOps: practical steps for a smooth transitionAnne-Marie Tousch
 
Concepts, use cases and principles to build big data systems (1)
Concepts, use cases and principles to build big data systems (1)Concepts, use cases and principles to build big data systems (1)
Concepts, use cases and principles to build big data systems (1)Trieu Nguyen
 
Experimentation to Industrialization: Implementing MLOps
Experimentation to Industrialization: Implementing MLOpsExperimentation to Industrialization: Implementing MLOps
Experimentation to Industrialization: Implementing MLOpsDatabricks
 
Building Data Products with Python (Georgetown)
Building Data Products with Python (Georgetown)Building Data Products with Python (Georgetown)
Building Data Products with Python (Georgetown)Benjamin Bengfort
 
The Machine Learning Audit. MIS ITAC 2017 Keynote
The Machine Learning Audit. MIS ITAC 2017 KeynoteThe Machine Learning Audit. MIS ITAC 2017 Keynote
The Machine Learning Audit. MIS ITAC 2017 KeynoteAndrew Clark
 
Machine Learning for Finance Master Class
Machine Learning for Finance Master Class Machine Learning for Finance Master Class
Machine Learning for Finance Master Class QuantUniversity
 
Introduction to ml ops in daily apps
Introduction to ml ops in daily appsIntroduction to ml ops in daily apps
Introduction to ml ops in daily appsVincent Tatan
 
AI hype or reality
AI  hype or realityAI  hype or reality
AI hype or realityAwantik Das
 
DSDT Meetup April 2021
DSDT Meetup April 2021DSDT Meetup April 2021
DSDT Meetup April 2021DSDT_MTL
 
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdfThe Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdfData Science Council of America
 
Operationalizing Machine Learning
Operationalizing Machine LearningOperationalizing Machine Learning
Operationalizing Machine LearningAgileThought
 
Digicrome Data Science & AI 11 Month Course PDF.pdf
Digicrome Data Science & AI 11 Month Course PDF.pdfDigicrome Data Science & AI 11 Month Course PDF.pdf
Digicrome Data Science & AI 11 Month Course PDF.pdfitsmeankitkhan
 
Building Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine LearningBuilding Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine LearningDavid Walker, CSM,CSD,MCP,MCAD,MCSD,MVP
 
Why do the majority of Data Science projects never make it to production?
Why do the majority of Data Science projects never make it to production?Why do the majority of Data Science projects never make it to production?
Why do the majority of Data Science projects never make it to production?Itai Yaffe
 

Similar to What is Machine Learning Operations (MLOps)? (20)

2016 03-16 digital energy luncheon
2016 03-16 digital energy luncheon2016 03-16 digital energy luncheon
2016 03-16 digital energy luncheon
 
Credit card fraud detection using python machine learning
Credit card fraud detection using python machine learningCredit card fraud detection using python machine learning
Credit card fraud detection using python machine learning
 
Career in Python and data science
Career in Python and data science Career in Python and data science
Career in Python and data science
 
Machine learning at scale - Webinar By zekeLabs
Machine learning at scale - Webinar By zekeLabsMachine learning at scale - Webinar By zekeLabs
Machine learning at scale - Webinar By zekeLabs
 
From DevOps to MLOps: practical steps for a smooth transition
From DevOps to MLOps: practical steps for a smooth transitionFrom DevOps to MLOps: practical steps for a smooth transition
From DevOps to MLOps: practical steps for a smooth transition
 
Concepts, use cases and principles to build big data systems (1)
Concepts, use cases and principles to build big data systems (1)Concepts, use cases and principles to build big data systems (1)
Concepts, use cases and principles to build big data systems (1)
 
Experimentation to Industrialization: Implementing MLOps
Experimentation to Industrialization: Implementing MLOpsExperimentation to Industrialization: Implementing MLOps
Experimentation to Industrialization: Implementing MLOps
 
Building Data Products with Python (Georgetown)
Building Data Products with Python (Georgetown)Building Data Products with Python (Georgetown)
Building Data Products with Python (Georgetown)
 
The Machine Learning Audit. MIS ITAC 2017 Keynote
The Machine Learning Audit. MIS ITAC 2017 KeynoteThe Machine Learning Audit. MIS ITAC 2017 Keynote
The Machine Learning Audit. MIS ITAC 2017 Keynote
 
Machine Learning for Finance Master Class
Machine Learning for Finance Master Class Machine Learning for Finance Master Class
Machine Learning for Finance Master Class
 
Introduction to ml ops in daily apps
Introduction to ml ops in daily appsIntroduction to ml ops in daily apps
Introduction to ml ops in daily apps
 
AI hype or reality
AI  hype or realityAI  hype or reality
AI hype or reality
 
DSDT Meetup April 2021
DSDT Meetup April 2021DSDT Meetup April 2021
DSDT Meetup April 2021
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdfThe Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
 
Operationalizing Machine Learning
Operationalizing Machine LearningOperationalizing Machine Learning
Operationalizing Machine Learning
 
Digicrome Data Science & AI 11 Month Course PDF.pdf
Digicrome Data Science & AI 11 Month Course PDF.pdfDigicrome Data Science & AI 11 Month Course PDF.pdf
Digicrome Data Science & AI 11 Month Course PDF.pdf
 
Data-X-Sparse-v2
Data-X-Sparse-v2Data-X-Sparse-v2
Data-X-Sparse-v2
 
Building Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine LearningBuilding Powerful and Intelligent Applications with Azure Machine Learning
Building Powerful and Intelligent Applications with Azure Machine Learning
 
Why do the majority of Data Science projects never make it to production?
Why do the majority of Data Science projects never make it to production?Why do the majority of Data Science projects never make it to production?
Why do the majority of Data Science projects never make it to production?
 

Recently uploaded

Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Product School
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaRTTS
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualityInflectra
 
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxWSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxJennifer Lim
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonDianaGray10
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...Product School
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...Product School
 
PLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsPLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsStefano
 
Speed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in MinutesSpeed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in Minutesconfluent
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Product School
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersSafe Software
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor TurskyiFwdays
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Julian Hyde
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesBhaskar Mitra
 
In-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsIn-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsExpeed Software
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxAbida Shariff
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxDavid Michel
 
IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoTAnalytics
 
What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024Stephanie Beckett
 

Recently uploaded (20)

Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxWSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
PLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsPLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. Startups
 
Speed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in MinutesSpeed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in Minutes
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
 
In-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsIn-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT Professionals
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
 
IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024
 
What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024
 

What is Machine Learning Operations (MLOps)?

  • 1. What is MLOps?🚀 Machine Learning Operations Leonardo Moraes
  • 2. Speaker www.amaris.com Leonardo Moraes ● Team Leader & Experienced Consultant Amaris Consulting, 2023 ● PhD in Computer Science USP, Brazil, in progress ● MSc in Computer Science USP, Brazil, 2020 ● Bachelor in Computer Science UFMS, Brasil, 2017
  • 3. Agenda ● How does Data Science work? ● How does MLOps support Data Science? ● How can I join this wave?
  • 4. 1. What is Data Science? 4
  • 5. 5 New tech world Without realizing it, people generate data all the time on the Internet ● Marketplaces - Site, payment method, search... ● Media player - Search, recommendations, advertisements... in offline ● Supermarket - What do we buy, how to organize the rows… ● Payment - Debit card, credit card, PIX, TED, DOC...
  • 6. 6 According to Gartner Research ● in 2020, an average of 44 trillion of gigabytes (zettabytes) of data in the world ● 2.2 million terabytes generated per day New tech world How can we analyze it? 1. Business Specialist 2. Data Analyst 3. Data Scientist
  • 7. 7 Example - Borrowing money Traditional design ● If you are 25 years old or older and have an income of R$3,000.00 ● I can borrow R$ 15,000.00 to pay in 5 years Scientific design ● Considering a risk (e.g., 5% risk), for the bank not to lose money ● Input: Age, gender, gross salary,... ● Output: Value and financing period.
  • 8. What is Data Science? 8 Data science is a multidisciplinary field that uses scientific processes, algorithms, and systems to extract knowledge and insights from data. by Google Bard Logo generated by ideogram
  • 9. 2. How does Data Science work? 9
  • 10. 10 Knowledge Discovery in Databases Data Science Procedure Advances in Knowledge Discovery and Data Mining. Fayyad, U. M., et. al (1996) Procedure executed by ● Data Scientist ● Data Engineer ● ML Engineer
  • 11. 11 Knowledge Discovery in Databases Data Science Procedure Advances in Knowledge Discovery and Data Mining. Fayyad, U. M., et. al (1996) Selection ● Identification of the subset of data that should be considered in the process
  • 12. 12 Knowledge Discovery in Databases Data Science Procedure Advances in Knowledge Discovery and Data Mining. Fayyad, U. M., et. al (1996) Preprocessing ● Data cleaning; covers any processing about data quality and data integrity
  • 13. 13 Knowledge Discovery in Databases Data Science Procedure Advances in Knowledge Discovery and Data Mining. Fayyad, U. M., et. al (1996) Transformation ● Data aggregation, transformation; encode data into inputs recognized by algorithms
  • 14. 14 Knowledge Discovery in Databases Data Science Procedure Advances in Knowledge Discovery and Data Mining. Fayyad, U. M., et. al (1996) Data Mining ● Practice of analyzing large databases in order to generate new information ● Tasks: classification, clustering, image recognition, etc..
  • 15. 15 Knowledge Discovery in Databases Data Science Procedure Advances in Knowledge Discovery and Data Mining. Fayyad, U. M., et. al (1996) Interpretation ● Transform identified/inferred patterns into knowledge ● Make new knowledge available to customers
  • 16. 16 Knowledge Discovery in Databases Data Science Procedure Advances in Knowledge Discovery and Data Mining. Fayyad, U. M., et. al (1996) Data Engineer Data Scientist M L E n g i n e e r ● Data Engineer - Prepare and organize the data ● Data Scientist - Generate knowledge and insights ● ML Engineer - Automation and delivery a product
  • 17. 3. How does MLOps support Data Science? 17
  • 18. 18 MLOps In the world of data science and machine learning, the process of developing, deploying, and maintaining models can be complex and challenging. MLOps, short for Machine Learning Operations, has emerged as a crucial discipline that aims to streamline this process and make it more manageable, efficient, and effective. Essential MLOps (2023), by Data Science Horizons
  • 19. 19 MLOps in Data Science MLOps: Continuous delivery and automation pipelines in machine learning (Google, NIPS 2014 Workshop) Challenge ● Technical debt ● High monetary risk factors Goal ● apply DevOps principles to ML systems (MLOps) ● automation and monitoring
  • 20. 20 MLOps x DevOps ● CI is no longer only about testing and validating code, but also testing and validating data, data schemas, and models. ● CT is a new property, unique to ML systems, that's concerned with automatically retraining and serving the models. ● CD is no longer about a single software package or a service, but an ML training pipeline that should automatically deploy another service (model prediction service).
  • 21. 21 MLOps Maturity Levels MLOps: Continuous delivery and automation pipelines in machine learning (Google) MLOps Maturity Model with Azure Machine Learning (Azure) Maturity Level Training Process Release Process Technology Level 0 - No MLOps Untracked file Manual, hand-off ● Manual builds and deployments ● Manual testing of model and application ● No tracking of model performance Level 1 - DevOps no MLOps Untracked file Semi-automatized ● Automated builds ● Automated tests for application code Level 2 - Automated Training Tracked, run results and model artifacts Automated release, code is version controlled ● Automated model training ● Tracking of model training performance ● Model orchestration and management Level 3 - Full MLOps Automated Retraining Tracked, run results and model artifacts, retraining set up based on metrics Automated, CI/CD pipeline set up, A/B testing has been added ● Automated model training and testing ● Centralized metrics from deployed model ● Automated tests for all code
  • 22. 22 Maturity L.0 - no MLOps Blogpost - MLOps Maturity Model with Azure Machine Learning (Azure)
  • 23. 23 Maturity L.1 - DevOps no MLOps Blogpost - MLOps Maturity Model with Azure Machine Learning (Azure)
  • 24. 24 Maturity L.2 - Automated Training Blogpost - MLOps Maturity Model with Azure Machine Learning (Azure)
  • 25. 25 Blogpost - MLOps Maturity Model with Azure Machine Learning (Azure) Maturity L.3 - Full MLOps
  • 26. 4. How can I join this wave? 26
  • 27. 27 1. Learn the basics of Data Science 2. High programming Skills, in Python, CI/CD, Git, Linux Machine Learning Engineer General 1. Build a Portfolio, like personal projects, contributions 2. Networking - Participation in Communities and Events 3. Stay updated - Lifelong Learning! How To Become MLOps Engineer in 2024 by Asad iqbal
  • 28. 28 MLOps challenge 🎯 Driving Innovation: Leveraging Machine Learning Platforms by Leonardo Moraes, 2024
  • 30. End! 30 MLOps: ML more efficient, reliable, and scalable. ● Observability: The ability to monitor models helps identify degraded performance or anomalies quickly. ● Optimization: enhance the performance, efficiency, and scalability of machine learning systems. ● Automation: allows for smooth and lower-risk rollouts of new ML model versions.
  • 31. Team Leader & Experienced Consultant Leonardo Moraes Thank you for your attention!