SlideShare a Scribd company logo
1 of 24
Agile Data Science &
Machine Learning
Operations
Introduction
Key takeaways
Q & A
Agenda
© 2024 Wavicle Data Solutions 2
Introduction
Agile transformations – my view!
4
© 2024 Wavicle Data
Solutions
• Agile is awesome!
• Agile processes provide better visibility to customer stakeholders
• There’s a framework for all needs
• Better suitable for distributed teams
• Removes a lot of boilerplate
Classic Agile Framework
5
© 2024 Wavicle Data Solutions
Product Backlog Sprint Backlog
Daily Scrum
Meeting
24 hours
2 to 4 weeks
Sprint
Retrospective
Potentially Shippable
Product Increment
Classic Agile Framework
6
© 2024 Wavicle Data Solutions
Product Backlog Sprint Backlog
Daily Scrum
Meeting
24 hours
2 to 4
weeks
Sprint
Retrospective
Potentially Shippable
Product Increment
Scrum Of
Scrum
(SoS)
Product Backlog Sprint Backlog
Daily Scrum
Meeting
24 hours
2 to 4
weeks
Sprint
Retrospective
Potentially Shippable
Product Increment
Product Backlog Sprint Backlog
Daily Scrum
Meeting
24 hours
2 to 4
weeks
Sprint
Retrospective
Potentially Shippable
Product Increment
Product Backlog Sprint Backlog
Daily Scrum
Meeting
24 hours
2 to 4
weeks
Sprint
Retrospective
Potentially Shippable
Product Increment
Scrum
Team1
Scrum
Team2
Scrum
Team4
Scrum
Team3
Benefits
Relevant Output Better Time-to-Market Real Feedback Cut Losses Early
7
© 2024 Wavicle Data Solutions
By defining requirements
just before development
(as opposed to all
upfront in a project), the
features are more likely
to meet the most current
needs. The stakeholders’
initial requests often do
not map to their real
needs. Agile practices
help you discover the
true needs earlier in the
process.
By delivering incremental
product features such as
exploratory data reports,
dashboards, or Minimal
Viable Models, users gain
value before the project’s
end.
By soliciting feedback on
the functional product,
the data scientists can
more accurately assess
whether their deliverables
work “in the real-world”.
Meanwhile, the product
manager can assess
whether the deliverables
provide the intended
business value.
No matter what you do,
some data science
projects will not succeed.
The sooner you get
feedback that you are
headed to a dead end, the
sooner you can pivot to
related objectives or a
completely new project.
Key Takeaways
5
2
1 4
3
The verdict
Challenges
Why Data Science? Agile DS Solution
The preparation
© 2024 Wavicle Data Solutions 9
1. Why Data Science?
1
0
© 2024 Wavicle Data
Solutions
• Data Science is a study of data to gain insights.
• Data types – raw, structured, unstructured etc
• Use of scientific methods and algorithms
• Data is useless without Data science
11
© 2024 Wavicle Data Solutions
© 2023 Wavicle Data
Solutions
1
1
2. Is Agile for Data Science?
• Yes! (In most cases)
1
2
© 2024 Wavicle Data
Solutions
• Data Science is naturally non-linear and fits well into the
Agile Framework. It does tend to lack clear upfront
understanding; therefore, the flexibility is required to
pivot based on findings.
• There are some variations in expectations and process
including:
- More ambiguity in planning due to exploratory nature of
data
- Not every iteration will deliver an expected result
- Fully staffed team must include Scrum Master, Data
Analyst, Business Analyst, Data Scientist, and ML
Engineer to be successful
3. Preparing for Agile
Data Science
Required Roles
14
© 2024 Wavicle Data Solutions
Role Definition Allocation
Business Analyst
The business analyst (BA) is responsible for documenting all the business interviews and technology interactions
as they relate to use case development and business expectations. The BA is responsible for documenting the
use cases.
50%
Scrum Master
The scrum Master is responsible for supporting the Agile environment by holding all Agile ceremonies,
developing the feature and sprint backlogs, and helping the team remove blockers.
50%
Data Analyst
The data analyst (DA) is responsible for the exploration and understanding of the data sets. They produce
artifacts like data dictionaries and data visualizations to help determine importance or bias of variables.
100%
Product Owner
The product owner (PO) is responsible for the deliverables required and has knowledge of the business need.
They will act as the main point for escalation as needed. They are responsible for reviewing project status,
helping clear roadblocks identified, and providing feedback on outputs.
50%
Business
Stakeholders and
SMEs
The business stakeholders and SMEs are members of the business affected by/and have knowledge of the need.
They are responsible for working with the development team to create and prioritize use cases, as well as
provide definition around the data and the value derived from outputs.
25%
Data Scientist
The data scientist (DSc) is responsible for performing exploratory data analysis, determining the list of features,
and building/testing their models.
100%
ML Ops Engineer
The ML Ops engineer is responsible for all ML Operations such as deploying machine learning models to
production with appropriate governance, monitoring, and CI/CD.
100%
Data Value Pyramid
Action!
Gain Insight
Explore & Visualize
Validate, Define, Organize
Identify & Access Raw Data
The World of Data!
Developing policy and making decisions based on generated insights
Action
1
2
3
4
Teams need to determine what is the data relevant to the industry/study and
how to ingest into internal systems for analysis
Identify & Access Raw Data
5
Data Science, Machine Learning, Generative AI processes to gain business
insights
Gain Insight
Determine how the data is trending and what picture is it making
Explore & Visualize
This includes defining fields, organizing into schema, and applying Feature
Stores in the case for Data Science
Validate, Define, Organize
© 2024 Wavicle Data Solutions 15
ML Workflow Decomposition
1
6
© 2024 Wavicle Data
Solutions
MLOps
1
7
© 2024 Wavicle Data
Solutions
Gotchas in Agile Data Science
• Watch out for the following and try to avoid:
1
8
© 2024 Wavicle Data Solutions
1. Misunderstanding of the process and lack of buy-in buy stakeholders and
team members
2. Applying software specific framework expectations. This can inhibit the
exploratory nature of data science
3. Expecting hard deadlines. Due to the R&D nature of data science, hard
deadlines and inflexible management may derail the value of the effort.
4. Limiting exploratory activity. Data science research often requires longer
time horizons that are difficult to know during the planning phase.
5. Improper staffing of pods. In most cases, each pod requires a business
analyst, data analyst, data scientist, and machine learning engineer in
order to successfully complete all tasks in the process.
Cross Industry Standard
Process for the
development of ML
Applications with
Quality Assurance
4. The Solution - CRISP-ML(Q)
© 2024 Wavicle Data Solutions 19
• MLOps Frameworks
• Define business
objectives
• Translate business
objectives into ML
objectives
• Collect and verify
data
• Assess the project
feasibility
• Create POC
CRISP-ML(Q) Phase
•
• Business & Data
• Feature Selection
• Data Selection
• Class Balancing
• Cleaning Data
• Feature Engineering
• Data Augmentation
• Data Standardization
•
• Data Engineering
• Define quality
measure of the
model
• ML Algorithm
selection
• Domain Knowledge
• Model Training
• Transfer Learning
• Model Compression
• Ensemble Learning
• Model
Documentation
•
• ML Model Engineering
• Validate model’s
performance
• Determine
robustness
• Increase model’s
explain ability
• Model deployment
• Model evaluation
documentation
•
• ML Model Evaluation
CRISP-ML(Q) Phase
Model Deployment Model Monitoring and Maintenance
2
© 2024 Wavicle Data Solutions
Evaluate model under production condition
User acceptance and usability
Model governance
Deploy according to the selected strategy (A/B
Testing, Multi-armed bandits)
Monitor the efficiency and efficacy of the model
prediction serving
Compare to the previously specified success criteria
(thresholds)
Retrain model if required
Collect new data
Perform labelling of the new data points
Repeat tasks from the Model Engineering
and Model Evaluation phases
Continuous, integration, training and
deployment of the model
5. This is the way!
• Agile
Q & A
• Venkatesa Prasannaa Selvaraj
• Director – Data Analytics
• Venkatesa.prasannaa@wavicledata.com
Thank you!
© 2024 Wavicle Data Solutions 24

More Related Content

Similar to ANIn Coimbatore Sep 2023 | Agile for data science by Venkatesa Prasanna Selvaraj

Best practice for_agile_ds_projects
Best practice for_agile_ds_projectsBest practice for_agile_ds_projects
Best practice for_agile_ds_projectsKhalid Kahloot
 
Sricharan_Sana_11yrs_MDM_DM_CRM
Sricharan_Sana_11yrs_MDM_DM_CRMSricharan_Sana_11yrs_MDM_DM_CRM
Sricharan_Sana_11yrs_MDM_DM_CRMsricharan sana
 
All Together Now: A Recipe for Successful Data Governance
All Together Now: A Recipe for Successful Data GovernanceAll Together Now: A Recipe for Successful Data Governance
All Together Now: A Recipe for Successful Data GovernanceInside Analysis
 
Patterns for Successful Data Science Projects (Spark AI Summit)
Patterns for Successful Data Science Projects (Spark AI Summit)Patterns for Successful Data Science Projects (Spark AI Summit)
Patterns for Successful Data Science Projects (Spark AI Summit)Bill Chambers
 
Data-Ed Webinar: Data Quality Engineering
Data-Ed Webinar: Data Quality EngineeringData-Ed Webinar: Data Quality Engineering
Data-Ed Webinar: Data Quality EngineeringDATAVERSITY
 
The Death of the Star Schema
The Death of the Star SchemaThe Death of the Star Schema
The Death of the Star SchemaDATAVERSITY
 
Data science for business leaders executive program
Data science for business leaders executive programData science for business leaders executive program
Data science for business leaders executive programmjitu309
 
The Value of Predictive Analytics and Decision Modeling
The Value of Predictive Analytics and Decision ModelingThe Value of Predictive Analytics and Decision Modeling
The Value of Predictive Analytics and Decision ModelingDecision Management Solutions
 
Data-Ed Webinar: Data Modeling Fundamentals
Data-Ed Webinar: Data Modeling FundamentalsData-Ed Webinar: Data Modeling Fundamentals
Data-Ed Webinar: Data Modeling FundamentalsDATAVERSITY
 
Crafting a Compelling Data Science Resume
Crafting a Compelling Data Science ResumeCrafting a Compelling Data Science Resume
Crafting a Compelling Data Science ResumeArushi Prakash, Ph.D.
 
ADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced Analytics
ADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced AnalyticsADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced Analytics
ADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced AnalyticsDATAVERSITY
 
Getting Data Quality Right
Getting Data Quality RightGetting Data Quality Right
Getting Data Quality RightDATAVERSITY
 
Transform Data into Action
Transform Data into ActionTransform Data into Action
Transform Data into ActionWorkday, Inc.
 
Analytics Rising: Plan for Success
Analytics Rising: Plan for SuccessAnalytics Rising: Plan for Success
Analytics Rising: Plan for SuccessLewandog, Inc,
 
Most Common Data Governance Challenges in the Digital Economy
Most Common Data Governance Challenges in the Digital EconomyMost Common Data Governance Challenges in the Digital Economy
Most Common Data Governance Challenges in the Digital EconomyRobyn Bollhorst
 

Similar to ANIn Coimbatore Sep 2023 | Agile for data science by Venkatesa Prasanna Selvaraj (20)

Best practice for_agile_ds_projects
Best practice for_agile_ds_projectsBest practice for_agile_ds_projects
Best practice for_agile_ds_projects
 
Sricharan_Sana_11yrs_MDM_DM_CRM
Sricharan_Sana_11yrs_MDM_DM_CRMSricharan_Sana_11yrs_MDM_DM_CRM
Sricharan_Sana_11yrs_MDM_DM_CRM
 
All Together Now: A Recipe for Successful Data Governance
All Together Now: A Recipe for Successful Data GovernanceAll Together Now: A Recipe for Successful Data Governance
All Together Now: A Recipe for Successful Data Governance
 
Patterns for Successful Data Science Projects (Spark AI Summit)
Patterns for Successful Data Science Projects (Spark AI Summit)Patterns for Successful Data Science Projects (Spark AI Summit)
Patterns for Successful Data Science Projects (Spark AI Summit)
 
Data-Ed Webinar: Data Quality Engineering
Data-Ed Webinar: Data Quality EngineeringData-Ed Webinar: Data Quality Engineering
Data-Ed Webinar: Data Quality Engineering
 
2014 dqe handouts
2014 dqe handouts2014 dqe handouts
2014 dqe handouts
 
The Death of the Star Schema
The Death of the Star SchemaThe Death of the Star Schema
The Death of the Star Schema
 
Data science for business leaders executive program
Data science for business leaders executive programData science for business leaders executive program
Data science for business leaders executive program
 
SPGregory_Resume
SPGregory_ResumeSPGregory_Resume
SPGregory_Resume
 
The Value of Predictive Analytics and Decision Modeling
The Value of Predictive Analytics and Decision ModelingThe Value of Predictive Analytics and Decision Modeling
The Value of Predictive Analytics and Decision Modeling
 
Data-Ed Webinar: Data Modeling Fundamentals
Data-Ed Webinar: Data Modeling FundamentalsData-Ed Webinar: Data Modeling Fundamentals
Data-Ed Webinar: Data Modeling Fundamentals
 
Ajit Kumar
Ajit KumarAjit Kumar
Ajit Kumar
 
Ajit Kumar
Ajit KumarAjit Kumar
Ajit Kumar
 
Heidi's Resume_2016
Heidi's Resume_2016Heidi's Resume_2016
Heidi's Resume_2016
 
Crafting a Compelling Data Science Resume
Crafting a Compelling Data Science ResumeCrafting a Compelling Data Science Resume
Crafting a Compelling Data Science Resume
 
ADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced Analytics
ADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced AnalyticsADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced Analytics
ADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced Analytics
 
Getting Data Quality Right
Getting Data Quality RightGetting Data Quality Right
Getting Data Quality Right
 
Transform Data into Action
Transform Data into ActionTransform Data into Action
Transform Data into Action
 
Analytics Rising: Plan for Success
Analytics Rising: Plan for SuccessAnalytics Rising: Plan for Success
Analytics Rising: Plan for Success
 
Most Common Data Governance Challenges in the Digital Economy
Most Common Data Governance Challenges in the Digital EconomyMost Common Data Governance Challenges in the Digital Economy
Most Common Data Governance Challenges in the Digital Economy
 

More from AgileNetwork

ANIn Chennai April 2024 |Agile Engineering: Modernizing Legacy Systems by Ana...
ANIn Chennai April 2024 |Agile Engineering: Modernizing Legacy Systems by Ana...ANIn Chennai April 2024 |Agile Engineering: Modernizing Legacy Systems by Ana...
ANIn Chennai April 2024 |Agile Engineering: Modernizing Legacy Systems by Ana...AgileNetwork
 
ANIn Chennai April 2024 |Beyond Big Bang: Technical Agility in Vintage Produc...
ANIn Chennai April 2024 |Beyond Big Bang: Technical Agility in Vintage Produc...ANIn Chennai April 2024 |Beyond Big Bang: Technical Agility in Vintage Produc...
ANIn Chennai April 2024 |Beyond Big Bang: Technical Agility in Vintage Produc...AgileNetwork
 
ANIn Gurugram April 2024 |Agile Adaptation: Driving Progress in Generative AI...
ANIn Gurugram April 2024 |Agile Adaptation: Driving Progress in Generative AI...ANIn Gurugram April 2024 |Agile Adaptation: Driving Progress in Generative AI...
ANIn Gurugram April 2024 |Agile Adaptation: Driving Progress in Generative AI...AgileNetwork
 
ANIn Noida Oct 2023 |AI Usage in Agile Transformation Journey by Kunal
ANIn Noida Oct 2023 |AI Usage in Agile Transformation Journey by KunalANIn Noida Oct 2023 |AI Usage in Agile Transformation Journey by Kunal
ANIn Noida Oct 2023 |AI Usage in Agile Transformation Journey by KunalAgileNetwork
 
ANIn Kolkata April 2024 |Ethics of AI by Abhishek Nandy
ANIn Kolkata April 2024 |Ethics of AI by Abhishek NandyANIn Kolkata April 2024 |Ethics of AI by Abhishek Nandy
ANIn Kolkata April 2024 |Ethics of AI by Abhishek NandyAgileNetwork
 
ANIn Kolkata April 2024 | AI Enabled Reflection in Agile Delivery by Indranil...
ANIn Kolkata April 2024 | AI Enabled Reflection in Agile Delivery by Indranil...ANIn Kolkata April 2024 | AI Enabled Reflection in Agile Delivery by Indranil...
ANIn Kolkata April 2024 | AI Enabled Reflection in Agile Delivery by Indranil...AgileNetwork
 
ANIn Gurugram April 2024 |Can Agile and AI work together? by Pramodkumar Shri...
ANIn Gurugram April 2024 |Can Agile and AI work together? by Pramodkumar Shri...ANIn Gurugram April 2024 |Can Agile and AI work together? by Pramodkumar Shri...
ANIn Gurugram April 2024 |Can Agile and AI work together? by Pramodkumar Shri...AgileNetwork
 
ANIn Pune April 2024 |L&D Accelerating business growth by Mukta Nalke
ANIn Pune April 2024 |L&D Accelerating business growth by Mukta NalkeANIn Pune April 2024 |L&D Accelerating business growth by Mukta Nalke
ANIn Pune April 2024 |L&D Accelerating business growth by Mukta NalkeAgileNetwork
 
ANIn Pune April 2024 | Meeting Modern Learning Needs with Innovation by Ankit...
ANIn Pune April 2024 | Meeting Modern Learning Needs with Innovation by Ankit...ANIn Pune April 2024 | Meeting Modern Learning Needs with Innovation by Ankit...
ANIn Pune April 2024 | Meeting Modern Learning Needs with Innovation by Ankit...AgileNetwork
 
ANIn Ahmedabad April 2024 | Powering Big Wins with Small, Agile Teams by Yoge...
ANIn Ahmedabad April 2024 | Powering Big Wins with Small, Agile Teams by Yoge...ANIn Ahmedabad April 2024 | Powering Big Wins with Small, Agile Teams by Yoge...
ANIn Ahmedabad April 2024 | Powering Big Wins with Small, Agile Teams by Yoge...AgileNetwork
 
ANIn Coimbatore March 2024 | Unlocking Agility with Gen AI by Balaprasanna S
ANIn Coimbatore March 2024 | Unlocking Agility with Gen AI by Balaprasanna SANIn Coimbatore March 2024 | Unlocking Agility with Gen AI by Balaprasanna S
ANIn Coimbatore March 2024 | Unlocking Agility with Gen AI by Balaprasanna SAgileNetwork
 
ANIn Coimbatore March 2024 | Agile & AI in Project Management by Dhilipkumar ...
ANIn Coimbatore March 2024 | Agile & AI in Project Management by Dhilipkumar ...ANIn Coimbatore March 2024 | Agile & AI in Project Management by Dhilipkumar ...
ANIn Coimbatore March 2024 | Agile & AI in Project Management by Dhilipkumar ...AgileNetwork
 
ANIn Chennai March 2024 |Conversational AI for Agility in Healthcare by Shine...
ANIn Chennai March 2024 |Conversational AI for Agility in Healthcare by Shine...ANIn Chennai March 2024 |Conversational AI for Agility in Healthcare by Shine...
ANIn Chennai March 2024 |Conversational AI for Agility in Healthcare by Shine...AgileNetwork
 
ANIn Chennai March 2024 |Oxygenating AI ecosystem with Agility by Gowtham Bal...
ANIn Chennai March 2024 |Oxygenating AI ecosystem with Agility by Gowtham Bal...ANIn Chennai March 2024 |Oxygenating AI ecosystem with Agility by Gowtham Bal...
ANIn Chennai March 2024 |Oxygenating AI ecosystem with Agility by Gowtham Bal...AgileNetwork
 
ANIn Ahmedabad March 2024 | The Power of Retrospection by Rakesh Mehta
ANIn Ahmedabad March 2024 | The Power of Retrospection by Rakesh MehtaANIn Ahmedabad March 2024 | The Power of Retrospection by Rakesh Mehta
ANIn Ahmedabad March 2024 | The Power of Retrospection by Rakesh MehtaAgileNetwork
 
ANIn Pune March 2024 | Customer Stratification for Business Growth by Manish ...
ANIn Pune March 2024 | Customer Stratification for Business Growth by Manish ...ANIn Pune March 2024 | Customer Stratification for Business Growth by Manish ...
ANIn Pune March 2024 | Customer Stratification for Business Growth by Manish ...AgileNetwork
 
ANIn Coimbatore July 2023 | Business Agility in Data Science by Dr.Selvaraaju...
ANIn Coimbatore July 2023 | Business Agility in Data Science by Dr.Selvaraaju...ANIn Coimbatore July 2023 | Business Agility in Data Science by Dr.Selvaraaju...
ANIn Coimbatore July 2023 | Business Agility in Data Science by Dr.Selvaraaju...AgileNetwork
 
ANIn Coimbatore May 2023 | Agile and Beyond by Nithya Sitharam
ANIn Coimbatore May 2023 | Agile and Beyond by Nithya SitharamANIn Coimbatore May 2023 | Agile and Beyond by Nithya Sitharam
ANIn Coimbatore May 2023 | Agile and Beyond by Nithya SitharamAgileNetwork
 
ANIn Hyderabad Jun 2023 |Humanizing Agile Transformation Beyond Process and T...
ANIn Hyderabad Jun 2023 |Humanizing Agile Transformation Beyond Process and T...ANIn Hyderabad Jun 2023 |Humanizing Agile Transformation Beyond Process and T...
ANIn Hyderabad Jun 2023 |Humanizing Agile Transformation Beyond Process and T...AgileNetwork
 
ANIn Coimbatore Jul 2023 |The Importance of Business Agility in the Current L...
ANIn Coimbatore Jul 2023 |The Importance of Business Agility in the Current L...ANIn Coimbatore Jul 2023 |The Importance of Business Agility in the Current L...
ANIn Coimbatore Jul 2023 |The Importance of Business Agility in the Current L...AgileNetwork
 

More from AgileNetwork (20)

ANIn Chennai April 2024 |Agile Engineering: Modernizing Legacy Systems by Ana...
ANIn Chennai April 2024 |Agile Engineering: Modernizing Legacy Systems by Ana...ANIn Chennai April 2024 |Agile Engineering: Modernizing Legacy Systems by Ana...
ANIn Chennai April 2024 |Agile Engineering: Modernizing Legacy Systems by Ana...
 
ANIn Chennai April 2024 |Beyond Big Bang: Technical Agility in Vintage Produc...
ANIn Chennai April 2024 |Beyond Big Bang: Technical Agility in Vintage Produc...ANIn Chennai April 2024 |Beyond Big Bang: Technical Agility in Vintage Produc...
ANIn Chennai April 2024 |Beyond Big Bang: Technical Agility in Vintage Produc...
 
ANIn Gurugram April 2024 |Agile Adaptation: Driving Progress in Generative AI...
ANIn Gurugram April 2024 |Agile Adaptation: Driving Progress in Generative AI...ANIn Gurugram April 2024 |Agile Adaptation: Driving Progress in Generative AI...
ANIn Gurugram April 2024 |Agile Adaptation: Driving Progress in Generative AI...
 
ANIn Noida Oct 2023 |AI Usage in Agile Transformation Journey by Kunal
ANIn Noida Oct 2023 |AI Usage in Agile Transformation Journey by KunalANIn Noida Oct 2023 |AI Usage in Agile Transformation Journey by Kunal
ANIn Noida Oct 2023 |AI Usage in Agile Transformation Journey by Kunal
 
ANIn Kolkata April 2024 |Ethics of AI by Abhishek Nandy
ANIn Kolkata April 2024 |Ethics of AI by Abhishek NandyANIn Kolkata April 2024 |Ethics of AI by Abhishek Nandy
ANIn Kolkata April 2024 |Ethics of AI by Abhishek Nandy
 
ANIn Kolkata April 2024 | AI Enabled Reflection in Agile Delivery by Indranil...
ANIn Kolkata April 2024 | AI Enabled Reflection in Agile Delivery by Indranil...ANIn Kolkata April 2024 | AI Enabled Reflection in Agile Delivery by Indranil...
ANIn Kolkata April 2024 | AI Enabled Reflection in Agile Delivery by Indranil...
 
ANIn Gurugram April 2024 |Can Agile and AI work together? by Pramodkumar Shri...
ANIn Gurugram April 2024 |Can Agile and AI work together? by Pramodkumar Shri...ANIn Gurugram April 2024 |Can Agile and AI work together? by Pramodkumar Shri...
ANIn Gurugram April 2024 |Can Agile and AI work together? by Pramodkumar Shri...
 
ANIn Pune April 2024 |L&D Accelerating business growth by Mukta Nalke
ANIn Pune April 2024 |L&D Accelerating business growth by Mukta NalkeANIn Pune April 2024 |L&D Accelerating business growth by Mukta Nalke
ANIn Pune April 2024 |L&D Accelerating business growth by Mukta Nalke
 
ANIn Pune April 2024 | Meeting Modern Learning Needs with Innovation by Ankit...
ANIn Pune April 2024 | Meeting Modern Learning Needs with Innovation by Ankit...ANIn Pune April 2024 | Meeting Modern Learning Needs with Innovation by Ankit...
ANIn Pune April 2024 | Meeting Modern Learning Needs with Innovation by Ankit...
 
ANIn Ahmedabad April 2024 | Powering Big Wins with Small, Agile Teams by Yoge...
ANIn Ahmedabad April 2024 | Powering Big Wins with Small, Agile Teams by Yoge...ANIn Ahmedabad April 2024 | Powering Big Wins with Small, Agile Teams by Yoge...
ANIn Ahmedabad April 2024 | Powering Big Wins with Small, Agile Teams by Yoge...
 
ANIn Coimbatore March 2024 | Unlocking Agility with Gen AI by Balaprasanna S
ANIn Coimbatore March 2024 | Unlocking Agility with Gen AI by Balaprasanna SANIn Coimbatore March 2024 | Unlocking Agility with Gen AI by Balaprasanna S
ANIn Coimbatore March 2024 | Unlocking Agility with Gen AI by Balaprasanna S
 
ANIn Coimbatore March 2024 | Agile & AI in Project Management by Dhilipkumar ...
ANIn Coimbatore March 2024 | Agile & AI in Project Management by Dhilipkumar ...ANIn Coimbatore March 2024 | Agile & AI in Project Management by Dhilipkumar ...
ANIn Coimbatore March 2024 | Agile & AI in Project Management by Dhilipkumar ...
 
ANIn Chennai March 2024 |Conversational AI for Agility in Healthcare by Shine...
ANIn Chennai March 2024 |Conversational AI for Agility in Healthcare by Shine...ANIn Chennai March 2024 |Conversational AI for Agility in Healthcare by Shine...
ANIn Chennai March 2024 |Conversational AI for Agility in Healthcare by Shine...
 
ANIn Chennai March 2024 |Oxygenating AI ecosystem with Agility by Gowtham Bal...
ANIn Chennai March 2024 |Oxygenating AI ecosystem with Agility by Gowtham Bal...ANIn Chennai March 2024 |Oxygenating AI ecosystem with Agility by Gowtham Bal...
ANIn Chennai March 2024 |Oxygenating AI ecosystem with Agility by Gowtham Bal...
 
ANIn Ahmedabad March 2024 | The Power of Retrospection by Rakesh Mehta
ANIn Ahmedabad March 2024 | The Power of Retrospection by Rakesh MehtaANIn Ahmedabad March 2024 | The Power of Retrospection by Rakesh Mehta
ANIn Ahmedabad March 2024 | The Power of Retrospection by Rakesh Mehta
 
ANIn Pune March 2024 | Customer Stratification for Business Growth by Manish ...
ANIn Pune March 2024 | Customer Stratification for Business Growth by Manish ...ANIn Pune March 2024 | Customer Stratification for Business Growth by Manish ...
ANIn Pune March 2024 | Customer Stratification for Business Growth by Manish ...
 
ANIn Coimbatore July 2023 | Business Agility in Data Science by Dr.Selvaraaju...
ANIn Coimbatore July 2023 | Business Agility in Data Science by Dr.Selvaraaju...ANIn Coimbatore July 2023 | Business Agility in Data Science by Dr.Selvaraaju...
ANIn Coimbatore July 2023 | Business Agility in Data Science by Dr.Selvaraaju...
 
ANIn Coimbatore May 2023 | Agile and Beyond by Nithya Sitharam
ANIn Coimbatore May 2023 | Agile and Beyond by Nithya SitharamANIn Coimbatore May 2023 | Agile and Beyond by Nithya Sitharam
ANIn Coimbatore May 2023 | Agile and Beyond by Nithya Sitharam
 
ANIn Hyderabad Jun 2023 |Humanizing Agile Transformation Beyond Process and T...
ANIn Hyderabad Jun 2023 |Humanizing Agile Transformation Beyond Process and T...ANIn Hyderabad Jun 2023 |Humanizing Agile Transformation Beyond Process and T...
ANIn Hyderabad Jun 2023 |Humanizing Agile Transformation Beyond Process and T...
 
ANIn Coimbatore Jul 2023 |The Importance of Business Agility in the Current L...
ANIn Coimbatore Jul 2023 |The Importance of Business Agility in the Current L...ANIn Coimbatore Jul 2023 |The Importance of Business Agility in the Current L...
ANIn Coimbatore Jul 2023 |The Importance of Business Agility in the Current L...
 

Recently uploaded

Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 

Recently uploaded (20)

Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power Systems
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 

ANIn Coimbatore Sep 2023 | Agile for data science by Venkatesa Prasanna Selvaraj

  • 1. Agile Data Science & Machine Learning Operations
  • 2. Introduction Key takeaways Q & A Agenda © 2024 Wavicle Data Solutions 2
  • 4. Agile transformations – my view! 4 © 2024 Wavicle Data Solutions • Agile is awesome! • Agile processes provide better visibility to customer stakeholders • There’s a framework for all needs • Better suitable for distributed teams • Removes a lot of boilerplate
  • 5. Classic Agile Framework 5 © 2024 Wavicle Data Solutions Product Backlog Sprint Backlog Daily Scrum Meeting 24 hours 2 to 4 weeks Sprint Retrospective Potentially Shippable Product Increment
  • 6. Classic Agile Framework 6 © 2024 Wavicle Data Solutions Product Backlog Sprint Backlog Daily Scrum Meeting 24 hours 2 to 4 weeks Sprint Retrospective Potentially Shippable Product Increment Scrum Of Scrum (SoS) Product Backlog Sprint Backlog Daily Scrum Meeting 24 hours 2 to 4 weeks Sprint Retrospective Potentially Shippable Product Increment Product Backlog Sprint Backlog Daily Scrum Meeting 24 hours 2 to 4 weeks Sprint Retrospective Potentially Shippable Product Increment Product Backlog Sprint Backlog Daily Scrum Meeting 24 hours 2 to 4 weeks Sprint Retrospective Potentially Shippable Product Increment Scrum Team1 Scrum Team2 Scrum Team4 Scrum Team3
  • 7. Benefits Relevant Output Better Time-to-Market Real Feedback Cut Losses Early 7 © 2024 Wavicle Data Solutions By defining requirements just before development (as opposed to all upfront in a project), the features are more likely to meet the most current needs. The stakeholders’ initial requests often do not map to their real needs. Agile practices help you discover the true needs earlier in the process. By delivering incremental product features such as exploratory data reports, dashboards, or Minimal Viable Models, users gain value before the project’s end. By soliciting feedback on the functional product, the data scientists can more accurately assess whether their deliverables work “in the real-world”. Meanwhile, the product manager can assess whether the deliverables provide the intended business value. No matter what you do, some data science projects will not succeed. The sooner you get feedback that you are headed to a dead end, the sooner you can pivot to related objectives or a completely new project.
  • 9. 5 2 1 4 3 The verdict Challenges Why Data Science? Agile DS Solution The preparation © 2024 Wavicle Data Solutions 9
  • 10. 1. Why Data Science? 1 0 © 2024 Wavicle Data Solutions • Data Science is a study of data to gain insights. • Data types – raw, structured, unstructured etc • Use of scientific methods and algorithms • Data is useless without Data science
  • 11. 11 © 2024 Wavicle Data Solutions © 2023 Wavicle Data Solutions 1 1
  • 12. 2. Is Agile for Data Science? • Yes! (In most cases) 1 2 © 2024 Wavicle Data Solutions • Data Science is naturally non-linear and fits well into the Agile Framework. It does tend to lack clear upfront understanding; therefore, the flexibility is required to pivot based on findings. • There are some variations in expectations and process including: - More ambiguity in planning due to exploratory nature of data - Not every iteration will deliver an expected result - Fully staffed team must include Scrum Master, Data Analyst, Business Analyst, Data Scientist, and ML Engineer to be successful
  • 13. 3. Preparing for Agile Data Science
  • 14. Required Roles 14 © 2024 Wavicle Data Solutions Role Definition Allocation Business Analyst The business analyst (BA) is responsible for documenting all the business interviews and technology interactions as they relate to use case development and business expectations. The BA is responsible for documenting the use cases. 50% Scrum Master The scrum Master is responsible for supporting the Agile environment by holding all Agile ceremonies, developing the feature and sprint backlogs, and helping the team remove blockers. 50% Data Analyst The data analyst (DA) is responsible for the exploration and understanding of the data sets. They produce artifacts like data dictionaries and data visualizations to help determine importance or bias of variables. 100% Product Owner The product owner (PO) is responsible for the deliverables required and has knowledge of the business need. They will act as the main point for escalation as needed. They are responsible for reviewing project status, helping clear roadblocks identified, and providing feedback on outputs. 50% Business Stakeholders and SMEs The business stakeholders and SMEs are members of the business affected by/and have knowledge of the need. They are responsible for working with the development team to create and prioritize use cases, as well as provide definition around the data and the value derived from outputs. 25% Data Scientist The data scientist (DSc) is responsible for performing exploratory data analysis, determining the list of features, and building/testing their models. 100% ML Ops Engineer The ML Ops engineer is responsible for all ML Operations such as deploying machine learning models to production with appropriate governance, monitoring, and CI/CD. 100%
  • 15. Data Value Pyramid Action! Gain Insight Explore & Visualize Validate, Define, Organize Identify & Access Raw Data The World of Data! Developing policy and making decisions based on generated insights Action 1 2 3 4 Teams need to determine what is the data relevant to the industry/study and how to ingest into internal systems for analysis Identify & Access Raw Data 5 Data Science, Machine Learning, Generative AI processes to gain business insights Gain Insight Determine how the data is trending and what picture is it making Explore & Visualize This includes defining fields, organizing into schema, and applying Feature Stores in the case for Data Science Validate, Define, Organize © 2024 Wavicle Data Solutions 15
  • 16. ML Workflow Decomposition 1 6 © 2024 Wavicle Data Solutions
  • 17. MLOps 1 7 © 2024 Wavicle Data Solutions
  • 18. Gotchas in Agile Data Science • Watch out for the following and try to avoid: 1 8 © 2024 Wavicle Data Solutions 1. Misunderstanding of the process and lack of buy-in buy stakeholders and team members 2. Applying software specific framework expectations. This can inhibit the exploratory nature of data science 3. Expecting hard deadlines. Due to the R&D nature of data science, hard deadlines and inflexible management may derail the value of the effort. 4. Limiting exploratory activity. Data science research often requires longer time horizons that are difficult to know during the planning phase. 5. Improper staffing of pods. In most cases, each pod requires a business analyst, data analyst, data scientist, and machine learning engineer in order to successfully complete all tasks in the process.
  • 19. Cross Industry Standard Process for the development of ML Applications with Quality Assurance 4. The Solution - CRISP-ML(Q) © 2024 Wavicle Data Solutions 19 • MLOps Frameworks
  • 20. • Define business objectives • Translate business objectives into ML objectives • Collect and verify data • Assess the project feasibility • Create POC CRISP-ML(Q) Phase • • Business & Data • Feature Selection • Data Selection • Class Balancing • Cleaning Data • Feature Engineering • Data Augmentation • Data Standardization • • Data Engineering • Define quality measure of the model • ML Algorithm selection • Domain Knowledge • Model Training • Transfer Learning • Model Compression • Ensemble Learning • Model Documentation • • ML Model Engineering • Validate model’s performance • Determine robustness • Increase model’s explain ability • Model deployment • Model evaluation documentation • • ML Model Evaluation
  • 21. CRISP-ML(Q) Phase Model Deployment Model Monitoring and Maintenance 2 © 2024 Wavicle Data Solutions Evaluate model under production condition User acceptance and usability Model governance Deploy according to the selected strategy (A/B Testing, Multi-armed bandits) Monitor the efficiency and efficacy of the model prediction serving Compare to the previously specified success criteria (thresholds) Retrain model if required Collect new data Perform labelling of the new data points Repeat tasks from the Model Engineering and Model Evaluation phases Continuous, integration, training and deployment of the model
  • 22. 5. This is the way! • Agile
  • 23. Q & A
  • 24. • Venkatesa Prasannaa Selvaraj • Director – Data Analytics • Venkatesa.prasannaa@wavicledata.com Thank you! © 2024 Wavicle Data Solutions 24

Editor's Notes

  1. Agile does not make sense for data science in academic settings.
  2. Agile does not make sense for data science in academic settings.
  3. The Agile Data Science process will take place at this level of the data value pyramid. Therefore, the other foundational items must be set before implementing.