The lean principles of data ops

Lars Albertsson
Lars AlbertssonFounder & Data Engineer
www.scling.com
The lean principles of
DataOps
Berlin Buzzwords, 2020-06-08
Lars Albertsson, Founder, Scling
Christopher Bergh, CEO & Head Chef, DataKitchen
1
www.scling.com
Scling - data-value-as-a-service
2
Data lake
Stream storage
● Extract value from your data
● Data platform + custom data pipelines
● Imitate data leaders:
○ Quick idea-to-production
○ Operational efficiency
Our marketing strategy:
● Promiscuously share knowledge
○ On slides devoid of glossy polish
www.scling.com
1994: OS/2 Warp CID installation
3
Grmbl, who
reinstalled my
machine?
www.scling.com
IT craft to factory
4
Security Waterfall
Application
delivery
Traditional
operations
Traditional
QA
Infrastructure
DevSecOps Agile
Containers
DevOps CI/CD
Infrastructure
as code
www.scling.com
Security Waterfall
Data factories
5
Application
delivery
Traditional
operations
Traditional
QA
Infrastructure
DB-oriented
architecture
DevSecOps Agile
Containers
DevOps CI/CD
Infrastructure
as code
Data factories,
data pipelines,
DataOps
www.scling.com
The Toyota Way
Selected lean principles:
● Long-term over short-term
● The right process will produce the right results
● Eliminate waste (muda)
● Continuous improvement (kaizen)
● Use pull systems to avoid unnecessary production
● Quality takes precedence (jidoka)
○ Stop to fix problems
● Standardised tasks and processes
● Reliable technology that serves people and process
● Develop your people
● Decisions slowly by consensus
● Relentless reflection (hansei), organisational learning
6
www.scling.com
Common waste species
● Cognitive waste
● Delivery waste
● Operational waste
● Product waste
7
www.scling.com
Cognitive waste
● Why do we have 25 time formats?
○ ISO 8601, UTC assumed
○ ISO 8601 + timezone
○ Millis since epoch, UTC
○ Nanos since epoch, UTC
○ Millis since epoch, user local time
○ …
○ Float of seconds since epoch, as string.
WTF?!?
● my-kafka-topic-name, your_topic_name
8
● Definition of an order:
○ Abandoned cart?
○ Payment refused?
○ Returned goods?
○ Free promotion?
● Data entity source of truth
○ MySQL, Kafka, data lake?
www.scling.com
What causes cognitive waste?
● We are autonomous!
○ Teams can choose technology, format, process, ...
● Cognitive debt
○ Short-term over long-term
○ Decisions without consensus
● Recognition and rewards
○ "You have made a similar independent pipeline, great work!"
9
www.scling.com
Avoiding cognitive waste
● Reusing semantic definitions
● Reusing code & technical definitions
○ Code transparency & sharing
○ Standardised technology
○ Document decisions & consensus process
● Read-only sharing not enough
○ Must be empowered to change for reuse and to improve quality
○ Standardised processes
10
www.scling.com
Eliminating cognitive waste
● Refactoring code, semantics, docs
● Low risk - what will I break downstream?
○ Standardised, automated, trusted QA process
○ End-to-end pipeline testing
● "Creating a pipeline - one day! Replace old pipeline - 18 months."
11
www.scling.com
Delivery waste
● Friction from code to production
○ Ideal: Idea, research, write code+tests, done. Everything else is friction.
● Code inventory
○ Code not yet fully utilised
● Data inventory
○ Data not yet fully processed
12
www.scling.com
Data product quality assurance
● Product quality = f(code, data)
○ Cannot do full QA on code only
○ Only real data is production data
● Test in production
○ Quick QA cycle = quick production deployment
○ Measure, monitor, validate
13
www.scling.com
Eliminating delivery friction
14
● In theory simple - scrutinise everything
○ Positive engineering: writing code, tests, docs, refactor, improve
○ All else is negative
● You are limited by your assumptions
○ State of practice far from state of art
But the test suite
takes 3 hours.
We have this
checklist.
Security must
approve.
X must be
released before Y.
That is another
team's job.
We don't have
access.
We must test in
staging first.
We haven't
performance
tested yet.
www.scling.com
So get rid of the waste. Resources:
No tradeoff between speed and quality!
15
www.scling.com
● Code not yet fully utilised
● Code on its way to production
○ In a notebook
○ Waiting for approval
○ Waiting for release
○ Internally released, waiting
for dependants to upgrade
● Tests not fully used
○ Cover code (shared component),
but not yet executed
Code inventory
16
www.scling.com
Data inventory
● Data collected, but not yet fully processed
○ Traditional lazy joins & SQL processing at runtime
● Eliminate with eager processing = pipeline
○ Process, join, denormalise
● Fatal problems → offline crash
○ "Andon" cord - stop and fix before significant harm is done
17
www.scling.com
Operational waste
● Friction in operational manoeuvres
○ Fear of mistakes
● Cost of incidents
○ Time to recovery
○ Impact of incident
○ Frequency of incidents
18
www.scling.com
Separating offline and online
19
Raw
19
Fraud
serviceFraud
model
Orders Orders
Replication /
Backup
Standard procedures Standard proceduresLightweight procedures
● QA driven by internal efficiency
● Continuous deployment
● New pipeline < 1 day
● Upgrade < 1 hour
● Bug recovery < 1 hour
Careful handover Careful handover
www.scling.com
20
Cost of a software error
Online
● User impact
● Data corruption
● Cascading corruption
● Unbounded recovery
www.scling.com
21
Cost of a software error
Nearline
● Data corruption
● Downstream impact
● Bounded recovery
Online
● User impact
● Data corruption
● Cascading corruption
● Unbounded recovery
Job
Stream
Stream
Job
Stream
www.scling.com
22
Cost of a software error
Nearline
● Data corruption
● Downstream impact
● Bounded recovery
Offline
● Temporary data
corruption
● Downstream impact
● Easy recovery
Online
● User impact
● Data corruption
● Cascading corruption
● Unbounded recovery
Job
Stream
Stream
Job
Stream
www.scling.com
Data speed Innovation speed
23
Nearline
Data processing tradeoff
23
Job
Stream
OfflineOnline
Stream
Job
Stream
www.scling.com
Product waste
● Work not driven by use case
● Unrealised data potential due to friction
○ Unawareness of data
○ Difficulty to use data
● Hidden quality problems
● Collaboration and communication overhead
24
Data democratisation -
making data accessible
and usable
Copyright 2020 by DataKitchen, Inc. All Rights Reserved.
Waste: Your Team’s Time Not Well Spent
25
Percentage
Time Team
Spends Per
Week
Current
Errors &
Operational Tasks
New Features &
Data For Customers
Improvements & Debt
Challenges:
• Complex roles
• Complex organizations
• Complex toolchains
• Complex data
• Complex collaboration
Copyright 2020 DataKitchen, Inc.
Waste: Data Analytics is like the US Auto
Industry in the 1970s
Current
High Errors
Production
Errors
Data Analytics
Team
Deployment
Latency
Weeks, Months
Dev Prod
Challenges:
• Slow to add new features,
rapidly address consumer
requests, changing data sets
• Lack of trust by data
consumers
• Slow model deployment, slow
to move to cloud
• Team morale
26
Copyright 2020 by DataKitchen, Inc.  All Rights Reserved.
Waste: Conway’s Law and Data Pipelines
Data Analytics Follows Conway's Law
The structure of how teams are organized to do Data Science, Data
Engineering, Analytics, and Production is reflected in their data
pipelines.
Copyright 2020 by DataKitchen, Inc.  All Rights Reserved.
Waste: A cornucopia of collaboration complexity
D D
P
D
D
D D
D
D
D
P
D
P
P
D Development - Data Analytic Team P Production - Data Analytic Team
Centralized Dev Centralized Dev & Prod Decentralized Dev Decentralized Dev & Prod
How do we create
together without conflicts?
(Data Engineer & Data
Scientist)
How do we deploy safely
and rapidly? (Data Team and
Production Team)
How to balance centralized
control vs self service freedom?
(Home Office Data Team and
Line of Business Analysts)
How to reuse/incorporate what
another team deployed?
(Multiple Data & Production
Teams in Many Orgs)
DE
DS
BI
Copyright 2020 by DataKitchen, Inc. All Rights Reserved.
Why? Data Teams Are Suffering
Data teams are caught between three competing forces:
• Unaware Data Providers – unaware that they send
crappy, late, and error prone data sets
• Demanding Data Consumers – demand trusted, original
insight at the speed of Amazon delivery
• Critical Supporting Teams – need flawless ongoing
production and collaboration with other teams/people
Make for:
• A beaten down, distraught, disempowered work
environment
• Teams that cannot create and innovate
• Lack of trust all around
29
Unaware Data
Providers
Demanding Data
Consumers
Critical Supporting
Teams
Copyright 2020 by DataKitchen, Inc. All Rights Reserved.
DataOps – Solution To That Suffering
DataOps – The technical practices,
cultural norms, and architecture
that enable:
• Rapid cycles of experimentation
and innovation to delivery of new
insights to our customers
• Low error rates
• Collaboration across complex sets
of people, technology, and
environments
• Clear measurement and monitoring
of results
30Source: Gartner
“Organizations that adopt a DevOps- and DataOps-based
approach are more successful in implementing end-to-end,
reliable, robust, scalable and repeatable solutions.”
Sumit Pal, Gartner, November 2018
People,
Process,
Organization
Technical
Environment
Copyright 2020 by DataKitchen, Inc.  All Rights Reserved.
DataOps Benefit: Lower Cost, More Insight
31
After DataOps
Percentage
Time Team
Spends Per
Week
Before DataOps
New Features &
Data For Customers
Errors &
Operational Tasks
New Features &
Data For Customers
Improvements & Debt
Errors & Operational
Tasks
Process Improvements
& Tech Debt Reduction
Copyright 2020 by DataKitchen, Inc.  All Rights Reserved.
DataOps Benefit: Faster, Better & Happier
32
After DataOpsBefore DataOps
High Errors
Production
Errors Low Errors
Data Analytics
Team
Deployment
Latency
Weeks, Months
Dev Prod
Hours & Mins
Dev Prod
Copyright 2020 by DataKitchen, Inc.  All Rights Reserved.
DevOps vs DataOps (and all those *Opses)
Lean, Learning Origination, and W Edwards Deming Principles: Focus on Low Errors, Cycle Time,
Collaboration, and Measurement
Industrial Manufacturing
Teams
Business
Management
Concept
Data Science, Engineering
and Analytics Teams
IT and Software TeamsOrganization
Team Management Agile, Kanban, Scrum, DA, etc.
Team Management Six Sigma,
Total Quality Management
Organizational
Management
Method
Technical
Environment and
Process DevOps
AIOps
DevSecOps
DataOps
ModelOps
MLOps
…
GitOps
Copyright 2020 by DataKitchen, Inc.  All Rights Reserved.
DevOps vs DataOps (and all those *Opses)
Lean, Learning Origination, and W Edwards Deming Principles: Focus on Low Errors, Cycle Time,
Collaboration, and Measurement
Industrial Manufacturing
Teams
Business
Management
Concept
Data Science, Engineering
and Analytics Teams
IT and Software TeamsOrganization
Team Management Agile, Kanban, Scrum, DA, etc.
Team Management Six Sigma,
Total Quality Management
Organizational
Management
Method
Technical
Environment and
Process DevOps
AIOps
DevSecOps
DataOps
ModelOps
MLOps
…
GitOps
Copyright 2020 by DataKitchen, Inc.  All Rights Reserved.
DevOps vs DataOps (and all those *Opses)
Lean, Learning Origination, and W Edwards Deming Principles: Focus on Low Errors, Cycle Time,
Collaboration, and Measurement
Industrial Manufacturing
Teams
Business
Management
Concept
Data Science, Engineering
and Analytics Teams
IT and Software TeamsOrganization
Team Management Agile, Kanban, Scrum, DA, etc.
Team Management Six Sigma,
Total Quality Management
Organizational
Management
Method
Technical
Environment and
Process DevOps
AIOps
DevSecOps
DataOps
ModelOps
MLOps
…
GitOps
Copyright 2020 by DataKitchen, Inc.  All Rights Reserved.
DevOps vs DataOps (and all those *Opses)
Lean, Learning Origination, and W Edwards Deming Principles: Focus on Low Errors, Cycle Time,
Collaboration, and Measurement
Industrial Manufacturing
Teams
Business
Management
Concept
Data Science, Engineering
and Analytics Teams
IT and Software TeamsOrganization
Team Management Agile, Kanban, Scrum, DA, etc.
Team Management Six Sigma,
Total Quality Management
Organizational
Management
Method
Technical
Environment and
Process DevOps
AIOps
DevSecOps
DataOps
ModelOps
MLOps
…
GitOps
Copyright 2020 by DataKitchen, Inc. All Rights Reserved.
What You Do Is Much Less Important Than
How You Do It
37
“We realized that the true problem, the true difficulty, and where
the greatest potential is – is building the machine that makes
the machine. It’s building the factory.” – Elon Musk
94% of causes were common cause. We often attribute problems
to a specific case, and look for a person to blame, rather than
focusing on the underlying process – Dr Deming
www.scling.com
Questions?
38
1 of 38

Recommended

Washington DC DataOps Meetup -- Nov 2019 by
Washington DC DataOps Meetup   -- Nov 2019Washington DC DataOps Meetup   -- Nov 2019
Washington DC DataOps Meetup -- Nov 2019DataKitchen
3.3K views70 slides
Understanding DataOps and Its Impact on Application Quality by
Understanding DataOps and Its Impact on Application QualityUnderstanding DataOps and Its Impact on Application Quality
Understanding DataOps and Its Impact on Application QualityDevOps.com
534 views22 slides
Data ops in practice by
Data ops in practiceData ops in practice
Data ops in practiceLars Albertsson
3K views26 slides
seven steps to dataops @ dataops.rocks conference Oct 2019 by
seven steps to dataops @ dataops.rocks conference Oct 2019seven steps to dataops @ dataops.rocks conference Oct 2019
seven steps to dataops @ dataops.rocks conference Oct 2019DataKitchen
1K views45 slides
Low-tech, Low-cost data management: Six insights from national reporting on f... by
Low-tech, Low-cost data management: Six insights from national reporting on f...Low-tech, Low-cost data management: Six insights from national reporting on f...
Low-tech, Low-cost data management: Six insights from national reporting on f...srjbridge
400 views14 slides
Dataiku Data Science Studio (datasheet) by
Dataiku Data Science Studio (datasheet)Dataiku Data Science Studio (datasheet)
Dataiku Data Science Studio (datasheet)John Cann
619 views16 slides

More Related Content

What's hot

Future of Data Strategy (ASEAN) by
Future of Data Strategy (ASEAN)Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)Denodo
190 views35 slides
Dsc 2021 presentation_radovan_bacovic by
Dsc 2021 presentation_radovan_bacovicDsc 2021 presentation_radovan_bacovic
Dsc 2021 presentation_radovan_bacovicRadovan Baćović
126 views67 slides
Testing the Data Warehouse—Big Data, Big Problems by
Testing the Data Warehouse—Big Data, Big ProblemsTesting the Data Warehouse—Big Data, Big Problems
Testing the Data Warehouse—Big Data, Big ProblemsTechWell
1.8K views39 slides
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,... by
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...Mihai Criveti
316 views56 slides
Measuring Data Quality with DataOps by
Measuring Data Quality with DataOpsMeasuring Data Quality with DataOps
Measuring Data Quality with DataOpsSteven Ensslen
504 views30 slides
Agile, Automated, Aware: How to Model for Success by
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessInside Analysis
906 views38 slides

What's hot(20)

Future of Data Strategy (ASEAN) by Denodo
Future of Data Strategy (ASEAN)Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)
Denodo 190 views
Testing the Data Warehouse—Big Data, Big Problems by TechWell
Testing the Data Warehouse—Big Data, Big ProblemsTesting the Data Warehouse—Big Data, Big Problems
Testing the Data Warehouse—Big Data, Big Problems
TechWell1.8K views
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,... by Mihai Criveti
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
Mihai Criveti316 views
Measuring Data Quality with DataOps by Steven Ensslen
Measuring Data Quality with DataOpsMeasuring Data Quality with DataOps
Measuring Data Quality with DataOps
Steven Ensslen504 views
Agile, Automated, Aware: How to Model for Success by Inside Analysis
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for Success
Inside Analysis906 views
Architecting for analytics by Rob Winters
Architecting for analyticsArchitecting for analytics
Architecting for analytics
Rob Winters311 views
You're the New CDO, Now What? by Caserta
You're the New CDO, Now What?You're the New CDO, Now What?
You're the New CDO, Now What?
Caserta 2.3K views
Michael Stonebraker: Big Data, Disruption, and the 800 Pound Gorilla in the ... by TamrMarketing
Michael Stonebraker:  Big Data, Disruption, and the 800 Pound Gorilla in the ...Michael Stonebraker:  Big Data, Disruption, and the 800 Pound Gorilla in the ...
Michael Stonebraker: Big Data, Disruption, and the 800 Pound Gorilla in the ...
TamrMarketing262 views
Open Data Science Conference Agile Data by DataKitchen
Open Data Science Conference Agile DataOpen Data Science Conference Agile Data
Open Data Science Conference Agile Data
DataKitchen1.5K views
Webinar: Attaining Excellence in Big Data Integration by SnapLogic
Webinar: Attaining Excellence in Big Data IntegrationWebinar: Attaining Excellence in Big Data Integration
Webinar: Attaining Excellence in Big Data Integration
SnapLogic5.8K views
The 3 Key Barriers Keeping Companies from Deploying Data Products by Dataiku
The 3 Key Barriers Keeping Companies from Deploying Data Products The 3 Key Barriers Keeping Companies from Deploying Data Products
The 3 Key Barriers Keeping Companies from Deploying Data Products
Dataiku1.4K views
Dataiku, Pitch at Data-Driven NYC, New York City, September 17th 2013 by Dataiku
Dataiku, Pitch at Data-Driven NYC, New York City, September 17th 2013Dataiku, Pitch at Data-Driven NYC, New York City, September 17th 2013
Dataiku, Pitch at Data-Driven NYC, New York City, September 17th 2013
Dataiku2.6K views
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017 by Caserta
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
Caserta 1.9K views
Data Ops at TripActions by Rob Winters
Data Ops at TripActionsData Ops at TripActions
Data Ops at TripActions
Rob Winters147 views
MLUC 2011 XQuery Enigma by Peter O'Kelly
MLUC 2011 XQuery EnigmaMLUC 2011 XQuery Enigma
MLUC 2011 XQuery Enigma
Peter O'Kelly1.3K views
An Ounce of Prevention: Forging Healthy BI by Inside Analysis
An Ounce of Prevention: Forging Healthy BIAn Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BI
Inside Analysis457 views
Architecting Agile Data Applications for Scale by Databricks
Architecting Agile Data Applications for ScaleArchitecting Agile Data Applications for Scale
Architecting Agile Data Applications for Scale
Databricks424 views
CSNI: How State Medicaid Agencies Can Use Analytics to Predict Opioid Abuse a... by Seeling Cheung
CSNI: How State Medicaid Agencies Can Use Analytics to Predict Opioid Abuse a...CSNI: How State Medicaid Agencies Can Use Analytics to Predict Opioid Abuse a...
CSNI: How State Medicaid Agencies Can Use Analytics to Predict Opioid Abuse a...
Seeling Cheung2K views
The Emerging Role of the Data Lake by Caserta
The Emerging Role of the Data LakeThe Emerging Role of the Data Lake
The Emerging Role of the Data Lake
Caserta 1.4K views

Similar to The lean principles of data ops

DataOps - Lean principles and lean practices by
DataOps - Lean principles and lean practicesDataOps - Lean principles and lean practices
DataOps - Lean principles and lean practicesLars Albertsson
787 views29 slides
Data ops in practice - Swedish style by
Data ops in practice - Swedish styleData ops in practice - Swedish style
Data ops in practice - Swedish styleLars Albertsson
408 views59 slides
Crossing the data divide by
Crossing the data divideCrossing the data divide
Crossing the data divideLars Albertsson
3 views31 slides
Holistic data application quality by
Holistic data application qualityHolistic data application quality
Holistic data application qualityLars Albertsson
396 views30 slides
Overcoming Digital Transformation Pain Points by
Overcoming Digital Transformation Pain PointsOvercoming Digital Transformation Pain Points
Overcoming Digital Transformation Pain PointsInductive Automation
172 views40 slides
Developing and Implementing a QA Plan During Your Legacy Data to S1000D by
Developing and Implementing a QA Plan During Your Legacy Data to S1000DDeveloping and Implementing a QA Plan During Your Legacy Data to S1000D
Developing and Implementing a QA Plan During Your Legacy Data to S1000Ddclsocialmedia
807 views33 slides

Similar to The lean principles of data ops(20)

DataOps - Lean principles and lean practices by Lars Albertsson
DataOps - Lean principles and lean practicesDataOps - Lean principles and lean practices
DataOps - Lean principles and lean practices
Lars Albertsson787 views
Data ops in practice - Swedish style by Lars Albertsson
Data ops in practice - Swedish styleData ops in practice - Swedish style
Data ops in practice - Swedish style
Lars Albertsson408 views
Holistic data application quality by Lars Albertsson
Holistic data application qualityHolistic data application quality
Holistic data application quality
Lars Albertsson396 views
Developing and Implementing a QA Plan During Your Legacy Data to S1000D by dclsocialmedia
Developing and Implementing a QA Plan During Your Legacy Data to S1000DDeveloping and Implementing a QA Plan During Your Legacy Data to S1000D
Developing and Implementing a QA Plan During Your Legacy Data to S1000D
dclsocialmedia807 views
Self-Service Analytics with Guard Rails by Denodo
Self-Service Analytics with Guard RailsSelf-Service Analytics with Guard Rails
Self-Service Analytics with Guard Rails
Denodo 507 views
Data engineering in 10 years.pdf by Lars Albertsson
Data engineering in 10 years.pdfData engineering in 10 years.pdf
Data engineering in 10 years.pdf
Lars Albertsson840 views
Data Con LA 2022 - Practical Solutions to Complex Supply Chain Problems by Data Con LA
Data Con LA 2022 - Practical Solutions to Complex Supply Chain ProblemsData Con LA 2022 - Practical Solutions to Complex Supply Chain Problems
Data Con LA 2022 - Practical Solutions to Complex Supply Chain Problems
Data Con LA11 views
Introduction for Embedding Infobright for OEMs by Infobright
Introduction for Embedding Infobright for OEMsIntroduction for Embedding Infobright for OEMs
Introduction for Embedding Infobright for OEMs
Infobright431 views
451 Research + NuoDB: What It Means to be a Container-Native SQL Database by NuoDB
451 Research + NuoDB: What It Means to be a Container-Native SQL Database451 Research + NuoDB: What It Means to be a Container-Native SQL Database
451 Research + NuoDB: What It Means to be a Container-Native SQL Database
NuoDB236 views
Why Your Data Science Architecture Should Include a Data Virtualization Tool ... by Denodo
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Denodo 25 views
Secure software supply chain on a shoestring budget by Lars Albertsson
Secure software supply chain on a shoestring budgetSecure software supply chain on a shoestring budget
Secure software supply chain on a shoestring budget
Lars Albertsson268 views
David García, Rubén Aguilera Díaz-Heredero | A microservices experience in th... by Codemotion
David García, Rubén Aguilera Díaz-Heredero | A microservices experience in th...David García, Rubén Aguilera Díaz-Heredero | A microservices experience in th...
David García, Rubén Aguilera Díaz-Heredero | A microservices experience in th...
Codemotion153 views
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E... by DATAVERSITY
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
DATAVERSITY744 views
Cubodrom profile by cubodrom
Cubodrom profileCubodrom profile
Cubodrom profile
cubodrom251 views
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla... by Precisely
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Precisely93 views
Power to the People: A Stack to Empower Every User to Make Data-Driven Decisions by Looker
Power to the People: A Stack to Empower Every User to Make Data-Driven DecisionsPower to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Power to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Looker1.2K views
Democratizing Data Science in the Enterprise by Jesus Rodriguez
Democratizing Data Science in the EnterpriseDemocratizing Data Science in the Enterprise
Democratizing Data Science in the Enterprise
Jesus Rodriguez973 views

More from Lars Albertsson

Schema management with Scalameta by
Schema management with ScalametaSchema management with Scalameta
Schema management with ScalametaLars Albertsson
6 views50 slides
How to not kill people - Berlin Buzzwords 2023.pdf by
How to not kill people - Berlin Buzzwords 2023.pdfHow to not kill people - Berlin Buzzwords 2023.pdf
How to not kill people - Berlin Buzzwords 2023.pdfLars Albertsson
34 views51 slides
The 7 habits of data effective companies.pdf by
The 7 habits of data effective companies.pdfThe 7 habits of data effective companies.pdf
The 7 habits of data effective companies.pdfLars Albertsson
252 views44 slides
Ai legal and ethics by
Ai   legal and ethicsAi   legal and ethics
Ai legal and ethicsLars Albertsson
199 views6 slides
The right side of speed - learning to shift left by
The right side of speed - learning to shift leftThe right side of speed - learning to shift left
The right side of speed - learning to shift leftLars Albertsson
202 views44 slides
Mortal analytics - Covid-19 and the problem of data quality by
Mortal analytics - Covid-19 and the problem of data qualityMortal analytics - Covid-19 and the problem of data quality
Mortal analytics - Covid-19 and the problem of data qualityLars Albertsson
416 views43 slides

More from Lars Albertsson(20)

How to not kill people - Berlin Buzzwords 2023.pdf by Lars Albertsson
How to not kill people - Berlin Buzzwords 2023.pdfHow to not kill people - Berlin Buzzwords 2023.pdf
How to not kill people - Berlin Buzzwords 2023.pdf
Lars Albertsson34 views
The 7 habits of data effective companies.pdf by Lars Albertsson
The 7 habits of data effective companies.pdfThe 7 habits of data effective companies.pdf
The 7 habits of data effective companies.pdf
Lars Albertsson252 views
The right side of speed - learning to shift left by Lars Albertsson
The right side of speed - learning to shift leftThe right side of speed - learning to shift left
The right side of speed - learning to shift left
Lars Albertsson202 views
Mortal analytics - Covid-19 and the problem of data quality by Lars Albertsson
Mortal analytics - Covid-19 and the problem of data qualityMortal analytics - Covid-19 and the problem of data quality
Mortal analytics - Covid-19 and the problem of data quality
Lars Albertsson416 views
Eventually, time will kill your data processing by Lars Albertsson
Eventually, time will kill your data processingEventually, time will kill your data processing
Eventually, time will kill your data processing
Lars Albertsson413 views
Taming the reproducibility crisis by Lars Albertsson
Taming the reproducibility crisisTaming the reproducibility crisis
Taming the reproducibility crisis
Lars Albertsson521 views
Eventually, time will kill your data pipeline by Lars Albertsson
Eventually, time will kill your data pipelineEventually, time will kill your data pipeline
Eventually, time will kill your data pipeline
Lars Albertsson936 views
Test strategies for data processing pipelines, v2.0 by Lars Albertsson
Test strategies for data processing pipelines, v2.0Test strategies for data processing pipelines, v2.0
Test strategies for data processing pipelines, v2.0
Lars Albertsson2.7K views
10 ways to stumble with big data by Lars Albertsson
10 ways to stumble with big data10 ways to stumble with big data
10 ways to stumble with big data
Lars Albertsson1.4K views
Protecting privacy in practice by Lars Albertsson
Protecting privacy in practiceProtecting privacy in practice
Protecting privacy in practice
Lars Albertsson9.8K views
Testing data streaming applications by Lars Albertsson
Testing data streaming applicationsTesting data streaming applications
Testing data streaming applications
Lars Albertsson4K views
A primer on building real time data-driven products by Lars Albertsson
A primer on building real time data-driven productsA primer on building real time data-driven products
A primer on building real time data-driven products
Lars Albertsson951 views

Recently uploaded

[DSC Europe 23] Ivana Sesic - Use of AI in Public Health.pptx by
[DSC Europe 23] Ivana Sesic - Use of AI in Public Health.pptx[DSC Europe 23] Ivana Sesic - Use of AI in Public Health.pptx
[DSC Europe 23] Ivana Sesic - Use of AI in Public Health.pptxDataScienceConferenc1
5 views15 slides
Organic Shopping in Google Analytics 4.pdf by
Organic Shopping in Google Analytics 4.pdfOrganic Shopping in Google Analytics 4.pdf
Organic Shopping in Google Analytics 4.pdfGA4 Tutorials
16 views13 slides
CRIJ4385_Death Penalty_F23.pptx by
CRIJ4385_Death Penalty_F23.pptxCRIJ4385_Death Penalty_F23.pptx
CRIJ4385_Death Penalty_F23.pptxyvettemm100
6 views24 slides
Survey on Factuality in LLM's.pptx by
Survey on Factuality in LLM's.pptxSurvey on Factuality in LLM's.pptx
Survey on Factuality in LLM's.pptxNeethaSherra1
7 views9 slides
Chapter 3b- Process Communication (1) (1)(1) (1).pptx by
Chapter 3b- Process Communication (1) (1)(1) (1).pptxChapter 3b- Process Communication (1) (1)(1) (1).pptx
Chapter 3b- Process Communication (1) (1)(1) (1).pptxayeshabaig2004
7 views30 slides
Data about the sector workshop by
Data about the sector workshopData about the sector workshop
Data about the sector workshopinfo828217
12 views27 slides

Recently uploaded(20)

Organic Shopping in Google Analytics 4.pdf by GA4 Tutorials
Organic Shopping in Google Analytics 4.pdfOrganic Shopping in Google Analytics 4.pdf
Organic Shopping in Google Analytics 4.pdf
GA4 Tutorials16 views
CRIJ4385_Death Penalty_F23.pptx by yvettemm100
CRIJ4385_Death Penalty_F23.pptxCRIJ4385_Death Penalty_F23.pptx
CRIJ4385_Death Penalty_F23.pptx
yvettemm1006 views
Survey on Factuality in LLM's.pptx by NeethaSherra1
Survey on Factuality in LLM's.pptxSurvey on Factuality in LLM's.pptx
Survey on Factuality in LLM's.pptx
NeethaSherra17 views
Chapter 3b- Process Communication (1) (1)(1) (1).pptx by ayeshabaig2004
Chapter 3b- Process Communication (1) (1)(1) (1).pptxChapter 3b- Process Communication (1) (1)(1) (1).pptx
Chapter 3b- Process Communication (1) (1)(1) (1).pptx
ayeshabaig20047 views
Data about the sector workshop by info828217
Data about the sector workshopData about the sector workshop
Data about the sector workshop
info82821712 views
[DSC Europe 23][Cryptica] Martin_Summer_Digital_central_bank_money_Ideas_init... by DataScienceConferenc1
[DSC Europe 23][Cryptica] Martin_Summer_Digital_central_bank_money_Ideas_init...[DSC Europe 23][Cryptica] Martin_Summer_Digital_central_bank_money_Ideas_init...
[DSC Europe 23][Cryptica] Martin_Summer_Digital_central_bank_money_Ideas_init...
3196 The Case of The East River by ErickANDRADE90
3196 The Case of The East River3196 The Case of The East River
3196 The Case of The East River
ErickANDRADE9016 views
Cross-network in Google Analytics 4.pdf by GA4 Tutorials
Cross-network in Google Analytics 4.pdfCross-network in Google Analytics 4.pdf
Cross-network in Google Analytics 4.pdf
GA4 Tutorials6 views
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M... by DataScienceConferenc1
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx by DataScienceConferenc1
[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx
[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx
SUPER STORE SQL PROJECT.pptx by khan888620
SUPER STORE SQL PROJECT.pptxSUPER STORE SQL PROJECT.pptx
SUPER STORE SQL PROJECT.pptx
khan88862013 views
PRIVACY AWRE PERSONAL DATA STORAGE by antony420421
PRIVACY AWRE PERSONAL DATA STORAGEPRIVACY AWRE PERSONAL DATA STORAGE
PRIVACY AWRE PERSONAL DATA STORAGE
antony4204215 views
[DSC Europe 23][AI:CSI] Dragan Pleskonjic - AI Impact on Cybersecurity and P... by DataScienceConferenc1
[DSC Europe 23][AI:CSI]  Dragan Pleskonjic - AI Impact on Cybersecurity and P...[DSC Europe 23][AI:CSI]  Dragan Pleskonjic - AI Impact on Cybersecurity and P...
[DSC Europe 23][AI:CSI] Dragan Pleskonjic - AI Impact on Cybersecurity and P...

The lean principles of data ops

  • 1. www.scling.com The lean principles of DataOps Berlin Buzzwords, 2020-06-08 Lars Albertsson, Founder, Scling Christopher Bergh, CEO & Head Chef, DataKitchen 1
  • 2. www.scling.com Scling - data-value-as-a-service 2 Data lake Stream storage ● Extract value from your data ● Data platform + custom data pipelines ● Imitate data leaders: ○ Quick idea-to-production ○ Operational efficiency Our marketing strategy: ● Promiscuously share knowledge ○ On slides devoid of glossy polish
  • 3. www.scling.com 1994: OS/2 Warp CID installation 3 Grmbl, who reinstalled my machine?
  • 4. www.scling.com IT craft to factory 4 Security Waterfall Application delivery Traditional operations Traditional QA Infrastructure DevSecOps Agile Containers DevOps CI/CD Infrastructure as code
  • 6. www.scling.com The Toyota Way Selected lean principles: ● Long-term over short-term ● The right process will produce the right results ● Eliminate waste (muda) ● Continuous improvement (kaizen) ● Use pull systems to avoid unnecessary production ● Quality takes precedence (jidoka) ○ Stop to fix problems ● Standardised tasks and processes ● Reliable technology that serves people and process ● Develop your people ● Decisions slowly by consensus ● Relentless reflection (hansei), organisational learning 6
  • 7. www.scling.com Common waste species ● Cognitive waste ● Delivery waste ● Operational waste ● Product waste 7
  • 8. www.scling.com Cognitive waste ● Why do we have 25 time formats? ○ ISO 8601, UTC assumed ○ ISO 8601 + timezone ○ Millis since epoch, UTC ○ Nanos since epoch, UTC ○ Millis since epoch, user local time ○ … ○ Float of seconds since epoch, as string. WTF?!? ● my-kafka-topic-name, your_topic_name 8 ● Definition of an order: ○ Abandoned cart? ○ Payment refused? ○ Returned goods? ○ Free promotion? ● Data entity source of truth ○ MySQL, Kafka, data lake?
  • 9. www.scling.com What causes cognitive waste? ● We are autonomous! ○ Teams can choose technology, format, process, ... ● Cognitive debt ○ Short-term over long-term ○ Decisions without consensus ● Recognition and rewards ○ "You have made a similar independent pipeline, great work!" 9
  • 10. www.scling.com Avoiding cognitive waste ● Reusing semantic definitions ● Reusing code & technical definitions ○ Code transparency & sharing ○ Standardised technology ○ Document decisions & consensus process ● Read-only sharing not enough ○ Must be empowered to change for reuse and to improve quality ○ Standardised processes 10
  • 11. www.scling.com Eliminating cognitive waste ● Refactoring code, semantics, docs ● Low risk - what will I break downstream? ○ Standardised, automated, trusted QA process ○ End-to-end pipeline testing ● "Creating a pipeline - one day! Replace old pipeline - 18 months." 11
  • 12. www.scling.com Delivery waste ● Friction from code to production ○ Ideal: Idea, research, write code+tests, done. Everything else is friction. ● Code inventory ○ Code not yet fully utilised ● Data inventory ○ Data not yet fully processed 12
  • 13. www.scling.com Data product quality assurance ● Product quality = f(code, data) ○ Cannot do full QA on code only ○ Only real data is production data ● Test in production ○ Quick QA cycle = quick production deployment ○ Measure, monitor, validate 13
  • 14. www.scling.com Eliminating delivery friction 14 ● In theory simple - scrutinise everything ○ Positive engineering: writing code, tests, docs, refactor, improve ○ All else is negative ● You are limited by your assumptions ○ State of practice far from state of art But the test suite takes 3 hours. We have this checklist. Security must approve. X must be released before Y. That is another team's job. We don't have access. We must test in staging first. We haven't performance tested yet.
  • 15. www.scling.com So get rid of the waste. Resources: No tradeoff between speed and quality! 15
  • 16. www.scling.com ● Code not yet fully utilised ● Code on its way to production ○ In a notebook ○ Waiting for approval ○ Waiting for release ○ Internally released, waiting for dependants to upgrade ● Tests not fully used ○ Cover code (shared component), but not yet executed Code inventory 16
  • 17. www.scling.com Data inventory ● Data collected, but not yet fully processed ○ Traditional lazy joins & SQL processing at runtime ● Eliminate with eager processing = pipeline ○ Process, join, denormalise ● Fatal problems → offline crash ○ "Andon" cord - stop and fix before significant harm is done 17
  • 18. www.scling.com Operational waste ● Friction in operational manoeuvres ○ Fear of mistakes ● Cost of incidents ○ Time to recovery ○ Impact of incident ○ Frequency of incidents 18
  • 19. www.scling.com Separating offline and online 19 Raw 19 Fraud serviceFraud model Orders Orders Replication / Backup Standard procedures Standard proceduresLightweight procedures ● QA driven by internal efficiency ● Continuous deployment ● New pipeline < 1 day ● Upgrade < 1 hour ● Bug recovery < 1 hour Careful handover Careful handover
  • 20. www.scling.com 20 Cost of a software error Online ● User impact ● Data corruption ● Cascading corruption ● Unbounded recovery
  • 21. www.scling.com 21 Cost of a software error Nearline ● Data corruption ● Downstream impact ● Bounded recovery Online ● User impact ● Data corruption ● Cascading corruption ● Unbounded recovery Job Stream Stream Job Stream
  • 22. www.scling.com 22 Cost of a software error Nearline ● Data corruption ● Downstream impact ● Bounded recovery Offline ● Temporary data corruption ● Downstream impact ● Easy recovery Online ● User impact ● Data corruption ● Cascading corruption ● Unbounded recovery Job Stream Stream Job Stream
  • 23. www.scling.com Data speed Innovation speed 23 Nearline Data processing tradeoff 23 Job Stream OfflineOnline Stream Job Stream
  • 24. www.scling.com Product waste ● Work not driven by use case ● Unrealised data potential due to friction ○ Unawareness of data ○ Difficulty to use data ● Hidden quality problems ● Collaboration and communication overhead 24 Data democratisation - making data accessible and usable
  • 25. Copyright 2020 by DataKitchen, Inc. All Rights Reserved. Waste: Your Team’s Time Not Well Spent 25 Percentage Time Team Spends Per Week Current Errors & Operational Tasks New Features & Data For Customers Improvements & Debt Challenges: • Complex roles • Complex organizations • Complex toolchains • Complex data • Complex collaboration
  • 26. Copyright 2020 DataKitchen, Inc. Waste: Data Analytics is like the US Auto Industry in the 1970s Current High Errors Production Errors Data Analytics Team Deployment Latency Weeks, Months Dev Prod Challenges: • Slow to add new features, rapidly address consumer requests, changing data sets • Lack of trust by data consumers • Slow model deployment, slow to move to cloud • Team morale 26
  • 27. Copyright 2020 by DataKitchen, Inc.  All Rights Reserved. Waste: Conway’s Law and Data Pipelines Data Analytics Follows Conway's Law The structure of how teams are organized to do Data Science, Data Engineering, Analytics, and Production is reflected in their data pipelines.
  • 28. Copyright 2020 by DataKitchen, Inc.  All Rights Reserved. Waste: A cornucopia of collaboration complexity D D P D D D D D D D P D P P D Development - Data Analytic Team P Production - Data Analytic Team Centralized Dev Centralized Dev & Prod Decentralized Dev Decentralized Dev & Prod How do we create together without conflicts? (Data Engineer & Data Scientist) How do we deploy safely and rapidly? (Data Team and Production Team) How to balance centralized control vs self service freedom? (Home Office Data Team and Line of Business Analysts) How to reuse/incorporate what another team deployed? (Multiple Data & Production Teams in Many Orgs) DE DS BI
  • 29. Copyright 2020 by DataKitchen, Inc. All Rights Reserved. Why? Data Teams Are Suffering Data teams are caught between three competing forces: • Unaware Data Providers – unaware that they send crappy, late, and error prone data sets • Demanding Data Consumers – demand trusted, original insight at the speed of Amazon delivery • Critical Supporting Teams – need flawless ongoing production and collaboration with other teams/people Make for: • A beaten down, distraught, disempowered work environment • Teams that cannot create and innovate • Lack of trust all around 29 Unaware Data Providers Demanding Data Consumers Critical Supporting Teams
  • 30. Copyright 2020 by DataKitchen, Inc. All Rights Reserved. DataOps – Solution To That Suffering DataOps – The technical practices, cultural norms, and architecture that enable: • Rapid cycles of experimentation and innovation to delivery of new insights to our customers • Low error rates • Collaboration across complex sets of people, technology, and environments • Clear measurement and monitoring of results 30Source: Gartner “Organizations that adopt a DevOps- and DataOps-based approach are more successful in implementing end-to-end, reliable, robust, scalable and repeatable solutions.” Sumit Pal, Gartner, November 2018 People, Process, Organization Technical Environment
  • 31. Copyright 2020 by DataKitchen, Inc.  All Rights Reserved. DataOps Benefit: Lower Cost, More Insight 31 After DataOps Percentage Time Team Spends Per Week Before DataOps New Features & Data For Customers Errors & Operational Tasks New Features & Data For Customers Improvements & Debt Errors & Operational Tasks Process Improvements & Tech Debt Reduction
  • 32. Copyright 2020 by DataKitchen, Inc.  All Rights Reserved. DataOps Benefit: Faster, Better & Happier 32 After DataOpsBefore DataOps High Errors Production Errors Low Errors Data Analytics Team Deployment Latency Weeks, Months Dev Prod Hours & Mins Dev Prod
  • 33. Copyright 2020 by DataKitchen, Inc.  All Rights Reserved. DevOps vs DataOps (and all those *Opses) Lean, Learning Origination, and W Edwards Deming Principles: Focus on Low Errors, Cycle Time, Collaboration, and Measurement Industrial Manufacturing Teams Business Management Concept Data Science, Engineering and Analytics Teams IT and Software TeamsOrganization Team Management Agile, Kanban, Scrum, DA, etc. Team Management Six Sigma, Total Quality Management Organizational Management Method Technical Environment and Process DevOps AIOps DevSecOps DataOps ModelOps MLOps … GitOps
  • 34. Copyright 2020 by DataKitchen, Inc.  All Rights Reserved. DevOps vs DataOps (and all those *Opses) Lean, Learning Origination, and W Edwards Deming Principles: Focus on Low Errors, Cycle Time, Collaboration, and Measurement Industrial Manufacturing Teams Business Management Concept Data Science, Engineering and Analytics Teams IT and Software TeamsOrganization Team Management Agile, Kanban, Scrum, DA, etc. Team Management Six Sigma, Total Quality Management Organizational Management Method Technical Environment and Process DevOps AIOps DevSecOps DataOps ModelOps MLOps … GitOps
  • 35. Copyright 2020 by DataKitchen, Inc.  All Rights Reserved. DevOps vs DataOps (and all those *Opses) Lean, Learning Origination, and W Edwards Deming Principles: Focus on Low Errors, Cycle Time, Collaboration, and Measurement Industrial Manufacturing Teams Business Management Concept Data Science, Engineering and Analytics Teams IT and Software TeamsOrganization Team Management Agile, Kanban, Scrum, DA, etc. Team Management Six Sigma, Total Quality Management Organizational Management Method Technical Environment and Process DevOps AIOps DevSecOps DataOps ModelOps MLOps … GitOps
  • 36. Copyright 2020 by DataKitchen, Inc.  All Rights Reserved. DevOps vs DataOps (and all those *Opses) Lean, Learning Origination, and W Edwards Deming Principles: Focus on Low Errors, Cycle Time, Collaboration, and Measurement Industrial Manufacturing Teams Business Management Concept Data Science, Engineering and Analytics Teams IT and Software TeamsOrganization Team Management Agile, Kanban, Scrum, DA, etc. Team Management Six Sigma, Total Quality Management Organizational Management Method Technical Environment and Process DevOps AIOps DevSecOps DataOps ModelOps MLOps … GitOps
  • 37. Copyright 2020 by DataKitchen, Inc. All Rights Reserved. What You Do Is Much Less Important Than How You Do It 37 “We realized that the true problem, the true difficulty, and where the greatest potential is – is building the machine that makes the machine. It’s building the factory.” – Elon Musk 94% of causes were common cause. We often attribute problems to a specific case, and look for a person to blame, rather than focusing on the underlying process – Dr Deming