SlideShare a Scribd company logo
Data DevOps
Scott W. Ambler
Consulting Methodologist | Agile Data Coach
Ambysoft Inc.
© Ambysoft Inc. All rights reserved.
1
Download these slides at Slideshare.net/ScottWAmbler/
Agenda
© Ambysoft Inc. All rights reserved. 2
• Data DevOps
• Agile data development techniques
• Agile data operations techniques
• Parting thoughts
• Questions and answers
Scott Ambler
© Ambysoft Inc. All rights reserved. 3
• scott@scottambler.com
• Twitter: @scottwambler
• linkedin.com/in/sambler/
Consulting
Methodologist
Ambysoft.com
Thought Leader
AgileData.org
Thought Leader
AgileModeling.com
© Ambysoft Inc. All rights reserved. 4
Data
Dev
Data
Ops
Defining Data DevOps
Data Development (Data Dev)
The effort to develop and evolve the data aspects of your solutions
Data Operations (Data Ops)
The effort to operate, support, and govern the data aspects of your solutions
Data DevOps
The streamlined combination of Data Dev + Data Ops, part of your overall DevOps
strategy
© Ambysoft Inc. All rights reserved. 5
© Ambysoft Inc. All rights reserved. 6
Data
Dev
Data
Ops
Agile data
modeling
Database
refactoring
Continuous
database
integration (CDI)
Automated
database
regression
testing
Continuous database
deployment (CDD)
Operational data
quality assurance
Data
security
Critical Data DevOps Techniques
© Ambysoft Inc. All rights reserved. 7
Data modeling is the act of exploring
data-oriented structures
Evolutionary data modeling is data
modeling performed in an iterative
and incremental manner
Agile data modeling is evolutionary
data modeling done in a collaborative
manner
Source: AgileData.org/essays/agileDataModeling.html
Agile Data Modeling
Construction Sprint
JIT Modeling
Detailed physical data
modeling (to generate DDL)
Sprint 0:
Initial Modeling
High-level conceptual modeling
Test-Driven
Development
Detailed specification
© Ambysoft Inc. All rights reserved. 8
Source: AgileData.org/essays/lookAheadDataAnalysis.html
Agile Data Modeling: Look-Ahead Data Analysis
There are several factors that will determine how far ahead you need to perform look-ahead data analysis:
• The complexity of the data source(s).
• Your ability to gain access to the data source(s).
• The difficulty of the question being asked.
• The skill, experience, and knowledge of the data analyst(s).
• The availability of the data analyst(s).
• Your data profiling tools.
Sometimes it takes several days, even weeks, to perform data analytics before you can implement
a question story.
© Ambysoft Inc. All rights reserved. 9
A database refactoring is a simple change to a database
schema that improves its design while retaining both its
behavioral and informational semantics
A database schema includes structural aspects such as
table and view definitions; functional aspects such as
stored procedures and triggers; and informational
aspects such as the data itself
Source: AgileData.org/essays/databaseRefactoring.html
Database Refactoring
© Ambysoft Inc. All rights reserved. 10
Source: AgileData.org/essays/databaseRefactoring.html
Customer
CustomerID <<PK>>
Fname
Customer
CustomerID <<PK>>
Fname
FirstName
SynchronizeFirstName()
Customer
CustomerID <<PK>>
FirstName
Original
Schema:
Interim
Schema:
Final
Schema:
Database Refactoring: Rename Column
© Ambysoft Inc. All rights reserved. 11
• Part of building the system is building the
database (if it changed)
• Challenge: Tests SHOULD put the database back
into a known state, but sometimes don’t
• You will want to rebuild the (non-production)
database from scratch every so often
• Challenge: Database accesses take time
• Some test suites will test against DB mocks
• You still need to test the actual database
occasionally
• Source:
AgileData.org/essays/continuousIntegration.html
Continuous Database Integration (CDI)
© Ambysoft Inc. All rights reserved. 12
Source: AgileData.org/essays/databaseTesting.html
Database Regression Testing
© Ambysoft Inc. All rights reserved. 13
Continuous Deployment between Sandboxes
Development Integration 1 Integration N Production
• Each integration DB must know it’s version, so as to support database refactoring
• CDI differs between sandboxes
• Development sandboxes often implement database mocks
• Integration sandboxes should test against the database
• Production has limited tests and checks for performance reasons
Source: AgileData.org/essays/sandboxes.html
Operational Data Quality Assurance
There are many techniques available to ensure operational data quality
• Data transformation
• Data masking
• Data stewardship
• Data privacy
• Data governance
• Data source backup
• Data lineage
• Data history
• Data cleansing
Challenges:
• Increasing volume of data
• Increasing data arrival rate
© Ambysoft Inc. All rights reserved. 14
Data Security
There are a myriad of data security
issues to consider:
• Data encryption
• Data privacy
• Access control
• Secure storage
• Secure deletion
• And many more…
© Ambysoft Inc. All rights reserved. 15
Parting Thoughts
© Ambysoft Inc. All rights reserved. 16
© Ambysoft Inc. All rights reserved. 17
The increasing pace of change,
increasing complexity,
and increasing volume of data
demands nothing less than
complete data agility
Thank You!
© Ambysoft Inc. All rights reserved. 18
• scott@scottambler.com
• Twitter: @scottwambler
• linkedin.com/in/sambler/
Consulting Methodologist
Ambysoft.com
Thought Leader
AgileData.org
Thought Leader
AgileModeling.com
The Agile Data Site
AgileData.org
Key articles:
• Data Technical Debt: How to Address Quality Problems in Data Sources
• Database Techniques Stack
• The Agile Data Architect
© Ambysoft Inc. All rights reserved. 19
Look-Ahead Data Analysis: Agile
© Ambysoft Inc. All rights reserved. 20
Scenario: You want to implement three question stories in sprint #9
You need to:
• Have a definition of ready (DoR) indicating the amount of data analysis work required
• Guesstimate the amount of data analysis required for each one, and then perform the analysis
sufficiently before sprint #9
• Have sufficient capacity to perform look-ahead data analysis
• Interleave data analysis for other sprints into the work of the people performing it
Note: Staffing your team with specialists will exacerbate work scheduling challenges. Consider
generalizing specialists (AgileModeling.com/essays/generalizingSpecialists.htm) instead.
Look-Ahead Data Analysis: Continuous Delivery
© Ambysoft Inc. All rights reserved. 21
Development – QS 9a
Development – QS 9b
Development – QS 9c
Scenario: You want to implement the same three question stories
• You are not constrained by organizing the work into sprints
• Development = Data analysis + other implementation work
• Work can be brought into the team as capacity permits
• Value is delivered when it is available
• Average cycle time to deliver stories is shorter
• Staffing your team with generalizing specialists may enable you to swarm around data analysis
work and shorten the development time of a question story
Agile Data Ways of Thinking (WoT)
© Ambysoft Inc. All rights reserved. 22
Agile Data Ways of Thinking (WoT)
AgileData.org/essays/philosophies.html
Look Beyond Data
Collaborate Closely
Be Quality Infected
Embrace Evolution
Be Enterprise Aware
Fit-For-Purpose
Everyone Agile
© Ambysoft Inc. All rights reserved. 23

More Related Content

Similar to Data DevOps: An Overview

Part 3: Models in Production: A Look From Beginning to End
Part 3: Models in Production: A Look From Beginning to EndPart 3: Models in Production: A Look From Beginning to End
Part 3: Models in Production: A Look From Beginning to End
Cloudera, Inc.
 
The Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationThe Shifting Landscape of Data Integration
The Shifting Landscape of Data Integration
DATAVERSITY
 
Managing Large Amounts of Data with Salesforce
Managing Large Amounts of Data with SalesforceManaging Large Amounts of Data with Salesforce
Managing Large Amounts of Data with Salesforce
Sense Corp
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
Eric Kavanagh
 
ADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data LakesADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data Lakes
DATAVERSITY
 
Geek Sync | Is Your Database Environment Ready for DevOps?
Geek Sync | Is Your Database Environment Ready for DevOps?Geek Sync | Is Your Database Environment Ready for DevOps?
Geek Sync | Is Your Database Environment Ready for DevOps?
IDERA Software
 
Data Science in the Enterprise
Data Science in the EnterpriseData Science in the Enterprise
Data Science in the Enterprise
The Hive
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
DATAVERSITY
 
IBM Cloud Day January 2021 - A well architected data lake
IBM Cloud Day January 2021 - A well architected data lakeIBM Cloud Day January 2021 - A well architected data lake
IBM Cloud Day January 2021 - A well architected data lake
Torsten Steinbach
 
The Data Lake and Getting Buisnesses the Big Data Insights They Need
The Data Lake and Getting Buisnesses the Big Data Insights They NeedThe Data Lake and Getting Buisnesses the Big Data Insights They Need
The Data Lake and Getting Buisnesses the Big Data Insights They Need
Dunn Solutions Group
 
How Celtra Optimizes its Advertising Platform with Databricks
How Celtra Optimizes its Advertising Platformwith DatabricksHow Celtra Optimizes its Advertising Platformwith Databricks
How Celtra Optimizes its Advertising Platform with Databricks
Grega Kespret
 
Chapter 6: Data Operations Management
Chapter 6: Data Operations ManagementChapter 6: Data Operations Management
Chapter 6: Data Operations Management
Ahmed Alorage
 
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
DATAVERSITY
 
Hadoop Application Architectures tutorial at Big DataService 2015
Hadoop Application Architectures tutorial at Big DataService 2015Hadoop Application Architectures tutorial at Big DataService 2015
Hadoop Application Architectures tutorial at Big DataService 2015
hadooparchbook
 
DataOps , cbuswaw April '23
DataOps , cbuswaw April '23DataOps , cbuswaw April '23
DataOps , cbuswaw April '23
Jason Packer
 
The Key to Big Data Modeling: Collaboration
The Key to Big Data Modeling: CollaborationThe Key to Big Data Modeling: Collaboration
The Key to Big Data Modeling: Collaboration
Embarcadero Technologies
 
Challenges of Operationalising Data Science in Production
Challenges of Operationalising Data Science in ProductionChallenges of Operationalising Data Science in Production
Challenges of Operationalising Data Science in Production
iguazio
 
Big Data Boom
Big Data BoomBig Data Boom
Data Engineering.pdf
Data Engineering.pdfData Engineering.pdf
Data Engineering.pdf
Datacademy.ai
 
Resume Of David Kragness
Resume Of David KragnessResume Of David Kragness
Resume Of David KragnessOmahaDBA
 

Similar to Data DevOps: An Overview (20)

Part 3: Models in Production: A Look From Beginning to End
Part 3: Models in Production: A Look From Beginning to EndPart 3: Models in Production: A Look From Beginning to End
Part 3: Models in Production: A Look From Beginning to End
 
The Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationThe Shifting Landscape of Data Integration
The Shifting Landscape of Data Integration
 
Managing Large Amounts of Data with Salesforce
Managing Large Amounts of Data with SalesforceManaging Large Amounts of Data with Salesforce
Managing Large Amounts of Data with Salesforce
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
 
ADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data LakesADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data Lakes
 
Geek Sync | Is Your Database Environment Ready for DevOps?
Geek Sync | Is Your Database Environment Ready for DevOps?Geek Sync | Is Your Database Environment Ready for DevOps?
Geek Sync | Is Your Database Environment Ready for DevOps?
 
Data Science in the Enterprise
Data Science in the EnterpriseData Science in the Enterprise
Data Science in the Enterprise
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
 
IBM Cloud Day January 2021 - A well architected data lake
IBM Cloud Day January 2021 - A well architected data lakeIBM Cloud Day January 2021 - A well architected data lake
IBM Cloud Day January 2021 - A well architected data lake
 
The Data Lake and Getting Buisnesses the Big Data Insights They Need
The Data Lake and Getting Buisnesses the Big Data Insights They NeedThe Data Lake and Getting Buisnesses the Big Data Insights They Need
The Data Lake and Getting Buisnesses the Big Data Insights They Need
 
How Celtra Optimizes its Advertising Platform with Databricks
How Celtra Optimizes its Advertising Platformwith DatabricksHow Celtra Optimizes its Advertising Platformwith Databricks
How Celtra Optimizes its Advertising Platform with Databricks
 
Chapter 6: Data Operations Management
Chapter 6: Data Operations ManagementChapter 6: Data Operations Management
Chapter 6: Data Operations Management
 
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
 
Hadoop Application Architectures tutorial at Big DataService 2015
Hadoop Application Architectures tutorial at Big DataService 2015Hadoop Application Architectures tutorial at Big DataService 2015
Hadoop Application Architectures tutorial at Big DataService 2015
 
DataOps , cbuswaw April '23
DataOps , cbuswaw April '23DataOps , cbuswaw April '23
DataOps , cbuswaw April '23
 
The Key to Big Data Modeling: Collaboration
The Key to Big Data Modeling: CollaborationThe Key to Big Data Modeling: Collaboration
The Key to Big Data Modeling: Collaboration
 
Challenges of Operationalising Data Science in Production
Challenges of Operationalising Data Science in ProductionChallenges of Operationalising Data Science in Production
Challenges of Operationalising Data Science in Production
 
Big Data Boom
Big Data BoomBig Data Boom
Big Data Boom
 
Data Engineering.pdf
Data Engineering.pdfData Engineering.pdf
Data Engineering.pdf
 
Resume Of David Kragness
Resume Of David KragnessResume Of David Kragness
Resume Of David Kragness
 

More from Scott W. Ambler

Artificial Intelligence for Project Managers: Are You Ready?
Artificial Intelligence for Project Managers: Are You Ready?Artificial Intelligence for Project Managers: Are You Ready?
Artificial Intelligence for Project Managers: Are You Ready?
Scott W. Ambler
 
Applying Disciplined Agile: Become a Learning Organization
Applying Disciplined Agile: Become a Learning OrganizationApplying Disciplined Agile: Become a Learning Organization
Applying Disciplined Agile: Become a Learning Organization
Scott W. Ambler
 
EDGY: A Disciplined Look
EDGY: A Disciplined LookEDGY: A Disciplined Look
EDGY: A Disciplined Look
Scott W. Ambler
 
Agile Data Warehousing (DW)/Business Intelligence (BI): Addressing the Hard P...
Agile Data Warehousing (DW)/Business Intelligence (BI): Addressing the Hard P...Agile Data Warehousing (DW)/Business Intelligence (BI): Addressing the Hard P...
Agile Data Warehousing (DW)/Business Intelligence (BI): Addressing the Hard P...
Scott W. Ambler
 
Technical Debt: A Management Problem That Requires a Management Solution
Technical Debt: A Management Problem That Requires a Management SolutionTechnical Debt: A Management Problem That Requires a Management Solution
Technical Debt: A Management Problem That Requires a Management Solution
Scott W. Ambler
 
Working Smarter: Learn, Optimize, Accelerate
Working Smarter: Learn, Optimize, AccelerateWorking Smarter: Learn, Optimize, Accelerate
Working Smarter: Learn, Optimize, Accelerate
Scott W. Ambler
 
No frameworks: How we can take agile back
No frameworks: How we can take agile backNo frameworks: How we can take agile back
No frameworks: How we can take agile back
Scott W. Ambler
 
Agile transformations: The good, the bad, and the ugly
Agile transformations: The good, the bad, and the uglyAgile transformations: The good, the bad, and the ugly
Agile transformations: The good, the bad, and the ugly
Scott W. Ambler
 
Choose Your Way of Working (WoW)!
Choose Your Way of Working (WoW)!Choose Your Way of Working (WoW)!
Choose Your Way of Working (WoW)!
Scott W. Ambler
 
Choose Your WoW! DevOps in the Enterprise
Choose Your WoW!  DevOps in the EnterpriseChoose Your WoW!  DevOps in the Enterprise
Choose Your WoW! DevOps in the Enterprise
Scott W. Ambler
 
Disciplined Agile Data Management
Disciplined Agile Data ManagementDisciplined Agile Data Management
Disciplined Agile Data Management
Scott W. Ambler
 
Agile Modeling: A Disciplined Approach to Modelling and Documentation
Agile Modeling: A Disciplined Approach to Modelling and DocumentationAgile Modeling: A Disciplined Approach to Modelling and Documentation
Agile Modeling: A Disciplined Approach to Modelling and Documentation
Scott W. Ambler
 
Measuring Agile: A Disciplined Approach To Metrics
Measuring Agile: A Disciplined Approach To MetricsMeasuring Agile: A Disciplined Approach To Metrics
Measuring Agile: A Disciplined Approach To Metrics
Scott W. Ambler
 
Agile enterprise architecture
Agile enterprise architectureAgile enterprise architecture
Agile enterprise architecture
Scott W. Ambler
 
(In Agile) Where Do All The Managers Go?
(In Agile) Where Do All The Managers Go?(In Agile) Where Do All The Managers Go?
(In Agile) Where Do All The Managers Go?
Scott W. Ambler
 
Disciplined agile business analysis
Disciplined agile business analysisDisciplined agile business analysis
Disciplined agile business analysis
Scott W. Ambler
 
Crushed by technical debt
Crushed by technical debtCrushed by technical debt
Crushed by technical debt
Scott W. Ambler
 
Disciplined Agile Outsourcing: Making it work for both the customer and the s...
Disciplined Agile Outsourcing: Making it work for both the customer and the s...Disciplined Agile Outsourcing: Making it work for both the customer and the s...
Disciplined Agile Outsourcing: Making it work for both the customer and the s...
Scott W. Ambler
 
Disciplined Agile Business Analysis
Disciplined Agile Business AnalysisDisciplined Agile Business Analysis
Disciplined Agile Business Analysis
Scott W. Ambler
 
Continuous Architecture and Emergent Design: Disciplined Agile Strategies
Continuous Architecture and Emergent Design: Disciplined Agile StrategiesContinuous Architecture and Emergent Design: Disciplined Agile Strategies
Continuous Architecture and Emergent Design: Disciplined Agile Strategies
Scott W. Ambler
 

More from Scott W. Ambler (20)

Artificial Intelligence for Project Managers: Are You Ready?
Artificial Intelligence for Project Managers: Are You Ready?Artificial Intelligence for Project Managers: Are You Ready?
Artificial Intelligence for Project Managers: Are You Ready?
 
Applying Disciplined Agile: Become a Learning Organization
Applying Disciplined Agile: Become a Learning OrganizationApplying Disciplined Agile: Become a Learning Organization
Applying Disciplined Agile: Become a Learning Organization
 
EDGY: A Disciplined Look
EDGY: A Disciplined LookEDGY: A Disciplined Look
EDGY: A Disciplined Look
 
Agile Data Warehousing (DW)/Business Intelligence (BI): Addressing the Hard P...
Agile Data Warehousing (DW)/Business Intelligence (BI): Addressing the Hard P...Agile Data Warehousing (DW)/Business Intelligence (BI): Addressing the Hard P...
Agile Data Warehousing (DW)/Business Intelligence (BI): Addressing the Hard P...
 
Technical Debt: A Management Problem That Requires a Management Solution
Technical Debt: A Management Problem That Requires a Management SolutionTechnical Debt: A Management Problem That Requires a Management Solution
Technical Debt: A Management Problem That Requires a Management Solution
 
Working Smarter: Learn, Optimize, Accelerate
Working Smarter: Learn, Optimize, AccelerateWorking Smarter: Learn, Optimize, Accelerate
Working Smarter: Learn, Optimize, Accelerate
 
No frameworks: How we can take agile back
No frameworks: How we can take agile backNo frameworks: How we can take agile back
No frameworks: How we can take agile back
 
Agile transformations: The good, the bad, and the ugly
Agile transformations: The good, the bad, and the uglyAgile transformations: The good, the bad, and the ugly
Agile transformations: The good, the bad, and the ugly
 
Choose Your Way of Working (WoW)!
Choose Your Way of Working (WoW)!Choose Your Way of Working (WoW)!
Choose Your Way of Working (WoW)!
 
Choose Your WoW! DevOps in the Enterprise
Choose Your WoW!  DevOps in the EnterpriseChoose Your WoW!  DevOps in the Enterprise
Choose Your WoW! DevOps in the Enterprise
 
Disciplined Agile Data Management
Disciplined Agile Data ManagementDisciplined Agile Data Management
Disciplined Agile Data Management
 
Agile Modeling: A Disciplined Approach to Modelling and Documentation
Agile Modeling: A Disciplined Approach to Modelling and DocumentationAgile Modeling: A Disciplined Approach to Modelling and Documentation
Agile Modeling: A Disciplined Approach to Modelling and Documentation
 
Measuring Agile: A Disciplined Approach To Metrics
Measuring Agile: A Disciplined Approach To MetricsMeasuring Agile: A Disciplined Approach To Metrics
Measuring Agile: A Disciplined Approach To Metrics
 
Agile enterprise architecture
Agile enterprise architectureAgile enterprise architecture
Agile enterprise architecture
 
(In Agile) Where Do All The Managers Go?
(In Agile) Where Do All The Managers Go?(In Agile) Where Do All The Managers Go?
(In Agile) Where Do All The Managers Go?
 
Disciplined agile business analysis
Disciplined agile business analysisDisciplined agile business analysis
Disciplined agile business analysis
 
Crushed by technical debt
Crushed by technical debtCrushed by technical debt
Crushed by technical debt
 
Disciplined Agile Outsourcing: Making it work for both the customer and the s...
Disciplined Agile Outsourcing: Making it work for both the customer and the s...Disciplined Agile Outsourcing: Making it work for both the customer and the s...
Disciplined Agile Outsourcing: Making it work for both the customer and the s...
 
Disciplined Agile Business Analysis
Disciplined Agile Business AnalysisDisciplined Agile Business Analysis
Disciplined Agile Business Analysis
 
Continuous Architecture and Emergent Design: Disciplined Agile Strategies
Continuous Architecture and Emergent Design: Disciplined Agile StrategiesContinuous Architecture and Emergent Design: Disciplined Agile Strategies
Continuous Architecture and Emergent Design: Disciplined Agile Strategies
 

Recently uploaded

Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
Alison B. Lowndes
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Product School
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
Product School
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 

Recently uploaded (20)

Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 

Data DevOps: An Overview

  • 1. Data DevOps Scott W. Ambler Consulting Methodologist | Agile Data Coach Ambysoft Inc. © Ambysoft Inc. All rights reserved. 1 Download these slides at Slideshare.net/ScottWAmbler/
  • 2. Agenda © Ambysoft Inc. All rights reserved. 2 • Data DevOps • Agile data development techniques • Agile data operations techniques • Parting thoughts • Questions and answers
  • 3. Scott Ambler © Ambysoft Inc. All rights reserved. 3 • scott@scottambler.com • Twitter: @scottwambler • linkedin.com/in/sambler/ Consulting Methodologist Ambysoft.com Thought Leader AgileData.org Thought Leader AgileModeling.com
  • 4. © Ambysoft Inc. All rights reserved. 4 Data Dev Data Ops
  • 5. Defining Data DevOps Data Development (Data Dev) The effort to develop and evolve the data aspects of your solutions Data Operations (Data Ops) The effort to operate, support, and govern the data aspects of your solutions Data DevOps The streamlined combination of Data Dev + Data Ops, part of your overall DevOps strategy © Ambysoft Inc. All rights reserved. 5
  • 6. © Ambysoft Inc. All rights reserved. 6 Data Dev Data Ops Agile data modeling Database refactoring Continuous database integration (CDI) Automated database regression testing Continuous database deployment (CDD) Operational data quality assurance Data security Critical Data DevOps Techniques
  • 7. © Ambysoft Inc. All rights reserved. 7 Data modeling is the act of exploring data-oriented structures Evolutionary data modeling is data modeling performed in an iterative and incremental manner Agile data modeling is evolutionary data modeling done in a collaborative manner Source: AgileData.org/essays/agileDataModeling.html Agile Data Modeling Construction Sprint JIT Modeling Detailed physical data modeling (to generate DDL) Sprint 0: Initial Modeling High-level conceptual modeling Test-Driven Development Detailed specification
  • 8. © Ambysoft Inc. All rights reserved. 8 Source: AgileData.org/essays/lookAheadDataAnalysis.html Agile Data Modeling: Look-Ahead Data Analysis There are several factors that will determine how far ahead you need to perform look-ahead data analysis: • The complexity of the data source(s). • Your ability to gain access to the data source(s). • The difficulty of the question being asked. • The skill, experience, and knowledge of the data analyst(s). • The availability of the data analyst(s). • Your data profiling tools. Sometimes it takes several days, even weeks, to perform data analytics before you can implement a question story.
  • 9. © Ambysoft Inc. All rights reserved. 9 A database refactoring is a simple change to a database schema that improves its design while retaining both its behavioral and informational semantics A database schema includes structural aspects such as table and view definitions; functional aspects such as stored procedures and triggers; and informational aspects such as the data itself Source: AgileData.org/essays/databaseRefactoring.html Database Refactoring
  • 10. © Ambysoft Inc. All rights reserved. 10 Source: AgileData.org/essays/databaseRefactoring.html Customer CustomerID <<PK>> Fname Customer CustomerID <<PK>> Fname FirstName SynchronizeFirstName() Customer CustomerID <<PK>> FirstName Original Schema: Interim Schema: Final Schema: Database Refactoring: Rename Column
  • 11. © Ambysoft Inc. All rights reserved. 11 • Part of building the system is building the database (if it changed) • Challenge: Tests SHOULD put the database back into a known state, but sometimes don’t • You will want to rebuild the (non-production) database from scratch every so often • Challenge: Database accesses take time • Some test suites will test against DB mocks • You still need to test the actual database occasionally • Source: AgileData.org/essays/continuousIntegration.html Continuous Database Integration (CDI)
  • 12. © Ambysoft Inc. All rights reserved. 12 Source: AgileData.org/essays/databaseTesting.html Database Regression Testing
  • 13. © Ambysoft Inc. All rights reserved. 13 Continuous Deployment between Sandboxes Development Integration 1 Integration N Production • Each integration DB must know it’s version, so as to support database refactoring • CDI differs between sandboxes • Development sandboxes often implement database mocks • Integration sandboxes should test against the database • Production has limited tests and checks for performance reasons Source: AgileData.org/essays/sandboxes.html
  • 14. Operational Data Quality Assurance There are many techniques available to ensure operational data quality • Data transformation • Data masking • Data stewardship • Data privacy • Data governance • Data source backup • Data lineage • Data history • Data cleansing Challenges: • Increasing volume of data • Increasing data arrival rate © Ambysoft Inc. All rights reserved. 14
  • 15. Data Security There are a myriad of data security issues to consider: • Data encryption • Data privacy • Access control • Secure storage • Secure deletion • And many more… © Ambysoft Inc. All rights reserved. 15
  • 16. Parting Thoughts © Ambysoft Inc. All rights reserved. 16
  • 17. © Ambysoft Inc. All rights reserved. 17 The increasing pace of change, increasing complexity, and increasing volume of data demands nothing less than complete data agility
  • 18. Thank You! © Ambysoft Inc. All rights reserved. 18 • scott@scottambler.com • Twitter: @scottwambler • linkedin.com/in/sambler/ Consulting Methodologist Ambysoft.com Thought Leader AgileData.org Thought Leader AgileModeling.com
  • 19. The Agile Data Site AgileData.org Key articles: • Data Technical Debt: How to Address Quality Problems in Data Sources • Database Techniques Stack • The Agile Data Architect © Ambysoft Inc. All rights reserved. 19
  • 20. Look-Ahead Data Analysis: Agile © Ambysoft Inc. All rights reserved. 20 Scenario: You want to implement three question stories in sprint #9 You need to: • Have a definition of ready (DoR) indicating the amount of data analysis work required • Guesstimate the amount of data analysis required for each one, and then perform the analysis sufficiently before sprint #9 • Have sufficient capacity to perform look-ahead data analysis • Interleave data analysis for other sprints into the work of the people performing it Note: Staffing your team with specialists will exacerbate work scheduling challenges. Consider generalizing specialists (AgileModeling.com/essays/generalizingSpecialists.htm) instead.
  • 21. Look-Ahead Data Analysis: Continuous Delivery © Ambysoft Inc. All rights reserved. 21 Development – QS 9a Development – QS 9b Development – QS 9c Scenario: You want to implement the same three question stories • You are not constrained by organizing the work into sprints • Development = Data analysis + other implementation work • Work can be brought into the team as capacity permits • Value is delivered when it is available • Average cycle time to deliver stories is shorter • Staffing your team with generalizing specialists may enable you to swarm around data analysis work and shorten the development time of a question story
  • 22. Agile Data Ways of Thinking (WoT) © Ambysoft Inc. All rights reserved. 22
  • 23. Agile Data Ways of Thinking (WoT) AgileData.org/essays/philosophies.html Look Beyond Data Collaborate Closely Be Quality Infected Embrace Evolution Be Enterprise Aware Fit-For-Purpose Everyone Agile © Ambysoft Inc. All rights reserved. 23