Virtual Data : Eliminating the data constraint in Application Development

Virtual Data
Eliminating the data constraint in Application
Development
Kyle Hailey, Technical Evangelist at Delphix

Technology
Disruption
“Software is eating the world.”
- Marc Andreessen
Increasing
Commoditization
Competitive
Pressures

New software required for success
2010

• Problem : Data Constraint
• Solution : Virtual Data
• Use Cases : Development, Security, Cloud
In this presentation :

DevOps : Process
• Goals Clarify
• Metrics Define
• Constraints Identify
• Priorities Set
• Iterations Fast

DevOps : Process
• Goals Clarify
• Metrics Define
• Constraints Identify
• Priorities Set
• Iterations Fast
Tools:
• Continuous Delivery
• Cloud
• Agile
• Kanban
• Kata

The Phoenix Project
What is the
constraint
in IT ?

Put your energy into the constraint
Top 5 constraints in IT
1. Dev environments setup
2. QA setup
3. Code Architecture
4. Development
5. Product management
- Gene Kim
Surveyed
• 14000 companies
• 100s of CIOs

DevOps is a Goal
Fast flow
of features
from development
to IT operations
to the customers
- Gene Kim

Flow of Features
14
1
Development
Environments
2
QA & Testing Environments
Product
Management
Features
2 2
Code Architecture
3Code Speed
4
5
Data

Development Pipeline for QA
Build
Deploy
Environment
Database

16
PROD
DEV Test UAT
DBA
Sys
Admin
Storage
Admin
Legacy Data Movement: Slow & expensive
?
Slow environment builds: delays

Development Pipeline for QA
17
0 2 4 6 8 10 12 14 16 18 20 22 24
Reset
Test
Reset
Test
Reset
Test
Physical Data
Wait Time
Hours
Refresh
( > 80%)
Testing(< 20%)

Data Management not Agile
18
• 20% SDLC time lost waiting for data
• 60% dev/QA time consumed by data tasks
Conclusion:
Data management does not scale to
Agile
- Infosys
Data is the Constraint

Application Development Constraints
19
1. Not enough resources
2. Bad test data leading to bugs
3. Slow environment builds

1. Not Enough Resources: shared bottlenecks
Frustration Waiting

1. Not Enough Resources : bugs because of old data
Old Unrepresentative Data

1. Not enough resources: limited environments

2. Bad data leads to bugs: subsets

24
2. Bad data leads to bugs: Production Wall

2. Bad data leads to bugs: late stage bugs
Dev QA UAT Production

#
bugs
Found

Dev Testing UAT Production
0
10
20
30
40
50
60
70
1 2 3 4 5 6 7
Cost
To
Correct
Software
Engineering
Economics
– Barry Boehm (1981)

Developer Asks for
DB
Get
Access
Manager approves
DBA Request
system
Setup DB
System
Admin
Request
storage
Setup
machine
Storage
Admin
Allocate
storage
(take snapshot)
3. Slow environment builds: delays

Why are hand offs so expensive?
1hour
1 day
9 days
3. Slow environment builds: delays

Could I have a copy of the production DB ?
Developer, tester or AnalystBoss, Storage Admin, DBA

• Data Constraint
• Solution
• Use Cases
In this presentation :

Development UATQA
99% of blocks are identical

Three Technologies
Production DevelopmentStorage
Provision
Synchronize (copy) Clone (snapshot)

Three Technologies
Production DevelopmentStorage
Provision
Synchronize (copy) Clone (snapshot)
Virtual Copy Data Management
+ masking & self service

Install Delphix on Intel hardware
• .
• .
• .
• .
• .
• Data
• .
• Binaries
• Application Stacks
• EBS
• SAP
• Flat files

Allocate Any Storage to Delphix
Any Storage
Pure Storage + Delphix
Better Performance for
1/10 the cost

40© 2015 Delphix. All Rights Reserved. Private & Confidential.
One time backup of source database
Production
3 TB
1 TB

One time backup of source database
Production
3 TB
1 TB
ProvisionSynchronize (copy) Clone (snapshot)

Three Physical Copies Three Virtual Copies
Data
Virtualization
Appliance

43
PROD DEV DEV Test Test UAT
Data as a Service : fast, elastic, secure
Self Service

• Problem in the Industry
• Solution
• Use Cases

1. Development
2. Security
3. Cloud Migration
Use Cases

Development: Virtual Data
Development

Virtual Data: Parallelize
gif by Steve Karam

Environments: increase the limit

Physical Data : late stage bugs
0
50
100
150
200
250
300
350
400
450
500
Bugs Discovered Legacy
0
10
20
30
40
50
60
70
1 2 3 4 5 6 7
Cost
To
Correct
Cost
To
Correct

Physical Data : find bugs fast
0
10
20
30
40
50
60
70
1 2 3 4 5 6 7
Cost
To
Correct

RefreshTest Refresh
Test
Refresh
Test
Virtual Data : Fast Refresh
53
0 2 4 6 8 10 12 14 16 18 20 22 24
Hours
Virtual Data
Physical Data
Bookmark, Reset
99% Less Downtime Data FederationVersion Control
Bookmark and BranchQuickly Refresh Sync across data sources

Virtual Data: Version Control
54
Dev Dev
2.1 2.2
Production Time Flow
Live Archive data for years
• Archive EBS R11 before upgrade to R12
• Sarbanes-Oxley
• Dodd-Frank
• Financial Stress tests
Production

QA
• Fast
• Full Size
• Run Parallel QA
Virtual Data : Parallel
Production

Virtual Data: Rewind
QA
Production

Virtual Data: A/B
Index 1
Index 2
Production

Modernization: Federated
Production Time Flow 1
Production Time Flow 2
Production 1
Production 2

“I looked like a hero”
Tony Young, CIO Informatica
Virtual Data: Federated

1. Development & QA
2. Security
3. Cloud Migration
Use Cases

Tradition Protection: Network & Perimeter
EndpointsPerimeter Defense
Protect the
Interior
Encryption
Network
Intrusion
Detection
Endpoint
Defense
“Organizations should use data
Masking to protect sensitive data
at rest and in transit from insiders'
and outsiders' attacks.”
- Gartner
Magic Quadrant for
Data Masking Technology

Insider Threats Are Costly
$1,075
$1,900
$7,378
$33,565
$81,500
$85,959
$96,424
$126,545
$144,542
Botnets
Viruses, worms,…
Malware
Stolen devices
Malicious code
Phishing & social…
Web-based attacks
Denial of services
Malicious insiders
Average Annualized Cyber Crime Cost Weighted by
Attack Frequency
Consolidated view, n = 252 separate companies
2015 Global Cost of Cyber Crime Study,
Ponemon Institute

Costs more
Quality is
lower
Hard to mask
consistently
Moving data
from prod to
non-prod
takes a long
time
Ease of Use
Instant data
Consistent

Virtual Data Masking
• Automates discovery
• Provides different masking algorithms for different data types
• Mask once clone many with thin cloning
Mask Data
6 hours Clone 18 Hours
Clone
15 min
Mask Data
Mask
4 hours
Mask
Data

Production Dev, QA, UAT Reporting Backup
Security problem

Production Dev, QA, UAT Reporting Sandbox
Security management improvement

Production
Dev, QA, UAT Reporting Sandbox
Security Solution

Migration to Cloud
70
Three Clones
=
Moving 3 x the Source

Migration to Cloud with Delphix
71
Three Clones
=
Moving 1/3 of Source Size

Cloud Optimizations
72
ON PREMISE /
PRIVATE CLOUD
Replication
Encrypted
Compressed
Masked

Cloud Optimizations
73
ON PREMISE /
PRIVATE CLOUD

Cloud Optimizations
74
$$$
ON PREMISE /
PRIVATE CLOUD

Cloud Optimizations
75
ON PREMISE /
PRIVATE CLOUD

Cloud Optimizations
76
ON PREMISE /
PRIVATE CLOUD

Cloud Optimizations
77
ON PREMISE /
PRIVATE CLOUD

Cloud Optimizations
78
ON PREMISE /
PRIVATE CLOUD

• Recovery
• Forensics
• Migration
Bonus : Production Support

9TB database 1TB change day : 30 days
0
10
20
30
40
50
60
70
week1
week2
week3
week4
original
Oracle
Delphix
Storage
Required
(TB)
Days

RPO & RTO
81
• RPO
– Any time in last 30 days
– Down to the second
• RTO
– Minutes
– Push button
0
5
10
15
week1
week2
week3
week4 original
Delphix

Virtual Data: Recovery
Instance
Recover VDB
Drop
Production

Virtual Data: Forensics
Development
Production

Virtual Data: Development recovery
Development
Development
Prod & VDB Time Flow
Production

1. Development & QA
– Dev throughput increase by 2x
2. Secure
– Mask once, clone many
3. Cloud Enablement
– Compressed, encrypted replication
– active/active replication
Summary

• Problem: Data constraint
• Solution: Virtual Data
Summary

• Projects “12 months to 6 months.”
– New York Life
• Insurance product “about 50 days ... to about 23 days”
– Presbyterian Health
• “Can't imagine working without it”
– State of California
Virtual Data Quotes

Thank you!
• Kyle Hailey - Technical Evangelist (Oracle Ace Director, Oaktable)
– Kyle@delphix.com
– kylehailey.com
– slideshare.net/khailey
– @virtdata

A database refresh in 15 minutes?
That is mind blowing!
Delphix nailed it for us.
- Matt Lawrence , Sr Director Wind River (Intel)
Took 3 weeks to build a dev env
now with Delphix takes less than a day
the db part is less than 15 minutes
- Marty Boos , Stubhub (Ebay)
Delphix goes beyond storage
Delphix so much more than
We thought it was
-Michael Brow State of Colorado

Worth investing on this product
the technology is strong and
value prop is high
- Deloitte
I'm convinced about Delphix's
technology Delphix can really
increase the quality of Dev / QA
- Oaktable Member
Delphix allows us to move fast and setup database copies in seconds
Delphix is powerful and allowed us to scale from 2 projects to 11
We need Delphix to scale our agile environment
– Tim Campos, CIO, Facebook

The Goal : eliminate the constraint
Improvement
not made
at the constraint
is an illusion
Theory of Constraints

Factory floor
constraint
Tuning here
Stock piling

Factory floor
constraint
Tuning here
Starvation

Factory floor
constraint
Goal:
• find constraint
• optimize it

Virtual Data : Eliminating the data constraint in Application Development

More Related Content

What's hot

Similar to Virtual Data : Eliminating the data constraint in Application Development

More from Kyle Hailey

Recently uploaded

Virtual Data : Eliminating the data constraint in Application Development

Editor's Notes