SlideShare a Scribd company logo
1 of 32
Test Data Management
Harald Kikkers, Maarten Urbach & Bert Nienhuis
DATPROF
Data IntegrationTest Data Management
• Dutch Software supplier
• Founded in 1998
• Partners: ITCG, Sogeti, …
…and many more!
MANY
ORGANISATIONS
USE MULTIPLE COPIES OF
PRODUCTION DATABASES
PURPOSES:
• TESTING
• DEVELOPMENT
• OUTSOURCING
• MARKETING
• TRAINING
Agile Development
• Building the right product
• Room for change
• Every 2-4 weeks working increments of the software
• Progress in development
How to test all these iterations?
And… what data to use?
Team 1 Team 2 Team 3
6 TB 500 GB
Production
10 GB
6 TB 500 GB
Test
10 GB 6 TB 500 GB
Development
10 GB
Total
19,53 TB
Team 1 Team 2 Team 3
6 TB 500 GB
Production
10 GB
6 TB 500 GB
Test
10 GB 6 TB 500 GB
Development
10 GB
Total
19,53 TB
Team 1 Team 2 Team 3
Test
Team 1 Team 2 Team 3
Development
Team 1 Team 2 Team 3
6 TB 500 GB
Production
10 GB
6 TB 500 GB
Development
10 GB
6 TB 500 GB
Test
10 GB
6 TB 500 GB
Development
10 GB
6 TB 500 GB
Test
10 GB
6 TB 500 GB
Development
10 GB
6 TB 500 GB
Test
10 GB
Total
45,57 TB
Team 1 Team 2 Team 3
6 TB 500 GB
Production
10 GB
600 GB 50 GB
Development
1 GB
600 GB 50 GB
Test
1 GB
600 GB 50 GB
Development
1 GB
600 GB 50 GB
Test
1 GB
600 GB 50 GB
Development
1 GB
600 GB 50 GB
Test
1 GB
Total
10.4 TB
10 % Subset 10 % Subset 10 % Subset
Development
Test
Development
Test
Development
Test
How to protect
sensitive customer data?
Test Test Test
Development Development Development
Minimize data usage
Save on hardware & infra
Reduce throughput times
Efficient data management
Protect customer information
Comply with regislation
Prevent brand damage
Maintain competitive advantages
Subsetting Anonymizing
Advantages of subsetting data Advantages of scrambling & masking data
DBA Tools ETL Suites
100$ tools IBM, Informatica, Oracle
DBA Tools ETL Suites
?
DBA Tools ETL Suites
- User Experience
- Default templates
- Easy to maintain
- Smart functionality
- Chain support
DBA Tools ETL Suites
Production Test/Development
Source Database Target Database
Data model classification
Subset – Process data
Example: Customers, Orders, Contracts, Invoices, Transactions
Full – Master data
Example: Application data, configuration, master tables
Embty – Logging, non relevant history
Example: Logging tables, temp tabellen
Determine data to be subsetted
Chain of systems
Method for deriving consistent subsets from multiple systems
Production Test/Development
Start Filter
All customers from The
Netherlands
Start Filter
All orders from customers in
the previous subset.
Import
Meta data Classification Deployment
Anonymization of sensitive data
- Bank account balance
- Dept
- Medication
- Illness
- Religion
- Political preference
- Salary
- Phone history
- Et cetera…
- Name
- Date of birth
- Email
- Bank account number
- Social security number
- Adress
- Insurance number
- Cellphone number
- Et cetera..
Personal data
Identifying Characteristics
“Any information relating to an identified or identifiable natural person ("data subject")
Source: Data Protection Directive - Directive 95/46/EC
Techniques
Shuffle
Shuffle values within same column
Conditional
Manipulate specified rows+
First name Last name Type
John
Max
Joe
Clark
Smith
Williams
DATPROF
Customer
Customer
Customer
Company
321
First name Last name Type Comment E-Mail
John
Max
Joe
Smith
Williams
Clark
Blank
Delete values from columns
Scramble
Replace existing characters
j.clark@live.com
Smith_max@mail.com
i_am@JoeWilliams.de
“Brother of J. Clark”
“Has dept”
Customer
Customer
Customer
CompanyDATPROF
Nr. First name Last name Type Co.. E-mail Date of Birth
John
Max
Joe
Smith
Williams
Clark
DATPROF
123
Customer
Customer
Customer
Company
321
789
456
First day
Change dates to first day within same month and year
01-02-1954
01-11-1984
01-03-1974
Postal code
Date of Birth 1st day of month 1st day of year
87% 3.7% 0.04%
Source: research anonimity by Prof. Dr. Latanya Sweeney (Harvard University)
x.xxxxx@xxxx...
Xxxxx_xxx@xx...
x_xx@XxxXxxx...
Nr. First name Last name Type .. E-mail Date of birth
123
321
789
01-02-1954
01-11-1984
01-03-1974
Look-up
Replace values with values from a lookup table
James
Adrian
Thomas
John
Max
Joe
First names
Chris
Thomas
James
Ruben
Adrian
Michael
David
Reference data
Smith
Williams
Clark
DATPROF
Customer
Customer
Customer
Company
x.xxxxx@xxxx...
Xxxxx_xxx@xx...
x_xx@XxxXxxx...
Nr. First name Last name Type Comment E-mail Date of birth
Thomas
James
Adrian
Smith
Williams
Clark
DATPROF
123
Customer
Customer
Customer
Company
321
789
456
01-02-1954
01-11-1984
01-03-1974
Expression
Use custom made functions
Scrambled T.Smith@datprof.com
J.Willams@datprof.com
A.Clark@datprof.com
Scrambled
Scrambled
Import
Meta data
Define masking
rules
3. Deployment

More Related Content

What's hot

Automate data warehouse etl testing and migration testing the agile way
Automate data warehouse etl testing and migration testing the agile wayAutomate data warehouse etl testing and migration testing the agile way
Automate data warehouse etl testing and migration testing the agile wayTorana, Inc.
 
Test Data Management a Managed Service for Software Quality Assurance
Test Data Management a Managed Service for Software Quality AssuranceTest Data Management a Managed Service for Software Quality Assurance
Test Data Management a Managed Service for Software Quality AssuranceSoftware Testing Solution
 
Joseph Ours - The Scourge Of Testing: Test Data Management
Joseph Ours - The Scourge Of Testing: Test Data ManagementJoseph Ours - The Scourge Of Testing: Test Data Management
Joseph Ours - The Scourge Of Testing: Test Data ManagementQA or the Highway
 
Test Data Management and Its Role in DevOps
Test Data Management and Its Role in DevOpsTest Data Management and Its Role in DevOps
Test Data Management and Its Role in DevOpsTechWell
 
Testing the Data Warehouse―Big Data, Big Problems
Testing the Data Warehouse―Big Data, Big ProblemsTesting the Data Warehouse―Big Data, Big Problems
Testing the Data Warehouse―Big Data, Big ProblemsTechWell
 
Data summit connect fall 2020 - rise of data ops
Data summit connect fall 2020 - rise of data opsData summit connect fall 2020 - rise of data ops
Data summit connect fall 2020 - rise of data opsRyan Gross
 
Transforming Business Intelligence Testing
Transforming Business Intelligence TestingTransforming Business Intelligence Testing
Transforming Business Intelligence TestingMethod360
 
Data Warehouse Testing: It’s All about the Planning
Data Warehouse Testing: It’s All about the PlanningData Warehouse Testing: It’s All about the Planning
Data Warehouse Testing: It’s All about the PlanningTechWell
 
How to prepare data before a data migration
How to prepare data before a data migrationHow to prepare data before a data migration
How to prepare data before a data migrationETLSolutions
 
Testing Big Data: Automated Testing of Hadoop with QuerySurge
Testing Big Data: Automated  Testing of Hadoop with QuerySurgeTesting Big Data: Automated  Testing of Hadoop with QuerySurge
Testing Big Data: Automated Testing of Hadoop with QuerySurgeRTTS
 
Deliver Trusted Data by Leveraging ETL Testing
Deliver Trusted Data by Leveraging ETL TestingDeliver Trusted Data by Leveraging ETL Testing
Deliver Trusted Data by Leveraging ETL TestingCognizant
 
T2 Tech Group Overview PowerPoint
T2 Tech Group Overview PowerPointT2 Tech Group Overview PowerPoint
T2 Tech Group Overview PowerPointKevin Torf
 
15 Tips on Salesforce Data Migration - Naveen Gabrani & Jonathan Osgood
15 Tips on Salesforce Data Migration - Naveen Gabrani & Jonathan Osgood15 Tips on Salesforce Data Migration - Naveen Gabrani & Jonathan Osgood
15 Tips on Salesforce Data Migration - Naveen Gabrani & Jonathan OsgoodSalesforce Admins
 
Test Data Management: A Healthcare Industry Case Study
Test Data Management: A Healthcare Industry Case StudyTest Data Management: A Healthcare Industry Case Study
Test Data Management: A Healthcare Industry Case StudyTechWell
 
20171019 data migration (rk)
20171019 data migration (rk)20171019 data migration (rk)
20171019 data migration (rk)Ruud Kapteijn
 
"How to document your decisions", Dmytro Ovcharenko
"How to document your decisions", Dmytro Ovcharenko "How to document your decisions", Dmytro Ovcharenko
"How to document your decisions", Dmytro Ovcharenko Fwdays
 
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...Mihai Criveti
 

What's hot (20)

Automate data warehouse etl testing and migration testing the agile way
Automate data warehouse etl testing and migration testing the agile wayAutomate data warehouse etl testing and migration testing the agile way
Automate data warehouse etl testing and migration testing the agile way
 
Test Data Management a Managed Service for Software Quality Assurance
Test Data Management a Managed Service for Software Quality AssuranceTest Data Management a Managed Service for Software Quality Assurance
Test Data Management a Managed Service for Software Quality Assurance
 
Joseph Ours - The Scourge Of Testing: Test Data Management
Joseph Ours - The Scourge Of Testing: Test Data ManagementJoseph Ours - The Scourge Of Testing: Test Data Management
Joseph Ours - The Scourge Of Testing: Test Data Management
 
Test Data Management and Its Role in DevOps
Test Data Management and Its Role in DevOpsTest Data Management and Its Role in DevOps
Test Data Management and Its Role in DevOps
 
Testing the Data Warehouse―Big Data, Big Problems
Testing the Data Warehouse―Big Data, Big ProblemsTesting the Data Warehouse―Big Data, Big Problems
Testing the Data Warehouse―Big Data, Big Problems
 
Test Automation for Data Warehouses
Test Automation for Data Warehouses Test Automation for Data Warehouses
Test Automation for Data Warehouses
 
Data summit connect fall 2020 - rise of data ops
Data summit connect fall 2020 - rise of data opsData summit connect fall 2020 - rise of data ops
Data summit connect fall 2020 - rise of data ops
 
Transforming Business Intelligence Testing
Transforming Business Intelligence TestingTransforming Business Intelligence Testing
Transforming Business Intelligence Testing
 
Fraction ERP Overview
Fraction ERP OverviewFraction ERP Overview
Fraction ERP Overview
 
Data Quality Everywhere
Data Quality EverywhereData Quality Everywhere
Data Quality Everywhere
 
Data Warehouse Testing: It’s All about the Planning
Data Warehouse Testing: It’s All about the PlanningData Warehouse Testing: It’s All about the Planning
Data Warehouse Testing: It’s All about the Planning
 
How to prepare data before a data migration
How to prepare data before a data migrationHow to prepare data before a data migration
How to prepare data before a data migration
 
Testing Big Data: Automated Testing of Hadoop with QuerySurge
Testing Big Data: Automated  Testing of Hadoop with QuerySurgeTesting Big Data: Automated  Testing of Hadoop with QuerySurge
Testing Big Data: Automated Testing of Hadoop with QuerySurge
 
Deliver Trusted Data by Leveraging ETL Testing
Deliver Trusted Data by Leveraging ETL TestingDeliver Trusted Data by Leveraging ETL Testing
Deliver Trusted Data by Leveraging ETL Testing
 
T2 Tech Group Overview PowerPoint
T2 Tech Group Overview PowerPointT2 Tech Group Overview PowerPoint
T2 Tech Group Overview PowerPoint
 
15 Tips on Salesforce Data Migration - Naveen Gabrani & Jonathan Osgood
15 Tips on Salesforce Data Migration - Naveen Gabrani & Jonathan Osgood15 Tips on Salesforce Data Migration - Naveen Gabrani & Jonathan Osgood
15 Tips on Salesforce Data Migration - Naveen Gabrani & Jonathan Osgood
 
Test Data Management: A Healthcare Industry Case Study
Test Data Management: A Healthcare Industry Case StudyTest Data Management: A Healthcare Industry Case Study
Test Data Management: A Healthcare Industry Case Study
 
20171019 data migration (rk)
20171019 data migration (rk)20171019 data migration (rk)
20171019 data migration (rk)
 
"How to document your decisions", Dmytro Ovcharenko
"How to document your decisions", Dmytro Ovcharenko "How to document your decisions", Dmytro Ovcharenko
"How to document your decisions", Dmytro Ovcharenko
 
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
 

Viewers also liked

Test Data Management 101—Featuring a Tour of CA Test Data Manager (Formerly G...
Test Data Management 101—Featuring a Tour of CA Test Data Manager (Formerly G...Test Data Management 101—Featuring a Tour of CA Test Data Manager (Formerly G...
Test Data Management 101—Featuring a Tour of CA Test Data Manager (Formerly G...CA Technologies
 
Linked Data for Information Extraction Challenge - Tasks and Results @ ISWC 2014
Linked Data for Information Extraction Challenge - Tasks and Results @ ISWC 2014Linked Data for Information Extraction Challenge - Tasks and Results @ ISWC 2014
Linked Data for Information Extraction Challenge - Tasks and Results @ ISWC 2014Robert Meusel
 
Case Study: Manheim Implements Test Data Management to Reduce Testing Time an...
Case Study: Manheim Implements Test Data Management to Reduce Testing Time an...Case Study: Manheim Implements Test Data Management to Reduce Testing Time an...
Case Study: Manheim Implements Test Data Management to Reduce Testing Time an...CA Technologies
 
Test Automation NYC 2014
Test Automation NYC 2014Test Automation NYC 2014
Test Automation NYC 2014Kishore Bhatia
 
Ibm test data_management_v0.4
Ibm test data_management_v0.4Ibm test data_management_v0.4
Ibm test data_management_v0.4Rosario Cunha
 
Tech Vision: Next-Generation Performance Testing With BlazeMeter, Service Vir...
Tech Vision: Next-Generation Performance Testing With BlazeMeter, Service Vir...Tech Vision: Next-Generation Performance Testing With BlazeMeter, Service Vir...
Tech Vision: Next-Generation Performance Testing With BlazeMeter, Service Vir...CA Technologies
 
How to define mobile automation strategy
How to define mobile automation strategyHow to define mobile automation strategy
How to define mobile automation strategySelin Gungor
 
Test Data Management: The Underestimated Pain
Test Data Management: The Underestimated PainTest Data Management: The Underestimated Pain
Test Data Management: The Underestimated PainChelsea Frischknecht
 
ISTQB, ISEB Lecture Notes- 2
ISTQB, ISEB Lecture Notes- 2ISTQB, ISEB Lecture Notes- 2
ISTQB, ISEB Lecture Notes- 2onsoftwaretest
 
OSI Referans Modeli ve Katmanları - Alican Uzunhan
OSI Referans Modeli ve Katmanları - Alican UzunhanOSI Referans Modeli ve Katmanları - Alican Uzunhan
OSI Referans Modeli ve Katmanları - Alican UzunhanMesut Güneş
 
ISTQB Foundation Level Basic
ISTQB Foundation Level BasicISTQB Foundation Level Basic
ISTQB Foundation Level BasicSelin Gungor
 
Performance Testing
Performance TestingPerformance Testing
Performance TestingSelin Gungor
 
Software development life cycle yazılım geliştirme yaşam döngüsü
Software development life cycle   yazılım geliştirme yaşam döngüsüSoftware development life cycle   yazılım geliştirme yaşam döngüsü
Software development life cycle yazılım geliştirme yaşam döngüsüMesut Günes
 
ISTQB Projelerde Spesifikasyona Dayalı Test Teknikleri
ISTQB Projelerde Spesifikasyona Dayalı Test TeknikleriISTQB Projelerde Spesifikasyona Dayalı Test Teknikleri
ISTQB Projelerde Spesifikasyona Dayalı Test TeknikleriPEM Proje Eğitim Merkezi
 
Qtp 9.5 Tutorials by www.onsoftwaretest.com
Qtp 9.5 Tutorials by www.onsoftwaretest.comQtp 9.5 Tutorials by www.onsoftwaretest.com
Qtp 9.5 Tutorials by www.onsoftwaretest.comonsoftwaretest
 

Viewers also liked (20)

Comparación
ComparaciónComparación
Comparación
 
Test Data Management 101—Featuring a Tour of CA Test Data Manager (Formerly G...
Test Data Management 101—Featuring a Tour of CA Test Data Manager (Formerly G...Test Data Management 101—Featuring a Tour of CA Test Data Manager (Formerly G...
Test Data Management 101—Featuring a Tour of CA Test Data Manager (Formerly G...
 
Linked Data for Information Extraction Challenge - Tasks and Results @ ISWC 2014
Linked Data for Information Extraction Challenge - Tasks and Results @ ISWC 2014Linked Data for Information Extraction Challenge - Tasks and Results @ ISWC 2014
Linked Data for Information Extraction Challenge - Tasks and Results @ ISWC 2014
 
Case Study: Manheim Implements Test Data Management to Reduce Testing Time an...
Case Study: Manheim Implements Test Data Management to Reduce Testing Time an...Case Study: Manheim Implements Test Data Management to Reduce Testing Time an...
Case Study: Manheim Implements Test Data Management to Reduce Testing Time an...
 
Agile Testing
Agile Testing Agile Testing
Agile Testing
 
Need for scaling agile
Need for scaling agileNeed for scaling agile
Need for scaling agile
 
Test Automation NYC 2014
Test Automation NYC 2014Test Automation NYC 2014
Test Automation NYC 2014
 
Ibm test data_management_v0.4
Ibm test data_management_v0.4Ibm test data_management_v0.4
Ibm test data_management_v0.4
 
Tech Vision: Next-Generation Performance Testing With BlazeMeter, Service Vir...
Tech Vision: Next-Generation Performance Testing With BlazeMeter, Service Vir...Tech Vision: Next-Generation Performance Testing With BlazeMeter, Service Vir...
Tech Vision: Next-Generation Performance Testing With BlazeMeter, Service Vir...
 
Scrum best practices
Scrum best practicesScrum best practices
Scrum best practices
 
How to define mobile automation strategy
How to define mobile automation strategyHow to define mobile automation strategy
How to define mobile automation strategy
 
Test Data Management: The Underestimated Pain
Test Data Management: The Underestimated PainTest Data Management: The Underestimated Pain
Test Data Management: The Underestimated Pain
 
ISTQB, ISEB Lecture Notes- 2
ISTQB, ISEB Lecture Notes- 2ISTQB, ISEB Lecture Notes- 2
ISTQB, ISEB Lecture Notes- 2
 
OSI Referans Modeli ve Katmanları - Alican Uzunhan
OSI Referans Modeli ve Katmanları - Alican UzunhanOSI Referans Modeli ve Katmanları - Alican Uzunhan
OSI Referans Modeli ve Katmanları - Alican Uzunhan
 
ISTQB Foundation Level Basic
ISTQB Foundation Level BasicISTQB Foundation Level Basic
ISTQB Foundation Level Basic
 
Performance Testing
Performance TestingPerformance Testing
Performance Testing
 
Software development life cycle yazılım geliştirme yaşam döngüsü
Software development life cycle   yazılım geliştirme yaşam döngüsüSoftware development life cycle   yazılım geliştirme yaşam döngüsü
Software development life cycle yazılım geliştirme yaşam döngüsü
 
ISTQB PROJELERDE HATA YÖNETİMİ
ISTQB PROJELERDE HATA YÖNETİMİISTQB PROJELERDE HATA YÖNETİMİ
ISTQB PROJELERDE HATA YÖNETİMİ
 
ISTQB Projelerde Spesifikasyona Dayalı Test Teknikleri
ISTQB Projelerde Spesifikasyona Dayalı Test TeknikleriISTQB Projelerde Spesifikasyona Dayalı Test Teknikleri
ISTQB Projelerde Spesifikasyona Dayalı Test Teknikleri
 
Qtp 9.5 Tutorials by www.onsoftwaretest.com
Qtp 9.5 Tutorials by www.onsoftwaretest.comQtp 9.5 Tutorials by www.onsoftwaretest.com
Qtp 9.5 Tutorials by www.onsoftwaretest.com
 

Similar to DATPROF Test data Management (data privacy & data subsetting) - English

Lauri Pietarinen - What's Wrong With My Test Data
Lauri Pietarinen - What's Wrong With My Test DataLauri Pietarinen - What's Wrong With My Test Data
Lauri Pietarinen - What's Wrong With My Test DataTEST Huddle
 
DATA Warehousing & Data Mining
DATA Warehousing & Data MiningDATA Warehousing & Data Mining
DATA Warehousing & Data Miningcpjcollege
 
Business Analytics Paradigm Change
Business Analytics Paradigm ChangeBusiness Analytics Paradigm Change
Business Analytics Paradigm ChangeDmitry Anoshin
 
What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?RTTS
 
UNIT - 1 : Part 1: Data Warehousing and Data Mining
UNIT - 1 : Part 1: Data Warehousing and Data MiningUNIT - 1 : Part 1: Data Warehousing and Data Mining
UNIT - 1 : Part 1: Data Warehousing and Data MiningNandakumar P
 
Splunk Business Analytics
Splunk Business AnalyticsSplunk Business Analytics
Splunk Business AnalyticsCleverDATA
 
Optimize IT Infrastructure
Optimize IT InfrastructureOptimize IT Infrastructure
Optimize IT InfrastructureScalar Decisions
 
Introduction to Big Data Analytics
Introduction to Big Data AnalyticsIntroduction to Big Data Analytics
Introduction to Big Data AnalyticsUtkarsh Sharma
 
Improve the Health of Your Data
Improve the Health of Your DataImprove the Health of Your Data
Improve the Health of Your DataRTTS
 
Biz Nova It Project Bonus Slides
Biz Nova It Project Bonus SlidesBiz Nova It Project Bonus Slides
Biz Nova It Project Bonus SlidesTyHowardPMP
 
Data analytics and analysis trends in 2015 - Webinar
Data analytics and analysis trends in 2015 - WebinarData analytics and analysis trends in 2015 - Webinar
Data analytics and analysis trends in 2015 - WebinarAli Zeeshan
 
Big Data – Shining the Light on Enterprise Dark Data
Big Data – Shining the Light on Enterprise Dark DataBig Data – Shining the Light on Enterprise Dark Data
Big Data – Shining the Light on Enterprise Dark DataHitachi Vantara
 
Data Warehousing Datamining Concepts
Data Warehousing Datamining ConceptsData Warehousing Datamining Concepts
Data Warehousing Datamining Conceptsraulmisir
 
The key to unlocking the Value in the IoT? Managing the Data!
The key to unlocking the Value in the IoT? Managing the Data!The key to unlocking the Value in the IoT? Managing the Data!
The key to unlocking the Value in the IoT? Managing the Data!DataWorks Summit/Hadoop Summit
 
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014ALTER WAY
 
Intro to Data Science Big Data
Intro to Data Science Big DataIntro to Data Science Big Data
Intro to Data Science Big DataIndu Khemchandani
 
Testing the Data Warehouse―Big Data, Big Problems
Testing the Data Warehouse―Big Data, Big ProblemsTesting the Data Warehouse―Big Data, Big Problems
Testing the Data Warehouse―Big Data, Big ProblemsTechWell
 
Gulabs Ppt On Data Warehousing And Mining
Gulabs Ppt On Data Warehousing And MiningGulabs Ppt On Data Warehousing And Mining
Gulabs Ppt On Data Warehousing And Mininggulab sharma
 

Similar to DATPROF Test data Management (data privacy & data subsetting) - English (20)

Lauri Pietarinen - What's Wrong With My Test Data
Lauri Pietarinen - What's Wrong With My Test DataLauri Pietarinen - What's Wrong With My Test Data
Lauri Pietarinen - What's Wrong With My Test Data
 
DATA Warehousing & Data Mining
DATA Warehousing & Data MiningDATA Warehousing & Data Mining
DATA Warehousing & Data Mining
 
Business Analytics Paradigm Change
Business Analytics Paradigm ChangeBusiness Analytics Paradigm Change
Business Analytics Paradigm Change
 
What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?
 
Manage your Datasets
Manage your DatasetsManage your Datasets
Manage your Datasets
 
UNIT - 1 : Part 1: Data Warehousing and Data Mining
UNIT - 1 : Part 1: Data Warehousing and Data MiningUNIT - 1 : Part 1: Data Warehousing and Data Mining
UNIT - 1 : Part 1: Data Warehousing and Data Mining
 
Splunk Business Analytics
Splunk Business AnalyticsSplunk Business Analytics
Splunk Business Analytics
 
Optimize IT Infrastructure
Optimize IT InfrastructureOptimize IT Infrastructure
Optimize IT Infrastructure
 
Introduction to Big Data Analytics
Introduction to Big Data AnalyticsIntroduction to Big Data Analytics
Introduction to Big Data Analytics
 
Improve the Health of Your Data
Improve the Health of Your DataImprove the Health of Your Data
Improve the Health of Your Data
 
Biz Nova It Project Bonus Slides
Biz Nova It Project Bonus SlidesBiz Nova It Project Bonus Slides
Biz Nova It Project Bonus Slides
 
Data analytics and analysis trends in 2015 - Webinar
Data analytics and analysis trends in 2015 - WebinarData analytics and analysis trends in 2015 - Webinar
Data analytics and analysis trends in 2015 - Webinar
 
Big Data – Shining the Light on Enterprise Dark Data
Big Data – Shining the Light on Enterprise Dark DataBig Data – Shining the Light on Enterprise Dark Data
Big Data – Shining the Light on Enterprise Dark Data
 
Data Warehousing Datamining Concepts
Data Warehousing Datamining ConceptsData Warehousing Datamining Concepts
Data Warehousing Datamining Concepts
 
IT Ready - DW: 1st Day
IT Ready - DW: 1st Day IT Ready - DW: 1st Day
IT Ready - DW: 1st Day
 
The key to unlocking the Value in the IoT? Managing the Data!
The key to unlocking the Value in the IoT? Managing the Data!The key to unlocking the Value in the IoT? Managing the Data!
The key to unlocking the Value in the IoT? Managing the Data!
 
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
 
Intro to Data Science Big Data
Intro to Data Science Big DataIntro to Data Science Big Data
Intro to Data Science Big Data
 
Testing the Data Warehouse―Big Data, Big Problems
Testing the Data Warehouse―Big Data, Big ProblemsTesting the Data Warehouse―Big Data, Big Problems
Testing the Data Warehouse―Big Data, Big Problems
 
Gulabs Ppt On Data Warehousing And Mining
Gulabs Ppt On Data Warehousing And MiningGulabs Ppt On Data Warehousing And Mining
Gulabs Ppt On Data Warehousing And Mining
 

More from DATPROF

Test Data Management in an agile environment
Test Data Management in an agile environmentTest Data Management in an agile environment
Test Data Management in an agile environmentDATPROF
 
Test automatisering en test data management | data subsets
Test automatisering en test data management | data subsetsTest automatisering en test data management | data subsets
Test automatisering en test data management | data subsetsDATPROF
 
Gebruikerssessie DATPROF februari 2015
Gebruikerssessie DATPROF februari 2015Gebruikerssessie DATPROF februari 2015
Gebruikerssessie DATPROF februari 2015DATPROF
 
Breakfast Session: Test data Management
Breakfast Session: Test data ManagementBreakfast Session: Test data Management
Breakfast Session: Test data ManagementDATPROF
 
Presentatie Privacy Paleis anonimiseringstool PIA
Presentatie Privacy Paleis anonimiseringstool PIAPresentatie Privacy Paleis anonimiseringstool PIA
Presentatie Privacy Paleis anonimiseringstool PIADATPROF
 
Presentatie Agile en Testdata van Bert Nienhuis | DATPROF
Presentatie Agile en Testdata van Bert Nienhuis | DATPROF Presentatie Agile en Testdata van Bert Nienhuis | DATPROF
Presentatie Agile en Testdata van Bert Nienhuis | DATPROF DATPROF
 
Test Tool Event van Sogeti | DATPROF Testdata Management
Test Tool Event van Sogeti | DATPROF Testdata Management Test Tool Event van Sogeti | DATPROF Testdata Management
Test Tool Event van Sogeti | DATPROF Testdata Management DATPROF
 
Testdata kennissessie: Pas op: Persoonsgegevens?!
Testdata kennissessie: Pas op: Persoonsgegevens?!Testdata kennissessie: Pas op: Persoonsgegevens?!
Testdata kennissessie: Pas op: Persoonsgegevens?!DATPROF
 
Datprof tmap 2013 handout
Datprof tmap 2013 handoutDatprof tmap 2013 handout
Datprof tmap 2013 handoutDATPROF
 
20130918 kennis sessie-handout
20130918 kennis sessie-handout20130918 kennis sessie-handout
20130918 kennis sessie-handoutDATPROF
 
Dutchtestingconference2013 slideshare
Dutchtestingconference2013 slideshareDutchtestingconference2013 slideshare
Dutchtestingconference2013 slideshareDATPROF
 
20130318 datprof privacy & subset
20130318   datprof privacy & subset20130318   datprof privacy & subset
20130318 datprof privacy & subsetDATPROF
 
Datprof privacy & subset 2.0webslideshare
Datprof privacy & subset 2.0webslideshareDatprof privacy & subset 2.0webslideshare
Datprof privacy & subset 2.0webslideshareDATPROF
 
Privacy webslideshare
Privacy webslidesharePrivacy webslideshare
Privacy webslideshareDATPROF
 
20121119 tmapprivacy
20121119 tmapprivacy20121119 tmapprivacy
20121119 tmapprivacyDATPROF
 

More from DATPROF (15)

Test Data Management in an agile environment
Test Data Management in an agile environmentTest Data Management in an agile environment
Test Data Management in an agile environment
 
Test automatisering en test data management | data subsets
Test automatisering en test data management | data subsetsTest automatisering en test data management | data subsets
Test automatisering en test data management | data subsets
 
Gebruikerssessie DATPROF februari 2015
Gebruikerssessie DATPROF februari 2015Gebruikerssessie DATPROF februari 2015
Gebruikerssessie DATPROF februari 2015
 
Breakfast Session: Test data Management
Breakfast Session: Test data ManagementBreakfast Session: Test data Management
Breakfast Session: Test data Management
 
Presentatie Privacy Paleis anonimiseringstool PIA
Presentatie Privacy Paleis anonimiseringstool PIAPresentatie Privacy Paleis anonimiseringstool PIA
Presentatie Privacy Paleis anonimiseringstool PIA
 
Presentatie Agile en Testdata van Bert Nienhuis | DATPROF
Presentatie Agile en Testdata van Bert Nienhuis | DATPROF Presentatie Agile en Testdata van Bert Nienhuis | DATPROF
Presentatie Agile en Testdata van Bert Nienhuis | DATPROF
 
Test Tool Event van Sogeti | DATPROF Testdata Management
Test Tool Event van Sogeti | DATPROF Testdata Management Test Tool Event van Sogeti | DATPROF Testdata Management
Test Tool Event van Sogeti | DATPROF Testdata Management
 
Testdata kennissessie: Pas op: Persoonsgegevens?!
Testdata kennissessie: Pas op: Persoonsgegevens?!Testdata kennissessie: Pas op: Persoonsgegevens?!
Testdata kennissessie: Pas op: Persoonsgegevens?!
 
Datprof tmap 2013 handout
Datprof tmap 2013 handoutDatprof tmap 2013 handout
Datprof tmap 2013 handout
 
20130918 kennis sessie-handout
20130918 kennis sessie-handout20130918 kennis sessie-handout
20130918 kennis sessie-handout
 
Dutchtestingconference2013 slideshare
Dutchtestingconference2013 slideshareDutchtestingconference2013 slideshare
Dutchtestingconference2013 slideshare
 
20130318 datprof privacy & subset
20130318   datprof privacy & subset20130318   datprof privacy & subset
20130318 datprof privacy & subset
 
Datprof privacy & subset 2.0webslideshare
Datprof privacy & subset 2.0webslideshareDatprof privacy & subset 2.0webslideshare
Datprof privacy & subset 2.0webslideshare
 
Privacy webslideshare
Privacy webslidesharePrivacy webslideshare
Privacy webslideshare
 
20121119 tmapprivacy
20121119 tmapprivacy20121119 tmapprivacy
20121119 tmapprivacy
 

Recently uploaded

Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Hr365.us smith
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Velvetech LLC
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEEVICTOR MAESTRE RAMIREZ
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Matt Ray
 
Introduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfIntroduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfFerryKemperman
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtimeandrehoraa
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesŁukasz Chruściel
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureDinusha Kumarasiri
 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....kzayra69
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024StefanoLambiase
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfAlina Yurenko
 
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company OdishaBalasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odishasmiwainfosol
 

Recently uploaded (20)

Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEE
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
 
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort ServiceHot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
 
Introduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfIntroduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdf
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtime
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New Features
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
 
2.pdf Ejercicios de programación competitiva
2.pdf Ejercicios de programación competitiva2.pdf Ejercicios de programación competitiva
2.pdf Ejercicios de programación competitiva
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with Azure
 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
 
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company OdishaBalasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
 

DATPROF Test data Management (data privacy & data subsetting) - English

  • 1. Test Data Management Harald Kikkers, Maarten Urbach & Bert Nienhuis
  • 2. DATPROF Data IntegrationTest Data Management • Dutch Software supplier • Founded in 1998 • Partners: ITCG, Sogeti, …
  • 4. MANY ORGANISATIONS USE MULTIPLE COPIES OF PRODUCTION DATABASES
  • 5. PURPOSES: • TESTING • DEVELOPMENT • OUTSOURCING • MARKETING • TRAINING
  • 6. Agile Development • Building the right product • Room for change • Every 2-4 weeks working increments of the software • Progress in development
  • 7. How to test all these iterations? And… what data to use?
  • 8. Team 1 Team 2 Team 3 6 TB 500 GB Production 10 GB 6 TB 500 GB Test 10 GB 6 TB 500 GB Development 10 GB Total 19,53 TB
  • 9. Team 1 Team 2 Team 3 6 TB 500 GB Production 10 GB 6 TB 500 GB Test 10 GB 6 TB 500 GB Development 10 GB Total 19,53 TB Team 1 Team 2 Team 3 Test Team 1 Team 2 Team 3 Development
  • 10. Team 1 Team 2 Team 3 6 TB 500 GB Production 10 GB 6 TB 500 GB Development 10 GB 6 TB 500 GB Test 10 GB 6 TB 500 GB Development 10 GB 6 TB 500 GB Test 10 GB 6 TB 500 GB Development 10 GB 6 TB 500 GB Test 10 GB Total 45,57 TB
  • 11. Team 1 Team 2 Team 3 6 TB 500 GB Production 10 GB 600 GB 50 GB Development 1 GB 600 GB 50 GB Test 1 GB 600 GB 50 GB Development 1 GB 600 GB 50 GB Test 1 GB 600 GB 50 GB Development 1 GB 600 GB 50 GB Test 1 GB Total 10.4 TB 10 % Subset 10 % Subset 10 % Subset
  • 13. Test Test Test Development Development Development
  • 14. Minimize data usage Save on hardware & infra Reduce throughput times Efficient data management Protect customer information Comply with regislation Prevent brand damage Maintain competitive advantages Subsetting Anonymizing Advantages of subsetting data Advantages of scrambling & masking data
  • 15. DBA Tools ETL Suites 100$ tools IBM, Informatica, Oracle
  • 16. DBA Tools ETL Suites ?
  • 17. DBA Tools ETL Suites - User Experience - Default templates - Easy to maintain - Smart functionality - Chain support
  • 18. DBA Tools ETL Suites
  • 20. Data model classification Subset – Process data Example: Customers, Orders, Contracts, Invoices, Transactions Full – Master data Example: Application data, configuration, master tables Embty – Logging, non relevant history Example: Logging tables, temp tabellen Determine data to be subsetted
  • 21.
  • 22. Chain of systems Method for deriving consistent subsets from multiple systems Production Test/Development Start Filter All customers from The Netherlands Start Filter All orders from customers in the previous subset.
  • 25. - Bank account balance - Dept - Medication - Illness - Religion - Political preference - Salary - Phone history - Et cetera… - Name - Date of birth - Email - Bank account number - Social security number - Adress - Insurance number - Cellphone number - Et cetera.. Personal data Identifying Characteristics “Any information relating to an identified or identifiable natural person ("data subject") Source: Data Protection Directive - Directive 95/46/EC
  • 27. Shuffle Shuffle values within same column Conditional Manipulate specified rows+ First name Last name Type John Max Joe Clark Smith Williams DATPROF Customer Customer Customer Company
  • 28. 321 First name Last name Type Comment E-Mail John Max Joe Smith Williams Clark Blank Delete values from columns Scramble Replace existing characters j.clark@live.com Smith_max@mail.com i_am@JoeWilliams.de “Brother of J. Clark” “Has dept” Customer Customer Customer CompanyDATPROF
  • 29. Nr. First name Last name Type Co.. E-mail Date of Birth John Max Joe Smith Williams Clark DATPROF 123 Customer Customer Customer Company 321 789 456 First day Change dates to first day within same month and year 01-02-1954 01-11-1984 01-03-1974 Postal code Date of Birth 1st day of month 1st day of year 87% 3.7% 0.04% Source: research anonimity by Prof. Dr. Latanya Sweeney (Harvard University) x.xxxxx@xxxx... Xxxxx_xxx@xx... x_xx@XxxXxxx...
  • 30. Nr. First name Last name Type .. E-mail Date of birth 123 321 789 01-02-1954 01-11-1984 01-03-1974 Look-up Replace values with values from a lookup table James Adrian Thomas John Max Joe First names Chris Thomas James Ruben Adrian Michael David Reference data Smith Williams Clark DATPROF Customer Customer Customer Company x.xxxxx@xxxx... Xxxxx_xxx@xx... x_xx@XxxXxxx...
  • 31. Nr. First name Last name Type Comment E-mail Date of birth Thomas James Adrian Smith Williams Clark DATPROF 123 Customer Customer Customer Company 321 789 456 01-02-1954 01-11-1984 01-03-1974 Expression Use custom made functions Scrambled T.Smith@datprof.com J.Willams@datprof.com A.Clark@datprof.com Scrambled Scrambled

Editor's Notes

  1. Doordat het team zelf bepaald hoeveel werk zij van de backlog aankunnen en daarvoor commitment afgeven. Plus het feit dat na de sprint de rest van de organisatie zien wat hun voortgang is, zorgt voor een onzettend gemotiveerd en effectief team. Het bouwen van software is ontzettend veranderlijk. Gebruikers weten vaak niet precies wat ze willen totdat ze het voor hun zien of ermee kunnen werken. Daarvoor is prototype ontwikkeling en de mogelijk om na een sprint bij te sturen onzettend belangrijk. Zeggen Scrum te doen, maar niet doen…….. Uitleggen welke fouten
  2. - Test varianten -
  3. Verhouding tussen productie en test Nu 60-40