SlideShare a Scribd company logo
1 of 21
Download to read offline
Copyright © 2014, SAS Institute Inc. All rights reserved.
DATA QUALITY IN A
BIG DATA WORLD
Jos van Dongen
SAS Nederland
Copyright © 2014, SAS Institute Inc. All rights reserved.
Copyright © 2014, SAS Institute Inc. All rights reserved.
Barcelona
Copyright © 2014, SAS Institute Inc. All rights reserved.
Copyright © 2014, SAS Institute Inc. All rights reserved.
Copyright © 2014, SAS Institute Inc. All rights reserved.
INCORPORATE DATA
GOVERNANCE
DEFINE RULES AND POLICIES GOVERNING DATA
Who is
responsible
to maintain
this data?
And where?
Where can I
get this
information?
Is the
quality of
data
improving?
How am I
supposed
to use this
data?
What data
quality
standards
should this
data comply
to?
Who can
approve a
change to the
business
data model or
reference
data?
Are we
compliant
with
security,
privacy and
risk
regulations
?
How to
leverage
the value of
this data?
Are we
making the
most out of
our data?
Copyright © 2014, SAS Institute Inc. All rights reserved.
Data Quality?
Copyright © 2014, SAS Institute Inc. All rights reserved.
DATA MNGT
BUILDING BLOCKS
DATA QUALITY
Copyright © 2014, SAS Institute Inc. All rights reserved.
BUSINESS USER BUSINESS GLOSSARY
Trace data from source
to consumer and all the
steps in between
Document what has been
done to data and how it
has been transformed
Govern who has access
to data and who has
consumed data
Copyright © 2014, SAS Institute Inc. All rights reserved.
DATA QUALITY GOVERNANCE CYCLE
Iterative process
where Business
and IT work
together on
Data Governance
Copyright © 2014, SAS Institute Inc. All rights reserved.
DATA QUALITY PROFILE
Interactively
quickly discover
anomalies in the
data
Copyright © 2014, SAS Institute Inc. All rights reserved.
DATA QUALITY BUSINESS RULE VALIDATION
Validate whether the
data complies to
quality standards
Copyright © 2014, SAS Institute Inc. All rights reserved.
DATA QUALITY DATA CLEANSING: PARSING & STANDARDIZING
Copyright © 2014, SAS Institute Inc. All rights reserved.
DATA QUALITY REMEDIATION
Review and resolve
issues on a case by
case basis
Copyright © 2014, SAS Institute Inc. All rights reserved.
DATA QUALITY DASHBOARD
Real-time information
when data is out of
compliance with
established data
policies
Copyright © 2014, SAS Institute Inc. All rights reserved.
Conclusion
#BigData = Data (duh…)
Copyright © 2014, SAS Institute Inc. All rights reserved.
…or is it?
§  Most data assets come from within company
§ Focus on structured data
§ Look at data to assess what occurred in past
§ The goal is that each single record is correct
§ Good database design requires years
§ Pay attention to „data stocks“*
§ Business users have to ask IT for analysis
§ There are clearly defined information
requirements for each business process
§  A large proportion of data come from outside
§ Focus on structured and unstructured data
§ Real-time analysis to improve the outcome
§ The goal is that analytics results are accurate
§ Database as moving target, quick cycles
§ Pay attention to „data flows“*
§ Business users conduct analysis themselves
§ All internal and external data sources are
used to gain best insight in a given situation
Traditional data management Big Data Analytics World
Source: Alexander Borek, Data Quality Strategy in a Big Data Analytics World
Copyright © 2014, SAS Institute Inc. All rights reserved.
“By 2017, 50% of all companies
in regulated industries will have a
Chief Data Officer.”
Copyright © 2014, SAS Institute Inc. All rights reserved.
SAS INFORMATION MANAGEMENT
A single platform. A singular
approach to better data.
Copyright © 2014, SAS Institute Inc. All rights reserved.
Copyright © 2014, SAS Institute Inc. All rights reserved.
NOG VRAGEN???

More Related Content

What's hot

Geek Sync I Agile Data Management vs. Agile Data Modeling
Geek Sync I Agile Data Management vs. Agile Data ModelingGeek Sync I Agile Data Management vs. Agile Data Modeling
Geek Sync I Agile Data Management vs. Agile Data ModelingIDERA Software
 
Eclipse day Sydney 2014 BIG data presentation
Eclipse day Sydney 2014 BIG data presentationEclipse day Sydney 2014 BIG data presentation
Eclipse day Sydney 2014 BIG data presentationSai Paravastu
 
Creating Your Data Governance Dashboard
Creating Your Data Governance DashboardCreating Your Data Governance Dashboard
Creating Your Data Governance DashboardTrillium Software
 
Data quality and bi
Data quality and biData quality and bi
Data quality and bijeffd00
 
Notes On Single View Of The Customer
Notes On Single View Of The CustomerNotes On Single View Of The Customer
Notes On Single View Of The CustomerAlan McSweeney
 
Building the enterprise data architecture
Building the enterprise data architectureBuilding the enterprise data architecture
Building the enterprise data architectureCosta Pissaris
 
Geek Sync I The Importance of Data Model Change Management
Geek Sync I The Importance of Data Model Change ManagementGeek Sync I The Importance of Data Model Change Management
Geek Sync I The Importance of Data Model Change ManagementIDERA Software
 
02. Information solution outline template
02. Information solution outline template02. Information solution outline template
02. Information solution outline templateAlan D. Duncan
 
Unlocking Success in the 3 Stages of Master Data Management
Unlocking Success in the 3 Stages of Master Data ManagementUnlocking Success in the 3 Stages of Master Data Management
Unlocking Success in the 3 Stages of Master Data ManagementPerficient, Inc.
 
Implementing Digital Signatures in an FDA-Regulated Environment
Implementing Digital Signatures in an FDA-Regulated EnvironmentImplementing Digital Signatures in an FDA-Regulated Environment
Implementing Digital Signatures in an FDA-Regulated EnvironmentPerficient, Inc.
 
Leveraging Information Steward
Leveraging Information StewardLeveraging Information Steward
Leveraging Information StewardMethod360
 
Telelogic Dashboard Cmmi Presentation
Telelogic Dashboard Cmmi PresentationTelelogic Dashboard Cmmi Presentation
Telelogic Dashboard Cmmi PresentationBill Duncan
 
Agile Enterprise Data Model & Data Management Solution
Agile Enterprise Data Model & Data Management SolutionAgile Enterprise Data Model & Data Management Solution
Agile Enterprise Data Model & Data Management SolutionA.I. Consultancy Ltd
 
Enterprise Data Architect Job Description
Enterprise Data Architect Job DescriptionEnterprise Data Architect Job Description
Enterprise Data Architect Job DescriptionLars E Martinsson
 
Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best PracticesBoris Otto
 
CDO Webinar: Metadata and the CDO
CDO Webinar: Metadata and the CDOCDO Webinar: Metadata and the CDO
CDO Webinar: Metadata and the CDODATAVERSITY
 
Improving the customer experience using big data customer-centric measurement...
Improving the customer experience using big data customer-centric measurement...Improving the customer experience using big data customer-centric measurement...
Improving the customer experience using big data customer-centric measurement...Business Over Broadway
 

What's hot (20)

Geek Sync I Agile Data Management vs. Agile Data Modeling
Geek Sync I Agile Data Management vs. Agile Data ModelingGeek Sync I Agile Data Management vs. Agile Data Modeling
Geek Sync I Agile Data Management vs. Agile Data Modeling
 
Eclipse day Sydney 2014 BIG data presentation
Eclipse day Sydney 2014 BIG data presentationEclipse day Sydney 2014 BIG data presentation
Eclipse day Sydney 2014 BIG data presentation
 
Creating Your Data Governance Dashboard
Creating Your Data Governance DashboardCreating Your Data Governance Dashboard
Creating Your Data Governance Dashboard
 
Data quality and bi
Data quality and biData quality and bi
Data quality and bi
 
Data Quality
Data QualityData Quality
Data Quality
 
Notes On Single View Of The Customer
Notes On Single View Of The CustomerNotes On Single View Of The Customer
Notes On Single View Of The Customer
 
Building the enterprise data architecture
Building the enterprise data architectureBuilding the enterprise data architecture
Building the enterprise data architecture
 
Geek Sync I The Importance of Data Model Change Management
Geek Sync I The Importance of Data Model Change ManagementGeek Sync I The Importance of Data Model Change Management
Geek Sync I The Importance of Data Model Change Management
 
02. Information solution outline template
02. Information solution outline template02. Information solution outline template
02. Information solution outline template
 
Reference Data Management
Reference Data Management Reference Data Management
Reference Data Management
 
Unlocking Success in the 3 Stages of Master Data Management
Unlocking Success in the 3 Stages of Master Data ManagementUnlocking Success in the 3 Stages of Master Data Management
Unlocking Success in the 3 Stages of Master Data Management
 
Implementing Digital Signatures in an FDA-Regulated Environment
Implementing Digital Signatures in an FDA-Regulated EnvironmentImplementing Digital Signatures in an FDA-Regulated Environment
Implementing Digital Signatures in an FDA-Regulated Environment
 
Leveraging Information Steward
Leveraging Information StewardLeveraging Information Steward
Leveraging Information Steward
 
Telelogic Dashboard Cmmi Presentation
Telelogic Dashboard Cmmi PresentationTelelogic Dashboard Cmmi Presentation
Telelogic Dashboard Cmmi Presentation
 
Data Quality Presentation
Data Quality PresentationData Quality Presentation
Data Quality Presentation
 
Agile Enterprise Data Model & Data Management Solution
Agile Enterprise Data Model & Data Management SolutionAgile Enterprise Data Model & Data Management Solution
Agile Enterprise Data Model & Data Management Solution
 
Enterprise Data Architect Job Description
Enterprise Data Architect Job DescriptionEnterprise Data Architect Job Description
Enterprise Data Architect Job Description
 
Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best Practices
 
CDO Webinar: Metadata and the CDO
CDO Webinar: Metadata and the CDOCDO Webinar: Metadata and the CDO
CDO Webinar: Metadata and the CDO
 
Improving the customer experience using big data customer-centric measurement...
Improving the customer experience using big data customer-centric measurement...Improving the customer experience using big data customer-centric measurement...
Improving the customer experience using big data customer-centric measurement...
 

Similar to Data donderdag data quality sas

Becoming a data-driven organization in a fast-moving world - SAS italy
Becoming a data-driven organization in a fast-moving world - SAS italyBecoming a data-driven organization in a fast-moving world - SAS italy
Becoming a data-driven organization in a fast-moving world - SAS italySAS Italy
 
Data Management for High Performance Analytics
Data Management for High Performance AnalyticsData Management for High Performance Analytics
Data Management for High Performance AnalyticsMary Snyder
 
Extending BI with Big Data Analytics
Extending BI with Big Data AnalyticsExtending BI with Big Data Analytics
Extending BI with Big Data AnalyticsDatameer
 
Understand Your Customer Buying Journey with Big Data
Understand Your Customer Buying Journey with Big Data Understand Your Customer Buying Journey with Big Data
Understand Your Customer Buying Journey with Big Data Datameer
 
Sqrrl March Webinar: How to Build a Big App
Sqrrl March Webinar: How to Build a Big AppSqrrl March Webinar: How to Build a Big App
Sqrrl March Webinar: How to Build a Big AppSqrrl
 
IW14 Session: Mike Gualtieri, Forrester Research
IW14 Session: Mike Gualtieri, Forrester ResearchIW14 Session: Mike Gualtieri, Forrester Research
IW14 Session: Mike Gualtieri, Forrester ResearchSoftware AG
 
A Better Understanding: Solving Business Challenges with Data
A Better Understanding: Solving Business Challenges with DataA Better Understanding: Solving Business Challenges with Data
A Better Understanding: Solving Business Challenges with DataEric Kavanagh
 
Forrester’s View on Accelerating Analytics and Insights with Data Prep
Forrester’s View on Accelerating Analytics and Insights with Data PrepForrester’s View on Accelerating Analytics and Insights with Data Prep
Forrester’s View on Accelerating Analytics and Insights with Data PrepDatawatchCorporation
 
What is the Value of SAS Analytics?
What is the Value of SAS Analytics?What is the Value of SAS Analytics?
What is the Value of SAS Analytics?SAS Canada
 
The Model Enterprise: A Blueprint for Enterprise Data Governance
The Model Enterprise: A Blueprint for Enterprise Data GovernanceThe Model Enterprise: A Blueprint for Enterprise Data Governance
The Model Enterprise: A Blueprint for Enterprise Data GovernanceEric Kavanagh
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneySai Paravastu
 
How Guided Data Discovery Leads Users to Better Data Insights
How Guided Data Discovery Leads Users to Better Data InsightsHow Guided Data Discovery Leads Users to Better Data Insights
How Guided Data Discovery Leads Users to Better Data InsightsDenodo
 
Presumption of Abundance: Architecting the Future of Success
Presumption of Abundance: Architecting the Future of SuccessPresumption of Abundance: Architecting the Future of Success
Presumption of Abundance: Architecting the Future of SuccessInside Analysis
 
Business Agility Must Be Based on a New Flexible and Agile Data Approach
Business Agility Must Be Based on a New Flexible and Agile Data ApproachBusiness Agility Must Be Based on a New Flexible and Agile Data Approach
Business Agility Must Be Based on a New Flexible and Agile Data ApproachDenodo
 
Analyze Your Data, Transform Your Business
Analyze Your Data, Transform Your BusinessAnalyze Your Data, Transform Your Business
Analyze Your Data, Transform Your BusinessDATAVERSITY
 
Relational Databases are Evolving To Support New Data Capabilities
Relational Databases are Evolving To Support New Data CapabilitiesRelational Databases are Evolving To Support New Data Capabilities
Relational Databases are Evolving To Support New Data CapabilitiesEDB
 
Getting Started with Data Governance? Use Process Models!
Getting Started with Data Governance? Use Process Models!Getting Started with Data Governance? Use Process Models!
Getting Started with Data Governance? Use Process Models!DATAVERSITY
 
The Emerging Data Lake IT Strategy
The Emerging Data Lake IT StrategyThe Emerging Data Lake IT Strategy
The Emerging Data Lake IT StrategyThomas Kelly, PMP
 

Similar to Data donderdag data quality sas (20)

Becoming a data-driven organization in a fast-moving world - SAS italy
Becoming a data-driven organization in a fast-moving world - SAS italyBecoming a data-driven organization in a fast-moving world - SAS italy
Becoming a data-driven organization in a fast-moving world - SAS italy
 
Data Management for High Performance Analytics
Data Management for High Performance AnalyticsData Management for High Performance Analytics
Data Management for High Performance Analytics
 
Extending BI with Big Data Analytics
Extending BI with Big Data AnalyticsExtending BI with Big Data Analytics
Extending BI with Big Data Analytics
 
Understand Your Customer Buying Journey with Big Data
Understand Your Customer Buying Journey with Big Data Understand Your Customer Buying Journey with Big Data
Understand Your Customer Buying Journey with Big Data
 
Sqrrl March Webinar: How to Build a Big App
Sqrrl March Webinar: How to Build a Big AppSqrrl March Webinar: How to Build a Big App
Sqrrl March Webinar: How to Build a Big App
 
IW14 Session: Mike Gualtieri, Forrester Research
IW14 Session: Mike Gualtieri, Forrester ResearchIW14 Session: Mike Gualtieri, Forrester Research
IW14 Session: Mike Gualtieri, Forrester Research
 
A Better Understanding: Solving Business Challenges with Data
A Better Understanding: Solving Business Challenges with DataA Better Understanding: Solving Business Challenges with Data
A Better Understanding: Solving Business Challenges with Data
 
Forrester’s View on Accelerating Analytics and Insights with Data Prep
Forrester’s View on Accelerating Analytics and Insights with Data PrepForrester’s View on Accelerating Analytics and Insights with Data Prep
Forrester’s View on Accelerating Analytics and Insights with Data Prep
 
What is the Value of SAS Analytics?
What is the Value of SAS Analytics?What is the Value of SAS Analytics?
What is the Value of SAS Analytics?
 
High performance organisation
High performance organisationHigh performance organisation
High performance organisation
 
IT Ready - DW: 1st Day
IT Ready - DW: 1st Day IT Ready - DW: 1st Day
IT Ready - DW: 1st Day
 
The Model Enterprise: A Blueprint for Enterprise Data Governance
The Model Enterprise: A Blueprint for Enterprise Data GovernanceThe Model Enterprise: A Blueprint for Enterprise Data Governance
The Model Enterprise: A Blueprint for Enterprise Data Governance
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
 
How Guided Data Discovery Leads Users to Better Data Insights
How Guided Data Discovery Leads Users to Better Data InsightsHow Guided Data Discovery Leads Users to Better Data Insights
How Guided Data Discovery Leads Users to Better Data Insights
 
Presumption of Abundance: Architecting the Future of Success
Presumption of Abundance: Architecting the Future of SuccessPresumption of Abundance: Architecting the Future of Success
Presumption of Abundance: Architecting the Future of Success
 
Business Agility Must Be Based on a New Flexible and Agile Data Approach
Business Agility Must Be Based on a New Flexible and Agile Data ApproachBusiness Agility Must Be Based on a New Flexible and Agile Data Approach
Business Agility Must Be Based on a New Flexible and Agile Data Approach
 
Analyze Your Data, Transform Your Business
Analyze Your Data, Transform Your BusinessAnalyze Your Data, Transform Your Business
Analyze Your Data, Transform Your Business
 
Relational Databases are Evolving To Support New Data Capabilities
Relational Databases are Evolving To Support New Data CapabilitiesRelational Databases are Evolving To Support New Data Capabilities
Relational Databases are Evolving To Support New Data Capabilities
 
Getting Started with Data Governance? Use Process Models!
Getting Started with Data Governance? Use Process Models!Getting Started with Data Governance? Use Process Models!
Getting Started with Data Governance? Use Process Models!
 
The Emerging Data Lake IT Strategy
The Emerging Data Lake IT StrategyThe Emerging Data Lake IT Strategy
The Emerging Data Lake IT Strategy
 

More from Cre-Aid

Smart thermometer-niek
Smart thermometer-niekSmart thermometer-niek
Smart thermometer-niekCre-Aid
 
Senso run data donderdag 191115 v1
Senso run data donderdag 191115 v1Senso run data donderdag 191115 v1
Senso run data donderdag 191115 v1Cre-Aid
 
Startup pitch-what ifolution
Startup pitch-what ifolutionStartup pitch-what ifolution
Startup pitch-what ifolutionCre-Aid
 
Piek vossen-data-donderdag
Piek vossen-data-donderdagPiek vossen-data-donderdag
Piek vossen-data-donderdagCre-Aid
 
Leeruniek
Leeruniek Leeruniek
Leeruniek Cre-Aid
 
Predictive policing
Predictive policingPredictive policing
Predictive policingCre-Aid
 
150423 data donderdag presentatie
150423 data donderdag presentatie 150423 data donderdag presentatie
150423 data donderdag presentatie Cre-Aid
 
BEYOND ballet why and how
BEYOND ballet why and howBEYOND ballet why and how
BEYOND ballet why and howCre-Aid
 
Kick off 6e editie data donderdag
Kick off 6e editie data donderdagKick off 6e editie data donderdag
Kick off 6e editie data donderdagCre-Aid
 
Slides data donderdag #6
Slides data donderdag #6Slides data donderdag #6
Slides data donderdag #6Cre-Aid
 
Presentatie datadonderdag nationale denktank
Presentatie datadonderdag nationale denktankPresentatie datadonderdag nationale denktank
Presentatie datadonderdag nationale denktankCre-Aid
 
Pieter Winsemius - Rafael Dialoog 29 januari 2015
Pieter Winsemius - Rafael Dialoog 29 januari 2015Pieter Winsemius - Rafael Dialoog 29 januari 2015
Pieter Winsemius - Rafael Dialoog 29 januari 2015Cre-Aid
 
Presentaie brabantse agrofood 15 01-29
Presentaie brabantse agrofood 15 01-29Presentaie brabantse agrofood 15 01-29
Presentaie brabantse agrofood 15 01-29Cre-Aid
 
Fred van Eenennaam - Rafael Dialoog 28 januari 2015
Fred van Eenennaam - Rafael Dialoog 28 januari 2015Fred van Eenennaam - Rafael Dialoog 28 januari 2015
Fred van Eenennaam - Rafael Dialoog 28 januari 2015Cre-Aid
 
Martin Scholten - Rafael Dialoog 28 januari 2015
Martin Scholten - Rafael Dialoog 28 januari 2015Martin Scholten - Rafael Dialoog 28 januari 2015
Martin Scholten - Rafael Dialoog 28 januari 2015Cre-Aid
 
Data donderdag 30 oktober 2014 - DIME
Data donderdag 30 oktober 2014 - DIMEData donderdag 30 oktober 2014 - DIME
Data donderdag 30 oktober 2014 - DIMECre-Aid
 
Data donderdag 30 oktober 2014 - MapR
Data donderdag 30 oktober 2014 - MapRData donderdag 30 oktober 2014 - MapR
Data donderdag 30 oktober 2014 - MapRCre-Aid
 
Data donderdag 30 oktober 2014 - Rob Dielemans
Data donderdag 30 oktober 2014 - Rob DielemansData donderdag 30 oktober 2014 - Rob Dielemans
Data donderdag 30 oktober 2014 - Rob DielemansCre-Aid
 
Big Data Schiphol Group Meetup 30 10 2014
Big Data Schiphol Group Meetup 30 10 2014Big Data Schiphol Group Meetup 30 10 2014
Big Data Schiphol Group Meetup 30 10 2014Cre-Aid
 
Pim Stouten (LexisNexis BIS), Big Data Business as usual? - Data Donderdag
Pim Stouten (LexisNexis BIS), Big Data Business as usual? - Data DonderdagPim Stouten (LexisNexis BIS), Big Data Business as usual? - Data Donderdag
Pim Stouten (LexisNexis BIS), Big Data Business as usual? - Data DonderdagCre-Aid
 

More from Cre-Aid (20)

Smart thermometer-niek
Smart thermometer-niekSmart thermometer-niek
Smart thermometer-niek
 
Senso run data donderdag 191115 v1
Senso run data donderdag 191115 v1Senso run data donderdag 191115 v1
Senso run data donderdag 191115 v1
 
Startup pitch-what ifolution
Startup pitch-what ifolutionStartup pitch-what ifolution
Startup pitch-what ifolution
 
Piek vossen-data-donderdag
Piek vossen-data-donderdagPiek vossen-data-donderdag
Piek vossen-data-donderdag
 
Leeruniek
Leeruniek Leeruniek
Leeruniek
 
Predictive policing
Predictive policingPredictive policing
Predictive policing
 
150423 data donderdag presentatie
150423 data donderdag presentatie 150423 data donderdag presentatie
150423 data donderdag presentatie
 
BEYOND ballet why and how
BEYOND ballet why and howBEYOND ballet why and how
BEYOND ballet why and how
 
Kick off 6e editie data donderdag
Kick off 6e editie data donderdagKick off 6e editie data donderdag
Kick off 6e editie data donderdag
 
Slides data donderdag #6
Slides data donderdag #6Slides data donderdag #6
Slides data donderdag #6
 
Presentatie datadonderdag nationale denktank
Presentatie datadonderdag nationale denktankPresentatie datadonderdag nationale denktank
Presentatie datadonderdag nationale denktank
 
Pieter Winsemius - Rafael Dialoog 29 januari 2015
Pieter Winsemius - Rafael Dialoog 29 januari 2015Pieter Winsemius - Rafael Dialoog 29 januari 2015
Pieter Winsemius - Rafael Dialoog 29 januari 2015
 
Presentaie brabantse agrofood 15 01-29
Presentaie brabantse agrofood 15 01-29Presentaie brabantse agrofood 15 01-29
Presentaie brabantse agrofood 15 01-29
 
Fred van Eenennaam - Rafael Dialoog 28 januari 2015
Fred van Eenennaam - Rafael Dialoog 28 januari 2015Fred van Eenennaam - Rafael Dialoog 28 januari 2015
Fred van Eenennaam - Rafael Dialoog 28 januari 2015
 
Martin Scholten - Rafael Dialoog 28 januari 2015
Martin Scholten - Rafael Dialoog 28 januari 2015Martin Scholten - Rafael Dialoog 28 januari 2015
Martin Scholten - Rafael Dialoog 28 januari 2015
 
Data donderdag 30 oktober 2014 - DIME
Data donderdag 30 oktober 2014 - DIMEData donderdag 30 oktober 2014 - DIME
Data donderdag 30 oktober 2014 - DIME
 
Data donderdag 30 oktober 2014 - MapR
Data donderdag 30 oktober 2014 - MapRData donderdag 30 oktober 2014 - MapR
Data donderdag 30 oktober 2014 - MapR
 
Data donderdag 30 oktober 2014 - Rob Dielemans
Data donderdag 30 oktober 2014 - Rob DielemansData donderdag 30 oktober 2014 - Rob Dielemans
Data donderdag 30 oktober 2014 - Rob Dielemans
 
Big Data Schiphol Group Meetup 30 10 2014
Big Data Schiphol Group Meetup 30 10 2014Big Data Schiphol Group Meetup 30 10 2014
Big Data Schiphol Group Meetup 30 10 2014
 
Pim Stouten (LexisNexis BIS), Big Data Business as usual? - Data Donderdag
Pim Stouten (LexisNexis BIS), Big Data Business as usual? - Data DonderdagPim Stouten (LexisNexis BIS), Big Data Business as usual? - Data Donderdag
Pim Stouten (LexisNexis BIS), Big Data Business as usual? - Data Donderdag
 

Recently uploaded

Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNKTimothy Spann
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...amitlee9823
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...karishmasinghjnh
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Detecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachDetecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachBoston Institute of Analytics
 

Recently uploaded (20)

Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Detecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachDetecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning Approach
 

Data donderdag data quality sas

  • 1. Copyright © 2014, SAS Institute Inc. All rights reserved. DATA QUALITY IN A BIG DATA WORLD Jos van Dongen SAS Nederland
  • 2. Copyright © 2014, SAS Institute Inc. All rights reserved.
  • 3. Copyright © 2014, SAS Institute Inc. All rights reserved. Barcelona
  • 4. Copyright © 2014, SAS Institute Inc. All rights reserved.
  • 5. Copyright © 2014, SAS Institute Inc. All rights reserved.
  • 6. Copyright © 2014, SAS Institute Inc. All rights reserved. INCORPORATE DATA GOVERNANCE DEFINE RULES AND POLICIES GOVERNING DATA Who is responsible to maintain this data? And where? Where can I get this information? Is the quality of data improving? How am I supposed to use this data? What data quality standards should this data comply to? Who can approve a change to the business data model or reference data? Are we compliant with security, privacy and risk regulations ? How to leverage the value of this data? Are we making the most out of our data?
  • 7. Copyright © 2014, SAS Institute Inc. All rights reserved. Data Quality?
  • 8. Copyright © 2014, SAS Institute Inc. All rights reserved. DATA MNGT BUILDING BLOCKS DATA QUALITY
  • 9. Copyright © 2014, SAS Institute Inc. All rights reserved. BUSINESS USER BUSINESS GLOSSARY Trace data from source to consumer and all the steps in between Document what has been done to data and how it has been transformed Govern who has access to data and who has consumed data
  • 10. Copyright © 2014, SAS Institute Inc. All rights reserved. DATA QUALITY GOVERNANCE CYCLE Iterative process where Business and IT work together on Data Governance
  • 11. Copyright © 2014, SAS Institute Inc. All rights reserved. DATA QUALITY PROFILE Interactively quickly discover anomalies in the data
  • 12. Copyright © 2014, SAS Institute Inc. All rights reserved. DATA QUALITY BUSINESS RULE VALIDATION Validate whether the data complies to quality standards
  • 13. Copyright © 2014, SAS Institute Inc. All rights reserved. DATA QUALITY DATA CLEANSING: PARSING & STANDARDIZING
  • 14. Copyright © 2014, SAS Institute Inc. All rights reserved. DATA QUALITY REMEDIATION Review and resolve issues on a case by case basis
  • 15. Copyright © 2014, SAS Institute Inc. All rights reserved. DATA QUALITY DASHBOARD Real-time information when data is out of compliance with established data policies
  • 16. Copyright © 2014, SAS Institute Inc. All rights reserved. Conclusion #BigData = Data (duh…)
  • 17. Copyright © 2014, SAS Institute Inc. All rights reserved. …or is it? §  Most data assets come from within company § Focus on structured data § Look at data to assess what occurred in past § The goal is that each single record is correct § Good database design requires years § Pay attention to „data stocks“* § Business users have to ask IT for analysis § There are clearly defined information requirements for each business process §  A large proportion of data come from outside § Focus on structured and unstructured data § Real-time analysis to improve the outcome § The goal is that analytics results are accurate § Database as moving target, quick cycles § Pay attention to „data flows“* § Business users conduct analysis themselves § All internal and external data sources are used to gain best insight in a given situation Traditional data management Big Data Analytics World Source: Alexander Borek, Data Quality Strategy in a Big Data Analytics World
  • 18. Copyright © 2014, SAS Institute Inc. All rights reserved. “By 2017, 50% of all companies in regulated industries will have a Chief Data Officer.”
  • 19. Copyright © 2014, SAS Institute Inc. All rights reserved. SAS INFORMATION MANAGEMENT A single platform. A singular approach to better data.
  • 20. Copyright © 2014, SAS Institute Inc. All rights reserved.
  • 21. Copyright © 2014, SAS Institute Inc. All rights reserved. NOG VRAGEN???