SlideShare a Scribd company logo
1 of 21
Download to read offline
Copyright © 2014, SAS Institute Inc. All rights reserved.
DATA QUALITY IN A
BIG DATA WORLD
Jos van Dongen
SAS Nederland
Copyright © 2014, SAS Institute Inc. All rights reserved.
Copyright © 2014, SAS Institute Inc. All rights reserved.
Barcelona
Copyright © 2014, SAS Institute Inc. All rights reserved.
Copyright © 2014, SAS Institute Inc. All rights reserved.
Copyright © 2014, SAS Institute Inc. All rights reserved.
INCORPORATE DATA
GOVERNANCE
DEFINE RULES AND POLICIES GOVERNING DATA
Who is
responsible
to maintain
this data?
And where?
Where can I
get this
information?
Is the
quality of
data
improving?
How am I
supposed
to use this
data?
What data
quality
standards
should this
data comply
to?
Who can
approve a
change to the
business
data model or
reference
data?
Are we
compliant
with
security,
privacy and
risk
regulations
?
How to
leverage
the value of
this data?
Are we
making the
most out of
our data?
Copyright © 2014, SAS Institute Inc. All rights reserved.
Data Quality?
Copyright © 2014, SAS Institute Inc. All rights reserved.
DATA MNGT
BUILDING BLOCKS
DATA QUALITY
Copyright © 2014, SAS Institute Inc. All rights reserved.
BUSINESS USER BUSINESS GLOSSARY
Trace data from source
to consumer and all the
steps in between
Document what has been
done to data and how it
has been transformed
Govern who has access
to data and who has
consumed data
Copyright © 2014, SAS Institute Inc. All rights reserved.
DATA QUALITY GOVERNANCE CYCLE
Iterative process
where Business
and IT work
together on
Data Governance
Copyright © 2014, SAS Institute Inc. All rights reserved.
DATA QUALITY PROFILE
Interactively
quickly discover
anomalies in the
data
Copyright © 2014, SAS Institute Inc. All rights reserved.
DATA QUALITY BUSINESS RULE VALIDATION
Validate whether the
data complies to
quality standards
Copyright © 2014, SAS Institute Inc. All rights reserved.
DATA QUALITY DATA CLEANSING: PARSING & STANDARDIZING
Copyright © 2014, SAS Institute Inc. All rights reserved.
DATA QUALITY REMEDIATION
Review and resolve
issues on a case by
case basis
Copyright © 2014, SAS Institute Inc. All rights reserved.
DATA QUALITY DASHBOARD
Real-time information
when data is out of
compliance with
established data
policies
Copyright © 2014, SAS Institute Inc. All rights reserved.
Conclusion
#BigData = Data (duh…)
Copyright © 2014, SAS Institute Inc. All rights reserved.
…or is it?
§  Most data assets come from within company
§ Focus on structured data
§ Look at data to assess what occurred in past
§ The goal is that each single record is correct
§ Good database design requires years
§ Pay attention to „data stocks“*
§ Business users have to ask IT for analysis
§ There are clearly defined information
requirements for each business process
§  A large proportion of data come from outside
§ Focus on structured and unstructured data
§ Real-time analysis to improve the outcome
§ The goal is that analytics results are accurate
§ Database as moving target, quick cycles
§ Pay attention to „data flows“*
§ Business users conduct analysis themselves
§ All internal and external data sources are
used to gain best insight in a given situation
Traditional data management Big Data Analytics World
Source: Alexander Borek, Data Quality Strategy in a Big Data Analytics World
Copyright © 2014, SAS Institute Inc. All rights reserved.
“By 2017, 50% of all companies
in regulated industries will have a
Chief Data Officer.”
Copyright © 2014, SAS Institute Inc. All rights reserved.
SAS INFORMATION MANAGEMENT
A single platform. A singular
approach to better data.
Copyright © 2014, SAS Institute Inc. All rights reserved.
Copyright © 2014, SAS Institute Inc. All rights reserved.
NOG VRAGEN???

More Related Content

What's hot

Geek Sync I Agile Data Management vs. Agile Data Modeling
Geek Sync I Agile Data Management vs. Agile Data ModelingGeek Sync I Agile Data Management vs. Agile Data Modeling
Geek Sync I Agile Data Management vs. Agile Data ModelingIDERA Software
 
Eclipse day Sydney 2014 BIG data presentation
Eclipse day Sydney 2014 BIG data presentationEclipse day Sydney 2014 BIG data presentation
Eclipse day Sydney 2014 BIG data presentationSai Paravastu
 
Creating Your Data Governance Dashboard
Creating Your Data Governance DashboardCreating Your Data Governance Dashboard
Creating Your Data Governance DashboardTrillium Software
 
Data quality and bi
Data quality and biData quality and bi
Data quality and bijeffd00
 
Notes On Single View Of The Customer
Notes On Single View Of The CustomerNotes On Single View Of The Customer
Notes On Single View Of The CustomerAlan McSweeney
 
Building the enterprise data architecture
Building the enterprise data architectureBuilding the enterprise data architecture
Building the enterprise data architectureCosta Pissaris
 
Geek Sync I The Importance of Data Model Change Management
Geek Sync I The Importance of Data Model Change ManagementGeek Sync I The Importance of Data Model Change Management
Geek Sync I The Importance of Data Model Change ManagementIDERA Software
 
02. Information solution outline template
02. Information solution outline template02. Information solution outline template
02. Information solution outline templateAlan D. Duncan
 
Unlocking Success in the 3 Stages of Master Data Management
Unlocking Success in the 3 Stages of Master Data ManagementUnlocking Success in the 3 Stages of Master Data Management
Unlocking Success in the 3 Stages of Master Data ManagementPerficient, Inc.
 
Implementing Digital Signatures in an FDA-Regulated Environment
Implementing Digital Signatures in an FDA-Regulated EnvironmentImplementing Digital Signatures in an FDA-Regulated Environment
Implementing Digital Signatures in an FDA-Regulated EnvironmentPerficient, Inc.
 
Leveraging Information Steward
Leveraging Information StewardLeveraging Information Steward
Leveraging Information StewardMethod360
 
Telelogic Dashboard Cmmi Presentation
Telelogic Dashboard Cmmi PresentationTelelogic Dashboard Cmmi Presentation
Telelogic Dashboard Cmmi PresentationBill Duncan
 
Agile Enterprise Data Model & Data Management Solution
Agile Enterprise Data Model & Data Management SolutionAgile Enterprise Data Model & Data Management Solution
Agile Enterprise Data Model & Data Management SolutionA.I. Consultancy Ltd
 
Enterprise Data Architect Job Description
Enterprise Data Architect Job DescriptionEnterprise Data Architect Job Description
Enterprise Data Architect Job DescriptionLars E Martinsson
 
Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best PracticesBoris Otto
 
CDO Webinar: Metadata and the CDO
CDO Webinar: Metadata and the CDOCDO Webinar: Metadata and the CDO
CDO Webinar: Metadata and the CDODATAVERSITY
 
Improving the customer experience using big data customer-centric measurement...
Improving the customer experience using big data customer-centric measurement...Improving the customer experience using big data customer-centric measurement...
Improving the customer experience using big data customer-centric measurement...Business Over Broadway
 

What's hot (20)

Geek Sync I Agile Data Management vs. Agile Data Modeling
Geek Sync I Agile Data Management vs. Agile Data ModelingGeek Sync I Agile Data Management vs. Agile Data Modeling
Geek Sync I Agile Data Management vs. Agile Data Modeling
 
Eclipse day Sydney 2014 BIG data presentation
Eclipse day Sydney 2014 BIG data presentationEclipse day Sydney 2014 BIG data presentation
Eclipse day Sydney 2014 BIG data presentation
 
Creating Your Data Governance Dashboard
Creating Your Data Governance DashboardCreating Your Data Governance Dashboard
Creating Your Data Governance Dashboard
 
Data quality and bi
Data quality and biData quality and bi
Data quality and bi
 
Data Quality
Data QualityData Quality
Data Quality
 
Notes On Single View Of The Customer
Notes On Single View Of The CustomerNotes On Single View Of The Customer
Notes On Single View Of The Customer
 
Building the enterprise data architecture
Building the enterprise data architectureBuilding the enterprise data architecture
Building the enterprise data architecture
 
Geek Sync I The Importance of Data Model Change Management
Geek Sync I The Importance of Data Model Change ManagementGeek Sync I The Importance of Data Model Change Management
Geek Sync I The Importance of Data Model Change Management
 
02. Information solution outline template
02. Information solution outline template02. Information solution outline template
02. Information solution outline template
 
Reference Data Management
Reference Data Management Reference Data Management
Reference Data Management
 
Unlocking Success in the 3 Stages of Master Data Management
Unlocking Success in the 3 Stages of Master Data ManagementUnlocking Success in the 3 Stages of Master Data Management
Unlocking Success in the 3 Stages of Master Data Management
 
Implementing Digital Signatures in an FDA-Regulated Environment
Implementing Digital Signatures in an FDA-Regulated EnvironmentImplementing Digital Signatures in an FDA-Regulated Environment
Implementing Digital Signatures in an FDA-Regulated Environment
 
Leveraging Information Steward
Leveraging Information StewardLeveraging Information Steward
Leveraging Information Steward
 
Telelogic Dashboard Cmmi Presentation
Telelogic Dashboard Cmmi PresentationTelelogic Dashboard Cmmi Presentation
Telelogic Dashboard Cmmi Presentation
 
Data Quality Presentation
Data Quality PresentationData Quality Presentation
Data Quality Presentation
 
Agile Enterprise Data Model & Data Management Solution
Agile Enterprise Data Model & Data Management SolutionAgile Enterprise Data Model & Data Management Solution
Agile Enterprise Data Model & Data Management Solution
 
Enterprise Data Architect Job Description
Enterprise Data Architect Job DescriptionEnterprise Data Architect Job Description
Enterprise Data Architect Job Description
 
Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best Practices
 
CDO Webinar: Metadata and the CDO
CDO Webinar: Metadata and the CDOCDO Webinar: Metadata and the CDO
CDO Webinar: Metadata and the CDO
 
Improving the customer experience using big data customer-centric measurement...
Improving the customer experience using big data customer-centric measurement...Improving the customer experience using big data customer-centric measurement...
Improving the customer experience using big data customer-centric measurement...
 

Similar to Data donderdag data quality sas

Becoming a data-driven organization in a fast-moving world - SAS italy
Becoming a data-driven organization in a fast-moving world - SAS italyBecoming a data-driven organization in a fast-moving world - SAS italy
Becoming a data-driven organization in a fast-moving world - SAS italySAS Italy
 
Data Management for High Performance Analytics
Data Management for High Performance AnalyticsData Management for High Performance Analytics
Data Management for High Performance AnalyticsMary Snyder
 
Extending BI with Big Data Analytics
Extending BI with Big Data AnalyticsExtending BI with Big Data Analytics
Extending BI with Big Data AnalyticsDatameer
 
Understand Your Customer Buying Journey with Big Data
Understand Your Customer Buying Journey with Big Data Understand Your Customer Buying Journey with Big Data
Understand Your Customer Buying Journey with Big Data Datameer
 
Sqrrl March Webinar: How to Build a Big App
Sqrrl March Webinar: How to Build a Big AppSqrrl March Webinar: How to Build a Big App
Sqrrl March Webinar: How to Build a Big AppSqrrl
 
IW14 Session: Mike Gualtieri, Forrester Research
IW14 Session: Mike Gualtieri, Forrester ResearchIW14 Session: Mike Gualtieri, Forrester Research
IW14 Session: Mike Gualtieri, Forrester ResearchSoftware AG
 
A Better Understanding: Solving Business Challenges with Data
A Better Understanding: Solving Business Challenges with DataA Better Understanding: Solving Business Challenges with Data
A Better Understanding: Solving Business Challenges with DataEric Kavanagh
 
Forrester’s View on Accelerating Analytics and Insights with Data Prep
Forrester’s View on Accelerating Analytics and Insights with Data PrepForrester’s View on Accelerating Analytics and Insights with Data Prep
Forrester’s View on Accelerating Analytics and Insights with Data PrepDatawatchCorporation
 
What is the Value of SAS Analytics?
What is the Value of SAS Analytics?What is the Value of SAS Analytics?
What is the Value of SAS Analytics?SAS Canada
 
The Model Enterprise: A Blueprint for Enterprise Data Governance
The Model Enterprise: A Blueprint for Enterprise Data GovernanceThe Model Enterprise: A Blueprint for Enterprise Data Governance
The Model Enterprise: A Blueprint for Enterprise Data GovernanceEric Kavanagh
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneySai Paravastu
 
How Guided Data Discovery Leads Users to Better Data Insights
How Guided Data Discovery Leads Users to Better Data InsightsHow Guided Data Discovery Leads Users to Better Data Insights
How Guided Data Discovery Leads Users to Better Data InsightsDenodo
 
Presumption of Abundance: Architecting the Future of Success
Presumption of Abundance: Architecting the Future of SuccessPresumption of Abundance: Architecting the Future of Success
Presumption of Abundance: Architecting the Future of SuccessInside Analysis
 
Business Agility Must Be Based on a New Flexible and Agile Data Approach
Business Agility Must Be Based on a New Flexible and Agile Data ApproachBusiness Agility Must Be Based on a New Flexible and Agile Data Approach
Business Agility Must Be Based on a New Flexible and Agile Data ApproachDenodo
 
Analyze Your Data, Transform Your Business
Analyze Your Data, Transform Your BusinessAnalyze Your Data, Transform Your Business
Analyze Your Data, Transform Your BusinessDATAVERSITY
 
Relational Databases are Evolving To Support New Data Capabilities
Relational Databases are Evolving To Support New Data CapabilitiesRelational Databases are Evolving To Support New Data Capabilities
Relational Databases are Evolving To Support New Data CapabilitiesEDB
 
Getting Started with Data Governance? Use Process Models!
Getting Started with Data Governance? Use Process Models!Getting Started with Data Governance? Use Process Models!
Getting Started with Data Governance? Use Process Models!DATAVERSITY
 
The Emerging Data Lake IT Strategy
The Emerging Data Lake IT StrategyThe Emerging Data Lake IT Strategy
The Emerging Data Lake IT StrategyThomas Kelly, PMP
 

Similar to Data donderdag data quality sas (20)

Becoming a data-driven organization in a fast-moving world - SAS italy
Becoming a data-driven organization in a fast-moving world - SAS italyBecoming a data-driven organization in a fast-moving world - SAS italy
Becoming a data-driven organization in a fast-moving world - SAS italy
 
Data Management for High Performance Analytics
Data Management for High Performance AnalyticsData Management for High Performance Analytics
Data Management for High Performance Analytics
 
Extending BI with Big Data Analytics
Extending BI with Big Data AnalyticsExtending BI with Big Data Analytics
Extending BI with Big Data Analytics
 
Understand Your Customer Buying Journey with Big Data
Understand Your Customer Buying Journey with Big Data Understand Your Customer Buying Journey with Big Data
Understand Your Customer Buying Journey with Big Data
 
Sqrrl March Webinar: How to Build a Big App
Sqrrl March Webinar: How to Build a Big AppSqrrl March Webinar: How to Build a Big App
Sqrrl March Webinar: How to Build a Big App
 
IW14 Session: Mike Gualtieri, Forrester Research
IW14 Session: Mike Gualtieri, Forrester ResearchIW14 Session: Mike Gualtieri, Forrester Research
IW14 Session: Mike Gualtieri, Forrester Research
 
A Better Understanding: Solving Business Challenges with Data
A Better Understanding: Solving Business Challenges with DataA Better Understanding: Solving Business Challenges with Data
A Better Understanding: Solving Business Challenges with Data
 
Forrester’s View on Accelerating Analytics and Insights with Data Prep
Forrester’s View on Accelerating Analytics and Insights with Data PrepForrester’s View on Accelerating Analytics and Insights with Data Prep
Forrester’s View on Accelerating Analytics and Insights with Data Prep
 
What is the Value of SAS Analytics?
What is the Value of SAS Analytics?What is the Value of SAS Analytics?
What is the Value of SAS Analytics?
 
High performance organisation
High performance organisationHigh performance organisation
High performance organisation
 
IT Ready - DW: 1st Day
IT Ready - DW: 1st Day IT Ready - DW: 1st Day
IT Ready - DW: 1st Day
 
The Model Enterprise: A Blueprint for Enterprise Data Governance
The Model Enterprise: A Blueprint for Enterprise Data GovernanceThe Model Enterprise: A Blueprint for Enterprise Data Governance
The Model Enterprise: A Blueprint for Enterprise Data Governance
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
 
How Guided Data Discovery Leads Users to Better Data Insights
How Guided Data Discovery Leads Users to Better Data InsightsHow Guided Data Discovery Leads Users to Better Data Insights
How Guided Data Discovery Leads Users to Better Data Insights
 
Presumption of Abundance: Architecting the Future of Success
Presumption of Abundance: Architecting the Future of SuccessPresumption of Abundance: Architecting the Future of Success
Presumption of Abundance: Architecting the Future of Success
 
Business Agility Must Be Based on a New Flexible and Agile Data Approach
Business Agility Must Be Based on a New Flexible and Agile Data ApproachBusiness Agility Must Be Based on a New Flexible and Agile Data Approach
Business Agility Must Be Based on a New Flexible and Agile Data Approach
 
Analyze Your Data, Transform Your Business
Analyze Your Data, Transform Your BusinessAnalyze Your Data, Transform Your Business
Analyze Your Data, Transform Your Business
 
Relational Databases are Evolving To Support New Data Capabilities
Relational Databases are Evolving To Support New Data CapabilitiesRelational Databases are Evolving To Support New Data Capabilities
Relational Databases are Evolving To Support New Data Capabilities
 
Getting Started with Data Governance? Use Process Models!
Getting Started with Data Governance? Use Process Models!Getting Started with Data Governance? Use Process Models!
Getting Started with Data Governance? Use Process Models!
 
The Emerging Data Lake IT Strategy
The Emerging Data Lake IT StrategyThe Emerging Data Lake IT Strategy
The Emerging Data Lake IT Strategy
 

More from Cre-Aid

Smart thermometer-niek
Smart thermometer-niekSmart thermometer-niek
Smart thermometer-niekCre-Aid
 
Senso run data donderdag 191115 v1
Senso run data donderdag 191115 v1Senso run data donderdag 191115 v1
Senso run data donderdag 191115 v1Cre-Aid
 
Startup pitch-what ifolution
Startup pitch-what ifolutionStartup pitch-what ifolution
Startup pitch-what ifolutionCre-Aid
 
Piek vossen-data-donderdag
Piek vossen-data-donderdagPiek vossen-data-donderdag
Piek vossen-data-donderdagCre-Aid
 
Leeruniek
Leeruniek Leeruniek
Leeruniek Cre-Aid
 
Predictive policing
Predictive policingPredictive policing
Predictive policingCre-Aid
 
150423 data donderdag presentatie
150423 data donderdag presentatie 150423 data donderdag presentatie
150423 data donderdag presentatie Cre-Aid
 
BEYOND ballet why and how
BEYOND ballet why and howBEYOND ballet why and how
BEYOND ballet why and howCre-Aid
 
Kick off 6e editie data donderdag
Kick off 6e editie data donderdagKick off 6e editie data donderdag
Kick off 6e editie data donderdagCre-Aid
 
Slides data donderdag #6
Slides data donderdag #6Slides data donderdag #6
Slides data donderdag #6Cre-Aid
 
Presentatie datadonderdag nationale denktank
Presentatie datadonderdag nationale denktankPresentatie datadonderdag nationale denktank
Presentatie datadonderdag nationale denktankCre-Aid
 
Pieter Winsemius - Rafael Dialoog 29 januari 2015
Pieter Winsemius - Rafael Dialoog 29 januari 2015Pieter Winsemius - Rafael Dialoog 29 januari 2015
Pieter Winsemius - Rafael Dialoog 29 januari 2015Cre-Aid
 
Presentaie brabantse agrofood 15 01-29
Presentaie brabantse agrofood 15 01-29Presentaie brabantse agrofood 15 01-29
Presentaie brabantse agrofood 15 01-29Cre-Aid
 
Fred van Eenennaam - Rafael Dialoog 28 januari 2015
Fred van Eenennaam - Rafael Dialoog 28 januari 2015Fred van Eenennaam - Rafael Dialoog 28 januari 2015
Fred van Eenennaam - Rafael Dialoog 28 januari 2015Cre-Aid
 
Martin Scholten - Rafael Dialoog 28 januari 2015
Martin Scholten - Rafael Dialoog 28 januari 2015Martin Scholten - Rafael Dialoog 28 januari 2015
Martin Scholten - Rafael Dialoog 28 januari 2015Cre-Aid
 
Data donderdag 30 oktober 2014 - DIME
Data donderdag 30 oktober 2014 - DIMEData donderdag 30 oktober 2014 - DIME
Data donderdag 30 oktober 2014 - DIMECre-Aid
 
Data donderdag 30 oktober 2014 - MapR
Data donderdag 30 oktober 2014 - MapRData donderdag 30 oktober 2014 - MapR
Data donderdag 30 oktober 2014 - MapRCre-Aid
 
Data donderdag 30 oktober 2014 - Rob Dielemans
Data donderdag 30 oktober 2014 - Rob DielemansData donderdag 30 oktober 2014 - Rob Dielemans
Data donderdag 30 oktober 2014 - Rob DielemansCre-Aid
 
Big Data Schiphol Group Meetup 30 10 2014
Big Data Schiphol Group Meetup 30 10 2014Big Data Schiphol Group Meetup 30 10 2014
Big Data Schiphol Group Meetup 30 10 2014Cre-Aid
 
Pim Stouten (LexisNexis BIS), Big Data Business as usual? - Data Donderdag
Pim Stouten (LexisNexis BIS), Big Data Business as usual? - Data DonderdagPim Stouten (LexisNexis BIS), Big Data Business as usual? - Data Donderdag
Pim Stouten (LexisNexis BIS), Big Data Business as usual? - Data DonderdagCre-Aid
 

More from Cre-Aid (20)

Smart thermometer-niek
Smart thermometer-niekSmart thermometer-niek
Smart thermometer-niek
 
Senso run data donderdag 191115 v1
Senso run data donderdag 191115 v1Senso run data donderdag 191115 v1
Senso run data donderdag 191115 v1
 
Startup pitch-what ifolution
Startup pitch-what ifolutionStartup pitch-what ifolution
Startup pitch-what ifolution
 
Piek vossen-data-donderdag
Piek vossen-data-donderdagPiek vossen-data-donderdag
Piek vossen-data-donderdag
 
Leeruniek
Leeruniek Leeruniek
Leeruniek
 
Predictive policing
Predictive policingPredictive policing
Predictive policing
 
150423 data donderdag presentatie
150423 data donderdag presentatie 150423 data donderdag presentatie
150423 data donderdag presentatie
 
BEYOND ballet why and how
BEYOND ballet why and howBEYOND ballet why and how
BEYOND ballet why and how
 
Kick off 6e editie data donderdag
Kick off 6e editie data donderdagKick off 6e editie data donderdag
Kick off 6e editie data donderdag
 
Slides data donderdag #6
Slides data donderdag #6Slides data donderdag #6
Slides data donderdag #6
 
Presentatie datadonderdag nationale denktank
Presentatie datadonderdag nationale denktankPresentatie datadonderdag nationale denktank
Presentatie datadonderdag nationale denktank
 
Pieter Winsemius - Rafael Dialoog 29 januari 2015
Pieter Winsemius - Rafael Dialoog 29 januari 2015Pieter Winsemius - Rafael Dialoog 29 januari 2015
Pieter Winsemius - Rafael Dialoog 29 januari 2015
 
Presentaie brabantse agrofood 15 01-29
Presentaie brabantse agrofood 15 01-29Presentaie brabantse agrofood 15 01-29
Presentaie brabantse agrofood 15 01-29
 
Fred van Eenennaam - Rafael Dialoog 28 januari 2015
Fred van Eenennaam - Rafael Dialoog 28 januari 2015Fred van Eenennaam - Rafael Dialoog 28 januari 2015
Fred van Eenennaam - Rafael Dialoog 28 januari 2015
 
Martin Scholten - Rafael Dialoog 28 januari 2015
Martin Scholten - Rafael Dialoog 28 januari 2015Martin Scholten - Rafael Dialoog 28 januari 2015
Martin Scholten - Rafael Dialoog 28 januari 2015
 
Data donderdag 30 oktober 2014 - DIME
Data donderdag 30 oktober 2014 - DIMEData donderdag 30 oktober 2014 - DIME
Data donderdag 30 oktober 2014 - DIME
 
Data donderdag 30 oktober 2014 - MapR
Data donderdag 30 oktober 2014 - MapRData donderdag 30 oktober 2014 - MapR
Data donderdag 30 oktober 2014 - MapR
 
Data donderdag 30 oktober 2014 - Rob Dielemans
Data donderdag 30 oktober 2014 - Rob DielemansData donderdag 30 oktober 2014 - Rob Dielemans
Data donderdag 30 oktober 2014 - Rob Dielemans
 
Big Data Schiphol Group Meetup 30 10 2014
Big Data Schiphol Group Meetup 30 10 2014Big Data Schiphol Group Meetup 30 10 2014
Big Data Schiphol Group Meetup 30 10 2014
 
Pim Stouten (LexisNexis BIS), Big Data Business as usual? - Data Donderdag
Pim Stouten (LexisNexis BIS), Big Data Business as usual? - Data DonderdagPim Stouten (LexisNexis BIS), Big Data Business as usual? - Data Donderdag
Pim Stouten (LexisNexis BIS), Big Data Business as usual? - Data Donderdag
 

Recently uploaded

Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 

Recently uploaded (20)

Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 

Data donderdag data quality sas

  • 1. Copyright © 2014, SAS Institute Inc. All rights reserved. DATA QUALITY IN A BIG DATA WORLD Jos van Dongen SAS Nederland
  • 2. Copyright © 2014, SAS Institute Inc. All rights reserved.
  • 3. Copyright © 2014, SAS Institute Inc. All rights reserved. Barcelona
  • 4. Copyright © 2014, SAS Institute Inc. All rights reserved.
  • 5. Copyright © 2014, SAS Institute Inc. All rights reserved.
  • 6. Copyright © 2014, SAS Institute Inc. All rights reserved. INCORPORATE DATA GOVERNANCE DEFINE RULES AND POLICIES GOVERNING DATA Who is responsible to maintain this data? And where? Where can I get this information? Is the quality of data improving? How am I supposed to use this data? What data quality standards should this data comply to? Who can approve a change to the business data model or reference data? Are we compliant with security, privacy and risk regulations ? How to leverage the value of this data? Are we making the most out of our data?
  • 7. Copyright © 2014, SAS Institute Inc. All rights reserved. Data Quality?
  • 8. Copyright © 2014, SAS Institute Inc. All rights reserved. DATA MNGT BUILDING BLOCKS DATA QUALITY
  • 9. Copyright © 2014, SAS Institute Inc. All rights reserved. BUSINESS USER BUSINESS GLOSSARY Trace data from source to consumer and all the steps in between Document what has been done to data and how it has been transformed Govern who has access to data and who has consumed data
  • 10. Copyright © 2014, SAS Institute Inc. All rights reserved. DATA QUALITY GOVERNANCE CYCLE Iterative process where Business and IT work together on Data Governance
  • 11. Copyright © 2014, SAS Institute Inc. All rights reserved. DATA QUALITY PROFILE Interactively quickly discover anomalies in the data
  • 12. Copyright © 2014, SAS Institute Inc. All rights reserved. DATA QUALITY BUSINESS RULE VALIDATION Validate whether the data complies to quality standards
  • 13. Copyright © 2014, SAS Institute Inc. All rights reserved. DATA QUALITY DATA CLEANSING: PARSING & STANDARDIZING
  • 14. Copyright © 2014, SAS Institute Inc. All rights reserved. DATA QUALITY REMEDIATION Review and resolve issues on a case by case basis
  • 15. Copyright © 2014, SAS Institute Inc. All rights reserved. DATA QUALITY DASHBOARD Real-time information when data is out of compliance with established data policies
  • 16. Copyright © 2014, SAS Institute Inc. All rights reserved. Conclusion #BigData = Data (duh…)
  • 17. Copyright © 2014, SAS Institute Inc. All rights reserved. …or is it? §  Most data assets come from within company § Focus on structured data § Look at data to assess what occurred in past § The goal is that each single record is correct § Good database design requires years § Pay attention to „data stocks“* § Business users have to ask IT for analysis § There are clearly defined information requirements for each business process §  A large proportion of data come from outside § Focus on structured and unstructured data § Real-time analysis to improve the outcome § The goal is that analytics results are accurate § Database as moving target, quick cycles § Pay attention to „data flows“* § Business users conduct analysis themselves § All internal and external data sources are used to gain best insight in a given situation Traditional data management Big Data Analytics World Source: Alexander Borek, Data Quality Strategy in a Big Data Analytics World
  • 18. Copyright © 2014, SAS Institute Inc. All rights reserved. “By 2017, 50% of all companies in regulated industries will have a Chief Data Officer.”
  • 19. Copyright © 2014, SAS Institute Inc. All rights reserved. SAS INFORMATION MANAGEMENT A single platform. A singular approach to better data.
  • 20. Copyright © 2014, SAS Institute Inc. All rights reserved.
  • 21. Copyright © 2014, SAS Institute Inc. All rights reserved. NOG VRAGEN???