SlideShare a Scribd company logo
1 of 32
Sourcing Good Data 
10 best practices
Welcome 
Why is data quality 
important? 
Our 10 best practices 
Agenda:
Data Quality Story 
Overbooked 10,000 tickets for event 
Manual spreadsheet error 
- telegraph.co.uk
Your data has reach… 
Where data from a report is used: % of data in spreadsheets that influences CEO 
* Panko and Port, 2012 
Inter-departmental 
69% 
Within 
department 
31% 
42%
Just how much of an issue is data quality? 
1 in 10 organisations rate their data 
quality as “excellent” 
Poor data quality accounts for 
20% of business process costs 
$611bn The cost of poor data quality to US 
companies each year 
* Gartner, TDWI
And we want more… 
2009 – enough data to fill a stack of DVDs 
to the moon and back 
2020 – Grow by 44x 
Less than 1% of available data is 
analysed 
93% of execs believe they are losing 
revenue as a result of not fully leveraging 
the information they collect 
* IDC, Oracle and EMC 
1% 
x44 by 2020
What is data quality? 
HOW 
RELIABLE 
IS YOUR 
DATA? 
TRUSTED 
AND 
CREDIBLE 
Complete 
Accurate 
Available 
Consistent
Why is data quality important? 
“It supports accountability” 
“It gives us accurate and timely 
information to manage our business” 
“It ensures the best use of our resources” 
“It increases our efficiency” 
“It reduces the cost of rework” 
“It can increase customer satisfaction” 
“It ensures we have the best possible 
understanding of our customers and employees” 
“It improves the success rate of enterprise initiatives 
like Business Intelligence…”
Building high quality “supply chains” of data 
MEASURE 
FOR QUALITY 
GET THE 
RIGHT DATA 
BE AGILE
Focus on the outcome 
Analysis Paralysis 
Letting data dictate what is 
“important” 
Limited time and energy 
to focus 
1 
ISSUES
1 Focus on the outcome 
Start with 
the 
outcome… 
…then the 
data. 
Focus on 
what matters 
RECOMMENDATIONS
2 Profile your data 
Data supplier doesn’t know 
your data needs 
The data you source is as 
good as the information 
you provide to the 
supplier… 
ISSUES
2 Profile your data 
Write your data profile 
Structure, Format, Frequency, Age, Delivery Method 
Communicate it to data providers 
Opportunity to identify issues and gaps 
RECOMMENDATIONS
3 Get as close to the source as possible 
When your source data is somebody else’s 
spreadsheet…. 
Human Error Risk 
Availability of data 
Unexpected Changes 
Additional effort and complexity 
ISSUES
3 Get as close to the source as possible 
CAUTION 
Be cautious of 
manual 
spreadsheets 
Skip the 
spreadsheet as a 
source 
PLAN 
Communicate and 
measure for quality 
RECOMMENDATIONS
4 Streamline data sources 
Using multiple sources 
Redundant data 
Increased complexity and quality risk 
ISSUES
4 Streamline data sources 
Identify redundant data 
Focus on the essentials 
Cut out the stuff you don’t need 
RECOMMENDATIONS
5 Set data quality expectations 
Perfectionism  Burnout 
You can’t expect to focus on everything 
ISSUES
5 Set data quality expectations 
Focus on high impact data 
Employ tolerances and ranges for quality and accuracy 
RECOMMENDATIONS 
RELAX 
(a little)
6 Catch data quality issues early 
Early 
$1 
$10 
$100 
If found in the 
middle of the 
journey 
If found at the end 
Late of the journey 
* Total Quality Management 
If found at the 
start of journey 
1-10-100 Rule: 
ISSUES
6 Catch data quality issues early 
Implement quality measures near the start of 
the data supply chain 
Use the “start” as a reference point when 
checking data further down the journey 
RECOMMENDATIONS
7 Actively measure quality 
ISSUES 
Invalid Assumption: 
If the data meets our expectations today, it will 
going forward 
No simple way to identify if data is correct 
What happens when we do find an issue?
7 Actively measure quality 
OK 
GOOD 
NOT GOOD 
Define metrics for your data quality 
Measure for quality on a consistent basis 
Address consistent issues with strategic 
solutions (e.g. data cleansing) 
RECOMMENDATIONS
8 Expect Change. Embrace It. 
We all know change is coming 
Business activity, changes in 
strategies and systems 
So rigid that you need to “reset” 
ISSUES
8 Expect Change. Embrace It. 
Likelihood 
Impact 
L 
H 
L 
H 
Score and rank potential changes 
Focus on high likelihood/impact 
changes 
Have a plan in place for high risk items 
RECOMMENDATIONS
9 Plan for change 
A change occurs, then what? 
Lack of clear policies and rules on who 
needs to do what… 
Knowledge resting in the minds of key 
individuals 
ISSUES
9 Plan for change 
RECOMMENDATIONS 
CAUTION 
In the event 
of a change 
the following 
people will… 
Policies and rules Documentation Tracking Changes
10 Controlled human interaction 
Value of human interaction with data… 
… at the cost of data quality 
Uncontrolled manipulation of data 
ISSUES
10 Controlled human interaction 
Avoid uncontrolled manipulation 
Facilitate controlled and discrete changes 
Make sure it is traceable 
RECOMMENDATIONS
Recap 
1 Focus on the outcome 
2 Profile your data 
3 Get close to the source 
4 Streamline data sources 
5 Set data quality expectations
Recap 
6 Catch data quality issues early 
7 Measure quality 
8 Expect and embrace change 
9 Plan for change 
10 Controlled human interaction
Thank You

More Related Content

What's hot

Analytics Staffing Models of Health Systems That Compete Well Using Data
Analytics Staffing Models of Health Systems That Compete Well Using DataAnalytics Staffing Models of Health Systems That Compete Well Using Data
Analytics Staffing Models of Health Systems That Compete Well Using DataThotWave
 
Zach Frank: Pitfalls of Predicative Models in People Analytics
Zach Frank: Pitfalls of Predicative Models in People AnalyticsZach Frank: Pitfalls of Predicative Models in People Analytics
Zach Frank: Pitfalls of Predicative Models in People AnalyticsEdunomica
 
Predictive Data Analytics to Help Your Customers
Predictive Data Analytics to Help Your CustomersPredictive Data Analytics to Help Your Customers
Predictive Data Analytics to Help Your CustomersExperian_US
 
Jones Lang Lasalle at The Chief Analytics Officer Forum, Europe
Jones Lang Lasalle at The Chief Analytics Officer Forum, EuropeJones Lang Lasalle at The Chief Analytics Officer Forum, Europe
Jones Lang Lasalle at The Chief Analytics Officer Forum, EuropeChief Analytics Officer Forum
 
Data Management as a Strategic Initiative for Government
Data Management as a Strategic Initiative for GovernmentData Management as a Strategic Initiative for Government
Data Management as a Strategic Initiative for GovernmentSAS Institute India Pvt. Ltd
 
1120 track 3 prendki_using our laptop
1120 track 3 prendki_using our laptop1120 track 3 prendki_using our laptop
1120 track 3 prendki_using our laptopRising Media, Inc.
 
Business analytics in healthcare & life science
Business analytics in healthcare & life scienceBusiness analytics in healthcare & life science
Business analytics in healthcare & life scienceSanjay Choubey
 
How To Improve Profitability & Outperform Your Competition: the Guide to Data...
How To Improve Profitability & Outperform Your Competition: the Guide to Data...How To Improve Profitability & Outperform Your Competition: the Guide to Data...
How To Improve Profitability & Outperform Your Competition: the Guide to Data...A.J. Riedel
 
Self-service Analytic for Business Users-19july2017-final
Self-service Analytic for Business Users-19july2017-finalSelf-service Analytic for Business Users-19july2017-final
Self-service Analytic for Business Users-19july2017-finalstelligence
 
Data Driven Decision Making Presentation
Data Driven Decision Making PresentationData Driven Decision Making Presentation
Data Driven Decision Making PresentationRussell Kunz
 

What's hot (20)

Data driven decision making
Data driven decision makingData driven decision making
Data driven decision making
 
Analytics Staffing Models of Health Systems That Compete Well Using Data
Analytics Staffing Models of Health Systems That Compete Well Using DataAnalytics Staffing Models of Health Systems That Compete Well Using Data
Analytics Staffing Models of Health Systems That Compete Well Using Data
 
Zach Frank: Pitfalls of Predicative Models in People Analytics
Zach Frank: Pitfalls of Predicative Models in People AnalyticsZach Frank: Pitfalls of Predicative Models in People Analytics
Zach Frank: Pitfalls of Predicative Models in People Analytics
 
High performance organisation
High performance organisationHigh performance organisation
High performance organisation
 
Predictive Data Analytics to Help Your Customers
Predictive Data Analytics to Help Your CustomersPredictive Data Analytics to Help Your Customers
Predictive Data Analytics to Help Your Customers
 
Jones Lang Lasalle at The Chief Analytics Officer Forum, Europe
Jones Lang Lasalle at The Chief Analytics Officer Forum, EuropeJones Lang Lasalle at The Chief Analytics Officer Forum, Europe
Jones Lang Lasalle at The Chief Analytics Officer Forum, Europe
 
Lingaro
LingaroLingaro
Lingaro
 
Data Management as a Strategic Initiative for Government
Data Management as a Strategic Initiative for GovernmentData Management as a Strategic Initiative for Government
Data Management as a Strategic Initiative for Government
 
1120 track 3 prendki_using our laptop
1120 track 3 prendki_using our laptop1120 track 3 prendki_using our laptop
1120 track 3 prendki_using our laptop
 
1120 track1 grossman
1120 track1 grossman1120 track1 grossman
1120 track1 grossman
 
Business analytics in healthcare & life science
Business analytics in healthcare & life scienceBusiness analytics in healthcare & life science
Business analytics in healthcare & life science
 
PWC at The Chief Analytics Officer Forum, Europe
PWC at The Chief Analytics Officer Forum, EuropePWC at The Chief Analytics Officer Forum, Europe
PWC at The Chief Analytics Officer Forum, Europe
 
How To Improve Profitability & Outperform Your Competition: the Guide to Data...
How To Improve Profitability & Outperform Your Competition: the Guide to Data...How To Improve Profitability & Outperform Your Competition: the Guide to Data...
How To Improve Profitability & Outperform Your Competition: the Guide to Data...
 
The Future of Information - Experian Knows Big Data Analytics
The Future of Information - Experian Knows Big Data AnalyticsThe Future of Information - Experian Knows Big Data Analytics
The Future of Information - Experian Knows Big Data Analytics
 
Analytics - Trends and Prospects
Analytics - Trends and ProspectsAnalytics - Trends and Prospects
Analytics - Trends and Prospects
 
Unlocking the Strategic Value of your Data
Unlocking the Strategic Value of your Data Unlocking the Strategic Value of your Data
Unlocking the Strategic Value of your Data
 
Mighty Guides- Data Disruption
Mighty Guides- Data Disruption Mighty Guides- Data Disruption
Mighty Guides- Data Disruption
 
Self-service Analytic for Business Users-19july2017-final
Self-service Analytic for Business Users-19july2017-finalSelf-service Analytic for Business Users-19july2017-final
Self-service Analytic for Business Users-19july2017-final
 
Data Driven Decision Making Presentation
Data Driven Decision Making PresentationData Driven Decision Making Presentation
Data Driven Decision Making Presentation
 
1115 track1 ramirez_whiting
1115 track1 ramirez_whiting1115 track1 ramirez_whiting
1115 track1 ramirez_whiting
 

Similar to How to source good data

Building a Data Quality Program from Scratch
Building a Data Quality Program from ScratchBuilding a Data Quality Program from Scratch
Building a Data Quality Program from Scratchdmurph4
 
Analytics from data to better decision
Analytics   from data to better decisionAnalytics   from data to better decision
Analytics from data to better decisionFrehiwot Mulugeta
 
NTEN Your Analytics doesn't have to be dramatic to be useful
NTEN Your Analytics doesn't have to be dramatic to be usefulNTEN Your Analytics doesn't have to be dramatic to be useful
NTEN Your Analytics doesn't have to be dramatic to be usefulAndrew Patricio
 
From Compliance to Customer 360: Winning with Data Quality & Data Governance
From Compliance to Customer 360: Winning with Data Quality & Data GovernanceFrom Compliance to Customer 360: Winning with Data Quality & Data Governance
From Compliance to Customer 360: Winning with Data Quality & Data GovernancePrecisely
 
Creating a Data-Driven Organization, Data Day Texas, January 2016
Creating a Data-Driven Organization, Data Day Texas, January 2016Creating a Data-Driven Organization, Data Day Texas, January 2016
Creating a Data-Driven Organization, Data Day Texas, January 2016Carl Anderson
 
Optimize Your Healthcare Data Quality Investment: Three Ways to Accelerate Ti...
Optimize Your Healthcare Data Quality Investment: Three Ways to Accelerate Ti...Optimize Your Healthcare Data Quality Investment: Three Ways to Accelerate Ti...
Optimize Your Healthcare Data Quality Investment: Three Ways to Accelerate Ti...Health Catalyst
 
Applying Data Quality Best Practices at Big Data Scale
Applying Data Quality Best Practices at Big Data ScaleApplying Data Quality Best Practices at Big Data Scale
Applying Data Quality Best Practices at Big Data ScalePrecisely
 
U5 a1 stages in the decision making process
U5 a1 stages in the decision making processU5 a1 stages in the decision making process
U5 a1 stages in the decision making processPeter R Breach
 
Creating a Data-Driven Organization, Crunchconf, October 2015
Creating a Data-Driven Organization, Crunchconf, October 2015Creating a Data-Driven Organization, Crunchconf, October 2015
Creating a Data-Driven Organization, Crunchconf, October 2015Carl Anderson
 
Creating a Data-Driven Organization -- thisismetis meetup
Creating a Data-Driven Organization -- thisismetis meetupCreating a Data-Driven Organization -- thisismetis meetup
Creating a Data-Driven Organization -- thisismetis meetupCarl Anderson
 
Your AI and ML Projects Are Failing – Key Steps to Get Them Back on Track
Your AI and ML Projects Are Failing – Key Steps to Get Them Back on TrackYour AI and ML Projects Are Failing – Key Steps to Get Them Back on Track
Your AI and ML Projects Are Failing – Key Steps to Get Them Back on TrackPrecisely
 
Big Data Expo 2015 - Trillium software Big Data and the Data Quality
Big Data Expo 2015 - Trillium software Big Data and the Data QualityBig Data Expo 2015 - Trillium software Big Data and the Data Quality
Big Data Expo 2015 - Trillium software Big Data and the Data QualityBigDataExpo
 
Building a Data Warehouse at Clover (PDF)
Building a Data Warehouse at Clover (PDF)Building a Data Warehouse at Clover (PDF)
Building a Data Warehouse at Clover (PDF)Otis Anderson
 
You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?Precisely
 
You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?Precisely
 
Data Quality: The Cornerstone Of High-Yield Technology Investments
Data Quality: The Cornerstone Of High-Yield Technology InvestmentsData Quality: The Cornerstone Of High-Yield Technology Investments
Data Quality: The Cornerstone Of High-Yield Technology InvestmentsshaileshShetty34
 
Data-Ed Webinar: Data Quality Success Stories
Data-Ed Webinar: Data Quality Success StoriesData-Ed Webinar: Data Quality Success Stories
Data-Ed Webinar: Data Quality Success StoriesDATAVERSITY
 
Data Science by Chappuis Halder & Co.
Data Science by Chappuis Halder & Co.Data Science by Chappuis Halder & Co.
Data Science by Chappuis Halder & Co.Genest Benoit
 

Similar to How to source good data (20)

Building a Data Quality Program from Scratch
Building a Data Quality Program from ScratchBuilding a Data Quality Program from Scratch
Building a Data Quality Program from Scratch
 
Analytics from data to better decision
Analytics   from data to better decisionAnalytics   from data to better decision
Analytics from data to better decision
 
do_dq.pdf
do_dq.pdfdo_dq.pdf
do_dq.pdf
 
NTEN Your Analytics doesn't have to be dramatic to be useful
NTEN Your Analytics doesn't have to be dramatic to be usefulNTEN Your Analytics doesn't have to be dramatic to be useful
NTEN Your Analytics doesn't have to be dramatic to be useful
 
From Compliance to Customer 360: Winning with Data Quality & Data Governance
From Compliance to Customer 360: Winning with Data Quality & Data GovernanceFrom Compliance to Customer 360: Winning with Data Quality & Data Governance
From Compliance to Customer 360: Winning with Data Quality & Data Governance
 
Creating a Data-Driven Organization, Data Day Texas, January 2016
Creating a Data-Driven Organization, Data Day Texas, January 2016Creating a Data-Driven Organization, Data Day Texas, January 2016
Creating a Data-Driven Organization, Data Day Texas, January 2016
 
Optimize Your Healthcare Data Quality Investment: Three Ways to Accelerate Ti...
Optimize Your Healthcare Data Quality Investment: Three Ways to Accelerate Ti...Optimize Your Healthcare Data Quality Investment: Three Ways to Accelerate Ti...
Optimize Your Healthcare Data Quality Investment: Three Ways to Accelerate Ti...
 
Applying Data Quality Best Practices at Big Data Scale
Applying Data Quality Best Practices at Big Data ScaleApplying Data Quality Best Practices at Big Data Scale
Applying Data Quality Best Practices at Big Data Scale
 
U5 a1 stages in the decision making process
U5 a1 stages in the decision making processU5 a1 stages in the decision making process
U5 a1 stages in the decision making process
 
Creating a Data-Driven Organization, Crunchconf, October 2015
Creating a Data-Driven Organization, Crunchconf, October 2015Creating a Data-Driven Organization, Crunchconf, October 2015
Creating a Data-Driven Organization, Crunchconf, October 2015
 
Creating a Data-Driven Organization -- thisismetis meetup
Creating a Data-Driven Organization -- thisismetis meetupCreating a Data-Driven Organization -- thisismetis meetup
Creating a Data-Driven Organization -- thisismetis meetup
 
Your AI and ML Projects Are Failing – Key Steps to Get Them Back on Track
Your AI and ML Projects Are Failing – Key Steps to Get Them Back on TrackYour AI and ML Projects Are Failing – Key Steps to Get Them Back on Track
Your AI and ML Projects Are Failing – Key Steps to Get Them Back on Track
 
Big Data Expo 2015 - Trillium software Big Data and the Data Quality
Big Data Expo 2015 - Trillium software Big Data and the Data QualityBig Data Expo 2015 - Trillium software Big Data and the Data Quality
Big Data Expo 2015 - Trillium software Big Data and the Data Quality
 
Building a Data Warehouse at Clover (PDF)
Building a Data Warehouse at Clover (PDF)Building a Data Warehouse at Clover (PDF)
Building a Data Warehouse at Clover (PDF)
 
Big Data: How does it fit in your data strategy?
Big Data: How does it fit in your data strategy?Big Data: How does it fit in your data strategy?
Big Data: How does it fit in your data strategy?
 
You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?
 
You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?
 
Data Quality: The Cornerstone Of High-Yield Technology Investments
Data Quality: The Cornerstone Of High-Yield Technology InvestmentsData Quality: The Cornerstone Of High-Yield Technology Investments
Data Quality: The Cornerstone Of High-Yield Technology Investments
 
Data-Ed Webinar: Data Quality Success Stories
Data-Ed Webinar: Data Quality Success StoriesData-Ed Webinar: Data Quality Success Stories
Data-Ed Webinar: Data Quality Success Stories
 
Data Science by Chappuis Halder & Co.
Data Science by Chappuis Halder & Co.Data Science by Chappuis Halder & Co.
Data Science by Chappuis Halder & Co.
 

Recently uploaded

From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...ThinkInnovation
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 

Recently uploaded (20)

From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 

How to source good data

  • 1. Sourcing Good Data 10 best practices
  • 2. Welcome Why is data quality important? Our 10 best practices Agenda:
  • 3. Data Quality Story Overbooked 10,000 tickets for event Manual spreadsheet error - telegraph.co.uk
  • 4. Your data has reach… Where data from a report is used: % of data in spreadsheets that influences CEO * Panko and Port, 2012 Inter-departmental 69% Within department 31% 42%
  • 5. Just how much of an issue is data quality? 1 in 10 organisations rate their data quality as “excellent” Poor data quality accounts for 20% of business process costs $611bn The cost of poor data quality to US companies each year * Gartner, TDWI
  • 6. And we want more… 2009 – enough data to fill a stack of DVDs to the moon and back 2020 – Grow by 44x Less than 1% of available data is analysed 93% of execs believe they are losing revenue as a result of not fully leveraging the information they collect * IDC, Oracle and EMC 1% x44 by 2020
  • 7. What is data quality? HOW RELIABLE IS YOUR DATA? TRUSTED AND CREDIBLE Complete Accurate Available Consistent
  • 8. Why is data quality important? “It supports accountability” “It gives us accurate and timely information to manage our business” “It ensures the best use of our resources” “It increases our efficiency” “It reduces the cost of rework” “It can increase customer satisfaction” “It ensures we have the best possible understanding of our customers and employees” “It improves the success rate of enterprise initiatives like Business Intelligence…”
  • 9. Building high quality “supply chains” of data MEASURE FOR QUALITY GET THE RIGHT DATA BE AGILE
  • 10. Focus on the outcome Analysis Paralysis Letting data dictate what is “important” Limited time and energy to focus 1 ISSUES
  • 11. 1 Focus on the outcome Start with the outcome… …then the data. Focus on what matters RECOMMENDATIONS
  • 12. 2 Profile your data Data supplier doesn’t know your data needs The data you source is as good as the information you provide to the supplier… ISSUES
  • 13. 2 Profile your data Write your data profile Structure, Format, Frequency, Age, Delivery Method Communicate it to data providers Opportunity to identify issues and gaps RECOMMENDATIONS
  • 14. 3 Get as close to the source as possible When your source data is somebody else’s spreadsheet…. Human Error Risk Availability of data Unexpected Changes Additional effort and complexity ISSUES
  • 15. 3 Get as close to the source as possible CAUTION Be cautious of manual spreadsheets Skip the spreadsheet as a source PLAN Communicate and measure for quality RECOMMENDATIONS
  • 16. 4 Streamline data sources Using multiple sources Redundant data Increased complexity and quality risk ISSUES
  • 17. 4 Streamline data sources Identify redundant data Focus on the essentials Cut out the stuff you don’t need RECOMMENDATIONS
  • 18. 5 Set data quality expectations Perfectionism  Burnout You can’t expect to focus on everything ISSUES
  • 19. 5 Set data quality expectations Focus on high impact data Employ tolerances and ranges for quality and accuracy RECOMMENDATIONS RELAX (a little)
  • 20. 6 Catch data quality issues early Early $1 $10 $100 If found in the middle of the journey If found at the end Late of the journey * Total Quality Management If found at the start of journey 1-10-100 Rule: ISSUES
  • 21. 6 Catch data quality issues early Implement quality measures near the start of the data supply chain Use the “start” as a reference point when checking data further down the journey RECOMMENDATIONS
  • 22. 7 Actively measure quality ISSUES Invalid Assumption: If the data meets our expectations today, it will going forward No simple way to identify if data is correct What happens when we do find an issue?
  • 23. 7 Actively measure quality OK GOOD NOT GOOD Define metrics for your data quality Measure for quality on a consistent basis Address consistent issues with strategic solutions (e.g. data cleansing) RECOMMENDATIONS
  • 24. 8 Expect Change. Embrace It. We all know change is coming Business activity, changes in strategies and systems So rigid that you need to “reset” ISSUES
  • 25. 8 Expect Change. Embrace It. Likelihood Impact L H L H Score and rank potential changes Focus on high likelihood/impact changes Have a plan in place for high risk items RECOMMENDATIONS
  • 26. 9 Plan for change A change occurs, then what? Lack of clear policies and rules on who needs to do what… Knowledge resting in the minds of key individuals ISSUES
  • 27. 9 Plan for change RECOMMENDATIONS CAUTION In the event of a change the following people will… Policies and rules Documentation Tracking Changes
  • 28. 10 Controlled human interaction Value of human interaction with data… … at the cost of data quality Uncontrolled manipulation of data ISSUES
  • 29. 10 Controlled human interaction Avoid uncontrolled manipulation Facilitate controlled and discrete changes Make sure it is traceable RECOMMENDATIONS
  • 30. Recap 1 Focus on the outcome 2 Profile your data 3 Get close to the source 4 Streamline data sources 5 Set data quality expectations
  • 31. Recap 6 Catch data quality issues early 7 Measure quality 8 Expect and embrace change 9 Plan for change 10 Controlled human interaction