Data Quality APAC Congress 2011 In Pursuit of Data Quality:  When the Business Demands Results Tom Kunz Data Manager, Downstream, Shell Finance Operations - Data Use this area for cover image (height 6.5cm, width 8cm)
Today’s Agenda   Who is Shell? Can a professional data organization exist in a big company? Can a practical data governance structure really be created? How can metadata accelerate data quality improvement?  Why would I want to use six sigma and lean techniques to solve data quality issues? Does knowing the cost of poor data quality really help? What are the key takeaways? In pursuit of Data Quality:  When the Business demands results
Business Overview Some Data About Shell Who is Shell? 1.0
Business Overview UPGRADER PLANT ON AND OFFSHORE OIL AND GAS REFINERY GAS TO LIQUIDS PLANT BIOFUELS PLANT CHEMICAL PLANT LNG LIQUEFACTION PLANT LNG REGASIFICATION TERMINAL WIND TURBINES POWER STATION CHEMICAL PRODUCTS USED FOR: Plastics Coatings Detergents REFINED OIL PRODUCTS (Bio) Fuels Lubricants Bitumen Liquefied petroleum gas GAS AND ELECTRICITY Industrial use Domestic use
FACTS AND FIGURES – SHELL PERFORMANCE IN 2009 Source: 2009 Annual Report 2010 data available March 15th
The problem The solution The new opportunities Can a professional data organization exist in a big company? 2.0
The Problem:  Does Data Quality Matter? Missing aircraft info could pose security threat NEW YORK (AP) — The  Federal Aviation Administration 's aircraft registry is missing key information on who owns one-third of the 357,000 private and commercial planes in the U.S. — a gap the agency fears could be  exploited by terrorists and drug trafficker s. While he served abroad, his credit was under siege Federal Reserve plays major role in fate of 2006 market Homeland Security contributed bad data to military intelligence database Greenspan is probably one of the most-intuitive economists because  he concluded the Fed had bad data . Mr. Baur said that those operating the database had  misinterpreted their mandate  and that what was intended as an antiterrorist database became, in some respects, a catch-all for leads on possible disruptions and threats against military installations in the United States, including protests against the military presence in Iraq. Report: Low oil spill estimates rested on "unexplained assumptions" Bad data? Infection Prevention groups reject federal Healthcare Associated Infections report.  'An  outdated and incomplete picture  of HAIs'  Faced with a critical federal report on the lack of progress against healthcare associated infections, the nation's leading infection prevention groups find themselves in the thankless position of having to challenge the methodology of the report without appearing to be in denial about HAIs.  A 2005 survey by the U.S. Public Interest Research Group found  79% of credit reports contained errors , and 25% contained enough mistakes to prevent the individual from obtaining credit. Once the credit system accepts bad data, it can be next to impossible to clear. The reports authors say they cannot tell if the low estimates actually slowed the response to the oil spill, but say they likely  undermined public confidence in BP  and the federal response team, regardless.
The Problem Fr a gm ent a tio n Here a touch… There a touch… Everywhere a touch, touch…
The Solution: Manage Data as a Process in Finance P0 Data risk management Data quality assurance Meta-data management Data lifecycle management Audit and reporting Controls & compliance  Assess, quantify and maximize the business value of enterprise data assets across the value chain (including suppliers, partners, customers)  Capture, use, maintain, archive and delete data Define, measure, improve, and certify the quality (accuracy, validity, completeness, timeliness) of data Identify, assess, avoid, accept, mitigate, or transfer out risks Identify and establish control requirements for data and ensure compliance (including privacy, security, regulatory aspects) Measure and monitor data quality, risks, and efficacy of governance Capture, use, maintain semantic definitions for business terms and data models Create Value
The Solution: A process-based data management organization Data Manager Data Manager Data Manager Business Facing Data Manager Process  Manager Process  Manager Process  Manager Process  Manager “ Certification” Upstream  Downstream Projects &  Technology Finance, HR,  Corporate, legal Businesses Process owners Accounts Aligned by Data Process Assets & Projects Organisation & People Real Estate Contracts Convenience Retail Products B2B Customers Card Customers Retail Site Customers Facilities and Equipment Materials and Services Vendors Procurement Contracts Lubes Products Etc… Data Competency Framework Data Teams
New Opportunities A Third-quartile data Top-quartile data Migrate:  De-fragment and migrate data activities into a single team of dedicated data professionals Operate & measure: Operate and measure end-to-end data process performance: KPIs, controls, quality standards. Improve: Continuously improve data quality by addressing processes, tools, capabilities, quality standards 1 2 3 B
The Impossible Dream The Long and Winding Road I’m A Believer Can a practical data governance structure really be created? 3.0
The Impossible Dream Where everything just works….. Footer: Title may be placed here or disclaimer if required. May sit up to two lines in depth. Business understands master data Business takes ownership for data quality Process designers are valued Continuous improvement is a mindset Results are more important than politics E2E process is understood Data gatherers know what to do Data processes are managed Feedback is welcomed
The Long and Winding Road Business Sponsored Go where the need is Keep the scope narrow Slippery Slopes Compromise Go slow at times … and then start  again
I’m a Believer Data Value Owner Data Gatherer Process Manager Data Operations Business understands master data and its processes Business takes ownership for data quality Process designers are valued Continuous improvement is a mindset Results are more important than politics E2E process is understood Data gatherers know what to do Data processes are managed Feedback is welcomed
What it is How we used it What we learned How can metadata accelerate data quality improvement? 4.0
Metadata:  What it is Data about Data Describes the contents of the information Provides documentation or information about a specific piece of information Include elements and attributes such as a name, size or type Can represent the location or ownership of the file Any other information that needs to be noted about the data Can be information about frequency or volume of updates
Metadata:  How we use it Fields with a significant number of updates in a given period Identification of fields not used in the design, but actually have data in them Fields critical to the success of a particular process but not covered by a current data quality standard
Metadata:  What we are learning… Frequency and number of updates to each field in the customer master Fields with data in them, but not used in the design Fields included in the data quality compliance standards Discover fields that are candidates for mass upload tools Reduce effort by no longer populating unused fields Identify which fields are not in data quality standards that should be Data about Data:
Danger:  Low Hanging Fruit! Structuring for success Delivering the goods Why would I want to use six sigma and lean techniques to solve data quality issues? 5.0
Danger:  Low Hanging Fruit! What happens when you pick it and it just grows back?
Structuring for Success Operations Improvement Logs Business Pain Points Prioritization of Improvement Projects Project Charter Project Charter Project Charter Operations Business Black Belt Coaching Develop Greenbelts Develop Greenbelts
Delivering the Goods Reducing Costs Increasing speed Improving quality
Everybody has a model What works for us When it just doesn’t matter…much Does knowing the cost of poor data quality really help? 6.0
Everybody has a model
What works for us – FMEA  (Failure Mode Effect Analysis) Cost of providing the data PLUS Cost of compliance to the standard VS. Cost of non-compliance to the standard (Requires RISK BASED analysis
When COPDQ just doesn’t matter…. much Business is energized Resources are available Hot spots are known
What are the key takeaways? 7.0 In Pursuit of Data Quality:  When the business demands results
Takeaways Footer: Title may be placed here or disclaimer if required. May sit up to two lines in depth. May appear on Title pg. In pursuit of Data Quality:  When the Business demands results
Q   &   A
 

Tom Kunz

  • 1.
    Data Quality APACCongress 2011 In Pursuit of Data Quality: When the Business Demands Results Tom Kunz Data Manager, Downstream, Shell Finance Operations - Data Use this area for cover image (height 6.5cm, width 8cm)
  • 2.
    Today’s Agenda  Who is Shell? Can a professional data organization exist in a big company? Can a practical data governance structure really be created? How can metadata accelerate data quality improvement? Why would I want to use six sigma and lean techniques to solve data quality issues? Does knowing the cost of poor data quality really help? What are the key takeaways? In pursuit of Data Quality:  When the Business demands results
  • 3.
    Business Overview SomeData About Shell Who is Shell? 1.0
  • 4.
    Business Overview UPGRADERPLANT ON AND OFFSHORE OIL AND GAS REFINERY GAS TO LIQUIDS PLANT BIOFUELS PLANT CHEMICAL PLANT LNG LIQUEFACTION PLANT LNG REGASIFICATION TERMINAL WIND TURBINES POWER STATION CHEMICAL PRODUCTS USED FOR: Plastics Coatings Detergents REFINED OIL PRODUCTS (Bio) Fuels Lubricants Bitumen Liquefied petroleum gas GAS AND ELECTRICITY Industrial use Domestic use
  • 5.
    FACTS AND FIGURES– SHELL PERFORMANCE IN 2009 Source: 2009 Annual Report 2010 data available March 15th
  • 6.
    The problem Thesolution The new opportunities Can a professional data organization exist in a big company? 2.0
  • 7.
    The Problem: Does Data Quality Matter? Missing aircraft info could pose security threat NEW YORK (AP) — The Federal Aviation Administration 's aircraft registry is missing key information on who owns one-third of the 357,000 private and commercial planes in the U.S. — a gap the agency fears could be exploited by terrorists and drug trafficker s. While he served abroad, his credit was under siege Federal Reserve plays major role in fate of 2006 market Homeland Security contributed bad data to military intelligence database Greenspan is probably one of the most-intuitive economists because he concluded the Fed had bad data . Mr. Baur said that those operating the database had misinterpreted their mandate and that what was intended as an antiterrorist database became, in some respects, a catch-all for leads on possible disruptions and threats against military installations in the United States, including protests against the military presence in Iraq. Report: Low oil spill estimates rested on "unexplained assumptions" Bad data? Infection Prevention groups reject federal Healthcare Associated Infections report. 'An outdated and incomplete picture of HAIs' Faced with a critical federal report on the lack of progress against healthcare associated infections, the nation's leading infection prevention groups find themselves in the thankless position of having to challenge the methodology of the report without appearing to be in denial about HAIs. A 2005 survey by the U.S. Public Interest Research Group found 79% of credit reports contained errors , and 25% contained enough mistakes to prevent the individual from obtaining credit. Once the credit system accepts bad data, it can be next to impossible to clear. The reports authors say they cannot tell if the low estimates actually slowed the response to the oil spill, but say they likely undermined public confidence in BP and the federal response team, regardless.
  • 8.
    The Problem Fra gm ent a tio n Here a touch… There a touch… Everywhere a touch, touch…
  • 9.
    The Solution: ManageData as a Process in Finance P0 Data risk management Data quality assurance Meta-data management Data lifecycle management Audit and reporting Controls & compliance Assess, quantify and maximize the business value of enterprise data assets across the value chain (including suppliers, partners, customers) Capture, use, maintain, archive and delete data Define, measure, improve, and certify the quality (accuracy, validity, completeness, timeliness) of data Identify, assess, avoid, accept, mitigate, or transfer out risks Identify and establish control requirements for data and ensure compliance (including privacy, security, regulatory aspects) Measure and monitor data quality, risks, and efficacy of governance Capture, use, maintain semantic definitions for business terms and data models Create Value
  • 10.
    The Solution: Aprocess-based data management organization Data Manager Data Manager Data Manager Business Facing Data Manager Process Manager Process Manager Process Manager Process Manager “ Certification” Upstream Downstream Projects & Technology Finance, HR, Corporate, legal Businesses Process owners Accounts Aligned by Data Process Assets & Projects Organisation & People Real Estate Contracts Convenience Retail Products B2B Customers Card Customers Retail Site Customers Facilities and Equipment Materials and Services Vendors Procurement Contracts Lubes Products Etc… Data Competency Framework Data Teams
  • 11.
    New Opportunities AThird-quartile data Top-quartile data Migrate: De-fragment and migrate data activities into a single team of dedicated data professionals Operate & measure: Operate and measure end-to-end data process performance: KPIs, controls, quality standards. Improve: Continuously improve data quality by addressing processes, tools, capabilities, quality standards 1 2 3 B
  • 12.
    The Impossible DreamThe Long and Winding Road I’m A Believer Can a practical data governance structure really be created? 3.0
  • 13.
    The Impossible DreamWhere everything just works….. Footer: Title may be placed here or disclaimer if required. May sit up to two lines in depth. Business understands master data Business takes ownership for data quality Process designers are valued Continuous improvement is a mindset Results are more important than politics E2E process is understood Data gatherers know what to do Data processes are managed Feedback is welcomed
  • 14.
    The Long andWinding Road Business Sponsored Go where the need is Keep the scope narrow Slippery Slopes Compromise Go slow at times … and then start again
  • 15.
    I’m a BelieverData Value Owner Data Gatherer Process Manager Data Operations Business understands master data and its processes Business takes ownership for data quality Process designers are valued Continuous improvement is a mindset Results are more important than politics E2E process is understood Data gatherers know what to do Data processes are managed Feedback is welcomed
  • 16.
    What it isHow we used it What we learned How can metadata accelerate data quality improvement? 4.0
  • 17.
    Metadata: Whatit is Data about Data Describes the contents of the information Provides documentation or information about a specific piece of information Include elements and attributes such as a name, size or type Can represent the location or ownership of the file Any other information that needs to be noted about the data Can be information about frequency or volume of updates
  • 18.
    Metadata: Howwe use it Fields with a significant number of updates in a given period Identification of fields not used in the design, but actually have data in them Fields critical to the success of a particular process but not covered by a current data quality standard
  • 19.
    Metadata: Whatwe are learning… Frequency and number of updates to each field in the customer master Fields with data in them, but not used in the design Fields included in the data quality compliance standards Discover fields that are candidates for mass upload tools Reduce effort by no longer populating unused fields Identify which fields are not in data quality standards that should be Data about Data:
  • 20.
    Danger: LowHanging Fruit! Structuring for success Delivering the goods Why would I want to use six sigma and lean techniques to solve data quality issues? 5.0
  • 21.
    Danger: LowHanging Fruit! What happens when you pick it and it just grows back?
  • 22.
    Structuring for SuccessOperations Improvement Logs Business Pain Points Prioritization of Improvement Projects Project Charter Project Charter Project Charter Operations Business Black Belt Coaching Develop Greenbelts Develop Greenbelts
  • 23.
    Delivering the GoodsReducing Costs Increasing speed Improving quality
  • 24.
    Everybody has amodel What works for us When it just doesn’t matter…much Does knowing the cost of poor data quality really help? 6.0
  • 25.
  • 26.
    What works forus – FMEA (Failure Mode Effect Analysis) Cost of providing the data PLUS Cost of compliance to the standard VS. Cost of non-compliance to the standard (Requires RISK BASED analysis
  • 27.
    When COPDQ justdoesn’t matter…. much Business is energized Resources are available Hot spots are known
  • 28.
    What are thekey takeaways? 7.0 In Pursuit of Data Quality: When the business demands results
  • 29.
    Takeaways Footer: Titlemay be placed here or disclaimer if required. May sit up to two lines in depth. May appear on Title pg. In pursuit of Data Quality:  When the Business demands results
  • 30.
    Q & A
  • 31.

Editor's Notes

  • #5 With around 102,000 employees in more than 100 countries and territories, Shell helps to meet the world’s growing demand for energy in economically, environmentally and socially responsible ways. UPSTREAM Our Exploration & Production business searches for and recovers oil and natural gas around the world. Many of these activities are carried out as joint venture partnerships, often with national oil companies. Our Gas & Power business liquefies natural gas and transports it to customers across the world. Its gas to liquids (GTL) process turns natural gas into cleaner-burning synthetic fuel and other products. It develops wind power to generate electricity and is involved in solar power technology. It also licenses our coal gasification technology, enabling coal to be used as a chemical feedstock and for more efficient generation of electricity. DOWNSTREAM Our Oil Sands business, the Athabasca Oil Sands Project, extracts bitumen – an especially thick, heavy oil – from oil sands in Alberta, western Canada, and converts it to synthetic crude oils that can be turned into a range of products. Our Oil Products business makes, moves and sells a range of petroleum-based products around the world for domestic, industrial and transport use. Its Future Fuels and CO2 business unit develops biofuels and hydrogen and markets the synthetic fuel and products made from the GTL process. It also leads company-wide activities in CO2 management. With around 45,000 service stations, ours is the world’s largest single-branded fuel retail network. Our Chemicals business produces petrochemicals for industrial customers. They include the raw materials for plastics, coatings and detergents used in the manufacture of textiles, medical supplies and computers.
  • #6 We are active in more than 90 countries and territories, and employ the equivalent of 101,000 full-time employees worldwide. (The number of people with Shell employment contracts is higher than 101,000, because the number includes part-time employees.) For more than 100 years our logo, the name ‘ Shell ’ and our distinctive red and yellow colours have identified the Shell brand. We believe that oil and gas will be integral to the global energy needs for economic development for many decades to come. Our role is to ensure that we extract and deliver them in economically, environmentally and socially responsible ways.