Gis Data Quality Profiles 2005

Loading...

Flash Player 9 (or above) is needed to view presentations.
We have detected that you do not have it on your computer. To install it, go here.

0 comments

Post a comment

    Post a comment
    Embed Video
    Edit your comment Cancel

    2 Favorites

    Gis Data Quality Profiles 2005 - Presentation Transcript

    1. Address: Data Quality Dots on a map improve data quality Author: Peter Benza © 2005
    2. Input Files:
        • Duplicate input files or records must be merged for accurate plotting.
        • If not “dot stacking” and inaccurate aggregate summary datasets will result.
    3. Duplicates
      • Duplicates are present in virtually every database.
      • Geo-spatial tools help you recognize and eliminate duplicates to assure data quality.
    4. Data Quality: Example
      • Customer names and addresses in small geographical require special data processing.
      • Lets visualize (geocode) the name Jose Gonzalez across our enterprise from disparate source files.
    5. We isolated 4 variations in our customer master.
        • 4 customer names:
          • Jose Gonzal es
          • Jose D. Gonzal ez
          • J. Gonzal ez
          • Jose Gonzal ez
    6. Analysts Reported:
      • After identification and removal of duplicates:
      1 Customer, Not 4
    7. Using Best Practices:
      • If the customer database was limited to a “trade area”, and these 4 customer records were spatially plotted,
      • ...What should have been one dot on a map - would have been 4 dots. (An erroneous depiction.)
    8. Imagine The Impact:
      • were this actually an entire trade area!
        • Customer metrics
        • Market share
        • Compliance
    9. Customer Master: 1 Mile 1,683 Address Points (1 Mile) Multiple Input Databases? Pluralization Phonetic Matching Multiple Languages Vowel Management Initials: First, Middle, Last Same Surname Dots Address and Name Frequency First Name
    10. Gonzale xx : 1 Mile Map Symbols “ 7” Have A “J” “ 16” End With EZ vs. ES “ 22” Families
    11. Assessment Criteria... Pre-Directional Post Directional Suffix Name House Number Primary Address Secondary Address First, Middle, Last Gender Determination Case Management Input Files: Customer Master 1 Customer Master 2 Customer Master 3 Customer Master 4 Customer Master 5 Data Quality (Errors, Omissions) Address Elements Name Number Of Input Files Factors Considered: Phonetics Languages Match Control Data Enhancement Data Transformation Customer Keys
    12. What Does This Mean?
      • Simply plotting data, without cleansing the data, results in erroneous plotting – poor data quality.
      • Data quality software is critical and visualizing helps.
      • Each data quality function must be assessed for each client.
    13. Profiling A Database* Street Address Surname First Name Middle Initial *one mile radius
    14. Data Quality Tools...
      • Many vendors offer data quality tools today – which one is right for your organization?

    + pab571pab571, 3 years ago

    custom

    1519 views, 2 favs, 3 embeds more stats

    There are many different ways to assess the quality more

    More info about this document

    © All Rights Reserved

    Go to text version

    • Total Views 1519
      • 1376 on SlideShare
      • 143 from embeds
    • Comments 0
    • Favorites 2
    • Downloads 0
    Most viewed embeds
    • 89 views on http://enterprisedataquality.com
    • 53 views on http://datahygiene.wordpress.com
    • 1 views on http://enterprisedataquality.wordpress.com

    more

    All embeds
    • 89 views on http://enterprisedataquality.com
    • 53 views on http://datahygiene.wordpress.com
    • 1 views on http://enterprisedataquality.wordpress.com

    less

    Flagged as inappropriate Flag as inappropriate
    Flag as inappropriate

    Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

    Cancel
    File a copyright complaint
    Having problems? Go to our helpdesk?

    Categories