KBACE Data Quality Management Webinar

733 views

Published on

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
733
On SlideShare
0
From Embeds
0
Number of Embeds
5
Actions
Shares
0
Downloads
0
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

KBACE Data Quality Management Webinar

  1. 1. De-duplicate Dirty Data Now and Forever Using Oracle Data Quality Management Rita Beck, Senior Principal Consultant March 13th, 2009 1 © 2009 KBACE Technologies, Inc.
  2. 2. Agenda • Data Quality Management (DQM) Basics • DQM Tools • Smart Search • Batch Duplicate Identification • Conclusion 2 © 2009 KBACE Technologies, Inc.
  3. 3. Data Quality Management (DQM) Basics 3 © 2009 KBACE Technologies, Inc.
  4. 4. Why Use Data Quality Management? • Inconsistent Information • Inaccurate Financial Reporting • Customer Dissatisfaction • Inefficient Sales and Marketing 4 © 2009 KBACE Technologies, Inc.
  5. 5. Inaccurate Financial Reporting • Scenario 1 • Average Sales Volume = $100,000 Comp. aBc Company abc $100,000 $100,000 Company Abc $100,000 ABC Company Company abC $100,000 $100,000 Corp. ABC $100,000 5 © 2009 KBACE Technologies, Inc.
  6. 6. Inaccurate Financial Reporting • Scenario 2 • Average Sales Volume = $600,000 Company ABC $600,000 6 © 2009 KBACE Technologies, Inc.
  7. 7. Duplicate Customers Company Abc Company a.b.c. Comp. aBc ABC Company Company abC Corp. ABC 7 © 2009 KBACE Technologies, Inc.
  8. 8. What is Data Quality Management? • Prevents Future Duplicates from Entering the System • Manually or via Import • Identifies Existing Duplicates 8 © 2009 KBACE Technologies, Inc.
  9. 9. How Does DQM Work? • Transforms and Standardizes TCA Registry data • Copies standardized data into separate staged schema tables • Performs user-defined searches • Within the TCA Registry • Between the TCA Registry and other sets of data • Determines potential duplicate records 9 © 2009 KBACE Technologies, Inc.
  10. 10. DQM Tools 10 © 2009 KBACE Technologies, Inc.
  11. 11. Data Quality Management Tools • Word Replacements • Entities • Attributes • Transformations • Match Rules 11 © 2009 KBACE Technologies, Inc.
  12. 12. Data Quality Management Tools • Word Replacements • Entities • Attributes • Transformations • Match Rules 12 © 2009 KBACE Technologies, Inc.
  13. 13. Word Replacements • Provide Standardization Bob Robert Rob Robert Robbie Robert Roberto Robert Bobby Robert 13 © 2009 KBACE Technologies, Inc.
  14. 14. Word Replacements 14 © 2009 KBACE Technologies, Inc.
  15. 15. Data Quality Management Tools • Word Replacements • Entities • Attributes • Transformations • Match Rules 15 © 2009 KBACE Technologies, Inc.
  16. 16. Entities • Party • Address • Contact • Contact Point 16 © 2009 KBACE Technologies, Inc.
  17. 17. Data Quality Management Tools • Word Replacements • Entities • Attributes • Transformations • Match Rules 17 © 2009 KBACE Technologies, Inc.
  18. 18. Attributes • Derived from columns within the TCA Registry tables • Attributes make up an Entity • Used for matching purposes between an Input Record and the TCA Registry data 18 © 2009 KBACE Technologies, Inc.
  19. 19. Attributes 19 © 2009 KBACE Technologies, Inc.
  20. 20. Data Quality Management Tools • Word Replacements • Entities • Attributes • Transformations • Match Rules 20 © 2009 KBACE Technologies, Inc.
  21. 21. Transformations D’ Angello • Capitalize all letters D’ ANGELLO • Remove non-alphanumeric characters • Reduce all instances of white space D ANGELLO to a single white space • Remove double letters • Remove vowels except initial vowels D ANGELLO D ANGELO D ANGL 21 © 2009 KBACE Technologies, Inc.
  22. 22. Transformations 22 © 2009 KBACE Technologies, Inc.
  23. 23. Data Quality Management Tools • Word Replacements • Entities • Attributes • Transformations • Match Rules 23 © 2009 KBACE Technologies, Inc.
  24. 24. Match Rule Purposes • Search • Used for Search User Interfaces • Expanded Duplicate Identification • Used for Identifying and Preventing Duplicates • Bulk Duplicate Identification • Used for Identifying Duplicates 24 © 2009 KBACE Technologies, Inc.
  25. 25. Score Based Matching • Acquisition • Provides an initial set of potential matches • Scoring • Assigns scores to further filter matches 25 © 2009 KBACE Technologies, Inc.
  26. 26. Example Match Rule 26 © 2009 KBACE Technologies, Inc.
  27. 27. Smart Search 27 © 2009 KBACE Technologies, Inc.
  28. 28. What is Smart Search? • Used to identify records within the TCA tables that are potential duplicates of user entered data • Match Rule Purpose = Search 28 © 2009 KBACE Technologies, Inc.
  29. 29. Smart Search Process Step 1 Transformations TCA TCA Applied Staged Registry Schema DQM Staging Program 29 © 2009 KBACE Technologies, Inc.
  30. 30. Smart Search Process Step 2 Smart Search Match Rule Transformations Standardized User Input Applied User Input 30 © 2009 KBACE Technologies, Inc.
  31. 31. Smart Search Process Step 3 Smart Search Match Rule TCA Staged Acquisition and Duplicates Schema Scoring Between Match Criteria User Input and Thresholds and Applied TCA Registry Standardized User Input 31 © 2009 KBACE Technologies, Inc.
  32. 32. Smart Search Example: Existing TCA Records • TCA Registry Record #1 KBACE 6 Trafalgar Square Nashua, NH 03063 • TCA Registry Record #2 KBACE Technologies, Incorporated Six Trafalgar Sq. Neshua, NH 03063 32 © 2009 KBACE Technologies, Inc.
  33. 33. DQM Searching 33 © 2009 KBACE Technologies, Inc.
  34. 34. DQM Search Results 34 © 2009 KBACE Technologies, Inc.
  35. 35. Non-DQM Search 35 © 2009 KBACE Technologies, Inc.
  36. 36. DQM Smart Search 36 © 2009 KBACE Technologies, Inc.
  37. 37. Smart Search – Entering New Record 37 © 2009 KBACE Technologies, Inc.
  38. 38. Batch Duplicate Identification 38 © 2009 KBACE Technologies, Inc.
  39. 39. What is Batch Duplicate Identification? • Used to identify duplicate parties that already exist in the TCA Registry • Match Rule Purposes • Bulk Duplicate Identification • Expanded Duplicate Identification 39 © 2009 KBACE Technologies, Inc.
  40. 40. Batch Duplicate Identification Process Step 1 Transformations TCA TCA Applied Staged Registry Schema DQM Staging Program 40 © 2009 KBACE Technologies, Inc.
  41. 41. Batch Duplicate Identification Process Step 2 Bulk (or Expanded) Duplication Identification Match Rule TCA Staged Schema Acquisition and Scoring Duplicates (Self Join) Match Criteria Within and Thresholds TCA Registry Applied TCA Staged Schema 41 © 2009 KBACE Technologies, Inc.
  42. 42. Define Duplicate Identification Batch 42 © 2009 KBACE Technologies, Inc.
  43. 43. Duplication Identification Batch Results 43 © 2009 KBACE Technologies, Inc.
  44. 44. Duplication Identification Batch Details 44 © 2009 KBACE Technologies, Inc.
  45. 45. Duplication Identification Batch Results 45 © 2009 KBACE Technologies, Inc.
  46. 46. Products Using DQM Functionality 1. Marketing Online (AMS) 2. Receivables (AR) 3. Sales (ASN) 4. TeleSales (AST) 5. Customers Online (OCO) 6. Inventory (INV) 7. Lease Management (OKL) 8. Partner Management (PV) 9. Sales for Communications (XNC) 10. Healthcare Transaction Base (HTB) 11. CRM Foundation (JTF) 46 © 2009 KBACE Technologies, Inc.
  47. 47. Let Data Quality Management Work for You! • Enhance Search Results • Prevent Future Duplication • Identify and Merge Existing Duplicates 47 © 2009 KBACE Technologies, Inc.
  48. 48. Q U E S T I O N S A N S W E R S 48 © 2009 KBACE Technologies, Inc.
  49. 49. For Additional Information • For the recording and presentation, please visit: http://kbace.com/Services/Webinars.aspx • Contact Rita Beck at rbeck@kbace.com 49 © 2009 KBACE Technologies, Inc.

×