<Insert Picture Here> <I t Pi t H >Oracle Data Profiling and Quality 11gR1Oracle PartnerNetwork EnablementUgo Pollio, Principal Sales Consultant, DIS EMEAFX Ni l Nicolas, Principal P d t M P i i l Product Manager, DIS
Agenda g• Introduction to Data Quality• Oracle Data Profiling and Quality Overview• Data Quality Use Cases• Demonstration D t ti• Q&A
Managing the Information Sensitive Business Value of Trusted Information Why do I have so Why are my many duplicate Why can’t I get applications copies of data? access the data I referring to last Much of it is need for decision week’s numbers? inaccurate! making ? Accessible Up-to-Date Trusted Information Information Information• Available • Fast access • Accurate• Secure • Multiple sources • Consistent• Reliable • Actionable • Quality
What is Data Quality? y• Degree of Excellence of the Data• Process of making and keeping data in a state of completeness, validity, consistency, timeliness and accuracy that makes it appropriate for a specific use.
Example of Data Quality Issues p y Matching Records Non Standard formatsName Address City State Zip Phone EmailBob Williams 36 Jones Avenue Newton MA 02106 617 555 000 firstname.lastname@example.orgRobert Williams 36 Jones Av. MA 02106 617555000Burkes, Mike and Ilda 38 Jones av. Nweton MA 02106 617-532-9550 email@example.comJason Bourne, 76 East 51st Newton MA 617-536-5480 6175541329Bourne & Cie.… … … … … … … Mis-fielded data Multiple Names Mixed business Typos and contact Missing Data names
Two Facts about Data Quality y Clear view on data• The Data Quality Challenge is an iceberg • The biggest DQ threats are the ones we do not see. gg Q Data Profiling lowers the water line and draws a clear view of the quality issues Risk manageable Business rules tractable Known Data Issues Expectations clear E t ti l High business user involvement Suspected Data Issues Unexpected Data Issues Risk unmanageable Business rules unknowable Missed expectations Mi d t ti Little business user involvement
Two Facts about Data Quality y Data quality decays• Data value decays • Data is an asset which value decays over time y • Business events can make this worse • Quality is not a one shot p y process but a constant effort in the enterprise processes. Data Quality needs to be pervasive and continuous. Pervasive Oracle Data Integrator Supplies Standard, Inline Data Quality and Data Profiling Capabilities with Every ETL and E-LT Job Continuous Oracle Data Profiling and Oracle Data g Quality for Data Integrator support Integrated Workflow, Recycling, and Steward GUIs
Oracle Data Quality Products y• Oracle Data Integrator • Integrate Data between sources and targets with inline data g g integrity check • Call Data Quality processes• Oracle Data Profiling • Investigate quality issues• Oracle Data Quality for Oracle Data Integrator • Cleanse, Parse, Match & Merge , , g• Oracle Product Data Quality • Semantic Approach to product data quality
1. Investigate - Oracle Data Profiling g g ODP/ODQ Client • Profiling/Investigation • Quality monitoring • Investigate the Data • Structure • Content • Values, statistics, Sampling & S li frequencies, frequencies ranges Analysis Metabase • Data Relationships • Dependencies, keys, p y joins • Assess Data Compliance • Report & Alerts • Monitor Quality Over Time
2. Cleanse - Oracle Data Quality y ODP/ODQ Client • Quality Project Design • Rules Tuning • Proven, scalable DQ engines • Rich global content for cleansing, standardization, validation Metabase • Packaged Quality Rules • Delivered Out-of-the-Box Out of the Box by Oracle • For 60+ Countries • Extensible & Customizable Rules Route Parse Match Link MergeData Quality Server
3. Run with Oracle Data Integrator g Oracle Data Integrator • Moves & Transforms Data • Integration handled by • Calls Quality Processes ODI • Advanced quality processing handled by ODQQ • Pre-built Knowledge Modules • For Metadata Exchange • Tool for DQ process invocation Route Parse Match Link MergeData Quality Server
What’s New in ODP/ODQ 11gR1 g• Business Rules Library• User-Defined Templates User Defined• Enhanced International Support • Multi lingual client (user interface for design environment) Multi-lingual • Tuning facilities for global data, including double-byte data• Improved Geographic Support • E.g.: Expanded Japanese Postal Matcher Options • Latitude/Longitude appends Globally appends, Globally.• User Interface Enhancements
Data Quality for Business Intelligence Change Data into valuable Information • The Business Issue • BI Reports are not trustable, because of the state of source data DO NOT • Reduce risks TRUST THIS • Improve data q p quality by integrating y y g g DATA ! cleansing as part of the processProfiling• Investigate • Eliminate data redundancies Cleansing • Standardize, Enrich, Match • Improve Business Insights Control • Improved business insight with improved • Govern over time data quality q y • Better profiling of data to eliminate gaps in insight
Data Quality Dashboards in OBIEE yAttribute Analysis, Historical Analysis, DQ Stats
Data Quality Firewall y Profile, Repair, Check, Alert, Report Sources Database COBOL DQ Dashboard Copybooks Route Parse Match Link Merge Files• DWH, improving reliability and quality• New ERP/CRM installation, and legacy data Discarded integration Records R d• Master Data Management projects• Data synchronization projects Human Workflow
Data Quality for Migration ODP + ODQ + ODI CRM Route Parse Match Link Merge g COBOLCopybooks M&A Analyze Build & Test & g & Design Cleanse Validate MDM Iterations ERP 1 Assumptions 2 Identified Failure(s) DWH 3 Files Documentation Metadata Business InputToday Tomorrow The Truth Content Agreed Structure Plan (less risk) Scope Relationship Managed Resources Timescale Quality Productive Cost
For More Information Quote Attribution Title, Company Get Started Resources• Visit the Oracle Fusion Middleware 11g g • Datasheet: web site at http://www.oracle.com/products/middle http://www.oracle.com/goto/fmw11g/ind ware/odi/docs/odiee-datasheet.pdf ex.html • Blog:• Oracle Data Integration on oracle.com http://blogs.oracle.com/dataintegration www.oracle.com/goto/odi • Technical information available at:• Oracle Fusion Middleware on OTN http://www.oracle.com/technology/prod http://otn.oracle.com/middleware ucts/oracle-data-integrator/index.html• Information on GoldenGate: • Data Integration Events http://www.oracle.com/goldengate http://www.oracle.com/events
A particular slide catching your eye?
Clipping is a handy way to collect important slides you want to go back to later.