This document provides an overview of new features in IBM InfoSphere MDM version 11.3, including: improved integration with other IBM offerings like BigInsights; enhanced Salesforce integration; new healthcare, clinical data, and application licensing capabilities; improved DataStage integration; and core MDM enhancements. Key new features include probabilistic matching for BigInsights, Salesforce integration improvements, healthcare provider data warehousing enables, clinical data services, and application-based licensing options.
10 Trends Likely to Shape Enterprise Technology in 2024
Présentation IBM InfoSphere MDM 11.3
1. InfoSphere User Group France MDM v11.3 New Features
Aomar BARIZ Information Governance Client Technical Professional Mobile:+33 6 73 48 40 72 E-mail: Aomar.Bariz@fr.ibm.com
2. InfoSphere MDM – Release Timeline
eGA: June 2013
Target eGA: Q4 2014
eGA: Oct. 2013
This Release Target eGA: June 27, 2014
v11.0
refresh
vNext
v11.0
V11.3
•Continued Integration of Development organization (all MDM Flavors)
•Simultaneous delivery of all supported platforms
•Simultaneous release with a number of InfoSphere offerings (incl. Information Server)
2
3. •InfoSphere MDM probabilistic matching engine for BigInsights: Enhances bulk search and performance for BigInsights matching.
•Salesforce.com integration: Improves Salesforce.com experience by leveraging InfoSphere MDM for search, remediation, and integration with external data. Invokes virtual MDM.
•IBM Health Care Provider data warehouse: Enables patient-centric analytics by pushing a 360-degree view of the patient into the warehouse.
•Clinical data services: Integrate patient clinical data for research, care coordination, and wellness initiatives.
•Application-based licensing: Offers new licensing and pricing options when acquiring MDM in support of other IBM products.
•InfoSphere DataStage integration: Reduces implementation time and cost by speeding and simplifying integration development both for initial load and ongoing production integration.
MDM Industry /LOB Solutions
•Collaboration Server free text search: Improves user experience with a more intuitive search facility powered by IBM Watson Explorer.
•Virtual MDM performance: Enhances control of record linking to reduce the risk of system performance issues.
Core MDM Enhancements
Overview of Key InfoSphere MDM v11.3 Features
MDM for Big Data
IIG Portfolio Integration
3
4. InfoSphere MDM v11.3 (Bali) Collaborative MDM Free Text Search with Watson Explorer
4
5. Positioning – Free Text Search with Watson Explorer
5
What are the business or technical benefits of solving this problem(s)?
What problem(s) and for whom (role, industry) does this capability solve?
How is this capability different in solving this problem than competitive offerings?
How does this capability solve this problem(s)?
•Searching PIM systems is not intuitive and often requires training
•Search performance becomes an issue when dealing with large volumes
•Providing external access to PIM data is complex and costly
•All roles across all industries are impacted:
•Analysts, Admins, Stewards, Executives, etc
•Retail, Banking, Industrial, Telco, All
•Intuitive access to data with Google-like capabilities
•Improved search performance with larger volumes
•Externalize search and access outside of the PIM system
•Data is pushed to Watson Explorer and indexed for high performance
•New static search bar on every screen allows users to search by key words and phrases
•Results page allows for opening a single record or multiple records for mass edit
•Competitors are not currently offering a free text search capability
•Watson search leverages best in class Big Data capabilities for high performance
•Only offering to provide a true web experience for search.
10. Search can be externalized for Enterprise access
Search directly on Watson Explorer
10
11. Capabilities
Enable or disable through configuration file
Static search bar on every screen
Search examples
–Term
–Term1 OR Term2
–Term1 AND Term2 (Term1 Term2 also works)
–Catalog:CatalogName Term
–Combine the above for complex searches
Search across specific catalogs
Launch in single edit or bulk edit
Limited use license of Watson Explorer included in the v11.3 bundle
11
13. Matching Approaches
Deterministic Rules-based fuzzy matching
Apply logical rules sequentially or hierarchically
Deterministic Rules-based exact matching
Deterministic Scoring-based matching
Probabilistic Self-learning algorithms
Compare records attribute by attribute.
Assign a score for each attribute match.
If the total score is high enough, they match.
How do you decide how much a partial date match should be worth? An edit distance of 2? A nickname match?
Account for misspellings and typographical errors
Metaphone
Edit distance
The algorithm learns how to score attributes from the data itself
How common are partial date matches within your data?
How common are nicknames?
Tuned to your data
Big Data needs more sophisticated capability
13
14. Using out of the box fuzzy functions to enable accurate data searching/matching in your Hadoop environment
Nov 6,
Phonetics Mohammed vs. Mahmoud
Synonyms Andrew = Andy George = Jorge 1st = First
Abbreviations AIG = American International Group Road = Rd
Concatenation Van de Velde = Vandevelde
Misalignment Kim Jung-il = Kim il Jung
Edit Distance 867-5309 ~ 876- 5309
Region Specific トヨダ = トヨタ株式会社
Date Similarity 01/01/1973 ~ 01/03/1973
Proximity Geocodes and great-circle distance
Noise Words Roadster Inc. = Roadster
Typographical Errors John Smith vs. John Snith
14
15. C. Johnson 123 Main Street 512-545-1234
CRM
Supply Chain
Fulfillment
Support Ticketing
External Sources
3rd Party
Chris Johnston 123 Main Street 512-554-1234 Shipping: 456 Pine Ave
Christine. Johnson 123 Main Street Call length Semi-structured notes Satisfaction
C. Johnson Main Street 512-554-1234
C. Johnson 125 Main Street 512-554-1234
ChrisJohnson65 “Likes” Clothes, Camping Gear
@ChristyJohnson65
Christy65 Circle / Network data
Order Mgmt.
Internal / Structured
External / Unstructured
Web
Chris.johnson@cj.net
Big Match empowers customer analytics at Hadoop scale
Big Match matches all these records
Big Match combines the MDM probabilistic matching engine & pre-built algorithms & BigInsights for customer matching natively within Hadoop
Increased Value of Customer only if…
Christine Johnson Married 1 child 4/15/74
Christy65 Mail Order responder Specialty Apparel Partner Sales data
VIP: Gold Customer Sat: 80% Influence Score: 8/10
15
16. What is Big Match
Big Match allows you to run the MDM probabilistic matching engine natively within IBM’s open source Hadoop distribution (Infosphere BigInsights)
Your clients are implementing customer analytics projects using Hadoop today
Use Big Match to differentiate the IBM stack – no other vendor has it
Infosphere Master Data Management (Advanced Edition, Standard Edition, Collaborative Edition)
16
17. Big Match as a foundation of your customer analytics in Hadoop
Accurate – Matches via statistical learning algorithms based on your data (customer see improvements between 5-15%)
Simple & Fast Time to Value - Hours to use configurable pre-built customer algorithms, instead of weeks or months of developing code
Performance - Hours to match initial data sets of big data volumes via use of MapReduce distributed processing
Proven - Leverages the experience of over 10 years and 900 customers across worldwide deployments dealing with individuals and organizations
Is your client using Hadoop within customer analytics? Then they need Big Match
17
18. InfoSphere MDM v11.3 (Bali) IBM Stewardship Center for - Physical MDM, Individual Domain
19. Three takeaways
1.Deliver a differentiating, prescriptive user experience for LOB users
WHY?
1.LOB users need to explore and discover how master data can help their business
2.Knowledgeable LOB users make the most informed data quality decisions & their involvement increases their confidence in master data
20
Stewardship Center is a physical MDM application for LOB users and stewards
20. Positioning – IBM Stewardship Center
21
What are the business or technical benefits of solving this problem(s)?
What problem(s) and for whom (role, industry) does this capability solve?
How is this capability different in solving this problem than competitive offerings?
How does this capability solve this problem(s)?
•Data quality decisions are low quality because they do not include LOB user insight
•LOB users do not trust master data because they struggle to understand how their system’s data contributes to the golden record
•Stewardship managers struggle to show their team’s value & contribution to the business
•All roles across all industries are impacted:
•Analysts, Admins, Stewards, etc
•Retail, Banking, Industrial, Telco, All
•LOB discovers MDM value by browsing and investigating MDM, gaining new insights
•If issues are identified, LOB users can make master data updates directly
•Direct data quality decisions to the right LOB users at the right time
•Stewardship dashboard enables stewardship managers to demonstrate efficacy and make informed decisions to improve team performance
•IBM Design Thinking brings the prescriptive UX necessary to leverage knowledge workers
•Enable data quality users to collaborate using social and mobile features
•Business rules tailor which user is assigned a given task based on task type and entity segment
•The dashboard displays task breakdown and team/individual performance by reporting from the Stewardship Center’s data warehouse
•Prescriptive OOTB UX allows Stewards and LOB users to commune on data quality decisions using social collaboration and mobile
•Keep the business connected to master data with mobile stewardship, approval, & notifications
•Dashboard allows for the quick assessment the data quality metrics, team monitoring and rerouting
•Intelligent Inbox prioritizes work w/ auto-escalation
•Quickly extend or customize the Stewardship Center’s WF or UI using MDM AT & IBM BPM
•Options for Cloud deployment
21. 22
LOB Knowledge Workers
LOB Owners & Governance Team
IT Stewards
Web
Mobile
Social Collaboration
Data Quality Application
Business Processes
Analytics
Stewardship Center
Customer Centricity
Know Your Customer
Operational Excellence
Dashboard
Workflow & Rules
Comprehensive data quality application delivering business confidence
22. Capabilities needed to deliver data quality to the business?
Ensure most knowledgeable LOB users contribute in quality decisions
Provide LOB users business context and prescriptive experience
Align stewards and LOB users to efficiently remediate data quality tasks
Include the right participant at the right time within the data quality process
Ensure task ownership and accountability with traceability
Demonstrate team performance and provide management insight
23
23. Stewardship Center keeps the business connected to master data, driving ownership
LOB users
Explore, learn, and discover master data
–Discover relationships
–Data quality root cause analysis, take corrective action
–Master data survivorship
–Review/approval and notifications for critical data issues
–Stay connected with mobile stewardship
–Only view appropriate information
24
24. Stewardship Center increases data quality through LOB user and steward collaboration
Stewards and LOB user
Cross team connectedness with social collaboration
Data quality workflow assigns tasks to the right user at the right time
Automate common decisions reduce time and cost of human involvement
Increase throughput by including LOB users while infusing business knowledge into DQ decisions
Increase business confidence and ownership of master data
Prescriptive business tools for data maintenance and matching records
25
25. Stewardship Center provides visibility and insight to ensure effective team performance
Data Steward Manager
Dashboard optimized for data steward manager activities
Quickly assess areas of risk and take corrective action
Identify active stewards and commune
Track at risk/high priority tasks for better team mgmt and resource loading
Identify data quality trend/bottlenecks and make informed improvements
Ensure task ownership and accountability along with traceability
26
26. Complete task visibility
Identify data quality trends
View team and status
Commune with team
View team’s tasks and manage
Data Steward Management Dashboard
27
28. All CEOs; n = 229
Which technology-enabled capabilities will be an important area of investment to improve your business over the next five years?
Source: Gartner Report - CRM in a Sea of Change 2013
29
Gartner Technology Investment Survey 2013
29. What are the primary objectives of your 2013 CRM programs?
0 10 20 30 40 50
Improve customer data quality
Increase customer loyalty
Improve lead quality and conversion
Increase customer retention
Create a single view of the customer
Enhance cross-sell or upsell of products
and services
Increase customer satisfaction
Increase sales revenue
Increase acquisition of new customers
Enhance customer experience
Percentage of Respondents
Revenue
Information
Loyalty/
Satisfaction
Source: Gartner Report - CRM in a Sea of Change 2013
n = 190
Gartner CRM Survey 2013: Top 10 CRM Objectives in the U.S.
30
30. Source: Aberdeen Group, July 2011
A CRM assessment report published by Aberdeen in 2011 showed that Peak CRM Performance is directly related to the accuracy and availability of customer records
n = 261
Gartner believes that bad data quality is the #1 reason why CRM projects to fail
Data Quality and Accessibility – by Best in Class
31
33. Customer Relationship Mgmt
Improve win-rates & seller productivity
Contact
• Identify duplicate customer and prospect records & reduce duplication at the point of entry
• Find the right customer faster by leveraging advanced search capabilities from MDM
• Enrich customer data in Salesforce with collective knowledge from internal & external data sources
• Identify relationships between customers/entities
InfoSphere MDM can help organizations optimize client-focused
initiatives by delivering a Single View of Customer
34
34. An InfoSphere MDM powered Salesforce initiative can deliver real business benefits
35
35. InfoSphere MDM – SFDC solution capabilities
36
InfoSphere MDM Powered Probabilistic Search
Publish enriched master information from InfoSphere MDM to SFDC
Event Notifications
Search for Accounts from SFDC as well as from other sources
Enrich Account information in SFDC by leveraging the broader enterprise master information in InfoSphere MDM
Automatically receive MDM events in SFDC whenever a SFDC record gets added, updated or linked in MDM
Enhanced Security using WebSphere Cast Iron
Secure data and calls between InfoSphere MDM (on-premise) and SFDC (an external SaaS application)
Perform Bulk data ops using InfoSphere Information Server (SF-Pack)
Perform Bulk data movement from MDM to SFDC using Information Server (SFPack)
36. IBM MDM
No Inbound No In-out bound
IS-SP
HTTPS
JMS
Queue
HTTPS
SSL
Tunnel
1
2
3
1
Real-time Sync
2
Near Real-time Sync with reliable
queuing
3
Batch Import/Export using InfoSphere
Server Pack (IS-SP) for SFDC
Customer Firewall
InfoSphere MDM-Salesforce Integration Architecture
37
38. User Technologies
41
An MDM Hub is only useful when integrated with the “Extended” enterprise.
MDM projects require a high performance ETL solution for loading data into and extracting data from an MDM hub.
The ETL solution should be integrated so as to hide the complexity of the underlying MDM data model and to make integration easy.
IBM MDM by itself, not provide a fully integrated, end –to-end integration , quality and data governance for master data
The bundled 3rd party Clover ETL is just a point solution for ETL
Clover ETL is not integrated with other IBM IIS components
MDM bundles only IIS for Data Quality
IBM MDM by itself, does not provide end-to-end metadata management for MDM data
Changes in the MDM model are not automatically shared across integration components and governance tools.
MDM assets are not „natively‟ available to the Information Server Information Governance Catalog (IGC)
The Problem?
41
39. 42
IBM IIS Enterprise Edition bundled with IBM MDM (with license restrictions) providing end to end ETL, governance and data quality
Out-of-the-box MDM Connector Stage within InfoSphere DataStage/QualityStage Designer that simplifies MDM load and extract for DS/QS developers
InfoSphere MDM metadata in IIS drives automation and makes integration configurable so very little development is required for MDM data load and/or extract
Enables configurable data integration for MDM
Provides governance for the MDM information supply chain
Provides design lineage through MDM metadata in IGC
Clover ETL is removed from MDM V11.3 as a bundled component
Customers can obtain licensing and support directly from Javelin in order to use existing Clover Graphs with MDM V11.3
The Solution?
42
40. User Technologies
Build a MDM metadata model to describe assets in the IS repository Enables all tools in the suite, particularly IMAM, IGC, DataStage and DataClick to work with MDM assets
Build export functions into the MDM workbench Allows individual MDM users to document and manage their projects metadata assets for use by IS repository
Build a MDM Connector that can consume the MDM metadata and enable the memget and memput MDM interactions memget and memput interactions are broadly used to read and write data resp. Implement the MDM Connector over the Java Integration Stage Leverage the Parallel engine capability
Bundle Information Server Enterprise Edition with MDM Limited license terms Gives MDM customers the relevant Information Server functions out-of-the-box
How are we integrating?
43
41. Metadata Admin
DataStage/Quality Stage
Job Developer
XMeta
(MDM, ASCL)
MDM Developer
MDM Workbench
Exports the project metadata as XMI files for use by MDM Connector
XMI Files
SCM or DevOps Repository
Import XMI files, analyze, preview and upload to Metadata Server
IMAM Asset Manager
[ MDM Design MetaData ]
DataStage/ QualityStage Designer
Configure MDM Stage with MDM Hub Connection and other settings
Query MDM Model in XMeta
MDM Model Bridge
XMeta (DSX)
Persist Stage configuration in DSX Model
Compile, Deploy & Test Jobs
[ Job MetaData ]
MDM Connector Usage (Design time)
45
43. 47
Notable:
No support for Power Linux
Windows only supported for Standard Edition
WAS only (started previously)
No FireFox support for Collaborative Edition
Information Server v11.3 release has two delivery tiers (platforms)
Latest information online – Refer to InfoSphere MDM System Requirements on ibm.com
Component
Flavor
Version(s)
Notes
Operating System
AIX (Power)
Solaris (SPARC)
Linux (RHEL)
Linux (SLES)
zLinux (RHEL)
zLinux (SLES)
Windows
v6.1 and v7.1
v10
v6
v11
v6
v11
2008R2, 2012
X86-64 only
X86-64 only
SE ONLY
App Server
WAS
v8.5.5
Database
DB2 LUW
DB2 for z/OS
Oracle
v10.1, v10.5
v10.1, v11
11g R2, 12c
Web Browser
IE
FireFox
9, 10
ESR 24
Not CE
Other
Information Server
MQ
BPM
Portal Server
v11.3
v7.5
v8.5.0.1
V8.0.0.1
Tiered Release
InfoSphere MDM v11.3 – Supported Platforms
44. 48
Makes it easier for customers to deploy an MDM environment (e.g. Development)
There should not be an expectation that all deployment scenarios are supported with the Supporting Programs … Additional licensing (e.g. RAD) is often required!
Limitations/Restrictions:
–Primary Limitation – Only when in support of InfoSphere MDM
–See LI for others
Important Note: The LIs for the Supporting Programs is also in effect
Supporting Programs for InfoSphere MDM v11.3
IBM Rational Application Developer for WebSphere Software v9.0
IBM DB2 Enterprise Server Edition V10.5
IBM Content Integrator 8.6
IBM Cognos Business Intelligence V10.1.1
IBM Cognos Business Intelligence Modeling v10.1.1
IBM Cognos Business Intelligence Samples v10.1.1
IBM Cognos Supplementary Language Documentation v10.1.1
IBM Process Server Standard v8.5
IBM Process Server Standard for Non-production Environment v8.5
IBM Process Center Standard v8.5
IBM Process Designer v8.5
IBM InfoSphere Information Server Enterprise Edition v11.3 (for MDM Editions)
IBM InfoSphere Information Server for Data Quality v11.3 (for RDM and CDH Stand-alone)
IBM InfoSphere Data Explorer v9.0
IBM InfoSphere BigInsights Standard Edition v2.1.2
IBM InfoSphere Blueprint Director v2.2
IBM WebSphere Message Broker v8.0.0.3
IBM WebSphere Message Broker Connectivity for Healthcare v8.0
IBM WebSphere Application Server Network Deployment V8.5.5
IBM WebSphere Application Server Base 8.5.5
IBM WebSphere MQ V7.5
IBM WebSphere Portal Server 8.0.0.1
IBM Installation Manager & IBM Packaging Utility for Rational Software Development Platform v1.7
IBM Security Directory Server v6.3.1
IBM Support Assistant Data Collector v2.0.1
Supporting Programs – v11.3