Information Management Trends and Some History C. Mohan , PhD   IBM Fellow & IBM India Chief Scientist   Member, IBM Softw...
Key Customer Pain Points <ul><li>Can’t Find Information –  Discovery </li></ul><ul><li>Can’t combine Information –  Integr...
The Information Challenge   Information is in Silos… Trusted Information is Not Available Today’s business challenges mand...
Information Management Trends <ul><li>Information Intensive Applications </li></ul><ul><ul><li>Shift from transaction-cent...
Information as a Strategic Asset Presentation Services EDW Legacy Legacy Portals, Browsers, and or Devices Strategic APPL ...
Master Data Compliance  & Risk  Mgmt. Sales and  Marketing –  Closed Loop  Campaign  Mgmt. Customer Service Data Stewardsh...
IBM Information Management Software Delivering Value Beyond Traditional Repositories <ul><li>Data Services </li></ul><ul><...
DB2 9 – A Pure XML, Relational Hybrid XML Developer  “ I see a sophisticated XML repository that also supports SQL.&quot; ...
Integration of XML & Relational Capabilities DB2 SERVER CLIENT SQL/XML XQuery DB2 Engine XML Interface Relational Interfac...
DB2 V9 pureXML support XQuery SQL/XML APIs/Client XML Indexes XML Schema support Native Storage XML Load Import/Export <ul...
DB2 V9 pureXML support <ul><li>XML as a native data type </li></ul><ul><li>Pure XML storage and indexing </li></ul><ul><li...
Some  of Our Info Mgmt Research Legacy <ul><li>Invention of Relational Model/Technology & SQL </li></ul><ul><li>Research p...
Why We Have Experience with Customers <ul><li>Over 2 decades of partnership  with SWG Toronto & SVL </li></ul><ul><ul><li>...
Leveraging Technology and People IMS Development DB2 Development IDS / U2 Development Customer  Requirements IBM Products ...
SVL  DB2 UDB for z/OS & OS/390 IMS Business Intelligence Content Management DB2 Everyplace Red Brick Icing Traditional AD ...
A Spectrum of Info Serving Requirements Platform:   Mobile   Desktop Small Servers   Large Servers  Data Size:   Micro  Co...
Products to Match the Spectrum of Data Serving Needs DB2 Everyplace OLTP Relational Mobile Embedded  Linux PalmOS Symbian ...
DB2 for z/OS <ul><li>The power and function of an open, industry standard data server  with zSeries’ industry leading avai...
Technology Evolution with Mainframe Specialty Engines Integrated Facility for Linux (IFL) 2001 IBM System z9 Integrated In...
Data Challenges <ul><li>Variety, Velocity, and Volume </li></ul><ul><li>New composite applications need data from multiple...
Addressing the Changing Characteristics of Data Actionability Heterogeneity Scale Satellite & Surveillance Images and Vide...
Research in Information and Interaction Drive our leadership technologies for search, structured and unstructured informat...
Worlds of Structured & Unstructured Data Come Together Analytical Complexity Collect Store Retrieve Drill Mine ETL Warehou...
Need for Business Intelligence <ul><li>Loyalty </li></ul><ul><li>Profitability </li></ul><ul><li>Buyer Behavior </li></ul>...
SOA Reference Architecture Business Innovation & Optimization Services Development Services Integrated environment for des...
Information as a Service  The SOA Lifecycle Mapped to Information Needs <ul><li>Understand Information Assets and Link to ...
Information as a Service Moving From a Project-Based to a Flexible Architecture (SOA) and  more… abc… DB2 IBM  Content Man...
Information Services for SOA Unprecedented Business Flexibility <ul><li>Store Information </li></ul><ul><ul><li>DB2 Viper ...
Industry Solutions Deliver Insight On Demand <ul><li>Law Enforcement </li></ul><ul><ul><li>Crime Information Warehouse </l...
OmniFind Key Technologies Content Crawling <ul><li>Scalable Web crawler </li></ul><ul><li>Data Source crawlers </li></ul><...
Content Management Portfolio Strategy   <ul><li>Capture, store, and manage all forms of content </li></ul><ul><li>Complete...
IBM Content Management Platform Roadmap 4Q2004 1Q2005 2005 2006 … and Beyond WebSphere Portal V5.1 Embeds DB2 Content Mana...
Query Optimization <ul><li>Industry-Leading Optimization </li></ul><ul><li>Extensible – SQL to XQuery! </li></ul><ul><li>O...
Unstructured Information Management Architecture <ul><li>Common Research infrastructure for advancing Text Analysis and NL...
Analytics  bridge the  Unstructured & Structured worlds Unstructured Information UIMA High-Value Most Current Content Fast...
Evolution of Metadata Hierarchical Data Model  Rigid Metadata Single Application Domain Specific Ontologies Flexible Metad...
Integrated Metadata Enables Shared Understanding Data-driven analysis, reporting, monitoring, data rule & integration spec...
How Does Metadata Make Information Services Different? <ul><li>Information Services provide a basis for trust in informati...
Metadata-driven Design for Integration Web Service Build These Using These New Business Process New Integrated View Legacy...
Metadata Will Be Used to Facilitate Information and Application Integration <ul><li>Today  – manual integration, custom ha...
Thank You Merci Grazie Gracias Obrigado Danke Japanese French Russian German Italian Spanish Brazilian Portuguese Arabic T...
Upcoming SlideShare
Loading in …5
×

Information Management Trends and Some History

1,082 views

Published on

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
1,082
On SlideShare
0
From Embeds
0
Number of Embeds
9
Actions
Shares
0
Downloads
26
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide
  • Products from the combined research and development efforts … blended with customer requirements … to meet specific sets of customer requirements. An example would be IDS Express … we had an existing product, IDS … but our customers wanted a lower-cost option with many of the same features as the full IDS product …
  • Information Management Trends and Some History

    1. 1. Information Management Trends and Some History C. Mohan , PhD IBM Fellow & IBM India Chief Scientist Member, IBM Software Group, Asset Architecture & Information Management Architecture Boards http://www.almaden.ibm.com/u/mohan/ [email_address]
    2. 2. Key Customer Pain Points <ul><li>Can’t Find Information – Discovery </li></ul><ul><li>Can’t combine Information – Integration </li></ul><ul><li>Can’t extract value from Information – Insight </li></ul><ul><li>Can’t consume Information – Dissemination </li></ul>
    3. 3. The Information Challenge Information is in Silos… Trusted Information is Not Available Today’s business challenges mandate a fresh approach to managing information Managing information in silos has become obsolete Multiple Versions of the Truth Inaccurate, Untimely Inconsistent Incomplete, Inaccessible Out of Context… Globalization, M&As Risk & Compliance, Eroding Customer Loyalty, Supply Chain Complexity, Industry Transformations, Cost Cutting… 70% of people’s time can be spent searching for relevant information 60%+ of CEOs: Need to do a better job leveraging information Sources: IBM Attributes & Capabilities Study, 2005; Client Interviews 2004; IBM CFO Study, 2006 5X More Value creation by organizations effective at using Information as an Asset Information Must Become a Strategic Asset
    4. 4. Information Management Trends <ul><li>Information Intensive Applications </li></ul><ul><ul><li>Shift from transaction-centric to information-intensive applications </li></ul></ul><ul><li>Information Diversity </li></ul><ul><ul><li>Delivering insight over increasingly diverse sources of information </li></ul></ul><ul><li>New Business & Delivery Models </li></ul><ul><ul><li>Information as a Service, Outsourcing, New Licensing Models </li></ul></ul><ul><li>Democratization of Information </li></ul><ul><ul><li>Changing User Expectations & the “Parent Test” </li></ul></ul><ul><li>Massive Collaboration & Societal Intelligence </li></ul><ul><ul><li>Collaboration over shared information to creating business insight </li></ul></ul>
    5. 5. Information as a Strategic Asset Presentation Services EDW Legacy Legacy Portals, Browsers, and or Devices Strategic APPL Event Processing Tactical APPL Tx APPL App Server Discovery APPL Master Data APPL Process Services Information Integration Services Analytic Services Master Data Services Transaction Application Services Analytic Application Services Business Process Management Federation Discovery Services ECW Content Services Collaboration Services Notes Email Enterprise Service Bus Metadata Services Master data Hubs Product Customer Supplier Location Transaction Services OLTP2 OLTP1 OLTP Business Rules Business Monitoring Streaming Batch Metadata
    6. 6. Master Data Compliance & Risk Mgmt. Sales and Marketing – Closed Loop Campaign Mgmt. Customer Service Data Stewardship & Administration Compliance Marketing Account Administration Privacy Management Web Self-Service Wireless Self-Service Distributor IVR Self-Service Branch / Sales Office Call Center Browser-based Unlimited Attributes Multiple Categorizations Multi-enterprise Standards-based Security and Audit New Business Processing Privacy and Data Mgmt. Marketing Insight Customer Facing Channels Internal Users Customer Master Data Integration
    7. 7. IBM Information Management Software Delivering Value Beyond Traditional Repositories <ul><li>Data Services </li></ul><ul><ul><li>Databases, Warehouses, Tools… </li></ul></ul><ul><li>Content & Discovery Services </li></ul><ul><ul><li>Content Mgmt. & Integration Services </li></ul></ul><ul><ul><li>Discovery Services… </li></ul></ul><ul><li>Information Integration Services </li></ul><ul><ul><li>Quality Services </li></ul></ul><ul><ul><li>Transformation Services </li></ul></ul><ul><ul><li>Federation Services </li></ul></ul><ul><ul><li>Metadata Services… </li></ul></ul><ul><li>Information Accelerators </li></ul><ul><ul><li>Master Data Management </li></ul></ul><ul><ul><li>Entity Analytics </li></ul></ul><ul><ul><li>Information Warehousing </li></ul></ul><ul><ul><li>Customizable Dashboards </li></ul></ul><ul><ul><li>Industry Data Models… </li></ul></ul>Information Delivered On Demand Based on Services Oriented Architecture
    8. 8. DB2 9 – A Pure XML, Relational Hybrid XML Developer “ I see a sophisticated XML repository that also supports SQL.&quot; SQL Developer &quot;I see a sophisticated RDBMS that also supports XML.&quot; Familiar Programming Models Optimized Storage Models Mature Services Familiar Tooling Optimized Performance & Scale
    9. 9. Integration of XML & Relational Capabilities DB2 SERVER CLIENT SQL/XML XQuery DB2 Engine XML Interface Relational Interface XML DB2 Storage : DB2 Client / Customer Client Application <ul><ul><li>Applications combine XML & relational data </li></ul></ul><ul><ul><li>Native XML data type (server & client side) </li></ul></ul><ul><ul><li>XML Capabilities in all DB2 components </li></ul></ul>Relational
    10. 10. DB2 V9 pureXML support XQuery SQL/XML APIs/Client XML Indexes XML Schema support Native Storage XML Load Import/Export <ul><li>Native XML support in DB2 with more to come </li></ul><ul><li>Seamless integration with the relational world </li></ul>New XML Join Methods Tools And all the relational stuff
    11. 11. DB2 V9 pureXML support <ul><li>XML as a native data type </li></ul><ul><li>Pure XML storage and indexing </li></ul><ul><li>XQuery and SQL/XML support </li></ul><ul><li>XML Schema Repository </li></ul><ul><li>Schema validation </li></ul><ul><li>Application Support (Java, C/C++, .NET, PHP, etc.) </li></ul><ul><li>Visual Tooling, Control Center Enhancements </li></ul><ul><li>Annotated schema shredding </li></ul><ul><li>DB2 Utilities: Import/Export, HADR, etc. </li></ul><ul><li>… and more </li></ul>Secure and Resilient Infrastructure for a New Breed of Agile Applications DB2 9
    12. 12. Some of Our Info Mgmt Research Legacy <ul><li>Invention of Relational Model/Technology & SQL </li></ul><ul><li>Research prototypes </li></ul><ul><ul><li>System R </li></ul></ul><ul><ul><li>R* Distributed DBMS </li></ul></ul><ul><ul><li>Starburst Extensible Object-Relational DBMS </li></ul></ul><ul><ul><li>Garlic Heterogeneous DBMS </li></ul></ul><ul><li>Product Contributions </li></ul><ul><ul><li>Data sharing on DB2 390 Sysplex </li></ul></ul><ul><ul><li>DB2 UDB Query Processor </li></ul></ul><ul><ul><li>Intelligent Miner </li></ul></ul><ul><ul><li>Lotus Notes R5 Recovery </li></ul></ul><ul><ul><li>Discovery Link & DB2 Information Integrator </li></ul></ul><ul><li>6 IBM Fellows from team of < 50 </li></ul>
    13. 13. Why We Have Experience with Customers <ul><li>Over 2 decades of partnership with SWG Toronto & SVL </li></ul><ul><ul><li>Incorporation of Starburst prototype into DB2 </li></ul></ul><ul><ul><li>Component Owners of DB2 for LUW’s Query Compiler </li></ul></ul><ul><ul><li>Versions 2 – 5 (1992-1997) </li></ul></ul><ul><ul><li>Dealt with customer APARs, Visits, & Presentations </li></ul></ul><ul><li>Responsible for many DB2 innovations </li></ul><ul><ul><li>Query Graph Model (internal query representation, key to extensibility) </li></ul></ul><ul><ul><li>Query ReWrite and Optimizer technology </li></ul></ul><ul><ul><li>ARIES recovery and locking methods </li></ul></ul><ul><ul><li>Triggers and Constraints </li></ul></ul><ul><ul><li>Star Join and Hash Join </li></ul></ul><ul><ul><li>Object-relational features </li></ul></ul><ul><ul><li>Automatic Summary Tables (materialized views) </li></ul></ul><ul><ul><li>Visual Explain </li></ul></ul><ul><ul><li>Index Advisor </li></ul></ul><ul><li>Respected for our vision </li></ul><ul><ul><li>World-class publications in leading database conferences </li></ul></ul><ul><ul><li>Cognizant of industry trends </li></ul></ul>
    14. 14. Leveraging Technology and People IMS Development DB2 Development IDS / U2 Development Customer Requirements IBM Products IBM Research
    15. 15. SVL DB2 UDB for z/OS & OS/390 IMS Business Intelligence Content Management DB2 Everyplace Red Brick Icing Traditional AD Languages Boeblingen DB2 Text Extenders SAP/R3 Enablement Intelligent Miner for Data Intelligent Miner for Text Somers Hawthorne Advanced Technology Almaden Advanced Technology Austin GBIS Portland XPS & DB2 Lenexa IDS Boulder & Denver Content Management U2 Datablades Boca Raton & Miami EMMS LA Informix Support Rochester DB2 UDB for AS/400 Toronto DB2 UDB for UNIX, Windows, & OS/2 IBM Information Management Teams Beijing Information Integration DB2 for zOS Content Management DB2 and IMS tools Las Vegas Entity Analytics Over 6000 employees worldwide Menlo Park & Oakland IDS XPS JDBC Visionary Cloudscape Datablades Object Connect & Translator Content Management India DB2 UDB Service Business Intelligence IDS Yamato High Speed Inverted Index Search Business Intelligence Content Management Hursley Enterprise Master Data Solutions <ul><li>India Software Lab </li></ul><ul><ul><li>3000 employees </li></ul></ul><ul><ul><li>Broad range of skills – all SWG Brands </li></ul></ul><ul><ul><li>Linux Competency Center </li></ul></ul><ul><li>DB2 Lab within ISL </li></ul><ul><ul><li>100+ developers </li></ul></ul><ul><ul><li>Lab based services teams – DB2, CM, BI </li></ul></ul><ul><li>Other Resources </li></ul><ul><ul><li>India Research Lab </li></ul></ul><ul><ul><li>Solution Porting Center </li></ul></ul><ul><ul><li>Education Center for IBM Software </li></ul></ul><ul><ul><li>IBM Academic Initiative </li></ul></ul>
    16. 16. A Spectrum of Info Serving Requirements Platform: Mobile Desktop Small Servers Large Servers Data Size: Micro Compact Large Extremely Large Workload: Batch Online Transactions Real-time Analysis Data Mining Structure: Hierarchical Relational Multi-Value XML OS: Symbian PalmOS Windows Linux Unix i5/OS z/OS Scope: Embedded Intra-application Single application Multi-application Support: None Web/E-mail Business hours 24x7
    17. 17. Products to Match the Spectrum of Data Serving Needs DB2 Everyplace OLTP Relational Mobile Embedded Linux PalmOS Symbian Cloudscape OLTP Relational Intra-App / Single-App Java IDS OLTP Relational Intra-App / Single-App AIX, etc. Linux Windows DB2 OLTP & Analysis Relational & XML Single / Multi-App z/OS I5/OS AIX, etc. Linux Windows IMS OLTP Hierarchical Single / Multi-App z/OS U2 OLTP Multi-Value Intra-App / Single-App AIX, etc. Linux Windows Superior capabilities across the spectrum of requirements
    18. 18. DB2 for z/OS <ul><li>The power and function of an open, industry standard data server with zSeries’ industry leading availability, performance, and security </li></ul><ul><li>What it takes to be the industry’s most extreme data server </li></ul><ul><li>Continuous application availability measured in years </li></ul><ul><li>Ability to process over 1B SQL transactions per hour </li></ul><ul><li>Uninterrupted growth from 1 byte to over a peta-byte </li></ul><ul><li>Serving 100s of applications for 100,000s of users </li></ul><ul><li>US Government’s highest security classification (zSeries) </li></ul><ul><li>Support for industry standards: XML, Web services, Java, C, COBOL </li></ul><ul><li>Support for complex business applications: SAP, PeopleSoft, Siebel </li></ul>Extreme qualities of service XML and Relational data server
    19. 19. Technology Evolution with Mainframe Specialty Engines Integrated Facility for Linux (IFL) 2001 IBM System z9 Integrated Information Processor (IBM zIIP) planned for 2006 System z9 Application Assist Processor (zAAP) 2004 <ul><li>Building on a strong track record of technology innovation with specialty engines, IBM intends to introduce the System z9 Integrated Information Processor </li></ul><ul><li>Support for new workloads and open standards </li></ul><ul><li>Designed to help improve resource optimization for eligible data workloads within the enterprise </li></ul><ul><li>Centralized data sharing across mainframes </li></ul><ul><li>Incorporation of JAVA into existing mainframe solutions </li></ul>Internal Coupling Facility (ICF) 1997
    20. 20. Data Challenges <ul><li>Variety, Velocity, and Volume </li></ul><ul><li>New composite applications need data from multiple sources </li></ul><ul><ul><li>Consumers expect holistic, personalized, and value-added content </li></ul></ul><ul><ul><li>Relational, XML, packaged applications, content repositories, file systems all contain critical business information </li></ul></ul><ul><li>Increasing emphasis on current data </li></ul><ul><ul><li>Real-time analytics </li></ul></ul><ul><ul><li>Business activity monitoring </li></ul></ul><ul><li>Petabytes will be the measure of available online data </li></ul><ul><ul><li>All client interactions are important ( e.g., instant messages, audio records, web traffic,…) </li></ul></ul><ul><ul><li>Internet and intranet content </li></ul></ul>The world produces 250MB of information every year for every man, woman and child on earth. 10-100GB 100s GB - 1TB 1 - 20 GBs 100s MB 100s KB 1999 1s TB 1s TB 100s TB 100s TB 1s TB 1s TB 10s GB 10s GB 1s GB 1s GB 2004 10X 100X 100X 1,000X 10,000X Common Database Sizes Common Database Sizes Transactions Warehouses Marts Mobile Pervasive 37% CGR Disk Growth ’96-’07 70,000 TB of TV and Radio content in 2002 alone; 30% growth/year
    21. 21. Addressing the Changing Characteristics of Data Actionability Heterogeneity Scale Satellite & Surveillance Images and Video Gene Sequences Transactions Text and Web Increasing need to manage and analyze new data types Protein Folding
    22. 22. Research in Information and Interaction Drive our leadership technologies for search, structured and unstructured information processing and analytics, natural language processing, and conversational and multimodal interaction, across multiple tiers of business activities in SWG products and solutions. Foster the exploitation of components with these leading research technologies in IGS services offerings. CM Information Retrieval NLP Analytics Video Analysis Conversational and Multimodal Interactions Unstructured Information Management Information Management Database Synthesis Information Integration Metadata Speech Recognition
    23. 23. Worlds of Structured & Unstructured Data Come Together Analytical Complexity Collect Store Retrieve Drill Mine ETL Warehouse SQL OLAP Cluster, Classify, .. Crawl ECM Search Navigate Cluster, Classify, .. Solutions II Structured Data Unstructured Data
    24. 24. Need for Business Intelligence <ul><li>Loyalty </li></ul><ul><li>Profitability </li></ul><ul><li>Buyer Behavior </li></ul><ul><li>Targeted Offers </li></ul>Homeland Security <ul><li>Internet Buzz </li></ul><ul><li>Anti-Money Laundering </li></ul><ul><li>Border Control </li></ul><ul><li>Crime Information </li></ul><ul><li>Globalization </li></ul><ul><li>Business Controls </li></ul><ul><li>Mergers and Acquisitions </li></ul><ul><li>Supply Chain Efficiencies </li></ul>Accountability and Compliance Customer Knowledge Business Performance <ul><li>Risk Management </li></ul><ul><li>Fraud and Abuse </li></ul><ul><li>Public Protection </li></ul>HIPAA Basel II Patriot Act Sarbanes-Oxley Capitalism and Its Troubles: A Survey of International Finance -May 24, 2002 Preparing for terror How scared should you be? Nov 28th 2002 From The Economist print edition
    25. 25. SOA Reference Architecture Business Innovation & Optimization Services Development Services Integrated environment for design and creation of solution assets Manage and secure services, applications & resources Facilitates better decision-making with real-time business information IT Service Management Infrastructure Services Optimizes throughput, availability and performance Partner Services Business App Services Access Services Connect with trading partners Build on a robust, scaleable, and secure services environment Facilitates interactions with existing information and application assets Interaction Services Process Services Information Services Enables collaboration between people, processes & information Orchestrate and automate business processes Manages diverse data and content in a unified manner ESB Facilitates communication between services Apps & Info Assets
    26. 26. Information as a Service The SOA Lifecycle Mapped to Information Needs <ul><li>Understand Information Assets and Link to Business Context </li></ul><ul><ul><li>Discover information metadata </li></ul></ul><ul><ul><li>Map information to business processes </li></ul></ul><ul><ul><li>Develop data & content models </li></ul></ul><ul><li>Compose Information Services Across Heterogeneous Sources </li></ul><ul><ul><li>Extract, federate & transform heterogeneous information </li></ul></ul><ul><li>Service Information Requests </li></ul><ul><ul><li>Deliver unified data & content </li></ul></ul><ul><ul><li>Deliver business context </li></ul></ul><ul><ul><li>Discover relationships </li></ul></ul><ul><li>Ensure Performance, Availability & Security Meet Service Levels </li></ul><ul><li>Define & Refine Information Management Rules & Policies </li></ul><ul><ul><li>Monitor information usage over time </li></ul></ul>
    27. 27. Information as a Service Moving From a Project-Based to a Flexible Architecture (SOA) and more… abc… DB2 IBM Content Manager Oracle xyz… Heterogeneous Applications & Information Insight Information as a Service Optimize, Virtualize, Integrate, Accelerate Master Data, Entity Analytics, Decision Portals, Executive Dashboards, Industry Data Models Extracted or Real-time Standards-based e.g., XQuery, JSR170, JDBC, Web Services... Processes People Tools & Applications Data & Content Business Context Insightful Relationships
    28. 28. Information Services for SOA Unprecedented Business Flexibility <ul><li>Store Information </li></ul><ul><ul><li>DB2 Viper </li></ul></ul><ul><ul><ul><li>Optimized XML storage </li></ul></ul></ul><ul><li>Virtualize Information Access </li></ul><ul><ul><li>WebSphere Information Server </li></ul></ul><ul><li>Integrate Information </li></ul><ul><ul><li>WebSphere Information Server </li></ul></ul><ul><li>Accelerate Master Information </li></ul><ul><ul><li>WebSphere Customer Center </li></ul></ul><ul><ul><li>WebSphere Product Center </li></ul></ul><ul><ul><li>IBM Entity Analytics </li></ul></ul><ul><ul><li>Industry Models </li></ul></ul>
    29. 29. Industry Solutions Deliver Insight On Demand <ul><li>Law Enforcement </li></ul><ul><ul><li>Crime Information Warehouse </li></ul></ul><ul><ul><li>Entity Resolution </li></ul></ul><ul><ul><li>Anti Money Laundering </li></ul></ul><ul><li>Banking </li></ul><ul><li>Basel II and Banking Data Warehouse </li></ul><ul><li>Entity Resolution </li></ul><ul><li>Health Care </li></ul><ul><li>Aligned Clinical Environment </li></ul><ul><li>Retail </li></ul><ul><li>RFID </li></ul><ul><li>Retail Data Model </li></ul><ul><li>Telco </li></ul><ul><li>Telco Data Warehouse </li></ul><ul><li>Insurance </li></ul><ul><li>Customer Insight </li></ul><ul><li>IIW </li></ul><ul><li>Automotive </li></ul><ul><li>Quality Insight Early Warning </li></ul><ul><li>Life Sciences </li></ul><ul><li>Drug Discovery </li></ul>
    30. 30. OmniFind Key Technologies Content Crawling <ul><li>Scalable Web crawler </li></ul><ul><li>Data Source crawlers </li></ul><ul><li>Content Push </li></ul>Parsing/ Tokenizing <ul><li>HTML/XML </li></ul><ul><li>200+ Doc Filters </li></ul><ul><li>Advance Linguistic </li></ul>Search Collections Categorization <ul><li>Taxonomy </li></ul><ul><li>Rule-based </li></ul>Annotation <ul><li>Text Analytics </li></ul><ul><li>Plug-in </li></ul>Indexing <ul><li>Global Analysis </li></ul><ul><li>Static Ranking </li></ul><ul><li>Store </li></ul><ul><li>Dynamic Ranking </li></ul><ul><li>Fielded Search </li></ul><ul><li>Dynamic Summary </li></ul><ul><li>Parametric Search </li></ul><ul><li>Spell Checking </li></ul>Searching Security
    31. 31. Content Management Portfolio Strategy <ul><li>Capture, store, and manage all forms of content </li></ul><ul><li>Complete and scalable, content management functionality </li></ul><ul><ul><li>Document management </li></ul></ul><ul><ul><li>Image management </li></ul></ul><ul><ul><li>Digital asset management </li></ul></ul><ul><ul><li>Report management </li></ul></ul><ul><ul><li>Web content management </li></ul></ul><ul><ul><li>Records management </li></ul></ul><ul><ul><li>Digital rights management </li></ul></ul><ul><ul><li>Email/Messaging archiving and management </li></ul></ul><ul><ul><li>Collaboration tools </li></ul></ul><ul><ul><li>… </li></ul></ul><ul><li>Enterprise-scale business process management </li></ul><ul><li>Cross-portfolio, out-of-the-box integration </li></ul><ul><li>Rich, common client platform </li></ul>
    32. 32. IBM Content Management Platform Roadmap 4Q2004 1Q2005 2005 2006 … and Beyond WebSphere Portal V5.1 Embeds DB2 Content Manager Runtime Edition (JCR) Records Manager V4.1.1 A Dynamic RM Infrastructure Workplace Web Content Management V2.0 Leveraging DB2 Content Manager and WebSphere Portal Framework DB2 Content Manager V8.3 Enhance Doc Routing Enable BPM Extend Integration Capabilities Seamless RM DB2 Document Manager V8.3 Compliance/RM Extending Native Language Support DB2 CommonStore V8.3 Full-Text Search Seamless RM First Step ECM Unified Client New Portlets J2EE Web Components Extend to DPM Extend Document Management Email/Messaging Archiving and Management Enhancements Physical Records Management Virtual Records Management WCM Leveraging Workplace and DB2 Content Manager Runtime (JCR) Common Content Repository Workplace Unified End-User Experience (Client) Event Framework Integrated / Interoperable DPM/BPM Extended ECM Capabilities as Add-On Features Enterprise JCR IBM CM SDK Enterprise Content Integration – JSR170 DB2 Content Manager Runtime in ISV Applications LDDM* Fully Supports JSR170 Autonomic Capabilities Content Preservation Content Intelligence Pervasive Enablement … and More * Lotus Domino Document Manager
    33. 33. Query Optimization <ul><li>Industry-Leading Optimization </li></ul><ul><li>Extensible – SQL to XQuery! </li></ul><ul><li>Optimizes for Parallel </li></ul><ul><ul><li>I/O accesses </li></ul></ul><ul><ul><li>Within a node (SMP) </li></ul></ul><ul><ul><li>Between nodes (MPP) </li></ul></ul><ul><li>Powerful for complex OLAP & BI queries </li></ul><ul><li>Industry-Strength Engineering </li></ul><ul><li>Portable </li></ul><ul><ul><li>Across HW & SW platforms </li></ul></ul><ul><ul><li>Databases of 1 GB to > 300 TB </li></ul></ul><ul><li>Continuing &quot;technology pump&quot; of improvements from Research </li></ul>
    34. 34. Unstructured Information Management Architecture <ul><li>Common Research infrastructure for advancing Text Analysis and NLP capability </li></ul><ul><ul><li>Promotes re-use of best-of-breed components </li></ul></ul><ul><ul><li>Promotes combination hypothesis through ease of integration </li></ul></ul>Unstructured Information Application Libraries Specialized Application Libraries Provide basic functions common to a broad class of application libraries & applications (e.g. Glossary Extraction Taxonomy Generation, Classification, Translation, etc.) Question Answering e-Commerce Semantic Search Engine Token and Concept Indexing Query Key words, concepts, spans, ranges -> Ranked Hit List National & Intelligence Business Bioinformatics Technical Support Document & Meta Data Store Documents with meta data based on key-value pairs Enables view & collection management (Text) Analysis Engine (TAEs) Combination of analysis engines employing a variety of analytical techniques and strategies Structured Knowledge Access Knowledge Source Adapters - (KSAs) deliver content from many structured knowledge sources according to central ontologies Collection Processing Manager KSA Directory Service Dynamic query & delivery of KSAs TAE Directory Service Dynamic query & delivery of TAEs UIMA Standard Application Libraries Relevant Application Knowledge UIM Solutions Structured Data
    35. 35. Analytics bridge the Unstructured & Structured worlds Unstructured Information UIMA High-Value Most Current Content Fastest Growing BUT ... Buried in Huge Volumes – Lots of Noise Implicit Semantics Inefficient Search Explicit Structure Explicit Semantics Efficient Search Focused Content Text , Chat, Email, Audio, Video Indices DBs KBs <ul><li>Identify Semantic Entities, Induce Structure </li></ul><ul><ul><li>Chats, Phone Calls, Transfers </li></ul></ul><ul><ul><li>People, Places, Org, Events </li></ul></ul><ul><ul><li>Times, Topics, Opinions, Relationships </li></ul></ul><ul><ul><li>Threats, Plots, etc. </li></ul></ul>UIMA - The Big Picture Structured Information
    36. 36. Evolution of Metadata Hierarchical Data Model Rigid Metadata Single Application Domain Specific Ontologies Flexible Metadata Cross Industry Integration Increased Business Value of Metadata Syntactic annotation of data: what this data represents Semantic annotations of data: what this data means Relational Data Model Rigid Metadata Integration Within Enterprise Extensible Data Model (XML) Flexible Metadata Integration Within Industry 1970 1990 2000 2010 1980
    37. 37. Integrated Metadata Enables Shared Understanding Data-driven analysis, reporting, monitoring, data rule & integration specification Data Analysts Business context mapped to information technology assets Subject Matter Experts, Data Stewards <ul><li>Simplify integration </li></ul>Metadata and data-driven data modeling and management Architects <ul><li>Increase trust and confidence in information </li></ul><ul><li>Increase compliance to standards </li></ul><ul><li>Facilitate change management & reuse </li></ul>Database application and transformation development Implementers Data Administrators Development Data Modeling Data Stewardship Metadata Server Source System Analysis Information Server Business Glossary Data Architect Information Analyzer DataStage QualityStage
    38. 38. How Does Metadata Make Information Services Different? <ul><li>Information Services provide a basis for trust in information – providing visibility into lineage, relationships to other systems, and business definition </li></ul>Other Data Sources Content Repositories ? WSDL WSDL Traditional Service Information Service <ul><li>Where does the information come from? </li></ul><ul><li>What happens to it along the way? </li></ul><ul><li>How does this fit into how the business defines things? </li></ul><ul><li>How do I know I’m using the right service? </li></ul>Get customer Get customer
    39. 39. Metadata-driven Design for Integration Web Service Build These Using These New Business Process New Integrated View Legacy and packaged apps Relational databases XML documents New DataFlow WBI II ETL 40% of IT budgets may be spent on integration 30% of people’s time is searching for relevant information 30% of development time is copy management <ul><li>Remember It </li></ul><ul><li>Remember relationships and dependencies </li></ul><ul><li>Find It </li></ul><ul><li>Find and visualize related information </li></ul><ul><li>Connect It </li></ul><ul><li>Generate the integration glue </li></ul>
    40. 40. Metadata Will Be Used to Facilitate Information and Application Integration <ul><li>Today – manual integration, custom hard-wired integration </li></ul><ul><li>Tomorrow – semi-automated integration by using tools and connectors </li></ul><ul><li>Future – automated integration through metadata standards and tools </li></ul><ul><ul><li>Dictionaries </li></ul></ul><ul><ul><li>Taxonomies </li></ul></ul><ul><ul><li>Ontologies </li></ul></ul>
    41. 41. Thank You Merci Grazie Gracias Obrigado Danke Japanese French Russian German Italian Spanish Brazilian Portuguese Arabic Traditional Chinese Simplified Chinese Hindi Tamil Thai Korean

    ×