SlideShare a Scribd company logo
1 of 12
Big Data Stories

Dirk deRoos
IBM World Wide Technical Sales, IBM Big Data Platform
dderoos@ca.ibm.com
   @Dirk_deRoos




                                                        © 2012 IBM Corporation
Harnessing the Largest Predictive Focus Group in the World


    Purpose
     • Understand public sentiment towards an event:
       movie trailers
     • Deeply understand the potential customer profile:
       gender, occupation, intent to watch
     • Alter marketing launch plans based on insight



    Background
     • 1.1 Billion Tweets analyzed
     • 5.7 Million blogs/forum posts
     • 3.5 million messages
     • Also: Facebook, Google+, Tumblr, Flickr




2                                                          © 2012 IBM Corporation
Media & Entertainment Social Media Analytics




3                                              © 2012 IBM Corporation
Asian telco reduces
    billing costs and improves
    customer satisfaction




    Real-time mediation and analysis of
     6B CDRs per day
    Data processing time reduced from
     12 hrs to 1 sec

    Hardware cost reduced to 1/8th
    Proactively address issues
      (e.g. dropped calls) impacting
      customer satisfaction
4                             © 2012 IBM Corporation
Pacific Northwest Smart Grid
Demonstration Project


Capabilities:

     Stream Computing – real-time
     control system

     Deep Analytics Appliance – analyze
     massive data sets




Demonstrates scalability from 100
to 500K homes while retaining 10
years’ historical data



Accommodates ad hoc analysis of price
fluctuation, energy consumption profiles,
risk, fraud detection, grid health, etc.


 5                                          © 2012 IBM Corporation
Watson’s advanced analytic capabilities can sort through the equivalent of
6
    200 MILLION pages of data to uncover an answer in 3 SECONDS.Corporation
                                                               © 2012 IBM
The Jeopardy! Challenge – Question Answering Solution


         Broad/Open               $200                        $1000
           Domain         If you're standing, it's       The first person
                        the direction you should      mentioned by name in
                                                       ‘The Man in the Iron
                          look to check out the
    Complex                    wainscoting.
                                                      Mask’ is this hero of a
                                                      previous book by the
    Language                                              same author.

            High
          Precision                                           $2000
                                  $600
                                                      Of the 4 countries in the
                         The colorful nickname
     Accurate                                         world that the U.S. does
                         for the country whose
                                                        not have diplomatic
    Confidence          elite footballers let their    relations with, the one
                           egos prevent them            that’s farthest north
                               from winning
             High
            Speed
7                                                                 © 2012 IBM Corporation
Brief History of IBM Watson
      IBM        Jeopardy!         Watson              Watson           Watson
    Research       Grand             for             for Financial      Industry
     Project     Challenge        Healthcare           Services         Solutions
    (2006 – )    (Feb 2011)       (Aug 2011 –)       (Mar 2012 – )       (2012 – )




                                                                      Cross-industry
                                                      Expansion        Applications

                                Commercialization
                Demonstration                       From inspiration and invention,
                                                            through innovation and
      R&D                                             industrialization, ending with
                                                           industry transformation.
8                                                                       © 2012 IBM Corporation
IBM Watson brings together a set of transformational
technologies to drive optimized outcomes

                                                  2 Generates and
                                                          evaluates
                                                    hypothesis for
1 Understands                                      better outcomes
    natural                                                         99%
    language and                                                    60%
                                                                    10%

    human speech




     3 Adapts and
       Learns from                        …built on a massively parallel
       user selections                     probabilistic evidence-based
       and responses                architecture optimized for POWER7


9                                                             © 2012 IBM Corporation
Healthcare industry is beset with some of the most complex
information challenges we collectively face


        Medical information
        is doubling every 5
        years, much of
        which is
        unstructured

        81% of physicians
        report spending 5
        hours or less per
        month reading
        medical journals

     “Medicine has become too complex (and only) about 20% of the knowledge clinicians use
     today is evidence-base.”   Steven Shapiro, Chief Medical & Scientific Officer, UPMC

10   Source: International Journal of Circumpolar Health,                               © 2012 IBM Corporation
     DoctorDirectory.com, Institute for Medicine"
IBM and Seton Health put
     Ready for Watson to work

     • Proactively target care management that
       reduces re-admission of congestive heart
       failure patients
     • Improve patient quality of life, reduce cost
       and mortality rates
     • Analyze unstructured data (e.g., physician
       notes) and provide an integrated view of
       clinical and operational information
     • Analysis revealed:
          – 18 top indicators determined
          – 2 key re-admission factors




11                                   © 2012 IBM Corporation
12   © 2012 IBM Corporation

More Related Content

Similar to VINT Symposium 2012: Recorded Future | Dirk de Roos (IBM)

CII Panel Discussion on Cloud Computing
CII Panel Discussion on Cloud ComputingCII Panel Discussion on Cloud Computing
CII Panel Discussion on Cloud Computing
Anand Deshpande
 
Summerschool+ 2012 Ibm Kees Donker future of learning
Summerschool+ 2012 Ibm Kees Donker future of learningSummerschool+ 2012 Ibm Kees Donker future of learning
Summerschool+ 2012 Ibm Kees Donker future of learning
Kennisnet
 
Constellation's Research Outlook For 2011
Constellation's Research Outlook For 2011Constellation's Research Outlook For 2011
Constellation's Research Outlook For 2011
R "Ray" Wang
 
Intel Cloud summit: Big Data by Nick Knupffer
Intel Cloud summit: Big Data by Nick KnupfferIntel Cloud summit: Big Data by Nick Knupffer
Intel Cloud summit: Big Data by Nick Knupffer
IntelAPAC
 
Insync 10 session jd edwards strategy and roadmap anz (a4) - final
Insync 10 session   jd edwards strategy and roadmap anz (a4) - finalInsync 10 session   jd edwards strategy and roadmap anz (a4) - final
Insync 10 session jd edwards strategy and roadmap anz (a4) - final
InSync Conference
 

Similar to VINT Symposium 2012: Recorded Future | Dirk de Roos (IBM) (20)

Healthcare as an industry in transition
Healthcare as an industry in transitionHealthcare as an industry in transition
Healthcare as an industry in transition
 
CII Panel Discussion on Cloud Computing
CII Panel Discussion on Cloud ComputingCII Panel Discussion on Cloud Computing
CII Panel Discussion on Cloud Computing
 
Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)
Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)
Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)
 
IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!
IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!
IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!
 
Scenari evolutivi nello snellimento dei sistemi informativi
Scenari evolutivi nello snellimento dei sistemi informativiScenari evolutivi nello snellimento dei sistemi informativi
Scenari evolutivi nello snellimento dei sistemi informativi
 
Opening Keynote: Putting IBM Watson to Work
Opening Keynote: Putting IBM Watson to WorkOpening Keynote: Putting IBM Watson to Work
Opening Keynote: Putting IBM Watson to Work
 
Smarter planet and mega trends presentation 2012
Smarter planet and mega trends presentation 2012Smarter planet and mega trends presentation 2012
Smarter planet and mega trends presentation 2012
 
Accelerate Return on Data
Accelerate Return on DataAccelerate Return on Data
Accelerate Return on Data
 
Summerschool+ 2012 Ibm Kees Donker future of learning
Summerschool+ 2012 Ibm Kees Donker future of learningSummerschool+ 2012 Ibm Kees Donker future of learning
Summerschool+ 2012 Ibm Kees Donker future of learning
 
Constellation's Research Outlook For 2011
Constellation's Research Outlook For 2011Constellation's Research Outlook For 2011
Constellation's Research Outlook For 2011
 
What is big data - Architectures and Practical Use Cases
What is big data - Architectures and Practical Use CasesWhat is big data - Architectures and Practical Use Cases
What is big data - Architectures and Practical Use Cases
 
Intel Cloud summit: Big Data by Nick Knupffer
Intel Cloud summit: Big Data by Nick KnupfferIntel Cloud summit: Big Data by Nick Knupffer
Intel Cloud summit: Big Data by Nick Knupffer
 
Vision et Stratégie d'Hitachi Data Systems Randy DEMONT, Executive Vice Presi...
Vision et Stratégie d'Hitachi Data Systems Randy DEMONT, Executive Vice Presi...Vision et Stratégie d'Hitachi Data Systems Randy DEMONT, Executive Vice Presi...
Vision et Stratégie d'Hitachi Data Systems Randy DEMONT, Executive Vice Presi...
 
DAMA Presentation
DAMA PresentationDAMA Presentation
DAMA Presentation
 
The Network of Truth
The Network of TruthThe Network of Truth
The Network of Truth
 
How companies are achieving competitive advantage through analytics
How companies are achieving competitive advantage through analyticsHow companies are achieving competitive advantage through analytics
How companies are achieving competitive advantage through analytics
 
Insync 10 session jd edwards strategy and roadmap anz (a4) - final
Insync 10 session   jd edwards strategy and roadmap anz (a4) - finalInsync 10 session   jd edwards strategy and roadmap anz (a4) - final
Insync 10 session jd edwards strategy and roadmap anz (a4) - final
 
Latest news phoenix
Latest news phoenixLatest news phoenix
Latest news phoenix
 
IBM CEC Big Data 2011 06-11 final
IBM CEC Big Data 2011 06-11 finalIBM CEC Big Data 2011 06-11 final
IBM CEC Big Data 2011 06-11 final
 
Carlo Colicchio: Big Data for business
Carlo Colicchio: Big Data for businessCarlo Colicchio: Big Data for business
Carlo Colicchio: Big Data for business
 

More from VINTlabs | The Sogeti Trendlab

Design to Disrupt: New Digital Competition - Sogeti VINT
Design to Disrupt: New Digital Competition - Sogeti VINTDesign to Disrupt: New Digital Competition - Sogeti VINT
Design to Disrupt: New Digital Competition - Sogeti VINT
VINTlabs | The Sogeti Trendlab
 
SMACT and the City: How will the Internet of Things Change our Cities?
SMACT and the City: How will the Internet of Things Change our Cities?SMACT and the City: How will the Internet of Things Change our Cities?
SMACT and the City: How will the Internet of Things Change our Cities?
VINTlabs | The Sogeti Trendlab
 
Exponential Innovation | Federico Pistono @ VINT symposium THINGS 2013
Exponential Innovation | Federico Pistono @ VINT symposium THINGS 2013Exponential Innovation | Federico Pistono @ VINT symposium THINGS 2013
Exponential Innovation | Federico Pistono @ VINT symposium THINGS 2013
VINTlabs | The Sogeti Trendlab
 
Internet of Scenarios | Sander Duivestein @ VINT symposium THINGS 2013
Internet of Scenarios | Sander Duivestein @ VINT symposium THINGS 2013Internet of Scenarios | Sander Duivestein @ VINT symposium THINGS 2013
Internet of Scenarios | Sander Duivestein @ VINT symposium THINGS 2013
VINTlabs | The Sogeti Trendlab
 
Digital Innovation: The next step in the digital journey | Alberto Prado @ VI...
Digital Innovation: The next step in the digital journey | Alberto Prado @ VI...Digital Innovation: The next step in the digital journey | Alberto Prado @ VI...
Digital Innovation: The next step in the digital journey | Alberto Prado @ VI...
VINTlabs | The Sogeti Trendlab
 
Tom tom - Location services and privacy | Simon Hania @ VINT symposium THINGS...
Tom tom - Location services and privacy | Simon Hania @ VINT symposium THINGS...Tom tom - Location services and privacy | Simon Hania @ VINT symposium THINGS...
Tom tom - Location services and privacy | Simon Hania @ VINT symposium THINGS...
VINTlabs | The Sogeti Trendlab
 

More from VINTlabs | The Sogeti Trendlab (20)

Design to Disrupt: New Digital Competition - Sogeti VINT
Design to Disrupt: New Digital Competition - Sogeti VINTDesign to Disrupt: New Digital Competition - Sogeti VINT
Design to Disrupt: New Digital Competition - Sogeti VINT
 
Design to Disrupt: An Executive Introduction - Sogeti VINT
Design to Disrupt: An Executive Introduction - Sogeti VINTDesign to Disrupt: An Executive Introduction - Sogeti VINT
Design to Disrupt: An Executive Introduction - Sogeti VINT
 
Design2Disrupt: Patrick Naef (Emirates) - A 30 year old start up
Design2Disrupt: Patrick Naef (Emirates) -  A 30 year old start upDesign2Disrupt: Patrick Naef (Emirates) -  A 30 year old start up
Design2Disrupt: Patrick Naef (Emirates) - A 30 year old start up
 
Design2Disrupt: Mark Plakias (Orange Institute) - Unicorns, Startups & Giants
Design2Disrupt: Mark Plakias (Orange Institute) -   Unicorns, Startups & GiantsDesign2Disrupt: Mark Plakias (Orange Institute) -   Unicorns, Startups & Giants
Design2Disrupt: Mark Plakias (Orange Institute) - Unicorns, Startups & Giants
 
Design2Disrupt: Patrick Kalaher (Frog) - A Dirty New World
Design2Disrupt: Patrick Kalaher (Frog) - A Dirty New WorldDesign2Disrupt: Patrick Kalaher (Frog) - A Dirty New World
Design2Disrupt: Patrick Kalaher (Frog) - A Dirty New World
 
Design2Disrupt Summit: Tim O'brien - The New Rule of IT
Design2Disrupt Summit: Tim O'brien -  The New Rule of ITDesign2Disrupt Summit: Tim O'brien -  The New Rule of IT
Design2Disrupt Summit: Tim O'brien - The New Rule of IT
 
Design2Disrupt Summit: Ted Schadler - The Mobile Mind Shift
Design2Disrupt Summit: Ted Schadler - The Mobile Mind ShiftDesign2Disrupt Summit: Ted Schadler - The Mobile Mind Shift
Design2Disrupt Summit: Ted Schadler - The Mobile Mind Shift
 
Design2Disrupt: Hannu Kauppinen - The Programmable World
Design2Disrupt: Hannu Kauppinen - The Programmable WorldDesign2Disrupt: Hannu Kauppinen - The Programmable World
Design2Disrupt: Hannu Kauppinen - The Programmable World
 
Design2Disrupt: Menno van Doorn - Setting the Stage
Design2Disrupt: Menno van Doorn - Setting the StageDesign2Disrupt: Menno van Doorn - Setting the Stage
Design2Disrupt: Menno van Doorn - Setting the Stage
 
Design2Disrupt Summit: Joseph Pine - Destructive Recreation
Design2Disrupt Summit: Joseph Pine  - Destructive RecreationDesign2Disrupt Summit: Joseph Pine  - Destructive Recreation
Design2Disrupt Summit: Joseph Pine - Destructive Recreation
 
Design2Disrupt Summit: John Hogan - The New Epoch of Computing
Design2Disrupt Summit: John Hogan - The New Epoch of ComputingDesign2Disrupt Summit: John Hogan - The New Epoch of Computing
Design2Disrupt Summit: John Hogan - The New Epoch of Computing
 
Design2Disrupt Summit: Didier Bonnet - Leading Digital
Design2Disrupt Summit: Didier Bonnet - Leading DigitalDesign2Disrupt Summit: Didier Bonnet - Leading Digital
Design2Disrupt Summit: Didier Bonnet - Leading Digital
 
SMACT and the City: How will the Internet of Things Change our Cities?
SMACT and the City: How will the Internet of Things Change our Cities?SMACT and the City: How will the Internet of Things Change our Cities?
SMACT and the City: How will the Internet of Things Change our Cities?
 
The Fourth Industrial Revolution – Internet of Things to Tighten the Link bet...
The Fourth Industrial Revolution – Internet of Things to Tighten the Link bet...The Fourth Industrial Revolution – Internet of Things to Tighten the Link bet...
The Fourth Industrial Revolution – Internet of Things to Tighten the Link bet...
 
Sander Duivestein @ VINT Symposium 2014
Sander Duivestein @ VINT Symposium 2014Sander Duivestein @ VINT Symposium 2014
Sander Duivestein @ VINT Symposium 2014
 
Empathic Computing: From Wearables to Biohacking
Empathic Computing: From Wearables to BiohackingEmpathic Computing: From Wearables to Biohacking
Empathic Computing: From Wearables to Biohacking
 
Exponential Innovation | Federico Pistono @ VINT symposium THINGS 2013
Exponential Innovation | Federico Pistono @ VINT symposium THINGS 2013Exponential Innovation | Federico Pistono @ VINT symposium THINGS 2013
Exponential Innovation | Federico Pistono @ VINT symposium THINGS 2013
 
Internet of Scenarios | Sander Duivestein @ VINT symposium THINGS 2013
Internet of Scenarios | Sander Duivestein @ VINT symposium THINGS 2013Internet of Scenarios | Sander Duivestein @ VINT symposium THINGS 2013
Internet of Scenarios | Sander Duivestein @ VINT symposium THINGS 2013
 
Digital Innovation: The next step in the digital journey | Alberto Prado @ VI...
Digital Innovation: The next step in the digital journey | Alberto Prado @ VI...Digital Innovation: The next step in the digital journey | Alberto Prado @ VI...
Digital Innovation: The next step in the digital journey | Alberto Prado @ VI...
 
Tom tom - Location services and privacy | Simon Hania @ VINT symposium THINGS...
Tom tom - Location services and privacy | Simon Hania @ VINT symposium THINGS...Tom tom - Location services and privacy | Simon Hania @ VINT symposium THINGS...
Tom tom - Location services and privacy | Simon Hania @ VINT symposium THINGS...
 

Recently uploaded

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Recently uploaded (20)

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 

VINT Symposium 2012: Recorded Future | Dirk de Roos (IBM)

  • 1. Big Data Stories Dirk deRoos IBM World Wide Technical Sales, IBM Big Data Platform dderoos@ca.ibm.com @Dirk_deRoos © 2012 IBM Corporation
  • 2. Harnessing the Largest Predictive Focus Group in the World Purpose • Understand public sentiment towards an event: movie trailers • Deeply understand the potential customer profile: gender, occupation, intent to watch • Alter marketing launch plans based on insight Background • 1.1 Billion Tweets analyzed • 5.7 Million blogs/forum posts • 3.5 million messages • Also: Facebook, Google+, Tumblr, Flickr 2 © 2012 IBM Corporation
  • 3. Media & Entertainment Social Media Analytics 3 © 2012 IBM Corporation
  • 4. Asian telco reduces billing costs and improves customer satisfaction Real-time mediation and analysis of 6B CDRs per day Data processing time reduced from 12 hrs to 1 sec Hardware cost reduced to 1/8th Proactively address issues (e.g. dropped calls) impacting customer satisfaction 4 © 2012 IBM Corporation
  • 5. Pacific Northwest Smart Grid Demonstration Project Capabilities: Stream Computing – real-time control system Deep Analytics Appliance – analyze massive data sets Demonstrates scalability from 100 to 500K homes while retaining 10 years’ historical data Accommodates ad hoc analysis of price fluctuation, energy consumption profiles, risk, fraud detection, grid health, etc. 5 © 2012 IBM Corporation
  • 6. Watson’s advanced analytic capabilities can sort through the equivalent of 6 200 MILLION pages of data to uncover an answer in 3 SECONDS.Corporation © 2012 IBM
  • 7. The Jeopardy! Challenge – Question Answering Solution Broad/Open $200 $1000 Domain If you're standing, it's The first person the direction you should mentioned by name in ‘The Man in the Iron look to check out the Complex wainscoting. Mask’ is this hero of a previous book by the Language same author. High Precision $2000 $600 Of the 4 countries in the The colorful nickname Accurate world that the U.S. does for the country whose not have diplomatic Confidence elite footballers let their relations with, the one egos prevent them that’s farthest north from winning High Speed 7 © 2012 IBM Corporation
  • 8. Brief History of IBM Watson IBM Jeopardy! Watson Watson Watson Research Grand for for Financial Industry Project Challenge Healthcare Services Solutions (2006 – ) (Feb 2011) (Aug 2011 –) (Mar 2012 – ) (2012 – ) Cross-industry Expansion Applications Commercialization Demonstration From inspiration and invention, through innovation and R&D industrialization, ending with industry transformation. 8 © 2012 IBM Corporation
  • 9. IBM Watson brings together a set of transformational technologies to drive optimized outcomes 2 Generates and evaluates hypothesis for 1 Understands better outcomes natural 99% language and 60% 10% human speech 3 Adapts and Learns from …built on a massively parallel user selections probabilistic evidence-based and responses architecture optimized for POWER7 9 © 2012 IBM Corporation
  • 10. Healthcare industry is beset with some of the most complex information challenges we collectively face Medical information is doubling every 5 years, much of which is unstructured 81% of physicians report spending 5 hours or less per month reading medical journals “Medicine has become too complex (and only) about 20% of the knowledge clinicians use today is evidence-base.” Steven Shapiro, Chief Medical & Scientific Officer, UPMC 10 Source: International Journal of Circumpolar Health, © 2012 IBM Corporation DoctorDirectory.com, Institute for Medicine"
  • 11. IBM and Seton Health put Ready for Watson to work • Proactively target care management that reduces re-admission of congestive heart failure patients • Improve patient quality of life, reduce cost and mortality rates • Analyze unstructured data (e.g., physician notes) and provide an integrated view of clinical and operational information • Analysis revealed: – 18 top indicators determined – 2 key re-admission factors 11 © 2012 IBM Corporation
  • 12. 12 © 2012 IBM Corporation

Editor's Notes

  1. We have been working with an Indian telco client for some time now to help reduce their billing costs and improve customer satisfaction. Challenge: Call Detail Record (CDR) processing within their data warehouse was sub-optimal, Could not achieve real time billing which required handling billions of CDRs per day and de-duplication against 15 days worth of CDR data Unable to support for future IT and Business with real-time analytics Solution: Single platform for mediation and real time analytics reduces IT complexity The PMML standard is used to import data mining models from InfoSphere Warehouse. Offloaded the CDR processing to InfoSphere Streams resulting in enhanced data warehouse performance and improved TCO Each incoming CDR is analyzed using these data mining models, allowing immediate detection of events (ex: dropped calls) that might create customer satisfaction issues. Business Benefit: Data now processed at the speed of Business - from 12 hours to 1 second HW Costs reduced to 1/8th Support for future growth without the need to re-architect, more data, more analysis Platform in-place for real-time analytics to drive revenue
  2. IBM has been working with one of the leading non-profit research institutes leading a regional project to prove the viability and benefits of smart grid technology and test the concept of demand-based electrical power pricing Background: The project is the largest initiative of its kind in the US and is designed to test and quantify smart grid costs and benefits with over 60,000 consumers in five states - Washington, Oregon, Idaho, Montana and Wyoming. The smart grid technique uses an incentive and a feedback signal to help coordinate smart grid resources. The two-way communication of this information - from power source to destination - allows intelligent devices and consumers to make smart decisions about using this energy. The requirements of the project call for a robust infrastructure that facilitates two-way data flow and computing power capable of continuously processing petabytes of data. Solution: IBM is building the infrastructure to disseminate the project ’ s transactive incentive signal and interlace it with the participants ’ responsive assets. The solution consists of: - IBM streams computing software running on IBM x86 servers to allow for the effective streaming of data - IBM data warehouse appliance provide to analyze and understand the project data (up to 10 petabytes) in minutes Benefits: • Enabled a town to avoid a power outage by using a two-way advanced meter system to shut off home water heaters during peak periods, reducing strain on an unreliable underwater cable • Empowers consumers to make educated choices about how and when to use electricity, and at what price • Increases grid efficiency and reliability through system self-monitoring and feedback
  3. Most of you know of Watson, our computing system designed to compete on the Jeopardy game show. Watson represents a breakthrough in terms of volume of information stored, and the ability to access it quickly (answering natural language questions). I think Watson is impressive, because there are many commercial uses for this technology – and the technology exists today! The game Jeopardy provides the ultimate challenge for Watson because the game’s clues involve analyzing subtle meanings, irony, riddles, and other complexities in which humans excel and computers traditionally do not. If you think about Deep Blue, the 1997 IBM machine that defeated the reigning world chess champion, Watson is yet another major leap in capability of IT systems to identify patterns, gain critical insight and enhance decision-making despite daunting complexities. While Deep Blue was amazing, it was an achievement of the application of compute power to a computationally well-defined and well-bound game: Chess. Watson, on the other hand, faces a challenge that is open-ended, defies the well-bounded mathematical formulation of a game like Chess. Watson has to operate in the near limitless, ambiguous, and high contextual domain of human language and knowledge.   Watson answers a Grand Challenge: Can IBM design a computing system that rivals a human’s ability to answer questions posed in natural language by interpreting meaning and context and then retrieving, analyzing and understanding vast amounts of information in real-time? IBM Watson is a breakthrough in analytic innovation, proving that it is possible to harness vast amounts of information and rival a human’s ability to answer questions posted in natural language in real-time. But it doesn't matter how good the machine is if we don’t have good information to feed it. We live in a time where a computer can compete against humans at answering questions in plain English, based on storing, retrieving, analyzing and understanding vast amounts of information at real-time speeds. These same capabilities can enable you to improve and optimize your business, too. IBM just showed the value of putting that information to work by creating a computing system capable of competing on Jeopardy Well there ’ s a lot of technology that went into Watson – and a lot of Big Data technology in there as well. Now take a moment and think about how this iconic game show is played: you have to answer a question within three seconds. The technology used to analyze and return answers in Watson was a pre-cursor to the Streams technology, in fact, Streams was invented because that technology used in Watson wasn’t fast enough for some of the in-motion requirements needed by companies today. Jeopardy questions are not straight forward, they have pun and tricks to make them harder – so some of our text analytic technology with natural language processing, which is part of the IBM Big Data platform, is in there too (that ’ s yet another MAJOR DIFFERENTIATOR for IBM in Big Data: our Text   Analytic Toolkit, which you will hear more about later in this presentation). It wasn’t always smooth sailing for Watson, the big breakthrough came when they started to use machine learning (ML), and the IBM Big Data platform will further differentiate itself from the field in 2012 when a corresponding toolkit came to market just like the text analytics toolkit. Finally, Watson had to have access to a heck of a lot of data – and Big Data technologies were used to load and index over 200 million pages of data; Watson had everything from encyclopedias, to the bible, to the world famous music and movie databases, etc.   All these technologies mentioned in the previous paragraph had to work together as well. So IBM clearly has some inflection point understanding of these technologies and how to get them working together. In the case of the text analytics and machine learning – well we have to make that easier to consume because you don ’ t have the world ’ s largest commercial research organization for math at your fingertips. So we need to build tooling, and optimization, and accelerators around that and put these technologies inside consumable toolkits: which are we doing now.
  4. In order to know we are making progress on scientific problems like open-domain QA well-defined challenges help demonstrate we can solve concrete & difficult tasks. As you might know Jeopardy! Is a long-standing, well-regarded and highly challenging Television quiz show in the US that demands human contestants to quickly understand and answer richly expressed natural language questions over a staggering array of topics. The Jeopardy! Challenge uniquely provides a palpable, compelling and notable way to drive the technology of Question Answering along key dimensions If you are familiar with the quiz show it asks an I incredibly broad range of questions over a huge variety of topics. In a single round there is a grid of 6 Categories and for each category 5 rows with increasing $ values. Once a cell is chosen by 1 of three players, A question, or what is often called a Clue is revealed. Here you see some example questions. <read some of the questions>   Jeopardy uses complex and often subtle language to describe what is being asked.   To win you have to be extraordinarily precise. You must deliver the exact answer – no more and no less – it is not good enough for it be somewhere in the top 2, 10 or 20 documents – you must know it exactly and get it in first place – otherwise no credit – in fact you loose points.   You must demonstrate Accurate Confidences -- That is -- you must know what you know – if you “buzz –in” and then get it wrong you lose the $$ value of the question.   And you have to do this all very quickly – deeply analyze huge volumes of content, consider many possible answers, compute your confidence and buzz in – all in just seconds. As we shall see compete with human champions at this game represents a Grand Challenge in Automatic Open-Domain Question Answering. <STOP> <NEXT SLIDE>
  5. 01/18/12 IOD2011 4/9/12 GS302_ManojSaxena_v7
  6. Main point: At the core of what makes Watson different are three powerful technologies - natural language, hypothesis generation, and evidence based learning. But Watson is more than the sum of its individual parts. Watson is about bringing these capabilities together in a way that ’s never been done before resulting in a fundamental change in the way businesses look at quickly solving problems Further speaking points: . Looking at these one by one, understanding natural language and the way we speak breaks down the communication barrier that has stood in the way between people and their machines for so long. Hypothesis generation bypasses the historic deterministic way that computers function and recognizes that there are various probabilities of various outcomes rather than a single definitive ‘right’ response. And adaptation and learning helps Watson continuously improve in the same way that humans learn….it keeps track of which of its selections were selected by users and which responses got positive feedback thus improving future response generation Additional information : The result is a machine that functions along side of us as an assistant rather than something we wrestle with to get an adequate outcome
  7. Challenge Reduce the occurrence of high cost Congestive Heart Failure (CHF) readmissions by proactively identifying patients likely to be readmitted on an emergent basis. Solution Seton Healthcare is a not-for-profit organization, the Seton Family is the leading provider of healthcare services in Central Texas, serving an 11-county population of 1.9 million Target and understand high-risk CHF patients for care management programs using natural language processing. Used predictive models that have demonstrated high positive predictive value against extracted structured and unstructured data Results Proactively targeted care management which will reduce re-admission of CHF patients. Identified patients likely for re-admission and introduced early interventions which will reduce cost, mortality rates, and improve patient quality of life. Background Seton Healthcare is a not-for-profit organization, the Seton Family is the leading provider of healthcare services in Central Texas, serving an 11-county population of 1.8 million. Seton Healthcare identified an opportunity to significantly reduce the occurrence of high cost CHF readmissions by proactively identifying patients likely to be readmitted on an emergent basis. Objectives Seton will partner with IBM to implement content and predictive analytics to identify patients who should receive proactive medical case management and intervention. The expectation is that Seton can reduce the occurrence of costly readmissions, mortality rates and improve the quality of life for these patients. Project Description CHF prevention and reduced re-admission is a main focuses of Seton’s Clinical Design Center. The key clinical, financial, and contextual data for CHF patients span many applications and are stored in both structured and unstructured content. To achieve the Design Center objectives, the following capabilities are needed: Integrate these data into longitudinal patient records Identify important information in the unstructured data Develop predictive models that show Likelihood of readmission Likelihood of ambulatory-sensitive ED visits and admissions Forecasted next year costs Display predictive model results along with aggregated patient record data in an visual, easily-navigable system IOD2011_BA KEYNOTEIBM IOD 2011 06/19/12 D1_BA Keynote_v4