SlideShare a Scribd company logo
1 of 6
Download to read offline
IBM Systems and Technology                        February 2011
An IBM White Paper




Watson – A System
Designed for Answers
The future of workload optimized systems design
2   Watson – A System Designed for Answers




Executive summary
Over the last century, IBM has achieved numerous scientific
breakthroughs through its commitment to research and its tradi-
tion of Grand Challenges. These Grand Challenges—such as
Deep Blue®, which was designed to rival world chess champion
Gary Kasparov—work to push science in ways that weren’t
thought possible before. Watson is the latest IBM Research
Grand Challenge, designed to further the science of natural
language processing through advances in question and answer
technology.

Watson is a workload optimized system based on IBM DeepQA
architecture running on a cluster of IBM® POWER7®
processor-based servers. After four years of intense research and
development by a team of IBM researchers, Watson competed
on Jeopardy! in February 2011, performing at the level of human     Today, with companies increasingly capturing critical business
experts in terms of precision, confidence and speed against two      information in natural language documentation, there is growing
of the best-known and most successful Jeopardy! Champions,          interest in workload optimized systems that deeply analyze the
Ken Jennings and Brad Rutter. This white paper explains             content of natural language questions to answer those questions
Watson’s workload optimized system design, how it’s emblematic      with precision. Advances in question answering (QA) technology
of the future of systems design, and why this represents a new      will increasingly help support professionals in critical and timely
computing paradigm.                                                 decision making in areas such as health care, business intelli-
                                                                    gence, knowledge discovery, enterprise knowledge management,
Jeopardy! The IBM challenge                                         and customer support.
In 1997, Deep Blue, the computer chess-playing system devel-
oped by IBM Research, captured worldwide attention by com-          With QA in mind, IBM settled on a challenge to build a com-
peting successfully against world chess champion Gary Kasparov.     puter system called “Watson” (after Thomas J. Watson, the
It was the culmination of a grand challenge to advance the sci-     founder of IBM), which could compete at the human champion
ence of computing in a way that created great popular interest.     level in real time on the American TV quiz show Jeopardy! The
                                                                    program, which has been broadcast in the United States for
                                                                    more than 25 years, pits three human contestants against one
                                                                    another to answer rich natural language questions over a broad
IBM Systems and Technology   3




range of topics, with penalties for wrong answers. In this three-   Watson competed against two of the most well-known and suc-
person competition, confidence, precision and answering speed        cessful Jeopardy! champions—Ken Jennings and Brad Rutter—in
are of critical importance, as contestants usually come up with     a two-match contest aired over three consecutive nights begin-
their answers in the few seconds it takes for the host to read a    ning on February 14, 2011.
clue. To compete in this game at human-champion levels, a
computer system would need to answer roughly 70 percent of          IBM DeepQA
the questions asked with greater than 80 percent precision in       DeepQA is a massively parallel probabilistic evidence-based
three seconds or less.                                              architecture. For the Jeopardy! Challenge, more than 100 differ-
                                                                    ent techniques are used to analyze natural language, identify
Watson represents an impressive leap forward in systems design      sources, find and generate hypotheses, find and score evidence,
and analytics. It runs IBM’s DeepQA technology, a new kind of       and merge and rank hypotheses. Far more important than any
analytics capability that can perform thousands of simultaneous     particular technique is the way all these techniques are combined
tasks in seconds to provide precise answers to questions.           in DeepQA such that overlapping approaches can bring their
Powered by IBM POWER7 processor technology, Watson is an            strengths to bear and contribute to improvements in accuracy,
example of the complex analytics workloads that are becoming        confidence, or speed.
increasingly common and critical to business success and
competitiveness in today’s data-intensive environment.
4   Watson – A System Designed for Answers




DeepQA is an architecture with an accompanying methodology,          of Apache UIMA, enables the scale-out of UIMA applications
but it is not specific to the Jeopardy! Challenge. IBM has begun      using asynchronous messaging. Watson uses UIMA-AS to scale
adapting it to different business applications and additional        out across 2,880 POWER7 cores in a cluster of 90 IBM Power®
exploratory challenge problems including medicine, enterprise        750 servers. UIMA_AS manages all of the inter-process commu-
search and gaming.                                                   nication using the open JMS standard. The UIMA-AS deploy-
                                                                     ment on POWER7 enabled Watson to deliver answers in one to
The overarching principles in DeepQA are:                            six seconds.

1. Massive parallelism: Exploit massive parallelism in the con-      Watson has roughly 200 million pages of natural language
   sideration of multiple interpretations and hypotheses.            content (equivalent to reading 1 million books). Watson uses the
2. Many experts: Facilitate the integration, application and con-    Apache Hadoop framework to facilitate preprocessing the large
   textual evaluation of a wide range of loosely coupled proba-      volume of data in order to create in-memory datasets used at
   bilistic question and content analytics.                          run-time. Watson’s DeepQA UIMA annotators were deployed
3. Pervasive confidence estimation: No single component               as mappers in the Hadoop map-reduce framework, which dis-
   commits to an answer; all components produce features and         tributed them across processors in the cluster. Hadoop con-
   associated confidences, scoring different question and content     tributes to optimal CPU utilization and also provides convenient
   interpretations. An underlying confidence processing substrate     tools for deploying, managing, and monitoring the data
   learns how to stack and combine the scores.                       analysis process.
4. Integrate shallow and deep knowledge: Balance the use of
   strict semantics and shallow semantics, leveraging many           Harnessing POWER7
   loosely formed ontologies.                                        Watson harnesses the massive parallel processing performance
                                                                     of its POWER7 processors to execute its thousands of
Speed and scale-out                                                  DeepQA tasks simultaneously on individual processor cores.
DeepQA is developed using Apache UIMA, a framework                   Each of Watson’s 90 clustered IBM Power 750 servers features
implementation of the Unstructured Information Management            32 POWER7 cores running at 3.55 GHz. Running the Linux®
Architecture. UIMA was designed to support interoperability          operating system, the servers are housed in 10 racks along with
and scale-out of text and multimodal analysis applications. All of   associated I/O nodes and communications hubs. The system has
the components in DeepQA are implemented as UIMA annota-             a combined total of 16 Terabytes of memory and can operate at
tors. These are components that analyze text and produce anno-       over 80 Teraflops (trillions of operations per second).
tations or assertions about the text. Over time Watson has
evolved so that the system now has hundred of components.            With its innovative, eight-core processor design, POWER7 is
UIMA facilitated rapid component integration, testing and            ideally suited for massively parallel processing of Watson’s
evaluation.                                                          analytics algorithms. POWER7 also features 500 gigabytes of
                                                                     on-chip communications bandwidth, contributing to exceptional
Early implementations of Watson ran on a single processor,           efficiency of both memory and processor utilization. And since
which required two hours to answer a single question. The            each server packs 32 high performance POWER7 cores with up
DeepQA computation is embarrassing parallel, however, and so         to 512 GB of memory, the Power 750 makes an ideal platform
it can be divided into a number of independent parts, each of        for Watson’s processor and memory-hungry Java processes.
which can be executed by a separate processor. UIMA-AS, part
IBM Systems and Technology   5




Designing Watson on commercially available Power 750 servers
was a deliberate choice to ensure more rapid adoption of opti-
mized systems in industries such as healthcare and financial serv-
ices. That goal was a fundamental difference between Watson
and Deep Blue, which was a highly customized supercomputer.
Deep Blue was based on an earlier generation of Power proces-
sor technology, featuring a 30 node RS/6000 SP system, with
each node containing a single 120 MHz POWER2 processor.
But in addition to the regular POWER2 processors, Deep Blue’s
performance was enhanced with 480 special purpose chess
processor chips.

The same Power 750 server used by Watson is already
deployed today by thousands of organizations in optimized
systems that provide for both complex analytics and transaction
processing. Rice University in Houston, Texas, for example, uses
IBM Power 750 systems to accelerate the understanding of the         human-expert levels in terms of precision, confidence and speed.
molecular basis of cancer through the application of genome          The project has advanced the fields of unstructured data
analysis technologies. POWER7 systems have given Rice more           analytics, natural language processing, and the design of work-
flexibility and efficiency, enabling them to pursue a broader         load optimized systems. Beyond Jeopardy!, the technology
range of research challenges on a single system than was possible    behind Watson can be adapted to solve business and societal
before. And GHY International, a customs brokerage firm in            problems—for example, diagnosing disease, handling online
Canada, migrated to a new Power 750 running Power AIX®,              technical support questions, and parsing vast tracts of legal
Power i and Power Linux to better support their clients’             documents—and to drive progress across industries.
increased engagement in international trading. With
PowerVM™ virtualization, GHY is now able to deploy new               Watson’s ability to understand the meaning and context of
capabilities in as little as five minutes to support their clients’   human language, and rapidly process information to find precise
changing needs.                                                      answers to complex questions, holds enormous potential to
                                                                     transform how computers can help people accomplish tasks in
A system designed for answers                                        business and their personal lives.
After four years of intense research and development by a team
of IBM researchers, Watson has demonstrated its ability to
compete on Jeopardy! against champion players, performing at
For more information
To learn more about Watson, POWER7 and workload opti-
mized systems, please contact your IBM marketing representa-
tive or IBM Business Partner, or visit the following websites:
●   ibm.com/systems/power/advantages/watson
●   ibm.com/systems/power                                        © Copyright IBM Corporation 2011

                                                                 IBM Systems and Technology Group
                                                                 Route 100
                                                                 Somers, NY 10589

                                                                 Produced in the United States of America
                                                                 February 2011
                                                                 All Rights Reserved

                                                                 IBM, the IBM logo, ibm.com, Power, POWER7 and DEEP BLUE are
                                                                 trademarks of International Business Machines Corporation in the
                                                                 United States, other countries or both. If these and other IBM trademarked
                                                                 terms are marked on their first occurrence in this information with a
                                                                 trademark symbol (® or ™), these symbols indicate U.S. registered or
                                                                 common law trademarks owned by IBM at the time this information was
                                                                 published. Such trademarks may also be registered or common law
                                                                 trademarks in other countries. A current list of IBM trademarks is
                                                                 available on the web at “Copyright and trademark information” at
                                                                 ibm.com/legal/copytrade.shtml

                                                                 Other company, product or service names may be trademarks or service
                                                                 marks of others.


                                                                          Please Recycle




                                                                                                                     POW03061-USEN-00

More Related Content

What's hot

Characterization of hadoop jobs using unsupervised learning
Characterization of hadoop jobs using unsupervised learningCharacterization of hadoop jobs using unsupervised learning
Characterization of hadoop jobs using unsupervised learningJoão Gabriel Lima
 
Machine programming
Machine programmingMachine programming
Machine programmingDESMOND YUEN
 
Cache and consistency in nosql
Cache and consistency in nosqlCache and consistency in nosql
Cache and consistency in nosqlJoão Gabriel Lima
 
MongoDB on Windows Azure
MongoDB on Windows AzureMongoDB on Windows Azure
MongoDB on Windows AzureJeremy Taylor
 
MongoDB on Windows Azure
MongoDB on Windows AzureMongoDB on Windows Azure
MongoDB on Windows AzureJeremy Taylor
 
A SURVEY OF NEURAL NETWORK HARDWARE ACCELERATORS IN MACHINE LEARNING
A SURVEY OF NEURAL NETWORK HARDWARE ACCELERATORS IN MACHINE LEARNING A SURVEY OF NEURAL NETWORK HARDWARE ACCELERATORS IN MACHINE LEARNING
A SURVEY OF NEURAL NETWORK HARDWARE ACCELERATORS IN MACHINE LEARNING mlaij
 
A Survey of Machine Learning Methods Applied to Computer ...
A Survey of Machine Learning Methods Applied to Computer ...A Survey of Machine Learning Methods Applied to Computer ...
A Survey of Machine Learning Methods Applied to Computer ...butest
 
Attaching cloud storage to a campus grid using parrot, chirp, and hadoop
Attaching cloud storage to a campus grid using parrot, chirp, and hadoopAttaching cloud storage to a campus grid using parrot, chirp, and hadoop
Attaching cloud storage to a campus grid using parrot, chirp, and hadoopJoão Gabriel Lima
 
Near Data Computing Architectures: Opportunities and Challenges for Apache Spark
Near Data Computing Architectures: Opportunities and Challenges for Apache SparkNear Data Computing Architectures: Opportunities and Challenges for Apache Spark
Near Data Computing Architectures: Opportunities and Challenges for Apache SparkAhsan Javed Awan
 
Presentation
PresentationPresentation
Presentationbutest
 
Increasing Throughput per Node for Content Delivery Networks
Increasing Throughput per Node for Content Delivery NetworksIncreasing Throughput per Node for Content Delivery Networks
Increasing Throughput per Node for Content Delivery NetworksDESMOND YUEN
 
How Apollo Group Evaluted MongoDB
How Apollo Group Evaluted MongoDBHow Apollo Group Evaluted MongoDB
How Apollo Group Evaluted MongoDBJeremy Taylor
 
Cag Corporate Dossier May 2012
Cag Corporate Dossier May 2012Cag Corporate Dossier May 2012
Cag Corporate Dossier May 2012fastmpj
 
Gschwind, PowerAI: A Co-Optimized Software Stack for AI on Power
Gschwind, PowerAI: A Co-Optimized Software Stack for AI on PowerGschwind, PowerAI: A Co-Optimized Software Stack for AI on Power
Gschwind, PowerAI: A Co-Optimized Software Stack for AI on PowerMichael Gschwind
 
Cisco and Greenplum Partner to Deliver High-Performance Hadoop Reference ...
Cisco and Greenplum  Partner to Deliver  High-Performance  Hadoop Reference  ...Cisco and Greenplum  Partner to Deliver  High-Performance  Hadoop Reference  ...
Cisco and Greenplum Partner to Deliver High-Performance Hadoop Reference ...EMC
 

What's hot (18)

Characterization of hadoop jobs using unsupervised learning
Characterization of hadoop jobs using unsupervised learningCharacterization of hadoop jobs using unsupervised learning
Characterization of hadoop jobs using unsupervised learning
 
Machine programming
Machine programmingMachine programming
Machine programming
 
Cache and consistency in nosql
Cache and consistency in nosqlCache and consistency in nosql
Cache and consistency in nosql
 
MongoDB on Windows Azure
MongoDB on Windows AzureMongoDB on Windows Azure
MongoDB on Windows Azure
 
MongoDB on Windows Azure
MongoDB on Windows AzureMongoDB on Windows Azure
MongoDB on Windows Azure
 
A SURVEY OF NEURAL NETWORK HARDWARE ACCELERATORS IN MACHINE LEARNING
A SURVEY OF NEURAL NETWORK HARDWARE ACCELERATORS IN MACHINE LEARNING A SURVEY OF NEURAL NETWORK HARDWARE ACCELERATORS IN MACHINE LEARNING
A SURVEY OF NEURAL NETWORK HARDWARE ACCELERATORS IN MACHINE LEARNING
 
gfs-sosp2003
gfs-sosp2003gfs-sosp2003
gfs-sosp2003
 
A Survey of Machine Learning Methods Applied to Computer ...
A Survey of Machine Learning Methods Applied to Computer ...A Survey of Machine Learning Methods Applied to Computer ...
A Survey of Machine Learning Methods Applied to Computer ...
 
PowerAI Deep dive
PowerAI Deep divePowerAI Deep dive
PowerAI Deep dive
 
Attaching cloud storage to a campus grid using parrot, chirp, and hadoop
Attaching cloud storage to a campus grid using parrot, chirp, and hadoopAttaching cloud storage to a campus grid using parrot, chirp, and hadoop
Attaching cloud storage to a campus grid using parrot, chirp, and hadoop
 
Near Data Computing Architectures: Opportunities and Challenges for Apache Spark
Near Data Computing Architectures: Opportunities and Challenges for Apache SparkNear Data Computing Architectures: Opportunities and Challenges for Apache Spark
Near Data Computing Architectures: Opportunities and Challenges for Apache Spark
 
Presentation
PresentationPresentation
Presentation
 
Deep Learning
Deep LearningDeep Learning
Deep Learning
 
Increasing Throughput per Node for Content Delivery Networks
Increasing Throughput per Node for Content Delivery NetworksIncreasing Throughput per Node for Content Delivery Networks
Increasing Throughput per Node for Content Delivery Networks
 
How Apollo Group Evaluted MongoDB
How Apollo Group Evaluted MongoDBHow Apollo Group Evaluted MongoDB
How Apollo Group Evaluted MongoDB
 
Cag Corporate Dossier May 2012
Cag Corporate Dossier May 2012Cag Corporate Dossier May 2012
Cag Corporate Dossier May 2012
 
Gschwind, PowerAI: A Co-Optimized Software Stack for AI on Power
Gschwind, PowerAI: A Co-Optimized Software Stack for AI on PowerGschwind, PowerAI: A Co-Optimized Software Stack for AI on Power
Gschwind, PowerAI: A Co-Optimized Software Stack for AI on Power
 
Cisco and Greenplum Partner to Deliver High-Performance Hadoop Reference ...
Cisco and Greenplum  Partner to Deliver  High-Performance  Hadoop Reference  ...Cisco and Greenplum  Partner to Deliver  High-Performance  Hadoop Reference  ...
Cisco and Greenplum Partner to Deliver High-Performance Hadoop Reference ...
 

Viewers also liked (8)

Pasos fundamentales para el éxito de tu negocio Tiens
Pasos fundamentales para el éxito de tu negocio TiensPasos fundamentales para el éxito de tu negocio Tiens
Pasos fundamentales para el éxito de tu negocio Tiens
 
Genética Alvaro
Genética AlvaroGenética Alvaro
Genética Alvaro
 
Mat. gestión. modulo 6
Mat. gestión. modulo 6Mat. gestión. modulo 6
Mat. gestión. modulo 6
 
Analise das Demonstracoes
Analise das DemonstracoesAnalise das Demonstracoes
Analise das Demonstracoes
 
Ensayo tics
Ensayo ticsEnsayo tics
Ensayo tics
 
Centro de investigaciones
Centro de investigacionesCentro de investigaciones
Centro de investigaciones
 
LETT Årsrapport 2013
LETT Årsrapport 2013LETT Årsrapport 2013
LETT Årsrapport 2013
 
6820 19027-1-pb
6820 19027-1-pb6820 19027-1-pb
6820 19027-1-pb
 

Similar to Watson A System Designed For Answers

2016 August POWER Up Your Insights - IBM System Summit Mumbai
2016 August POWER Up Your Insights - IBM System Summit Mumbai2016 August POWER Up Your Insights - IBM System Summit Mumbai
2016 August POWER Up Your Insights - IBM System Summit MumbaiAnand Haridass
 
Breaking the Silos: Storage for Analytics & AI
Breaking the Silos: Storage for Analytics & AIBreaking the Silos: Storage for Analytics & AI
Breaking the Silos: Storage for Analytics & AIDataWorks Summit
 
AI Scalability for the Next Decade
AI Scalability for the Next DecadeAI Scalability for the Next Decade
AI Scalability for the Next DecadePaula Koziol
 
IBM Power Systems: Designed for Data
IBM Power Systems: Designed for DataIBM Power Systems: Designed for Data
IBM Power Systems: Designed for DataIBM Power Systems
 
GPU 101: The Beast In Data Centers
GPU 101: The Beast In Data CentersGPU 101: The Beast In Data Centers
GPU 101: The Beast In Data CentersRommel Garcia
 
Introduction of Deep Learning
Introduction of Deep LearningIntroduction of Deep Learning
Introduction of Deep LearningMyungjin Lee
 
Ανδρέας Τσαγκάρης, 5th Digital Banking Forum
Ανδρέας Τσαγκάρης, 5th Digital Banking ForumΑνδρέας Τσαγκάρης, 5th Digital Banking Forum
Ανδρέας Τσαγκάρης, 5th Digital Banking ForumStarttech Ventures
 
Cluster Tutorial
Cluster TutorialCluster Tutorial
Cluster Tutorialcybercbm
 
IBM POWER - An ideal platform for scale-out deployments
IBM POWER - An ideal platform for scale-out deploymentsIBM POWER - An ideal platform for scale-out deployments
IBM POWER - An ideal platform for scale-out deploymentsthinkASG
 
Covid-19 Response Capability with Power Systems
Covid-19 Response Capability with Power SystemsCovid-19 Response Capability with Power Systems
Covid-19 Response Capability with Power SystemsGanesan Narayanasamy
 
Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.Karl Roche
 
Machine Learning and Power AI Workshop v4
Machine Learning and Power AI Workshop v4Machine Learning and Power AI Workshop v4
Machine Learning and Power AI Workshop v4LennartF
 
How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors
How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors
How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors DataWorks Summit/Hadoop Summit
 
Optimizing Hortonworks Apache Spark machine learning workloads for contempora...
Optimizing Hortonworks Apache Spark machine learning workloads for contempora...Optimizing Hortonworks Apache Spark machine learning workloads for contempora...
Optimizing Hortonworks Apache Spark machine learning workloads for contempora...Indrajit Poddar
 
OpenPOWER/POWER9 Webinar from MIT and IBM
OpenPOWER/POWER9 Webinar from MIT and IBM OpenPOWER/POWER9 Webinar from MIT and IBM
OpenPOWER/POWER9 Webinar from MIT and IBM Ganesan Narayanasamy
 

Similar to Watson A System Designed For Answers (20)

IBM Watson
IBM WatsonIBM Watson
IBM Watson
 
2016 August POWER Up Your Insights - IBM System Summit Mumbai
2016 August POWER Up Your Insights - IBM System Summit Mumbai2016 August POWER Up Your Insights - IBM System Summit Mumbai
2016 August POWER Up Your Insights - IBM System Summit Mumbai
 
Breaking the Silos: Storage for Analytics & AI
Breaking the Silos: Storage for Analytics & AIBreaking the Silos: Storage for Analytics & AI
Breaking the Silos: Storage for Analytics & AI
 
AI Scalability for the Next Decade
AI Scalability for the Next DecadeAI Scalability for the Next Decade
AI Scalability for the Next Decade
 
IBM Power Systems: Designed for Data
IBM Power Systems: Designed for DataIBM Power Systems: Designed for Data
IBM Power Systems: Designed for Data
 
GPU 101: The Beast In Data Centers
GPU 101: The Beast In Data CentersGPU 101: The Beast In Data Centers
GPU 101: The Beast In Data Centers
 
Introduction of Deep Learning
Introduction of Deep LearningIntroduction of Deep Learning
Introduction of Deep Learning
 
Ανδρέας Τσαγκάρης, 5th Digital Banking Forum
Ανδρέας Τσαγκάρης, 5th Digital Banking ForumΑνδρέας Τσαγκάρης, 5th Digital Banking Forum
Ανδρέας Τσαγκάρης, 5th Digital Banking Forum
 
Cluster Tutorial
Cluster TutorialCluster Tutorial
Cluster Tutorial
 
IBM Watson
IBM Watson IBM Watson
IBM Watson
 
IBM POWER - An ideal platform for scale-out deployments
IBM POWER - An ideal platform for scale-out deploymentsIBM POWER - An ideal platform for scale-out deployments
IBM POWER - An ideal platform for scale-out deployments
 
kumarResume
kumarResumekumarResume
kumarResume
 
Covid-19 Response Capability with Power Systems
Covid-19 Response Capability with Power SystemsCovid-19 Response Capability with Power Systems
Covid-19 Response Capability with Power Systems
 
IBM Watson
IBM WatsonIBM Watson
IBM Watson
 
Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.
 
Machine Learning and Power AI Workshop v4
Machine Learning and Power AI Workshop v4Machine Learning and Power AI Workshop v4
Machine Learning and Power AI Workshop v4
 
RESUME_FULLTIME
RESUME_FULLTIMERESUME_FULLTIME
RESUME_FULLTIME
 
How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors
How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors
How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors
 
Optimizing Hortonworks Apache Spark machine learning workloads for contempora...
Optimizing Hortonworks Apache Spark machine learning workloads for contempora...Optimizing Hortonworks Apache Spark machine learning workloads for contempora...
Optimizing Hortonworks Apache Spark machine learning workloads for contempora...
 
OpenPOWER/POWER9 Webinar from MIT and IBM
OpenPOWER/POWER9 Webinar from MIT and IBM OpenPOWER/POWER9 Webinar from MIT and IBM
OpenPOWER/POWER9 Webinar from MIT and IBM
 

Watson A System Designed For Answers

  • 1. IBM Systems and Technology February 2011 An IBM White Paper Watson – A System Designed for Answers The future of workload optimized systems design
  • 2. 2 Watson – A System Designed for Answers Executive summary Over the last century, IBM has achieved numerous scientific breakthroughs through its commitment to research and its tradi- tion of Grand Challenges. These Grand Challenges—such as Deep Blue®, which was designed to rival world chess champion Gary Kasparov—work to push science in ways that weren’t thought possible before. Watson is the latest IBM Research Grand Challenge, designed to further the science of natural language processing through advances in question and answer technology. Watson is a workload optimized system based on IBM DeepQA architecture running on a cluster of IBM® POWER7® processor-based servers. After four years of intense research and development by a team of IBM researchers, Watson competed on Jeopardy! in February 2011, performing at the level of human Today, with companies increasingly capturing critical business experts in terms of precision, confidence and speed against two information in natural language documentation, there is growing of the best-known and most successful Jeopardy! Champions, interest in workload optimized systems that deeply analyze the Ken Jennings and Brad Rutter. This white paper explains content of natural language questions to answer those questions Watson’s workload optimized system design, how it’s emblematic with precision. Advances in question answering (QA) technology of the future of systems design, and why this represents a new will increasingly help support professionals in critical and timely computing paradigm. decision making in areas such as health care, business intelli- gence, knowledge discovery, enterprise knowledge management, Jeopardy! The IBM challenge and customer support. In 1997, Deep Blue, the computer chess-playing system devel- oped by IBM Research, captured worldwide attention by com- With QA in mind, IBM settled on a challenge to build a com- peting successfully against world chess champion Gary Kasparov. puter system called “Watson” (after Thomas J. Watson, the It was the culmination of a grand challenge to advance the sci- founder of IBM), which could compete at the human champion ence of computing in a way that created great popular interest. level in real time on the American TV quiz show Jeopardy! The program, which has been broadcast in the United States for more than 25 years, pits three human contestants against one another to answer rich natural language questions over a broad
  • 3. IBM Systems and Technology 3 range of topics, with penalties for wrong answers. In this three- Watson competed against two of the most well-known and suc- person competition, confidence, precision and answering speed cessful Jeopardy! champions—Ken Jennings and Brad Rutter—in are of critical importance, as contestants usually come up with a two-match contest aired over three consecutive nights begin- their answers in the few seconds it takes for the host to read a ning on February 14, 2011. clue. To compete in this game at human-champion levels, a computer system would need to answer roughly 70 percent of IBM DeepQA the questions asked with greater than 80 percent precision in DeepQA is a massively parallel probabilistic evidence-based three seconds or less. architecture. For the Jeopardy! Challenge, more than 100 differ- ent techniques are used to analyze natural language, identify Watson represents an impressive leap forward in systems design sources, find and generate hypotheses, find and score evidence, and analytics. It runs IBM’s DeepQA technology, a new kind of and merge and rank hypotheses. Far more important than any analytics capability that can perform thousands of simultaneous particular technique is the way all these techniques are combined tasks in seconds to provide precise answers to questions. in DeepQA such that overlapping approaches can bring their Powered by IBM POWER7 processor technology, Watson is an strengths to bear and contribute to improvements in accuracy, example of the complex analytics workloads that are becoming confidence, or speed. increasingly common and critical to business success and competitiveness in today’s data-intensive environment.
  • 4. 4 Watson – A System Designed for Answers DeepQA is an architecture with an accompanying methodology, of Apache UIMA, enables the scale-out of UIMA applications but it is not specific to the Jeopardy! Challenge. IBM has begun using asynchronous messaging. Watson uses UIMA-AS to scale adapting it to different business applications and additional out across 2,880 POWER7 cores in a cluster of 90 IBM Power® exploratory challenge problems including medicine, enterprise 750 servers. UIMA_AS manages all of the inter-process commu- search and gaming. nication using the open JMS standard. The UIMA-AS deploy- ment on POWER7 enabled Watson to deliver answers in one to The overarching principles in DeepQA are: six seconds. 1. Massive parallelism: Exploit massive parallelism in the con- Watson has roughly 200 million pages of natural language sideration of multiple interpretations and hypotheses. content (equivalent to reading 1 million books). Watson uses the 2. Many experts: Facilitate the integration, application and con- Apache Hadoop framework to facilitate preprocessing the large textual evaluation of a wide range of loosely coupled proba- volume of data in order to create in-memory datasets used at bilistic question and content analytics. run-time. Watson’s DeepQA UIMA annotators were deployed 3. Pervasive confidence estimation: No single component as mappers in the Hadoop map-reduce framework, which dis- commits to an answer; all components produce features and tributed them across processors in the cluster. Hadoop con- associated confidences, scoring different question and content tributes to optimal CPU utilization and also provides convenient interpretations. An underlying confidence processing substrate tools for deploying, managing, and monitoring the data learns how to stack and combine the scores. analysis process. 4. Integrate shallow and deep knowledge: Balance the use of strict semantics and shallow semantics, leveraging many Harnessing POWER7 loosely formed ontologies. Watson harnesses the massive parallel processing performance of its POWER7 processors to execute its thousands of Speed and scale-out DeepQA tasks simultaneously on individual processor cores. DeepQA is developed using Apache UIMA, a framework Each of Watson’s 90 clustered IBM Power 750 servers features implementation of the Unstructured Information Management 32 POWER7 cores running at 3.55 GHz. Running the Linux® Architecture. UIMA was designed to support interoperability operating system, the servers are housed in 10 racks along with and scale-out of text and multimodal analysis applications. All of associated I/O nodes and communications hubs. The system has the components in DeepQA are implemented as UIMA annota- a combined total of 16 Terabytes of memory and can operate at tors. These are components that analyze text and produce anno- over 80 Teraflops (trillions of operations per second). tations or assertions about the text. Over time Watson has evolved so that the system now has hundred of components. With its innovative, eight-core processor design, POWER7 is UIMA facilitated rapid component integration, testing and ideally suited for massively parallel processing of Watson’s evaluation. analytics algorithms. POWER7 also features 500 gigabytes of on-chip communications bandwidth, contributing to exceptional Early implementations of Watson ran on a single processor, efficiency of both memory and processor utilization. And since which required two hours to answer a single question. The each server packs 32 high performance POWER7 cores with up DeepQA computation is embarrassing parallel, however, and so to 512 GB of memory, the Power 750 makes an ideal platform it can be divided into a number of independent parts, each of for Watson’s processor and memory-hungry Java processes. which can be executed by a separate processor. UIMA-AS, part
  • 5. IBM Systems and Technology 5 Designing Watson on commercially available Power 750 servers was a deliberate choice to ensure more rapid adoption of opti- mized systems in industries such as healthcare and financial serv- ices. That goal was a fundamental difference between Watson and Deep Blue, which was a highly customized supercomputer. Deep Blue was based on an earlier generation of Power proces- sor technology, featuring a 30 node RS/6000 SP system, with each node containing a single 120 MHz POWER2 processor. But in addition to the regular POWER2 processors, Deep Blue’s performance was enhanced with 480 special purpose chess processor chips. The same Power 750 server used by Watson is already deployed today by thousands of organizations in optimized systems that provide for both complex analytics and transaction processing. Rice University in Houston, Texas, for example, uses IBM Power 750 systems to accelerate the understanding of the human-expert levels in terms of precision, confidence and speed. molecular basis of cancer through the application of genome The project has advanced the fields of unstructured data analysis technologies. POWER7 systems have given Rice more analytics, natural language processing, and the design of work- flexibility and efficiency, enabling them to pursue a broader load optimized systems. Beyond Jeopardy!, the technology range of research challenges on a single system than was possible behind Watson can be adapted to solve business and societal before. And GHY International, a customs brokerage firm in problems—for example, diagnosing disease, handling online Canada, migrated to a new Power 750 running Power AIX®, technical support questions, and parsing vast tracts of legal Power i and Power Linux to better support their clients’ documents—and to drive progress across industries. increased engagement in international trading. With PowerVM™ virtualization, GHY is now able to deploy new Watson’s ability to understand the meaning and context of capabilities in as little as five minutes to support their clients’ human language, and rapidly process information to find precise changing needs. answers to complex questions, holds enormous potential to transform how computers can help people accomplish tasks in A system designed for answers business and their personal lives. After four years of intense research and development by a team of IBM researchers, Watson has demonstrated its ability to compete on Jeopardy! against champion players, performing at
  • 6. For more information To learn more about Watson, POWER7 and workload opti- mized systems, please contact your IBM marketing representa- tive or IBM Business Partner, or visit the following websites: ● ibm.com/systems/power/advantages/watson ● ibm.com/systems/power © Copyright IBM Corporation 2011 IBM Systems and Technology Group Route 100 Somers, NY 10589 Produced in the United States of America February 2011 All Rights Reserved IBM, the IBM logo, ibm.com, Power, POWER7 and DEEP BLUE are trademarks of International Business Machines Corporation in the United States, other countries or both. If these and other IBM trademarked terms are marked on their first occurrence in this information with a trademark symbol (® or ™), these symbols indicate U.S. registered or common law trademarks owned by IBM at the time this information was published. Such trademarks may also be registered or common law trademarks in other countries. A current list of IBM trademarks is available on the web at “Copyright and trademark information” at ibm.com/legal/copytrade.shtml Other company, product or service names may be trademarks or service marks of others. Please Recycle POW03061-USEN-00