SlideShare a Scribd company logo
1 of 26
WATSON
COMPUTER
INTRODUCTION
WATSON is an artificially intelligent computer system capable of answering
questions posed in natural language, developed in IBM's DeepQA project.
QA technology takes a question expressed in natural language, seeks to
understand it in much greater detail, and returns a precise answer to the
question.
Watson applies advanced natural language processing, information retrieval,
knowledge representation, automated reasoning, and machine learning
technologies for this purpose.
 It incorporates a local corpus (database) of information.
1/23
JEOPARDY !
Initially developed to answer questions on Jeopardy !, a quiz show known for its tricky
questions.
Watson participated in 2011 against former champions Brad Rutter and Ken Jennings
and won over them.
2/23
REQUIREMENTS
• 90 x IBM Power 750 servers
• 2880 POWER7 cores
• POWER7 3.55 GHz chip
• 500 GB per sec on-chip bandwidth
• 10 Gb Ethernet network
• 16 Terabytes of memory
• 20 Terabytes of disk, clustered
• Can operate at 80 Teraflops
• Runs IBM DeepQA software
• Scales out with and searches vast amounts of unstructured
information with UIMA & Hadoop open source components
• Linux provides a scalable, open platform, optimized
to exploit POWER7 performance
• 10 racks include servers, networking, shared disk system,
cluster controllers
3/23
ALGORITHMS USED
1. SVM (Support Vector Machines) Classifier
• SVM is supervised learning model that analyzes data and recognizes patterns
• Given a set of training examples, each marked as belonging to one of two categories, it builds a
model that assigns new examples into one category or the other
• It is a non-probabilistic binary linear classifier.
2. Naïve Bayes’ Classifier
• It is a family of classifiers based on applying Bayes' theorem with strong
(naive) independence assumptions between the features.
• So it is a conditional probability model.
• Particularly suited when the dimensionality of the inputs is high.
4/23
ALGORITHMS USED
3. Word Sense Disambiguation
• It is an open problem of natural language
processing and ontology. WSD is identifying
which sense of a word (i.e. meaning) is used in
a sentence, when the word has multiple
meanings.
• It requires two strict things: a dictionary to
specify the senses and a corpus of
language data to be disambiguated. WordNet
is used as a dictionary in this context. For
example –
5/23
3. Word Sense Disambiguation (contd.)
• The sentence as well as the query forms an ordered set of words. We then compute the sense
network between every pair of words from query and sentence.
ALGORITHMS USED
6/23
PROCESS
The basic working of Watson computer is
based on four steps –
1. Question
Analysis
4. Answer
Extraction
(Result)
3. Hypotheses
Generation
2. Document
Retrieval
7/23
PROCESS
Step 1 – Determining answer type
•Uses machine learning techniques like SVM
(Support Vector Machine), Naïve Bayes
classifiers
•Above techniques apply on a tagged corpus
of information
Step 2 – Query formation
• Assume question is a valid IR query
• Remove stop words from question
Example:
In 1897 Swiss climber Matthias Zurbriggen
became the first to scale this Argentinean
peak.
1. Question Analysis
8/23
PROCESS
•The task of the document retrieval module is
to select a small set from the collection which
can be practically handled in the later stages.
•Using important terms from the question,
Watson performs a search over millions of
documents to find relevant passages.
• Data can be stored either in a local corpus or
can be accessed from the Internet.
2. Document Retrieval
9/23
PROCESS
• Extracts important entities – so called “candidate answers” – from the documents.
• WordNet is used as a sense/semantic dictionary.
• Obtain statistics of a particular word from a large corpus by assigning probabilities based on
occurrence of target concept.
• Hypotheses generation of example given above -
3. Hypotheses Generation
10/23
PROCESS
Step 1 – Answer Scoring
•Candidate answers are scored using a large number of answer scoring analytics running
parallel.
• Algorithms like Type Coercion scorer, temporal match etc. are used.
• Answer scoring of example given above -
4. Answer Extraction
11/23
PROCESS
Step 2 – Analysing Scores
•The scores are grouped into meaningful groups, or evidence dimensions.
•A plot of these yields the evidence profile for the candidate.
•Watson statistically combines the scores to produce a final confidence score.
4. Answer Extraction
Aconcagua
12/23
EXAMPLE
13/23
14/23
15/23
16/23
17/23
18/23
19/23
EXISTING CHALLENGES
Healthcare
Medical information doubles
every three years, physician’s
inability to be up-to-date,
complex decision making
Retail
Fulfilling customers’ high
expectation of satisfaction and
effectively analysing growing
mountain of data
Finance
Each day huge financial
information is generated, difficult
to harness
Public Sector
Efficient analysis of enormous
volumes of unstructured,
unverified data
20/23
APPLICATIONS
Health Care
Finance
Retail
Public Sector
Memorial Sloan
Kettering
Genesys
MD Anderson
DBS (Development
Bank of Singapore)
The North Face
Decision-making
Policy and
performance
Public security
21/23
Engaging shoppers !
Wellpoint
FUTURE SCOPE
 Recipe generating platform
 Pharmaceutical industry
 Publishing
 Biotechnology
 Research or inventions
22/23
 Requires a huge database of prior knowledge and information
 Has trouble responding to short clues
 Incapable of coming up with fresh ideas
 More than base knowledge, clues may require thought, an area where humans
still have an edge over Watson Computer
LIMITATIONS
23/23
BIBLIOGRAPHY
 https://researcher.ibm.com/researcher/viewpage.php?id=2121
 Science Behind an Answer http://www03.ibm.com/innovation/us/watson/what-is-watson/science-
behind-an-answer.html
 Jeopardy! IBM Watson Day 1 (Feb 14, 2011)
http://www.youtube.com/watch?v=seNkjYyG3gI&feature=related
 Tom M. Mitchell. 1997. Machine Learning. Computer Science Series. McGraw-Hill.
 Corpora for Question Answering Task, Cognitive Computation Group at the Department of
Computer Science, University of Illinois at Urbana-Champaign.
 Dell Zhang and Wee Sun Lee. 2003. Question Classification using Support Vector Machines. In
Proceedings of the 26th ACM International Conference on Research and Developement in
Information Retrieval (SIGIR’03), pages 26–32, Toronto, Canada.
 www.google.com
 www.wikipedia.com
 www.ibm.com
QUESTIONS ??

More Related Content

What's hot

Using Bioinformatics Data to inform Therapeutics discovery and development
Using Bioinformatics Data to inform Therapeutics discovery and developmentUsing Bioinformatics Data to inform Therapeutics discovery and development
Using Bioinformatics Data to inform Therapeutics discovery and developmentEleanor Howe
 
LearningAG.ppt
LearningAG.pptLearningAG.ppt
LearningAG.pptbutest
 
Intuidex - To be or not to be iid by William M. Pottenger (NYC Machine Learni...
Intuidex - To be or not to be iid by William M. Pottenger (NYC Machine Learni...Intuidex - To be or not to be iid by William M. Pottenger (NYC Machine Learni...
Intuidex - To be or not to be iid by William M. Pottenger (NYC Machine Learni...Hakka Labs
 
Machine learning
Machine learningMachine learning
Machine learningRohit Kumar
 
Data Science, Data & Dashboards Design
Data Science, Data & Dashboards DesignData Science, Data & Dashboards Design
Data Science, Data & Dashboards DesignKoo Ping Shung
 

What's hot (8)

Using Bioinformatics Data to inform Therapeutics discovery and development
Using Bioinformatics Data to inform Therapeutics discovery and developmentUsing Bioinformatics Data to inform Therapeutics discovery and development
Using Bioinformatics Data to inform Therapeutics discovery and development
 
LearningAG.ppt
LearningAG.pptLearningAG.ppt
LearningAG.ppt
 
Nvivo
NvivoNvivo
Nvivo
 
Intuidex - To be or not to be iid by William M. Pottenger (NYC Machine Learni...
Intuidex - To be or not to be iid by William M. Pottenger (NYC Machine Learni...Intuidex - To be or not to be iid by William M. Pottenger (NYC Machine Learni...
Intuidex - To be or not to be iid by William M. Pottenger (NYC Machine Learni...
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Learning
LearningLearning
Learning
 
Machine learning
Machine learningMachine learning
Machine learning
 
Data Science, Data & Dashboards Design
Data Science, Data & Dashboards DesignData Science, Data & Dashboards Design
Data Science, Data & Dashboards Design
 

Viewers also liked

Modellazione Ecoidrologica della Tundra: Fusione del Permafrost, Termocarsism...
Modellazione Ecoidrologica della Tundra: Fusione del Permafrost, Termocarsism...Modellazione Ecoidrologica della Tundra: Fusione del Permafrost, Termocarsism...
Modellazione Ecoidrologica della Tundra: Fusione del Permafrost, Termocarsism...pietro richelli
 
Brain gate system document
Brain gate system documentBrain gate system document
Brain gate system documentSridhar Goud
 
Braingate technology
Braingate technologyBraingate technology
Braingate technologyPraneeth IPz
 

Viewers also liked (7)

BRAIN GATE
BRAIN GATEBRAIN GATE
BRAIN GATE
 
Brain gate
Brain gateBrain gate
Brain gate
 
Modellazione Ecoidrologica della Tundra: Fusione del Permafrost, Termocarsism...
Modellazione Ecoidrologica della Tundra: Fusione del Permafrost, Termocarsism...Modellazione Ecoidrologica della Tundra: Fusione del Permafrost, Termocarsism...
Modellazione Ecoidrologica della Tundra: Fusione del Permafrost, Termocarsism...
 
Brain gate system document
Brain gate system documentBrain gate system document
Brain gate system document
 
Google's project ara
Google's project araGoogle's project ara
Google's project ara
 
Brain gate
Brain gateBrain gate
Brain gate
 
Braingate technology
Braingate technologyBraingate technology
Braingate technology
 

Similar to Watson Computer

Watson - A new era of computing.
Watson - A new era of computing.Watson - A new era of computing.
Watson - A new era of computing.Cesar Maciel
 
Deep learning Tutorial - Part II
Deep learning Tutorial - Part IIDeep learning Tutorial - Part II
Deep learning Tutorial - Part IIQuantUniversity
 
IRJET- Factoid Question and Answering System
IRJET-  	  Factoid Question and Answering SystemIRJET-  	  Factoid Question and Answering System
IRJET- Factoid Question and Answering SystemIRJET Journal
 
ACRL 2011 Data-Driven Library Web Design
ACRL 2011 Data-Driven Library Web DesignACRL 2011 Data-Driven Library Web Design
ACRL 2011 Data-Driven Library Web DesignAmanda Dinscore
 
Week 11 12 chap11 c-2
Week 11 12 chap11 c-2Week 11 12 chap11 c-2
Week 11 12 chap11 c-2Zahir Reza
 
Large Scale Data Mining using Genetics-Based Machine Learning
Large Scale Data Mining using Genetics-Based Machine LearningLarge Scale Data Mining using Genetics-Based Machine Learning
Large Scale Data Mining using Genetics-Based Machine Learningjaumebp
 
Bioinformatics&Databases.ppt
Bioinformatics&Databases.pptBioinformatics&Databases.ppt
Bioinformatics&Databases.pptBlackHunt1
 
Decision CAMP 2014 - Benjamin Grosof Janine Bloomfield - Explanation-based E-...
Decision CAMP 2014 - Benjamin Grosof Janine Bloomfield - Explanation-based E-...Decision CAMP 2014 - Benjamin Grosof Janine Bloomfield - Explanation-based E-...
Decision CAMP 2014 - Benjamin Grosof Janine Bloomfield - Explanation-based E-...Decision CAMP
 
Software Analytics: Data Analytics for Software Engineering
Software Analytics: Data Analytics for Software EngineeringSoftware Analytics: Data Analytics for Software Engineering
Software Analytics: Data Analytics for Software EngineeringTao Xie
 
Multi task learning stepping away from narrow expert models 7.11.18
Multi task learning stepping away from narrow expert models 7.11.18Multi task learning stepping away from narrow expert models 7.11.18
Multi task learning stepping away from narrow expert models 7.11.18Cloudera, Inc.
 
Business intelligence and data warehousing
Business intelligence and data warehousingBusiness intelligence and data warehousing
Business intelligence and data warehousingVaishnavi
 
Fake news detection
Fake news detection Fake news detection
Fake news detection shalushamil
 
Watson DevCon 2016 - From Jeopardy! to the Future
Watson DevCon 2016 - From Jeopardy! to the FutureWatson DevCon 2016 - From Jeopardy! to the Future
Watson DevCon 2016 - From Jeopardy! to the FutureIBM Watson
 
Performance Issue? Machine Learning to the rescue!
Performance Issue? Machine Learning to the rescue!Performance Issue? Machine Learning to the rescue!
Performance Issue? Machine Learning to the rescue!Maarten Smeets
 
Brief Tour of Machine Learning
Brief Tour of Machine LearningBrief Tour of Machine Learning
Brief Tour of Machine Learningbutest
 
The Myths + Realities of Machine-Learning Cybersecurity
The Myths + Realities of Machine-Learning CybersecurityThe Myths + Realities of Machine-Learning Cybersecurity
The Myths + Realities of Machine-Learning CybersecurityInterset
 

Similar to Watson Computer (20)

Watson - A new era of computing.
Watson - A new era of computing.Watson - A new era of computing.
Watson - A new era of computing.
 
Deep learning Tutorial - Part II
Deep learning Tutorial - Part IIDeep learning Tutorial - Part II
Deep learning Tutorial - Part II
 
Deep learning for NLP
Deep learning for NLPDeep learning for NLP
Deep learning for NLP
 
IRJET- Factoid Question and Answering System
IRJET-  	  Factoid Question and Answering SystemIRJET-  	  Factoid Question and Answering System
IRJET- Factoid Question and Answering System
 
ACRL 2011 Data-Driven Library Web Design
ACRL 2011 Data-Driven Library Web DesignACRL 2011 Data-Driven Library Web Design
ACRL 2011 Data-Driven Library Web Design
 
Week 11 12 chap11 c-2
Week 11 12 chap11 c-2Week 11 12 chap11 c-2
Week 11 12 chap11 c-2
 
Large Scale Data Mining using Genetics-Based Machine Learning
Large Scale Data Mining using Genetics-Based Machine LearningLarge Scale Data Mining using Genetics-Based Machine Learning
Large Scale Data Mining using Genetics-Based Machine Learning
 
Expert systems
Expert systemsExpert systems
Expert systems
 
Bioinformatics&Databases.ppt
Bioinformatics&Databases.pptBioinformatics&Databases.ppt
Bioinformatics&Databases.ppt
 
Decision CAMP 2014 - Benjamin Grosof Janine Bloomfield - Explanation-based E-...
Decision CAMP 2014 - Benjamin Grosof Janine Bloomfield - Explanation-based E-...Decision CAMP 2014 - Benjamin Grosof Janine Bloomfield - Explanation-based E-...
Decision CAMP 2014 - Benjamin Grosof Janine Bloomfield - Explanation-based E-...
 
Software Analytics: Data Analytics for Software Engineering
Software Analytics: Data Analytics for Software EngineeringSoftware Analytics: Data Analytics for Software Engineering
Software Analytics: Data Analytics for Software Engineering
 
Multi task learning stepping away from narrow expert models 7.11.18
Multi task learning stepping away from narrow expert models 7.11.18Multi task learning stepping away from narrow expert models 7.11.18
Multi task learning stepping away from narrow expert models 7.11.18
 
Business intelligence and data warehousing
Business intelligence and data warehousingBusiness intelligence and data warehousing
Business intelligence and data warehousing
 
Fake news detection
Fake news detection Fake news detection
Fake news detection
 
Database part1-
Database part1-Database part1-
Database part1-
 
Introduction
IntroductionIntroduction
Introduction
 
Watson DevCon 2016 - From Jeopardy! to the Future
Watson DevCon 2016 - From Jeopardy! to the FutureWatson DevCon 2016 - From Jeopardy! to the Future
Watson DevCon 2016 - From Jeopardy! to the Future
 
Performance Issue? Machine Learning to the rescue!
Performance Issue? Machine Learning to the rescue!Performance Issue? Machine Learning to the rescue!
Performance Issue? Machine Learning to the rescue!
 
Brief Tour of Machine Learning
Brief Tour of Machine LearningBrief Tour of Machine Learning
Brief Tour of Machine Learning
 
The Myths + Realities of Machine-Learning Cybersecurity
The Myths + Realities of Machine-Learning CybersecurityThe Myths + Realities of Machine-Learning Cybersecurity
The Myths + Realities of Machine-Learning Cybersecurity
 

Recently uploaded

CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 

Recently uploaded (20)

CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 

Watson Computer

  • 2. INTRODUCTION WATSON is an artificially intelligent computer system capable of answering questions posed in natural language, developed in IBM's DeepQA project. QA technology takes a question expressed in natural language, seeks to understand it in much greater detail, and returns a precise answer to the question. Watson applies advanced natural language processing, information retrieval, knowledge representation, automated reasoning, and machine learning technologies for this purpose.  It incorporates a local corpus (database) of information. 1/23
  • 3. JEOPARDY ! Initially developed to answer questions on Jeopardy !, a quiz show known for its tricky questions. Watson participated in 2011 against former champions Brad Rutter and Ken Jennings and won over them. 2/23
  • 4. REQUIREMENTS • 90 x IBM Power 750 servers • 2880 POWER7 cores • POWER7 3.55 GHz chip • 500 GB per sec on-chip bandwidth • 10 Gb Ethernet network • 16 Terabytes of memory • 20 Terabytes of disk, clustered • Can operate at 80 Teraflops • Runs IBM DeepQA software • Scales out with and searches vast amounts of unstructured information with UIMA & Hadoop open source components • Linux provides a scalable, open platform, optimized to exploit POWER7 performance • 10 racks include servers, networking, shared disk system, cluster controllers 3/23
  • 5. ALGORITHMS USED 1. SVM (Support Vector Machines) Classifier • SVM is supervised learning model that analyzes data and recognizes patterns • Given a set of training examples, each marked as belonging to one of two categories, it builds a model that assigns new examples into one category or the other • It is a non-probabilistic binary linear classifier. 2. Naïve Bayes’ Classifier • It is a family of classifiers based on applying Bayes' theorem with strong (naive) independence assumptions between the features. • So it is a conditional probability model. • Particularly suited when the dimensionality of the inputs is high. 4/23
  • 6. ALGORITHMS USED 3. Word Sense Disambiguation • It is an open problem of natural language processing and ontology. WSD is identifying which sense of a word (i.e. meaning) is used in a sentence, when the word has multiple meanings. • It requires two strict things: a dictionary to specify the senses and a corpus of language data to be disambiguated. WordNet is used as a dictionary in this context. For example – 5/23
  • 7. 3. Word Sense Disambiguation (contd.) • The sentence as well as the query forms an ordered set of words. We then compute the sense network between every pair of words from query and sentence. ALGORITHMS USED 6/23
  • 8. PROCESS The basic working of Watson computer is based on four steps – 1. Question Analysis 4. Answer Extraction (Result) 3. Hypotheses Generation 2. Document Retrieval 7/23
  • 9. PROCESS Step 1 – Determining answer type •Uses machine learning techniques like SVM (Support Vector Machine), Naïve Bayes classifiers •Above techniques apply on a tagged corpus of information Step 2 – Query formation • Assume question is a valid IR query • Remove stop words from question Example: In 1897 Swiss climber Matthias Zurbriggen became the first to scale this Argentinean peak. 1. Question Analysis 8/23
  • 10. PROCESS •The task of the document retrieval module is to select a small set from the collection which can be practically handled in the later stages. •Using important terms from the question, Watson performs a search over millions of documents to find relevant passages. • Data can be stored either in a local corpus or can be accessed from the Internet. 2. Document Retrieval 9/23
  • 11. PROCESS • Extracts important entities – so called “candidate answers” – from the documents. • WordNet is used as a sense/semantic dictionary. • Obtain statistics of a particular word from a large corpus by assigning probabilities based on occurrence of target concept. • Hypotheses generation of example given above - 3. Hypotheses Generation 10/23
  • 12. PROCESS Step 1 – Answer Scoring •Candidate answers are scored using a large number of answer scoring analytics running parallel. • Algorithms like Type Coercion scorer, temporal match etc. are used. • Answer scoring of example given above - 4. Answer Extraction 11/23
  • 13. PROCESS Step 2 – Analysing Scores •The scores are grouped into meaningful groups, or evidence dimensions. •A plot of these yields the evidence profile for the candidate. •Watson statistically combines the scores to produce a final confidence score. 4. Answer Extraction Aconcagua 12/23
  • 15. 14/23
  • 16. 15/23
  • 17. 16/23
  • 18. 17/23
  • 19. 18/23
  • 20. 19/23
  • 21. EXISTING CHALLENGES Healthcare Medical information doubles every three years, physician’s inability to be up-to-date, complex decision making Retail Fulfilling customers’ high expectation of satisfaction and effectively analysing growing mountain of data Finance Each day huge financial information is generated, difficult to harness Public Sector Efficient analysis of enormous volumes of unstructured, unverified data 20/23
  • 22. APPLICATIONS Health Care Finance Retail Public Sector Memorial Sloan Kettering Genesys MD Anderson DBS (Development Bank of Singapore) The North Face Decision-making Policy and performance Public security 21/23 Engaging shoppers ! Wellpoint
  • 23. FUTURE SCOPE  Recipe generating platform  Pharmaceutical industry  Publishing  Biotechnology  Research or inventions 22/23
  • 24.  Requires a huge database of prior knowledge and information  Has trouble responding to short clues  Incapable of coming up with fresh ideas  More than base knowledge, clues may require thought, an area where humans still have an edge over Watson Computer LIMITATIONS 23/23
  • 25. BIBLIOGRAPHY  https://researcher.ibm.com/researcher/viewpage.php?id=2121  Science Behind an Answer http://www03.ibm.com/innovation/us/watson/what-is-watson/science- behind-an-answer.html  Jeopardy! IBM Watson Day 1 (Feb 14, 2011) http://www.youtube.com/watch?v=seNkjYyG3gI&feature=related  Tom M. Mitchell. 1997. Machine Learning. Computer Science Series. McGraw-Hill.  Corpora for Question Answering Task, Cognitive Computation Group at the Department of Computer Science, University of Illinois at Urbana-Champaign.  Dell Zhang and Wee Sun Lee. 2003. Question Classification using Support Vector Machines. In Proceedings of the 26th ACM International Conference on Research and Developement in Information Retrieval (SIGIR’03), pages 26–32, Toronto, Canada.  www.google.com  www.wikipedia.com  www.ibm.com