SlideShare a Scribd company logo
1 of 28
Download to read offline
© 2015 International Business Machines Corporation1
IBM
© 20015 IBM Corporation
January 2017
Frank Stein, Informs Certified Analytics Professional
Director of Analytics Solution Center
IBM
fstein@us.ibm.com
www.ibm.com/ascdc
Leveraging Information for Smarter Organizational Outcomes
Cognitive Computing:
At the Cross-Roads of Data
Science and Natural Language
Processing
© 2015 International Business Machines Corporation2
IBM
Agenda
Part 1 Cognitive Computing (Frank Stein)
 What is Cognitive Computing?
 Real-Life Examples
 Playing with Cognitive
Part 2 Statistical NLP (Mona Diab)
Q&A
© 2015 International Business Machines Corporation4
IBM
China
Almaden
Austin
Tokyo
Zurich
IndiaHaifa
IBM Research: The journey to Watson
Machine
Learning
Natural
Language
Processing
High
Performance
Computing
Knowledge
Representation
and
Reasoning
Question
Answering
Technology
Unstructured
Information
Management
Watson
4
Ireland
Australia
Brazil
Africa
Tokyo
© 2015 International Business Machines Corporation5
IBM
Businesses are “dying of thirst in an ocean of data”
80%
of the world’s data
today is
unstructured
90%
of the world’s data
was created in the
last two years
1 in 2
Business leaders
don’t have access
to data they need
© 2015 International Business Machines Corporation6
IBM
Data is growing exponentially – A Problem or Opportunity?
44 zettabytes
unstructured data
structured data
20202010
You are here
© 2015 International Business Machines Corporation7
IBM
Watson answers a grand challenge
Can we design a computing system that rivals a human’s ability to answer
questions posed in natural language, interpreting meaning and context and
retrieving, analyzing and understanding vast amounts of information in real-time?
© 2015 International Business Machines Corporation8
IBM
8IBM Confidential
video
© 2015 International Business Machines Corporation9
IBM
1900 1950 2011
Watson is ushering in a new era of computing . . .
With the goal to create a new partnership that enhances, scales
and accelerates human expertise.
© 2015 International Business Machines Corporation10
IBM
We Rely on Many Types of Analytics to Process Data
Descriptive
Predictive
Prescriptive
Cognitive
What happened; a single source of the truth
What will happen and implications
What should we do
The way we think
Three capabilities differentiate cognitive systems from
traditional programmed computing systems…
Reasoning
They reason. They understand
underlying ideas and concepts.
They form hypothesis. They
infer and extract concepts.
Learning
They never stop learning
getting more valuable with
time. Advancing with each
new piece of information,
interaction, and outcome.
They develop “expertise”.
Understanding
Cognitive systems
understand like humans
do.
…. allowing them to interact with humans.
Humans
excel at:
Dilemmas
Compassion
Dreaming
Abstraction
Imagination
Morals
Generalization
Cognitive
Systems
excel at:
Common Sense
Natural Language
Locating Knowledge
Pattern Identification
Machine Learning
Eliminate Bias
Endless Capacity
Cognitive systems forge a new partnership
between man and machine.
With the goal to create a new partnership that enhances, scales and
accelerates human expertise.
14©2016 IBM Corporation
of not knowing.The price
1
5
Examples include:
Analyst reports
tweets
Wire tap transcripts
Battlefield docs
E-mails
Texts
Forensic reports
Newspapers
Blogs
Wiki
Court rulings
International crime
database
Stolen vehicle data
Missing persons
data
Data, information, and expertise
create the foundation.
Cognitive systems rely on collections of
data and information:
© 2015 International Business Machines Corporation16
IBM
Unstructured Information Management Applications (UIMA)
Questions
Ingested Corpus
of
User Domain Info
Watson Advisor Cognitive Computing Pipeline Architecture
Answers
Scores & Evidence
Primary
Search
Candidate
Answers
Answer
Scoring
Contextual
Scoring
Trained
Models
Evidence
Retrieval
Question
Analysis
Hypothesis
Generation
Scoring
Final
Merging &
Ranking
© 2015 International Business Machines Corporation17
IBM
The main BlackEnergy executable
being dropped from the Excel
Spreadsheet (vba_macro.exe)
executes an additional two binaries
that it creates: FONTCACHE.DAT
and runndll32.exe
Malware BlackEnergy
Software executable
Threat_Action dropped
Software Excel Spreadsheet
Indicator vba_macro.exe
Software binaries
Indicator FONTCACHE.DAT
Indicator runndll32.exe
Annotate
Identify mentions and relations in unstructured text.
Watson Knowledge Studio
© 2015 International Business Machines Corporation18
IBM
Teaching Watson
Ingestion Pipeline
Q/AFactoid
Knowledge
Canvassing
Knowledge
Graph
Domain content
Watson Knowledge
Studio
Define/train
annotators
SIRE
Ground Truth Information
Q&A Training
Watson
Runtime
© 2015 International Business Machines Corporation21
IBM
Some Examples
© 2015 International Business Machines Corporation22
IBM
Where do you need a deeper bond with
your organization, client, constituent?
 E.g. Staples, Hilton (Pepper the Robot)
Where do you need to have everyone
perform as an expert? (Augmenting
Intelligence)
 Watson Oncology Advisor, Watson
Teacher Advisor
Embedded Cognition
 Whirlpool (kitchen appliances),
Medtronics (insulin pump), GM (cars)
Cognitive Business Processes
 Airbus Smarter Fleet Management
Discovery, Research
 Watson for Drug Discovery with
Ontario Brain Institute
What will you do with Cognitive?
© 2015 International Business Machines Corporation23
IBM
Watson Oncology Cognitive Assistant:
Helping oncologists treat cancer patients
Business problem:
Need better individualized cancer treatment plans
Solution:
• Suggestions to help inform oncologists’ decisions
based on 600K+ pieces of evidence and 2M pages of
text from 42 publications
• Analyzes patient data against thousands of historical
cases and trained through 5000+ Memorial Sloan-
Kettering MD and analyst hours
• Evolves with the fast-changing field
Attacking the cause of
one in four deaths
IBM Watson
Oncology
Built with Memorial Sloan Kettering
© 2015 International Business Machines Corporation27
IBM
 Grammy-winning music producer Alex Da
Kid used Watson’s technology to inspire his
new song about heartbreak, “Not Easy.”
 Watson analyzed the last five years of
culture and music data
 To identify the most pervasive themes,
Watson Alchemy Language API used to
read and understand Nobel Peace Prize
speeches, New York Times articles, etc.
 The Watson Tone Analyzer API then
ingested more than 2 million lines of related
social content to understand the emotional
sentiment
 Used Watson Beat - a cognitive technology
that understands music and lets artists
change the sound of a song based on the
mood they want to express
Cognitive Creativity
© 2015 International Business Machines Corporation34
IBM
…and more new Watson Services APIs continue to emerge frequently
Watson is available as a set of services delivered as APIs in the
Cloud bluemix.net
© 2015 International Business Machines Corporation35
IBM
AlchemyLanguage
Twelve APIs around text analysis service functions, each of which uses sophisticated natural language processing techniques to
analyze your content and add high-level semantic information
 Entity Extraction: what are the entities (people, places, organizations, etc.) in text
 Sentiment Analysis: how are people talking about the entities (positive, negative)
 Keyword Extraction: identify important topics in content
 Concept Tagging: high-level concepts in text (e.g. article is about monetary policy)
 Relation Extraction: subject / action relations between entities
 Taxonomy Classifier: hierarchical categorization (finance/personal finance/credit card)
 Author Extraction: who wrote the article
 Language Detection: what language is this written in
 Text Extraction: extract the important parts of text within an article
 Microformat Parsing: enhances webpage categorization and indexing and to perform content
discovery tasks
 Feed Detection: discover new content, including blog posts, news articles and comment streams.
 Linked Data Support: bring any content into the semantic web
© 2015 International Business Machines Corporation36
IBM
Watson Data Platform allows employees to work
together to gain insight from data.
 Enables collaboration of Data Scientists, Data Engineers,
Business Analysts and Developers
 Provides data cleansing, visualization and sharing
capabilities
 Support for analytic notebooks
 Supports R, python, Scala, Rstudio, Shiny, and sparklyr (R
interface to Spark), Java
Watson Machine Learning built on Apache Spark
automatically can build models on structured and
unstructured information
 Apache SparkML (also available from Bluemix.net )
 Cognitive Assistance for Data Science technology scores
machine learning algorithms against the data to
recommend best match
Watson Data Platform with Machine Learning (new 2017)
© 2015 International Business Machines Corporation37
IBM
Personality Insights
© 2015 International Business Machines Corporation38
IBM
MLK Speech Analyzed by Personality Insights
© 2015 International Business Machines Corporation39
IBM
Obama Farewell Speech Analyzed by Personality Insights
© 2015 International Business Machines Corporation41
IBM
Better Data = Better Outcomes, need to curate the ingested corpus, be careful
about the ontology
Significant Upfront work training the system – but it will pay off as the system
improves over time
Cognitive and Cloud Services go together
Domain adaptation requires domain expertise
Address user anxiety over AI
 Partnership on AI – established with Microsoft, Amazon, Google and Facebook
Will conduct and publish research in such areas as Ethics,
Fairness/inclusiveness, transparency, privacy; trustworthiness, reliability,
and robustness
What we’ve learned so far
© 2015 International Business Machines Corporation43
IBM

More Related Content

What's hot

introduction to data science
introduction to data scienceintroduction to data science
introduction to data sciencebhavesh lande
 
Data Science For Social Scientists Workshop
Data Science For Social Scientists WorkshopData Science For Social Scientists Workshop
Data Science For Social Scientists WorkshopIan Hopkinson
 
Introduction to data science club
Introduction to data science clubIntroduction to data science club
Introduction to data science clubData Science Club
 
Data science applications and usecases
Data science applications and usecasesData science applications and usecases
Data science applications and usecasesSreenatha Reddy K R
 
Demystifying Data Science with an introduction to Machine Learning
Demystifying Data Science with an introduction to Machine LearningDemystifying Data Science with an introduction to Machine Learning
Demystifying Data Science with an introduction to Machine LearningJulian Bright
 
Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI dayMohammed Barakat
 
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...DATAVERSITY
 
The Rensselaer IDEA: Data Exploration
The Rensselaer IDEA: Data Exploration The Rensselaer IDEA: Data Exploration
The Rensselaer IDEA: Data Exploration James Hendler
 
Keynote - An overview on Big Data & Data Science - Dr Gregory Piatetsky-Shapiro
Keynote -  An overview on Big Data & Data Science - Dr Gregory Piatetsky-ShapiroKeynote -  An overview on Big Data & Data Science - Dr Gregory Piatetsky-Shapiro
Keynote - An overview on Big Data & Data Science - Dr Gregory Piatetsky-ShapiroData ScienceTech Institute
 
Python for Data Science - TDC 2015
Python for Data Science - TDC 2015Python for Data Science - TDC 2015
Python for Data Science - TDC 2015Gabriel Moreira
 
A data view of the data science process
A data view of the data science processA data view of the data science process
A data view of the data science processMathieu d'Aquin
 
Data science as a professional career
Data science as a professional careerData science as a professional career
Data science as a professional careerDavid Rostcheck
 
Big Data in Education Sector
Big Data in Education SectorBig Data in Education Sector
Big Data in Education SectorKaran Sachdeva
 

What's hot (20)

Data Science and its impact on society
Data Science and its impact on societyData Science and its impact on society
Data Science and its impact on society
 
What is Data Science
What is Data ScienceWhat is Data Science
What is Data Science
 
Data Science 101
Data Science 101Data Science 101
Data Science 101
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data science
 
Data Science For Social Scientists Workshop
Data Science For Social Scientists WorkshopData Science For Social Scientists Workshop
Data Science For Social Scientists Workshop
 
Introduction to data science club
Introduction to data science clubIntroduction to data science club
Introduction to data science club
 
Data science applications and usecases
Data science applications and usecasesData science applications and usecases
Data science applications and usecases
 
Demystifying Data Science with an introduction to Machine Learning
Demystifying Data Science with an introduction to Machine LearningDemystifying Data Science with an introduction to Machine Learning
Demystifying Data Science with an introduction to Machine Learning
 
Data Analytics
Data AnalyticsData Analytics
Data Analytics
 
Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI day
 
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
 
The Rensselaer IDEA: Data Exploration
The Rensselaer IDEA: Data Exploration The Rensselaer IDEA: Data Exploration
The Rensselaer IDEA: Data Exploration
 
Keynote - An overview on Big Data & Data Science - Dr Gregory Piatetsky-Shapiro
Keynote -  An overview on Big Data & Data Science - Dr Gregory Piatetsky-ShapiroKeynote -  An overview on Big Data & Data Science - Dr Gregory Piatetsky-Shapiro
Keynote - An overview on Big Data & Data Science - Dr Gregory Piatetsky-Shapiro
 
Python for Data Science - TDC 2015
Python for Data Science - TDC 2015Python for Data Science - TDC 2015
Python for Data Science - TDC 2015
 
Lecture #01
Lecture #01Lecture #01
Lecture #01
 
Lecture #03
Lecture #03Lecture #03
Lecture #03
 
Intro to Data Science Concepts
Intro to Data Science ConceptsIntro to Data Science Concepts
Intro to Data Science Concepts
 
A data view of the data science process
A data view of the data science processA data view of the data science process
A data view of the data science process
 
Data science as a professional career
Data science as a professional careerData science as a professional career
Data science as a professional career
 
Big Data in Education Sector
Big Data in Education SectorBig Data in Education Sector
Big Data in Education Sector
 

Viewers also liked

Big data solutions in Azure
Big data solutions in AzureBig data solutions in Azure
Big data solutions in AzureMostafa
 
Build intelligent solutions using Azure
Build intelligent solutions using AzureBuild intelligent solutions using Azure
Build intelligent solutions using AzureMostafa
 
Extending Product Outreach with Outlook Connectors
Extending Product Outreach with Outlook ConnectorsExtending Product Outreach with Outlook Connectors
Extending Product Outreach with Outlook ConnectorsMostafa
 
Azure Machine Learning
Azure Machine LearningAzure Machine Learning
Azure Machine LearningMostafa
 
Introducing Power BI Embedded
Introducing Power BI EmbeddedIntroducing Power BI Embedded
Introducing Power BI EmbeddedMostafa
 
Big Data Solutions for Healthcare
Big Data Solutions for HealthcareBig Data Solutions for Healthcare
Big Data Solutions for HealthcareOdinot Stanislas
 

Viewers also liked (6)

Big data solutions in Azure
Big data solutions in AzureBig data solutions in Azure
Big data solutions in Azure
 
Build intelligent solutions using Azure
Build intelligent solutions using AzureBuild intelligent solutions using Azure
Build intelligent solutions using Azure
 
Extending Product Outreach with Outlook Connectors
Extending Product Outreach with Outlook ConnectorsExtending Product Outreach with Outlook Connectors
Extending Product Outreach with Outlook Connectors
 
Azure Machine Learning
Azure Machine LearningAzure Machine Learning
Azure Machine Learning
 
Introducing Power BI Embedded
Introducing Power BI EmbeddedIntroducing Power BI Embedded
Introducing Power BI Embedded
 
Big Data Solutions for Healthcare
Big Data Solutions for HealthcareBig Data Solutions for Healthcare
Big Data Solutions for Healthcare
 

Similar to NOVA Data Science Meetup 1/19/2017 - Presentation 1

Steve Mills - Your Cognitive Future
Steve Mills - Your Cognitive FutureSteve Mills - Your Cognitive Future
Steve Mills - Your Cognitive FutureSogetiLabs
 
IBM Cognitive platform: IBM Watson
IBM Cognitive platform: IBM WatsonIBM Cognitive platform: IBM Watson
IBM Cognitive platform: IBM WatsonDaniela Zuppini
 
Ai & ibm watson cookbook
Ai & ibm watson cookbookAi & ibm watson cookbook
Ai & ibm watson cookbookJerry O'Brien
 
1.0 nikos maniatis presentation
1.0 nikos maniatis presentation1.0 nikos maniatis presentation
1.0 nikos maniatis presentationtechnology_forum
 
Ai, IBM Watson External
Ai, IBM Watson ExternalAi, IBM Watson External
Ai, IBM Watson ExternalJerry O'Brien
 
IBM Watson Analytics Presentation
IBM Watson Analytics PresentationIBM Watson Analytics Presentation
IBM Watson Analytics PresentationIan Balina
 
Cognitive Computing
Cognitive ComputingCognitive Computing
Cognitive ComputingPietro Leo
 
Infrastructure Designed for Cognitive Workloads: Why is it Crucial? - Xavier ...
Infrastructure Designed for Cognitive Workloads: Why is it Crucial? - Xavier ...Infrastructure Designed for Cognitive Workloads: Why is it Crucial? - Xavier ...
Infrastructure Designed for Cognitive Workloads: Why is it Crucial? - Xavier ...WithTheBest
 
IBM Watson and natural language processing
IBM Watson and natural language processingIBM Watson and natural language processing
IBM Watson and natural language processingRoberto Villa
 
Data Science at Speed. At Scale.
Data Science at Speed. At Scale.Data Science at Speed. At Scale.
Data Science at Speed. At Scale.DataWorks Summit
 
Machine Learning and Power AI Workshop v4
Machine Learning and Power AI Workshop v4Machine Learning and Power AI Workshop v4
Machine Learning and Power AI Workshop v4LennartF
 
2106-04-30 - IBM - The Era of the Cognitive Home - for distribution
2106-04-30 - IBM - The Era of the Cognitive Home - for distribution2106-04-30 - IBM - The Era of the Cognitive Home - for distribution
2106-04-30 - IBM - The Era of the Cognitive Home - for distributionThorsten Schroeer
 
CWIN17 san francisco-ai implementation-pub
CWIN17 san francisco-ai implementation-pubCWIN17 san francisco-ai implementation-pub
CWIN17 san francisco-ai implementation-pubCapgemini
 
Watson and Cognitive Meetup April 2017
Watson and Cognitive Meetup   April 2017Watson and Cognitive Meetup   April 2017
Watson and Cognitive Meetup April 2017Rick Osowski
 
Preparing the next generation for the cognitive era - NFAIS Keynote
Preparing the next generation for the cognitive era - NFAIS KeynotePreparing the next generation for the cognitive era - NFAIS Keynote
Preparing the next generation for the cognitive era - NFAIS KeynoteSteven Miller
 
SmartData Webinar Slides: How to analyze 72 billion messages a day to find tr...
SmartData Webinar Slides: How to analyze 72 billion messages a day to find tr...SmartData Webinar Slides: How to analyze 72 billion messages a day to find tr...
SmartData Webinar Slides: How to analyze 72 billion messages a day to find tr...DATAVERSITY
 

Similar to NOVA Data Science Meetup 1/19/2017 - Presentation 1 (20)

Steve Mills - Your Cognitive Future
Steve Mills - Your Cognitive FutureSteve Mills - Your Cognitive Future
Steve Mills - Your Cognitive Future
 
IBM Cognitive platform: IBM Watson
IBM Cognitive platform: IBM WatsonIBM Cognitive platform: IBM Watson
IBM Cognitive platform: IBM Watson
 
Ai & ibm watson cookbook
Ai & ibm watson cookbookAi & ibm watson cookbook
Ai & ibm watson cookbook
 
Bridging the Gap
Bridging the GapBridging the Gap
Bridging the Gap
 
1.0 nikos maniatis presentation
1.0 nikos maniatis presentation1.0 nikos maniatis presentation
1.0 nikos maniatis presentation
 
Ai, IBM Watson External
Ai, IBM Watson ExternalAi, IBM Watson External
Ai, IBM Watson External
 
Watson in 2016
Watson in 2016Watson in 2016
Watson in 2016
 
IBM Watson Analytics Presentation
IBM Watson Analytics PresentationIBM Watson Analytics Presentation
IBM Watson Analytics Presentation
 
Cognitive Computing
Cognitive ComputingCognitive Computing
Cognitive Computing
 
Infrastructure Designed for Cognitive Workloads: Why is it Crucial? - Xavier ...
Infrastructure Designed for Cognitive Workloads: Why is it Crucial? - Xavier ...Infrastructure Designed for Cognitive Workloads: Why is it Crucial? - Xavier ...
Infrastructure Designed for Cognitive Workloads: Why is it Crucial? - Xavier ...
 
IBM Watson and natural language processing
IBM Watson and natural language processingIBM Watson and natural language processing
IBM Watson and natural language processing
 
Future of AI
Future of AIFuture of AI
Future of AI
 
Data Science at Speed. At Scale.
Data Science at Speed. At Scale.Data Science at Speed. At Scale.
Data Science at Speed. At Scale.
 
Machine Learning and Power AI Workshop v4
Machine Learning and Power AI Workshop v4Machine Learning and Power AI Workshop v4
Machine Learning and Power AI Workshop v4
 
2106-04-30 - IBM - The Era of the Cognitive Home - for distribution
2106-04-30 - IBM - The Era of the Cognitive Home - for distribution2106-04-30 - IBM - The Era of the Cognitive Home - for distribution
2106-04-30 - IBM - The Era of the Cognitive Home - for distribution
 
CWIN17 san francisco-ai implementation-pub
CWIN17 san francisco-ai implementation-pubCWIN17 san francisco-ai implementation-pub
CWIN17 san francisco-ai implementation-pub
 
Watson and Cognitive Meetup April 2017
Watson and Cognitive Meetup   April 2017Watson and Cognitive Meetup   April 2017
Watson and Cognitive Meetup April 2017
 
Building Bots Using IBM Watson
Building Bots Using IBM WatsonBuilding Bots Using IBM Watson
Building Bots Using IBM Watson
 
Preparing the next generation for the cognitive era - NFAIS Keynote
Preparing the next generation for the cognitive era - NFAIS KeynotePreparing the next generation for the cognitive era - NFAIS Keynote
Preparing the next generation for the cognitive era - NFAIS Keynote
 
SmartData Webinar Slides: How to analyze 72 billion messages a day to find tr...
SmartData Webinar Slides: How to analyze 72 billion messages a day to find tr...SmartData Webinar Slides: How to analyze 72 billion messages a day to find tr...
SmartData Webinar Slides: How to analyze 72 billion messages a day to find tr...
 

More from NOVA DATASCIENCE

NOVA Data Science Meetup 2-21-2018 Presentation Cloudera Data Science Workbench
NOVA Data Science Meetup 2-21-2018 Presentation Cloudera Data Science WorkbenchNOVA Data Science Meetup 2-21-2018 Presentation Cloudera Data Science Workbench
NOVA Data Science Meetup 2-21-2018 Presentation Cloudera Data Science WorkbenchNOVA DATASCIENCE
 
Nova Data Science Meetup 9-20-2017 Introduction
Nova Data Science Meetup 9-20-2017 IntroductionNova Data Science Meetup 9-20-2017 Introduction
Nova Data Science Meetup 9-20-2017 IntroductionNOVA DATASCIENCE
 
Nova Data Science Meetup 9-20-2017 Presentation How AI Powers the Comcast X1 ...
Nova Data Science Meetup 9-20-2017 Presentation How AI Powers the Comcast X1 ...Nova Data Science Meetup 9-20-2017 Presentation How AI Powers the Comcast X1 ...
Nova Data Science Meetup 9-20-2017 Presentation How AI Powers the Comcast X1 ...NOVA DATASCIENCE
 
NOVA Data Science Meetup 8-10-2017 Presentation - State of Data Science Educa...
NOVA Data Science Meetup 8-10-2017 Presentation - State of Data Science Educa...NOVA Data Science Meetup 8-10-2017 Presentation - State of Data Science Educa...
NOVA Data Science Meetup 8-10-2017 Presentation - State of Data Science Educa...NOVA DATASCIENCE
 
NOVA Data Science Meetup 5/10/2017 - Presentation Building a gigaword corpus
NOVA Data Science Meetup 5/10/2017 - Presentation Building a gigaword corpusNOVA Data Science Meetup 5/10/2017 - Presentation Building a gigaword corpus
NOVA Data Science Meetup 5/10/2017 - Presentation Building a gigaword corpusNOVA DATASCIENCE
 
NOVA Data Science Meetup 1/19/2017 - Presentation 2
NOVA Data Science Meetup 1/19/2017 - Presentation 2NOVA Data Science Meetup 1/19/2017 - Presentation 2
NOVA Data Science Meetup 1/19/2017 - Presentation 2NOVA DATASCIENCE
 

More from NOVA DATASCIENCE (6)

NOVA Data Science Meetup 2-21-2018 Presentation Cloudera Data Science Workbench
NOVA Data Science Meetup 2-21-2018 Presentation Cloudera Data Science WorkbenchNOVA Data Science Meetup 2-21-2018 Presentation Cloudera Data Science Workbench
NOVA Data Science Meetup 2-21-2018 Presentation Cloudera Data Science Workbench
 
Nova Data Science Meetup 9-20-2017 Introduction
Nova Data Science Meetup 9-20-2017 IntroductionNova Data Science Meetup 9-20-2017 Introduction
Nova Data Science Meetup 9-20-2017 Introduction
 
Nova Data Science Meetup 9-20-2017 Presentation How AI Powers the Comcast X1 ...
Nova Data Science Meetup 9-20-2017 Presentation How AI Powers the Comcast X1 ...Nova Data Science Meetup 9-20-2017 Presentation How AI Powers the Comcast X1 ...
Nova Data Science Meetup 9-20-2017 Presentation How AI Powers the Comcast X1 ...
 
NOVA Data Science Meetup 8-10-2017 Presentation - State of Data Science Educa...
NOVA Data Science Meetup 8-10-2017 Presentation - State of Data Science Educa...NOVA Data Science Meetup 8-10-2017 Presentation - State of Data Science Educa...
NOVA Data Science Meetup 8-10-2017 Presentation - State of Data Science Educa...
 
NOVA Data Science Meetup 5/10/2017 - Presentation Building a gigaword corpus
NOVA Data Science Meetup 5/10/2017 - Presentation Building a gigaword corpusNOVA Data Science Meetup 5/10/2017 - Presentation Building a gigaword corpus
NOVA Data Science Meetup 5/10/2017 - Presentation Building a gigaword corpus
 
NOVA Data Science Meetup 1/19/2017 - Presentation 2
NOVA Data Science Meetup 1/19/2017 - Presentation 2NOVA Data Science Meetup 1/19/2017 - Presentation 2
NOVA Data Science Meetup 1/19/2017 - Presentation 2
 

Recently uploaded

Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Digi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxDigi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxTanveerAhmed817946
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 

Recently uploaded (20)

Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Digi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxDigi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 

NOVA Data Science Meetup 1/19/2017 - Presentation 1

  • 1. © 2015 International Business Machines Corporation1 IBM © 20015 IBM Corporation January 2017 Frank Stein, Informs Certified Analytics Professional Director of Analytics Solution Center IBM fstein@us.ibm.com www.ibm.com/ascdc Leveraging Information for Smarter Organizational Outcomes Cognitive Computing: At the Cross-Roads of Data Science and Natural Language Processing
  • 2. © 2015 International Business Machines Corporation2 IBM Agenda Part 1 Cognitive Computing (Frank Stein)  What is Cognitive Computing?  Real-Life Examples  Playing with Cognitive Part 2 Statistical NLP (Mona Diab) Q&A
  • 3. © 2015 International Business Machines Corporation4 IBM China Almaden Austin Tokyo Zurich IndiaHaifa IBM Research: The journey to Watson Machine Learning Natural Language Processing High Performance Computing Knowledge Representation and Reasoning Question Answering Technology Unstructured Information Management Watson 4 Ireland Australia Brazil Africa Tokyo
  • 4. © 2015 International Business Machines Corporation5 IBM Businesses are “dying of thirst in an ocean of data” 80% of the world’s data today is unstructured 90% of the world’s data was created in the last two years 1 in 2 Business leaders don’t have access to data they need
  • 5. © 2015 International Business Machines Corporation6 IBM Data is growing exponentially – A Problem or Opportunity? 44 zettabytes unstructured data structured data 20202010 You are here
  • 6. © 2015 International Business Machines Corporation7 IBM Watson answers a grand challenge Can we design a computing system that rivals a human’s ability to answer questions posed in natural language, interpreting meaning and context and retrieving, analyzing and understanding vast amounts of information in real-time?
  • 7. © 2015 International Business Machines Corporation8 IBM 8IBM Confidential video
  • 8. © 2015 International Business Machines Corporation9 IBM 1900 1950 2011 Watson is ushering in a new era of computing . . . With the goal to create a new partnership that enhances, scales and accelerates human expertise.
  • 9. © 2015 International Business Machines Corporation10 IBM We Rely on Many Types of Analytics to Process Data Descriptive Predictive Prescriptive Cognitive What happened; a single source of the truth What will happen and implications What should we do The way we think
  • 10. Three capabilities differentiate cognitive systems from traditional programmed computing systems… Reasoning They reason. They understand underlying ideas and concepts. They form hypothesis. They infer and extract concepts. Learning They never stop learning getting more valuable with time. Advancing with each new piece of information, interaction, and outcome. They develop “expertise”. Understanding Cognitive systems understand like humans do. …. allowing them to interact with humans.
  • 11. Humans excel at: Dilemmas Compassion Dreaming Abstraction Imagination Morals Generalization Cognitive Systems excel at: Common Sense Natural Language Locating Knowledge Pattern Identification Machine Learning Eliminate Bias Endless Capacity Cognitive systems forge a new partnership between man and machine. With the goal to create a new partnership that enhances, scales and accelerates human expertise.
  • 12. 14©2016 IBM Corporation of not knowing.The price
  • 13. 1 5 Examples include: Analyst reports tweets Wire tap transcripts Battlefield docs E-mails Texts Forensic reports Newspapers Blogs Wiki Court rulings International crime database Stolen vehicle data Missing persons data Data, information, and expertise create the foundation. Cognitive systems rely on collections of data and information:
  • 14. © 2015 International Business Machines Corporation16 IBM Unstructured Information Management Applications (UIMA) Questions Ingested Corpus of User Domain Info Watson Advisor Cognitive Computing Pipeline Architecture Answers Scores & Evidence Primary Search Candidate Answers Answer Scoring Contextual Scoring Trained Models Evidence Retrieval Question Analysis Hypothesis Generation Scoring Final Merging & Ranking
  • 15. © 2015 International Business Machines Corporation17 IBM The main BlackEnergy executable being dropped from the Excel Spreadsheet (vba_macro.exe) executes an additional two binaries that it creates: FONTCACHE.DAT and runndll32.exe Malware BlackEnergy Software executable Threat_Action dropped Software Excel Spreadsheet Indicator vba_macro.exe Software binaries Indicator FONTCACHE.DAT Indicator runndll32.exe Annotate Identify mentions and relations in unstructured text. Watson Knowledge Studio
  • 16. © 2015 International Business Machines Corporation18 IBM Teaching Watson Ingestion Pipeline Q/AFactoid Knowledge Canvassing Knowledge Graph Domain content Watson Knowledge Studio Define/train annotators SIRE Ground Truth Information Q&A Training Watson Runtime
  • 17. © 2015 International Business Machines Corporation21 IBM Some Examples
  • 18. © 2015 International Business Machines Corporation22 IBM Where do you need a deeper bond with your organization, client, constituent?  E.g. Staples, Hilton (Pepper the Robot) Where do you need to have everyone perform as an expert? (Augmenting Intelligence)  Watson Oncology Advisor, Watson Teacher Advisor Embedded Cognition  Whirlpool (kitchen appliances), Medtronics (insulin pump), GM (cars) Cognitive Business Processes  Airbus Smarter Fleet Management Discovery, Research  Watson for Drug Discovery with Ontario Brain Institute What will you do with Cognitive?
  • 19. © 2015 International Business Machines Corporation23 IBM Watson Oncology Cognitive Assistant: Helping oncologists treat cancer patients Business problem: Need better individualized cancer treatment plans Solution: • Suggestions to help inform oncologists’ decisions based on 600K+ pieces of evidence and 2M pages of text from 42 publications • Analyzes patient data against thousands of historical cases and trained through 5000+ Memorial Sloan- Kettering MD and analyst hours • Evolves with the fast-changing field Attacking the cause of one in four deaths IBM Watson Oncology Built with Memorial Sloan Kettering
  • 20. © 2015 International Business Machines Corporation27 IBM  Grammy-winning music producer Alex Da Kid used Watson’s technology to inspire his new song about heartbreak, “Not Easy.”  Watson analyzed the last five years of culture and music data  To identify the most pervasive themes, Watson Alchemy Language API used to read and understand Nobel Peace Prize speeches, New York Times articles, etc.  The Watson Tone Analyzer API then ingested more than 2 million lines of related social content to understand the emotional sentiment  Used Watson Beat - a cognitive technology that understands music and lets artists change the sound of a song based on the mood they want to express Cognitive Creativity
  • 21. © 2015 International Business Machines Corporation34 IBM …and more new Watson Services APIs continue to emerge frequently Watson is available as a set of services delivered as APIs in the Cloud bluemix.net
  • 22. © 2015 International Business Machines Corporation35 IBM AlchemyLanguage Twelve APIs around text analysis service functions, each of which uses sophisticated natural language processing techniques to analyze your content and add high-level semantic information  Entity Extraction: what are the entities (people, places, organizations, etc.) in text  Sentiment Analysis: how are people talking about the entities (positive, negative)  Keyword Extraction: identify important topics in content  Concept Tagging: high-level concepts in text (e.g. article is about monetary policy)  Relation Extraction: subject / action relations between entities  Taxonomy Classifier: hierarchical categorization (finance/personal finance/credit card)  Author Extraction: who wrote the article  Language Detection: what language is this written in  Text Extraction: extract the important parts of text within an article  Microformat Parsing: enhances webpage categorization and indexing and to perform content discovery tasks  Feed Detection: discover new content, including blog posts, news articles and comment streams.  Linked Data Support: bring any content into the semantic web
  • 23. © 2015 International Business Machines Corporation36 IBM Watson Data Platform allows employees to work together to gain insight from data.  Enables collaboration of Data Scientists, Data Engineers, Business Analysts and Developers  Provides data cleansing, visualization and sharing capabilities  Support for analytic notebooks  Supports R, python, Scala, Rstudio, Shiny, and sparklyr (R interface to Spark), Java Watson Machine Learning built on Apache Spark automatically can build models on structured and unstructured information  Apache SparkML (also available from Bluemix.net )  Cognitive Assistance for Data Science technology scores machine learning algorithms against the data to recommend best match Watson Data Platform with Machine Learning (new 2017)
  • 24. © 2015 International Business Machines Corporation37 IBM Personality Insights
  • 25. © 2015 International Business Machines Corporation38 IBM MLK Speech Analyzed by Personality Insights
  • 26. © 2015 International Business Machines Corporation39 IBM Obama Farewell Speech Analyzed by Personality Insights
  • 27. © 2015 International Business Machines Corporation41 IBM Better Data = Better Outcomes, need to curate the ingested corpus, be careful about the ontology Significant Upfront work training the system – but it will pay off as the system improves over time Cognitive and Cloud Services go together Domain adaptation requires domain expertise Address user anxiety over AI  Partnership on AI – established with Microsoft, Amazon, Google and Facebook Will conduct and publish research in such areas as Ethics, Fairness/inclusiveness, transparency, privacy; trustworthiness, reliability, and robustness What we’ve learned so far
  • 28. © 2015 International Business Machines Corporation43 IBM