SlideShare a Scribd company logo
1 of 27
Download to read offline
Data Science vs Artificial
Intelligence: a useful distinction
Dr. Christoforos Anagnostopoulos
Founder and Chief Data Scientist, Mentat Innovations
ex-Assoc. Professor, Imperial College London
Mentat
PhD in Machine Learning at Imperial College
Research Fellow, Statistical Laboratory, Cambridge U.
ex-Lecturer in Statistical Modelling at Imperial College
Numerous consulting projects (defence, web, social media)
Data journalism (The Independent, The Guardian, BBC, …)
Founder and Chief Scientist of Mentat (AI for Cybersecurity)
Many ideas in this talk were the result of conversations with:
Credentials
Prof. David Hand, OBE (Chairman of Advisory Board of Mentat)
Renowned statistician, twice president of Royal Statistical Society
Machine Learning / AI
This talk
Data Science
Are the two technology trends different?
Does it matter?
Data Science vs AI: skill-sets
Courtesy of Cathy O’Neil and Rachel Schutt
Data Science: the origins
Many rediscoveries of data
analysis in the last 20 years
1970s: Peter Naur introduces “data science” as a
synonym to “computer science”
1997: Jeff Wu claims “statisticians” are “data scientists”.
2001: William Cleveland introduces data science as an
independent discipline, extending statistics.
2008: DJ Patil (LinkedIn) and Jeff
Hammerbacher (Facebook) describe their job
role as that of “Data Scientist”
Data Science: the origins
Term became trending since 2008
38 years
Artificial Intelligence: the origins
1950
Turing Test
Perceptron
Logic programming
1960s
1970s
AI winter
Minsky
Turing
Lighthill
Rise and Fall of
Expert Systems
1980s
Lisp
1997
Chess
2011
Jeopardy!
Deep Blue
Watson
2015
Big Data
Computational Stats
Bayes Revival
Machine Learning
Deep Learning
GPUs
Open Source
Big Data
Volume SQL
HDFS
Velocity
complex events processing
apache storm
apache spark streaming
Variety
structured semi-structured unstructured
social graphs, system logs,
tweets/blogs, CCTV
many variables, sampling variability
(e.g., spatiotemporal)
Volume
Velocity
Variety
Veracity
Value
Nobody wants data.
Everybody wants data-driven
reliable actionable insights.
Big Data
Big Data in Science
Models guided by theory
Well formulated questions
Big Data in the Commercial World
Little to no theory
“Needle in the haystack”
Often question is unclear (“fishing”)
Data quality low
The data value pyramid
Access
Analytics
Fusion
Artificial Intelligence
Machine Learning
Data Science
Value
Big Data query
learn
The data value pyramid
Access
Example: “Give me all transactions by this user”
Tech: DB, HDFS, Query Languages, APIs
Analytics
Example: “How many transactions per country?”
Tech: Anything from Excel to Apache SPARK. Mostly
basic aggregations, strong visualisation component
Fusion
Example: “Give me the office building locations of all
employees that visited this website yesterday”
Tech: Break through silos. Data Lakes, Big Data stacks
Plus: Tremendous amount of value unlocked in this process
Minus: retrospective, user-driven, manual
The data value pyramid
Learning:
Forecasting
Example: “How many new users will I get next week?”
Tech: Predictive Analytics tools (ML/DS)
Learning:
regression
Example: “How many emails should I expect a user of
these characteristics to receive per day? ”
Tech: Regression tools (machine learning / statistics)
Learning:
classification
Example: “Given the email header, the email body, and
the type of attachment, classify it as Spam or not.”
Tech: Classification tools (machine learning)
Learning:
inference /
anomaly
detection
Example: “why did we have a peak in traffic?”
Tech: Data Science
What does success mean?
interact
predict
infer
in controlled/semi-controlled environments
the future / the class of new examples
unobserved/unobservable attributes
query what has been recorded
consistency
predictive accuracy
model quality / causality
live trials / competitions
Why is Learning hard?




The Learning revolution in AI
learning by example
Hard to describe precisely how cats differ from dogs
Easy to provide examples
Machine Learning and AI
tremendous success stories — game AI
Machine Learning and AI
success stories — computer vision
Machine Learning and AI
success stories — machine translation
Market is driving a “standardisation” of AI/ML APIs.
Tech Stack
If your problem fits one of these APIs, you’re 99% there.
If not, your data science pipeline might still use them.
Data Science vs AI: skill-sets
Courtesy of Cathy O’Neil and Rachel Schutt
Data Science
Infrastructure
Fram
ew
orks
Q
ueries
Learning
Theory
and
Algorithm
s
UI with Domain
Expert
or Business
ML
Products
ML
toolsets
D
anger!
Research
Labelled
Data
Artificial Intelligence
Labelled
Data
Infrastructure
Fram
ew
orks
Q
ueries
Learning
Theory
and
Algorithm
s
UI with Domain
Expert
or Business
ML
Products
ML
toolsets
D
anger!
Research
Pipeline
Heuristics
Visualisation /
Reporting / UX
Actionability
Interpretation /
Validation of Results
Big Data Stack
domain expertise
machine learning
data science
hacking
Data Cleaning
Data Moulding Model Lifecycle
Management
Bespoke
Statistical
Models
Machine
Learning
Learning
Stack
Futurology
Machines are outperforming
humans in an increasingly broad
array of everyday tasks.
Last time this happened was the
Industrial Revolution.
No more call centres, truck drivers, shop assistants.
No more doctors? Not yet. But less looking at X-rays.
stone iron steam electricity AI
Overconfident machines
If true wisdom is to know what you don't
know, machines are still pretty stupid.
“Complex Models fail in
Complicated Ways”
Learning by Example /
replicating human cognition is
not always a good idea.
By way of conclusion
Chris Anderson quoting Peter Norvig:
“All models are wrong, and increasingly you can
succeed without them.”
in “The End of Theory: the data deluge is making
the scientific method obsolete”
Peter Norvig: “That's a silly statement, I didn't
say it, and I disagree with it. […] Theory has not
ended, it is expanding into new forms.”
info@ment.at
@canagnos

More Related Content

What's hot

Data science presentation
Data science presentationData science presentation
Data science presentationMSDEVMTL
 
Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI dayMohammed Barakat
 
Introduction To Data Science
Introduction To Data ScienceIntroduction To Data Science
Introduction To Data ScienceSpotle.ai
 
Industrial Machine Learning (SIGKDD17)
Industrial Machine Learning (SIGKDD17)Industrial Machine Learning (SIGKDD17)
Industrial Machine Learning (SIGKDD17)Joshua Bloom
 
Challenges in Analytics for BIG Data
Challenges in Analytics for BIG DataChallenges in Analytics for BIG Data
Challenges in Analytics for BIG DataPrasant Misra
 
Data science
Data scienceData science
Data science9diov
 
MMDS 2014: Myria (and Scalable Graph Clustering with RelaxMap)
MMDS 2014: Myria (and Scalable Graph Clustering with RelaxMap)MMDS 2014: Myria (and Scalable Graph Clustering with RelaxMap)
MMDS 2014: Myria (and Scalable Graph Clustering with RelaxMap)University of Washington
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data scienceSampath Kumar
 
AI: The next frontier by Amparo Alonso at Big Data Spain 2017
AI: The next frontier by Amparo Alonso at Big Data Spain 2017AI: The next frontier by Amparo Alonso at Big Data Spain 2017
AI: The next frontier by Amparo Alonso at Big Data Spain 2017Big Data Spain
 
Big Data and the Art of Data Science
Big Data and the Art of Data ScienceBig Data and the Art of Data Science
Big Data and the Art of Data ScienceAndrew Gardner
 
Big Data and Computer Science Education
Big Data and Computer Science EducationBig Data and Computer Science Education
Big Data and Computer Science EducationJames Hendler
 
Industrial Machine Learning (at GE)
Industrial Machine Learning (at GE)Industrial Machine Learning (at GE)
Industrial Machine Learning (at GE)Joshua Bloom
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceANOOP V S
 
Introduction to Data Science (Data Science Thailand Meetup #1)
Introduction to Data Science (Data Science Thailand Meetup #1)Introduction to Data Science (Data Science Thailand Meetup #1)
Introduction to Data Science (Data Science Thailand Meetup #1)Data Science Thailand
 
Creating a Data Science Ecosystem for Scientific, Societal and Educational Im...
Creating a Data Science Ecosystem for Scientific, Societal and Educational Im...Creating a Data Science Ecosystem for Scientific, Societal and Educational Im...
Creating a Data Science Ecosystem for Scientific, Societal and Educational Im...Ilkay Altintas, Ph.D.
 
Big Data Science: Intro and Benefits
Big Data Science: Intro and BenefitsBig Data Science: Intro and Benefits
Big Data Science: Intro and BenefitsChandan Rajah
 
The Science of Data Science
The Science of Data Science The Science of Data Science
The Science of Data Science James Hendler
 

What's hot (20)

Data science presentation
Data science presentationData science presentation
Data science presentation
 
Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI day
 
Introduction To Data Science
Introduction To Data ScienceIntroduction To Data Science
Introduction To Data Science
 
Industrial Machine Learning (SIGKDD17)
Industrial Machine Learning (SIGKDD17)Industrial Machine Learning (SIGKDD17)
Industrial Machine Learning (SIGKDD17)
 
Challenges in Analytics for BIG Data
Challenges in Analytics for BIG DataChallenges in Analytics for BIG Data
Challenges in Analytics for BIG Data
 
Data science
Data scienceData science
Data science
 
MMDS 2014: Myria (and Scalable Graph Clustering with RelaxMap)
MMDS 2014: Myria (and Scalable Graph Clustering with RelaxMap)MMDS 2014: Myria (and Scalable Graph Clustering with RelaxMap)
MMDS 2014: Myria (and Scalable Graph Clustering with RelaxMap)
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
AI: The next frontier by Amparo Alonso at Big Data Spain 2017
AI: The next frontier by Amparo Alonso at Big Data Spain 2017AI: The next frontier by Amparo Alonso at Big Data Spain 2017
AI: The next frontier by Amparo Alonso at Big Data Spain 2017
 
Big Data and the Art of Data Science
Big Data and the Art of Data ScienceBig Data and the Art of Data Science
Big Data and the Art of Data Science
 
NoSQL (Not Only SQL)
NoSQL (Not Only SQL)NoSQL (Not Only SQL)
NoSQL (Not Only SQL)
 
Data science
Data science Data science
Data science
 
Big Data and Computer Science Education
Big Data and Computer Science EducationBig Data and Computer Science Education
Big Data and Computer Science Education
 
Industrial Machine Learning (at GE)
Industrial Machine Learning (at GE)Industrial Machine Learning (at GE)
Industrial Machine Learning (at GE)
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Introduction to Data Science (Data Science Thailand Meetup #1)
Introduction to Data Science (Data Science Thailand Meetup #1)Introduction to Data Science (Data Science Thailand Meetup #1)
Introduction to Data Science (Data Science Thailand Meetup #1)
 
Creating a Data Science Ecosystem for Scientific, Societal and Educational Im...
Creating a Data Science Ecosystem for Scientific, Societal and Educational Im...Creating a Data Science Ecosystem for Scientific, Societal and Educational Im...
Creating a Data Science Ecosystem for Scientific, Societal and Educational Im...
 
Big Data Science: Intro and Benefits
Big Data Science: Intro and BenefitsBig Data Science: Intro and Benefits
Big Data Science: Intro and Benefits
 
Data Science using Python
Data Science using PythonData Science using Python
Data Science using Python
 
The Science of Data Science
The Science of Data Science The Science of Data Science
The Science of Data Science
 

Similar to Data Science versus Artificial Intelligence: a useful distinction

Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactDr. Sunil Kr. Pandey
 
Data science training institute in hyderabad
Data science training institute in hyderabadData science training institute in hyderabad
Data science training institute in hyderabadKelly Technologies
 
Benefiting from Semantic AI along the data life cycle
Benefiting from Semantic AI along the data life cycleBenefiting from Semantic AI along the data life cycle
Benefiting from Semantic AI along the data life cycleMartin Kaltenböck
 
Workshop_Presentation.pptx
Workshop_Presentation.pptxWorkshop_Presentation.pptx
Workshop_Presentation.pptxRUDRAPRASADSABAR
 
Introduction to Data Science 5-13.pptx
Introduction to Data Science 5-13.pptxIntroduction to Data Science 5-13.pptx
Introduction to Data Science 5-13.pptxdatapro2
 
Introduction to Data Science 5-13.pptx
Introduction to Data Science 5-13.pptxIntroduction to Data Science 5-13.pptx
Introduction to Data Science 5-13.pptxSanmati Jain
 
Introduction to Data Science 1113.pptx
Introduction to Data Science 1113.pptxIntroduction to Data Science 1113.pptx
Introduction to Data Science 1113.pptxmark828
 
Presentación Ciro Cattuto, ISI Foundation en VI Summit País Digital 2018
Presentación Ciro Cattuto, ISI Foundation en VI Summit País Digital 2018Presentación Ciro Cattuto, ISI Foundation en VI Summit País Digital 2018
Presentación Ciro Cattuto, ISI Foundation en VI Summit País Digital 2018PAÍS DIGITAL
 
Introduction to Data Science 5-13.pptx
Introduction to Data Science 5-13.pptxIntroduction to Data Science 5-13.pptx
Introduction to Data Science 5-13.pptxNilesh Raj
 
From Rocket Science to Data Science
From Rocket Science to Data ScienceFrom Rocket Science to Data Science
From Rocket Science to Data ScienceSanghamitra Deb
 
Introduction to Data Science 1115.pptx
Introduction to Data Science 1115.pptxIntroduction to Data Science 1115.pptx
Introduction to Data Science 1115.pptxmark828
 
Introduction to Data Science 1117.pptx
Introduction to Data Science 1117.pptxIntroduction to Data Science 1117.pptx
Introduction to Data Science 1117.pptxmark828
 
Introduction to Data Science 1116.pptx
Introduction to Data Science 1116.pptxIntroduction to Data Science 1116.pptx
Introduction to Data Science 1116.pptxmark828
 
Introduction to Data Science 5-13.pptx
Introduction to Data Science 5-13.pptxIntroduction to Data Science 5-13.pptx
Introduction to Data Science 5-13.pptxAravind Reddy
 

Similar to Data Science versus Artificial Intelligence: a useful distinction (20)

Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
 
Why Data Science is a Science
Why Data Science is a ScienceWhy Data Science is a Science
Why Data Science is a Science
 
Data science training institute in hyderabad
Data science training institute in hyderabadData science training institute in hyderabad
Data science training institute in hyderabad
 
Big Data: Big Issues for IP
Big Data: Big Issues for IPBig Data: Big Issues for IP
Big Data: Big Issues for IP
 
mkol.pptx
mkol.pptxmkol.pptx
mkol.pptx
 
Benefiting from Semantic AI along the data life cycle
Benefiting from Semantic AI along the data life cycleBenefiting from Semantic AI along the data life cycle
Benefiting from Semantic AI along the data life cycle
 
Workshop_Presentation.pptx
Workshop_Presentation.pptxWorkshop_Presentation.pptx
Workshop_Presentation.pptx
 
On Big Data
On Big DataOn Big Data
On Big Data
 
Introduction to Data Science 5-13.pptx
Introduction to Data Science 5-13.pptxIntroduction to Data Science 5-13.pptx
Introduction to Data Science 5-13.pptx
 
Introduction to Data Science 5-13.pptx
Introduction to Data Science 5-13.pptxIntroduction to Data Science 5-13.pptx
Introduction to Data Science 5-13.pptx
 
Dig18
Dig18Dig18
Dig18
 
hjol.pptx
hjol.pptxhjol.pptx
hjol.pptx
 
Introduction to Data Science 1113.pptx
Introduction to Data Science 1113.pptxIntroduction to Data Science 1113.pptx
Introduction to Data Science 1113.pptx
 
Presentación Ciro Cattuto, ISI Foundation en VI Summit País Digital 2018
Presentación Ciro Cattuto, ISI Foundation en VI Summit País Digital 2018Presentación Ciro Cattuto, ISI Foundation en VI Summit País Digital 2018
Presentación Ciro Cattuto, ISI Foundation en VI Summit País Digital 2018
 
Introduction to Data Science 5-13.pptx
Introduction to Data Science 5-13.pptxIntroduction to Data Science 5-13.pptx
Introduction to Data Science 5-13.pptx
 
From Rocket Science to Data Science
From Rocket Science to Data ScienceFrom Rocket Science to Data Science
From Rocket Science to Data Science
 
Introduction to Data Science 1115.pptx
Introduction to Data Science 1115.pptxIntroduction to Data Science 1115.pptx
Introduction to Data Science 1115.pptx
 
Introduction to Data Science 1117.pptx
Introduction to Data Science 1117.pptxIntroduction to Data Science 1117.pptx
Introduction to Data Science 1117.pptx
 
Introduction to Data Science 1116.pptx
Introduction to Data Science 1116.pptxIntroduction to Data Science 1116.pptx
Introduction to Data Science 1116.pptx
 
Introduction to Data Science 5-13.pptx
Introduction to Data Science 5-13.pptxIntroduction to Data Science 5-13.pptx
Introduction to Data Science 5-13.pptx
 

Recently uploaded

Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...shivangimorya083
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service LucknowAminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknowmakika9823
 

Recently uploaded (20)

Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service LucknowAminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
 

Data Science versus Artificial Intelligence: a useful distinction

  • 1. Data Science vs Artificial Intelligence: a useful distinction Dr. Christoforos Anagnostopoulos Founder and Chief Data Scientist, Mentat Innovations ex-Assoc. Professor, Imperial College London Mentat
  • 2. PhD in Machine Learning at Imperial College Research Fellow, Statistical Laboratory, Cambridge U. ex-Lecturer in Statistical Modelling at Imperial College Numerous consulting projects (defence, web, social media) Data journalism (The Independent, The Guardian, BBC, …) Founder and Chief Scientist of Mentat (AI for Cybersecurity) Many ideas in this talk were the result of conversations with: Credentials Prof. David Hand, OBE (Chairman of Advisory Board of Mentat) Renowned statistician, twice president of Royal Statistical Society
  • 3. Machine Learning / AI This talk Data Science Are the two technology trends different? Does it matter?
  • 4. Data Science vs AI: skill-sets Courtesy of Cathy O’Neil and Rachel Schutt
  • 5. Data Science: the origins Many rediscoveries of data analysis in the last 20 years 1970s: Peter Naur introduces “data science” as a synonym to “computer science” 1997: Jeff Wu claims “statisticians” are “data scientists”. 2001: William Cleveland introduces data science as an independent discipline, extending statistics. 2008: DJ Patil (LinkedIn) and Jeff Hammerbacher (Facebook) describe their job role as that of “Data Scientist”
  • 6. Data Science: the origins Term became trending since 2008 38 years
  • 7. Artificial Intelligence: the origins 1950 Turing Test Perceptron Logic programming 1960s 1970s AI winter Minsky Turing Lighthill Rise and Fall of Expert Systems 1980s Lisp 1997 Chess 2011 Jeopardy! Deep Blue Watson 2015 Big Data Computational Stats Bayes Revival Machine Learning Deep Learning GPUs Open Source
  • 8. Big Data Volume SQL HDFS Velocity complex events processing apache storm apache spark streaming Variety structured semi-structured unstructured social graphs, system logs, tweets/blogs, CCTV many variables, sampling variability (e.g., spatiotemporal)
  • 9. Volume Velocity Variety Veracity Value Nobody wants data. Everybody wants data-driven reliable actionable insights. Big Data
  • 10. Big Data in Science Models guided by theory Well formulated questions Big Data in the Commercial World Little to no theory “Needle in the haystack” Often question is unclear (“fishing”) Data quality low
  • 11. The data value pyramid Access Analytics Fusion Artificial Intelligence Machine Learning Data Science Value Big Data query learn
  • 12. The data value pyramid Access Example: “Give me all transactions by this user” Tech: DB, HDFS, Query Languages, APIs Analytics Example: “How many transactions per country?” Tech: Anything from Excel to Apache SPARK. Mostly basic aggregations, strong visualisation component Fusion Example: “Give me the office building locations of all employees that visited this website yesterday” Tech: Break through silos. Data Lakes, Big Data stacks Plus: Tremendous amount of value unlocked in this process Minus: retrospective, user-driven, manual
  • 13. The data value pyramid Learning: Forecasting Example: “How many new users will I get next week?” Tech: Predictive Analytics tools (ML/DS) Learning: regression Example: “How many emails should I expect a user of these characteristics to receive per day? ” Tech: Regression tools (machine learning / statistics) Learning: classification Example: “Given the email header, the email body, and the type of attachment, classify it as Spam or not.” Tech: Classification tools (machine learning) Learning: inference / anomaly detection Example: “why did we have a peak in traffic?” Tech: Data Science
  • 14. What does success mean? interact predict infer in controlled/semi-controlled environments the future / the class of new examples unobserved/unobservable attributes query what has been recorded consistency predictive accuracy model quality / causality live trials / competitions
  • 15. Why is Learning hard? 
 

  • 16. The Learning revolution in AI learning by example Hard to describe precisely how cats differ from dogs Easy to provide examples
  • 17. Machine Learning and AI tremendous success stories — game AI
  • 18. Machine Learning and AI success stories — computer vision
  • 19. Machine Learning and AI success stories — machine translation
  • 20. Market is driving a “standardisation” of AI/ML APIs. Tech Stack If your problem fits one of these APIs, you’re 99% there. If not, your data science pipeline might still use them.
  • 21. Data Science vs AI: skill-sets Courtesy of Cathy O’Neil and Rachel Schutt
  • 22. Data Science Infrastructure Fram ew orks Q ueries Learning Theory and Algorithm s UI with Domain Expert or Business ML Products ML toolsets D anger! Research Labelled Data
  • 24. Pipeline Heuristics Visualisation / Reporting / UX Actionability Interpretation / Validation of Results Big Data Stack domain expertise machine learning data science hacking Data Cleaning Data Moulding Model Lifecycle Management Bespoke Statistical Models Machine Learning Learning Stack
  • 25. Futurology Machines are outperforming humans in an increasingly broad array of everyday tasks. Last time this happened was the Industrial Revolution. No more call centres, truck drivers, shop assistants. No more doctors? Not yet. But less looking at X-rays. stone iron steam electricity AI
  • 26. Overconfident machines If true wisdom is to know what you don't know, machines are still pretty stupid. “Complex Models fail in Complicated Ways” Learning by Example / replicating human cognition is not always a good idea.
  • 27. By way of conclusion Chris Anderson quoting Peter Norvig: “All models are wrong, and increasingly you can succeed without them.” in “The End of Theory: the data deluge is making the scientific method obsolete” Peter Norvig: “That's a silly statement, I didn't say it, and I disagree with it. […] Theory has not ended, it is expanding into new forms.” info@ment.at @canagnos