SlideShare a Scribd company logo
1 of 49
Download to read offline
HYBRID INTELLIGENCE:
COMBINING THE POWER OF HUMAN
COMPUTATION AND MACHINE LEARNING
Fabio Casati
University of Trento
1
DEFINITIONS
2
Crowdsourcing: The practice of obtaining information or
input into a task or project by enlisting the services of a large
number of people, either paid or unpaid, typically via the
Internet.
Human-based computation: a computer science technique
in which a machine performs its function by outsourcing
certain steps to humans, usually as microwork
MARIE-JEAN-ANTOINE-NICOLAS DE CARITAT,
MARQUIS DE CONDORCET (1743-1794)
- French philosopher of the Enlightenment
and advocate of public education and
women rights (among lots of other
things)
- Éléments du calcul des probabilités, et
son application aux jeux de hasard, à la
loterie et aux jugements des hommes.
- “Jury theorem”
!3
SIR FRANCIS GALTON
(1822-1911)
Expert in pretty much everything
- Statistician, sociologist, psychologist, anthropologist,
eugenicist, tropical explorer, geographer,
meteorologist, psychometrician, and cake-cutter
- Created the statistical concept of correlation.
- introduced the use of questionnaires and surveys for
collecting data on human communities
- As the initiator of scientific meteorology, devised the
first weather map
- (Was the first to apply statistical methods to the
study of human differences and inheritance of
intelligence)
!4
5
THE WISDOM OF CROWDS
6
Weight: 1197 pounds
Median: (used by Galton): 1208 pounds
Mean: 1197 pounds.
Ken Wallis. Revisiting Francis Galton’s Forecasting Competition.
Statistical Science, 2014.
(787 paying participants)
MORTADELLA
7
Wolfgang von Kempelen, 1770
8
Little et al, Hcomp20109
HOW WOULDYOU SOLVETHIS
10
Greg Little, Lydia B. Chilton, Robert C. Miller, and Max Goldman
TurKit: Tools for Iterative Tasks on Mechanical Turk
HComp 2009
11
Greg Little, Lydia B. Chilton, Robert C. Miller, and Max
Goldman
TurKit: Tools for Iterative Tasks on Mechanical Turk
HComp 2009
VIZWIZ: NEARLY REAL-TIME ANSWERSTOVISUAL
QUESTIONS
12
CITIZEN SCIENCE: ZOONIVERSE
13
REST API
to People
Create task
Run batch*
Monitor
Results
Pay
Platform for human computations. But: how to program it? How to limit
recourse to (expensive) humans? how do we make their work more efficient?
SYSTEMATIC LITERATURE REVIEW
Scientific and evidence-based approach to literature reviews
Systematic Literature reviews (SLR)
Process
Prevalence of antepartum hemorrhage in women with placenta previa: a systematic review and
meta-analysis. Dazhi Fan, Song Wu, Li Liu, Qing Xia,Wen Wang, Xiaoling Guo & Zhengping Liu.
Scientific Reports volume 7,Article number: 40320 (2017)
1. Study on adults 75 and older
2. Involves the use of interaction technology
3. Is an “intervention” (alternatively: RCT)
16
USEFUL BUT PAINFUL…
- Millions of papers published every year
- About half of them is never cited (not even by the authors)
- Incomplete (40-70% of missing papers!)
- From idea to submission: typically 9 to 36 months
- Query repeated multiple times (6-30 months apart), sometimes 60
- ~1/3 abandoned
17
Perrine Créquit, Ludovic Trinquart, Amélie Yavchitz, and Philippe Ravaud. 2016.
Wasted research when systematic reviews fail to provide a complete and up-to- date evidence
synthesis: the example of lung cancer. BMC Medicine 14, 1 (2016), 8.
Can we
crowdsource it?
19
Crowd-based Multi-Predicate Screening of Papers in Literature Reviews. WWW2018
!20
Trained ML models
CAN WE DO BETTER? CAN MACHINE LEARNING HELP?
• Help in screening (keep the same search+filter process but improve it)
• Help in finding (different process), or Live SLR
Crowdsourcing
Model training Trained ML models
ON RCT
21
Wallace et al
Identifying reports of randomized controlled trials (RCTs) via a hybrid machine learning and crowdsourcing approach
Jamia, 2017
predicted probability of being an
RCT of ≤0.1
specificity of 99.8% and an overall
recall of 98%
3 OPTIONS SO FAR
• Expert analysis: the typical approach today (painful, slow, and
expensive even if you don’t notice it)
• Crowdsourcing: Works well: speed, diversity, quality… but at a cost
• For scientists and experts. Hard to use it.
• Machine Learning and Classification: Label, Train, Classify
• Works great only in some cases: fairly “easy” problem, very large pool
!22
HYBRID INTELLIGENCE
!23
ML Algorithms
Budget
Goals and
ConstraintsDecisions
?
Assets Problem
Results
“Side” goal
Trained ML models
Hybrid
(meta-)Algorithms
and processes
Trained ML
models *
APPLICABILITY
• Finite pool, uniqueness of the problem: Not enough items to
train
• Can’t get ML to the precision we need
• Or, we can, but it takes time and in the meantime we initially
leverage crowd heavily, then progressively less (e.g. crisis
situations)
!24
ML, THEN CROWD WHEN IN DOUBT
ML AlgorithmGet training data
Train algorithms
Apply: machine first,
then (maybe) crowd
William Callaghan et al. MechanicalHeart: A Human-Machine Framework for the Classification of Phonocardiograms
CSCW 2018 !25
Trained ML
models
Trained ML
models
Works with weak algorithms for classification problems
(as long as confidence estimate is accurate)
26
when the crowd is more confident than the
machine in the classification of a given instance,
they are most often correct.
Works well only if we take machine input when it
is very confident
William Callaghan et al. MechanicalHeart: A Human-Machine Framework for the Classification of Phonocardiograms
CSCW 2018
A “sprinkle” of ML helps
ML AS ASSISTANT THAT BIASES OUR THINKING
ML AlgorithmGet training data
Train algorithms
!27
Trained ML
models
Trained ML
models
Apply: machine sets a
prior, crowd
Krivosheev et al. Combining Crowd and Machines for Multi-predicate Item Screening
CSCW 2018
P (class | votes) = P (votes | class) * p(class) / p(votes)
Impact on redundancy - always ask crowd
28
- Works with weak algorithms for classification problems
- “sprinkle of crowd” makes it right
EMBED CROWDS INSIDE MACHINE LEARNING
ARCHITECTURES
- Explore feature spaces that are largely unreachable by automatic
extraction,
- Train models that use human-understandable features
Cheng and Bernstein. Flock: Hybrid Crowd-Machine Learning Classifiers. CSCW2015
!29
30Cheng and Bernstein. Flock: Hybrid Crowd-Machine Learning Classifiers. CSCW2015
31
.1 improvement in ROC AUC
hybrid here used as features,
classification is automatic
Cheng and Bernstein. Flock: Hybrid Crowd-Machine Learning Classifiers. CSCW2015
Outliers are important
CROWD HELPS MACHINES HELP CROWD
• Bias the crowd to obtain better and faster (cheaper) responses
!32
34
Ramirez et al. Influencing workers: The case of human-machine collaboration
(in progress)
35
Ramirez et al. Influencing workers: The case of human-machine collaboration
(in progress)
Determinants of Primary
School Non- Enrollment
and Absenteeism
36
GENERAL FINITE POOL PROBLEM
• No clear idea on how well ML can do
• No clear idea on how well crowd can do (not to talk about task design)
• Limited items and limited budget: how to spend it?
• Kind of a meta-active learning problem, where in addition we have to learn how to learn
!37
SMALL STEPS: ACTIVE HYBRID LEARNING
• Given a set of hotel descriptions, find hotels that are kids-friendly
and that are near Macquire
• We have a ML algorithm given, and a crowd or hybrid classifier
• It is a learning vs exploitation trade-off.
!38
ACTIVE HYBRID LEARNING
Restricted version of the general problem
1. Mange trade-off between labelling items to learn vs labeling to classify
2. Actively learn if favour ML or crowd, and then perform active sampling
!39
MAB or RL problem
40
41Krivosheev et al. Active Hybrid Classification (under review)
(SEMI-) AUTOMATED PIPELINE GENERATION
DO CHECK FIX AGGREGATE
SUGGEST
FEATURESGROUPWORK
Model training
Active Learning
Hybrid strategy
TEMPLATES
(SEMI-) AUTOMATED PIPELINE GENERATION
PROCESS
Crowd Research: Open and Scalable University Laboratories,Vaish et al, UIST2017
44
PROCESS
- Open call
- Training materials (on SLRs in general, and SLRs on related topics)
- Screening task (acts also as selection filter)
- Paper assignment - full paper screening (also act as filter)
- Paper reading and “guided” paper summarization (with redundancy
and metadata extraction)
- Peer “grading” (positive, like-style)
- Definition of dimensions for analysis (separate subgroups)
- Selection of group leaders (also based on volunteering)
- Brainstorming in video call with PI and group leaders, each
presenting dimensions
- Second iteration
- Revisiting summaries of papers based on dimensions and filling of
tables
- Cross-check tables
45
ASSISTED TASK DESIGN
- How to define a task
- How to train
- How (much) to test
- Pricing
- Stopping
- Optimizing task assignment to workers
- Finding task design errors early
- => Assist in design for creative work
47
Task design Pipeline design
Conversational agent
SUMMING UP…
• Combining human and machine computation has incredible potential for solving a
variety of tasks
• Get results immediately, while improving ml
• Crisis situations
• Novel versions of old problems (from SLRs to fake news to criminal activities)
• Continuously check and improve areas where ML is weak, even with human-suggested
features
• Nothing of this is actually restricted to “crowd” - works with experts as well
• Move towards systems that do not require expertise, meaning, the average knowledge
worker can use it
!48
THANKS
fabio.casati@unitn.it
49

More Related Content

Similar to Hybrid Intelligence

Introduction to Artificial Intelligence
Introduction to Artificial IntelligenceIntroduction to Artificial Intelligence
Introduction to Artificial IntelligenceLuca Bianchi
 
An introduction to machine learning in biomedical research: Key concepts, pr...
An introduction to machine learning in biomedical research:  Key concepts, pr...An introduction to machine learning in biomedical research:  Key concepts, pr...
An introduction to machine learning in biomedical research: Key concepts, pr...FranciscoJAzuajeG
 
Augmented intelligence as a response to the crisis of artificial intelligence
Augmented intelligence as a response to the crisis of artificial intelligenceAugmented intelligence as a response to the crisis of artificial intelligence
Augmented intelligence as a response to the crisis of artificial intelligenceAlexander Ryzhov
 
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...Matthew Lease
 
algorithmic-decisions, fairness, machine learning, provenance, transparency
algorithmic-decisions, fairness, machine learning, provenance, transparencyalgorithmic-decisions, fairness, machine learning, provenance, transparency
algorithmic-decisions, fairness, machine learning, provenance, transparencyPaolo Missier
 
On Machine Learning and Data Mining
On Machine Learning and Data MiningOn Machine Learning and Data Mining
On Machine Learning and Data Miningbutest
 
Mark2Cure: a crowdsourcing platform for biomedical literature annotation
Mark2Cure: a crowdsourcing platform for biomedical literature annotationMark2Cure: a crowdsourcing platform for biomedical literature annotation
Mark2Cure: a crowdsourcing platform for biomedical literature annotationBenjamin Good
 
Introduction to Data and Computation: Essential capabilities for everyone in ...
Introduction to Data and Computation: Essential capabilities for everyone in ...Introduction to Data and Computation: Essential capabilities for everyone in ...
Introduction to Data and Computation: Essential capabilities for everyone in ...Kim Flintoff
 
The Philosophical Aspects of Data Modelling
The Philosophical Aspects of Data ModellingThe Philosophical Aspects of Data Modelling
The Philosophical Aspects of Data ModellingEmir Muñoz
 
SBQS 2013 Keynote: Cooperative Testing and Analysis
SBQS 2013 Keynote: Cooperative Testing and AnalysisSBQS 2013 Keynote: Cooperative Testing and Analysis
SBQS 2013 Keynote: Cooperative Testing and AnalysisTao Xie
 
Ch 1 Introduction to AI.pdf
Ch 1 Introduction to AI.pdfCh 1 Introduction to AI.pdf
Ch 1 Introduction to AI.pdfKrishnaMadala1
 
History of AI, Current Trends, Prospective Trajectories
History of AI, Current Trends, Prospective TrajectoriesHistory of AI, Current Trends, Prospective Trajectories
History of AI, Current Trends, Prospective TrajectoriesGiovanni Sileno
 
Introaied nancy2019 luengo
Introaied nancy2019 luengoIntroaied nancy2019 luengo
Introaied nancy2019 luengoVanda Luengo
 
Transparency in ML and AI (humble views from a concerned academic)
Transparency in ML and AI (humble views from a concerned academic)Transparency in ML and AI (humble views from a concerned academic)
Transparency in ML and AI (humble views from a concerned academic)Paolo Missier
 
Interacting with an Inferred World: the Challenge of Machine Learning for Hum...
Interacting with an Inferred World: the Challenge of Machine Learning for Hum...Interacting with an Inferred World: the Challenge of Machine Learning for Hum...
Interacting with an Inferred World: the Challenge of Machine Learning for Hum...Minjoon Kim
 
Machine Learning Meets Human Learning
Machine Learning Meets Human LearningMachine Learning Meets Human Learning
Machine Learning Meets Human Learningbutest
 
Agent-Based Modelling: Social Science Meets Computer Science?
Agent-Based Modelling: Social Science Meets Computer Science?Agent-Based Modelling: Social Science Meets Computer Science?
Agent-Based Modelling: Social Science Meets Computer Science?Edmund Chattoe-Brown
 

Similar to Hybrid Intelligence (20)

Introduction to Artificial Intelligence
Introduction to Artificial IntelligenceIntroduction to Artificial Intelligence
Introduction to Artificial Intelligence
 
An introduction to machine learning in biomedical research: Key concepts, pr...
An introduction to machine learning in biomedical research:  Key concepts, pr...An introduction to machine learning in biomedical research:  Key concepts, pr...
An introduction to machine learning in biomedical research: Key concepts, pr...
 
Augmented intelligence as a response to the crisis of artificial intelligence
Augmented intelligence as a response to the crisis of artificial intelligenceAugmented intelligence as a response to the crisis of artificial intelligence
Augmented intelligence as a response to the crisis of artificial intelligence
 
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...
 
algorithmic-decisions, fairness, machine learning, provenance, transparency
algorithmic-decisions, fairness, machine learning, provenance, transparencyalgorithmic-decisions, fairness, machine learning, provenance, transparency
algorithmic-decisions, fairness, machine learning, provenance, transparency
 
On Machine Learning and Data Mining
On Machine Learning and Data MiningOn Machine Learning and Data Mining
On Machine Learning and Data Mining
 
Mark2Cure: a crowdsourcing platform for biomedical literature annotation
Mark2Cure: a crowdsourcing platform for biomedical literature annotationMark2Cure: a crowdsourcing platform for biomedical literature annotation
Mark2Cure: a crowdsourcing platform for biomedical literature annotation
 
Introduction to Data and Computation: Essential capabilities for everyone in ...
Introduction to Data and Computation: Essential capabilities for everyone in ...Introduction to Data and Computation: Essential capabilities for everyone in ...
Introduction to Data and Computation: Essential capabilities for everyone in ...
 
The Philosophical Aspects of Data Modelling
The Philosophical Aspects of Data ModellingThe Philosophical Aspects of Data Modelling
The Philosophical Aspects of Data Modelling
 
Kbe
KbeKbe
Kbe
 
SBQS 2013 Keynote: Cooperative Testing and Analysis
SBQS 2013 Keynote: Cooperative Testing and AnalysisSBQS 2013 Keynote: Cooperative Testing and Analysis
SBQS 2013 Keynote: Cooperative Testing and Analysis
 
Ch 1 Introduction to AI.pdf
Ch 1 Introduction to AI.pdfCh 1 Introduction to AI.pdf
Ch 1 Introduction to AI.pdf
 
n01.ppt
n01.pptn01.ppt
n01.ppt
 
History of AI, Current Trends, Prospective Trajectories
History of AI, Current Trends, Prospective TrajectoriesHistory of AI, Current Trends, Prospective Trajectories
History of AI, Current Trends, Prospective Trajectories
 
AI Presentation 1
AI Presentation 1AI Presentation 1
AI Presentation 1
 
Introaied nancy2019 luengo
Introaied nancy2019 luengoIntroaied nancy2019 luengo
Introaied nancy2019 luengo
 
Transparency in ML and AI (humble views from a concerned academic)
Transparency in ML and AI (humble views from a concerned academic)Transparency in ML and AI (humble views from a concerned academic)
Transparency in ML and AI (humble views from a concerned academic)
 
Interacting with an Inferred World: the Challenge of Machine Learning for Hum...
Interacting with an Inferred World: the Challenge of Machine Learning for Hum...Interacting with an Inferred World: the Challenge of Machine Learning for Hum...
Interacting with an Inferred World: the Challenge of Machine Learning for Hum...
 
Machine Learning Meets Human Learning
Machine Learning Meets Human LearningMachine Learning Meets Human Learning
Machine Learning Meets Human Learning
 
Agent-Based Modelling: Social Science Meets Computer Science?
Agent-Based Modelling: Social Science Meets Computer Science?Agent-Based Modelling: Social Science Meets Computer Science?
Agent-Based Modelling: Social Science Meets Computer Science?
 

Recently uploaded

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbuapidays
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 

Recently uploaded (20)

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 

Hybrid Intelligence

  • 1. HYBRID INTELLIGENCE: COMBINING THE POWER OF HUMAN COMPUTATION AND MACHINE LEARNING Fabio Casati University of Trento 1
  • 2. DEFINITIONS 2 Crowdsourcing: The practice of obtaining information or input into a task or project by enlisting the services of a large number of people, either paid or unpaid, typically via the Internet. Human-based computation: a computer science technique in which a machine performs its function by outsourcing certain steps to humans, usually as microwork
  • 3. MARIE-JEAN-ANTOINE-NICOLAS DE CARITAT, MARQUIS DE CONDORCET (1743-1794) - French philosopher of the Enlightenment and advocate of public education and women rights (among lots of other things) - Éléments du calcul des probabilités, et son application aux jeux de hasard, à la loterie et aux jugements des hommes. - “Jury theorem” !3
  • 4. SIR FRANCIS GALTON (1822-1911) Expert in pretty much everything - Statistician, sociologist, psychologist, anthropologist, eugenicist, tropical explorer, geographer, meteorologist, psychometrician, and cake-cutter - Created the statistical concept of correlation. - introduced the use of questionnaires and surveys for collecting data on human communities - As the initiator of scientific meteorology, devised the first weather map - (Was the first to apply statistical methods to the study of human differences and inheritance of intelligence) !4
  • 5. 5
  • 6. THE WISDOM OF CROWDS 6 Weight: 1197 pounds Median: (used by Galton): 1208 pounds Mean: 1197 pounds. Ken Wallis. Revisiting Francis Galton’s Forecasting Competition. Statistical Science, 2014. (787 paying participants)
  • 9. Little et al, Hcomp20109
  • 10. HOW WOULDYOU SOLVETHIS 10 Greg Little, Lydia B. Chilton, Robert C. Miller, and Max Goldman TurKit: Tools for Iterative Tasks on Mechanical Turk HComp 2009
  • 11. 11 Greg Little, Lydia B. Chilton, Robert C. Miller, and Max Goldman TurKit: Tools for Iterative Tasks on Mechanical Turk HComp 2009
  • 12. VIZWIZ: NEARLY REAL-TIME ANSWERSTOVISUAL QUESTIONS 12
  • 14. REST API to People Create task Run batch* Monitor Results Pay Platform for human computations. But: how to program it? How to limit recourse to (expensive) humans? how do we make their work more efficient?
  • 15. SYSTEMATIC LITERATURE REVIEW Scientific and evidence-based approach to literature reviews
  • 16. Systematic Literature reviews (SLR) Process Prevalence of antepartum hemorrhage in women with placenta previa: a systematic review and meta-analysis. Dazhi Fan, Song Wu, Li Liu, Qing Xia,Wen Wang, Xiaoling Guo & Zhengping Liu. Scientific Reports volume 7,Article number: 40320 (2017) 1. Study on adults 75 and older 2. Involves the use of interaction technology 3. Is an “intervention” (alternatively: RCT) 16
  • 17. USEFUL BUT PAINFUL… - Millions of papers published every year - About half of them is never cited (not even by the authors) - Incomplete (40-70% of missing papers!) - From idea to submission: typically 9 to 36 months - Query repeated multiple times (6-30 months apart), sometimes 60 - ~1/3 abandoned 17 Perrine Créquit, Ludovic Trinquart, Amélie Yavchitz, and Philippe Ravaud. 2016. Wasted research when systematic reviews fail to provide a complete and up-to- date evidence synthesis: the example of lung cancer. BMC Medicine 14, 1 (2016), 8.
  • 19. 19 Crowd-based Multi-Predicate Screening of Papers in Literature Reviews. WWW2018
  • 20. !20 Trained ML models CAN WE DO BETTER? CAN MACHINE LEARNING HELP? • Help in screening (keep the same search+filter process but improve it) • Help in finding (different process), or Live SLR Crowdsourcing Model training Trained ML models
  • 21. ON RCT 21 Wallace et al Identifying reports of randomized controlled trials (RCTs) via a hybrid machine learning and crowdsourcing approach Jamia, 2017 predicted probability of being an RCT of ≤0.1 specificity of 99.8% and an overall recall of 98%
  • 22. 3 OPTIONS SO FAR • Expert analysis: the typical approach today (painful, slow, and expensive even if you don’t notice it) • Crowdsourcing: Works well: speed, diversity, quality… but at a cost • For scientists and experts. Hard to use it. • Machine Learning and Classification: Label, Train, Classify • Works great only in some cases: fairly “easy” problem, very large pool !22
  • 23. HYBRID INTELLIGENCE !23 ML Algorithms Budget Goals and ConstraintsDecisions ? Assets Problem Results “Side” goal Trained ML models Hybrid (meta-)Algorithms and processes Trained ML models *
  • 24. APPLICABILITY • Finite pool, uniqueness of the problem: Not enough items to train • Can’t get ML to the precision we need • Or, we can, but it takes time and in the meantime we initially leverage crowd heavily, then progressively less (e.g. crisis situations) !24
  • 25. ML, THEN CROWD WHEN IN DOUBT ML AlgorithmGet training data Train algorithms Apply: machine first, then (maybe) crowd William Callaghan et al. MechanicalHeart: A Human-Machine Framework for the Classification of Phonocardiograms CSCW 2018 !25 Trained ML models Trained ML models Works with weak algorithms for classification problems (as long as confidence estimate is accurate)
  • 26. 26 when the crowd is more confident than the machine in the classification of a given instance, they are most often correct. Works well only if we take machine input when it is very confident William Callaghan et al. MechanicalHeart: A Human-Machine Framework for the Classification of Phonocardiograms CSCW 2018 A “sprinkle” of ML helps
  • 27. ML AS ASSISTANT THAT BIASES OUR THINKING ML AlgorithmGet training data Train algorithms !27 Trained ML models Trained ML models Apply: machine sets a prior, crowd Krivosheev et al. Combining Crowd and Machines for Multi-predicate Item Screening CSCW 2018 P (class | votes) = P (votes | class) * p(class) / p(votes) Impact on redundancy - always ask crowd
  • 28. 28 - Works with weak algorithms for classification problems - “sprinkle of crowd” makes it right
  • 29. EMBED CROWDS INSIDE MACHINE LEARNING ARCHITECTURES - Explore feature spaces that are largely unreachable by automatic extraction, - Train models that use human-understandable features Cheng and Bernstein. Flock: Hybrid Crowd-Machine Learning Classifiers. CSCW2015 !29
  • 30. 30Cheng and Bernstein. Flock: Hybrid Crowd-Machine Learning Classifiers. CSCW2015
  • 31. 31 .1 improvement in ROC AUC hybrid here used as features, classification is automatic Cheng and Bernstein. Flock: Hybrid Crowd-Machine Learning Classifiers. CSCW2015 Outliers are important
  • 32. CROWD HELPS MACHINES HELP CROWD • Bias the crowd to obtain better and faster (cheaper) responses !32
  • 33.
  • 34. 34 Ramirez et al. Influencing workers: The case of human-machine collaboration (in progress)
  • 35. 35 Ramirez et al. Influencing workers: The case of human-machine collaboration (in progress)
  • 36. Determinants of Primary School Non- Enrollment and Absenteeism 36
  • 37. GENERAL FINITE POOL PROBLEM • No clear idea on how well ML can do • No clear idea on how well crowd can do (not to talk about task design) • Limited items and limited budget: how to spend it? • Kind of a meta-active learning problem, where in addition we have to learn how to learn !37
  • 38. SMALL STEPS: ACTIVE HYBRID LEARNING • Given a set of hotel descriptions, find hotels that are kids-friendly and that are near Macquire • We have a ML algorithm given, and a crowd or hybrid classifier • It is a learning vs exploitation trade-off. !38
  • 39. ACTIVE HYBRID LEARNING Restricted version of the general problem 1. Mange trade-off between labelling items to learn vs labeling to classify 2. Actively learn if favour ML or crowd, and then perform active sampling !39 MAB or RL problem
  • 40. 40
  • 41. 41Krivosheev et al. Active Hybrid Classification (under review)
  • 42. (SEMI-) AUTOMATED PIPELINE GENERATION DO CHECK FIX AGGREGATE SUGGEST FEATURESGROUPWORK Model training Active Learning Hybrid strategy TEMPLATES
  • 44. PROCESS Crowd Research: Open and Scalable University Laboratories,Vaish et al, UIST2017 44
  • 45. PROCESS - Open call - Training materials (on SLRs in general, and SLRs on related topics) - Screening task (acts also as selection filter) - Paper assignment - full paper screening (also act as filter) - Paper reading and “guided” paper summarization (with redundancy and metadata extraction) - Peer “grading” (positive, like-style) - Definition of dimensions for analysis (separate subgroups) - Selection of group leaders (also based on volunteering) - Brainstorming in video call with PI and group leaders, each presenting dimensions - Second iteration - Revisiting summaries of papers based on dimensions and filling of tables - Cross-check tables 45
  • 46. ASSISTED TASK DESIGN - How to define a task - How to train - How (much) to test - Pricing - Stopping - Optimizing task assignment to workers - Finding task design errors early - => Assist in design for creative work
  • 47. 47 Task design Pipeline design Conversational agent
  • 48. SUMMING UP… • Combining human and machine computation has incredible potential for solving a variety of tasks • Get results immediately, while improving ml • Crisis situations • Novel versions of old problems (from SLRs to fake news to criminal activities) • Continuously check and improve areas where ML is weak, even with human-suggested features • Nothing of this is actually restricted to “crowd” - works with experts as well • Move towards systems that do not require expertise, meaning, the average knowledge worker can use it !48