SlideShare a Scribd company logo
1 of 1
Isabelle Augenstein, Christina Lioma, Dongsheng Wang, Lucas Chaves Lima,
Casper Hansen, Christian Hansen, Jakob Grue Simonsen
University of Copenhagen
{augenstein | c.lioma | wang | lcl | c.hansen | chrh | simonsen}@di.ku.dk
MultiFC: A Real-World Multi-Domain Dataset for
Evidence-Based Fact Checking of Claims
Joint Veracity Prediction &
Evidence Ranking
Claims in MultiFCContributions
• Novel fact checking dataset
Largest such with naturally
occurring claims
34 918 claims from 26
English fact checking
portals
Rich additional meta-data
10 evidence pages per
claim
• Joint veracity prediction and
evidence ranking model
Treats claims from different
portals as different tasks /
domains
Encodes disparate labels
with label embeddings
Confusion MatrixEntities in Claims
Fact Checking Portals
Overall Results
Dataset Download
https://copenlu.github.io/publication/2019_emnlp_augenstein/
Error Analysis
• Meta-data: topic tags most important, entities least important
• Correctly predicting ‘true’ claims is much easier than ‘false’ ones
• Most confusions happen over close labels
• Longer claims are harder to classify correctly
• High token overlap of claims & evidence → high evidence ranking
• General topic tags frequently co-occur with incorrect predictions;
more specific tags often co-occur with correct predictions

More Related Content

More from Isabelle Augenstein

Learning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyondLearning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyond
Isabelle Augenstein
 

More from Isabelle Augenstein (18)

What can typological knowledge bases and language representations tell us abo...
What can typological knowledge bases and language representations tell us abo...What can typological knowledge bases and language representations tell us abo...
What can typological knowledge bases and language representations tell us abo...
 
Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...
Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...
Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...
 
Learning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyondLearning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyond
 
Learning to read for automated fact checking
Learning to read for automated fact checkingLearning to read for automated fact checking
Learning to read for automated fact checking
 
SemEval 2017 Task 10: ScienceIE – Extracting Keyphrases and Relations from Sc...
SemEval 2017 Task 10: ScienceIE – Extracting Keyphrases and Relations from Sc...SemEval 2017 Task 10: ScienceIE – Extracting Keyphrases and Relations from Sc...
SemEval 2017 Task 10: ScienceIE – Extracting Keyphrases and Relations from Sc...
 
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
 
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
 
Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...
Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...
Machine Reading Using Neural Machines (talk at Microsoft Research Faculty Sum...
 
Weakly Supervised Machine Reading
Weakly Supervised Machine ReadingWeakly Supervised Machine Reading
Weakly Supervised Machine Reading
 
USFD at SemEval-2016 - Stance Detection on Twitter with Autoencoders
USFD at SemEval-2016 - Stance Detection on Twitter with AutoencodersUSFD at SemEval-2016 - Stance Detection on Twitter with Autoencoders
USFD at SemEval-2016 - Stance Detection on Twitter with Autoencoders
 
Distant Supervision with Imitation Learning
Distant Supervision with Imitation LearningDistant Supervision with Imitation Learning
Distant Supervision with Imitation Learning
 
Extracting Relations between Non-Standard Entities using Distant Supervision ...
Extracting Relations between Non-Standard Entities using Distant Supervision ...Extracting Relations between Non-Standard Entities using Distant Supervision ...
Extracting Relations between Non-Standard Entities using Distant Supervision ...
 
Information Extraction with Linked Data
Information Extraction with Linked DataInformation Extraction with Linked Data
Information Extraction with Linked Data
 
Lodifier: Generating Linked Data from Unstructured Text
Lodifier: Generating Linked Data from Unstructured TextLodifier: Generating Linked Data from Unstructured Text
Lodifier: Generating Linked Data from Unstructured Text
 
Relation Extraction from the Web using Distant Supervision
Relation Extraction from the Web using Distant SupervisionRelation Extraction from the Web using Distant Supervision
Relation Extraction from the Web using Distant Supervision
 
Natural Language Processing for the Semantic Web
Natural Language Processing for the Semantic WebNatural Language Processing for the Semantic Web
Natural Language Processing for the Semantic Web
 
Seed Selection for Distantly Supervised Web-Based Relation Extraction
Seed Selection for Distantly Supervised Web-Based Relation ExtractionSeed Selection for Distantly Supervised Web-Based Relation Extraction
Seed Selection for Distantly Supervised Web-Based Relation Extraction
 
Mapping Keywords to
Mapping Keywords to Mapping Keywords to
Mapping Keywords to
 

Recently uploaded

Recently uploaded (8)

Unlock Facebook Success with Sociocosmos
Unlock Facebook Success with SociocosmosUnlock Facebook Success with Sociocosmos
Unlock Facebook Success with Sociocosmos
 
GRACE ANDREWS the future of podcasting.pdf
GRACE ANDREWS the future of podcasting.pdfGRACE ANDREWS the future of podcasting.pdf
GRACE ANDREWS the future of podcasting.pdf
 
MalluProgrammers. Careeradvice.Webinarppt.pdf
MalluProgrammers. Careeradvice.Webinarppt.pdfMalluProgrammers. Careeradvice.Webinarppt.pdf
MalluProgrammers. Careeradvice.Webinarppt.pdf
 
Maximize Your Pinterest Reach....................
Maximize Your Pinterest Reach....................Maximize Your Pinterest Reach....................
Maximize Your Pinterest Reach....................
 
MalluProgrammers. Careeradvice.Webinarppt.pdf
MalluProgrammers. Careeradvice.Webinarppt.pdfMalluProgrammers. Careeradvice.Webinarppt.pdf
MalluProgrammers. Careeradvice.Webinarppt.pdf
 
Looking to Drive Traffic from Pinterest?
Looking to Drive Traffic from Pinterest?Looking to Drive Traffic from Pinterest?
Looking to Drive Traffic from Pinterest?
 
Want to Amplify Your Pinterest Content?...
Want to Amplify Your Pinterest Content?...Want to Amplify Your Pinterest Content?...
Want to Amplify Your Pinterest Content?...
 
The Colossal Studio Events Packages.pdf
The Colossal Studio  Events Packages.pdfThe Colossal Studio  Events Packages.pdf
The Colossal Studio Events Packages.pdf
 

MultiFC: A Real-World Multi-Domain Dataset for Evidence-Based Fact Checking of Claims

  • 1. Isabelle Augenstein, Christina Lioma, Dongsheng Wang, Lucas Chaves Lima, Casper Hansen, Christian Hansen, Jakob Grue Simonsen University of Copenhagen {augenstein | c.lioma | wang | lcl | c.hansen | chrh | simonsen}@di.ku.dk MultiFC: A Real-World Multi-Domain Dataset for Evidence-Based Fact Checking of Claims Joint Veracity Prediction & Evidence Ranking Claims in MultiFCContributions • Novel fact checking dataset Largest such with naturally occurring claims 34 918 claims from 26 English fact checking portals Rich additional meta-data 10 evidence pages per claim • Joint veracity prediction and evidence ranking model Treats claims from different portals as different tasks / domains Encodes disparate labels with label embeddings Confusion MatrixEntities in Claims Fact Checking Portals Overall Results Dataset Download https://copenlu.github.io/publication/2019_emnlp_augenstein/ Error Analysis • Meta-data: topic tags most important, entities least important • Correctly predicting ‘true’ claims is much easier than ‘false’ ones • Most confusions happen over close labels • Longer claims are harder to classify correctly • High token overlap of claims & evidence → high evidence ranking • General topic tags frequently co-occur with incorrect predictions; more specific tags often co-occur with correct predictions