SlideShare a Scribd company logo
1 of 15
Download to read offline
SemEval Task 10: ScienceIE –
Extracting Keyphrases and Relations
from Scientific Publications
Isabelle Augenstein*#, Mrinal Das$, Sebastian Riedel*,
Lakshmi Vikraman$, Andrew McCallum$
*University College London, #University of Copenhagen,
$University of Massachusetts Amherst
4 August 2017Supported by:
Motivation
Previous Tasks
SemEval 2010 Task 5 (Kim, Medelyan, Kan, Baldwin):
Automatic Keyphrase Extraction from Scientific Articles
Extract list of words/phrases
representing key topics from
scientific documents
-  context-independent
-  ranking evaluation
-  no relations
Main Title
Abstract
…........................................
…........................................
…........................................
…........................................
…........................................
…........................................
….................. ......................
….................. ......................
….................. ......................
….................. ......................
….................. ......................
….................. ......................
….................. ......................
….................. ......................
Keyphrases
1) ...........
2) ….......
3) ….......
4) ...... ....
5) .…......
Extracting Keyphrases and Relations from
Scientific Publications
Isabelle Augenstein, Mrinal Das, Sebastian Riedel, Lakshmi Vikraman, Andrew McCallum
Subtasks:
A) Mention-level keyphrase identification

B) Mention-level keyphrase classification:
•  PROCESS (e.g. methods, equipment)
•  TASK
•  MATERIAL (e.g. corpora, physical materials)
C) Mention-level semantic relation extraction:
•  HYPONYM-OF
•  SYNONYM-OF
… addresses the task of named entity recognition (NER), a subtask of information
extraction, using conditional random fields (CRF). Our method is evaluated on the
ConLL-2003 NER corpus.
Which papers present which
processes/tasks/materials?
How do they relate to one another?
Supported by:
Annotation & Dataset
-  brat (Stenetorp, Pyysalo, Topić,
Ohta, Ananiadou, Tsujii, 2012)
-  *.ann stand-off format
-  Hosted on AWS S3
-  Annotators work remotely
-  500 paragraphs from CS, Phys, MS: 350 train, 50 dev, 100 test
-  Sampling semi-automatically from keyphrase / relation-rich paragraphs
-  Full article text given to participants as well for context
Annotation & Dataset
-  13 paid student annotators, 8 completed annotation exercise
-  Double-annotated by expert annotator given student annotations
-  Up to 38 instances per annotator
Student Annotator IAA (Cohen’s kappa)
1 0.85
2 0.66
3 0.63
4 0.60
Dataset Statistics
Characteristic
Labels Material, Process, Task
Topics
Computer Science, Physics, Material
Science
Number all keyphrases
5730
Number unique keyphrases 1697
% singleton keyphrases 31%
% single-word mentions 18%
% mentions, word length >= 3 51%
% mentions, word length >= 5 22%
% mentions, noun phrases 93%
Most common keyphrases
‘Isogeometric analysis’, ‘samples’,
‘calibration process’, ‘Zirconium alloys’
Subtasks and Evaluation Scenarios
Subtasks
a)  Mention-level keyphrase identification
b)  Mention-level keyphrase classification (PROCESS, TASK,
MATERIAL)
c)  Mention-level semantic relation extraction between keyphrases with
the same keyphrase types (HYPONYM-OF, SYNONYM-OF)
Evaluation Scenarios
1)  Only plain text is given (Subtasks A, B, C)
2)  Plain text with manually annotated keyphrase boundaries are given
(Subtasks B, C)
3)  Plain text with manually annotated keyphrases and their types are
given (Subtask C)
Overall Participation
-  54 systems submitted in development phase
-  26 systems out of those participated in test phase
-  Wide variety of approaches
-  Neural networks
-  CRFs
-  Supervised approaches with careful feature engineering
-  Rule-based systems
-  Ensembles
Results Scenario 1
Teams Overall F1 A B C
s2 end2end
(Ammar et al., 2017)
0.43 0.55 0.44 0.28
TIAL UW 0.42 0.56 0.44
TTI COIN
(Tsujimura et al., 2017)
0.38 0.5 0.39 0.21
upper bound 0.84 0.85 0.85 0.77
random 0.00 0.03 0.01 0.00
17 participating systems
Results Scenario 2
Teams Overall F1 B C
MayoNLP
(Liu et al., 2017)
0.64 0.67 0.23
UKP/EELECTION
(Eger et al., 2017)
0.63 0.66
LABDA
(Segura-Bedmar et al.,
2017)
0.48 0.51
upper bound 0.84 0.85 0.77
random 0.15 0.23 0.01
4 participating systems
Results Scenario 3
Teams Overall F1 / C
MIT
(Lee et al., 2017a)
0.64
s2_rel
(Ammar et al., 2017)
0.54
NTNU-2
(Barik and Marsi, 2017)
0.5
upper bound 0.84
random 0.04
5 participating systems
Summary
-  Most successful systems use RNNs (+ CRFs)
-  However, best system for Scenario 1: SVM + well-engineered features
-  Identifying keyphrases is most challenging subtask
-  Dataset contains many long and infrequent keyphrases
-  Systems relying memorising lists of keyphrases do not perform well
-  Finding high-quality annotators for this task is hard – many student
annotators dropped out
-  Better recruitment, pilot annotation, pick only top annotators
-  Combining subtasks to evaluation scenarios caused confusion
-  Many teams’ systems did not tackle relation extraction subtask – even
though it hurt their overall F1
Relevant Papers at ACL
Isabelle Augenstein, Mrinal Das, Sebastian Riedel, Lakshmi Vikraman
and Andrew McCallum. SemEval 2017 Task 10: ScienceIE - Extracting
Keyphrases and Relations from Scientific Publications. SemEval 2017.
https://arxiv.org/abs/1704.02853
Isabelle Augenstein, Anders Søgaard. Multi-Task Learning of
Keyphrase Boundary Classification. ACL 2017 (short).
https://arxiv.org/abs/1704.00514
Ed Collins, Isabelle Augenstein, Sebastian Riedel. A Supervised
Approach to Extractive Summarisation of Scientific Papers. CoNLL 2017.
https://arxiv.org/abs/1706.03946
Thank you!
isabelleaugenstein.github.io
augenstein@di.ku.dk
@iaugenstein
github.com/isabelleaugenstein

More Related Content

What's hot

Lec1-Into
Lec1-IntoLec1-Into
Lec1-Into
butest
 
Mining Product Reputations On the Web
Mining Product Reputations On the WebMining Product Reputations On the Web
Mining Product Reputations On the Web
feiwin
 
EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...
EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...
EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...
Francesco Osborne
 
An Investigation of Keywords Extraction from Textual Documents using Word2Ve...
 An Investigation of Keywords Extraction from Textual Documents using Word2Ve... An Investigation of Keywords Extraction from Textual Documents using Word2Ve...
An Investigation of Keywords Extraction from Textual Documents using Word2Ve...
IJCSIS Research Publications
 

What's hot (20)

IRJET-Classifying Mined Online Discussion Data for Reflective Thinking based ...
IRJET-Classifying Mined Online Discussion Data for Reflective Thinking based ...IRJET-Classifying Mined Online Discussion Data for Reflective Thinking based ...
IRJET-Classifying Mined Online Discussion Data for Reflective Thinking based ...
 
IRJET- Finding the Original Writer of an Anonymous Text using Naïve Bayes Cla...
IRJET- Finding the Original Writer of an Anonymous Text using Naïve Bayes Cla...IRJET- Finding the Original Writer of an Anonymous Text using Naïve Bayes Cla...
IRJET- Finding the Original Writer of an Anonymous Text using Naïve Bayes Cla...
 
Lec1-Into
Lec1-IntoLec1-Into
Lec1-Into
 
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...Evaluating Machine Learning Algorithms for Materials Science using the Matben...
Evaluating Machine Learning Algorithms for Materials Science using the Matben...
 
Transfer learning-presentation
Transfer learning-presentationTransfer learning-presentation
Transfer learning-presentation
 
2-IJCSE-00536
2-IJCSE-005362-IJCSE-00536
2-IJCSE-00536
 
Mining Product Reputations On the Web
Mining Product Reputations On the WebMining Product Reputations On the Web
Mining Product Reputations On the Web
 
EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...
EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...
EKAW 2016 - Ontology Forecasting in Scientific Literature: Semantic Concepts ...
 
Extracting and Making Use of Materials Data from Millions of Journal Articles...
Extracting and Making Use of Materials Data from Millions of Journal Articles...Extracting and Making Use of Materials Data from Millions of Journal Articles...
Extracting and Making Use of Materials Data from Millions of Journal Articles...
 
Question Answering System using machine learning approach
Question Answering System using machine learning approachQuestion Answering System using machine learning approach
Question Answering System using machine learning approach
 
Applications of Natural Language Processing to Materials Design
Applications of Natural Language Processing to Materials DesignApplications of Natural Language Processing to Materials Design
Applications of Natural Language Processing to Materials Design
 
Document
DocumentDocument
Document
 
An Investigation of Keywords Extraction from Textual Documents using Word2Ve...
 An Investigation of Keywords Extraction from Textual Documents using Word2Ve... An Investigation of Keywords Extraction from Textual Documents using Word2Ve...
An Investigation of Keywords Extraction from Textual Documents using Word2Ve...
 
Genomics data analysis in Julia
Genomics data analysis in JuliaGenomics data analysis in Julia
Genomics data analysis in Julia
 
The Status of ML Algorithms for Structure-property Relationships Using Matb...
The Status of ML Algorithms for Structure-property Relationships Using Matb...The Status of ML Algorithms for Structure-property Relationships Using Matb...
The Status of ML Algorithms for Structure-property Relationships Using Matb...
 
Natural Language Processing for Materials Design - What Can We Extract From t...
Natural Language Processing for Materials Design - What Can We Extract From t...Natural Language Processing for Materials Design - What Can We Extract From t...
Natural Language Processing for Materials Design - What Can We Extract From t...
 
Zero-shot Image Recognition Using Relational Matching, Adaptation and Calibra...
Zero-shot Image Recognition Using Relational Matching, Adaptation and Calibra...Zero-shot Image Recognition Using Relational Matching, Adaptation and Calibra...
Zero-shot Image Recognition Using Relational Matching, Adaptation and Calibra...
 
PhD Defense Slides
PhD Defense SlidesPhD Defense Slides
PhD Defense Slides
 
Probabilistic Information Retrieval
Probabilistic Information RetrievalProbabilistic Information Retrieval
Probabilistic Information Retrieval
 
Sentiment Analysis and Classification of Tweets using Data Mining
Sentiment Analysis and Classification of Tweets using Data MiningSentiment Analysis and Classification of Tweets using Data Mining
Sentiment Analysis and Classification of Tweets using Data Mining
 

Similar to SemEval 2017 Task 10: ScienceIE – Extracting Keyphrases and Relations from Scientific Publications

Topic_6
Topic_6Topic_6
Topic_6
butest
 
kantorNSF-NIJ-ISI-03-06-04.ppt
kantorNSF-NIJ-ISI-03-06-04.pptkantorNSF-NIJ-ISI-03-06-04.ppt
kantorNSF-NIJ-ISI-03-06-04.ppt
butest
 
Learning to Search Henry Kautz
Learning to Search Henry KautzLearning to Search Henry Kautz
Learning to Search Henry Kautz
butest
 
Learning to Search Henry Kautz
Learning to Search Henry KautzLearning to Search Henry Kautz
Learning to Search Henry Kautz
butest
 
32_Nov07_MachineLear..
32_Nov07_MachineLear..32_Nov07_MachineLear..
32_Nov07_MachineLear..
butest
 

Similar to SemEval 2017 Task 10: ScienceIE – Extracting Keyphrases and Relations from Scientific Publications (20)

Understand students’ self-reflections through learning analytics
Understand students’ self-reflections through learning analyticsUnderstand students’ self-reflections through learning analytics
Understand students’ self-reflections through learning analytics
 
Kdd by Mr.Sameer Kumar Das
Kdd by Mr.Sameer Kumar DasKdd by Mr.Sameer Kumar Das
Kdd by Mr.Sameer Kumar Das
 
Automated Question Paper Generator And Answer Checker Using Information Retri...
Automated Question Paper Generator And Answer Checker Using Information Retri...Automated Question Paper Generator And Answer Checker Using Information Retri...
Automated Question Paper Generator And Answer Checker Using Information Retri...
 
Question Classification using Semantic, Syntactic and Lexical features
Question Classification using Semantic, Syntactic and Lexical featuresQuestion Classification using Semantic, Syntactic and Lexical features
Question Classification using Semantic, Syntactic and Lexical features
 
Question Classification using Semantic, Syntactic and Lexical features
Question Classification using Semantic, Syntactic and Lexical featuresQuestion Classification using Semantic, Syntactic and Lexical features
Question Classification using Semantic, Syntactic and Lexical features
 
Topic_6
Topic_6Topic_6
Topic_6
 
kantorNSF-NIJ-ISI-03-06-04.ppt
kantorNSF-NIJ-ISI-03-06-04.pptkantorNSF-NIJ-ISI-03-06-04.ppt
kantorNSF-NIJ-ISI-03-06-04.ppt
 
Comparison Intelligent Electronic Assessment with Traditional Assessment for ...
Comparison Intelligent Electronic Assessment with Traditional Assessment for ...Comparison Intelligent Electronic Assessment with Traditional Assessment for ...
Comparison Intelligent Electronic Assessment with Traditional Assessment for ...
 
COMPARISON INTELLIGENT ELECTRONIC ASSESSMENT WITH TRADITIONAL ASSESSMENT FOR ...
COMPARISON INTELLIGENT ELECTRONIC ASSESSMENT WITH TRADITIONAL ASSESSMENT FOR ...COMPARISON INTELLIGENT ELECTRONIC ASSESSMENT WITH TRADITIONAL ASSESSMENT FOR ...
COMPARISON INTELLIGENT ELECTRONIC ASSESSMENT WITH TRADITIONAL ASSESSMENT FOR ...
 
COMPARISON INTELLIGENT ELECTRONIC ASSESSMENT WITH TRADITIONAL ASSESSMENT FOR ...
COMPARISON INTELLIGENT ELECTRONIC ASSESSMENT WITH TRADITIONAL ASSESSMENT FOR ...COMPARISON INTELLIGENT ELECTRONIC ASSESSMENT WITH TRADITIONAL ASSESSMENT FOR ...
COMPARISON INTELLIGENT ELECTRONIC ASSESSMENT WITH TRADITIONAL ASSESSMENT FOR ...
 
COMPARISON INTELLIGENT ELECTRONIC ASSESSMENT WITH TRADITIONAL ASSESSMENT FOR ...
COMPARISON INTELLIGENT ELECTRONIC ASSESSMENT WITH TRADITIONAL ASSESSMENT FOR ...COMPARISON INTELLIGENT ELECTRONIC ASSESSMENT WITH TRADITIONAL ASSESSMENT FOR ...
COMPARISON INTELLIGENT ELECTRONIC ASSESSMENT WITH TRADITIONAL ASSESSMENT FOR ...
 
Automatic Essay Scoring A Review On The Feature Analysis Techniques
Automatic Essay Scoring  A Review On The Feature Analysis TechniquesAutomatic Essay Scoring  A Review On The Feature Analysis Techniques
Automatic Essay Scoring A Review On The Feature Analysis Techniques
 
INTELLIGENT ELECTRONIC ASSESSMENT FOR SUBJECTIVE EXAMS
INTELLIGENT ELECTRONIC ASSESSMENT FOR SUBJECTIVE EXAMS INTELLIGENT ELECTRONIC ASSESSMENT FOR SUBJECTIVE EXAMS
INTELLIGENT ELECTRONIC ASSESSMENT FOR SUBJECTIVE EXAMS
 
Learning to Search Henry Kautz
Learning to Search Henry KautzLearning to Search Henry Kautz
Learning to Search Henry Kautz
 
Learning to Search Henry Kautz
Learning to Search Henry KautzLearning to Search Henry Kautz
Learning to Search Henry Kautz
 
Using a keyword extraction pipeline to understand concepts in future work sec...
Using a keyword extraction pipeline to understand concepts in future work sec...Using a keyword extraction pipeline to understand concepts in future work sec...
Using a keyword extraction pipeline to understand concepts in future work sec...
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Science
 
IRJET - A Survey on Machine Learning Algorithms, Techniques and Applications
IRJET - A Survey on Machine Learning Algorithms, Techniques and ApplicationsIRJET - A Survey on Machine Learning Algorithms, Techniques and Applications
IRJET - A Survey on Machine Learning Algorithms, Techniques and Applications
 
32_Nov07_MachineLear..
32_Nov07_MachineLear..32_Nov07_MachineLear..
32_Nov07_MachineLear..
 
Evaluation of a hybrid method for constructing multiple SVM kernels
Evaluation of a hybrid method for constructing multiple SVM kernelsEvaluation of a hybrid method for constructing multiple SVM kernels
Evaluation of a hybrid method for constructing multiple SVM kernels
 

More from Isabelle Augenstein

Learning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyondLearning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyond
Isabelle Augenstein
 

More from Isabelle Augenstein (20)

Beyond Fact Checking — Modelling Information Change in Scientific Communication
Beyond Fact Checking — Modelling Information Change in Scientific CommunicationBeyond Fact Checking — Modelling Information Change in Scientific Communication
Beyond Fact Checking — Modelling Information Change in Scientific Communication
 
Automatically Detecting Scientific Misinformation
Automatically Detecting Scientific MisinformationAutomatically Detecting Scientific Misinformation
Automatically Detecting Scientific Misinformation
 
Accountable and Robust Automatic Fact Checking
Accountable and Robust Automatic Fact CheckingAccountable and Robust Automatic Fact Checking
Accountable and Robust Automatic Fact Checking
 
Towards Explainable Fact Checking (DIKU Business Club presentation)
Towards Explainable Fact Checking (DIKU Business Club presentation)Towards Explainable Fact Checking (DIKU Business Club presentation)
Towards Explainable Fact Checking (DIKU Business Club presentation)
 
Explainability for NLP
Explainability for NLPExplainability for NLP
Explainability for NLP
 
Towards Explainable Fact Checking
Towards Explainable Fact CheckingTowards Explainable Fact Checking
Towards Explainable Fact Checking
 
Tracking False Information Online
Tracking False Information OnlineTracking False Information Online
Tracking False Information Online
 
What can typological knowledge bases and language representations tell us abo...
What can typological knowledge bases and language representations tell us abo...What can typological knowledge bases and language representations tell us abo...
What can typological knowledge bases and language representations tell us abo...
 
Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...
Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...
Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...
 
Learning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyondLearning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyond
 
Learning to read for automated fact checking
Learning to read for automated fact checkingLearning to read for automated fact checking
Learning to read for automated fact checking
 
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
 
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
1st Workshop for Women and Underrepresented Minorities (WiNLP) at ACL 2017 - ...
 
Weakly Supervised Machine Reading
Weakly Supervised Machine ReadingWeakly Supervised Machine Reading
Weakly Supervised Machine Reading
 
USFD at SemEval-2016 - Stance Detection on Twitter with Autoencoders
USFD at SemEval-2016 - Stance Detection on Twitter with AutoencodersUSFD at SemEval-2016 - Stance Detection on Twitter with Autoencoders
USFD at SemEval-2016 - Stance Detection on Twitter with Autoencoders
 
Distant Supervision with Imitation Learning
Distant Supervision with Imitation LearningDistant Supervision with Imitation Learning
Distant Supervision with Imitation Learning
 
Extracting Relations between Non-Standard Entities using Distant Supervision ...
Extracting Relations between Non-Standard Entities using Distant Supervision ...Extracting Relations between Non-Standard Entities using Distant Supervision ...
Extracting Relations between Non-Standard Entities using Distant Supervision ...
 
Information Extraction with Linked Data
Information Extraction with Linked DataInformation Extraction with Linked Data
Information Extraction with Linked Data
 
Lodifier: Generating Linked Data from Unstructured Text
Lodifier: Generating Linked Data from Unstructured TextLodifier: Generating Linked Data from Unstructured Text
Lodifier: Generating Linked Data from Unstructured Text
 
Relation Extraction from the Web using Distant Supervision
Relation Extraction from the Web using Distant SupervisionRelation Extraction from the Web using Distant Supervision
Relation Extraction from the Web using Distant Supervision
 

Recently uploaded

1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
MateoGardella
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch Letter
MateoGardella
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 

Recently uploaded (20)

1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch Letter
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 

SemEval 2017 Task 10: ScienceIE – Extracting Keyphrases and Relations from Scientific Publications

  • 1. SemEval Task 10: ScienceIE – Extracting Keyphrases and Relations from Scientific Publications Isabelle Augenstein*#, Mrinal Das$, Sebastian Riedel*, Lakshmi Vikraman$, Andrew McCallum$ *University College London, #University of Copenhagen, $University of Massachusetts Amherst 4 August 2017Supported by:
  • 3. Previous Tasks SemEval 2010 Task 5 (Kim, Medelyan, Kan, Baldwin): Automatic Keyphrase Extraction from Scientific Articles Extract list of words/phrases representing key topics from scientific documents -  context-independent -  ranking evaluation -  no relations Main Title Abstract …........................................ …........................................ …........................................ …........................................ …........................................ …........................................ ….................. ...................... ….................. ...................... ….................. ...................... ….................. ...................... ….................. ...................... ….................. ...................... ….................. ...................... ….................. ...................... Keyphrases 1) ........... 2) …....... 3) …....... 4) ...... .... 5) .…......
  • 4. Extracting Keyphrases and Relations from Scientific Publications Isabelle Augenstein, Mrinal Das, Sebastian Riedel, Lakshmi Vikraman, Andrew McCallum Subtasks: A) Mention-level keyphrase identification
 B) Mention-level keyphrase classification: •  PROCESS (e.g. methods, equipment) •  TASK •  MATERIAL (e.g. corpora, physical materials) C) Mention-level semantic relation extraction: •  HYPONYM-OF •  SYNONYM-OF … addresses the task of named entity recognition (NER), a subtask of information extraction, using conditional random fields (CRF). Our method is evaluated on the ConLL-2003 NER corpus. Which papers present which processes/tasks/materials? How do they relate to one another? Supported by:
  • 5. Annotation & Dataset -  brat (Stenetorp, Pyysalo, Topić, Ohta, Ananiadou, Tsujii, 2012) -  *.ann stand-off format -  Hosted on AWS S3 -  Annotators work remotely -  500 paragraphs from CS, Phys, MS: 350 train, 50 dev, 100 test -  Sampling semi-automatically from keyphrase / relation-rich paragraphs -  Full article text given to participants as well for context
  • 6. Annotation & Dataset -  13 paid student annotators, 8 completed annotation exercise -  Double-annotated by expert annotator given student annotations -  Up to 38 instances per annotator Student Annotator IAA (Cohen’s kappa) 1 0.85 2 0.66 3 0.63 4 0.60
  • 7. Dataset Statistics Characteristic Labels Material, Process, Task Topics Computer Science, Physics, Material Science Number all keyphrases 5730 Number unique keyphrases 1697 % singleton keyphrases 31% % single-word mentions 18% % mentions, word length >= 3 51% % mentions, word length >= 5 22% % mentions, noun phrases 93% Most common keyphrases ‘Isogeometric analysis’, ‘samples’, ‘calibration process’, ‘Zirconium alloys’
  • 8. Subtasks and Evaluation Scenarios Subtasks a)  Mention-level keyphrase identification b)  Mention-level keyphrase classification (PROCESS, TASK, MATERIAL) c)  Mention-level semantic relation extraction between keyphrases with the same keyphrase types (HYPONYM-OF, SYNONYM-OF) Evaluation Scenarios 1)  Only plain text is given (Subtasks A, B, C) 2)  Plain text with manually annotated keyphrase boundaries are given (Subtasks B, C) 3)  Plain text with manually annotated keyphrases and their types are given (Subtask C)
  • 9. Overall Participation -  54 systems submitted in development phase -  26 systems out of those participated in test phase -  Wide variety of approaches -  Neural networks -  CRFs -  Supervised approaches with careful feature engineering -  Rule-based systems -  Ensembles
  • 10. Results Scenario 1 Teams Overall F1 A B C s2 end2end (Ammar et al., 2017) 0.43 0.55 0.44 0.28 TIAL UW 0.42 0.56 0.44 TTI COIN (Tsujimura et al., 2017) 0.38 0.5 0.39 0.21 upper bound 0.84 0.85 0.85 0.77 random 0.00 0.03 0.01 0.00 17 participating systems
  • 11. Results Scenario 2 Teams Overall F1 B C MayoNLP (Liu et al., 2017) 0.64 0.67 0.23 UKP/EELECTION (Eger et al., 2017) 0.63 0.66 LABDA (Segura-Bedmar et al., 2017) 0.48 0.51 upper bound 0.84 0.85 0.77 random 0.15 0.23 0.01 4 participating systems
  • 12. Results Scenario 3 Teams Overall F1 / C MIT (Lee et al., 2017a) 0.64 s2_rel (Ammar et al., 2017) 0.54 NTNU-2 (Barik and Marsi, 2017) 0.5 upper bound 0.84 random 0.04 5 participating systems
  • 13. Summary -  Most successful systems use RNNs (+ CRFs) -  However, best system for Scenario 1: SVM + well-engineered features -  Identifying keyphrases is most challenging subtask -  Dataset contains many long and infrequent keyphrases -  Systems relying memorising lists of keyphrases do not perform well -  Finding high-quality annotators for this task is hard – many student annotators dropped out -  Better recruitment, pilot annotation, pick only top annotators -  Combining subtasks to evaluation scenarios caused confusion -  Many teams’ systems did not tackle relation extraction subtask – even though it hurt their overall F1
  • 14. Relevant Papers at ACL Isabelle Augenstein, Mrinal Das, Sebastian Riedel, Lakshmi Vikraman and Andrew McCallum. SemEval 2017 Task 10: ScienceIE - Extracting Keyphrases and Relations from Scientific Publications. SemEval 2017. https://arxiv.org/abs/1704.02853 Isabelle Augenstein, Anders Søgaard. Multi-Task Learning of Keyphrase Boundary Classification. ACL 2017 (short). https://arxiv.org/abs/1704.00514 Ed Collins, Isabelle Augenstein, Sebastian Riedel. A Supervised Approach to Extractive Summarisation of Scientific Papers. CoNLL 2017. https://arxiv.org/abs/1706.03946