SlideShare a Scribd company logo
KOI at SemEval-2018 Task 5:
K O
I
Paramita Mirza Fariz Darari Rahmad Mahendra
paramita@mpi-inf.mpg.de. fariz@cs.ui.ac.id rahmad.mahendra@cs.ui.ac.id
1
SemEval-2018
International Workshop on Semantic Evaluation
New Orleans, LA, USA, June 5-6 2018
(1) “How many killing incidents happened in June 2016 in San Antonio, Texas?”
(2) “How many people were killed in June 2016 in San Antonio, Texas?”
2
Numerical answer:
(1) 2
(2) 2
Supporting documents:
Knowledge
Graph of
Incidents
News articles
KOI
System
Architecture
Evaluation
on
SemEval Task 5
3
(1) “How many killing incidents happened in June 2016 in San Antonio, Texas?”
(2) “How many people were killed in June 2016 in San Antonio, Texas?”
4
Document
Preprocessing
Event Extraction
and Coreference
Resolution
KG Construction
& Population
Knowledge
Graph of
Incidents
Question
Parsing
Query
Execution
News articles
Numerical answer:
(1) 2
(2) 2
Supporting documents:
Powered by:
5
Document
Preprocessing
Event Extraction
and Coreference
Resolution
KG Construction
& Population
News articles
Word sense disambiguation & entity linking
(Navigli & Ponzetto, 2012)
One man, 41-years-old John Doe, was fatally shot early Sunday morning in San Antonio, police said.
Powered by:
Word sense disambiguation & entity linking
(Navigli & Ponzetto, 2012)
6
Document
Preprocessing
Event Extraction
and Coreference
Resolution
KG Construction
& Population
News articles
Named entity recognition
(spacy.io)
HeidelTime
Time expression recognition
& normalization
(Strötgen & Gertz, 2013)
One man, 41-years-old John Doe, was fatally shot early Sunday morning in San Antonio, police said.
June 5, 2018
PERSON 03-06-2018
Semantic role labelling
(Collobert et al., 2011)
shot A1: One man , 41-years-old John Doe
AM-MNR: fatally
AM-TMP: early Sunday Morning
AM-LOC: in San Antonio
said A0: police
A1: One man , 41-years-old John Doe,
was fatally shot early Sunday Morning
in San Antonio
Document
Preprocessing
Event Extraction
and Coreference
Resolution
KG Construction
& Population
News articles
7
One man, 41-years-old John Doe, was fatally shot early Sunday morning in San Antonio, police said.One man, 41-years-old John Doe, was fatally shot early Sunday morning in San Antonio, police said.
June 5, 2018
PERSON 03-06-2018
shot A1: One man , 41-years-old John Doe
AM-MNR: fatally
AM-TMP: early Sunday Morning
AM-LOC: in San Antonio
said A0: police
A1: One man , 41-years-old John Doe,
was fatally shot early Sunday Morning
in San Antonio
said A0: police
A1: One man , 41-years-old John Doe,
was fatally shot early Sunday Morning
in San Antonio
Predicate-level event
shot A1: One man , 41-years-old John Doe
AM-MNR: fatally
AM-TMP: early Sunday Morning
AM-LOC: in San Antonio
• Identifying incident-related concepts  via path-
based WordNet similarity (Hirst et al., 1998) > 5.0
• Identifying event participants and their roles (e.g.,
victim)
• Identifying number of victims (except for suspect-
related predicates)
• Identifying and normalizing event time
shot A1: One man , 41-years-old John Doe
AM-MNR: fatally
AM-TMP: early Sunday Morning
AM-LOC: in San Antonio
shot A1: One man , 41-years-old John Doe
AM-MNR: fatally
AM-TMP: early Sunday Morning
AM-LOC: in San Antonio
shot A1: One man , 41-years-old John Doe
AM-MNR: fatally
AM-TMP: early Sunday Morning
AM-LOC: in San Antonio
Predicate-level event, type: killing
Document
Preprocessing
Event Extraction
and Coreference
Resolution
KG Construction
& Population
News articles
8
One man, 41-years-old John Doe, was fatally shot early Sunday morning in San Antonio, police said.One man, 41-years-old John Doe, was fatally shot early Sunday morning in San Antonio, police said.
June 5, 2018
PERSON 03-06-2018
shot A1: One man , 41-years-old John Doe
AM-MNR: fatally
AM-TMP: early Sunday Morning
AM-LOC: in San Antonio
said A0: police
A1: One man , 41-years-old John Doe,
was fatally shot early Sunday Morning
in San Antonio
said A0: police
A1: One man , 41-years-old John Doe,
was fatally shot early Sunday Morning
in San Antonio
Sentence-level event
shot A1: One man , 41-years-old John Doe
AM-MNR: fatally
AM-TMP: early Sunday Morning
AM-LOC: in San Antonio
shot A1: One man , 41-years-old John Doe
AM-MNR: fatally
AM-TMP: early Sunday Morning
AM-LOC: in San Antonio
shot A1: One man , 41-years-old John Doe
AM-MNR: fatally
AM-TMP: early Sunday Morning
AM-LOC: in San Antonio
shot A1: One man , 41-years-old John Doe
AM-MNR: fatally
AM-TMP: early Sunday Morning
AM-LOC: in San Antonio
shot A1: One man , 41-years-old John Doe
AM-MNR: fatally
AM-TMP: early Sunday Morning
AM-LOC: in San Antonio
shot A1: One man , 41-years-old John Doe
AM-MNR: fatally
AM-TMP: early Sunday Morning
AM-LOC: in San Antonio
shot A1: One man , 41-years-old John Doe
AM-MNR: fatally
AM-TMP: early Sunday Morning
AM-LOC: in San Antonio
shot A1: One man , 41-years-old John Doe
AM-MNR: fatally
AM-TMP: early Sunday Morning
AM-LOC: in San Antonio
killing
injuring
injuring
fire
Document-level event (incident)
• One incident per incident type per
document
• Determine incident location
• Aggregate event participants per
incident type
• Aggregate number of victims per
incident type
• Determine incident time per
incident type
shot A1: One man , 41-years-old John Doe
AM-MNR: fatally
AM-TMP: early Sunday Morning
AM-LOC: in San Antonio
shot A1: One man , 41-years-old John Doe
AM-MNR: fatally
AM-TMP: early Sunday Morning
AM-LOC: in San Antonio
shot A1: One man , 41-years-old John Doe
AM-MNR: fatally
AM-TMP: early Sunday Morning
AM-LOC: in San Antonio
shot A1: One man , 41-years-old John Doe
AM-MNR: fatally
AM-TMP: early Sunday Morning
AM-LOC: in San Antonio
killing
injuring
fire
Predicate-level event, type: killing
Document
Preprocessing
Event Extraction
and Coreference
Resolution
KG Construction
& Population
News articles
9
injuring
fire
Cross-document event coreference resolution
via document clustering
killing
injuring
fire
dct1: June 4, 2018 dct2: June 5, 2018
≈
|dct1-dct2|≤ 3 days
Cosine similarity of TF-IDF-based vectors of:
• BabelNet senses
• spaCy’s PERSON and GPE
KOI-KG  https://koi.cs.ui.ac.id/incidents
 Powered by
10
Document
Preprocessing
Event Extraction
and Coreference
Resolution
KG Construction
& Population
News articles
injuring
fire
killing
injuring
fire
dct1: June 4, 2018 dct2: June 5, 2018
killing injuring fire
supported by
John Doe
1 victim
03-06-2018 San Antonio
≈
|dct1-dct2|≤ 3 days
(1) “How many killing incidents happened in June 2016 in San Antonio, Texas?”
(2) “How many people were killed in June 2016 in San Antonio, Texas?”
11
Document
Preprocessing
Event Extraction
and Coreference
Resolution
KG Construction
& Population
Knowledge
Graph of
Incidents
Query
Execution
News articles
Numerical answer:
(1) 2
(2) 2
Supporting documents:
SPARQL query
Question
Parsing
SELECT ?event ?document
WHERE {
?event koi:eventType koi:killing .
?event koi:eventDate [
koi:month "06" ;
koi:year "2016" ] .
?event koi:location [
koi:city
<http://dbpedia.org/resource/San_Antonio> ;
koi:state
<http://dbpedia.org/resource/Texas> ] .
?document koi:event ?event .
}
KOI
System
Architecture
Evaluation
on
SemEval Task 5
12
System
Subtask S1 Subtask S2 Subtask S3
%Ans Doc-F1 %Ans Doc-F1 %Ans Doc-F1
Baseline 16.5 67.3 100.0 26.4 - -
KOI 44.2 83.0 67.5 55.2 66.6 69.6
NAI-SEA 100.0 78.3 100.0 50.5 100.0 63.6
FEUP 100.0 24.7 100.0 30.5 100.0 26.8
NewsReader 51.6 46.2 100.0 36.9 100.0 26.8
13
KOI v2 44.2 83.0 100.0 51.2 100.0 49.1
KOI v3 55.1 85.7 100.0 54.8 100.0 50.9
KOI v2: answer with 0 and empty set of answer documents, if cannot find an answer
KOI v3: bug (fire burning & job firing incident types) is fixed, improved incident time identification
KOI v3
Micro-averaged
P R F1
Answered questions 86.6 74.0 79.8
All questions 86.6 41.6 56.2
14
Killing 88.5 43.2 58.1
Injuring 82.8 37.4 51.5
Job firing 100.0 8.7 16.0
Fire burning 96.9 66.2 78.7
Participant 84.8 43.0 57.0
Location 89.1 39.4 54.6
Time 86.0 42.4 56.8
System
Subtask S2 Subtask S3
%Ans Accuracy RMSE %Ans Accuracy RMSE
Baseline 18.3 8.5 - -
KOI 67.5 20.4 6.2 66.6 19.3 7.9
FEUP 100.0 26.4 6.1 100.0 30.4 478.7
NewsReader 100.0 21.9 44.0 100.0 21.1 296.5
NAI-SEA 100.0 17.4 20.6 100.0 20.2 13.5
15
KOI v2 100.0 25.6 5.2 100.0 24.8 7.1
KOI v3 100.0 27.4 5.3 100.0 23.0 7.7
KOI v2: answer with 0 and empty set of answer documents, if cannot find an answer
KOI v3: bug (fire burning & job firing incident types) is fixed, improved incident time identification
16
(1) “How many killing incidents happened in June 2016 in San Antonio, Texas?” 2
(2) “How many people were killed in June 2016 in San Antonio, Texas?” 1
 KOI  Knowledge Graph of Incidents  to be used for efficiently answering
numerical questions about domain-specific events
 Fully unsupervised approach, utilizing already existing NLP tools
 Simple cross-document event coreference method via document clustering
 One main event/incident per document assumption  does not always hold
 Low performance on identifying the existence of event participants for specific
roles
 “Two boys and a girl were shot while…”
Thank you!
17
 When KOI yields perfect (non-empty) sets of answer documents:
 On answering number of incidents (cross-document event coreference)
 On answering number of victims (counting event participants)
 34.3% correct number
18
correct number
overestimate
underestimate
from only counting
from only numeral mentions
combination

More Related Content

More from Fariz Darari

Artificial Neural Networks: Pointers
Artificial Neural Networks: PointersArtificial Neural Networks: Pointers
Artificial Neural Networks: Pointers
Fariz Darari
 
Open Tridharma at ICACSIS 2019
Open Tridharma at ICACSIS 2019Open Tridharma at ICACSIS 2019
Open Tridharma at ICACSIS 2019
Fariz Darari
 
Defense Slides of Avicenna Wisesa - PROWD
Defense Slides of Avicenna Wisesa - PROWDDefense Slides of Avicenna Wisesa - PROWD
Defense Slides of Avicenna Wisesa - PROWD
Fariz Darari
 
Seminar Laporan Aktualisasi - Tridharma Terbuka - Fariz Darari
Seminar Laporan Aktualisasi - Tridharma Terbuka - Fariz DarariSeminar Laporan Aktualisasi - Tridharma Terbuka - Fariz Darari
Seminar Laporan Aktualisasi - Tridharma Terbuka - Fariz Darari
Fariz Darari
 
Foundations of Programming - Java OOP
Foundations of Programming - Java OOPFoundations of Programming - Java OOP
Foundations of Programming - Java OOP
Fariz Darari
 
Recursion in Python
Recursion in PythonRecursion in Python
Recursion in Python
Fariz Darari
 
[ISWC 2013] Completeness statements about RDF data sources and their use for ...
[ISWC 2013] Completeness statements about RDF data sources and their use for ...[ISWC 2013] Completeness statements about RDF data sources and their use for ...
[ISWC 2013] Completeness statements about RDF data sources and their use for ...
Fariz Darari
 
Testing in Python: doctest and unittest (Updated)
Testing in Python: doctest and unittest (Updated)Testing in Python: doctest and unittest (Updated)
Testing in Python: doctest and unittest (Updated)
Fariz Darari
 
Testing in Python: doctest and unittest
Testing in Python: doctest and unittestTesting in Python: doctest and unittest
Testing in Python: doctest and unittest
Fariz Darari
 
Dissertation Defense - Managing and Consuming Completeness Information for RD...
Dissertation Defense - Managing and Consuming Completeness Information for RD...Dissertation Defense - Managing and Consuming Completeness Information for RD...
Dissertation Defense - Managing and Consuming Completeness Information for RD...
Fariz Darari
 
Research Writing - 2018.07.18
Research Writing - 2018.07.18Research Writing - 2018.07.18
Research Writing - 2018.07.18
Fariz Darari
 
Comparing Index Structures for Completeness Reasoning
Comparing Index Structures for Completeness ReasoningComparing Index Structures for Completeness Reasoning
Comparing Index Structures for Completeness Reasoning
Fariz Darari
 
Python in 30 minutes!
Python in 30 minutes!Python in 30 minutes!
Python in 30 minutes!
Fariz Darari
 
Research Writing - Universitas Indonesia
Research Writing - Universitas IndonesiaResearch Writing - Universitas Indonesia
Research Writing - Universitas Indonesia
Fariz Darari
 
Otsuka Talk in Dec 2017
Otsuka Talk in Dec 2017Otsuka Talk in Dec 2017
Otsuka Talk in Dec 2017
Fariz Darari
 
Knowledge Technologies: Opportunities and Challenges
Knowledge Technologies: Opportunities and ChallengesKnowledge Technologies: Opportunities and Challenges
Knowledge Technologies: Opportunities and Challenges
Fariz Darari
 
COOL-WD: A Completeness Tool for Wikidata
COOL-WD: A Completeness Tool for WikidataCOOL-WD: A Completeness Tool for Wikidata
COOL-WD: A Completeness Tool for Wikidata
Fariz Darari
 
The DANGERS of Incomplete Data
The DANGERS of Incomplete DataThe DANGERS of Incomplete Data
The DANGERS of Incomplete Data
Fariz Darari
 
Cardinal Virtues - Extracting Relation Cardinalities from Text
Cardinal Virtues - Extracting Relation Cardinalities from TextCardinal Virtues - Extracting Relation Cardinalities from Text
Cardinal Virtues - Extracting Relation Cardinalities from Text
Fariz Darari
 
2017 UniBZ Winter Seminar Poster: Managing and Consuming Completeness Informa...
2017 UniBZ Winter Seminar Poster: Managing and Consuming Completeness Informa...2017 UniBZ Winter Seminar Poster: Managing and Consuming Completeness Informa...
2017 UniBZ Winter Seminar Poster: Managing and Consuming Completeness Informa...
Fariz Darari
 

More from Fariz Darari (20)

Artificial Neural Networks: Pointers
Artificial Neural Networks: PointersArtificial Neural Networks: Pointers
Artificial Neural Networks: Pointers
 
Open Tridharma at ICACSIS 2019
Open Tridharma at ICACSIS 2019Open Tridharma at ICACSIS 2019
Open Tridharma at ICACSIS 2019
 
Defense Slides of Avicenna Wisesa - PROWD
Defense Slides of Avicenna Wisesa - PROWDDefense Slides of Avicenna Wisesa - PROWD
Defense Slides of Avicenna Wisesa - PROWD
 
Seminar Laporan Aktualisasi - Tridharma Terbuka - Fariz Darari
Seminar Laporan Aktualisasi - Tridharma Terbuka - Fariz DarariSeminar Laporan Aktualisasi - Tridharma Terbuka - Fariz Darari
Seminar Laporan Aktualisasi - Tridharma Terbuka - Fariz Darari
 
Foundations of Programming - Java OOP
Foundations of Programming - Java OOPFoundations of Programming - Java OOP
Foundations of Programming - Java OOP
 
Recursion in Python
Recursion in PythonRecursion in Python
Recursion in Python
 
[ISWC 2013] Completeness statements about RDF data sources and their use for ...
[ISWC 2013] Completeness statements about RDF data sources and their use for ...[ISWC 2013] Completeness statements about RDF data sources and their use for ...
[ISWC 2013] Completeness statements about RDF data sources and their use for ...
 
Testing in Python: doctest and unittest (Updated)
Testing in Python: doctest and unittest (Updated)Testing in Python: doctest and unittest (Updated)
Testing in Python: doctest and unittest (Updated)
 
Testing in Python: doctest and unittest
Testing in Python: doctest and unittestTesting in Python: doctest and unittest
Testing in Python: doctest and unittest
 
Dissertation Defense - Managing and Consuming Completeness Information for RD...
Dissertation Defense - Managing and Consuming Completeness Information for RD...Dissertation Defense - Managing and Consuming Completeness Information for RD...
Dissertation Defense - Managing and Consuming Completeness Information for RD...
 
Research Writing - 2018.07.18
Research Writing - 2018.07.18Research Writing - 2018.07.18
Research Writing - 2018.07.18
 
Comparing Index Structures for Completeness Reasoning
Comparing Index Structures for Completeness ReasoningComparing Index Structures for Completeness Reasoning
Comparing Index Structures for Completeness Reasoning
 
Python in 30 minutes!
Python in 30 minutes!Python in 30 minutes!
Python in 30 minutes!
 
Research Writing - Universitas Indonesia
Research Writing - Universitas IndonesiaResearch Writing - Universitas Indonesia
Research Writing - Universitas Indonesia
 
Otsuka Talk in Dec 2017
Otsuka Talk in Dec 2017Otsuka Talk in Dec 2017
Otsuka Talk in Dec 2017
 
Knowledge Technologies: Opportunities and Challenges
Knowledge Technologies: Opportunities and ChallengesKnowledge Technologies: Opportunities and Challenges
Knowledge Technologies: Opportunities and Challenges
 
COOL-WD: A Completeness Tool for Wikidata
COOL-WD: A Completeness Tool for WikidataCOOL-WD: A Completeness Tool for Wikidata
COOL-WD: A Completeness Tool for Wikidata
 
The DANGERS of Incomplete Data
The DANGERS of Incomplete DataThe DANGERS of Incomplete Data
The DANGERS of Incomplete Data
 
Cardinal Virtues - Extracting Relation Cardinalities from Text
Cardinal Virtues - Extracting Relation Cardinalities from TextCardinal Virtues - Extracting Relation Cardinalities from Text
Cardinal Virtues - Extracting Relation Cardinalities from Text
 
2017 UniBZ Winter Seminar Poster: Managing and Consuming Completeness Informa...
2017 UniBZ Winter Seminar Poster: Managing and Consuming Completeness Informa...2017 UniBZ Winter Seminar Poster: Managing and Consuming Completeness Informa...
2017 UniBZ Winter Seminar Poster: Managing and Consuming Completeness Informa...
 

Recently uploaded

CAKE: Sharing Slices of Confidential Data on Blockchain
CAKE: Sharing Slices of Confidential Data on BlockchainCAKE: Sharing Slices of Confidential Data on Blockchain
CAKE: Sharing Slices of Confidential Data on Blockchain
Claudio Di Ciccio
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
AI-Powered Food Delivery Transforming App Development in Saudi Arabia.pdf
AI-Powered Food Delivery Transforming App Development in Saudi Arabia.pdfAI-Powered Food Delivery Transforming App Development in Saudi Arabia.pdf
AI-Powered Food Delivery Transforming App Development in Saudi Arabia.pdf
Techgropse Pvt.Ltd.
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 

Recently uploaded (20)

CAKE: Sharing Slices of Confidential Data on Blockchain
CAKE: Sharing Slices of Confidential Data on BlockchainCAKE: Sharing Slices of Confidential Data on Blockchain
CAKE: Sharing Slices of Confidential Data on Blockchain
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
AI-Powered Food Delivery Transforming App Development in Saudi Arabia.pdf
AI-Powered Food Delivery Transforming App Development in Saudi Arabia.pdfAI-Powered Food Delivery Transforming App Development in Saudi Arabia.pdf
AI-Powered Food Delivery Transforming App Development in Saudi Arabia.pdf
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 

KOI - Knowledge Of Incidents - SemEval 2018

  • 1. KOI at SemEval-2018 Task 5: K O I Paramita Mirza Fariz Darari Rahmad Mahendra paramita@mpi-inf.mpg.de. fariz@cs.ui.ac.id rahmad.mahendra@cs.ui.ac.id 1 SemEval-2018 International Workshop on Semantic Evaluation New Orleans, LA, USA, June 5-6 2018
  • 2. (1) “How many killing incidents happened in June 2016 in San Antonio, Texas?” (2) “How many people were killed in June 2016 in San Antonio, Texas?” 2 Numerical answer: (1) 2 (2) 2 Supporting documents: Knowledge Graph of Incidents News articles
  • 4. (1) “How many killing incidents happened in June 2016 in San Antonio, Texas?” (2) “How many people were killed in June 2016 in San Antonio, Texas?” 4 Document Preprocessing Event Extraction and Coreference Resolution KG Construction & Population Knowledge Graph of Incidents Question Parsing Query Execution News articles Numerical answer: (1) 2 (2) 2 Supporting documents:
  • 5. Powered by: 5 Document Preprocessing Event Extraction and Coreference Resolution KG Construction & Population News articles Word sense disambiguation & entity linking (Navigli & Ponzetto, 2012)
  • 6. One man, 41-years-old John Doe, was fatally shot early Sunday morning in San Antonio, police said. Powered by: Word sense disambiguation & entity linking (Navigli & Ponzetto, 2012) 6 Document Preprocessing Event Extraction and Coreference Resolution KG Construction & Population News articles Named entity recognition (spacy.io) HeidelTime Time expression recognition & normalization (Strötgen & Gertz, 2013) One man, 41-years-old John Doe, was fatally shot early Sunday morning in San Antonio, police said. June 5, 2018 PERSON 03-06-2018 Semantic role labelling (Collobert et al., 2011) shot A1: One man , 41-years-old John Doe AM-MNR: fatally AM-TMP: early Sunday Morning AM-LOC: in San Antonio said A0: police A1: One man , 41-years-old John Doe, was fatally shot early Sunday Morning in San Antonio
  • 7. Document Preprocessing Event Extraction and Coreference Resolution KG Construction & Population News articles 7 One man, 41-years-old John Doe, was fatally shot early Sunday morning in San Antonio, police said.One man, 41-years-old John Doe, was fatally shot early Sunday morning in San Antonio, police said. June 5, 2018 PERSON 03-06-2018 shot A1: One man , 41-years-old John Doe AM-MNR: fatally AM-TMP: early Sunday Morning AM-LOC: in San Antonio said A0: police A1: One man , 41-years-old John Doe, was fatally shot early Sunday Morning in San Antonio said A0: police A1: One man , 41-years-old John Doe, was fatally shot early Sunday Morning in San Antonio Predicate-level event shot A1: One man , 41-years-old John Doe AM-MNR: fatally AM-TMP: early Sunday Morning AM-LOC: in San Antonio • Identifying incident-related concepts  via path- based WordNet similarity (Hirst et al., 1998) > 5.0 • Identifying event participants and their roles (e.g., victim) • Identifying number of victims (except for suspect- related predicates) • Identifying and normalizing event time shot A1: One man , 41-years-old John Doe AM-MNR: fatally AM-TMP: early Sunday Morning AM-LOC: in San Antonio shot A1: One man , 41-years-old John Doe AM-MNR: fatally AM-TMP: early Sunday Morning AM-LOC: in San Antonio shot A1: One man , 41-years-old John Doe AM-MNR: fatally AM-TMP: early Sunday Morning AM-LOC: in San Antonio Predicate-level event, type: killing
  • 8. Document Preprocessing Event Extraction and Coreference Resolution KG Construction & Population News articles 8 One man, 41-years-old John Doe, was fatally shot early Sunday morning in San Antonio, police said.One man, 41-years-old John Doe, was fatally shot early Sunday morning in San Antonio, police said. June 5, 2018 PERSON 03-06-2018 shot A1: One man , 41-years-old John Doe AM-MNR: fatally AM-TMP: early Sunday Morning AM-LOC: in San Antonio said A0: police A1: One man , 41-years-old John Doe, was fatally shot early Sunday Morning in San Antonio said A0: police A1: One man , 41-years-old John Doe, was fatally shot early Sunday Morning in San Antonio Sentence-level event shot A1: One man , 41-years-old John Doe AM-MNR: fatally AM-TMP: early Sunday Morning AM-LOC: in San Antonio shot A1: One man , 41-years-old John Doe AM-MNR: fatally AM-TMP: early Sunday Morning AM-LOC: in San Antonio shot A1: One man , 41-years-old John Doe AM-MNR: fatally AM-TMP: early Sunday Morning AM-LOC: in San Antonio shot A1: One man , 41-years-old John Doe AM-MNR: fatally AM-TMP: early Sunday Morning AM-LOC: in San Antonio shot A1: One man , 41-years-old John Doe AM-MNR: fatally AM-TMP: early Sunday Morning AM-LOC: in San Antonio shot A1: One man , 41-years-old John Doe AM-MNR: fatally AM-TMP: early Sunday Morning AM-LOC: in San Antonio shot A1: One man , 41-years-old John Doe AM-MNR: fatally AM-TMP: early Sunday Morning AM-LOC: in San Antonio shot A1: One man , 41-years-old John Doe AM-MNR: fatally AM-TMP: early Sunday Morning AM-LOC: in San Antonio killing injuring injuring fire Document-level event (incident) • One incident per incident type per document • Determine incident location • Aggregate event participants per incident type • Aggregate number of victims per incident type • Determine incident time per incident type shot A1: One man , 41-years-old John Doe AM-MNR: fatally AM-TMP: early Sunday Morning AM-LOC: in San Antonio shot A1: One man , 41-years-old John Doe AM-MNR: fatally AM-TMP: early Sunday Morning AM-LOC: in San Antonio shot A1: One man , 41-years-old John Doe AM-MNR: fatally AM-TMP: early Sunday Morning AM-LOC: in San Antonio shot A1: One man , 41-years-old John Doe AM-MNR: fatally AM-TMP: early Sunday Morning AM-LOC: in San Antonio killing injuring fire Predicate-level event, type: killing
  • 9. Document Preprocessing Event Extraction and Coreference Resolution KG Construction & Population News articles 9 injuring fire Cross-document event coreference resolution via document clustering killing injuring fire dct1: June 4, 2018 dct2: June 5, 2018 ≈ |dct1-dct2|≤ 3 days Cosine similarity of TF-IDF-based vectors of: • BabelNet senses • spaCy’s PERSON and GPE
  • 10. KOI-KG  https://koi.cs.ui.ac.id/incidents  Powered by 10 Document Preprocessing Event Extraction and Coreference Resolution KG Construction & Population News articles injuring fire killing injuring fire dct1: June 4, 2018 dct2: June 5, 2018 killing injuring fire supported by John Doe 1 victim 03-06-2018 San Antonio ≈ |dct1-dct2|≤ 3 days
  • 11. (1) “How many killing incidents happened in June 2016 in San Antonio, Texas?” (2) “How many people were killed in June 2016 in San Antonio, Texas?” 11 Document Preprocessing Event Extraction and Coreference Resolution KG Construction & Population Knowledge Graph of Incidents Query Execution News articles Numerical answer: (1) 2 (2) 2 Supporting documents: SPARQL query Question Parsing SELECT ?event ?document WHERE { ?event koi:eventType koi:killing . ?event koi:eventDate [ koi:month "06" ; koi:year "2016" ] . ?event koi:location [ koi:city <http://dbpedia.org/resource/San_Antonio> ; koi:state <http://dbpedia.org/resource/Texas> ] . ?document koi:event ?event . }
  • 13. System Subtask S1 Subtask S2 Subtask S3 %Ans Doc-F1 %Ans Doc-F1 %Ans Doc-F1 Baseline 16.5 67.3 100.0 26.4 - - KOI 44.2 83.0 67.5 55.2 66.6 69.6 NAI-SEA 100.0 78.3 100.0 50.5 100.0 63.6 FEUP 100.0 24.7 100.0 30.5 100.0 26.8 NewsReader 51.6 46.2 100.0 36.9 100.0 26.8 13 KOI v2 44.2 83.0 100.0 51.2 100.0 49.1 KOI v3 55.1 85.7 100.0 54.8 100.0 50.9 KOI v2: answer with 0 and empty set of answer documents, if cannot find an answer KOI v3: bug (fire burning & job firing incident types) is fixed, improved incident time identification
  • 14. KOI v3 Micro-averaged P R F1 Answered questions 86.6 74.0 79.8 All questions 86.6 41.6 56.2 14 Killing 88.5 43.2 58.1 Injuring 82.8 37.4 51.5 Job firing 100.0 8.7 16.0 Fire burning 96.9 66.2 78.7 Participant 84.8 43.0 57.0 Location 89.1 39.4 54.6 Time 86.0 42.4 56.8
  • 15. System Subtask S2 Subtask S3 %Ans Accuracy RMSE %Ans Accuracy RMSE Baseline 18.3 8.5 - - KOI 67.5 20.4 6.2 66.6 19.3 7.9 FEUP 100.0 26.4 6.1 100.0 30.4 478.7 NewsReader 100.0 21.9 44.0 100.0 21.1 296.5 NAI-SEA 100.0 17.4 20.6 100.0 20.2 13.5 15 KOI v2 100.0 25.6 5.2 100.0 24.8 7.1 KOI v3 100.0 27.4 5.3 100.0 23.0 7.7 KOI v2: answer with 0 and empty set of answer documents, if cannot find an answer KOI v3: bug (fire burning & job firing incident types) is fixed, improved incident time identification
  • 16. 16 (1) “How many killing incidents happened in June 2016 in San Antonio, Texas?” 2 (2) “How many people were killed in June 2016 in San Antonio, Texas?” 1
  • 17.  KOI  Knowledge Graph of Incidents  to be used for efficiently answering numerical questions about domain-specific events  Fully unsupervised approach, utilizing already existing NLP tools  Simple cross-document event coreference method via document clustering  One main event/incident per document assumption  does not always hold  Low performance on identifying the existence of event participants for specific roles  “Two boys and a girl were shot while…” Thank you! 17
  • 18.  When KOI yields perfect (non-empty) sets of answer documents:  On answering number of incidents (cross-document event coreference)  On answering number of victims (counting event participants)  34.3% correct number 18 correct number overestimate underestimate from only counting from only numeral mentions combination

Editor's Notes

  1. Add exmamples for spaCy HeidelTime and Senna
  2. Add exmamples for spaCy HeidelTime and Senna