SlideShare a Scribd company logo
1 of 20
Page 1 / 20
Survey on Challenges of
Question Answering
in the Semantic Web
Semantic Web journal 2016
Höffner et al.
Leipzig University, Institute of Computer Science, AKSW Group
홍동균 (Saltlux Inc.)
2018. 11. 16
Page 2 / 20
Contents
1. Introduction
2. Methodology (to find SQA systems)
3. 7 Challenges
4. 7 Challenges in Adam QA
5. Conclusion
Page 3 / 20
Introduction
• Semantic question answering (SQA)
– Asking questions in natural language and receiving answers from a RDF
knowledge base.
• SQA systems
– Since natural language is complex and ambiguous, reliable SQA systems
require many different components.
– Instead of a shared effort, however, many essential components are
redeveloped, which is an inefficient use of researcher’s time and resources.
Page 4 / 20
Introduction
• Contributions
– Surveyed existing work with 72 publications about 62 systems developed
from 2010 to 2015.
– Identified challenges faced by those approaches and collected solutions for
them from the 72 publications.
– Made recommendations on how to develop future SQA systems.
Page 5 / 20
Methodology
• Inclusion criteria
– Candidate 1: First 300 publications of Google Scholar search results
 Query: “ ‘question answering’ AND (‘Semantic Web’ OR ‘data web’) “
– Candidate 2: All publications in the proceeding
 Target conference: ISWC, ESWC, WWW, NLDB, QALD challenge
• Exclusion Criteria
– Published before November 2010 or after July 2015
– Not related to SQA
• Result
– 72 publications describing 62 distinct SQA systems.
 (39 of them from candidate 1, 33 of them form candidate 2)
Page 6 / 20
7 Challenges
• Lexical Gap
• Ambiguity
• Multilingualism
• Complex Queries
• Distributed Knowledge
• Procedural, Temporal and Spatial Questions
• Templates
Number of publications per year
addressed challenge
Page 7 / 20
Lexical Gap
• The vocabulary used in a question is different from the one used in
the labels of the knowledge base. (linking problem)
– Different form of the same word
 (run <-> running, ran), (running <-> runnign, runing)
– Different form of the similar meaning
 Synonyms (run <-> sprint)
 hyper-hyponym pair (chemical process - photosynthesis)
– Different phrases of the same RDF property
 “What is the population of A”, “How many people are there in A?” -> ‘population’
Page 8 / 20
Lexical Gap - Different form of the same word
• String normalization
– Conversion to lower case or to base form
 Stemming, Lemmatizing (running, ran -> run)
• Similarity functions
– Quantifying similarity using a function and a threshold can be applied
 Jaro-Winkler distance
 Edit-distance
 Largest common substring
Page 9 / 20
Lexical Gap - Different form of the similar meaning
• Automatic Query Expansion
– Using additional labels from lexical databases such as WordNet
– Increase recall but lead to mismatches between related words and thus can
decrease the precision.
WordNet
Page 10 / 20
Lexical Gap - Different phrases of the same RDF property
• Pattern libraries
– BOA [Gerber et al.] generates patterns for RDF predicates from corpus and a
knowledge base
 E.g. (:writing, “X wrote Y”), (:writer, “X is written by Y”), (:population, “How many
people are there in X?”)
– PARALEX [Fader et al.]
PARALEX’s examples of paraphrase from the QA dataset
(Wikianswers)
PARALEX’s examples of lexical entries
Natural Language Question:
How big is nyc?
Formal query:
Population(?, new-york)
Learning
Page 11 / 20
Ambiguity
• The phenomenon of the same phrase having different meanings.
– Homonymy: same string refers to different concepts
 (money) bank vs. (river) bank
– Polysemy: same string refers to different but related concepts
 bank (as a company) vs. bank (as a building).
“이동국” in Adam KB
Page 12 / 20
Ambiguity - Disambiguation
• Resource-based methods
– Ranking the candidate RDF resources based of their properties and the
connections between them
– gAnswer [Huang et al.]
Q: Who was married to an actor that played in Philadelphia?
Subgraph matching
Page 13 / 20
Complex Queries
• Complex Queries
– Requiring multiple facts, certain restriction, aggregation, filtered results…
 E.g., Comparison, yes/no, quantifiers, superlatives
– PYTHIA [Unger et al.] constructs formal query even for complex query using
ontology-based grammar
Page 14 / 20
Templates
• (1) Template-based approach
– Map input questions to either manually or automatically created SPARQL
query templates
• (2) Template-free approach
– Build SPARQL queries based on the given syntactic structure of the input
question.
Template-based approach:
TBSL [Unger et al.]
Template-free approach:
Xser [Xu et al.]
Page 15 / 20
Others
• Multilingualism
– SQA systems that can handle multiple input languages, which may even
differ from the language used to encode the knowledge.
• Distributed Knowledge
– Some questions are only answerable with multiple knowledge bases
• Procedural Questions
– E.g. How question (step-by-step instructions)
• Temporal Question
– E.g. Temporal question on clinical narratives
• Spatial Questions
– E.g. Relationship of locations such as crossing, inclusion and nearness.
Page 16 / 20
7 Challenges in Adam QA
• Lexical Gap
– String normalization, similarity function, synonyms -> available
– Patterns for RDF predicates -> unavailable
 Current: string matching
• Ambiguity
– Ranking the candidate RDF resources -> Available (but naïve approach)
 Current: resources are ranked by the number of triples
Page 17 / 20
7 Challenges in Adam QA
• Complex Queries
– Comparisons, yes/no, superlatives, quantifiers -> partially available
• Templates
– Template-based approach -> available
– Template-free approach -> soon (GBQA?)
Page 18 / 20
7 Challenges in Adam QA
• Multilingualism
– Unavailable
• Distributed Knowledge
– Unavailable
• Procedural, Temporal and Spatial Questions
– Partially available
Page 19 / 20
Conclusion
• Analyzing 62 systems and their contributions to seven challenges for
SQA systems.
• Recommendation on future SQA system
– Modularization & Reusing existing parts
– Benchmarking single algorithmic modules instead of benchmarking a
system as a whole.
Page 20 / 20
Thank you.

More Related Content

What's hot

The Rise of Approximate Ontology Reasoning: Is It Mainstream Yet? --- Revisit...
The Rise of Approximate Ontology Reasoning: Is It Mainstream Yet? --- Revisit...The Rise of Approximate Ontology Reasoning: Is It Mainstream Yet? --- Revisit...
The Rise of Approximate Ontology Reasoning: Is It Mainstream Yet? --- Revisit...Jeff Z. Pan
 
Semantic Web: From Representations to Applications
Semantic Web: From Representations to ApplicationsSemantic Web: From Representations to Applications
Semantic Web: From Representations to ApplicationsGuus Schreiber
 
Query Translation for Ontology-extended Data Sources
Query Translation for Ontology-extended Data SourcesQuery Translation for Ontology-extended Data Sources
Query Translation for Ontology-extended Data SourcesJie Bao
 
Improving data quality at Europeana (SWIB 2016)
Improving data quality at Europeana (SWIB 2016)Improving data quality at Europeana (SWIB 2016)
Improving data quality at Europeana (SWIB 2016)Péter Király
 
Owl web ontology language
Owl  web ontology languageOwl  web ontology language
Owl web ontology languagehassco2011
 
Best Practices for Large Scale Text Mining Processing
Best Practices for Large Scale Text Mining ProcessingBest Practices for Large Scale Text Mining Processing
Best Practices for Large Scale Text Mining ProcessingOntotext
 
GTTS System for the Spoken Web Search Task at MediaEval 2012
GTTS System for the Spoken Web Search Task at MediaEval 2012GTTS System for the Spoken Web Search Task at MediaEval 2012
GTTS System for the Spoken Web Search Task at MediaEval 2012MediaEval2012
 
Publishing and Using Linked Open Data - Day 2
Publishing and Using Linked Open Data - Day 2Publishing and Using Linked Open Data - Day 2
Publishing and Using Linked Open Data - Day 2Richard Urban
 
OWL Web Ontology Language Overview
OWL Web Ontology Language OverviewOWL Web Ontology Language Overview
OWL Web Ontology Language OverviewIgor Myroshnichenko
 
Jarrar: OWL -Web Ontology Language
Jarrar: OWL -Web Ontology LanguageJarrar: OWL -Web Ontology Language
Jarrar: OWL -Web Ontology LanguageMustafa Jarrar
 
Jarrar: OWL (Web Ontology Language)
Jarrar: OWL (Web Ontology Language)Jarrar: OWL (Web Ontology Language)
Jarrar: OWL (Web Ontology Language)Mustafa Jarrar
 
The Standards Mosaic Opening the Way to New Technologies
The Standards Mosaic Opening the Way to New TechnologiesThe Standards Mosaic Opening the Way to New Technologies
The Standards Mosaic Opening the Way to New TechnologiesDave Lewis
 
RDA: thinking globally, acting globally
RDA: thinking globally, acting globallyRDA: thinking globally, acting globally
RDA: thinking globally, acting globallyGordon Dunsire
 
Introduction to question answering for linked data & big data
Introduction to question answering for linked data & big dataIntroduction to question answering for linked data & big data
Introduction to question answering for linked data & big dataAndre Freitas
 

What's hot (20)

The Rise of Approximate Ontology Reasoning: Is It Mainstream Yet? --- Revisit...
The Rise of Approximate Ontology Reasoning: Is It Mainstream Yet? --- Revisit...The Rise of Approximate Ontology Reasoning: Is It Mainstream Yet? --- Revisit...
The Rise of Approximate Ontology Reasoning: Is It Mainstream Yet? --- Revisit...
 
Semantic Web: From Representations to Applications
Semantic Web: From Representations to ApplicationsSemantic Web: From Representations to Applications
Semantic Web: From Representations to Applications
 
Oke
OkeOke
Oke
 
The Web Ontology Language
The Web Ontology LanguageThe Web Ontology Language
The Web Ontology Language
 
Query Translation for Ontology-extended Data Sources
Query Translation for Ontology-extended Data SourcesQuery Translation for Ontology-extended Data Sources
Query Translation for Ontology-extended Data Sources
 
Improving data quality at Europeana (SWIB 2016)
Improving data quality at Europeana (SWIB 2016)Improving data quality at Europeana (SWIB 2016)
Improving data quality at Europeana (SWIB 2016)
 
Owl web ontology language
Owl  web ontology languageOwl  web ontology language
Owl web ontology language
 
Best Practices for Large Scale Text Mining Processing
Best Practices for Large Scale Text Mining ProcessingBest Practices for Large Scale Text Mining Processing
Best Practices for Large Scale Text Mining Processing
 
GTTS System for the Spoken Web Search Task at MediaEval 2012
GTTS System for the Spoken Web Search Task at MediaEval 2012GTTS System for the Spoken Web Search Task at MediaEval 2012
GTTS System for the Spoken Web Search Task at MediaEval 2012
 
OWL briefing
OWL briefingOWL briefing
OWL briefing
 
Publishing and Using Linked Open Data - Day 2
Publishing and Using Linked Open Data - Day 2Publishing and Using Linked Open Data - Day 2
Publishing and Using Linked Open Data - Day 2
 
OWL Web Ontology Language Overview
OWL Web Ontology Language OverviewOWL Web Ontology Language Overview
OWL Web Ontology Language Overview
 
Jarrar: OWL -Web Ontology Language
Jarrar: OWL -Web Ontology LanguageJarrar: OWL -Web Ontology Language
Jarrar: OWL -Web Ontology Language
 
Jarrar: OWL (Web Ontology Language)
Jarrar: OWL (Web Ontology Language)Jarrar: OWL (Web Ontology Language)
Jarrar: OWL (Web Ontology Language)
 
The Standards Mosaic Opening the Way to New Technologies
The Standards Mosaic Opening the Way to New TechnologiesThe Standards Mosaic Opening the Way to New Technologies
The Standards Mosaic Opening the Way to New Technologies
 
Snac webinar v3
Snac webinar v3Snac webinar v3
Snac webinar v3
 
RDA: thinking globally, acting globally
RDA: thinking globally, acting globallyRDA: thinking globally, acting globally
RDA: thinking globally, acting globally
 
Introduction to question answering for linked data & big data
Introduction to question answering for linked data & big dataIntroduction to question answering for linked data & big data
Introduction to question answering for linked data & big data
 
Ontology
OntologyOntology
Ontology
 
Xml unit1
Xml unit1Xml unit1
Xml unit1
 

Similar to 20181106 survey on challenges of question answering in the semantic web saltlux

Using and learning phrases
Using and learning phrasesUsing and learning phrases
Using and learning phrasesCassandra Jacobs
 
Semantic Application for Healthcare
Semantic Application for HealthcareSemantic Application for Healthcare
Semantic Application for Healthcarescholten
 
Knowledge engineering and the Web
Knowledge engineering and the WebKnowledge engineering and the Web
Knowledge engineering and the WebGuus Schreiber
 
Language Models for Information Retrieval
Language Models for Information RetrievalLanguage Models for Information Retrieval
Language Models for Information RetrievalNik Spirin
 
Approach to leverage Websites to APIs through Semantics
Approach to leverage Websites to APIs through SemanticsApproach to leverage Websites to APIs through Semantics
Approach to leverage Websites to APIs through SemanticsIoannis Stavrakantonakis
 
Knowledge Representation, Semantic Web
Knowledge Representation, Semantic WebKnowledge Representation, Semantic Web
Knowledge Representation, Semantic WebSerendipity Seraph
 
A review of the state of the art in Machine Learning on the Semantic Web
A review of the state of the art in Machine Learning on the Semantic WebA review of the state of the art in Machine Learning on the Semantic Web
A review of the state of the art in Machine Learning on the Semantic WebSimon Price
 
Tutorial on Semantic Digital Libraries (WWW'2007)
Tutorial on Semantic Digital Libraries (WWW'2007)Tutorial on Semantic Digital Libraries (WWW'2007)
Tutorial on Semantic Digital Libraries (WWW'2007)Sebastian Ryszard Kruk
 
Innovative methods for data integration: Linked Data and NLP
Innovative methods for data integration: Linked Data and NLPInnovative methods for data integration: Linked Data and NLP
Innovative methods for data integration: Linked Data and NLPariadnenetwork
 
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...Keith.May
 
Fri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineeringFri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineeringeswcsummerschool
 
Hide the Stack: Toward Usable Linked Data
Hide the Stack:Toward Usable Linked DataHide the Stack:Toward Usable Linked Data
Hide the Stack: Toward Usable Linked Dataaba-sah
 
JIST2015-Computing the Semantic Similarity of Resources in DBpedia for Recomm...
JIST2015-Computing the Semantic Similarity of Resources in DBpedia for Recomm...JIST2015-Computing the Semantic Similarity of Resources in DBpedia for Recomm...
JIST2015-Computing the Semantic Similarity of Resources in DBpedia for Recomm...GUANGYUAN PIAO
 
A Metadata Application Profile for KOS Vocabulary Registries (KOS-AP)
A Metadata Application Profile for KOS Vocabulary Registries (KOS-AP)A Metadata Application Profile for KOS Vocabulary Registries (KOS-AP)
A Metadata Application Profile for KOS Vocabulary Registries (KOS-AP)Marcia Zeng
 

Similar to 20181106 survey on challenges of question answering in the semantic web saltlux (20)

semantic web & natural language
semantic web & natural languagesemantic web & natural language
semantic web & natural language
 
Using and learning phrases
Using and learning phrasesUsing and learning phrases
Using and learning phrases
 
What is word2vec?
What is word2vec?What is word2vec?
What is word2vec?
 
Semantic Application for Healthcare
Semantic Application for HealthcareSemantic Application for Healthcare
Semantic Application for Healthcare
 
Knowledge engineering and the Web
Knowledge engineering and the WebKnowledge engineering and the Web
Knowledge engineering and the Web
 
Analysis on semantic web layer cake entities
Analysis on semantic web layer cake entitiesAnalysis on semantic web layer cake entities
Analysis on semantic web layer cake entities
 
Language Models for Information Retrieval
Language Models for Information RetrievalLanguage Models for Information Retrieval
Language Models for Information Retrieval
 
Approach to leverage Websites to APIs through Semantics
Approach to leverage Websites to APIs through SemanticsApproach to leverage Websites to APIs through Semantics
Approach to leverage Websites to APIs through Semantics
 
Knowledge Representation, Semantic Web
Knowledge Representation, Semantic WebKnowledge Representation, Semantic Web
Knowledge Representation, Semantic Web
 
A review of the state of the art in Machine Learning on the Semantic Web
A review of the state of the art in Machine Learning on the Semantic WebA review of the state of the art in Machine Learning on the Semantic Web
A review of the state of the art in Machine Learning on the Semantic Web
 
Tutorial on Semantic Digital Libraries (WWW'2007)
Tutorial on Semantic Digital Libraries (WWW'2007)Tutorial on Semantic Digital Libraries (WWW'2007)
Tutorial on Semantic Digital Libraries (WWW'2007)
 
Innovative methods for data integration: Linked Data and NLP
Innovative methods for data integration: Linked Data and NLPInnovative methods for data integration: Linked Data and NLP
Innovative methods for data integration: Linked Data and NLP
 
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
 
Knowledge mangement
Knowledge mangementKnowledge mangement
Knowledge mangement
 
Fri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineeringFri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineering
 
From ontology to wiki
From ontology to wikiFrom ontology to wiki
From ontology to wiki
 
Hide the Stack: Toward Usable Linked Data
Hide the Stack:Toward Usable Linked DataHide the Stack:Toward Usable Linked Data
Hide the Stack: Toward Usable Linked Data
 
NLP & DBpedia
 NLP & DBpedia NLP & DBpedia
NLP & DBpedia
 
JIST2015-Computing the Semantic Similarity of Resources in DBpedia for Recomm...
JIST2015-Computing the Semantic Similarity of Resources in DBpedia for Recomm...JIST2015-Computing the Semantic Similarity of Resources in DBpedia for Recomm...
JIST2015-Computing the Semantic Similarity of Resources in DBpedia for Recomm...
 
A Metadata Application Profile for KOS Vocabulary Registries (KOS-AP)
A Metadata Application Profile for KOS Vocabulary Registries (KOS-AP)A Metadata Application Profile for KOS Vocabulary Registries (KOS-AP)
A Metadata Application Profile for KOS Vocabulary Registries (KOS-AP)
 

More from DongGyun Hong

20170928 session basedrec_hyu_dake
20170928 session basedrec_hyu_dake20170928 session basedrec_hyu_dake
20170928 session basedrec_hyu_dakeDongGyun Hong
 
20170216 conv mf_hyu_dake
20170216 conv mf_hyu_dake20170216 conv mf_hyu_dake
20170216 conv mf_hyu_dakeDongGyun Hong
 
180212 normalization hyu_dake
180212 normalization hyu_dake180212 normalization hyu_dake
180212 normalization hyu_dakeDongGyun Hong
 
20190901 seq2 sparql_kips
20190901 seq2 sparql_kips20190901 seq2 sparql_kips
20190901 seq2 sparql_kipsDongGyun Hong
 
20181217 sac dong_gyun_hong
20181217 sac dong_gyun_hong20181217 sac dong_gyun_hong
20181217 sac dong_gyun_hongDongGyun Hong
 
20200923 open domain-qa_saltlux
20200923 open domain-qa_saltlux20200923 open domain-qa_saltlux
20200923 open domain-qa_saltluxDongGyun Hong
 

More from DongGyun Hong (7)

20170928 session basedrec_hyu_dake
20170928 session basedrec_hyu_dake20170928 session basedrec_hyu_dake
20170928 session basedrec_hyu_dake
 
20170216 conv mf_hyu_dake
20170216 conv mf_hyu_dake20170216 conv mf_hyu_dake
20170216 conv mf_hyu_dake
 
180212 normalization hyu_dake
180212 normalization hyu_dake180212 normalization hyu_dake
180212 normalization hyu_dake
 
20190901 seq2 sparql_kips
20190901 seq2 sparql_kips20190901 seq2 sparql_kips
20190901 seq2 sparql_kips
 
20181103 kbcqa kips
20181103 kbcqa kips20181103 kbcqa kips
20181103 kbcqa kips
 
20181217 sac dong_gyun_hong
20181217 sac dong_gyun_hong20181217 sac dong_gyun_hong
20181217 sac dong_gyun_hong
 
20200923 open domain-qa_saltlux
20200923 open domain-qa_saltlux20200923 open domain-qa_saltlux
20200923 open domain-qa_saltlux
 

Recently uploaded

Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...aditisharan08
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfPower Karaoke
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Andreas Granig
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningVitsRangannavar
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureDinusha Kumarasiri
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
What are the features of Vehicle Tracking System?
What are the features of Vehicle Tracking System?What are the features of Vehicle Tracking System?
What are the features of Vehicle Tracking System?Watsoo Telematics
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWave PLM
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran
 
XpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsXpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsMehedi Hasan Shohan
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
buds n tech IT solutions
buds n  tech IT                solutionsbuds n  tech IT                solutions
buds n tech IT solutionsmonugehlot87
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationkaushalgiri8080
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样umasea
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptkotipi9215
 

Recently uploaded (20)

Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdf
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learning
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with Azure
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
What are the features of Vehicle Tracking System?
What are the features of Vehicle Tracking System?What are the features of Vehicle Tracking System?
What are the features of Vehicle Tracking System?
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need It
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalm
 
XpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsXpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software Solutions
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
buds n tech IT solutions
buds n  tech IT                solutionsbuds n  tech IT                solutions
buds n tech IT solutions
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanation
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.ppt
 

20181106 survey on challenges of question answering in the semantic web saltlux

  • 1. Page 1 / 20 Survey on Challenges of Question Answering in the Semantic Web Semantic Web journal 2016 Höffner et al. Leipzig University, Institute of Computer Science, AKSW Group 홍동균 (Saltlux Inc.) 2018. 11. 16
  • 2. Page 2 / 20 Contents 1. Introduction 2. Methodology (to find SQA systems) 3. 7 Challenges 4. 7 Challenges in Adam QA 5. Conclusion
  • 3. Page 3 / 20 Introduction • Semantic question answering (SQA) – Asking questions in natural language and receiving answers from a RDF knowledge base. • SQA systems – Since natural language is complex and ambiguous, reliable SQA systems require many different components. – Instead of a shared effort, however, many essential components are redeveloped, which is an inefficient use of researcher’s time and resources.
  • 4. Page 4 / 20 Introduction • Contributions – Surveyed existing work with 72 publications about 62 systems developed from 2010 to 2015. – Identified challenges faced by those approaches and collected solutions for them from the 72 publications. – Made recommendations on how to develop future SQA systems.
  • 5. Page 5 / 20 Methodology • Inclusion criteria – Candidate 1: First 300 publications of Google Scholar search results  Query: “ ‘question answering’ AND (‘Semantic Web’ OR ‘data web’) “ – Candidate 2: All publications in the proceeding  Target conference: ISWC, ESWC, WWW, NLDB, QALD challenge • Exclusion Criteria – Published before November 2010 or after July 2015 – Not related to SQA • Result – 72 publications describing 62 distinct SQA systems.  (39 of them from candidate 1, 33 of them form candidate 2)
  • 6. Page 6 / 20 7 Challenges • Lexical Gap • Ambiguity • Multilingualism • Complex Queries • Distributed Knowledge • Procedural, Temporal and Spatial Questions • Templates Number of publications per year addressed challenge
  • 7. Page 7 / 20 Lexical Gap • The vocabulary used in a question is different from the one used in the labels of the knowledge base. (linking problem) – Different form of the same word  (run <-> running, ran), (running <-> runnign, runing) – Different form of the similar meaning  Synonyms (run <-> sprint)  hyper-hyponym pair (chemical process - photosynthesis) – Different phrases of the same RDF property  “What is the population of A”, “How many people are there in A?” -> ‘population’
  • 8. Page 8 / 20 Lexical Gap - Different form of the same word • String normalization – Conversion to lower case or to base form  Stemming, Lemmatizing (running, ran -> run) • Similarity functions – Quantifying similarity using a function and a threshold can be applied  Jaro-Winkler distance  Edit-distance  Largest common substring
  • 9. Page 9 / 20 Lexical Gap - Different form of the similar meaning • Automatic Query Expansion – Using additional labels from lexical databases such as WordNet – Increase recall but lead to mismatches between related words and thus can decrease the precision. WordNet
  • 10. Page 10 / 20 Lexical Gap - Different phrases of the same RDF property • Pattern libraries – BOA [Gerber et al.] generates patterns for RDF predicates from corpus and a knowledge base  E.g. (:writing, “X wrote Y”), (:writer, “X is written by Y”), (:population, “How many people are there in X?”) – PARALEX [Fader et al.] PARALEX’s examples of paraphrase from the QA dataset (Wikianswers) PARALEX’s examples of lexical entries Natural Language Question: How big is nyc? Formal query: Population(?, new-york) Learning
  • 11. Page 11 / 20 Ambiguity • The phenomenon of the same phrase having different meanings. – Homonymy: same string refers to different concepts  (money) bank vs. (river) bank – Polysemy: same string refers to different but related concepts  bank (as a company) vs. bank (as a building). “이동국” in Adam KB
  • 12. Page 12 / 20 Ambiguity - Disambiguation • Resource-based methods – Ranking the candidate RDF resources based of their properties and the connections between them – gAnswer [Huang et al.] Q: Who was married to an actor that played in Philadelphia? Subgraph matching
  • 13. Page 13 / 20 Complex Queries • Complex Queries – Requiring multiple facts, certain restriction, aggregation, filtered results…  E.g., Comparison, yes/no, quantifiers, superlatives – PYTHIA [Unger et al.] constructs formal query even for complex query using ontology-based grammar
  • 14. Page 14 / 20 Templates • (1) Template-based approach – Map input questions to either manually or automatically created SPARQL query templates • (2) Template-free approach – Build SPARQL queries based on the given syntactic structure of the input question. Template-based approach: TBSL [Unger et al.] Template-free approach: Xser [Xu et al.]
  • 15. Page 15 / 20 Others • Multilingualism – SQA systems that can handle multiple input languages, which may even differ from the language used to encode the knowledge. • Distributed Knowledge – Some questions are only answerable with multiple knowledge bases • Procedural Questions – E.g. How question (step-by-step instructions) • Temporal Question – E.g. Temporal question on clinical narratives • Spatial Questions – E.g. Relationship of locations such as crossing, inclusion and nearness.
  • 16. Page 16 / 20 7 Challenges in Adam QA • Lexical Gap – String normalization, similarity function, synonyms -> available – Patterns for RDF predicates -> unavailable  Current: string matching • Ambiguity – Ranking the candidate RDF resources -> Available (but naïve approach)  Current: resources are ranked by the number of triples
  • 17. Page 17 / 20 7 Challenges in Adam QA • Complex Queries – Comparisons, yes/no, superlatives, quantifiers -> partially available • Templates – Template-based approach -> available – Template-free approach -> soon (GBQA?)
  • 18. Page 18 / 20 7 Challenges in Adam QA • Multilingualism – Unavailable • Distributed Knowledge – Unavailable • Procedural, Temporal and Spatial Questions – Partially available
  • 19. Page 19 / 20 Conclusion • Analyzing 62 systems and their contributions to seven challenges for SQA systems. • Recommendation on future SQA system – Modularization & Reusing existing parts – Benchmarking single algorithmic modules instead of benchmarking a system as a whole.
  • 20. Page 20 / 20 Thank you.