SlideShare a Scribd company logo
1 of 9
Download to read offline
A fully-automatic approach
 to answer geographic queries:
       GIRSA-WP at GikiP

   Johannes Leveling           Sven Hartrumpf

Intelligent Information and Communication Systems (IICS)
       University of Hagen (FernUniversität in Hagen)
                   58084 Hagen, Germany
       firstname.lastname@fernuni-hagen.de
GIRSA-WP

 J. Leveling,
S. Hartrumpf
                                                                      Main idea
Main idea

GIRSA-WP
                            InSicht (Hartrumpf, 2005)
Semantic filter                    • open-domain QA system
Experiments                       • based on matching semantic network representations
and Results                         of question and documents
Conclusions                       • supports question decomposition
References                          e.g. temporal or geographical constraints
                       + GIRSA (Leveling and Hartrumpf, 2008)
                                  • textual GIR system
                                  • supports methods to boost recall
                                    e.g. normalizing location indicators
                                  • supports methods to boost precision
                                    e.g. metonymy recognition
                       = GIRSA-WP (GIRSA for Wikipedia)
                                  • automatic combination of InSicht and GIRSA



      J. Leveling, S. Hartrumpf                            GIRSA-WP                      2/9
GIRSA-WP

 J. Leveling,
S. Hartrumpf
                                                                GIRSA-WP
Main idea

GIRSA-WP

Semantic filter

Experiments
and Results
                        • applies semantic filter on answer candidates
Conclusions             • merges results from InSicht and GIRSA by using the
References                  maximum score of documents
                        • returns list of Wikipedia article names
                        • simple multilingual approach:
                            follow German Wikipedia links to articles in English and
                            Portuguese




      J. Leveling, S. Hartrumpf                      GIRSA-WP                      3/9
GIRSA-WP

 J. Leveling,
S. Hartrumpf
                                                      Semantic filter (1/2)
Main idea

GIRSA-WP
                        • in QA: check expected answer type of answer
Semantic filter
                          candidates
Experiments
and Results             • for GIRSA-WP: check semantic answer types
Conclusions               (semantic sort and features, see Helbig (2006))
References                        • extract word representing the answer type from topic
                                    title and description (the first noun not a proper noun)
                                  • parse these words with WOCADI, a syntactico-semantic
                                    parser (includes a disambiguation of words) and find
                                    semantic features corresponding to the extracted words
                                  • parse the answer candidates (titles of Wikipedia
                                    articles) and determine their semantic features
                                  • test if unification of semantic features succeeds;
                                    discard answer candidate, otherwise



      J. Leveling, S. Hartrumpf                            GIRSA-WP                           4/9
GIRSA-WP

 J. Leveling,
S. Hartrumpf
                                                      Semantic filter (2/2)
Main idea

GIRSA-WP
                        • Which Swiss cantons border Germany?
Semantic filter

Experiments
                            → extracted word: cantons
and Results             • parse result: corresponding concept is canton
Conclusions
                            • artificial geographical entity or regional institution
References
                            • legal-person:+, movable:–, etc.
                        • answer candidate Cross-Border-Leasing:
                                  • prototypical-theoretical-concept
                                  • legal-person:–, movable:–
                              → semantic features not unifiable
                        • answer candidate Aargau:
                              → unifiable semantic features




      J. Leveling, S. Hartrumpf                            GIRSA-WP                   5/9
GIRSA-WP

 J. Leveling,
S. Hartrumpf
                                          Experiments and results
Main idea

GIRSA-WP

Semantic filter

Experiments             • six runs submitted:
and Results
                            three with threshold score of 0.01 and
Conclusions

References
                            varied settings for stemming, location name
                            normalization, and noun decompounding;
                            additional three experiments with threshold
                            score of 0.03
                        • 798 (372) answers found
                        • 79 correct answers in best run




      J. Leveling, S. Hartrumpf                     GIRSA-WP              6/9
GIRSA-WP

 J. Leveling,
S. Hartrumpf
                                                 Conclusions (1/2)
Main idea

GIRSA-WP

Semantic filter
                    GikiP topics
Experiments             • are at least as difficult as QA or GeoCLEF topics
and Results

Conclusions
                        • aim at a wider range of expected answer types
References              • include complex geographic relations
                            (GP2: outside, GP4: on the border ),
                            restrictions on measurable properties
                            (GP3: more than, GP13: longer than), and
                            temporal constraints
                            (GP9: Renaissance, GP15: between 1980 and 1990)
                      ⇒ new challenge for QA and GIR community




      J. Leveling, S. Hartrumpf                   GIRSA-WP                    7/9
GIRSA-WP

 J. Leveling,
S. Hartrumpf
                                                    Conclusions (2/2)
Main idea

GIRSA-WP
                        • GIRSA:
Semantic filter
                            • indexing single sentences was meant to ensure a high
Experiments
and Results                    precision (but did not work);
Conclusions                 • geographic entities have not been annotated at all in the
References                     Wikipedia documents
                        • InSicht:
                            • important information is given in tables (like inhabitant
                               numbers), but WOCADI ignores these
                            • the semantic matching approach is still too strict for the
                               IR oriented parts of GikiP queries (similarly for
                               GeoCLEF)
                      ⇒ tasks for future work



      J. Leveling, S. Hartrumpf                       GIRSA-WP                         8/9
GIRSA-WP

 J. Leveling,
S. Hartrumpf
                                                 Selected References
Main idea

GIRSA-WP            Hartrumpf, S. (2005). Question answering using sentence parsing and
Semantic filter         semantic network matching. In Multilingual Information Access for
Experiments            Text, Speech and Images: 5th Workshop of the Cross-Language
and Results            Evaluation Forum, CLEF 2004 (edited by Peters, C.; Clough, P.;
Conclusions            Gonzalo, J.; Jones, G. J. F.; Kluck, M.; and Magnini, B.), volume 3491
References             of LNCS, pp. 512–521. Berlin: Springer.
                    Helbig, H. (2006). Knowledge Representation and the Semantics of
                       Natural Language. Berlin: Springer.
                    Leveling, J. and Hartrumpf, S. (2008). Inferring location names for
                       geographic information retrieval. In Advances in Multilingual and
                       Multimodal Information Retrieval: 8th Workshop of the
                       Cross-Language Evaluation Forum, CLEF 2007 (edited by Peters, C.;
                       Jijkoun, V.; Mandl, T.; Müller, H.; Oard, D. W.; Peñas, A.; Petras, V.;
                       and Santos, D.), volume 5152 of LNCS, pp. 773–780. Berlin:
                       Springer.



      J. Leveling, S. Hartrumpf                         GIRSA-WP                             9/9

More Related Content

Recently uploaded

Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 

Recently uploaded (20)

Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 

Featured

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by HubspotMarius Sescu
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTExpeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsPixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 

Featured (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

A Fully-automatic Approach to Answer Geographic Queries: GIRSA-WP at GikiP

  • 1. A fully-automatic approach to answer geographic queries: GIRSA-WP at GikiP Johannes Leveling Sven Hartrumpf Intelligent Information and Communication Systems (IICS) University of Hagen (FernUniversität in Hagen) 58084 Hagen, Germany firstname.lastname@fernuni-hagen.de
  • 2. GIRSA-WP J. Leveling, S. Hartrumpf Main idea Main idea GIRSA-WP InSicht (Hartrumpf, 2005) Semantic filter • open-domain QA system Experiments • based on matching semantic network representations and Results of question and documents Conclusions • supports question decomposition References e.g. temporal or geographical constraints + GIRSA (Leveling and Hartrumpf, 2008) • textual GIR system • supports methods to boost recall e.g. normalizing location indicators • supports methods to boost precision e.g. metonymy recognition = GIRSA-WP (GIRSA for Wikipedia) • automatic combination of InSicht and GIRSA J. Leveling, S. Hartrumpf GIRSA-WP 2/9
  • 3. GIRSA-WP J. Leveling, S. Hartrumpf GIRSA-WP Main idea GIRSA-WP Semantic filter Experiments and Results • applies semantic filter on answer candidates Conclusions • merges results from InSicht and GIRSA by using the References maximum score of documents • returns list of Wikipedia article names • simple multilingual approach: follow German Wikipedia links to articles in English and Portuguese J. Leveling, S. Hartrumpf GIRSA-WP 3/9
  • 4. GIRSA-WP J. Leveling, S. Hartrumpf Semantic filter (1/2) Main idea GIRSA-WP • in QA: check expected answer type of answer Semantic filter candidates Experiments and Results • for GIRSA-WP: check semantic answer types Conclusions (semantic sort and features, see Helbig (2006)) References • extract word representing the answer type from topic title and description (the first noun not a proper noun) • parse these words with WOCADI, a syntactico-semantic parser (includes a disambiguation of words) and find semantic features corresponding to the extracted words • parse the answer candidates (titles of Wikipedia articles) and determine their semantic features • test if unification of semantic features succeeds; discard answer candidate, otherwise J. Leveling, S. Hartrumpf GIRSA-WP 4/9
  • 5. GIRSA-WP J. Leveling, S. Hartrumpf Semantic filter (2/2) Main idea GIRSA-WP • Which Swiss cantons border Germany? Semantic filter Experiments → extracted word: cantons and Results • parse result: corresponding concept is canton Conclusions • artificial geographical entity or regional institution References • legal-person:+, movable:–, etc. • answer candidate Cross-Border-Leasing: • prototypical-theoretical-concept • legal-person:–, movable:– → semantic features not unifiable • answer candidate Aargau: → unifiable semantic features J. Leveling, S. Hartrumpf GIRSA-WP 5/9
  • 6. GIRSA-WP J. Leveling, S. Hartrumpf Experiments and results Main idea GIRSA-WP Semantic filter Experiments • six runs submitted: and Results three with threshold score of 0.01 and Conclusions References varied settings for stemming, location name normalization, and noun decompounding; additional three experiments with threshold score of 0.03 • 798 (372) answers found • 79 correct answers in best run J. Leveling, S. Hartrumpf GIRSA-WP 6/9
  • 7. GIRSA-WP J. Leveling, S. Hartrumpf Conclusions (1/2) Main idea GIRSA-WP Semantic filter GikiP topics Experiments • are at least as difficult as QA or GeoCLEF topics and Results Conclusions • aim at a wider range of expected answer types References • include complex geographic relations (GP2: outside, GP4: on the border ), restrictions on measurable properties (GP3: more than, GP13: longer than), and temporal constraints (GP9: Renaissance, GP15: between 1980 and 1990) ⇒ new challenge for QA and GIR community J. Leveling, S. Hartrumpf GIRSA-WP 7/9
  • 8. GIRSA-WP J. Leveling, S. Hartrumpf Conclusions (2/2) Main idea GIRSA-WP • GIRSA: Semantic filter • indexing single sentences was meant to ensure a high Experiments and Results precision (but did not work); Conclusions • geographic entities have not been annotated at all in the References Wikipedia documents • InSicht: • important information is given in tables (like inhabitant numbers), but WOCADI ignores these • the semantic matching approach is still too strict for the IR oriented parts of GikiP queries (similarly for GeoCLEF) ⇒ tasks for future work J. Leveling, S. Hartrumpf GIRSA-WP 8/9
  • 9. GIRSA-WP J. Leveling, S. Hartrumpf Selected References Main idea GIRSA-WP Hartrumpf, S. (2005). Question answering using sentence parsing and Semantic filter semantic network matching. In Multilingual Information Access for Experiments Text, Speech and Images: 5th Workshop of the Cross-Language and Results Evaluation Forum, CLEF 2004 (edited by Peters, C.; Clough, P.; Conclusions Gonzalo, J.; Jones, G. J. F.; Kluck, M.; and Magnini, B.), volume 3491 References of LNCS, pp. 512–521. Berlin: Springer. Helbig, H. (2006). Knowledge Representation and the Semantics of Natural Language. Berlin: Springer. Leveling, J. and Hartrumpf, S. (2008). Inferring location names for geographic information retrieval. In Advances in Multilingual and Multimodal Information Retrieval: 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007 (edited by Peters, C.; Jijkoun, V.; Mandl, T.; Müller, H.; Oard, D. W.; Peñas, A.; Petras, V.; and Santos, D.), volume 5152 of LNCS, pp. 773–780. Berlin: Springer. J. Leveling, S. Hartrumpf GIRSA-WP 9/9