SlideShare a Scribd company logo
1 of 11
Nervo Verdezoto
             University of Trento
      nervo.verdezoto@studenti.unitn.it


  Prof. Laure Vieu and Prof. Alessandro Oltramari
                      Tutors
Application of formal ontology and semantic
 techniques to improve the coherence and
       usability of lexical resources


                                           Master HLTI 2009-2010
Outline

   Objectives and Tasks
      – Data
      – Ontological Principles
      – Experiments
      – Results
   Manual Analysis and discussion
   Summary


                                     Master HLTI 2009-2010
Objectives

•   Get familiar with Ontology-driven
    Conceptual Modeling
•   Develop semi-automatic methods to
    spot semantic/ontological problems in
    WordNet at lower levels
•   Get familiar with scientific reporting



                                    Master HLTI 2009-2010
Tasks


    Study WordNet semantic relations to spot ontological
    problems

    Applications:
    
         RTE
    
         Automatic detection of part-whole relations e.g.
        (atmospheric phenomenon, communication), (shape, artifact),
        (shape, physical phenomenon)




                                                            Master HLTI 2009-2010
The Data

      WordNet: 82115 synsets were examined to collect the initial data, 22187 were
      involved in meronyms and holonyms relations (50% meronyms – 50% holonyms)
    
      Semeval 2007: 89 pairs relations were extracted.

      Additionally, we eliminated the redundant pairs from initial data.


                                       MERONYMS
                      14000


                      12000


                      10000


                       8000                                # PAIRS –
                                                           MERONYMS
                       6000


                       4000


                       2000


                          0
                              MEMBER    PART   SUBSTANCE




                                                                       Master HLTI 2009-2010
Ontological Principles

•   Constraints: part and whole should be of a
    similar nature.
•   DOLCE-ontological distinctions between:
     –   endurants (ED) or physical entities (like a
         dog, a table, a cave, etc.)
     –   perdurants (PD) or eventualities (like a
         lecture, a sleep, a raining, etc.)
     –   abstract (AB, entities like a number, the
         content of a text, etc.).


                                             Master HLTI 2009-2010
Experiments – Tests
     [defining queries]

•   Semantic Constraints
     –    Test 0: Individual – Class pairs:
             •    (great_divide%1:15:00,continental_divide%1:15:00)
     –    Test 4: Meronymy – Member and Member–Collection
          pairs:
             •    (coronal%1:06:00, rose%1:20:00)

•   Ontological Constraints
     –    Test 1: ED–AB (test 1.1) or AB–ED (test 1.2)
             •    Test 1.1: physical entity 1:03:00 (but not process 1:03:00) / abstraction 1:03:00 (but not event
                  1:03:00 + state 1:03:00. (head%1:06:04::,coin%1:21:02::)
     –    Test 2: ED–PD (test 2.1) or PD–ED (test 2.2)
             •    Test 2.1 , physical entity 1:03:00 (but not process 1:03:00) / process 1:03:00 + event 1:03:00 +
                  state 1:03:00. ⟨air%1:27:00, wind%1:19:00⟩

     –    Test 3: PD–AB (test 3.1) or AB–PD (test 3.2)
             •    Test 3.1 , abstraction 1:03:00 – but not event 1:03:00 + state 1:03:00(first all and then without
                  group) / event 1:03:00 + state 1:03:00 + process 1:03:00. ⟨regulation time%1:28:00, athletic
                  game%1:04:00⟩
                                                                                                 Master HLTI 2009-2010
Results
                           Ontological Problems

  180
        163
  160


  140


  120                                             108

  100                                                            WORDNET

                                                                 SEMEVAL
   80


   60
                             45

   40


   20
                       2                   2
   0

              Test 1              Test 2                Test 3




                           Ontological Problems

  180
        163
  160

  140

  120                                             108

  100                                                            W ORDNET

                                                                 SEMEVAL
   80

   60                        45

   40

   20
                       2                   2
   0

              Test 1              Test 2                Test 3



                                                                            Master HLTI 2009-2010
Manual Analysis and discussion

General Errors
•       a synset is considered as a class but should be an individual
    –      Confusion between class and an instance of this class for which the term is used with a specific
           sense e.g., ⟨great_divide%1:15:00,continental_divide%1:15:00⟩
    –      Confusion between class and group e.g., new_testament%1:10:00
•       a synset is not attached to the right place in the taxonomy
    –      Confusion between a property and a physical entity having that property (shape, quantity or
           measure, location) or between a relation and a physical entity being an argument in that relation
           e.g., coin%1:21:02, hay_mow%1:23:00 - calyx%1:20:00, mothball%1:06:00
•       a synset mixes two senses, and the missing sense should be attached elsewhere in the
        taxonomy or this missing sense is an individual, not a class
    –      Confusion between 2 senses of a word, amounting to a missing sense e.g.
           ⟨ethiopian%1:18:00, ethiopia%1:15:00⟩
•     the meronymy relation is wrong
    –    Confusion between meronymy and other relations (location, participation, etc.):
           •   “is located in” - ⟨balkan_wars%1:04:00, balkan_peninsula%1:15:00⟩
           •   “participates in” - ⟨feminist%1:18:00,feminist_movement%1:04:00⟩




                                                                                             Master HLTI 2009-2010
Summary and future work

•       An automatic query system based on ontological
        principles and semantic constraints is effective to build
        semi-automatic methods to spot errors in WordNet
•       Increase the number and type of experiments
•       Exploit the results of this study to:
    –       Develop a semi-automatic tool for ”cleaning-up” WordNet
    –       Design and develop guidelines to help lexicographers
            (Christiane Fellbaum from Princeton WordNet Group) to
            prevent classical ontological mistakes
    –       Evaluation for NLP applications




                                                          Master HLTI 2009-2010
THANK YOU


Nervo Verdezoto D.
                     Master HLTI 2009-2010

More Related Content

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Recently uploaded (20)

MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 

Featured

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Featured (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Application of formal ontology and semantic techniques to improve the coherence and usability of lexical resources

  • 1. Nervo Verdezoto University of Trento nervo.verdezoto@studenti.unitn.it Prof. Laure Vieu and Prof. Alessandro Oltramari Tutors Application of formal ontology and semantic techniques to improve the coherence and usability of lexical resources Master HLTI 2009-2010
  • 2. Outline  Objectives and Tasks – Data – Ontological Principles – Experiments – Results  Manual Analysis and discussion  Summary Master HLTI 2009-2010
  • 3. Objectives • Get familiar with Ontology-driven Conceptual Modeling • Develop semi-automatic methods to spot semantic/ontological problems in WordNet at lower levels • Get familiar with scientific reporting Master HLTI 2009-2010
  • 4. Tasks  Study WordNet semantic relations to spot ontological problems  Applications:  RTE  Automatic detection of part-whole relations e.g. (atmospheric phenomenon, communication), (shape, artifact), (shape, physical phenomenon) Master HLTI 2009-2010
  • 5. The Data  WordNet: 82115 synsets were examined to collect the initial data, 22187 were involved in meronyms and holonyms relations (50% meronyms – 50% holonyms)  Semeval 2007: 89 pairs relations were extracted.  Additionally, we eliminated the redundant pairs from initial data. MERONYMS 14000 12000 10000 8000 # PAIRS – MERONYMS 6000 4000 2000 0 MEMBER PART SUBSTANCE Master HLTI 2009-2010
  • 6. Ontological Principles • Constraints: part and whole should be of a similar nature. • DOLCE-ontological distinctions between: – endurants (ED) or physical entities (like a dog, a table, a cave, etc.) – perdurants (PD) or eventualities (like a lecture, a sleep, a raining, etc.) – abstract (AB, entities like a number, the content of a text, etc.). Master HLTI 2009-2010
  • 7. Experiments – Tests [defining queries] • Semantic Constraints – Test 0: Individual – Class pairs: • (great_divide%1:15:00,continental_divide%1:15:00) – Test 4: Meronymy – Member and Member–Collection pairs: • (coronal%1:06:00, rose%1:20:00) • Ontological Constraints – Test 1: ED–AB (test 1.1) or AB–ED (test 1.2) • Test 1.1: physical entity 1:03:00 (but not process 1:03:00) / abstraction 1:03:00 (but not event 1:03:00 + state 1:03:00. (head%1:06:04::,coin%1:21:02::) – Test 2: ED–PD (test 2.1) or PD–ED (test 2.2) • Test 2.1 , physical entity 1:03:00 (but not process 1:03:00) / process 1:03:00 + event 1:03:00 + state 1:03:00. ⟨air%1:27:00, wind%1:19:00⟩ – Test 3: PD–AB (test 3.1) or AB–PD (test 3.2) • Test 3.1 , abstraction 1:03:00 – but not event 1:03:00 + state 1:03:00(first all and then without group) / event 1:03:00 + state 1:03:00 + process 1:03:00. ⟨regulation time%1:28:00, athletic game%1:04:00⟩ Master HLTI 2009-2010
  • 8. Results Ontological Problems 180 163 160 140 120 108 100 WORDNET SEMEVAL 80 60 45 40 20 2 2 0 Test 1 Test 2 Test 3 Ontological Problems 180 163 160 140 120 108 100 W ORDNET SEMEVAL 80 60 45 40 20 2 2 0 Test 1 Test 2 Test 3 Master HLTI 2009-2010
  • 9. Manual Analysis and discussion General Errors • a synset is considered as a class but should be an individual – Confusion between class and an instance of this class for which the term is used with a specific sense e.g., ⟨great_divide%1:15:00,continental_divide%1:15:00⟩ – Confusion between class and group e.g., new_testament%1:10:00 • a synset is not attached to the right place in the taxonomy – Confusion between a property and a physical entity having that property (shape, quantity or measure, location) or between a relation and a physical entity being an argument in that relation e.g., coin%1:21:02, hay_mow%1:23:00 - calyx%1:20:00, mothball%1:06:00 • a synset mixes two senses, and the missing sense should be attached elsewhere in the taxonomy or this missing sense is an individual, not a class – Confusion between 2 senses of a word, amounting to a missing sense e.g. ⟨ethiopian%1:18:00, ethiopia%1:15:00⟩ • the meronymy relation is wrong – Confusion between meronymy and other relations (location, participation, etc.): • “is located in” - ⟨balkan_wars%1:04:00, balkan_peninsula%1:15:00⟩ • “participates in” - ⟨feminist%1:18:00,feminist_movement%1:04:00⟩ Master HLTI 2009-2010
  • 10. Summary and future work • An automatic query system based on ontological principles and semantic constraints is effective to build semi-automatic methods to spot errors in WordNet • Increase the number and type of experiments • Exploit the results of this study to: – Develop a semi-automatic tool for ”cleaning-up” WordNet – Design and develop guidelines to help lexicographers (Christiane Fellbaum from Princeton WordNet Group) to prevent classical ontological mistakes – Evaluation for NLP applications Master HLTI 2009-2010
  • 11. THANK YOU Nervo Verdezoto D. Master HLTI 2009-2010