SlideShare a Scribd company logo
1 of 17
Download to read offline
Approximate subgraph matching
 for detection of topic variations

                Mitja Trampuš
                Dunja Mladenić
                AI Lab, Jožef Stefan Institute, Slovenia




                                        ailab.ijs.si
Mining Diversity
• Web content varies in many aspects, e.g.
  o   Topical
  o   Social (author, target audience, people written about)
  o   Geographical (publisher, places written about)
  o   Sentiment (positive/negative)
  o   Writing style (structure, vocabulary)
  o   Coverage bias


• This work: (micro-)topical diversity
  o Macroscopic = largely solved
  o Microscopic = challenge

                                            AI Lab, Jozef Stefan Institute
                                                               ailab.ijs.si
Task:
Given a collection of texts on a topic,
• identify a common template
• align texts to the template




                                 AI Lab, Jozef Stefan Institute
                                                    ailab.ijs.si
Template representation
• Syntactic
  o info1: X people were killed / killed X people / resulted in X
    casualties
  o info2: blew up Y / destroyed Y / attacked in a Y


• Semantic
  o kill(bomber, people); count(people, X)
  o destroy(bomber, Y)



             X   count
                              kill bomber destroy     Y
                     people
                                             AI Lab, Jozef Stefan Institute
                                                                ailab.ijs.si
car
                                             drive
                 12      count
                                      kill suicide destroychurch
                            believers
                                           bomber
                      exit run                wear vest
Semantic Graph

                                  treatment           execute             attack
Prerequisite:



                               receive                        withstand
                 100    count
                                         kill terroristdemolish
                                                              hospital
                            patients




                  2     count
                             police slaughter     blow uppolice
                                            bomber
                             officer                     station

                                                     AI Lab, Jozef Stefan Institute
                                                                        ailab.ijs.si
Mining Templates
 • Template := subgraph with frequent
               specializations
     o Specializations implied by background taxonomy
       (WordNet)
     o Threshold frequency manually defined

                 X      count
                                        kill bomberdestroy building
                            people




12                             tear
     count             suicide down              2
        believers kill              church           count
                                                          police slaughter           police
                                                                               blow up
                       bomber                                            bomber
                                                         officer                     station
                                                         AI Lab, Jozef Stefan Institute
                                                                            ailab.ijs.si
Semantic Graph Construction
1. Data: Google News crawl
2. HTML cleanup
3. Named entity tagging
4. Pronoun resolution (he/she/him/her)
5. Named entity consolidation (Barack Obama
   vs President Obama)
6. Parsing, triple/fact/assertion extraction
   (for now: subj-verb-obj only)
7. Ontology/taxonomy alignment
8. Merging triples into a graph
                             AI Lab, Jozef Stefan Institute
                                                ailab.ijs.si
Approximate subgraph matching
                       number count
                                              kill person destroy
                                                                location
                                  people




                                                                     FREQUENT
                                                                      SUBTREE MINING

         count
number
             people   kill person destroylocation      number count                            destroy
                                                                                kill person           location
                                                                  people




                                                                                         GENERALIZE


  12     count                     tear
                           suicide down                   2    count      slaughter      blow up
            believers kill              church                      police                     police
                           bomber                                                  bomber
                                                                   officer                     station
                                                                    AI Lab, Jozef Stefan Institute
                                                                                       ailab.ijs.si
Approximate subgraph matching
                   number count
                                        kill person destroy
                                                          location
                              people




                                                  SPECIALIZE


                   number count
                                        kill bomberdestroy
                                                         building
                              people




12                             tear
     count             suicide down                 2
        believers kill              church                count
                                                               police slaughter           police
                                                                                    blow up
                       bomber                                                 bomber
                                                              officer                     station
                                                              AI Lab, Jozef Stefan Institute
                                                                                 ailab.ijs.si
Preliminary results
• 5 test domains; for each:
  o ~10 graphs, ~10000 nodes
  o 10-60 seconds


• At min. support 30%
  o 20 maximal patterns, 9 manually judged as interesting




                                          AI Lab, Jozef Stefan Institute
                                                             ailab.ijs.si
AI Lab, Jozef Stefan Institute
                   ailab.ijs.si
Conclusion
• Future work:
  o Mapping text -> semantics
      • Other ontologies?
  o Interestingness measure for assertions and patterns
  o Evaluation (precision, recall; multiple domains)
  o Alternative approaches to generalizing subgraphs


• Template extraction is achievable,
  but not easy
• Human filtering of results hard to avoid
• Current approach reasonably fast
                                           AI Lab, Jozef Stefan Institute
                                                              ailab.ijs.si
Thank you.
AI Lab, Jozef Stefan Institute
                   ailab.ijs.si
Can we extract all relations?
                           Kind of …
Thousands of small quakes
resumed 18 months ago and
continue to rattle Mammoth
Lakes, June Lake and other Mono
County resort towns. The
temblors, most measuring 1 to 3
on the Richter scale, started
beneath Mammoth Mountain.

Subject – Verb – Object
       Triplets

                            Semantic Graph
AI Lab, Jozef Stefan Institute
                   ailab.ijs.si
Templates - why
• Interpret content
   o news archives: structure/annotate old texts, enable
     semantic search
   o wikipedia: suggestions for infobox entries


• Generate content
   o wikipedia: a starting point for new articles / a checklist of
     information to be included




• No normative definition of “good Jozef Stefanailab.ijs.si
                                AI Lab,
                                        template”
                                                Institute
Evaluation
• Qualitative
  o Usage-specific
  o Not useful for tuning algorithms


• Quantitative
  o Precision
  o Recall




                                       AI Lab, Jozef Stefan Institute
                                                          ailab.ijs.si

More Related Content

More from RENDER project

Internals Of An Aggregated Web News Feed
Internals Of An Aggregated Web News FeedInternals Of An Aggregated Web News Feed
Internals Of An Aggregated Web News FeedRENDER project
 
Unterstützungswerkzeuge für Wikipedia
Unterstützungswerkzeuge für WikipediaUnterstützungswerkzeuge für Wikipedia
Unterstützungswerkzeuge für WikipediaRENDER project
 
Render Review: Wikipedia Case Study, Year 1
Render Review: Wikipedia Case Study, Year 1Render Review: Wikipedia Case Study, Year 1
Render Review: Wikipedia Case Study, Year 1RENDER project
 
Wiki case study - Review year 1
Wiki case study  - Review year 1Wiki case study  - Review year 1
Wiki case study - Review year 1RENDER project
 
Towards a diversity-minded Wikipedia
Towards a diversity-minded WikipediaTowards a diversity-minded Wikipedia
Towards a diversity-minded WikipediaRENDER project
 
Diversiweb2011 02 Opening- Devika P. Madalli
Diversiweb2011 02 Opening- Devika P. MadalliDiversiweb2011 02 Opening- Devika P. Madalli
Diversiweb2011 02 Opening- Devika P. MadalliRENDER project
 
Diversiweb2011 05 Scalable Detection of Sentiment-Based Contradictions - Mika...
Diversiweb2011 05 Scalable Detection of Sentiment-Based Contradictions - Mika...Diversiweb2011 05 Scalable Detection of Sentiment-Based Contradictions - Mika...
Diversiweb2011 05 Scalable Detection of Sentiment-Based Contradictions - Mika...RENDER project
 
Diversiweb2011 04 Expressing Opinion Diversity - Delia Rusu
Diversiweb2011 04 Expressing Opinion Diversity - Delia RusuDiversiweb2011 04 Expressing Opinion Diversity - Delia Rusu
Diversiweb2011 04 Expressing Opinion Diversity - Delia RusuRENDER project
 
Diversiweb2011 01 Opening - Elena Simperl
Diversiweb2011 01 Opening - Elena SimperlDiversiweb2011 01 Opening - Elena Simperl
Diversiweb2011 01 Opening - Elena SimperlRENDER project
 
Data Collection and Integration, Linked Data Management
Data Collection and Integration, Linked Data ManagementData Collection and Integration, Linked Data Management
Data Collection and Integration, Linked Data ManagementRENDER project
 

More from RENDER project (12)

Internals Of An Aggregated Web News Feed
Internals Of An Aggregated Web News FeedInternals Of An Aggregated Web News Feed
Internals Of An Aggregated Web News Feed
 
Unterstützungswerkzeuge für Wikipedia
Unterstützungswerkzeuge für WikipediaUnterstützungswerkzeuge für Wikipedia
Unterstützungswerkzeuge für Wikipedia
 
Render Review: Wikipedia Case Study, Year 1
Render Review: Wikipedia Case Study, Year 1Render Review: Wikipedia Case Study, Year 1
Render Review: Wikipedia Case Study, Year 1
 
Wiki case study - Review year 1
Wiki case study  - Review year 1Wiki case study  - Review year 1
Wiki case study - Review year 1
 
Towards a diversity-minded Wikipedia
Towards a diversity-minded WikipediaTowards a diversity-minded Wikipedia
Towards a diversity-minded Wikipedia
 
Diversiweb2011 02 Opening- Devika P. Madalli
Diversiweb2011 02 Opening- Devika P. MadalliDiversiweb2011 02 Opening- Devika P. Madalli
Diversiweb2011 02 Opening- Devika P. Madalli
 
Diversiweb2011 05 Scalable Detection of Sentiment-Based Contradictions - Mika...
Diversiweb2011 05 Scalable Detection of Sentiment-Based Contradictions - Mika...Diversiweb2011 05 Scalable Detection of Sentiment-Based Contradictions - Mika...
Diversiweb2011 05 Scalable Detection of Sentiment-Based Contradictions - Mika...
 
Diversiweb2011 04 Expressing Opinion Diversity - Delia Rusu
Diversiweb2011 04 Expressing Opinion Diversity - Delia RusuDiversiweb2011 04 Expressing Opinion Diversity - Delia Rusu
Diversiweb2011 04 Expressing Opinion Diversity - Delia Rusu
 
Diversiweb2011 01 Opening - Elena Simperl
Diversiweb2011 01 Opening - Elena SimperlDiversiweb2011 01 Opening - Elena Simperl
Diversiweb2011 01 Opening - Elena Simperl
 
Data Collection and Integration, Linked Data Management
Data Collection and Integration, Linked Data ManagementData Collection and Integration, Linked Data Management
Data Collection and Integration, Linked Data Management
 
RENDER Telefonica
RENDER TelefonicaRENDER Telefonica
RENDER Telefonica
 
Defining Diversity
Defining DiversityDefining Diversity
Defining Diversity
 

Recently uploaded

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 

Recently uploaded (20)

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 

Diversiweb2011 07 Approximate subgraph matching - Mitja Trampus

  • 1. Approximate subgraph matching for detection of topic variations Mitja Trampuš Dunja Mladenić AI Lab, Jožef Stefan Institute, Slovenia ailab.ijs.si
  • 2. Mining Diversity • Web content varies in many aspects, e.g. o Topical o Social (author, target audience, people written about) o Geographical (publisher, places written about) o Sentiment (positive/negative) o Writing style (structure, vocabulary) o Coverage bias • This work: (micro-)topical diversity o Macroscopic = largely solved o Microscopic = challenge AI Lab, Jozef Stefan Institute ailab.ijs.si
  • 3. Task: Given a collection of texts on a topic, • identify a common template • align texts to the template AI Lab, Jozef Stefan Institute ailab.ijs.si
  • 4. Template representation • Syntactic o info1: X people were killed / killed X people / resulted in X casualties o info2: blew up Y / destroyed Y / attacked in a Y • Semantic o kill(bomber, people); count(people, X) o destroy(bomber, Y) X count kill bomber destroy Y people AI Lab, Jozef Stefan Institute ailab.ijs.si
  • 5. car drive 12 count kill suicide destroychurch believers bomber exit run wear vest Semantic Graph treatment execute attack Prerequisite: receive withstand 100 count kill terroristdemolish hospital patients 2 count police slaughter blow uppolice bomber officer station AI Lab, Jozef Stefan Institute ailab.ijs.si
  • 6. Mining Templates • Template := subgraph with frequent specializations o Specializations implied by background taxonomy (WordNet) o Threshold frequency manually defined X count kill bomberdestroy building people 12 tear count suicide down 2 believers kill church count police slaughter police blow up bomber bomber officer station AI Lab, Jozef Stefan Institute ailab.ijs.si
  • 7. Semantic Graph Construction 1. Data: Google News crawl 2. HTML cleanup 3. Named entity tagging 4. Pronoun resolution (he/she/him/her) 5. Named entity consolidation (Barack Obama vs President Obama) 6. Parsing, triple/fact/assertion extraction (for now: subj-verb-obj only) 7. Ontology/taxonomy alignment 8. Merging triples into a graph AI Lab, Jozef Stefan Institute ailab.ijs.si
  • 8. Approximate subgraph matching number count kill person destroy location people FREQUENT SUBTREE MINING count number people kill person destroylocation number count destroy kill person location people GENERALIZE 12 count tear suicide down 2 count slaughter blow up believers kill church police police bomber bomber officer station AI Lab, Jozef Stefan Institute ailab.ijs.si
  • 9. Approximate subgraph matching number count kill person destroy location people SPECIALIZE number count kill bomberdestroy building people 12 tear count suicide down 2 believers kill church count police slaughter police blow up bomber bomber officer station AI Lab, Jozef Stefan Institute ailab.ijs.si
  • 10. Preliminary results • 5 test domains; for each: o ~10 graphs, ~10000 nodes o 10-60 seconds • At min. support 30% o 20 maximal patterns, 9 manually judged as interesting AI Lab, Jozef Stefan Institute ailab.ijs.si
  • 11. AI Lab, Jozef Stefan Institute ailab.ijs.si
  • 12. Conclusion • Future work: o Mapping text -> semantics • Other ontologies? o Interestingness measure for assertions and patterns o Evaluation (precision, recall; multiple domains) o Alternative approaches to generalizing subgraphs • Template extraction is achievable, but not easy • Human filtering of results hard to avoid • Current approach reasonably fast AI Lab, Jozef Stefan Institute ailab.ijs.si
  • 13. Thank you. AI Lab, Jozef Stefan Institute ailab.ijs.si
  • 14. Can we extract all relations? Kind of … Thousands of small quakes resumed 18 months ago and continue to rattle Mammoth Lakes, June Lake and other Mono County resort towns. The temblors, most measuring 1 to 3 on the Richter scale, started beneath Mammoth Mountain. Subject – Verb – Object Triplets Semantic Graph
  • 15. AI Lab, Jozef Stefan Institute ailab.ijs.si
  • 16. Templates - why • Interpret content o news archives: structure/annotate old texts, enable semantic search o wikipedia: suggestions for infobox entries • Generate content o wikipedia: a starting point for new articles / a checklist of information to be included • No normative definition of “good Jozef Stefanailab.ijs.si AI Lab, template” Institute
  • 17. Evaluation • Qualitative o Usage-specific o Not useful for tuning algorithms • Quantitative o Precision o Recall AI Lab, Jozef Stefan Institute ailab.ijs.si