SlideShare a Scribd company logo
1 of 30
Semantic Need
Guiding Metadata Annotations by Questions People #ask
Hans-Jörg Happel, FZI Karlsruhe, Germany
2010-11-09 @ 9th Int. Semantic Web Conference (ISWC 2010), Shanghai, China
Agenda
• Introduction
• Semantic Gap Heuristics
• Semantic MediaWiki Study
• Extension:Semantic Need
• Summary & Outlook
Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 2
• SMW is a popular Semantic Web application that
allows to annotate Wiki pages semantically
• Semantic interpretation of the existing Wiki categories
• Syntax extension for [[Wiki links]]
– Relations to other pages: [[Capital::Abuja]]
– Literals: [[Inhabitants::182418]]
Semantic MediaWiki (SMW)
Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 3
Structured Queries in SMW
• SMW also allows for structured queries
{{#ask:
[[Category:Country]] [[OnContinent::Africa]]
|?area
|?...
}}
Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 4
 SMW resembles the Semantic Web in small
SMW Query Result
{{#ask:
[[Category:Country]] [[OnContinent::Africa]]
|?area
|?...
}}
Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 5
???
…?
What happend to „Nigeria“?
Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 6
Info might
be missing
…not
annotated
properly
Different
property
name
Distributed
data source
not available
Semantic Gaps
• Observation:
– „Semantic gap between supply and demand on the Semantic
Web” [Mik09]
– Due to the evolutionary nature of the (Semantic) Web (OWA)
• What is missing? – i.e.:
– KB: Axioms that are known (e.g. statements about Nigeria)
– XKB: Axioms not yet known but people would like to know
Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 7
Towards Semantic Need
• Research questions
– How to identify „Semantic Gaps“?
– Do „Semantic Gaps“ exist?
– If yes, how to close these gaps?
• Research approach
– Propose heuristics
– Explorative: Analyze Public Semantic Web
– Constructive: Design and evaluate tools
88Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China
Agenda
• Introduction
• Semantic Gap Heuristics
• Semantic MediaWiki Study
• Extension:Semantic Need
• Summary & Outlook
Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 9
Idea: Guide Annotation by Information Needs
• Means for deriving information needs
– (Structured) queries
– Information access/browsing
– Context
– …?
• We chose to focus on queries
– Explicit; can be captured easily
– Express a „demand“ [Mik09]
– Recur across time and different people (at least in
IR! [Smy05, Tee06, Zha09])
Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 10
Identifying „Semantic Gaps“
• Focus on
– Conjuctive queries
– Retrieving instances and their properties
• Core elements
{{#ask: [[Category:Country]]
[[OfContinent::Africa]]
|?hasArea
|?population
|?hasCapital
|?Currency
}}
Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 11
Printout
Statement
Printout
Statement
ConditionsConditions
Semantic Gap Heuristic #1: Near Matches
• Instance I ∈ KB is considered a near match of a
query q if:
– I is not in the result set of q in KB
– There is at least one conjunctive query atom of q for
which I is part of the result set
– I would be in the result set of q in KB ∪ XKB
• Correspondingly, we consider q to have an
incomplete result set if it has „near matches“
Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 12
Semantic Gap Heuristic #1: Example
{{#ask:
[[Category:Country]] [[OnContinent::Africa]]
|?area
|?...
}}
Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 13
Egypt 1.001.449 km2 83.082.869 Cairo Egyptian pound
Lacks annotation
[[OnContinent::Africa]]
„Near Match“
Lacks annotation
[[OnContinent::Africa]]
„Near Match“
Semantic Gap Heuristic #2: Missing Printout Values
• Instance I ∈ KB is considered to have missing
printout values for a query q if:
– I is part of the result set of q
– q contains a printout statement x for which no property
value of I exists in the KB
• Note: Technically, „missing printout values“ can be
considered equivalent to near matches
– SPARQL requires „OPTIONAL“ modifier to yield missing
printout values
– SMW-QL allows to set printout values required
• Correspondigly, we consider q to have an sparse
result set if it has at least one „missing printout
value“
Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 14
Semantic Gap Heuristic #2: Example
Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 15
„Missing Printout
Values“
„Missing Printout
Values“
Agenda
• Introduction
• Semantic Gap Heuristics
• Semantic MediaWiki Study
• Extension:Semantic Need
• Summary & Outlook
Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 16
Public SMW Analysis: Design
• Goal
– Do „Semantic Gaps“ exist?
– Find out significance of missing result values and near
matches in real world queries
• Crawling public SMW installations
– Collected ~200 public SMW installations via overview lists and
search engines
– Selection of 8 SMW instances (filtered based on data and
technical reasons and random choice)
– Those have on average 1880 annotations and 35 inline
queries
• Checking for sparse & incomplete query results
– Analyzing 25 (out of 285) queries (only ASK-Queries, online
"Table"-output format, only queries with printout statements
resp. conjunctions)
– 17 of these queries were located on Template pages
Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 17
Public SMW Analysis: Results
• Printout-Values
– In average, 16% of cells in a result set were empty due to
missing annotations (up to 63% for certain queries)
 Allows for identifying a total of 296 missing printout values
– Validation showed that 13 out of 15 manually investigated
empty cells could be considered missing information
• Near matches
– In average, 22% of all potential result pages of a query lack a
selective annotation (up to 94% for certain queries)
Allows for identifying a total of 147 potentially missing
annotations for “selective” properties
– Validation showed that 10 out of 15 manually investigated
near matches could be considered missing information
• Note: based on evaluation conditions, only around 9%
of the overall inline queries were analyzed
Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 18
Agenda
• Introduction
• Semantic Gap Heuristics
• Semantic MediaWiki Study
• Extension:Semantic Need
• Summary & Outlook
Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 19
Extension:Semantic Need: Idea
• Goal:
– How to close „Semantic Gaps“?
– Guide the creation of semantic annotations in SMW
• Design principles
– „Need-driven Knowledge Sharing“ [Hap09b]
– People are willing to contribute missing information, if
they recognize that there is concrete demand
– Derived from related work and supported by user studies
• Core features
– Capture and store needs (i.e. #ask-queries)
– Guide annotations by extending and modifying the SMW
user interface based on information need heuristics (i.e.
„near matches“ and „missing printout values“
Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 2020
Screenshot: In-Page Annotation
Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 22
HintHint Sources of needSources of need
Semantic Need Online Survey: Design
• 34 questions on SMW and Semantic Need
• Target group: SMW experts (via mailinglist,
invitation)
• Data collected in June/July 2010
• 30 complete answers (out of 58)
Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 23
Semantic Need Online Survey: Semantic Need can help
• Problem patterns do occur
– Sparse result set: 12/30 considered
problematic
– Incomplete result set: 23/30 considered
problematic
• Stressed in free text
• Core issue: „invisibility“ of the issue
• Usage of SMW differs
– „Structured“ settings  focus on quality
– „Open“ settings  focus on guidance
– Semantic Need generally considered helpful by
both groups
24Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China
Maintenance practices: mostly ad hoc
• Methods & tools used to maintain semantic data
– (7: n.a.; due to given external data model)
– 12: none
– 5: „simple“
– 7: „advanced“ (e.g. scripts, documentation, team
decisions)
• How to find missing annotations for a given page
– 6: Compare similar pages („extensional“)
– 7: Check schema („intensional“)
– 4: Text analysis
– 10: Use query
25Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China
Agenda
• Introduction
• Semantic Gap Heuristics
• Semantic MediaWiki Study
• Extension:Semantic Need
• Summary & Outlook
Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 26
Insights
• „Semantic Gaps“ do exist
– Information needs are a valuable source to
find them
– „Missing printout values“ and „near
matches“ seem to be useful heuristics
– Especially „incomplete result sets“ are
considered problematic
• No systematic guidance & gardening of
SMW knowledge bases
Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 27
Design Implications
• Semantic Annotation
– Issue: Costly, often driven by a pre-defined ontology
structure
– Idea: Consider “incentives for annotation” [Han05]
• Semantic Search
– Issue: Decoupling of provision & access
– Idea: Consider information needs
• Need specification/ontology
• Maintain semantic query logs
• Data Quality/Gardening/Maturing
– Issue: The Semantic Web evolves continuously
– Idea: Allow for better data quality modeling
Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 28
Summary & Outlook
• Main contributions
– How to identify „Semantic Gaps“  Heuristics based on
queries
– Do „Semantic Gaps“ exist?  Yes
– If yes, how to close these gaps?  Semantic Need
• Next steps
– Large scale analysis of „Semantic Gaps“ (more public SMW
instances)
– Provide stable implementation und gather feedback from
field usage of Semantic Need
• Further research opportunities
– Use needs to guide the sharing of semantic annotations
– Use needs to create schema-level mappings or for
class/property evolution
– Many more (Semantic query logs, UI, Incentives, …)
29Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China
References
• Extension:Semantic Need
– http://amazonas.fzi.de/semanticneed / (Demo Wiki)
– http://www.mediawiki.org/wiki/Extension:Semantic_Need
• Extension:Woogle4MediaWiki (for non SMW-Wikis)
– http://amazonas.fzi.de/wooglenative/ (Demo Wiki)
– http://www.mediawiki.org/wiki/Extension:Woogle4MediaWiki
• Literature
– [Han05] Handschuh, Siegfried: Creating ontology-based metadata by annotation for the semantic web,
Dissertation, 2005
– [Hap09b] Hans-Jörg Happel: Towards Need-driven Knowledge Sharing in Distributed Teams. In
Proceedings of the 9th International Conference on Knowledge Management (I-KNOW 2009)
– [Hap09c] Hans-Jörg Happel: Social Search and Need-driven Knowledge Sharing in Wikis with Woogle. In
Proceedings of the 5th international Symposium on Wikis and Open Collaboration (Orlando, Florida,
October 25 - 27, 2009). WikiSym '09. ACM, New York, NY, 1-10.
– [Mik09]: Mika, P., Meij, E., Zaragoza, H.: Investigating the semantic gap through query log analysis. In:
International Semantic Web Conference. Lecture Notes in Computer Science, vol. 5823, pp. 441–455.
Springer (2009)
– [Smy05] Smyth, Barry ; Balfe, Evelyn ; Freyne, Jill ; Briggs, Peter ; Coyle, Maurice ; Boydell, Oisin:
Exploiting Query Repetition and Regularity in an Adaptive Community-Based Web Search Engine. In:
User Modeling and User-Adapted Interaction 14 (2005), Nr. 5, S. 383–423.
– [Tee06] Teevan, Jaime ; Adar, Eytan ; Jones, Rosie ; Potts, Michael: History repeats itself: repeat
queries in Yahoo’s logs. In: SIGIR’06: Proceedings of the 29th annual international ACM SIGIR
conference on Research and development in information retrieval. New York, NY, USA : ACM, 2006, S.
703–704.
– [Zha09] Zhang, Dell ; Lu, Jinsong: What queries are likely to recur in web search? In: SIGIR ’09:
Proceedings of the 32nd international ACM SIGIR conference on Research and development in
information retrieval. New York, NY, USA : ACM, 2009, S. 827–828.
Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 30
The Semantic Web: Problems
• Lack of resources: you might not annotate everything
– Metadata creation is costly
– Access to metadata might be restricted to different spheres of sharing
(private, friends, world…)
– “..probably the most important [open question] for the Semantic Web. How
to create incentives for annotation?” (Handschuh 2005, p198) [12]
• Lack of guidance: you might annotate the wrong things
– „ Semantic gap between supply and demand on the Semantic Web” [Mik09]
– The two processes of metadata creation and metadata use are decoupled
concerning time and actors
– Existing annotation approaches drive the annotation process by the pre-
defined ontology structure
 No unified theory, why metadata is created and how it is shared
– Semantic Web Vision does not address the creator side of metadata –
it spends a lot of effort to convince people using the Semantic Web but not
contributing to it
31Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China

More Related Content

What's hot

Connecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosConnecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosOCLC
 
IKHarvester - Informal Knowledge Harvester
IKHarvester - Informal Knowledge HarvesterIKHarvester - Informal Knowledge Harvester
IKHarvester - Informal Knowledge HarvesterJaroslaw Dobrzanski
 
Delivering Linked Data Training to Data Science Practitioners
Delivering Linked Data Training to Data Science PractitionersDelivering Linked Data Training to Data Science Practitioners
Delivering Linked Data Training to Data Science PractitionersMarin Dimitrov
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataSören Auer
 
Museum Data Exchange
Museum Data ExchangeMuseum Data Exchange
Museum Data ExchangeOCLC Research
 

What's hot (12)

Lauruhn-5-jun15
Lauruhn-5-jun15Lauruhn-5-jun15
Lauruhn-5-jun15
 
Thompson 6-jun15-final
Thompson 6-jun15-finalThompson 6-jun15-final
Thompson 6-jun15-final
 
Wacker-4-june15
Wacker-4-june15Wacker-4-june15
Wacker-4-june15
 
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
 
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosConnecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
 
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early AdoptersApril 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
 
IKHarvester - Informal Knowledge Harvester
IKHarvester - Informal Knowledge HarvesterIKHarvester - Informal Knowledge Harvester
IKHarvester - Informal Knowledge Harvester
 
Open Science and Identifiers
Open Science and IdentifiersOpen Science and Identifiers
Open Science and Identifiers
 
Delivering Linked Data Training to Data Science Practitioners
Delivering Linked Data Training to Data Science PractitionersDelivering Linked Data Training to Data Science Practitioners
Delivering Linked Data Training to Data Science Practitioners
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
 
Museum Data Exchange
Museum Data ExchangeMuseum Data Exchange
Museum Data Exchange
 
NISO DCMI Webinar bibframe-20130123
NISO DCMI Webinar bibframe-20130123NISO DCMI Webinar bibframe-20130123
NISO DCMI Webinar bibframe-20130123
 

Similar to Semantic Need: Guiding Metadata Annotations by Questions People #ask

Tutorial on query auto-completion
Tutorial on query auto-completionTutorial on query auto-completion
Tutorial on query auto-completionYichen Feng
 
Tutorial on query auto completion
Tutorial on query auto completionTutorial on query auto completion
Tutorial on query auto completionYichen Feng
 
Ravi Mynampaty - developing findability standards
Ravi Mynampaty - developing findability standardsRavi Mynampaty - developing findability standards
Ravi Mynampaty - developing findability standardsStarTech Conference
 
Developing & Implementing Findability Standards
Developing & Implementing Findability StandardsDeveloping & Implementing Findability Standards
Developing & Implementing Findability StandardsRavi Mynampaty
 
Pathways Summary Brief 25 Aug2010
Pathways Summary Brief   25 Aug2010Pathways Summary Brief   25 Aug2010
Pathways Summary Brief 25 Aug2010jmorriso
 
Introduction to Information Architecture & Design - 6/25/16
Introduction to Information Architecture & Design - 6/25/16Introduction to Information Architecture & Design - 6/25/16
Introduction to Information Architecture & Design - 6/25/16Robert Stribley
 
Introduction to Information Architecture & Design - 2/13/16
Introduction to Information Architecture & Design - 2/13/16Introduction to Information Architecture & Design - 2/13/16
Introduction to Information Architecture & Design - 2/13/16Robert Stribley
 
Building Surveys in Qualtrics for Efficient Analytics
Building Surveys in Qualtrics for Efficient AnalyticsBuilding Surveys in Qualtrics for Efficient Analytics
Building Surveys in Qualtrics for Efficient AnalyticsShalin Hai-Jew
 
Exploring Generative Models of Tripartite Graphs for Recommendation in Social...
Exploring Generative Models of Tripartite Graphs for Recommendation in Social...Exploring Generative Models of Tripartite Graphs for Recommendation in Social...
Exploring Generative Models of Tripartite Graphs for Recommendation in Social...Charalampos Chelmis
 
Introduction to Information Architecture & Design - 2/14/15
Introduction to Information Architecture & Design - 2/14/15Introduction to Information Architecture & Design - 2/14/15
Introduction to Information Architecture & Design - 2/14/15Robert Stribley
 
Semtech bizsemanticsearchtutorial
Semtech bizsemanticsearchtutorialSemtech bizsemanticsearchtutorial
Semtech bizsemanticsearchtutorialBarbara Starr
 
Introduction to Information Architecture & Design - 3/19/16
Introduction to Information Architecture & Design - 3/19/16Introduction to Information Architecture & Design - 3/19/16
Introduction to Information Architecture & Design - 3/19/16Robert Stribley
 
Introduction to Information Architecture & Design - 6/24/17
Introduction to Information Architecture & Design - 6/24/17Introduction to Information Architecture & Design - 6/24/17
Introduction to Information Architecture & Design - 6/24/17Robert Stribley
 
User Experience from a Business Perspective
User Experience from a Business PerspectiveUser Experience from a Business Perspective
User Experience from a Business PerspectiveDanny Mittleman
 
Semantic Search
Semantic SearchSemantic Search
Semantic Searchsssw2012
 
Introduction to Information Architecture & Design - 3/21/15
Introduction to Information Architecture & Design - 3/21/15Introduction to Information Architecture & Design - 3/21/15
Introduction to Information Architecture & Design - 3/21/15Robert Stribley
 
How Oracle Uses CrowdFlower For Sentiment Analysis
How Oracle Uses CrowdFlower For Sentiment AnalysisHow Oracle Uses CrowdFlower For Sentiment Analysis
How Oracle Uses CrowdFlower For Sentiment AnalysisCrowdFlower
 
RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...
 RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning... RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...
RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...S. Diana Hu
 
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...Joaquin Delgado PhD.
 

Similar to Semantic Need: Guiding Metadata Annotations by Questions People #ask (20)

Tutorial on query auto-completion
Tutorial on query auto-completionTutorial on query auto-completion
Tutorial on query auto-completion
 
Tutorial on query auto completion
Tutorial on query auto completionTutorial on query auto completion
Tutorial on query auto completion
 
Ravi Mynampaty - developing findability standards
Ravi Mynampaty - developing findability standardsRavi Mynampaty - developing findability standards
Ravi Mynampaty - developing findability standards
 
Developing & Implementing Findability Standards
Developing & Implementing Findability StandardsDeveloping & Implementing Findability Standards
Developing & Implementing Findability Standards
 
Pathways Summary Brief 25 Aug2010
Pathways Summary Brief   25 Aug2010Pathways Summary Brief   25 Aug2010
Pathways Summary Brief 25 Aug2010
 
Introduction to Information Architecture & Design - 6/25/16
Introduction to Information Architecture & Design - 6/25/16Introduction to Information Architecture & Design - 6/25/16
Introduction to Information Architecture & Design - 6/25/16
 
Introduction to Information Architecture & Design - 2/13/16
Introduction to Information Architecture & Design - 2/13/16Introduction to Information Architecture & Design - 2/13/16
Introduction to Information Architecture & Design - 2/13/16
 
Building Surveys in Qualtrics for Efficient Analytics
Building Surveys in Qualtrics for Efficient AnalyticsBuilding Surveys in Qualtrics for Efficient Analytics
Building Surveys in Qualtrics for Efficient Analytics
 
Exploring Generative Models of Tripartite Graphs for Recommendation in Social...
Exploring Generative Models of Tripartite Graphs for Recommendation in Social...Exploring Generative Models of Tripartite Graphs for Recommendation in Social...
Exploring Generative Models of Tripartite Graphs for Recommendation in Social...
 
Introduction to Information Architecture & Design - 2/14/15
Introduction to Information Architecture & Design - 2/14/15Introduction to Information Architecture & Design - 2/14/15
Introduction to Information Architecture & Design - 2/14/15
 
Semtech bizsemanticsearchtutorial
Semtech bizsemanticsearchtutorialSemtech bizsemanticsearchtutorial
Semtech bizsemanticsearchtutorial
 
Introduction to Information Architecture & Design - 3/19/16
Introduction to Information Architecture & Design - 3/19/16Introduction to Information Architecture & Design - 3/19/16
Introduction to Information Architecture & Design - 3/19/16
 
Introduction to Information Architecture & Design - 6/24/17
Introduction to Information Architecture & Design - 6/24/17Introduction to Information Architecture & Design - 6/24/17
Introduction to Information Architecture & Design - 6/24/17
 
Dlf 2012
Dlf 2012Dlf 2012
Dlf 2012
 
User Experience from a Business Perspective
User Experience from a Business PerspectiveUser Experience from a Business Perspective
User Experience from a Business Perspective
 
Semantic Search
Semantic SearchSemantic Search
Semantic Search
 
Introduction to Information Architecture & Design - 3/21/15
Introduction to Information Architecture & Design - 3/21/15Introduction to Information Architecture & Design - 3/21/15
Introduction to Information Architecture & Design - 3/21/15
 
How Oracle Uses CrowdFlower For Sentiment Analysis
How Oracle Uses CrowdFlower For Sentiment AnalysisHow Oracle Uses CrowdFlower For Sentiment Analysis
How Oracle Uses CrowdFlower For Sentiment Analysis
 
RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...
 RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning... RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...
RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...
 
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
 

More from Hans-Joerg Happel

Woogle4MediaWiki: An Extension to Improve Search and to Guide Contributions
Woogle4MediaWiki: An Extension to Improve Search and to Guide ContributionsWoogle4MediaWiki: An Extension to Improve Search and to Guide Contributions
Woogle4MediaWiki: An Extension to Improve Search and to Guide ContributionsHans-Joerg Happel
 
Wissensmanagement 3.0 Bedarfsorientierter Wissensaustausch mit Social Softwar...
Wissensmanagement 3.0 Bedarfsorientierter Wissensaustausch mit Social Softwar...Wissensmanagement 3.0 Bedarfsorientierter Wissensaustausch mit Social Softwar...
Wissensmanagement 3.0 Bedarfsorientierter Wissensaustausch mit Social Softwar...Hans-Joerg Happel
 
What do you want to contribute today (INSEMTIVES)
What do you want to contribute today (INSEMTIVES)What do you want to contribute today (INSEMTIVES)
What do you want to contribute today (INSEMTIVES)Hans-Joerg Happel
 
Semantic Need : Semantics from the People!
Semantic Need: Semantics from the People!Semantic Need: Semantics from the People!
Semantic Need : Semantics from the People!Hans-Joerg Happel
 
Semantic Result Formats: Automatically Transforming Structured Data into usef...
Semantic Result Formats: Automatically Transforming Structured Data into usef...Semantic Result Formats: Automatically Transforming Structured Data into usef...
Semantic Result Formats: Automatically Transforming Structured Data into usef...Hans-Joerg Happel
 
AIFB Graduiertenkolloquium - Bedarfsgetriebener Wissensaustausch
AIFB Graduiertenkolloquium - Bedarfsgetriebener WissensaustauschAIFB Graduiertenkolloquium - Bedarfsgetriebener Wissensaustausch
AIFB Graduiertenkolloquium - Bedarfsgetriebener WissensaustauschHans-Joerg Happel
 
Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki
Woogle4MediaWiki - Social search and knowledge sharing for MediaWikiWoogle4MediaWiki - Social search and knowledge sharing for MediaWiki
Woogle4MediaWiki - Social search and knowledge sharing for MediaWikiHans-Joerg Happel
 
When Users Becom Collaborators: Towards Continuous and Context-Aware User Input
When Users Becom Collaborators: Towards Continuous and Context-Aware User InputWhen Users Becom Collaborators: Towards Continuous and Context-Aware User Input
When Users Becom Collaborators: Towards Continuous and Context-Aware User InputHans-Joerg Happel
 
Towards Need-driven Knowledge Sharing in Distributed Teams
Towards Need-driven Knowledge Sharing in Distributed TeamsTowards Need-driven Knowledge Sharing in Distributed Teams
Towards Need-driven Knowledge Sharing in Distributed TeamsHans-Joerg Happel
 
2009 06 18 Atlassian User Conference Duesseldorf Romberg Happel Mit Wiquila S...
2009 06 18 Atlassian User Conference Duesseldorf Romberg Happel Mit Wiquila S...2009 06 18 Atlassian User Conference Duesseldorf Romberg Happel Mit Wiquila S...
2009 06 18 Atlassian User Conference Duesseldorf Romberg Happel Mit Wiquila S...Hans-Joerg Happel
 
Woogle -- On Why and How to Marry Wikis with Enterprise Search
Woogle -- On Why and How to Marry Wikis with Enterprise SearchWoogle -- On Why and How to Marry Wikis with Enterprise Search
Woogle -- On Why and How to Marry Wikis with Enterprise SearchHans-Joerg Happel
 
Supporting Software Development Teams with a Semantic Process- and Artifact-o...
Supporting Software Development Teams with a Semantic Process- and Artifact-o...Supporting Software Development Teams with a Semantic Process- and Artifact-o...
Supporting Software Development Teams with a Semantic Process- and Artifact-o...Hans-Joerg Happel
 
Enabling Social Network Analysis in Distributed Collaborative Software Develo...
Enabling Social Network Analysis in Distributed Collaborative Software Develo...Enabling Social Network Analysis in Distributed Collaborative Software Develo...
Enabling Social Network Analysis in Distributed Collaborative Software Develo...Hans-Joerg Happel
 
Playful Cleverness Revisited: Open-source Game Development as a Method for Te...
Playful Cleverness Revisited: Open-source Game Development as a Method for Te...Playful Cleverness Revisited: Open-source Game Development as a Method for Te...
Playful Cleverness Revisited: Open-source Game Development as a Method for Te...Hans-Joerg Happel
 
Web 2.0 artifacts in SME-networks
Web 2.0 artifacts in SME-networksWeb 2.0 artifacts in SME-networks
Web 2.0 artifacts in SME-networksHans-Joerg Happel
 
Einführung Wissenschaftliches Arbeiten in der angewandten Informatik
Einführung Wissenschaftliches Arbeiten in der angewandten InformatikEinführung Wissenschaftliches Arbeiten in der angewandten Informatik
Einführung Wissenschaftliches Arbeiten in der angewandten InformatikHans-Joerg Happel
 

More from Hans-Joerg Happel (16)

Woogle4MediaWiki: An Extension to Improve Search and to Guide Contributions
Woogle4MediaWiki: An Extension to Improve Search and to Guide ContributionsWoogle4MediaWiki: An Extension to Improve Search and to Guide Contributions
Woogle4MediaWiki: An Extension to Improve Search and to Guide Contributions
 
Wissensmanagement 3.0 Bedarfsorientierter Wissensaustausch mit Social Softwar...
Wissensmanagement 3.0 Bedarfsorientierter Wissensaustausch mit Social Softwar...Wissensmanagement 3.0 Bedarfsorientierter Wissensaustausch mit Social Softwar...
Wissensmanagement 3.0 Bedarfsorientierter Wissensaustausch mit Social Softwar...
 
What do you want to contribute today (INSEMTIVES)
What do you want to contribute today (INSEMTIVES)What do you want to contribute today (INSEMTIVES)
What do you want to contribute today (INSEMTIVES)
 
Semantic Need : Semantics from the People!
Semantic Need: Semantics from the People!Semantic Need: Semantics from the People!
Semantic Need : Semantics from the People!
 
Semantic Result Formats: Automatically Transforming Structured Data into usef...
Semantic Result Formats: Automatically Transforming Structured Data into usef...Semantic Result Formats: Automatically Transforming Structured Data into usef...
Semantic Result Formats: Automatically Transforming Structured Data into usef...
 
AIFB Graduiertenkolloquium - Bedarfsgetriebener Wissensaustausch
AIFB Graduiertenkolloquium - Bedarfsgetriebener WissensaustauschAIFB Graduiertenkolloquium - Bedarfsgetriebener Wissensaustausch
AIFB Graduiertenkolloquium - Bedarfsgetriebener Wissensaustausch
 
Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki
Woogle4MediaWiki - Social search and knowledge sharing for MediaWikiWoogle4MediaWiki - Social search and knowledge sharing for MediaWiki
Woogle4MediaWiki - Social search and knowledge sharing for MediaWiki
 
When Users Becom Collaborators: Towards Continuous and Context-Aware User Input
When Users Becom Collaborators: Towards Continuous and Context-Aware User InputWhen Users Becom Collaborators: Towards Continuous and Context-Aware User Input
When Users Becom Collaborators: Towards Continuous and Context-Aware User Input
 
Towards Need-driven Knowledge Sharing in Distributed Teams
Towards Need-driven Knowledge Sharing in Distributed TeamsTowards Need-driven Knowledge Sharing in Distributed Teams
Towards Need-driven Knowledge Sharing in Distributed Teams
 
2009 06 18 Atlassian User Conference Duesseldorf Romberg Happel Mit Wiquila S...
2009 06 18 Atlassian User Conference Duesseldorf Romberg Happel Mit Wiquila S...2009 06 18 Atlassian User Conference Duesseldorf Romberg Happel Mit Wiquila S...
2009 06 18 Atlassian User Conference Duesseldorf Romberg Happel Mit Wiquila S...
 
Woogle -- On Why and How to Marry Wikis with Enterprise Search
Woogle -- On Why and How to Marry Wikis with Enterprise SearchWoogle -- On Why and How to Marry Wikis with Enterprise Search
Woogle -- On Why and How to Marry Wikis with Enterprise Search
 
Supporting Software Development Teams with a Semantic Process- and Artifact-o...
Supporting Software Development Teams with a Semantic Process- and Artifact-o...Supporting Software Development Teams with a Semantic Process- and Artifact-o...
Supporting Software Development Teams with a Semantic Process- and Artifact-o...
 
Enabling Social Network Analysis in Distributed Collaborative Software Develo...
Enabling Social Network Analysis in Distributed Collaborative Software Develo...Enabling Social Network Analysis in Distributed Collaborative Software Develo...
Enabling Social Network Analysis in Distributed Collaborative Software Develo...
 
Playful Cleverness Revisited: Open-source Game Development as a Method for Te...
Playful Cleverness Revisited: Open-source Game Development as a Method for Te...Playful Cleverness Revisited: Open-source Game Development as a Method for Te...
Playful Cleverness Revisited: Open-source Game Development as a Method for Te...
 
Web 2.0 artifacts in SME-networks
Web 2.0 artifacts in SME-networksWeb 2.0 artifacts in SME-networks
Web 2.0 artifacts in SME-networks
 
Einführung Wissenschaftliches Arbeiten in der angewandten Informatik
Einführung Wissenschaftliches Arbeiten in der angewandten InformatikEinführung Wissenschaftliches Arbeiten in der angewandten Informatik
Einführung Wissenschaftliches Arbeiten in der angewandten Informatik
 

Recently uploaded

CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 

Semantic Need: Guiding Metadata Annotations by Questions People #ask

  • 1. Semantic Need Guiding Metadata Annotations by Questions People #ask Hans-Jörg Happel, FZI Karlsruhe, Germany 2010-11-09 @ 9th Int. Semantic Web Conference (ISWC 2010), Shanghai, China
  • 2. Agenda • Introduction • Semantic Gap Heuristics • Semantic MediaWiki Study • Extension:Semantic Need • Summary & Outlook Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 2
  • 3. • SMW is a popular Semantic Web application that allows to annotate Wiki pages semantically • Semantic interpretation of the existing Wiki categories • Syntax extension for [[Wiki links]] – Relations to other pages: [[Capital::Abuja]] – Literals: [[Inhabitants::182418]] Semantic MediaWiki (SMW) Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 3
  • 4. Structured Queries in SMW • SMW also allows for structured queries {{#ask: [[Category:Country]] [[OnContinent::Africa]] |?area |?... }} Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 4  SMW resembles the Semantic Web in small
  • 5. SMW Query Result {{#ask: [[Category:Country]] [[OnContinent::Africa]] |?area |?... }} Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 5 ??? …?
  • 6. What happend to „Nigeria“? Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 6 Info might be missing …not annotated properly Different property name Distributed data source not available
  • 7. Semantic Gaps • Observation: – „Semantic gap between supply and demand on the Semantic Web” [Mik09] – Due to the evolutionary nature of the (Semantic) Web (OWA) • What is missing? – i.e.: – KB: Axioms that are known (e.g. statements about Nigeria) – XKB: Axioms not yet known but people would like to know Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 7
  • 8. Towards Semantic Need • Research questions – How to identify „Semantic Gaps“? – Do „Semantic Gaps“ exist? – If yes, how to close these gaps? • Research approach – Propose heuristics – Explorative: Analyze Public Semantic Web – Constructive: Design and evaluate tools 88Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China
  • 9. Agenda • Introduction • Semantic Gap Heuristics • Semantic MediaWiki Study • Extension:Semantic Need • Summary & Outlook Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 9
  • 10. Idea: Guide Annotation by Information Needs • Means for deriving information needs – (Structured) queries – Information access/browsing – Context – …? • We chose to focus on queries – Explicit; can be captured easily – Express a „demand“ [Mik09] – Recur across time and different people (at least in IR! [Smy05, Tee06, Zha09]) Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 10
  • 11. Identifying „Semantic Gaps“ • Focus on – Conjuctive queries – Retrieving instances and their properties • Core elements {{#ask: [[Category:Country]] [[OfContinent::Africa]] |?hasArea |?population |?hasCapital |?Currency }} Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 11 Printout Statement Printout Statement ConditionsConditions
  • 12. Semantic Gap Heuristic #1: Near Matches • Instance I ∈ KB is considered a near match of a query q if: – I is not in the result set of q in KB – There is at least one conjunctive query atom of q for which I is part of the result set – I would be in the result set of q in KB ∪ XKB • Correspondingly, we consider q to have an incomplete result set if it has „near matches“ Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 12
  • 13. Semantic Gap Heuristic #1: Example {{#ask: [[Category:Country]] [[OnContinent::Africa]] |?area |?... }} Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 13 Egypt 1.001.449 km2 83.082.869 Cairo Egyptian pound Lacks annotation [[OnContinent::Africa]] „Near Match“ Lacks annotation [[OnContinent::Africa]] „Near Match“
  • 14. Semantic Gap Heuristic #2: Missing Printout Values • Instance I ∈ KB is considered to have missing printout values for a query q if: – I is part of the result set of q – q contains a printout statement x for which no property value of I exists in the KB • Note: Technically, „missing printout values“ can be considered equivalent to near matches – SPARQL requires „OPTIONAL“ modifier to yield missing printout values – SMW-QL allows to set printout values required • Correspondigly, we consider q to have an sparse result set if it has at least one „missing printout value“ Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 14
  • 15. Semantic Gap Heuristic #2: Example Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 15 „Missing Printout Values“ „Missing Printout Values“
  • 16. Agenda • Introduction • Semantic Gap Heuristics • Semantic MediaWiki Study • Extension:Semantic Need • Summary & Outlook Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 16
  • 17. Public SMW Analysis: Design • Goal – Do „Semantic Gaps“ exist? – Find out significance of missing result values and near matches in real world queries • Crawling public SMW installations – Collected ~200 public SMW installations via overview lists and search engines – Selection of 8 SMW instances (filtered based on data and technical reasons and random choice) – Those have on average 1880 annotations and 35 inline queries • Checking for sparse & incomplete query results – Analyzing 25 (out of 285) queries (only ASK-Queries, online "Table"-output format, only queries with printout statements resp. conjunctions) – 17 of these queries were located on Template pages Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 17
  • 18. Public SMW Analysis: Results • Printout-Values – In average, 16% of cells in a result set were empty due to missing annotations (up to 63% for certain queries)  Allows for identifying a total of 296 missing printout values – Validation showed that 13 out of 15 manually investigated empty cells could be considered missing information • Near matches – In average, 22% of all potential result pages of a query lack a selective annotation (up to 94% for certain queries) Allows for identifying a total of 147 potentially missing annotations for “selective” properties – Validation showed that 10 out of 15 manually investigated near matches could be considered missing information • Note: based on evaluation conditions, only around 9% of the overall inline queries were analyzed Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 18
  • 19. Agenda • Introduction • Semantic Gap Heuristics • Semantic MediaWiki Study • Extension:Semantic Need • Summary & Outlook Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 19
  • 20. Extension:Semantic Need: Idea • Goal: – How to close „Semantic Gaps“? – Guide the creation of semantic annotations in SMW • Design principles – „Need-driven Knowledge Sharing“ [Hap09b] – People are willing to contribute missing information, if they recognize that there is concrete demand – Derived from related work and supported by user studies • Core features – Capture and store needs (i.e. #ask-queries) – Guide annotations by extending and modifying the SMW user interface based on information need heuristics (i.e. „near matches“ and „missing printout values“ Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 2020
  • 21. Screenshot: In-Page Annotation Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 22 HintHint Sources of needSources of need
  • 22. Semantic Need Online Survey: Design • 34 questions on SMW and Semantic Need • Target group: SMW experts (via mailinglist, invitation) • Data collected in June/July 2010 • 30 complete answers (out of 58) Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 23
  • 23. Semantic Need Online Survey: Semantic Need can help • Problem patterns do occur – Sparse result set: 12/30 considered problematic – Incomplete result set: 23/30 considered problematic • Stressed in free text • Core issue: „invisibility“ of the issue • Usage of SMW differs – „Structured“ settings  focus on quality – „Open“ settings  focus on guidance – Semantic Need generally considered helpful by both groups 24Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China
  • 24. Maintenance practices: mostly ad hoc • Methods & tools used to maintain semantic data – (7: n.a.; due to given external data model) – 12: none – 5: „simple“ – 7: „advanced“ (e.g. scripts, documentation, team decisions) • How to find missing annotations for a given page – 6: Compare similar pages („extensional“) – 7: Check schema („intensional“) – 4: Text analysis – 10: Use query 25Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China
  • 25. Agenda • Introduction • Semantic Gap Heuristics • Semantic MediaWiki Study • Extension:Semantic Need • Summary & Outlook Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 26
  • 26. Insights • „Semantic Gaps“ do exist – Information needs are a valuable source to find them – „Missing printout values“ and „near matches“ seem to be useful heuristics – Especially „incomplete result sets“ are considered problematic • No systematic guidance & gardening of SMW knowledge bases Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 27
  • 27. Design Implications • Semantic Annotation – Issue: Costly, often driven by a pre-defined ontology structure – Idea: Consider “incentives for annotation” [Han05] • Semantic Search – Issue: Decoupling of provision & access – Idea: Consider information needs • Need specification/ontology • Maintain semantic query logs • Data Quality/Gardening/Maturing – Issue: The Semantic Web evolves continuously – Idea: Allow for better data quality modeling Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 28
  • 28. Summary & Outlook • Main contributions – How to identify „Semantic Gaps“  Heuristics based on queries – Do „Semantic Gaps“ exist?  Yes – If yes, how to close these gaps?  Semantic Need • Next steps – Large scale analysis of „Semantic Gaps“ (more public SMW instances) – Provide stable implementation und gather feedback from field usage of Semantic Need • Further research opportunities – Use needs to guide the sharing of semantic annotations – Use needs to create schema-level mappings or for class/property evolution – Many more (Semantic query logs, UI, Incentives, …) 29Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China
  • 29. References • Extension:Semantic Need – http://amazonas.fzi.de/semanticneed / (Demo Wiki) – http://www.mediawiki.org/wiki/Extension:Semantic_Need • Extension:Woogle4MediaWiki (for non SMW-Wikis) – http://amazonas.fzi.de/wooglenative/ (Demo Wiki) – http://www.mediawiki.org/wiki/Extension:Woogle4MediaWiki • Literature – [Han05] Handschuh, Siegfried: Creating ontology-based metadata by annotation for the semantic web, Dissertation, 2005 – [Hap09b] Hans-Jörg Happel: Towards Need-driven Knowledge Sharing in Distributed Teams. In Proceedings of the 9th International Conference on Knowledge Management (I-KNOW 2009) – [Hap09c] Hans-Jörg Happel: Social Search and Need-driven Knowledge Sharing in Wikis with Woogle. In Proceedings of the 5th international Symposium on Wikis and Open Collaboration (Orlando, Florida, October 25 - 27, 2009). WikiSym '09. ACM, New York, NY, 1-10. – [Mik09]: Mika, P., Meij, E., Zaragoza, H.: Investigating the semantic gap through query log analysis. In: International Semantic Web Conference. Lecture Notes in Computer Science, vol. 5823, pp. 441–455. Springer (2009) – [Smy05] Smyth, Barry ; Balfe, Evelyn ; Freyne, Jill ; Briggs, Peter ; Coyle, Maurice ; Boydell, Oisin: Exploiting Query Repetition and Regularity in an Adaptive Community-Based Web Search Engine. In: User Modeling and User-Adapted Interaction 14 (2005), Nr. 5, S. 383–423. – [Tee06] Teevan, Jaime ; Adar, Eytan ; Jones, Rosie ; Potts, Michael: History repeats itself: repeat queries in Yahoo’s logs. In: SIGIR’06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval. New York, NY, USA : ACM, 2006, S. 703–704. – [Zha09] Zhang, Dell ; Lu, Jinsong: What queries are likely to recur in web search? In: SIGIR ’09: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval. New York, NY, USA : ACM, 2009, S. 827–828. Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China 30
  • 30. The Semantic Web: Problems • Lack of resources: you might not annotate everything – Metadata creation is costly – Access to metadata might be restricted to different spheres of sharing (private, friends, world…) – “..probably the most important [open question] for the Semantic Web. How to create incentives for annotation?” (Handschuh 2005, p198) [12] • Lack of guidance: you might annotate the wrong things – „ Semantic gap between supply and demand on the Semantic Web” [Mik09] – The two processes of metadata creation and metadata use are decoupled concerning time and actors – Existing annotation approaches drive the annotation process by the pre- defined ontology structure  No unified theory, why metadata is created and how it is shared – Semantic Web Vision does not address the creator side of metadata – it spends a lot of effort to convince people using the Semantic Web but not contributing to it 31Semantic Need: Guiding Metadata Annotations by Questions People #ask - ISWC 2010; Shanghai, China