SlideShare a Scribd company logo
1 of 22
Download to read offline
A semantically enabled architecture for
 crowdsourced Linked Data management
 Elena Simperl,1 Maribel Acosta,1 Barry Norton2
 1Institute         AIFB, Karlsruhe Institute of Technology, Germany
 2Ontotext          AD, Bulgaria
 Institute of Applied Informatics and Formal Description Methods (AIFB)
Institute of Applied Informatics and Formal Description Methods (AIFB)




 KIT – University of the State of Baden-Wuerttemberg and
 National Research Center of the Helmholtz Association                    www.kit.edu
Background: What is Linked Data?
      Linked Data: set of best practices
     to publish and connect structured
     data on the Web.
            URIs to identify entities and
            concepts in the world
            HTTP to access and retrieve
            resources and descriptions of
            these resources
            RDF as generic graph-based data
            model to structure and link data
      Taken together Linked Data is
     said to form a ‘cloud’ of shared
     references and vocabularies.
       Query language: SPARQL.

    http://linkeddata.org/faq
2     07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                   Linked Data management                                                                     Beschreibungsverfahren (AIFB)
Background: Why Linked Data?
Data.gov & public sector information: more                                 BBC & media: added value of
transparency and accountability in                                         content through interlinking
governance




                                                                          Google, Yahoo, Bing & schema.org:
                                                                          enhanced search




3   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                 Linked Data management                                                                     Beschreibungsverfahren (AIFB)
Outline

                 1       • Motivation

                 2       • Our Approach

                 3       • Extensions to VoID and SPARQL

                 4       • Crowdsourced query processing tasks

                 5       • Advantages

                 6       • Challenges

4   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                 Linked Data management                                                                     Beschreibungsverfahren (AIFB)
1. Motivation
                        User Query: Give me the German names of all commercial
                        airports in Baden-Württemberg, ordered by their most
                        informative description.

    „Retrieve the labels in German of commercial airports located
    in Baden-Württemberg, ordered by the better human-readable
    description of the airport given in the comment“.


         This query cannot be optimally answered automatically:
                 Incorrect/missing classification of entities (e.g. classification as
                 airports instead of commercial airports).
                 Missing information in data sets (e.g. German labels).
                 It is not possible to optimally perform subjective operations (e.g.
                 comparisons of pictures or NL comments).
5   07.06.2012    CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                  Linked Data management                                                                     Beschreibungsverfahren (AIFB)
1. Motivation
    „Retrieve the labels in German of commercial airports
    located in Baden-Württemberg, ordered by the better human-
    readable description of the airport given in the comment“.


         In order to answer the query as intended:
                 Classification of airports as commercial airports.
                 Identity resolution of places (Baden-Württemberg).
                 Translation of the labels of the airports.
                 Ordering of the comments by a subjective comparison.




6   07.06.2012    CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                  Linked Data management                                                                     Beschreibungsverfahren (AIFB)
1. Motivation
    „Retrieve the labels in German of commercial airports
    located in Baden-Württemberg, ordered by the better human-
    readable description of the airport given in the comment“.

    SPARQL Query:
    SELECT ?label WHERE {                                                       Classification
                                                                            1
      ?x a metar:CommercialHubAirport;
          rdfs:label ?label;
          rdfs:comment ?comment .
      ?x geonames:parentFeature ?z .                                                                   Identity Resolution
                                                                                                                                         2
      ?z owl:sameAs <http://dbpedia.org/resource/Baden-Wuerttemberg> .
      FILTER (LANG(?label) = "de") 3 Missing Information
                                                           4 Ordering
    } ORDER BY CROWD(?comment, "Better description of %x")


7   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced         Institut für Angewandte Informatik und Formale
                 Linked Data management                                                                           Beschreibungsverfahren (AIFB)
1. Motivation: Our Aim
        SPARQL query engine, able to process queries using
       seamless combination of automatic query processing and
       crowdsourcing.
                        Query                     Results
                                                                                                        Mediator
                         SPARQL query engine                             Crowdsourced query processing
                              Query parsing                                 Task design        UI generation

                           Query optimization

                             Query execution


                     Wrapper       Wrapper         Wrapper                                 Wrapper




8   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced             Institut für Angewandte Informatik und Formale
                 Linked Data management                                                                               Beschreibungsverfahren (AIFB)
2. Our Approach

                                                Parser

     Query                    Results                Decomposes the input query.
       SPARQL query engine
                                                     Selects the data sets that should be
             Query parsing
                                                     accessed to produce answers.
          Query optimization
                                                     Rewrites the query into the internal
             Query execution                         structures.




9   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                 Linked Data management                                                                     Beschreibungsverfahren (AIFB)
2. Our Approach

                                                 Optimizer

      Query                    Results                DB statistics and crowdsourcing
        SPARQL query engine                           statistics: estimated time to completion,
               Query parsing
                                                      and other information about the
                                                      performance (quality, cost) of the crowd.
          Query optimization

                                                      Traditional data bases optimization
              Query execution
                                                      techniques are implemented.

                                                      Determines which parts of the query
                                                      should be solved by human input: VoID
                                                      and SPARQL extensions.

                                                      Generates logical and physical plans.
10   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                  Linked Data management                                                                     Beschreibungsverfahren (AIFB)
2. Our Approach

                                                 Executor

      Query                    Results                Implements physical operators.
        SPARQL query engine
                                                      Invokes crowdsourcing component:
              Query parsing

                                                              Creates tasks.
           Query optimization
                                                              Generates UI.
            Query execution
                                                              Infers facts automatically.

                                                      Executes query against Linked Data:
                                                      computational tasks.

                                                      Incorporates results from the human
                                                      input.
11   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                  Linked Data management                                                                     Beschreibungsverfahren (AIFB)
3. Extensions to VoID and SPARQL
         The RDF based schema to describe data sets is VoID
        (Vocabulary of Interlinked Datasets).



         Common VoID predicates: voidDataset,
        void:inDataset, void:Linkset, void:linkPredicate,
        void:target.
                                                               Automatic interlinking of datasets

          VoID extensions:                                     CrowdClass

                                                               CrowdProperty

13   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                  Linked Data management                                                                     Beschreibungsverfahren (AIFB)
3. Extensions to VoID and SPARQL
          Automatic interlinking of data sets

     Example - Specification of Data Sets:

     :METAR rdf:type void:Dataset .                                                         METAR
     :Genonames rdf:type void:Dataset .
                                                                                                                  owl:sameAs


     :METAR2Geonames rdf:type void:Linkset ;
          void:linkPredicate owl:sameAs ;
          void:target :METAR ;                                                                           Geonames
          void:target :Geonames .

14   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                  Linked Data management                                                                     Beschreibungsverfahren (AIFB)
3. Extensions to VoID and SPARQL

           CrowdClass
      - Specifies which entities of a data set could be crowdsourced.
      - All subclasses of the crowdClass are also defined (implicitly)
        as crowdsourced entities.

     Example:
     metar:Airport void:inDataset :METAR .
     metar:CommercialHubAirport void:inDataset :METAR;
            rdfs:subClass metar:Airport .
     metar:Airport rdf:type void:crowdClass .
     metar:CommercialHubAirport rdf:type void:crowdClass.


15    07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                   Linked Data management                                                                     Beschreibungsverfahren (AIFB)
3. Extensions to VoID and SPARQL
          RDF data can be queried using the language SPARQL.

         Common SPARQL operators: join, union, optional,
        filter, order by.


         Properties related to general ontology languages such as
        OWL are treated as extensions of SPARQL operators,
        and are modeled in our architecture as tasks.




16   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                  Linked Data management                                                                     Beschreibungsverfahren (AIFB)
4. Tasks

         Formal, declarative description of the data and
        tasks using SPARQL patterns as a basis for the
        automatic design of HITs.

                  Identity resolution

                  Missing information

                  Ontological classification

                  Ordering (new operator)


17   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                  Linked Data management                                                                     Beschreibungsverfahren (AIFB)
4.1. Ontological Classification
         It is not always possible to automatically infer classification
        from the properties.
         Example: Retrieve the names (labels) of METAR stations that
        correspond to commercial airports.

     SELECT ?label WHERE {
       ?station a metar:CommercialHubAirport;
         rdfs:label ?label .}

     Input:        {?station a metar:Station;
                      rdfs:label ?label;
                      wgs84:lat ?lat;
                      wgs84:long ?long}

     Output: {?station a ?type.
              ?type rdfs:subClassOf metar:Station}
18   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                  Linked Data management                                                                     Beschreibungsverfahren (AIFB)
4.2. Ordering
         Orderings defined via less straightforward built-ins; for
        instance, the ordering of pictorial representations of entities.
         SPARQL extension: ORDER BY CROWD
          Example: Retrieves all airports and their pictures, and the pictures should
        be ordered according to the more representative image of the given airport.

SELECT ?airport ?picture WHERE {
  ?airport a metar:Airport;
    foaf:depiction ?picture .
} ORDER BY CROWD(?picture,
"Most representative image for %airport")

 Input:       {?airport foaf:depiction ?x, ?y}

Output: {{(?x ?y) a rdf:List} UNION {(?y ?x) a rdf:List}}

19   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                  Linked Data management                                                                     Beschreibungsverfahren (AIFB)
4.3. Computational tasks expressed as
     SPARQL queries

          Transitive relations inferred automatically, without
         requiring human intervention.

           Implementation of restrictions in SPIN.

     Identity Resolution                            Classification                           Ordering
     CONSTRUCT {                                    CONSTRUCT {                              CONSTRUCT {
      ?a owl:sameAs ?c .                             ?a a ?b.                                 {(?a ?b) a rdf:List .}
     } WHERE {                                        ?b rdfs:subClassOf ?c.                 } WHERE {
      ?a owl:sameAs ?b .                            } WHERE {                                 (?a ?x) a rdf:List .
      ?b owl:sameAs ?c .                              ?a rdfs:subClassOf ?c.                  (?x ?b) a rdf:List .
     }                                                ?b rdfs:subClassOf ?b1.                }
                                                      ?b1 rdfs:subClassOf ?c.
                                                    }


20    07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced     Institut für Angewandte Informatik und Formale
                   Linked Data management                                                                       Beschreibungsverfahren (AIFB)
5. Advantages
         Declarative description of data allows to decompose the
        query.

          Generation of the UIs automatically.

         Generation of human tasks on-the-fly and adjustment of
        the design of the task.

         Automatic consistency check of results by reasoning
        against validating ontology.




21   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                  Linked Data management                                                                     Beschreibungsverfahren (AIFB)
6. Challenges
         Appropriate level of granularity for HITs design for specific
        SPARQL constructs.
          Caching
            Naively we can materialise HIT results into datasets.
                  How to deal with partial coverage and dynamic datasets.

          Optimal user interfaces of graph-like content.

          Pricing and workers’ assignment.




22   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                  Linked Data management                                                                     Beschreibungsverfahren (AIFB)
QUESTIONS



23   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                  Linked Data management                                                                     Beschreibungsverfahren (AIFB)

More Related Content

What's hot

Hadoop - Now, Next and Beyond
Hadoop - Now, Next and BeyondHadoop - Now, Next and Beyond
Hadoop - Now, Next and BeyondTeradata Aster
 
J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...
J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...
J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...SBGC
 
Show and tell program 04 2014-09-04
Show and tell program 04 2014-09-04Show and tell program 04 2014-09-04
Show and tell program 04 2014-09-04nihshowandtell
 
AUTOMATED TOOL FOR RESUME CLASSIFICATION USING SEMENTIC ANALYSIS
AUTOMATED TOOL FOR RESUME CLASSIFICATION USING SEMENTIC ANALYSIS AUTOMATED TOOL FOR RESUME CLASSIFICATION USING SEMENTIC ANALYSIS
AUTOMATED TOOL FOR RESUME CLASSIFICATION USING SEMENTIC ANALYSIS ijaia
 
Linked Open data: CNR
Linked Open data: CNRLinked Open data: CNR
Linked Open data: CNRDatiGovIT
 
Big Data and Data Standardization at LinkedIn
Big Data and Data Standardization at LinkedInBig Data and Data Standardization at LinkedIn
Big Data and Data Standardization at LinkedInAlexis Baird
 
Flexible querying of relational databases fuzzy set based approach 27-11
Flexible querying of relational databases fuzzy set based approach 27-11Flexible querying of relational databases fuzzy set based approach 27-11
Flexible querying of relational databases fuzzy set based approach 27-11Adel Sabour
 

What's hot (10)

Hadoop - Now, Next and Beyond
Hadoop - Now, Next and BeyondHadoop - Now, Next and Beyond
Hadoop - Now, Next and Beyond
 
Pacename
PacenamePacename
Pacename
 
02 Web Search
02 Web Search02 Web Search
02 Web Search
 
Treasure Data: Big Data Analytics on Heroku
Treasure Data: Big Data Analytics on HerokuTreasure Data: Big Data Analytics on Heroku
Treasure Data: Big Data Analytics on Heroku
 
J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...
J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...
J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...
 
Show and tell program 04 2014-09-04
Show and tell program 04 2014-09-04Show and tell program 04 2014-09-04
Show and tell program 04 2014-09-04
 
AUTOMATED TOOL FOR RESUME CLASSIFICATION USING SEMENTIC ANALYSIS
AUTOMATED TOOL FOR RESUME CLASSIFICATION USING SEMENTIC ANALYSIS AUTOMATED TOOL FOR RESUME CLASSIFICATION USING SEMENTIC ANALYSIS
AUTOMATED TOOL FOR RESUME CLASSIFICATION USING SEMENTIC ANALYSIS
 
Linked Open data: CNR
Linked Open data: CNRLinked Open data: CNR
Linked Open data: CNR
 
Big Data and Data Standardization at LinkedIn
Big Data and Data Standardization at LinkedInBig Data and Data Standardization at LinkedIn
Big Data and Data Standardization at LinkedIn
 
Flexible querying of relational databases fuzzy set based approach 27-11
Flexible querying of relational databases fuzzy set based approach 27-11Flexible querying of relational databases fuzzy set based approach 27-11
Flexible querying of relational databases fuzzy set based approach 27-11
 

Similar to Crowdsourcing-enabled Linked Data management architecture

Crowdsourcing tasks in Linked Data management
Crowdsourcing tasks in Linked Data managementCrowdsourcing tasks in Linked Data management
Crowdsourcing tasks in Linked Data managementBarry Norton
 
Linked data for Enterprise Data Integration
Linked data for Enterprise Data IntegrationLinked data for Enterprise Data Integration
Linked data for Enterprise Data IntegrationSören Auer
 
Semtech 2011 impressions
Semtech 2011 impressionsSemtech 2011 impressions
Semtech 2011 impressionsGeorge Roth
 
Big Data Real Time Applications
Big Data Real Time ApplicationsBig Data Real Time Applications
Big Data Real Time ApplicationsDataWorks Summit
 
“Semantic Technologies for Smart Services”
“Semantic Technologies for Smart Services” “Semantic Technologies for Smart Services”
“Semantic Technologies for Smart Services” diannepatricia
 
Session 0.0 poster minutes madness
Session 0.0   poster minutes madnessSession 0.0   poster minutes madness
Session 0.0 poster minutes madnesssemanticsconference
 
X api chinese cop monthly meeting feb.2016
X api chinese cop monthly meeting   feb.2016X api chinese cop monthly meeting   feb.2016
X api chinese cop monthly meeting feb.2016Jessie Chuang
 
Apache Spark and MongoDB - Turning Analytics into Real-Time Action
Apache Spark and MongoDB - Turning Analytics into Real-Time ActionApache Spark and MongoDB - Turning Analytics into Real-Time Action
Apache Spark and MongoDB - Turning Analytics into Real-Time ActionJoão Gabriel Lima
 
Applied Semantic Search with Microsoft SQL Server
Applied Semantic Search with Microsoft SQL ServerApplied Semantic Search with Microsoft SQL Server
Applied Semantic Search with Microsoft SQL ServerMark Tabladillo
 
MongoDB_Spark
MongoDB_SparkMongoDB_Spark
MongoDB_SparkMat Keep
 
How to Share and Reuse Learning Resources: the ARIADNE Experience
How to Share and Reuse Learning Resources: the ARIADNE ExperienceHow to Share and Reuse Learning Resources: the ARIADNE Experience
How to Share and Reuse Learning Resources: the ARIADNE ExperienceJoris Klerkx
 
FIWARE Global Summit - IDS Implementation with FIWARE Software Components
FIWARE Global Summit - IDS Implementation with FIWARE Software ComponentsFIWARE Global Summit - IDS Implementation with FIWARE Software Components
FIWARE Global Summit - IDS Implementation with FIWARE Software ComponentsFIWARE
 
Sem tech 2011 v8
Sem tech 2011 v8Sem tech 2011 v8
Sem tech 2011 v8dallemang
 
Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Analytic Platforms in the Real World with 451Research and Calpont_July 2012Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Analytic Platforms in the Real World with 451Research and Calpont_July 2012Calpont Corporation
 
ConceptClassifier for SharePoint Turbo Charging the Public Sector
ConceptClassifier for SharePoint Turbo Charging the Public SectorConceptClassifier for SharePoint Turbo Charging the Public Sector
ConceptClassifier for SharePoint Turbo Charging the Public Sectormartingarland
 

Similar to Crowdsourcing-enabled Linked Data management architecture (20)

Aaai2012
Aaai2012Aaai2012
Aaai2012
 
Crowdsourcing tasks in Linked Data management
Crowdsourcing tasks in Linked Data managementCrowdsourcing tasks in Linked Data management
Crowdsourcing tasks in Linked Data management
 
Linked data for Enterprise Data Integration
Linked data for Enterprise Data IntegrationLinked data for Enterprise Data Integration
Linked data for Enterprise Data Integration
 
Semtech 2011 impressions
Semtech 2011 impressionsSemtech 2011 impressions
Semtech 2011 impressions
 
Big Data Real Time Applications
Big Data Real Time ApplicationsBig Data Real Time Applications
Big Data Real Time Applications
 
“Semantic Technologies for Smart Services”
“Semantic Technologies for Smart Services” “Semantic Technologies for Smart Services”
“Semantic Technologies for Smart Services”
 
Session 0.0 poster minutes madness
Session 0.0   poster minutes madnessSession 0.0   poster minutes madness
Session 0.0 poster minutes madness
 
X api chinese cop monthly meeting feb.2016
X api chinese cop monthly meeting   feb.2016X api chinese cop monthly meeting   feb.2016
X api chinese cop monthly meeting feb.2016
 
Apache Spark and MongoDB - Turning Analytics into Real-Time Action
Apache Spark and MongoDB - Turning Analytics into Real-Time ActionApache Spark and MongoDB - Turning Analytics into Real-Time Action
Apache Spark and MongoDB - Turning Analytics into Real-Time Action
 
Applied Semantic Search with Microsoft SQL Server
Applied Semantic Search with Microsoft SQL ServerApplied Semantic Search with Microsoft SQL Server
Applied Semantic Search with Microsoft SQL Server
 
Sub1583
Sub1583Sub1583
Sub1583
 
MongoDB_Spark
MongoDB_SparkMongoDB_Spark
MongoDB_Spark
 
How to Share and Reuse Learning Resources: the ARIADNE Experience
How to Share and Reuse Learning Resources: the ARIADNE ExperienceHow to Share and Reuse Learning Resources: the ARIADNE Experience
How to Share and Reuse Learning Resources: the ARIADNE Experience
 
FIWARE Global Summit - IDS Implementation with FIWARE Software Components
FIWARE Global Summit - IDS Implementation with FIWARE Software ComponentsFIWARE Global Summit - IDS Implementation with FIWARE Software Components
FIWARE Global Summit - IDS Implementation with FIWARE Software Components
 
Sem tech 2011 v8
Sem tech 2011 v8Sem tech 2011 v8
Sem tech 2011 v8
 
Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Analytic Platforms in the Real World with 451Research and Calpont_July 2012Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Analytic Platforms in the Real World with 451Research and Calpont_July 2012
 
ConceptClassifier for SharePoint Turbo Charging the Public Sector
ConceptClassifier for SharePoint Turbo Charging the Public SectorConceptClassifier for SharePoint Turbo Charging the Public Sector
ConceptClassifier for SharePoint Turbo Charging the Public Sector
 
Role of Semantic Web in Health Informatics
Role of Semantic Web in Health InformaticsRole of Semantic Web in Health Informatics
Role of Semantic Web in Health Informatics
 
MarkAndrews
MarkAndrewsMarkAndrews
MarkAndrews
 
LRMI in Context, Brandt Redd
LRMI in Context, Brandt ReddLRMI in Context, Brandt Redd
LRMI in Context, Brandt Redd
 

More from Elena Simperl

This talk was not generated with ChatGPT: how AI is changing science
This talk was not generated with ChatGPT: how AI is changing scienceThis talk was not generated with ChatGPT: how AI is changing science
This talk was not generated with ChatGPT: how AI is changing scienceElena Simperl
 
Knowledge graph use cases in natural language generation
Knowledge graph use cases in natural language generationKnowledge graph use cases in natural language generation
Knowledge graph use cases in natural language generationElena Simperl
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backElena Simperl
 
The web of data: how are we doing so far
The web of data: how are we doing so farThe web of data: how are we doing so far
The web of data: how are we doing so farElena Simperl
 
What Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringWhat Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringElena Simperl
 
Open government data portals: from publishing to use and impact
Open government data portals: from publishing to use and impactOpen government data portals: from publishing to use and impact
Open government data portals: from publishing to use and impactElena Simperl
 
Ten myths about knowledge graphs.pdf
Ten myths about knowledge graphs.pdfTen myths about knowledge graphs.pdf
Ten myths about knowledge graphs.pdfElena Simperl
 
What Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringWhat Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringElena Simperl
 
Data commons and their role in fighting misinformation.pdf
Data commons and their role in fighting misinformation.pdfData commons and their role in fighting misinformation.pdf
Data commons and their role in fighting misinformation.pdfElena Simperl
 
Are our knowledge graphs trustworthy?
Are our knowledge graphs trustworthy?Are our knowledge graphs trustworthy?
Are our knowledge graphs trustworthy?Elena Simperl
 
The web of data: how are we doing so far?
The web of data: how are we doing so far?The web of data: how are we doing so far?
The web of data: how are we doing so far?Elena Simperl
 
Crowdsourcing and citizen engagement for people-centric smart cities
Crowdsourcing and citizen engagement for people-centric smart citiesCrowdsourcing and citizen engagement for people-centric smart cities
Crowdsourcing and citizen engagement for people-centric smart citiesElena Simperl
 
Pie chart or pizza: identifying chart types and their virality on Twitter
Pie chart or pizza: identifying chart types and their virality on TwitterPie chart or pizza: identifying chart types and their virality on Twitter
Pie chart or pizza: identifying chart types and their virality on TwitterElena Simperl
 
High-value datasets: from publication to impact
High-value datasets: from publication to impactHigh-value datasets: from publication to impact
High-value datasets: from publication to impactElena Simperl
 
The story of Data Stories
The story of Data StoriesThe story of Data Stories
The story of Data StoriesElena Simperl
 
The human face of AI: how collective and augmented intelligence can help sol...
The human face of AI:  how collective and augmented intelligence can help sol...The human face of AI:  how collective and augmented intelligence can help sol...
The human face of AI: how collective and augmented intelligence can help sol...Elena Simperl
 
Qrowd and the city: designing people-centric smart cities
Qrowd and the city: designing people-centric smart citiesQrowd and the city: designing people-centric smart cities
Qrowd and the city: designing people-centric smart citiesElena Simperl
 
One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...
One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...
One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...Elena Simperl
 
Inclusive cities: a crowdsourcing approach
Inclusive cities: a crowdsourcing approachInclusive cities: a crowdsourcing approach
Inclusive cities: a crowdsourcing approachElena Simperl
 

More from Elena Simperl (20)

This talk was not generated with ChatGPT: how AI is changing science
This talk was not generated with ChatGPT: how AI is changing scienceThis talk was not generated with ChatGPT: how AI is changing science
This talk was not generated with ChatGPT: how AI is changing science
 
Knowledge graph use cases in natural language generation
Knowledge graph use cases in natural language generationKnowledge graph use cases in natural language generation
Knowledge graph use cases in natural language generation
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
The web of data: how are we doing so far
The web of data: how are we doing so farThe web of data: how are we doing so far
The web of data: how are we doing so far
 
What Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringWhat Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineering
 
Open government data portals: from publishing to use and impact
Open government data portals: from publishing to use and impactOpen government data portals: from publishing to use and impact
Open government data portals: from publishing to use and impact
 
Ten myths about knowledge graphs.pdf
Ten myths about knowledge graphs.pdfTen myths about knowledge graphs.pdf
Ten myths about knowledge graphs.pdf
 
What Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringWhat Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineering
 
Data commons and their role in fighting misinformation.pdf
Data commons and their role in fighting misinformation.pdfData commons and their role in fighting misinformation.pdf
Data commons and their role in fighting misinformation.pdf
 
Are our knowledge graphs trustworthy?
Are our knowledge graphs trustworthy?Are our knowledge graphs trustworthy?
Are our knowledge graphs trustworthy?
 
The web of data: how are we doing so far?
The web of data: how are we doing so far?The web of data: how are we doing so far?
The web of data: how are we doing so far?
 
Crowdsourcing and citizen engagement for people-centric smart cities
Crowdsourcing and citizen engagement for people-centric smart citiesCrowdsourcing and citizen engagement for people-centric smart cities
Crowdsourcing and citizen engagement for people-centric smart cities
 
Pie chart or pizza: identifying chart types and their virality on Twitter
Pie chart or pizza: identifying chart types and their virality on TwitterPie chart or pizza: identifying chart types and their virality on Twitter
Pie chart or pizza: identifying chart types and their virality on Twitter
 
High-value datasets: from publication to impact
High-value datasets: from publication to impactHigh-value datasets: from publication to impact
High-value datasets: from publication to impact
 
The story of Data Stories
The story of Data StoriesThe story of Data Stories
The story of Data Stories
 
The human face of AI: how collective and augmented intelligence can help sol...
The human face of AI:  how collective and augmented intelligence can help sol...The human face of AI:  how collective and augmented intelligence can help sol...
The human face of AI: how collective and augmented intelligence can help sol...
 
Qrowd and the city: designing people-centric smart cities
Qrowd and the city: designing people-centric smart citiesQrowd and the city: designing people-centric smart cities
Qrowd and the city: designing people-centric smart cities
 
One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...
One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...
One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...
 
Qrowd and the city
Qrowd and the cityQrowd and the city
Qrowd and the city
 
Inclusive cities: a crowdsourcing approach
Inclusive cities: a crowdsourcing approachInclusive cities: a crowdsourcing approach
Inclusive cities: a crowdsourcing approach
 

Recently uploaded

CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 

Recently uploaded (20)

CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 

Crowdsourcing-enabled Linked Data management architecture

  • 1. A semantically enabled architecture for crowdsourced Linked Data management Elena Simperl,1 Maribel Acosta,1 Barry Norton2 1Institute AIFB, Karlsruhe Institute of Technology, Germany 2Ontotext AD, Bulgaria Institute of Applied Informatics and Formal Description Methods (AIFB) Institute of Applied Informatics and Formal Description Methods (AIFB) KIT – University of the State of Baden-Wuerttemberg and National Research Center of the Helmholtz Association www.kit.edu
  • 2. Background: What is Linked Data? Linked Data: set of best practices to publish and connect structured data on the Web. URIs to identify entities and concepts in the world HTTP to access and retrieve resources and descriptions of these resources RDF as generic graph-based data model to structure and link data Taken together Linked Data is said to form a ‘cloud’ of shared references and vocabularies. Query language: SPARQL. http://linkeddata.org/faq 2 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 3. Background: Why Linked Data? Data.gov & public sector information: more BBC & media: added value of transparency and accountability in content through interlinking governance Google, Yahoo, Bing & schema.org: enhanced search 3 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 4. Outline 1 • Motivation 2 • Our Approach 3 • Extensions to VoID and SPARQL 4 • Crowdsourced query processing tasks 5 • Advantages 6 • Challenges 4 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 5. 1. Motivation User Query: Give me the German names of all commercial airports in Baden-Württemberg, ordered by their most informative description. „Retrieve the labels in German of commercial airports located in Baden-Württemberg, ordered by the better human-readable description of the airport given in the comment“. This query cannot be optimally answered automatically: Incorrect/missing classification of entities (e.g. classification as airports instead of commercial airports). Missing information in data sets (e.g. German labels). It is not possible to optimally perform subjective operations (e.g. comparisons of pictures or NL comments). 5 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 6. 1. Motivation „Retrieve the labels in German of commercial airports located in Baden-Württemberg, ordered by the better human- readable description of the airport given in the comment“. In order to answer the query as intended: Classification of airports as commercial airports. Identity resolution of places (Baden-Württemberg). Translation of the labels of the airports. Ordering of the comments by a subjective comparison. 6 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 7. 1. Motivation „Retrieve the labels in German of commercial airports located in Baden-Württemberg, ordered by the better human- readable description of the airport given in the comment“. SPARQL Query: SELECT ?label WHERE { Classification 1 ?x a metar:CommercialHubAirport; rdfs:label ?label; rdfs:comment ?comment . ?x geonames:parentFeature ?z . Identity Resolution 2 ?z owl:sameAs <http://dbpedia.org/resource/Baden-Wuerttemberg> . FILTER (LANG(?label) = "de") 3 Missing Information 4 Ordering } ORDER BY CROWD(?comment, "Better description of %x") 7 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 8. 1. Motivation: Our Aim SPARQL query engine, able to process queries using seamless combination of automatic query processing and crowdsourcing. Query Results Mediator SPARQL query engine Crowdsourced query processing Query parsing Task design UI generation Query optimization Query execution Wrapper Wrapper Wrapper Wrapper 8 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 9. 2. Our Approach Parser Query Results Decomposes the input query. SPARQL query engine Selects the data sets that should be Query parsing accessed to produce answers. Query optimization Rewrites the query into the internal Query execution structures. 9 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 10. 2. Our Approach Optimizer Query Results DB statistics and crowdsourcing SPARQL query engine statistics: estimated time to completion, Query parsing and other information about the performance (quality, cost) of the crowd. Query optimization Traditional data bases optimization Query execution techniques are implemented. Determines which parts of the query should be solved by human input: VoID and SPARQL extensions. Generates logical and physical plans. 10 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 11. 2. Our Approach Executor Query Results Implements physical operators. SPARQL query engine Invokes crowdsourcing component: Query parsing Creates tasks. Query optimization Generates UI. Query execution Infers facts automatically. Executes query against Linked Data: computational tasks. Incorporates results from the human input. 11 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 12. 3. Extensions to VoID and SPARQL The RDF based schema to describe data sets is VoID (Vocabulary of Interlinked Datasets). Common VoID predicates: voidDataset, void:inDataset, void:Linkset, void:linkPredicate, void:target. Automatic interlinking of datasets VoID extensions: CrowdClass CrowdProperty 13 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 13. 3. Extensions to VoID and SPARQL Automatic interlinking of data sets Example - Specification of Data Sets: :METAR rdf:type void:Dataset . METAR :Genonames rdf:type void:Dataset . owl:sameAs :METAR2Geonames rdf:type void:Linkset ; void:linkPredicate owl:sameAs ; void:target :METAR ; Geonames void:target :Geonames . 14 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 14. 3. Extensions to VoID and SPARQL CrowdClass - Specifies which entities of a data set could be crowdsourced. - All subclasses of the crowdClass are also defined (implicitly) as crowdsourced entities. Example: metar:Airport void:inDataset :METAR . metar:CommercialHubAirport void:inDataset :METAR; rdfs:subClass metar:Airport . metar:Airport rdf:type void:crowdClass . metar:CommercialHubAirport rdf:type void:crowdClass. 15 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 15. 3. Extensions to VoID and SPARQL RDF data can be queried using the language SPARQL. Common SPARQL operators: join, union, optional, filter, order by. Properties related to general ontology languages such as OWL are treated as extensions of SPARQL operators, and are modeled in our architecture as tasks. 16 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 16. 4. Tasks Formal, declarative description of the data and tasks using SPARQL patterns as a basis for the automatic design of HITs. Identity resolution Missing information Ontological classification Ordering (new operator) 17 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 17. 4.1. Ontological Classification It is not always possible to automatically infer classification from the properties. Example: Retrieve the names (labels) of METAR stations that correspond to commercial airports. SELECT ?label WHERE { ?station a metar:CommercialHubAirport; rdfs:label ?label .} Input: {?station a metar:Station; rdfs:label ?label; wgs84:lat ?lat; wgs84:long ?long} Output: {?station a ?type. ?type rdfs:subClassOf metar:Station} 18 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 18. 4.2. Ordering Orderings defined via less straightforward built-ins; for instance, the ordering of pictorial representations of entities. SPARQL extension: ORDER BY CROWD Example: Retrieves all airports and their pictures, and the pictures should be ordered according to the more representative image of the given airport. SELECT ?airport ?picture WHERE { ?airport a metar:Airport; foaf:depiction ?picture . } ORDER BY CROWD(?picture, "Most representative image for %airport") Input: {?airport foaf:depiction ?x, ?y} Output: {{(?x ?y) a rdf:List} UNION {(?y ?x) a rdf:List}} 19 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 19. 4.3. Computational tasks expressed as SPARQL queries Transitive relations inferred automatically, without requiring human intervention. Implementation of restrictions in SPIN. Identity Resolution Classification Ordering CONSTRUCT { CONSTRUCT { CONSTRUCT { ?a owl:sameAs ?c . ?a a ?b. {(?a ?b) a rdf:List .} } WHERE { ?b rdfs:subClassOf ?c. } WHERE { ?a owl:sameAs ?b . } WHERE { (?a ?x) a rdf:List . ?b owl:sameAs ?c . ?a rdfs:subClassOf ?c. (?x ?b) a rdf:List . } ?b rdfs:subClassOf ?b1. } ?b1 rdfs:subClassOf ?c. } 20 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 20. 5. Advantages Declarative description of data allows to decompose the query. Generation of the UIs automatically. Generation of human tasks on-the-fly and adjustment of the design of the task. Automatic consistency check of results by reasoning against validating ontology. 21 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 21. 6. Challenges Appropriate level of granularity for HITs design for specific SPARQL constructs. Caching Naively we can materialise HIT results into datasets. How to deal with partial coverage and dynamic datasets. Optimal user interfaces of graph-like content. Pricing and workers’ assignment. 22 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 22. QUESTIONS 23 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)