SlideShare a Scribd company logo
1 of 22
Download to read offline
A semantically enabled architecture for
 crowdsourced Linked Data management
 Elena Simperl,1 Maribel Acosta,1 Barry Norton2
 1Institute         AIFB, Karlsruhe Institute of Technology, Germany
 2Ontotext          AD, Bulgaria
 Institute of Applied Informatics and Formal Description Methods (AIFB)
Institute of Applied Informatics and Formal Description Methods (AIFB)




 KIT – University of the State of Baden-Wuerttemberg and
 National Research Center of the Helmholtz Association                    www.kit.edu
Background: What is Linked Data?
      Linked Data: set of best practices
     to publish and connect structured
     data on the Web.
            URIs to identify entities and
            concepts in the world
            HTTP to access and retrieve
            resources and descriptions of
            these resources
            RDF as generic graph-based data
            model to structure and link data
      Taken together Linked Data is
     said to form a ‘cloud’ of shared
     references and vocabularies.
       Query language: SPARQL.

    http://linkeddata.org/faq
2     07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                   Linked Data management                                                                     Beschreibungsverfahren (AIFB)
Background: Why Linked Data?
Data.gov & public sector information: more                                 BBC & media: added value of
transparency and accountability in                                         content through interlinking
governance




                                                                          Google, Yahoo, Bing & schema.org:
                                                                          enhanced search




3   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                 Linked Data management                                                                     Beschreibungsverfahren (AIFB)
Outline

                 1       • Motivation

                 2       • Our Approach

                 3       • Extensions to VoID and SPARQL

                 4       • Crowdsourced query processing tasks

                 5       • Advantages

                 6       • Challenges

4   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                 Linked Data management                                                                     Beschreibungsverfahren (AIFB)
1. Motivation
                        User Query: Give me the German names of all commercial
                        airports in Baden-Württemberg, ordered by their most
                        informative description.

    „Retrieve the labels in German of commercial airports located
    in Baden-Württemberg, ordered by the better human-readable
    description of the airport given in the comment“.


         This query cannot be optimally answered automatically:
                 Incorrect/missing classification of entities (e.g. classification as
                 airports instead of commercial airports).
                 Missing information in data sets (e.g. German labels).
                 It is not possible to optimally perform subjective operations (e.g.
                 comparisons of pictures or NL comments).
5   07.06.2012    CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                  Linked Data management                                                                     Beschreibungsverfahren (AIFB)
1. Motivation
    „Retrieve the labels in German of commercial airports
    located in Baden-Württemberg, ordered by the better human-
    readable description of the airport given in the comment“.


         In order to answer the query as intended:
                 Classification of airports as commercial airports.
                 Identity resolution of places (Baden-Württemberg).
                 Translation of the labels of the airports.
                 Ordering of the comments by a subjective comparison.




6   07.06.2012    CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                  Linked Data management                                                                     Beschreibungsverfahren (AIFB)
1. Motivation
    „Retrieve the labels in German of commercial airports
    located in Baden-Württemberg, ordered by the better human-
    readable description of the airport given in the comment“.

    SPARQL Query:
    SELECT ?label WHERE {                                                       Classification
                                                                            1
      ?x a metar:CommercialHubAirport;
          rdfs:label ?label;
          rdfs:comment ?comment .
      ?x geonames:parentFeature ?z .                                                                   Identity Resolution
                                                                                                                                         2
      ?z owl:sameAs <http://dbpedia.org/resource/Baden-Wuerttemberg> .
      FILTER (LANG(?label) = "de") 3 Missing Information
                                                           4 Ordering
    } ORDER BY CROWD(?comment, "Better description of %x")


7   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced         Institut für Angewandte Informatik und Formale
                 Linked Data management                                                                           Beschreibungsverfahren (AIFB)
1. Motivation: Our Aim
        SPARQL query engine, able to process queries using
       seamless combination of automatic query processing and
       crowdsourcing.
                        Query                     Results
                                                                                                        Mediator
                         SPARQL query engine                             Crowdsourced query processing
                              Query parsing                                 Task design        UI generation

                           Query optimization

                             Query execution


                     Wrapper       Wrapper         Wrapper                                 Wrapper




8   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced             Institut für Angewandte Informatik und Formale
                 Linked Data management                                                                               Beschreibungsverfahren (AIFB)
2. Our Approach

                                                Parser

     Query                    Results                Decomposes the input query.
       SPARQL query engine
                                                     Selects the data sets that should be
             Query parsing
                                                     accessed to produce answers.
          Query optimization
                                                     Rewrites the query into the internal
             Query execution                         structures.




9   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                 Linked Data management                                                                     Beschreibungsverfahren (AIFB)
2. Our Approach

                                                 Optimizer

      Query                    Results                DB statistics and crowdsourcing
        SPARQL query engine                           statistics: estimated time to completion,
               Query parsing
                                                      and other information about the
                                                      performance (quality, cost) of the crowd.
          Query optimization

                                                      Traditional data bases optimization
              Query execution
                                                      techniques are implemented.

                                                      Determines which parts of the query
                                                      should be solved by human input: VoID
                                                      and SPARQL extensions.

                                                      Generates logical and physical plans.
10   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                  Linked Data management                                                                     Beschreibungsverfahren (AIFB)
2. Our Approach

                                                 Executor

      Query                    Results                Implements physical operators.
        SPARQL query engine
                                                      Invokes crowdsourcing component:
              Query parsing

                                                              Creates tasks.
           Query optimization
                                                              Generates UI.
            Query execution
                                                              Infers facts automatically.

                                                      Executes query against Linked Data:
                                                      computational tasks.

                                                      Incorporates results from the human
                                                      input.
11   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                  Linked Data management                                                                     Beschreibungsverfahren (AIFB)
3. Extensions to VoID and SPARQL
         The RDF based schema to describe data sets is VoID
        (Vocabulary of Interlinked Datasets).



         Common VoID predicates: voidDataset,
        void:inDataset, void:Linkset, void:linkPredicate,
        void:target.
                                                               Automatic interlinking of datasets

          VoID extensions:                                     CrowdClass

                                                               CrowdProperty

13   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                  Linked Data management                                                                     Beschreibungsverfahren (AIFB)
3. Extensions to VoID and SPARQL
          Automatic interlinking of data sets

     Example - Specification of Data Sets:

     :METAR rdf:type void:Dataset .                                                         METAR
     :Genonames rdf:type void:Dataset .
                                                                                                                  owl:sameAs


     :METAR2Geonames rdf:type void:Linkset ;
          void:linkPredicate owl:sameAs ;
          void:target :METAR ;                                                                           Geonames
          void:target :Geonames .

14   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                  Linked Data management                                                                     Beschreibungsverfahren (AIFB)
3. Extensions to VoID and SPARQL

           CrowdClass
      - Specifies which entities of a data set could be crowdsourced.
      - All subclasses of the crowdClass are also defined (implicitly)
        as crowdsourced entities.

     Example:
     metar:Airport void:inDataset :METAR .
     metar:CommercialHubAirport void:inDataset :METAR;
            rdfs:subClass metar:Airport .
     metar:Airport rdf:type void:crowdClass .
     metar:CommercialHubAirport rdf:type void:crowdClass.


15    07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                   Linked Data management                                                                     Beschreibungsverfahren (AIFB)
3. Extensions to VoID and SPARQL
          RDF data can be queried using the language SPARQL.

         Common SPARQL operators: join, union, optional,
        filter, order by.


         Properties related to general ontology languages such as
        OWL are treated as extensions of SPARQL operators,
        and are modeled in our architecture as tasks.




16   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                  Linked Data management                                                                     Beschreibungsverfahren (AIFB)
4. Tasks

         Formal, declarative description of the data and
        tasks using SPARQL patterns as a basis for the
        automatic design of HITs.

                  Identity resolution

                  Missing information

                  Ontological classification

                  Ordering (new operator)


17   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                  Linked Data management                                                                     Beschreibungsverfahren (AIFB)
4.1. Ontological Classification
         It is not always possible to automatically infer classification
        from the properties.
         Example: Retrieve the names (labels) of METAR stations that
        correspond to commercial airports.

     SELECT ?label WHERE {
       ?station a metar:CommercialHubAirport;
         rdfs:label ?label .}

     Input:        {?station a metar:Station;
                      rdfs:label ?label;
                      wgs84:lat ?lat;
                      wgs84:long ?long}

     Output: {?station a ?type.
              ?type rdfs:subClassOf metar:Station}
18   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                  Linked Data management                                                                     Beschreibungsverfahren (AIFB)
4.2. Ordering
         Orderings defined via less straightforward built-ins; for
        instance, the ordering of pictorial representations of entities.
         SPARQL extension: ORDER BY CROWD
          Example: Retrieves all airports and their pictures, and the pictures should
        be ordered according to the more representative image of the given airport.

SELECT ?airport ?picture WHERE {
  ?airport a metar:Airport;
    foaf:depiction ?picture .
} ORDER BY CROWD(?picture,
"Most representative image for %airport")

 Input:       {?airport foaf:depiction ?x, ?y}

Output: {{(?x ?y) a rdf:List} UNION {(?y ?x) a rdf:List}}

19   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                  Linked Data management                                                                     Beschreibungsverfahren (AIFB)
4.3. Computational tasks expressed as
     SPARQL queries

          Transitive relations inferred automatically, without
         requiring human intervention.

           Implementation of restrictions in SPIN.

     Identity Resolution                            Classification                           Ordering
     CONSTRUCT {                                    CONSTRUCT {                              CONSTRUCT {
      ?a owl:sameAs ?c .                             ?a a ?b.                                 {(?a ?b) a rdf:List .}
     } WHERE {                                        ?b rdfs:subClassOf ?c.                 } WHERE {
      ?a owl:sameAs ?b .                            } WHERE {                                 (?a ?x) a rdf:List .
      ?b owl:sameAs ?c .                              ?a rdfs:subClassOf ?c.                  (?x ?b) a rdf:List .
     }                                                ?b rdfs:subClassOf ?b1.                }
                                                      ?b1 rdfs:subClassOf ?c.
                                                    }


20    07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced     Institut für Angewandte Informatik und Formale
                   Linked Data management                                                                       Beschreibungsverfahren (AIFB)
5. Advantages
         Declarative description of data allows to decompose the
        query.

          Generation of the UIs automatically.

         Generation of human tasks on-the-fly and adjustment of
        the design of the task.

         Automatic consistency check of results by reasoning
        against validating ontology.




21   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                  Linked Data management                                                                     Beschreibungsverfahren (AIFB)
6. Challenges
         Appropriate level of granularity for HITs design for specific
        SPARQL constructs.
          Caching
            Naively we can materialise HIT results into datasets.
                  How to deal with partial coverage and dynamic datasets.

          Optimal user interfaces of graph-like content.

          Pricing and workers’ assignment.




22   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                  Linked Data management                                                                     Beschreibungsverfahren (AIFB)
QUESTIONS



23   07.06.2012   CrowdSearch 2012 - A semantically enabled architecture for crowdsourced   Institut für Angewandte Informatik und Formale
                  Linked Data management                                                                     Beschreibungsverfahren (AIFB)

More Related Content

What's hot

Hadoop - Now, Next and Beyond
Hadoop - Now, Next and BeyondHadoop - Now, Next and Beyond
Hadoop - Now, Next and BeyondTeradata Aster
 
J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...
J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...
J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...SBGC
 
Show and tell program 04 2014-09-04
Show and tell program 04 2014-09-04Show and tell program 04 2014-09-04
Show and tell program 04 2014-09-04nihshowandtell
 
AUTOMATED TOOL FOR RESUME CLASSIFICATION USING SEMENTIC ANALYSIS
AUTOMATED TOOL FOR RESUME CLASSIFICATION USING SEMENTIC ANALYSIS AUTOMATED TOOL FOR RESUME CLASSIFICATION USING SEMENTIC ANALYSIS
AUTOMATED TOOL FOR RESUME CLASSIFICATION USING SEMENTIC ANALYSIS ijaia
 
Linked Open data: CNR
Linked Open data: CNRLinked Open data: CNR
Linked Open data: CNRDatiGovIT
 
Big Data and Data Standardization at LinkedIn
Big Data and Data Standardization at LinkedInBig Data and Data Standardization at LinkedIn
Big Data and Data Standardization at LinkedInAlexis Baird
 
Flexible querying of relational databases fuzzy set based approach 27-11
Flexible querying of relational databases fuzzy set based approach 27-11Flexible querying of relational databases fuzzy set based approach 27-11
Flexible querying of relational databases fuzzy set based approach 27-11Adel Sabour
 

What's hot (10)

Hadoop - Now, Next and Beyond
Hadoop - Now, Next and BeyondHadoop - Now, Next and Beyond
Hadoop - Now, Next and Beyond
 
Pacename
PacenamePacename
Pacename
 
02 Web Search
02 Web Search02 Web Search
02 Web Search
 
Treasure Data: Big Data Analytics on Heroku
Treasure Data: Big Data Analytics on HerokuTreasure Data: Big Data Analytics on Heroku
Treasure Data: Big Data Analytics on Heroku
 
J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...
J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...
J2EE ieee projects 2011 SBGC ( Trichy, Chennai, Tirupati, Nellore, Kadapa, Ku...
 
Show and tell program 04 2014-09-04
Show and tell program 04 2014-09-04Show and tell program 04 2014-09-04
Show and tell program 04 2014-09-04
 
AUTOMATED TOOL FOR RESUME CLASSIFICATION USING SEMENTIC ANALYSIS
AUTOMATED TOOL FOR RESUME CLASSIFICATION USING SEMENTIC ANALYSIS AUTOMATED TOOL FOR RESUME CLASSIFICATION USING SEMENTIC ANALYSIS
AUTOMATED TOOL FOR RESUME CLASSIFICATION USING SEMENTIC ANALYSIS
 
Linked Open data: CNR
Linked Open data: CNRLinked Open data: CNR
Linked Open data: CNR
 
Big Data and Data Standardization at LinkedIn
Big Data and Data Standardization at LinkedInBig Data and Data Standardization at LinkedIn
Big Data and Data Standardization at LinkedIn
 
Flexible querying of relational databases fuzzy set based approach 27-11
Flexible querying of relational databases fuzzy set based approach 27-11Flexible querying of relational databases fuzzy set based approach 27-11
Flexible querying of relational databases fuzzy set based approach 27-11
 

Similar to Crowdsourcing-enabled Linked Data management architecture

Crowdsourcing tasks in Linked Data management
Crowdsourcing tasks in Linked Data managementCrowdsourcing tasks in Linked Data management
Crowdsourcing tasks in Linked Data managementBarry Norton
 
Linked data for Enterprise Data Integration
Linked data for Enterprise Data IntegrationLinked data for Enterprise Data Integration
Linked data for Enterprise Data IntegrationSören Auer
 
Semtech 2011 impressions
Semtech 2011 impressionsSemtech 2011 impressions
Semtech 2011 impressionsGeorge Roth
 
Big Data Real Time Applications
Big Data Real Time ApplicationsBig Data Real Time Applications
Big Data Real Time ApplicationsDataWorks Summit
 
“Semantic Technologies for Smart Services”
“Semantic Technologies for Smart Services” “Semantic Technologies for Smart Services”
“Semantic Technologies for Smart Services” diannepatricia
 
Session 0.0 poster minutes madness
Session 0.0   poster minutes madnessSession 0.0   poster minutes madness
Session 0.0 poster minutes madnesssemanticsconference
 
X api chinese cop monthly meeting feb.2016
X api chinese cop monthly meeting   feb.2016X api chinese cop monthly meeting   feb.2016
X api chinese cop monthly meeting feb.2016Jessie Chuang
 
Apache Spark and MongoDB - Turning Analytics into Real-Time Action
Apache Spark and MongoDB - Turning Analytics into Real-Time ActionApache Spark and MongoDB - Turning Analytics into Real-Time Action
Apache Spark and MongoDB - Turning Analytics into Real-Time ActionJoão Gabriel Lima
 
Applied Semantic Search with Microsoft SQL Server
Applied Semantic Search with Microsoft SQL ServerApplied Semantic Search with Microsoft SQL Server
Applied Semantic Search with Microsoft SQL ServerMark Tabladillo
 
MongoDB_Spark
MongoDB_SparkMongoDB_Spark
MongoDB_SparkMat Keep
 
How to Share and Reuse Learning Resources: the ARIADNE Experience
How to Share and Reuse Learning Resources: the ARIADNE ExperienceHow to Share and Reuse Learning Resources: the ARIADNE Experience
How to Share and Reuse Learning Resources: the ARIADNE ExperienceJoris Klerkx
 
FIWARE Global Summit - IDS Implementation with FIWARE Software Components
FIWARE Global Summit - IDS Implementation with FIWARE Software ComponentsFIWARE Global Summit - IDS Implementation with FIWARE Software Components
FIWARE Global Summit - IDS Implementation with FIWARE Software ComponentsFIWARE
 
Sem tech 2011 v8
Sem tech 2011 v8Sem tech 2011 v8
Sem tech 2011 v8dallemang
 
Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Analytic Platforms in the Real World with 451Research and Calpont_July 2012Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Analytic Platforms in the Real World with 451Research and Calpont_July 2012Calpont Corporation
 
ConceptClassifier for SharePoint Turbo Charging the Public Sector
ConceptClassifier for SharePoint Turbo Charging the Public SectorConceptClassifier for SharePoint Turbo Charging the Public Sector
ConceptClassifier for SharePoint Turbo Charging the Public Sectormartingarland
 

Similar to Crowdsourcing-enabled Linked Data management architecture (20)

Aaai2012
Aaai2012Aaai2012
Aaai2012
 
Crowdsourcing tasks in Linked Data management
Crowdsourcing tasks in Linked Data managementCrowdsourcing tasks in Linked Data management
Crowdsourcing tasks in Linked Data management
 
Linked data for Enterprise Data Integration
Linked data for Enterprise Data IntegrationLinked data for Enterprise Data Integration
Linked data for Enterprise Data Integration
 
Semtech 2011 impressions
Semtech 2011 impressionsSemtech 2011 impressions
Semtech 2011 impressions
 
Big Data Real Time Applications
Big Data Real Time ApplicationsBig Data Real Time Applications
Big Data Real Time Applications
 
“Semantic Technologies for Smart Services”
“Semantic Technologies for Smart Services” “Semantic Technologies for Smart Services”
“Semantic Technologies for Smart Services”
 
Session 0.0 poster minutes madness
Session 0.0   poster minutes madnessSession 0.0   poster minutes madness
Session 0.0 poster minutes madness
 
X api chinese cop monthly meeting feb.2016
X api chinese cop monthly meeting   feb.2016X api chinese cop monthly meeting   feb.2016
X api chinese cop monthly meeting feb.2016
 
Apache Spark and MongoDB - Turning Analytics into Real-Time Action
Apache Spark and MongoDB - Turning Analytics into Real-Time ActionApache Spark and MongoDB - Turning Analytics into Real-Time Action
Apache Spark and MongoDB - Turning Analytics into Real-Time Action
 
Applied Semantic Search with Microsoft SQL Server
Applied Semantic Search with Microsoft SQL ServerApplied Semantic Search with Microsoft SQL Server
Applied Semantic Search with Microsoft SQL Server
 
Sub1583
Sub1583Sub1583
Sub1583
 
MongoDB_Spark
MongoDB_SparkMongoDB_Spark
MongoDB_Spark
 
How to Share and Reuse Learning Resources: the ARIADNE Experience
How to Share and Reuse Learning Resources: the ARIADNE ExperienceHow to Share and Reuse Learning Resources: the ARIADNE Experience
How to Share and Reuse Learning Resources: the ARIADNE Experience
 
FIWARE Global Summit - IDS Implementation with FIWARE Software Components
FIWARE Global Summit - IDS Implementation with FIWARE Software ComponentsFIWARE Global Summit - IDS Implementation with FIWARE Software Components
FIWARE Global Summit - IDS Implementation with FIWARE Software Components
 
Sem tech 2011 v8
Sem tech 2011 v8Sem tech 2011 v8
Sem tech 2011 v8
 
Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Analytic Platforms in the Real World with 451Research and Calpont_July 2012Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Analytic Platforms in the Real World with 451Research and Calpont_July 2012
 
ConceptClassifier for SharePoint Turbo Charging the Public Sector
ConceptClassifier for SharePoint Turbo Charging the Public SectorConceptClassifier for SharePoint Turbo Charging the Public Sector
ConceptClassifier for SharePoint Turbo Charging the Public Sector
 
Role of Semantic Web in Health Informatics
Role of Semantic Web in Health InformaticsRole of Semantic Web in Health Informatics
Role of Semantic Web in Health Informatics
 
MarkAndrews
MarkAndrewsMarkAndrews
MarkAndrews
 
LRMI in Context, Brandt Redd
LRMI in Context, Brandt ReddLRMI in Context, Brandt Redd
LRMI in Context, Brandt Redd
 

More from Elena Simperl

This talk was not generated with ChatGPT: how AI is changing science
This talk was not generated with ChatGPT: how AI is changing scienceThis talk was not generated with ChatGPT: how AI is changing science
This talk was not generated with ChatGPT: how AI is changing scienceElena Simperl
 
Knowledge graph use cases in natural language generation
Knowledge graph use cases in natural language generationKnowledge graph use cases in natural language generation
Knowledge graph use cases in natural language generationElena Simperl
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backElena Simperl
 
The web of data: how are we doing so far
The web of data: how are we doing so farThe web of data: how are we doing so far
The web of data: how are we doing so farElena Simperl
 
What Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringWhat Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringElena Simperl
 
Open government data portals: from publishing to use and impact
Open government data portals: from publishing to use and impactOpen government data portals: from publishing to use and impact
Open government data portals: from publishing to use and impactElena Simperl
 
Ten myths about knowledge graphs.pdf
Ten myths about knowledge graphs.pdfTen myths about knowledge graphs.pdf
Ten myths about knowledge graphs.pdfElena Simperl
 
What Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringWhat Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringElena Simperl
 
Data commons and their role in fighting misinformation.pdf
Data commons and their role in fighting misinformation.pdfData commons and their role in fighting misinformation.pdf
Data commons and their role in fighting misinformation.pdfElena Simperl
 
Are our knowledge graphs trustworthy?
Are our knowledge graphs trustworthy?Are our knowledge graphs trustworthy?
Are our knowledge graphs trustworthy?Elena Simperl
 
The web of data: how are we doing so far?
The web of data: how are we doing so far?The web of data: how are we doing so far?
The web of data: how are we doing so far?Elena Simperl
 
Crowdsourcing and citizen engagement for people-centric smart cities
Crowdsourcing and citizen engagement for people-centric smart citiesCrowdsourcing and citizen engagement for people-centric smart cities
Crowdsourcing and citizen engagement for people-centric smart citiesElena Simperl
 
Pie chart or pizza: identifying chart types and their virality on Twitter
Pie chart or pizza: identifying chart types and their virality on TwitterPie chart or pizza: identifying chart types and their virality on Twitter
Pie chart or pizza: identifying chart types and their virality on TwitterElena Simperl
 
High-value datasets: from publication to impact
High-value datasets: from publication to impactHigh-value datasets: from publication to impact
High-value datasets: from publication to impactElena Simperl
 
The story of Data Stories
The story of Data StoriesThe story of Data Stories
The story of Data StoriesElena Simperl
 
The human face of AI: how collective and augmented intelligence can help sol...
The human face of AI:  how collective and augmented intelligence can help sol...The human face of AI:  how collective and augmented intelligence can help sol...
The human face of AI: how collective and augmented intelligence can help sol...Elena Simperl
 
Qrowd and the city: designing people-centric smart cities
Qrowd and the city: designing people-centric smart citiesQrowd and the city: designing people-centric smart cities
Qrowd and the city: designing people-centric smart citiesElena Simperl
 
One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...
One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...
One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...Elena Simperl
 
Inclusive cities: a crowdsourcing approach
Inclusive cities: a crowdsourcing approachInclusive cities: a crowdsourcing approach
Inclusive cities: a crowdsourcing approachElena Simperl
 

More from Elena Simperl (20)

This talk was not generated with ChatGPT: how AI is changing science
This talk was not generated with ChatGPT: how AI is changing scienceThis talk was not generated with ChatGPT: how AI is changing science
This talk was not generated with ChatGPT: how AI is changing science
 
Knowledge graph use cases in natural language generation
Knowledge graph use cases in natural language generationKnowledge graph use cases in natural language generation
Knowledge graph use cases in natural language generation
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
The web of data: how are we doing so far
The web of data: how are we doing so farThe web of data: how are we doing so far
The web of data: how are we doing so far
 
What Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringWhat Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineering
 
Open government data portals: from publishing to use and impact
Open government data portals: from publishing to use and impactOpen government data portals: from publishing to use and impact
Open government data portals: from publishing to use and impact
 
Ten myths about knowledge graphs.pdf
Ten myths about knowledge graphs.pdfTen myths about knowledge graphs.pdf
Ten myths about knowledge graphs.pdf
 
What Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringWhat Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineering
 
Data commons and their role in fighting misinformation.pdf
Data commons and their role in fighting misinformation.pdfData commons and their role in fighting misinformation.pdf
Data commons and their role in fighting misinformation.pdf
 
Are our knowledge graphs trustworthy?
Are our knowledge graphs trustworthy?Are our knowledge graphs trustworthy?
Are our knowledge graphs trustworthy?
 
The web of data: how are we doing so far?
The web of data: how are we doing so far?The web of data: how are we doing so far?
The web of data: how are we doing so far?
 
Crowdsourcing and citizen engagement for people-centric smart cities
Crowdsourcing and citizen engagement for people-centric smart citiesCrowdsourcing and citizen engagement for people-centric smart cities
Crowdsourcing and citizen engagement for people-centric smart cities
 
Pie chart or pizza: identifying chart types and their virality on Twitter
Pie chart or pizza: identifying chart types and their virality on TwitterPie chart or pizza: identifying chart types and their virality on Twitter
Pie chart or pizza: identifying chart types and their virality on Twitter
 
High-value datasets: from publication to impact
High-value datasets: from publication to impactHigh-value datasets: from publication to impact
High-value datasets: from publication to impact
 
The story of Data Stories
The story of Data StoriesThe story of Data Stories
The story of Data Stories
 
The human face of AI: how collective and augmented intelligence can help sol...
The human face of AI:  how collective and augmented intelligence can help sol...The human face of AI:  how collective and augmented intelligence can help sol...
The human face of AI: how collective and augmented intelligence can help sol...
 
Qrowd and the city: designing people-centric smart cities
Qrowd and the city: designing people-centric smart citiesQrowd and the city: designing people-centric smart cities
Qrowd and the city: designing people-centric smart cities
 
One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...
One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...
One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...
 
Qrowd and the city
Qrowd and the cityQrowd and the city
Qrowd and the city
 
Inclusive cities: a crowdsourcing approach
Inclusive cities: a crowdsourcing approachInclusive cities: a crowdsourcing approach
Inclusive cities: a crowdsourcing approach
 

Recently uploaded

Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfakmcokerachita
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfUmakantAnnand
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 

Recently uploaded (20)

Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdf
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.Compdf
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 

Crowdsourcing-enabled Linked Data management architecture

  • 1. A semantically enabled architecture for crowdsourced Linked Data management Elena Simperl,1 Maribel Acosta,1 Barry Norton2 1Institute AIFB, Karlsruhe Institute of Technology, Germany 2Ontotext AD, Bulgaria Institute of Applied Informatics and Formal Description Methods (AIFB) Institute of Applied Informatics and Formal Description Methods (AIFB) KIT – University of the State of Baden-Wuerttemberg and National Research Center of the Helmholtz Association www.kit.edu
  • 2. Background: What is Linked Data? Linked Data: set of best practices to publish and connect structured data on the Web. URIs to identify entities and concepts in the world HTTP to access and retrieve resources and descriptions of these resources RDF as generic graph-based data model to structure and link data Taken together Linked Data is said to form a ‘cloud’ of shared references and vocabularies. Query language: SPARQL. http://linkeddata.org/faq 2 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 3. Background: Why Linked Data? Data.gov & public sector information: more BBC & media: added value of transparency and accountability in content through interlinking governance Google, Yahoo, Bing & schema.org: enhanced search 3 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 4. Outline 1 • Motivation 2 • Our Approach 3 • Extensions to VoID and SPARQL 4 • Crowdsourced query processing tasks 5 • Advantages 6 • Challenges 4 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 5. 1. Motivation User Query: Give me the German names of all commercial airports in Baden-Württemberg, ordered by their most informative description. „Retrieve the labels in German of commercial airports located in Baden-Württemberg, ordered by the better human-readable description of the airport given in the comment“. This query cannot be optimally answered automatically: Incorrect/missing classification of entities (e.g. classification as airports instead of commercial airports). Missing information in data sets (e.g. German labels). It is not possible to optimally perform subjective operations (e.g. comparisons of pictures or NL comments). 5 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 6. 1. Motivation „Retrieve the labels in German of commercial airports located in Baden-Württemberg, ordered by the better human- readable description of the airport given in the comment“. In order to answer the query as intended: Classification of airports as commercial airports. Identity resolution of places (Baden-Württemberg). Translation of the labels of the airports. Ordering of the comments by a subjective comparison. 6 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 7. 1. Motivation „Retrieve the labels in German of commercial airports located in Baden-Württemberg, ordered by the better human- readable description of the airport given in the comment“. SPARQL Query: SELECT ?label WHERE { Classification 1 ?x a metar:CommercialHubAirport; rdfs:label ?label; rdfs:comment ?comment . ?x geonames:parentFeature ?z . Identity Resolution 2 ?z owl:sameAs <http://dbpedia.org/resource/Baden-Wuerttemberg> . FILTER (LANG(?label) = "de") 3 Missing Information 4 Ordering } ORDER BY CROWD(?comment, "Better description of %x") 7 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 8. 1. Motivation: Our Aim SPARQL query engine, able to process queries using seamless combination of automatic query processing and crowdsourcing. Query Results Mediator SPARQL query engine Crowdsourced query processing Query parsing Task design UI generation Query optimization Query execution Wrapper Wrapper Wrapper Wrapper 8 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 9. 2. Our Approach Parser Query Results Decomposes the input query. SPARQL query engine Selects the data sets that should be Query parsing accessed to produce answers. Query optimization Rewrites the query into the internal Query execution structures. 9 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 10. 2. Our Approach Optimizer Query Results DB statistics and crowdsourcing SPARQL query engine statistics: estimated time to completion, Query parsing and other information about the performance (quality, cost) of the crowd. Query optimization Traditional data bases optimization Query execution techniques are implemented. Determines which parts of the query should be solved by human input: VoID and SPARQL extensions. Generates logical and physical plans. 10 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 11. 2. Our Approach Executor Query Results Implements physical operators. SPARQL query engine Invokes crowdsourcing component: Query parsing Creates tasks. Query optimization Generates UI. Query execution Infers facts automatically. Executes query against Linked Data: computational tasks. Incorporates results from the human input. 11 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 12. 3. Extensions to VoID and SPARQL The RDF based schema to describe data sets is VoID (Vocabulary of Interlinked Datasets). Common VoID predicates: voidDataset, void:inDataset, void:Linkset, void:linkPredicate, void:target. Automatic interlinking of datasets VoID extensions: CrowdClass CrowdProperty 13 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 13. 3. Extensions to VoID and SPARQL Automatic interlinking of data sets Example - Specification of Data Sets: :METAR rdf:type void:Dataset . METAR :Genonames rdf:type void:Dataset . owl:sameAs :METAR2Geonames rdf:type void:Linkset ; void:linkPredicate owl:sameAs ; void:target :METAR ; Geonames void:target :Geonames . 14 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 14. 3. Extensions to VoID and SPARQL CrowdClass - Specifies which entities of a data set could be crowdsourced. - All subclasses of the crowdClass are also defined (implicitly) as crowdsourced entities. Example: metar:Airport void:inDataset :METAR . metar:CommercialHubAirport void:inDataset :METAR; rdfs:subClass metar:Airport . metar:Airport rdf:type void:crowdClass . metar:CommercialHubAirport rdf:type void:crowdClass. 15 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 15. 3. Extensions to VoID and SPARQL RDF data can be queried using the language SPARQL. Common SPARQL operators: join, union, optional, filter, order by. Properties related to general ontology languages such as OWL are treated as extensions of SPARQL operators, and are modeled in our architecture as tasks. 16 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 16. 4. Tasks Formal, declarative description of the data and tasks using SPARQL patterns as a basis for the automatic design of HITs. Identity resolution Missing information Ontological classification Ordering (new operator) 17 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 17. 4.1. Ontological Classification It is not always possible to automatically infer classification from the properties. Example: Retrieve the names (labels) of METAR stations that correspond to commercial airports. SELECT ?label WHERE { ?station a metar:CommercialHubAirport; rdfs:label ?label .} Input: {?station a metar:Station; rdfs:label ?label; wgs84:lat ?lat; wgs84:long ?long} Output: {?station a ?type. ?type rdfs:subClassOf metar:Station} 18 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 18. 4.2. Ordering Orderings defined via less straightforward built-ins; for instance, the ordering of pictorial representations of entities. SPARQL extension: ORDER BY CROWD Example: Retrieves all airports and their pictures, and the pictures should be ordered according to the more representative image of the given airport. SELECT ?airport ?picture WHERE { ?airport a metar:Airport; foaf:depiction ?picture . } ORDER BY CROWD(?picture, "Most representative image for %airport") Input: {?airport foaf:depiction ?x, ?y} Output: {{(?x ?y) a rdf:List} UNION {(?y ?x) a rdf:List}} 19 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 19. 4.3. Computational tasks expressed as SPARQL queries Transitive relations inferred automatically, without requiring human intervention. Implementation of restrictions in SPIN. Identity Resolution Classification Ordering CONSTRUCT { CONSTRUCT { CONSTRUCT { ?a owl:sameAs ?c . ?a a ?b. {(?a ?b) a rdf:List .} } WHERE { ?b rdfs:subClassOf ?c. } WHERE { ?a owl:sameAs ?b . } WHERE { (?a ?x) a rdf:List . ?b owl:sameAs ?c . ?a rdfs:subClassOf ?c. (?x ?b) a rdf:List . } ?b rdfs:subClassOf ?b1. } ?b1 rdfs:subClassOf ?c. } 20 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 20. 5. Advantages Declarative description of data allows to decompose the query. Generation of the UIs automatically. Generation of human tasks on-the-fly and adjustment of the design of the task. Automatic consistency check of results by reasoning against validating ontology. 21 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 21. 6. Challenges Appropriate level of granularity for HITs design for specific SPARQL constructs. Caching Naively we can materialise HIT results into datasets. How to deal with partial coverage and dynamic datasets. Optimal user interfaces of graph-like content. Pricing and workers’ assignment. 22 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)
  • 22. QUESTIONS 23 07.06.2012 CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Institut für Angewandte Informatik und Formale Linked Data management Beschreibungsverfahren (AIFB)