SlideShare a Scribd company logo
INSPIRE 2011, Edinburgh, June 29, 2011



Semantic Similarity
Assessment to Browse
Resources exposed as Linked
Data: an Application to
Habitat and Species Datasets
R. Albertoni, M. De Martino,
Institute for Applied Mathematics and Information Technologies
National Research Council (CNR), Italy
Outline
!   Linked data - Motivation

!   EUNIS Habitat and Species

!   Asymmetric and context dependent Semantic Similarity
    !     Two contexts
    !     Examples of assessments

!   Semantic similarity – Query refinement
    !     searching for geographical data set

!   Conclusion and remarks
Linked Data
Why Linked data ?

!     Data Portability across current Data Silos

!     HTTP based Open Database Connectivity

!     Platform Independent Data & Information Access Linked Data Spaces –

!     Serendipitous Discovery of relevant things via the Web

Examples of geographical related linked data datasets

EARTH, GEMET, EUNIS SPECIES & SITE, LINKED GEO DATA,
   GEONAMES …


               Items in “why Linked data” are borrowed from the Kingsley Idehen’s presentation
               “Creating_Deploying_Exploiting_Linked_Data2”
What can we do with linked
                data?
Applications already successful:

!     Improve/enrich the result returned by search engine (RDF/RDFa snippets)
      (Google, Yahoo)

!     Linked data driven mesh-ups considering data from different sources (LOD
      Graph,…)

What else we can do?

!     We want to push ahead with Serendipitous Discovery supporting decision
      making by analyzing Linked Data sources

!     Tools analyzing linked data: Context Dependent Instance Semantic
      Similarity
           !     Albertoni R., De Martino M., Asymmetric and context-dependent semantic
                 similarity among ontology instances, Journal on Data Semantics X, Springer Verlag,
                 pp 1-30, (2008).
EUNIS Species-Habitats
EUNIS Habitat and Species mapped in
 SKOS and published as Linked Data



                              skos:prefLabel


                                    URI:
               http://linkeddata.ge.imati.cnr.it:2020/…/B2.1




                             skos:description
Species and Habitats are instances of SKOS schema
                     skos:description “Beach and upper beach formations,
                     mostly of annuals of the low … ….. characteristic are
                     [Cakile edentula], [Polygonum norvegicum]
                     ([Polygonum oxyspermum ssp. raii]), [Atriplex longipes]
                     s.l., [Atriplex glabriuscula], [Mertensia maritima].




                                         Species are easily identifiable in
                                         the Habitat title and
                                         description !!!!

                                         We didn’t use SILK,
                                         We just developed an ad hoc
                                         interlinking procedure in JENA
Applying semantic
similarity on EUNIS
Species-Habitats

  Details among context formalization and mathematical formulas behind our semantic
  similarity are available in
  Albertoni R., De Martino M., Asymmetric and context-dependent semantic
  similarity among ontology instances, Journal on Data Semantics X,
   Springer Verlag, pp 1-30, (2008).
Definition of contexts- parameterizations
          of our instance similarity
Context 1:“habitat species-based similarity” habitats are compared
    according to the species that they host or vice versa

PREFIX skos: <http://www.w3.org/2004/02/skos/core#>

[skos:Concept]->{{},{(skos:relatedMatch, Inter)}

Context 2: “taxonomy-based similarity” habitats or species instances are
    compared with respect to their position in the taxonomy hierarchy

PREFIX skos: <http://www.w3.org/2004/02/skos/core#>

[skos:Concept]->{ {},{(skos:broader, Inter)}}

You can have contexts as complex as you want, for example
1)  considering different ontology schemas
2)  providing recursive similarity assessment
Context 1:“habitat species-based similarity” habitats are compared according to
    the species that they host or vice versa

 PREFIX skos: <http://www.w3.org/2004/02/skos/core#>
    [skos:Concept]->{{},{(skos:relatedMatch, Inter)}




 SIM(B211,X)=SIM(X, B211)=0                SIM(B211,X)=2/4    SIM(X,B211)=1




SIM(B211,X)=1/3   SIM(X,B211)=1/2              SIM(B211,X)=SIM(X, B211)=1
Context 2: “taxonomy-based similarity” habitats or species instances
   are compared with respect to their position in the taxonomy
   hierarchy

PREFIX skos: <http://www.w3.org/2004/02/skos/core#>
   [skos:Concept]->{ {},{(skos:broader, Inter)}}




                        JENA RULES to get skos:broader as transitive an reflexive relations
                        in order to compare nodes according to their ancestors

                        (?x skos:broader ?y) (?y skos:broader ?z)-> (?x skos:broader ?z)
                        (?y skos:broader ?z)-> (?y skos:broader ?y)
Our semantic similarity was adapted to work
            with Linked Data
(Here we have consider fairly “harmonized” linked data sets)

Semantic similarity design enhancements:

!     Direct access to linked data (No anymore centralized ontology driven
      repositories):
      !     (i) Follow your nose approach, (ii) RDF Dumps, (iii) SPARQL End Points

!     Increased independence from the ontology schema
      !     CONTEXTs can mix up different light weighted ontology schemas, since it is
            common practice in Linked data.

!     A reasoner to add simple RDF entailments

Quite challenging when we consider sources that are not “harmonized”
     !     non-authoritative resources, heterogeneous schema, non-consistently identified entities
     !     Riccardo Albertoni, Monica De Martino: Semantic Similarity and Selection of
           Resources Published According to Linked Data Best Practice. OTM Workshops 2010,
           LNCS vol. 6428/2010
Result considering Habitats and sub
     habitats of Coastal shingle (B2)




Context A
if SIM(X,Y)=1 and SIM(Y,X)=1 than Y contains the same species of X;
if SIM(X,Y)=1 and SIM(Y,X)<1 than Y contains the species of X but the
vice versa is not true;
SIM(X,Y) is proportional to the percentage of species in X that are
contained in Y out of the overall species of X.
Comparing species
according to habitats
they can be found in
HOW to USE IT
Example: Searching for data
    • you might want similarity to refine your keyword
    query
    •  habitats and species can be deployed as
    Thesaurus/controlled vocabulary

ADVANTAGES in our approach wrt other similarities
   • Different contexts  even more personalized
   suggestions
   • Asymmetry/Containment Highlighting  even
   more information when browsing the refinement
   alternatives
Conclusion
!     After publishing your data, let’s start to consume Linked Data not
      only for meshing up !!
      !     Assumed data is properly interlinked, we can consume data from
            different distributed sources and mixing up light weighted ontologies
            schemas.
      !     The more dataset are interlinked, the more are the potential contexts
            and similarity applications

!     Here we presented some very simple examples
      !     We can define more complex context considering instances’ relations
            and properties
      !     Our semantic similarity is a working prototype written in JAVA/JENA

!     Future work
      !     Further uses cases (Do you fancy trying our semantic similarity on your
            data? Let’s talk about it)
      !     Developments of a front end to define user-driven contexts
      !     Further reengineering of the prototype to scale up even more complex
            use cases

More Related Content

Similar to Semantic Similarity Assessment to Browse Resources exposed as Linked Data: an Application to Habitat and Species Datasets

Mining and Supporting Community Structures in Sensor Network Research
Mining and Supporting Community Structures in Sensor Network ResearchMining and Supporting Community Structures in Sensor Network Research
Mining and Supporting Community Structures in Sensor Network Research
Marko Rodriguez
 
Presentation at MTSR 2012
Presentation at MTSR 2012Presentation at MTSR 2012
Presentation at MTSR 2012
Riccardo Albertoni
 
Ontology engineering: Ontology alignment
Ontology engineering: Ontology alignmentOntology engineering: Ontology alignment
Ontology engineering: Ontology alignment
Guus Schreiber
 
Effective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP SystemsEffective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP Systems
Andre Freitas
 
Fueling the future with Semantic Web patterns - Keynote at WOP2014@ISWC
Fueling the future with Semantic Web patterns - Keynote at WOP2014@ISWCFueling the future with Semantic Web patterns - Keynote at WOP2014@ISWC
Fueling the future with Semantic Web patterns - Keynote at WOP2014@ISWC
Valentina Presutti
 
How To Make Linked Data More than Data
How To Make Linked Data More than DataHow To Make Linked Data More than Data
How To Make Linked Data More than Data
Artificial Intelligence Institute at UofSC
 
How To Make Linked Data More than Data
How To Make Linked Data More than DataHow To Make Linked Data More than Data
How To Make Linked Data More than Data
Amit Sheth
 
A semantic framework and software design to enable the transparent integratio...
A semantic framework and software design to enable the transparent integratio...A semantic framework and software design to enable the transparent integratio...
A semantic framework and software design to enable the transparent integratio...
Patricia Tavares Boralli
 
Relations for Reusing (R4R) in A Shared Context: An Exploration on Research P...
Relations for Reusing (R4R) in A Shared Context: An Exploration on Research P...Relations for Reusing (R4R) in A Shared Context: An Exploration on Research P...
Relations for Reusing (R4R) in A Shared Context: An Exploration on Research P...
andrea huang
 
IDENTIFYING THE SEMANTIC RELATIONS ON UNSTRUCTURED DATA
IDENTIFYING THE SEMANTIC RELATIONS ON UNSTRUCTURED DATAIDENTIFYING THE SEMANTIC RELATIONS ON UNSTRUCTURED DATA
IDENTIFYING THE SEMANTIC RELATIONS ON UNSTRUCTURED DATA
ijistjournal
 
What is What, When?
What is What, When?What is What, When?
What is What, When?
Elizabeth McLean
 
Genealogical domain
Genealogical domainGenealogical domain
Genealogical domain
jcampany
 
Neno/Fhat: Semantic Network Programming Language and Virtual Machine Specific...
Neno/Fhat: Semantic Network Programming Language and Virtual Machine Specific...Neno/Fhat: Semantic Network Programming Language and Virtual Machine Specific...
Neno/Fhat: Semantic Network Programming Language and Virtual Machine Specific...
Marko Rodriguez
 
247th ACS Meeting: Experiment Markup Language (ExptML)
247th ACS Meeting: Experiment Markup Language (ExptML)247th ACS Meeting: Experiment Markup Language (ExptML)
247th ACS Meeting: Experiment Markup Language (ExptML)
Stuart Chalk
 
A Non-Technical, Example-Driven Introduction to Linked Data
A Non-Technical, Example-Driven Introduction to Linked DataA Non-Technical, Example-Driven Introduction to Linked Data
A Non-Technical, Example-Driven Introduction to Linked Data
kjanowicz
 
SKOS and Linked Data
SKOS and Linked DataSKOS and Linked Data
SKOS and Linked Data
Antoine Isaac
 
Cross-lingual event-mining using wordnet as a shared knowledge interface
Cross-lingual event-mining using wordnet as a shared knowledge interfaceCross-lingual event-mining using wordnet as a shared knowledge interface
Cross-lingual event-mining using wordnet as a shared knowledge interface
pathsproject
 
Eacl 2006 Pedersen
Eacl 2006 PedersenEacl 2006 Pedersen
Automatically converting tabular data to
Automatically converting tabular data toAutomatically converting tabular data to
Automatically converting tabular data to
IJwest
 
CL2009_ANNIS_pre
CL2009_ANNIS_preCL2009_ANNIS_pre
CL2009_ANNIS_pre
tutorialsruby
 

Similar to Semantic Similarity Assessment to Browse Resources exposed as Linked Data: an Application to Habitat and Species Datasets (20)

Mining and Supporting Community Structures in Sensor Network Research
Mining and Supporting Community Structures in Sensor Network ResearchMining and Supporting Community Structures in Sensor Network Research
Mining and Supporting Community Structures in Sensor Network Research
 
Presentation at MTSR 2012
Presentation at MTSR 2012Presentation at MTSR 2012
Presentation at MTSR 2012
 
Ontology engineering: Ontology alignment
Ontology engineering: Ontology alignmentOntology engineering: Ontology alignment
Ontology engineering: Ontology alignment
 
Effective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP SystemsEffective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP Systems
 
Fueling the future with Semantic Web patterns - Keynote at WOP2014@ISWC
Fueling the future with Semantic Web patterns - Keynote at WOP2014@ISWCFueling the future with Semantic Web patterns - Keynote at WOP2014@ISWC
Fueling the future with Semantic Web patterns - Keynote at WOP2014@ISWC
 
How To Make Linked Data More than Data
How To Make Linked Data More than DataHow To Make Linked Data More than Data
How To Make Linked Data More than Data
 
How To Make Linked Data More than Data
How To Make Linked Data More than DataHow To Make Linked Data More than Data
How To Make Linked Data More than Data
 
A semantic framework and software design to enable the transparent integratio...
A semantic framework and software design to enable the transparent integratio...A semantic framework and software design to enable the transparent integratio...
A semantic framework and software design to enable the transparent integratio...
 
Relations for Reusing (R4R) in A Shared Context: An Exploration on Research P...
Relations for Reusing (R4R) in A Shared Context: An Exploration on Research P...Relations for Reusing (R4R) in A Shared Context: An Exploration on Research P...
Relations for Reusing (R4R) in A Shared Context: An Exploration on Research P...
 
IDENTIFYING THE SEMANTIC RELATIONS ON UNSTRUCTURED DATA
IDENTIFYING THE SEMANTIC RELATIONS ON UNSTRUCTURED DATAIDENTIFYING THE SEMANTIC RELATIONS ON UNSTRUCTURED DATA
IDENTIFYING THE SEMANTIC RELATIONS ON UNSTRUCTURED DATA
 
What is What, When?
What is What, When?What is What, When?
What is What, When?
 
Genealogical domain
Genealogical domainGenealogical domain
Genealogical domain
 
Neno/Fhat: Semantic Network Programming Language and Virtual Machine Specific...
Neno/Fhat: Semantic Network Programming Language and Virtual Machine Specific...Neno/Fhat: Semantic Network Programming Language and Virtual Machine Specific...
Neno/Fhat: Semantic Network Programming Language and Virtual Machine Specific...
 
247th ACS Meeting: Experiment Markup Language (ExptML)
247th ACS Meeting: Experiment Markup Language (ExptML)247th ACS Meeting: Experiment Markup Language (ExptML)
247th ACS Meeting: Experiment Markup Language (ExptML)
 
A Non-Technical, Example-Driven Introduction to Linked Data
A Non-Technical, Example-Driven Introduction to Linked DataA Non-Technical, Example-Driven Introduction to Linked Data
A Non-Technical, Example-Driven Introduction to Linked Data
 
SKOS and Linked Data
SKOS and Linked DataSKOS and Linked Data
SKOS and Linked Data
 
Cross-lingual event-mining using wordnet as a shared knowledge interface
Cross-lingual event-mining using wordnet as a shared knowledge interfaceCross-lingual event-mining using wordnet as a shared knowledge interface
Cross-lingual event-mining using wordnet as a shared knowledge interface
 
Eacl 2006 Pedersen
Eacl 2006 PedersenEacl 2006 Pedersen
Eacl 2006 Pedersen
 
Automatically converting tabular data to
Automatically converting tabular data toAutomatically converting tabular data to
Automatically converting tabular data to
 
CL2009_ANNIS_pre
CL2009_ANNIS_preCL2009_ANNIS_pre
CL2009_ANNIS_pre
 

More from Riccardo Albertoni

Albertoni ldq workshop ESWC 2015
Albertoni ldq workshop ESWC 2015Albertoni ldq workshop ESWC 2015
Albertoni ldq workshop ESWC 2015
Riccardo Albertoni
 
Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)
Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)
Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)
Riccardo Albertoni
 
LusTRE: a Linked Thesaurus fRamework for Environment
LusTRE: a Linked Thesaurus fRamework for EnvironmentLusTRE: a Linked Thesaurus fRamework for Environment
LusTRE: a Linked Thesaurus fRamework for Environment
Riccardo Albertoni
 
Linkset quality (LWDM 2013)
Linkset quality (LWDM 2013)Linkset quality (LWDM 2013)
Linkset quality (LWDM 2013)
Riccardo Albertoni
 
Linkset quality
Linkset qualityLinkset quality
Linkset quality
Riccardo Albertoni
 
SSONDE: Semantic Similarity On liNked Data Entities
SSONDE: Semantic Similarity On liNked Data EntitiesSSONDE: Semantic Similarity On liNked Data Entities
SSONDE: Semantic Similarity On liNked Data Entities
Riccardo Albertoni
 
An ontology driven module for accessing chronic pathology literature- CHRONIO...
An ontology driven module for accessing chronic pathology literature- CHRONIO...An ontology driven module for accessing chronic pathology literature- CHRONIO...
An ontology driven module for accessing chronic pathology literature- CHRONIO...
Riccardo Albertoni
 
Semantic Similarity and Selection of Resources Published According to Linked ...
Semantic Similarity and Selection of Resources Published According to Linked ...Semantic Similarity and Selection of Resources Published According to Linked ...
Semantic Similarity and Selection of Resources Published According to Linked ...
Riccardo Albertoni
 
SKOS and semantic web best practice to access terminological resources: Natur...
SKOS and semantic web best practice to access terminological resources: Natur...SKOS and semantic web best practice to access terminological resources: Natur...
SKOS and semantic web best practice to access terminological resources: Natur...
Riccardo Albertoni
 

More from Riccardo Albertoni (9)

Albertoni ldq workshop ESWC 2015
Albertoni ldq workshop ESWC 2015Albertoni ldq workshop ESWC 2015
Albertoni ldq workshop ESWC 2015
 
Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)
Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)
Environmental Thesauri Under the Lens of Reusability (EGOVIS 2014)
 
LusTRE: a Linked Thesaurus fRamework for Environment
LusTRE: a Linked Thesaurus fRamework for EnvironmentLusTRE: a Linked Thesaurus fRamework for Environment
LusTRE: a Linked Thesaurus fRamework for Environment
 
Linkset quality (LWDM 2013)
Linkset quality (LWDM 2013)Linkset quality (LWDM 2013)
Linkset quality (LWDM 2013)
 
Linkset quality
Linkset qualityLinkset quality
Linkset quality
 
SSONDE: Semantic Similarity On liNked Data Entities
SSONDE: Semantic Similarity On liNked Data EntitiesSSONDE: Semantic Similarity On liNked Data Entities
SSONDE: Semantic Similarity On liNked Data Entities
 
An ontology driven module for accessing chronic pathology literature- CHRONIO...
An ontology driven module for accessing chronic pathology literature- CHRONIO...An ontology driven module for accessing chronic pathology literature- CHRONIO...
An ontology driven module for accessing chronic pathology literature- CHRONIO...
 
Semantic Similarity and Selection of Resources Published According to Linked ...
Semantic Similarity and Selection of Resources Published According to Linked ...Semantic Similarity and Selection of Resources Published According to Linked ...
Semantic Similarity and Selection of Resources Published According to Linked ...
 
SKOS and semantic web best practice to access terminological resources: Natur...
SKOS and semantic web best practice to access terminological resources: Natur...SKOS and semantic web best practice to access terminological resources: Natur...
SKOS and semantic web best practice to access terminological resources: Natur...
 

Recently uploaded

GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
Javier Junquera
 
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeckPoznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
FilipTomaszewski5
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
Fwdays
 
A Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's ArchitectureA Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's Architecture
ScyllaDB
 
Demystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through StorytellingDemystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through Storytelling
Enterprise Knowledge
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
Antonios Katsarakis
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
AstuteBusiness
 
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
DanBrown980551
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
Neo4j
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
Safe Software
 
Day 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio FundamentalsDay 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio Fundamentals
UiPathCommunity
 
AppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSFAppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSF
Ajin Abraham
 
Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
Getting the Most Out of ScyllaDB Monitoring: ShareChat's TipsGetting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
ScyllaDB
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving
 
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectorsConnector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
DianaGray10
 
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin..."$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
Fwdays
 
Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!
Tobias Schneck
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
c5vrf27qcz
 
What is an RPA CoE? Session 2 – CoE Roles
What is an RPA CoE?  Session 2 – CoE RolesWhat is an RPA CoE?  Session 2 – CoE Roles
What is an RPA CoE? Session 2 – CoE Roles
DianaGray10
 
The Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptxThe Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptx
operationspcvita
 

Recently uploaded (20)

GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
 
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeckPoznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
 
A Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's ArchitectureA Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's Architecture
 
Demystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through StorytellingDemystifying Knowledge Management through Storytelling
Demystifying Knowledge Management through Storytelling
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
 
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
 
Day 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio FundamentalsDay 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio Fundamentals
 
AppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSFAppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSF
 
Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
Getting the Most Out of ScyllaDB Monitoring: ShareChat's TipsGetting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
 
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectorsConnector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
 
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin..."$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
 
Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
 
What is an RPA CoE? Session 2 – CoE Roles
What is an RPA CoE?  Session 2 – CoE RolesWhat is an RPA CoE?  Session 2 – CoE Roles
What is an RPA CoE? Session 2 – CoE Roles
 
The Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptxThe Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptx
 

Semantic Similarity Assessment to Browse Resources exposed as Linked Data: an Application to Habitat and Species Datasets

  • 1. INSPIRE 2011, Edinburgh, June 29, 2011 Semantic Similarity Assessment to Browse Resources exposed as Linked Data: an Application to Habitat and Species Datasets R. Albertoni, M. De Martino, Institute for Applied Mathematics and Information Technologies National Research Council (CNR), Italy
  • 2. Outline !   Linked data - Motivation !   EUNIS Habitat and Species !   Asymmetric and context dependent Semantic Similarity !   Two contexts !   Examples of assessments !   Semantic similarity – Query refinement !   searching for geographical data set !   Conclusion and remarks
  • 3. Linked Data Why Linked data ? !   Data Portability across current Data Silos !   HTTP based Open Database Connectivity !   Platform Independent Data & Information Access Linked Data Spaces – !   Serendipitous Discovery of relevant things via the Web Examples of geographical related linked data datasets EARTH, GEMET, EUNIS SPECIES & SITE, LINKED GEO DATA, GEONAMES … Items in “why Linked data” are borrowed from the Kingsley Idehen’s presentation “Creating_Deploying_Exploiting_Linked_Data2”
  • 4. What can we do with linked data? Applications already successful: !   Improve/enrich the result returned by search engine (RDF/RDFa snippets) (Google, Yahoo) !   Linked data driven mesh-ups considering data from different sources (LOD Graph,…) What else we can do? !   We want to push ahead with Serendipitous Discovery supporting decision making by analyzing Linked Data sources !   Tools analyzing linked data: Context Dependent Instance Semantic Similarity !   Albertoni R., De Martino M., Asymmetric and context-dependent semantic similarity among ontology instances, Journal on Data Semantics X, Springer Verlag, pp 1-30, (2008).
  • 6. EUNIS Habitat and Species mapped in SKOS and published as Linked Data skos:prefLabel URI: http://linkeddata.ge.imati.cnr.it:2020/…/B2.1 skos:description
  • 7. Species and Habitats are instances of SKOS schema skos:description “Beach and upper beach formations, mostly of annuals of the low … ….. characteristic are [Cakile edentula], [Polygonum norvegicum] ([Polygonum oxyspermum ssp. raii]), [Atriplex longipes] s.l., [Atriplex glabriuscula], [Mertensia maritima]. Species are easily identifiable in the Habitat title and description !!!! We didn’t use SILK, We just developed an ad hoc interlinking procedure in JENA
  • 8. Applying semantic similarity on EUNIS Species-Habitats Details among context formalization and mathematical formulas behind our semantic similarity are available in Albertoni R., De Martino M., Asymmetric and context-dependent semantic similarity among ontology instances, Journal on Data Semantics X, Springer Verlag, pp 1-30, (2008).
  • 9. Definition of contexts- parameterizations of our instance similarity Context 1:“habitat species-based similarity” habitats are compared according to the species that they host or vice versa PREFIX skos: <http://www.w3.org/2004/02/skos/core#> [skos:Concept]->{{},{(skos:relatedMatch, Inter)} Context 2: “taxonomy-based similarity” habitats or species instances are compared with respect to their position in the taxonomy hierarchy PREFIX skos: <http://www.w3.org/2004/02/skos/core#> [skos:Concept]->{ {},{(skos:broader, Inter)}} You can have contexts as complex as you want, for example 1)  considering different ontology schemas 2)  providing recursive similarity assessment
  • 10. Context 1:“habitat species-based similarity” habitats are compared according to the species that they host or vice versa PREFIX skos: <http://www.w3.org/2004/02/skos/core#> [skos:Concept]->{{},{(skos:relatedMatch, Inter)} SIM(B211,X)=SIM(X, B211)=0 SIM(B211,X)=2/4 SIM(X,B211)=1 SIM(B211,X)=1/3 SIM(X,B211)=1/2 SIM(B211,X)=SIM(X, B211)=1
  • 11. Context 2: “taxonomy-based similarity” habitats or species instances are compared with respect to their position in the taxonomy hierarchy PREFIX skos: <http://www.w3.org/2004/02/skos/core#> [skos:Concept]->{ {},{(skos:broader, Inter)}} JENA RULES to get skos:broader as transitive an reflexive relations in order to compare nodes according to their ancestors (?x skos:broader ?y) (?y skos:broader ?z)-> (?x skos:broader ?z) (?y skos:broader ?z)-> (?y skos:broader ?y)
  • 12. Our semantic similarity was adapted to work with Linked Data (Here we have consider fairly “harmonized” linked data sets) Semantic similarity design enhancements: !   Direct access to linked data (No anymore centralized ontology driven repositories): !   (i) Follow your nose approach, (ii) RDF Dumps, (iii) SPARQL End Points !   Increased independence from the ontology schema !   CONTEXTs can mix up different light weighted ontology schemas, since it is common practice in Linked data. !   A reasoner to add simple RDF entailments Quite challenging when we consider sources that are not “harmonized” !   non-authoritative resources, heterogeneous schema, non-consistently identified entities !   Riccardo Albertoni, Monica De Martino: Semantic Similarity and Selection of Resources Published According to Linked Data Best Practice. OTM Workshops 2010, LNCS vol. 6428/2010
  • 13. Result considering Habitats and sub habitats of Coastal shingle (B2) Context A if SIM(X,Y)=1 and SIM(Y,X)=1 than Y contains the same species of X; if SIM(X,Y)=1 and SIM(Y,X)<1 than Y contains the species of X but the vice versa is not true; SIM(X,Y) is proportional to the percentage of species in X that are contained in Y out of the overall species of X.
  • 14. Comparing species according to habitats they can be found in
  • 15. HOW to USE IT Example: Searching for data • you might want similarity to refine your keyword query •  habitats and species can be deployed as Thesaurus/controlled vocabulary ADVANTAGES in our approach wrt other similarities • Different contexts  even more personalized suggestions • Asymmetry/Containment Highlighting  even more information when browsing the refinement alternatives
  • 16. Conclusion !   After publishing your data, let’s start to consume Linked Data not only for meshing up !! !   Assumed data is properly interlinked, we can consume data from different distributed sources and mixing up light weighted ontologies schemas. !   The more dataset are interlinked, the more are the potential contexts and similarity applications !   Here we presented some very simple examples !   We can define more complex context considering instances’ relations and properties !   Our semantic similarity is a working prototype written in JAVA/JENA !   Future work !   Further uses cases (Do you fancy trying our semantic similarity on your data? Let’s talk about it) !   Developments of a front end to define user-driven contexts !   Further reengineering of the prototype to scale up even more complex use cases