SlideShare a Scribd company logo
1 of 19
Download to read offline
A SEMANTIC SCHEMA
FOR GEONAMES
Enzo Maltese
Feroz Farazi
Index
• Introduction
• GeoNames
• DERA Methodology
• Faceted Ontology
• Semantic Schema
• Result
• Conclusion
INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
Introduction
• With the proliferation of the Web, dataset publishing is
also getting a momentum
• With Linked Open Data (LOD) initiative
• Old datasets are published in machine readable format
• Many new datasets are made available on the Web
• As part of LOD initiative essentially we observe that
datasets are there with increased
• Machine readability
• Availability
• Ease of finding through links
INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
Introduction
• However, in the LOD initiative more emphasis is
given on the quantity of the data than its quality
• In this work we proposed a semantic schema that can
help improve the quality while enforcing it on the data
• Our approach is domain independent
• We performed a detailed case study using GeoNames
INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
GeoNames
• GeoNames is a geo-spatial database consists of various
locations of all countries
• It includes geographical data
• Place names in various languages
• Latitude
• Longitude
• Altitude
• Class
• Latitude and longitude coordinates are stored according
to the WGS84
INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
GeoNames
• It currently contains around 8 million geographic names
for around 7 million unique places
• The places are classified into 9 broader categories called
feature classes
• They are further divided into 667 classes, most of them
have a natural language description
• The data is available free of charge, can be used under a
creative commons attribution license
INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
DERA Methodology
• A methodology for developing ontology
• It allows building domain specific ontologies
• However, it can be used to construct ontology for any domain
• In DERA, a domain consists of three components namely entity
class, relation and attribute.
D=<E, R,A>
• E is a tuple <C, e>, where C represents set of concepts and e
represents set of entities
INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
DERA Methodology
• Methodology:
• Step 1: Identification of the relevant terms
• Step 2: Analysis
• Step 3: Synthesis
• Step 4: Standardization
• Following the above steps leads to the creation of a set of
facets
INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
Faceted Ontology
• Facet is a hierarchy of homogeneous concepts describing an aspect
of the domain (e.g., space), where each term in the hierarchy denotes
an atomic concept
• A facet is a distinctive property of a set of concepts that can help in
differentiating one group from another
• Faceted ontology is an ontology in which concepts are organized into
facets
• S. R. Ranganathan was the first to introduce the faceted approach
• GeoWordNet is an example of a faceted ontology consists of facets
such as body of water, geological formation and administrative
division
INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
Faceted Ontology
Body of water
•Flowing body of water
•Stream
•Brook
•River
•Still body of water
•Pond
•Lake
Populated place
•City
•Town
•Village
Landform
•Natural depression
•Oceanic depression
•Oceanic valley
•Oceanic trough
•Continental depression
•Trough
•Valley
•Natural elevation
•Oceanic elevation
•Seamount
•Submarine hill
•Continental elevation
•Hill
•Mountain
INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
Semantic Schema
• The usage of a geospatial ontology does not solve all the
problems
• In fact, GeoNames lacks sufficient constraints on the domain
and range of the attributes
• Moreover, it lacks corresponding mechanisms to enforce them
which can guarantee for an adequate quality of the data
• We define a semantic schema as a set of constrains on the
domain and range of
• the attributes (e.g. population)
• the relations (e.g. capital)
INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
Semantic Schema
• Such constraints should prevent the attribute population
to have a negative value
• While it is fine for cities to have such attribute, this should
be prevented for streams
• The schema is semantic-aware because
• the domain of attributes and relations and
• the range of relations are always a class and its more specific
classes taken from the geospatial ontology
• For instance, if we specify that the domain of the attribute
population is populated place
INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
Semantic Schema
• We assume it to apply also to city, town and village (more
specific classes in the ontology)
• The purpose of the schema is expressly to define what is
legal in terms of
• attributes
• relations
• corresponding values
• Enforcing the schema corresponds to verifying the
consistency of the dataset w.r.t. such constraints
INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
Semantic Schema
Attribute
Name
Definition Domain (main
class)
Range
Population the people who inhabit a
territory or
State
Populated Place Long > 0
Altitude elevation above sea level Location but
Undersea
Float in [-423,
8848]
Area the extent of a 2-
dimensional surface
enclosed within a
boundary
Location Float > 0
Capital A seat of government Geo-political
entity
Populated Place
INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
Semantic Schema
• The range of altitude was set by referring to the altitude of
• the Dead Sea (the lowest)
• the Mount Everest (the highest)
INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
Result
• In GeoNames the Dead Sea is represented with negative
altitude set to −405 m.
• Surprisingly, GeoNames contains other 45 locations with same
altitude of the Dead Sea
• two other locations are reported to be even lower than the Dead
Sea (Nahal Amazyahu and `Arvat Sedom)
• The domain of population includes several unexpected
classes such as airport, stream and garden
• We removed population from corresponding entities in the ontology
INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
Result
• We found several entities with elevation set to -9999 that
is used in GeoNames to encode an unknown value
• We removed elevation from corresponding entities in the ontology
• In the range of capital, 3 entities are encoded as cities
(e.g. Jerusalem) while all the others as capitals
• This is not wrong, but at least this is not homogeneous
• We represented them homogeneously
INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
Conclusion
• We have stressed the need for an integrated approach to
effectively support semantic interoperability between
different geospatial applications
• The proposed solution
• Consists in the usage of a geospatial faceted ontology providing
the terminology of the geospatial domain and
• A semantic schema that, by establishing precise constraints on the
domain and range of the attributes and the relations, guarantees a
higher level of data quality
INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
Thank you
Questions?

More Related Content

Viewers also liked

CUbRIK research on social aspects
CUbRIK research on social aspectsCUbRIK research on social aspects
CUbRIK research on social aspectsCUbRIK Project
 
Exploiting User Generated Content for Mountain Peak Detection
Exploiting User Generated Content for Mountain Peak DetectionExploiting User Generated Content for Mountain Peak Detection
Exploiting User Generated Content for Mountain Peak DetectionCUbRIK Project
 
CUbRIK Fashion Trend Analysis: a Business Intelligence Application
CUbRIK Fashion Trend Analysis: a Business Intelligence ApplicationCUbRIK Fashion Trend Analysis: a Business Intelligence Application
CUbRIK Fashion Trend Analysis: a Business Intelligence ApplicationCUbRIK Project
 
Prof. Fraternali about Human computation
Prof. Fraternali about Human computationProf. Fraternali about Human computation
Prof. Fraternali about Human computationCUbRIK Project
 
CUbRIK Tutorial at ICWE 2013: part 2 - Introduction to Games with a Purpose
CUbRIK Tutorial at ICWE 2013: part 2 - Introduction to Games with a PurposeCUbRIK Tutorial at ICWE 2013: part 2 - Introduction to Games with a Purpose
CUbRIK Tutorial at ICWE 2013: part 2 - Introduction to Games with a PurposeCUbRIK Project
 
histoGraph: a case study in Digital Humanities
histoGraph: a case study in Digital HumanitieshistoGraph: a case study in Digital Humanities
histoGraph: a case study in Digital HumanitiesCUbRIK Project
 
CUbRIK Social Graph Visual Interface
CUbRIK Social Graph Visual InterfaceCUbRIK Social Graph Visual Interface
CUbRIK Social Graph Visual InterfaceCUbRIK Project
 
Matching Game Mechanics and Human Computation Tasks in Games with a Purpose
Matching Game Mechanics and Human Computation Tasks in Games with a PurposeMatching Game Mechanics and Human Computation Tasks in Games with a Purpose
Matching Game Mechanics and Human Computation Tasks in Games with a PurposeCUbRIK Project
 
Building a social graph for the history of Europe: the CUbRIK histoGraph
Building a social graph for the history of Europe: the CUbRIK histoGraphBuilding a social graph for the history of Europe: the CUbRIK histoGraph
Building a social graph for the history of Europe: the CUbRIK histoGraphCUbRIK Project
 
CUbRIK tutorial at ICWE 2013: part 1 Introduction to Human Computation
CUbRIK tutorial at ICWE 2013: part 1 Introduction to Human ComputationCUbRIK tutorial at ICWE 2013: part 1 Introduction to Human Computation
CUbRIK tutorial at ICWE 2013: part 1 Introduction to Human ComputationCUbRIK Project
 
Castillejo Saray Cròniques Vampíriques
Castillejo Saray Cròniques VampíriquesCastillejo Saray Cròniques Vampíriques
Castillejo Saray Cròniques VampíriquesGemma Lluch
 
CUbRIK Research at CIKM 2012: Efficient Jaccard-based Diversity Analysis of L...
CUbRIK Research at CIKM 2012: Efficient Jaccard-based Diversity Analysis of L...CUbRIK Research at CIKM 2012: Efficient Jaccard-based Diversity Analysis of L...
CUbRIK Research at CIKM 2012: Efficient Jaccard-based Diversity Analysis of L...CUbRIK Project
 
histoGraph presented to MMSP 2013
histoGraph presented to MMSP 2013histoGraph presented to MMSP 2013
histoGraph presented to MMSP 2013CUbRIK Project
 

Viewers also liked (13)

CUbRIK research on social aspects
CUbRIK research on social aspectsCUbRIK research on social aspects
CUbRIK research on social aspects
 
Exploiting User Generated Content for Mountain Peak Detection
Exploiting User Generated Content for Mountain Peak DetectionExploiting User Generated Content for Mountain Peak Detection
Exploiting User Generated Content for Mountain Peak Detection
 
CUbRIK Fashion Trend Analysis: a Business Intelligence Application
CUbRIK Fashion Trend Analysis: a Business Intelligence ApplicationCUbRIK Fashion Trend Analysis: a Business Intelligence Application
CUbRIK Fashion Trend Analysis: a Business Intelligence Application
 
Prof. Fraternali about Human computation
Prof. Fraternali about Human computationProf. Fraternali about Human computation
Prof. Fraternali about Human computation
 
CUbRIK Tutorial at ICWE 2013: part 2 - Introduction to Games with a Purpose
CUbRIK Tutorial at ICWE 2013: part 2 - Introduction to Games with a PurposeCUbRIK Tutorial at ICWE 2013: part 2 - Introduction to Games with a Purpose
CUbRIK Tutorial at ICWE 2013: part 2 - Introduction to Games with a Purpose
 
histoGraph: a case study in Digital Humanities
histoGraph: a case study in Digital HumanitieshistoGraph: a case study in Digital Humanities
histoGraph: a case study in Digital Humanities
 
CUbRIK Social Graph Visual Interface
CUbRIK Social Graph Visual InterfaceCUbRIK Social Graph Visual Interface
CUbRIK Social Graph Visual Interface
 
Matching Game Mechanics and Human Computation Tasks in Games with a Purpose
Matching Game Mechanics and Human Computation Tasks in Games with a PurposeMatching Game Mechanics and Human Computation Tasks in Games with a Purpose
Matching Game Mechanics and Human Computation Tasks in Games with a Purpose
 
Building a social graph for the history of Europe: the CUbRIK histoGraph
Building a social graph for the history of Europe: the CUbRIK histoGraphBuilding a social graph for the history of Europe: the CUbRIK histoGraph
Building a social graph for the history of Europe: the CUbRIK histoGraph
 
CUbRIK tutorial at ICWE 2013: part 1 Introduction to Human Computation
CUbRIK tutorial at ICWE 2013: part 1 Introduction to Human ComputationCUbRIK tutorial at ICWE 2013: part 1 Introduction to Human Computation
CUbRIK tutorial at ICWE 2013: part 1 Introduction to Human Computation
 
Castillejo Saray Cròniques Vampíriques
Castillejo Saray Cròniques VampíriquesCastillejo Saray Cròniques Vampíriques
Castillejo Saray Cròniques Vampíriques
 
CUbRIK Research at CIKM 2012: Efficient Jaccard-based Diversity Analysis of L...
CUbRIK Research at CIKM 2012: Efficient Jaccard-based Diversity Analysis of L...CUbRIK Research at CIKM 2012: Efficient Jaccard-based Diversity Analysis of L...
CUbRIK Research at CIKM 2012: Efficient Jaccard-based Diversity Analysis of L...
 
histoGraph presented to MMSP 2013
histoGraph presented to MMSP 2013histoGraph presented to MMSP 2013
histoGraph presented to MMSP 2013
 

Similar to Semantic schema for geonames

Graph Mining, Graph Patterns, Social Network, Set & List Valued Attribute, Sp...
Graph Mining, Graph Patterns, Social Network, Set & List Valued Attribute, Sp...Graph Mining, Graph Patterns, Social Network, Set & List Valued Attribute, Sp...
Graph Mining, Graph Patterns, Social Network, Set & List Valued Attribute, Sp...Amar Myana
 
20130905_Feng_Chia_GIS_center_geospatial_ontology
20130905_Feng_Chia_GIS_center_geospatial_ontology20130905_Feng_Chia_GIS_center_geospatial_ontology
20130905_Feng_Chia_GIS_center_geospatial_ontologyDongpo Deng
 
Linked Open Data Approaches within the ARIADNE Project
Linked Open Data Approaches within the ARIADNE ProjectLinked Open Data Approaches within the ARIADNE Project
Linked Open Data Approaches within the ARIADNE Projectariadnenetwork
 
Free the Patterns! The Vital Challenge to the Pattern Community
Free the Patterns! The Vital Challenge to the Pattern CommunityFree the Patterns! The Vital Challenge to the Pattern Community
Free the Patterns! The Vital Challenge to the Pattern CommunityDouglas Schuler
 
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...TimelessFuture
 
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...Lauri Eloranta
 
Forty Years of the OTA
Forty Years of the OTAForty Years of the OTA
Forty Years of the OTAMartin Wynne
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingToine Bogers
 
Ontology Mapping
Ontology MappingOntology Mapping
Ontology Mappingsamhati27
 
Ariadne: Interoperability
Ariadne: InteroperabilityAriadne: Interoperability
Ariadne: Interoperabilityariadnenetwork
 
DMTM 2015 - 04 Data Exploration
DMTM 2015 - 04 Data ExplorationDMTM 2015 - 04 Data Exploration
DMTM 2015 - 04 Data ExplorationPier Luca Lanzi
 
1220 7106026052 7106026051
1220 7106026052 71060260511220 7106026052 7106026051
1220 7106026052 7106026051adhisry
 
What is an archaeological research infrastructure and why do we need it? Aims...
What is an archaeological research infrastructure and why do we need it? Aims...What is an archaeological research infrastructure and why do we need it? Aims...
What is an archaeological research infrastructure and why do we need it? Aims...ariadnenetwork
 
Semantics as a service at EMBL-EBI
Semantics as a service at EMBL-EBISemantics as a service at EMBL-EBI
Semantics as a service at EMBL-EBISimon Jupp
 
Haw GIScience lost its interdisciplinary mojo?
Haw GIScience lost its interdisciplinary mojo?Haw GIScience lost its interdisciplinary mojo?
Haw GIScience lost its interdisciplinary mojo?Muki Haklay
 
Artificial Intelligence: Knowledge Acquisition
Artificial Intelligence: Knowledge AcquisitionArtificial Intelligence: Knowledge Acquisition
Artificial Intelligence: Knowledge AcquisitionThe Integral Worm
 
Exploiting Natural Language Definitions and (Legacy) Data for Facilitating Ag...
Exploiting Natural Language Definitions and (Legacy) Data for Facilitating Ag...Exploiting Natural Language Definitions and (Legacy) Data for Facilitating Ag...
Exploiting Natural Language Definitions and (Legacy) Data for Facilitating Ag...Christophe Debruyne
 
Semantic Archive Integration for Holocaust Research: the EHRI Research Infras...
Semantic Archive Integration for Holocaust Research: the EHRI Research Infras...Semantic Archive Integration for Holocaust Research: the EHRI Research Infras...
Semantic Archive Integration for Holocaust Research: the EHRI Research Infras...Vladimir Alexiev, PhD, PMP
 

Similar to Semantic schema for geonames (20)

Graph Mining, Graph Patterns, Social Network, Set & List Valued Attribute, Sp...
Graph Mining, Graph Patterns, Social Network, Set & List Valued Attribute, Sp...Graph Mining, Graph Patterns, Social Network, Set & List Valued Attribute, Sp...
Graph Mining, Graph Patterns, Social Network, Set & List Valued Attribute, Sp...
 
20130905_Feng_Chia_GIS_center_geospatial_ontology
20130905_Feng_Chia_GIS_center_geospatial_ontology20130905_Feng_Chia_GIS_center_geospatial_ontology
20130905_Feng_Chia_GIS_center_geospatial_ontology
 
Linked Open Data Approaches within the ARIADNE Project
Linked Open Data Approaches within the ARIADNE ProjectLinked Open Data Approaches within the ARIADNE Project
Linked Open Data Approaches within the ARIADNE Project
 
Free the Patterns! The Vital Challenge to the Pattern Community
Free the Patterns! The Vital Challenge to the Pattern CommunityFree the Patterns! The Vital Challenge to the Pattern Community
Free the Patterns! The Vital Challenge to the Pattern Community
 
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...
Supporting the Interpretation of Enriched Audiovisual Sources through Tempora...
 
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
 
Forty Years of the OTA
Forty Years of the OTAForty Years of the OTA
Forty Years of the OTA
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Ontology Mapping
Ontology MappingOntology Mapping
Ontology Mapping
 
Ariadne: Interoperability
Ariadne: InteroperabilityAriadne: Interoperability
Ariadne: Interoperability
 
DMTM 2015 - 04 Data Exploration
DMTM 2015 - 04 Data ExplorationDMTM 2015 - 04 Data Exploration
DMTM 2015 - 04 Data Exploration
 
1220 7106026052 7106026051
1220 7106026052 71060260511220 7106026052 7106026051
1220 7106026052 7106026051
 
The NeDiMAH workgroup on Space and Time
The NeDiMAH workgroup on Space and TimeThe NeDiMAH workgroup on Space and Time
The NeDiMAH workgroup on Space and Time
 
What is an archaeological research infrastructure and why do we need it? Aims...
What is an archaeological research infrastructure and why do we need it? Aims...What is an archaeological research infrastructure and why do we need it? Aims...
What is an archaeological research infrastructure and why do we need it? Aims...
 
Semantics as a service at EMBL-EBI
Semantics as a service at EMBL-EBISemantics as a service at EMBL-EBI
Semantics as a service at EMBL-EBI
 
Haw GIScience lost its interdisciplinary mojo?
Haw GIScience lost its interdisciplinary mojo?Haw GIScience lost its interdisciplinary mojo?
Haw GIScience lost its interdisciplinary mojo?
 
Artificial Intelligence: Knowledge Acquisition
Artificial Intelligence: Knowledge AcquisitionArtificial Intelligence: Knowledge Acquisition
Artificial Intelligence: Knowledge Acquisition
 
12952999
1295299912952999
12952999
 
Exploiting Natural Language Definitions and (Legacy) Data for Facilitating Ag...
Exploiting Natural Language Definitions and (Legacy) Data for Facilitating Ag...Exploiting Natural Language Definitions and (Legacy) Data for Facilitating Ag...
Exploiting Natural Language Definitions and (Legacy) Data for Facilitating Ag...
 
Semantic Archive Integration for Holocaust Research: the EHRI Research Infras...
Semantic Archive Integration for Holocaust Research: the EHRI Research Infras...Semantic Archive Integration for Holocaust Research: the EHRI Research Infras...
Semantic Archive Integration for Holocaust Research: the EHRI Research Infras...
 

More from CUbRIK Project

The CUbRIK histoGraph Factsheet
The CUbRIK histoGraph FactsheetThe CUbRIK histoGraph Factsheet
The CUbRIK histoGraph FactsheetCUbRIK Project
 
Mining Emotions in Short Films: User Comments or Crowdsourcing?
Mining Emotions in Short Films: User Comments or Crowdsourcing?Mining Emotions in Short Films: User Comments or Crowdsourcing?
Mining Emotions in Short Films: User Comments or Crowdsourcing?CUbRIK Project
 
CUbRIK: Open Box. Multimedia and Human Computation approach
CUbRIK: Open Box. Multimedia and Human Computation approachCUbRIK: Open Box. Multimedia and Human Computation approach
CUbRIK: Open Box. Multimedia and Human Computation approachCUbRIK Project
 
ICT 2013: Better Society: empowering Horizon 2020 with trustable social media
ICT 2013: Better Society: empowering Horizon 2020 with trustable social mediaICT 2013: Better Society: empowering Horizon 2020 with trustable social media
ICT 2013: Better Society: empowering Horizon 2020 with trustable social mediaCUbRIK Project
 
How Do We Deep-Link? Leveraging User-Contributed Time-Links for Non-Linear Vi...
How Do We Deep-Link? Leveraging User-Contributed Time-Links for Non-Linear Vi...How Do We Deep-Link? Leveraging User-Contributed Time-Links for Non-Linear Vi...
How Do We Deep-Link? Leveraging User-Contributed Time-Links for Non-Linear Vi...CUbRIK Project
 
CUbRIK research presented at SSMS 2012
CUbRIK research presented at SSMS 2012CUbRIK research presented at SSMS 2012
CUbRIK research presented at SSMS 2012CUbRIK Project
 
CUbRIK research at CIKM 2012: Pic Alert
CUbRIK research at CIKM 2012: Pic AlertCUbRIK research at CIKM 2012: Pic Alert
CUbRIK research at CIKM 2012: Pic AlertCUbRIK Project
 
CUbRIK research at CIKM 2012: Map to Humans and Reduce Error
CUbRIK research at CIKM 2012: Map to Humans and Reduce ErrorCUbRIK research at CIKM 2012: Map to Humans and Reduce Error
CUbRIK research at CIKM 2012: Map to Humans and Reduce ErrorCUbRIK Project
 
CUbRIK research at RecSys 2012
CUbRIK research at RecSys 2012CUbRIK research at RecSys 2012
CUbRIK research at RecSys 2012CUbRIK Project
 
CUbRIK research at SIGMOD 2012
CUbRIK research at SIGMOD 2012CUbRIK research at SIGMOD 2012
CUbRIK research at SIGMOD 2012CUbRIK Project
 
CUbRIK research at CAPS Infoday in PIsa
CUbRIK research at CAPS Infoday in PIsaCUbRIK research at CAPS Infoday in PIsa
CUbRIK research at CAPS Infoday in PIsaCUbRIK Project
 
ACMMM12 - LikeLines: Collecting Timecode-level Feedback for Web Videos throug...
ACMMM12 - LikeLines: Collecting Timecode-level Feedback for Web Videos throug...ACMMM12 - LikeLines: Collecting Timecode-level Feedback for Web Videos throug...
ACMMM12 - LikeLines: Collecting Timecode-level Feedback for Web Videos throug...CUbRIK Project
 

More from CUbRIK Project (12)

The CUbRIK histoGraph Factsheet
The CUbRIK histoGraph FactsheetThe CUbRIK histoGraph Factsheet
The CUbRIK histoGraph Factsheet
 
Mining Emotions in Short Films: User Comments or Crowdsourcing?
Mining Emotions in Short Films: User Comments or Crowdsourcing?Mining Emotions in Short Films: User Comments or Crowdsourcing?
Mining Emotions in Short Films: User Comments or Crowdsourcing?
 
CUbRIK: Open Box. Multimedia and Human Computation approach
CUbRIK: Open Box. Multimedia and Human Computation approachCUbRIK: Open Box. Multimedia and Human Computation approach
CUbRIK: Open Box. Multimedia and Human Computation approach
 
ICT 2013: Better Society: empowering Horizon 2020 with trustable social media
ICT 2013: Better Society: empowering Horizon 2020 with trustable social mediaICT 2013: Better Society: empowering Horizon 2020 with trustable social media
ICT 2013: Better Society: empowering Horizon 2020 with trustable social media
 
How Do We Deep-Link? Leveraging User-Contributed Time-Links for Non-Linear Vi...
How Do We Deep-Link? Leveraging User-Contributed Time-Links for Non-Linear Vi...How Do We Deep-Link? Leveraging User-Contributed Time-Links for Non-Linear Vi...
How Do We Deep-Link? Leveraging User-Contributed Time-Links for Non-Linear Vi...
 
CUbRIK research presented at SSMS 2012
CUbRIK research presented at SSMS 2012CUbRIK research presented at SSMS 2012
CUbRIK research presented at SSMS 2012
 
CUbRIK research at CIKM 2012: Pic Alert
CUbRIK research at CIKM 2012: Pic AlertCUbRIK research at CIKM 2012: Pic Alert
CUbRIK research at CIKM 2012: Pic Alert
 
CUbRIK research at CIKM 2012: Map to Humans and Reduce Error
CUbRIK research at CIKM 2012: Map to Humans and Reduce ErrorCUbRIK research at CIKM 2012: Map to Humans and Reduce Error
CUbRIK research at CIKM 2012: Map to Humans and Reduce Error
 
CUbRIK research at RecSys 2012
CUbRIK research at RecSys 2012CUbRIK research at RecSys 2012
CUbRIK research at RecSys 2012
 
CUbRIK research at SIGMOD 2012
CUbRIK research at SIGMOD 2012CUbRIK research at SIGMOD 2012
CUbRIK research at SIGMOD 2012
 
CUbRIK research at CAPS Infoday in PIsa
CUbRIK research at CAPS Infoday in PIsaCUbRIK research at CAPS Infoday in PIsa
CUbRIK research at CAPS Infoday in PIsa
 
ACMMM12 - LikeLines: Collecting Timecode-level Feedback for Web Videos throug...
ACMMM12 - LikeLines: Collecting Timecode-level Feedback for Web Videos throug...ACMMM12 - LikeLines: Collecting Timecode-level Feedback for Web Videos throug...
ACMMM12 - LikeLines: Collecting Timecode-level Feedback for Web Videos throug...
 

Recently uploaded

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetEnjoy Anytime
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 

Recently uploaded (20)

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 

Semantic schema for geonames

  • 1. A SEMANTIC SCHEMA FOR GEONAMES Enzo Maltese Feroz Farazi
  • 2. Index • Introduction • GeoNames • DERA Methodology • Faceted Ontology • Semantic Schema • Result • Conclusion INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
  • 3. Introduction • With the proliferation of the Web, dataset publishing is also getting a momentum • With Linked Open Data (LOD) initiative • Old datasets are published in machine readable format • Many new datasets are made available on the Web • As part of LOD initiative essentially we observe that datasets are there with increased • Machine readability • Availability • Ease of finding through links INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
  • 4. Introduction • However, in the LOD initiative more emphasis is given on the quantity of the data than its quality • In this work we proposed a semantic schema that can help improve the quality while enforcing it on the data • Our approach is domain independent • We performed a detailed case study using GeoNames INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
  • 5. GeoNames • GeoNames is a geo-spatial database consists of various locations of all countries • It includes geographical data • Place names in various languages • Latitude • Longitude • Altitude • Class • Latitude and longitude coordinates are stored according to the WGS84 INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
  • 6. GeoNames • It currently contains around 8 million geographic names for around 7 million unique places • The places are classified into 9 broader categories called feature classes • They are further divided into 667 classes, most of them have a natural language description • The data is available free of charge, can be used under a creative commons attribution license INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
  • 7. DERA Methodology • A methodology for developing ontology • It allows building domain specific ontologies • However, it can be used to construct ontology for any domain • In DERA, a domain consists of three components namely entity class, relation and attribute. D=<E, R,A> • E is a tuple <C, e>, where C represents set of concepts and e represents set of entities INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
  • 8. DERA Methodology • Methodology: • Step 1: Identification of the relevant terms • Step 2: Analysis • Step 3: Synthesis • Step 4: Standardization • Following the above steps leads to the creation of a set of facets INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
  • 9. Faceted Ontology • Facet is a hierarchy of homogeneous concepts describing an aspect of the domain (e.g., space), where each term in the hierarchy denotes an atomic concept • A facet is a distinctive property of a set of concepts that can help in differentiating one group from another • Faceted ontology is an ontology in which concepts are organized into facets • S. R. Ranganathan was the first to introduce the faceted approach • GeoWordNet is an example of a faceted ontology consists of facets such as body of water, geological formation and administrative division INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
  • 10. Faceted Ontology Body of water •Flowing body of water •Stream •Brook •River •Still body of water •Pond •Lake Populated place •City •Town •Village Landform •Natural depression •Oceanic depression •Oceanic valley •Oceanic trough •Continental depression •Trough •Valley •Natural elevation •Oceanic elevation •Seamount •Submarine hill •Continental elevation •Hill •Mountain INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
  • 11. Semantic Schema • The usage of a geospatial ontology does not solve all the problems • In fact, GeoNames lacks sufficient constraints on the domain and range of the attributes • Moreover, it lacks corresponding mechanisms to enforce them which can guarantee for an adequate quality of the data • We define a semantic schema as a set of constrains on the domain and range of • the attributes (e.g. population) • the relations (e.g. capital) INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
  • 12. Semantic Schema • Such constraints should prevent the attribute population to have a negative value • While it is fine for cities to have such attribute, this should be prevented for streams • The schema is semantic-aware because • the domain of attributes and relations and • the range of relations are always a class and its more specific classes taken from the geospatial ontology • For instance, if we specify that the domain of the attribute population is populated place INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
  • 13. Semantic Schema • We assume it to apply also to city, town and village (more specific classes in the ontology) • The purpose of the schema is expressly to define what is legal in terms of • attributes • relations • corresponding values • Enforcing the schema corresponds to verifying the consistency of the dataset w.r.t. such constraints INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
  • 14. Semantic Schema Attribute Name Definition Domain (main class) Range Population the people who inhabit a territory or State Populated Place Long > 0 Altitude elevation above sea level Location but Undersea Float in [-423, 8848] Area the extent of a 2- dimensional surface enclosed within a boundary Location Float > 0 Capital A seat of government Geo-political entity Populated Place INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
  • 15. Semantic Schema • The range of altitude was set by referring to the altitude of • the Dead Sea (the lowest) • the Mount Everest (the highest) INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
  • 16. Result • In GeoNames the Dead Sea is represented with negative altitude set to −405 m. • Surprisingly, GeoNames contains other 45 locations with same altitude of the Dead Sea • two other locations are reported to be even lower than the Dead Sea (Nahal Amazyahu and `Arvat Sedom) • The domain of population includes several unexpected classes such as airport, stream and garden • We removed population from corresponding entities in the ontology INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
  • 17. Result • We found several entities with elevation set to -9999 that is used in GeoNames to encode an unknown value • We removed elevation from corresponding entities in the ontology • In the range of capital, 3 entities are encoded as cities (e.g. Jerusalem) while all the others as capitals • This is not wrong, but at least this is not homogeneous • We represented them homogeneously INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento
  • 18. Conclusion • We have stressed the need for an integrated approach to effectively support semantic interoperability between different geospatial applications • The proposed solution • Consists in the usage of a geospatial faceted ontology providing the terminology of the geospatial domain and • A semantic schema that, by establishing precise constraints on the domain and range of the attributes and the relations, guarantees a higher level of data quality INSPIRE Conference – June 25th 2013 Feroz Farazi – University of Trento