Supporting Geo-Ontology Engineering through Spatial Data Analytics

Irene Celino
Irene CelinoResearch Manager in the area of Human Computation and Semantic Interoperability
Supporting Geo-Ontology Engineering
through Spatial Data Analytics
Gloria Re Calegari, Emanuela Carlino, Irene Celino and
Diego Peroni
CEFRIEL – Milano, Italy
Introduction
Notion of place as a “situated concept”
Dependent on physical, natural, social, cultural and cognitive processes
Entities within a specific spatio-temporal context
Geospatial ontologies
General-purpose vs. location-specific
Top-down vs. bottom-up
(Re)engineering: evolution, repair and specialization
Can spatial data analytics help in geo-ontology engineering? (data=instances)
Objectives
O1: Re-engineering Spatial Features
Identifying concepts that play a different role in different cities, thus
possibly indicating different cultural or pragmatic meanings
O2: Specifying Spatial Neighborhoods
Highlighting new potential concepts to characterize the urban
neighborhoods of different cities
Experimental settings
Spatial objects from OpenStreetMap/LinkedGeoData
Described by the LinkedGeoData ontology  general-purpose geo-ontology
Several spatial features considered (amenities, shops, etc.)
Data from two cities: Milano and London
Similarities as well as differences
Assumptions about the coverage and completeness of the data
Volunteered Geographic Information (VGI), reflecting the way in which the
environment is experienced
Objective 1:
Re-Engineering Spatial Features
Analysis of the pattern distribution of each spatial feature
If spatial objects “instances” of a spatial feature are condensed in a specific area, that
feature is a relevant element to characterize that area
Aim to verify whether the same concept plays a different role in different places
Two-step analysis:
1) Analysing Distribution of Spatial Objects
2) Clustering Spatial Objects
Objective1:Re-Engineering SpatialFeatures
Analysing Spatial Objects' Distribution (1/4)
Evaluate the tendency of each spatial feature to show spatial patterns and to create
spatial aggregations
Morishita index: statistical measure of dispersion based on quadrats count
1 random point patterns
>>1 spatial aggregation may exist
Moran index: measure of spatial auto-correlation
–1 (perfect dispersion)
+1 (perfect correlation)
0 (random spatial pattern)
Objective1:Re-Engineering SpatialFeatures
Analysing Spatial Objects' Distribution (2/4)
Common spatial
features like
restaurants show
almost the same
spatial pattern in
the two cities
The analysis shows
similar trends in
the Morishita plot
and similar Moran
index values
Objective1:Re-Engineering SpatialFeatures
Analysing Spatial Objects' Distribution (3/4)
Pubs are everywhere in
London, while they
aggregate much more in
Milano
This reveals an actual
“semantic” difference in
the meaning of this
spatial feature
Objective1:Re-Engineering SpatialFeatures
Analysing Spatial Objects' Distribution (4/4)
Theatres are spread
everywhere in Milano,
while they strongly
aggregate in London
In this case, the meaning
of the spatial feature is
the same but the
analysis reveals the
“semantics” of the
cultural district
Objective1:Re-Engineering SpatialFeatures
Clustering Spatial Objects (1/2)
Density based clustering to find agglomeration of
spatial objects in a specific region
OPTICS algorithm (hierarchical DBSCAN
extension) – for each category of spatial object
Latitude-longitude coordinates as input
At least 5 points in a radius of 150 meters
lgdo:Restaurant in Milano
LONDON
• 56 categories clustered at least in
one region
• lgdo:Hotel 40% points clustered
(innermost boroughs)
• lgdo:Telephone clusters in the
more touristic area
• lgdo:Pub widespread around the
city, only 7% clustered
MILANO
• 25 categories clustered at least in
one region
• lgdo:Hotel 14% points clustered
(central and peripheral areas)
• lgdo:Telephone no clusters
• lgdo:Pub concentrated in nightlife
areas only, 20% clustered
Emerging differences between the two cities
Objective1:Re-Engineering SpatialFeatures
Clustering Spatial Objects (2/2)
Objective1:Re-EngineeringSpatialFeatures
What do clusters and feature distributions
suggest to ontology engineers?
Evaluation of the validity/applicability of a geo-ontology in different locations
Hints:
Spatial feature with the same behaviour in different locations  ontology re-
engineering is not required
Spatial feature with different behaviour in different locations  ontology re-
engineering may be required with location-specific extensions to the geo-ontology
Note: a pure numerical analysis of the spatial objects is meaningful (lgdo:Convenience
are ‘’outliers’’ in Milano, while in London are numerous)
Objective 2:
Specifying Spatial Neighborhoods
Which are the spatial features that occur together?
Is it possible to semantically characterize a region according to its semantic features
co-occurrence?
3-step analysis:
1) Identifying Neighborhoods
2) Characterizing Neighborhoods
3) Semantically Querying Neighborhoods
Objective2:SpecifyingSpatialNeighborhoods
Identifying Neighborhoods (1/3)
For each cluster obtained with OPTICS,
definition of the convex hull polygon
containing all the points belonging to
that cluster
For each polygon, computation of its
centroid, which represent the
categories' hotspot locations (urban
areas with a high number of amenity of
a given type)
Objective2:SpecifyingSpatialNeighborhoods
Identifying Neighborhoods (2/3)
Clustering of these centroids to
divide the urban space in
neighborhoods, according to the
simultaneous presence of spatial
features’ hotspots
Agglomerative clustering technique (with
Ward minimum variance method)
Objective2:SpecifyingSpatialNeighborhoods
Identifying Neighborhoods (3/3)
Clusters do not necessarily
correspond to actual
administrative divisions (cf.
black boundaries)
Clusters reflect the emerging
“semantic” neighborhoods, i.e.
regions of the city
characterized by co-
occurrence of hotspot
locations of different spatial
features
Objective2:SpecifyingSpatialNeighborhoods
Characterizing Neighborhoods (1/3)
Which spatial features best characterize each city region?
Definition of a spatial variant of tf-idf score
n : set of clustered points in the neighborhood
f : set of all spatial points with that feature
|N| : total number of neighborhoods (24 in the experiments, 12 neighborhoods for each city)
: number of neighborhood in which the spatial feature is represented by at least a point
Spatial Object Frequency Inverse Neighborhood Frequency
Objective2:SpecifyingSpatialNeighborhoods
Characterizing Neighborhoods (2/3)
Top-3 Spatial Object Frequency-Inverse Neighborhood Frequency in each district
Comparison between London and Milano
sof-inf scores via cosine similarity
‘’Hampstead’’ and ‘’Milano East“:
suburban areas with schools, parking lots
and offices
‘’Camden town’’ and ‘’Chelsea’’: peculiar
spatial features not present in Milano
‘’Navigli’’: there is not a corresponding
area. In London pubs are widespread
around the city
Milano
London
Objective2:SpecifyingSpatialNeighborhoods
Characterizing Neighborhoods (3/3)
Objective2:SpecifyingSpatialNeighborhoods
Semantically Querying Neighborhoods (1/2)
Definition of neighborhood concepts to express the co-occurrence of different spatial
features in the same area
Specific “spatio-semantic queries” by combining spatial features:
Shopping: Clothes, Shoes, Chemist and BeautyShop
Nightlife: Restaurant, Pub, Bar and Theatre
Residential: School, Atm, Butcher, Greengrocer, Convenience and Parking
Culture: Theatre, ArtShop, BookShop, MusicalInstruments and College
Alternative: Erotic, Tattoo, Charity, CommunityCentre and ArtShop
London
Milano
Cosine similarity between the
sof-inf vectors of the spatio-
semantic query and each
neighborhood
Objective2:SpecifyingSpatialNeighborhoods
Semantically Querying Neighborhoods (2/2)
Objective2:SpecifyingSpatialNeighborhoods
What does Spatial Neighborhoods analysis
suggest to ontology engineers?
New set of concepts to represent the ‘’semantics’’ of urban districts
The sof-inf score is a useful tool to select the most prominent spatial features that
characterize a neighborhood  it helps in summarising spatial regions by their
distinctive categories
The spatio-semantic query approach supports the ontological specification of
neighborhood concepts:
To verify the actual “instantiation” of those concept
To test different hypotheses with different feature compositions
To select the most suitable level of abstraction for the geo-ontology
Conclusions
Ontology engineering is still largely an art that requires a deep domain knowledge
In the case of geo-ontologies, spatial data analytics can provide ontology engineers
with additional hints and suggestions
Location-specific differences between cities
Local characterization via a spatio-semantic queries
Spatial data analytics can help to
Verify existing hypotheses  highlight already-known characteristics
Discover unknown specificities  in this case, further exploration is needed
Future Work
Current main limitation: spatial analytics brings only supporting insights, it is not a
fully-automated technique for ontology (re)engineering
User study with ontology engineers
More tightly integration with ontology engineering process/tools
Weighted sof-inf score, by adding spatial features’ weights to the spatio-semantic
queries to specify complex neighborhood concepts
Extended experiments to further prove generality applicability:
Larger set of more heterogeneous cities
Different sets of spatial features, also within non-urban contexts
Supporting Geo-Ontology Engineering
through Spatial Data Analytics
Gloria Re Calegari, Emanuela Carlino, Irene Celino andDiego Peroni
Thank you!
Additional material can be found at http://swa.cefriel.it/geo/eswc2016.html
1 of 25

Recommended

C+P-and-partners_Working process by
C+P-and-partners_Working processC+P-and-partners_Working process
C+P-and-partners_Working processQusi Alqarqaz
246 views16 slides
BlueBRIDGE: Cloud infrastructure serving aquafarms and supporting models by
BlueBRIDGE: Cloud infrastructure serving aquafarms and supporting modelsBlueBRIDGE: Cloud infrastructure serving aquafarms and supporting models
BlueBRIDGE: Cloud infrastructure serving aquafarms and supporting modelsBlue BRIDGE
667 views72 slides
Optimizing location-based apps with open data by
Optimizing location-based apps with open dataOptimizing location-based apps with open data
Optimizing location-based apps with open dataRaj Singh
356 views21 slides
Automation of Glass fiber Deployments in The Netherlands by
Automation of Glass fiber Deployments in The NetherlandsAutomation of Glass fiber Deployments in The Netherlands
Automation of Glass fiber Deployments in The NetherlandsKiran Solipuram. DEP, CFHP
1.2K views15 slides
Geo-analytics Architecture - Technologies by
Geo-analytics Architecture - TechnologiesGeo-analytics Architecture - Technologies
Geo-analytics Architecture - TechnologiesBlue BRIDGE
767 views10 slides
From Spark to Ignition: Fueling Your Business on Real-Time Analytics by
From Spark to Ignition: Fueling Your Business on Real-Time AnalyticsFrom Spark to Ignition: Fueling Your Business on Real-Time Analytics
From Spark to Ignition: Fueling Your Business on Real-Time AnalyticsSingleStore
2.8K views27 slides

More Related Content

Similar to Supporting Geo-Ontology Engineering through Spatial Data Analytics

Producir conocimiento espacial - Workshop WP4 by
Producir conocimiento espacial - Workshop WP4Producir conocimiento espacial - Workshop WP4
Producir conocimiento espacial - Workshop WP4Instituto Universitario de Urbanística
315 views57 slides
Iccsa2008cutinibreve by
Iccsa2008cutinibreveIccsa2008cutinibreve
Iccsa2008cutinibreveBeniamino Murgante
481 views36 slides
Data Consistency in OpenStreetMap by
Data Consistency in OpenStreetMapData Consistency in OpenStreetMap
Data Consistency in OpenStreetMapAlfonso Crisci
1K views35 slides
A4 - Woloszyn et Leduc. Caractérisation du potentiel paysager par le dimensio... by
A4 - Woloszyn et Leduc. Caractérisation du potentiel paysager par le dimensio...A4 - Woloszyn et Leduc. Caractérisation du potentiel paysager par le dimensio...
A4 - Woloszyn et Leduc. Caractérisation du potentiel paysager par le dimensio...IT-Gatineau2011
534 views16 slides
Producir conocimiento espacial - Sesión teórica 1 by
Producir conocimiento espacial - Sesión teórica 1Producir conocimiento espacial - Sesión teórica 1
Producir conocimiento espacial - Sesión teórica 1Instituto Universitario de Urbanística
265 views49 slides
Big Data Analytics for Smart Cities by
Big Data Analytics for Smart CitiesBig Data Analytics for Smart Cities
Big Data Analytics for Smart CitiesUniversity of Salerno
58 views12 slides

Similar to Supporting Geo-Ontology Engineering through Spatial Data Analytics(20)

Data Consistency in OpenStreetMap by Alfonso Crisci
Data Consistency in OpenStreetMapData Consistency in OpenStreetMap
Data Consistency in OpenStreetMap
Alfonso Crisci1K views
A4 - Woloszyn et Leduc. Caractérisation du potentiel paysager par le dimensio... by IT-Gatineau2011
A4 - Woloszyn et Leduc. Caractérisation du potentiel paysager par le dimensio...A4 - Woloszyn et Leduc. Caractérisation du potentiel paysager par le dimensio...
A4 - Woloszyn et Leduc. Caractérisation du potentiel paysager par le dimensio...
IT-Gatineau2011534 views
MSLab - A paradigmatic model to shape the metropolitan growth by MSLab Polimi
MSLab - A paradigmatic model to shape the metropolitan growthMSLab - A paradigmatic model to shape the metropolitan growth
MSLab - A paradigmatic model to shape the metropolitan growth
MSLab Polimi550 views
Hierarchical clustering through spatial interaction data. The case of commuti... by Beniamino Murgante
Hierarchical clustering through spatial interaction data. The case of commuti...Hierarchical clustering through spatial interaction data. The case of commuti...
Hierarchical clustering through spatial interaction data. The case of commuti...
Beniamino Murgante775 views
Extented abstract Quentin Silva by Quentin Silva
Extented abstract Quentin SilvaExtented abstract Quentin Silva
Extented abstract Quentin Silva
Quentin Silva242 views
An assessment-based process for modifying the built fabric of historic centre... by Beniamino Murgante
An assessment-based process for modifying the built fabric of historic centre...An assessment-based process for modifying the built fabric of historic centre...
An assessment-based process for modifying the built fabric of historic centre...
Beniamino Murgante609 views
Extending the knowledge level of cognitive architectures with Conceptual Spac... by Antonio Lieto
Extending the knowledge level of cognitive architectures with Conceptual Spac...Extending the knowledge level of cognitive architectures with Conceptual Spac...
Extending the knowledge level of cognitive architectures with Conceptual Spac...
Antonio Lieto1.4K views
Francesca Froy "What is the role of spatial configuration and urban morpholog... by HannahParr3
Francesca Froy "What is the role of spatial configuration and urban morpholog...Francesca Froy "What is the role of spatial configuration and urban morpholog...
Francesca Froy "What is the role of spatial configuration and urban morpholog...
HannahParr396 views
1 s2.0-s0198971512001093-main by LENIN Quintero
1 s2.0-s0198971512001093-main1 s2.0-s0198971512001093-main
1 s2.0-s0198971512001093-main
LENIN Quintero335 views
Geographical Map Annotation With Social Metadata In a Surveillance Environment by Elena Roglia
Geographical Map Annotation With Social Metadata In a Surveillance EnvironmentGeographical Map Annotation With Social Metadata In a Surveillance Environment
Geographical Map Annotation With Social Metadata In a Surveillance Environment
Elena Roglia358 views
Coates: topological approximations for spatial representation by ArchiLab 7
Coates: topological approximations for spatial representationCoates: topological approximations for spatial representation
Coates: topological approximations for spatial representation
ArchiLab 7375 views

More from Irene Celino

Knowledge Technologies group at Cefriel by
Knowledge Technologies group at CefrielKnowledge Technologies group at Cefriel
Knowledge Technologies group at CefrielIrene Celino
153 views25 slides
Human-in-the-loop @ ISWS 2019 by
Human-in-the-loop @ ISWS 2019Human-in-the-loop @ ISWS 2019
Human-in-the-loop @ ISWS 2019Irene Celino
886 views29 slides
Human computation @ Data Semantics by
Human computation @ Data SemanticsHuman computation @ Data Semantics
Human computation @ Data SemanticsIrene Celino
194 views30 slides
Interplay of Game Incentives, Player Profiles and Task Difficulty in Games with ... by
Interplay of Game Incentives, Player Profiles and Task Difficulty in Games with ...Interplay of Game Incentives, Player Profiles and Task Difficulty in Games with ...
Interplay of Game Incentives, Player Profiles and Task Difficulty in Games with ...Irene Celino
381 views13 slides
A Framework to build Games with a Purpose for Linked Data Refinement by
A Framework to build Games with a Purpose  for Linked Data RefinementA Framework to build Games with a Purpose  for Linked Data Refinement
A Framework to build Games with a Purpose for Linked Data RefinementIrene Celino
372 views14 slides
Involving people in Citizen Science through game incentives: the case of the ... by
Involving people in Citizen Science through game incentives: the case of the ...Involving people in Citizen Science through game incentives: the case of the ...
Involving people in Citizen Science through game incentives: the case of the ...Irene Celino
343 views12 slides

More from Irene Celino(20)

Knowledge Technologies group at Cefriel by Irene Celino
Knowledge Technologies group at CefrielKnowledge Technologies group at Cefriel
Knowledge Technologies group at Cefriel
Irene Celino153 views
Human-in-the-loop @ ISWS 2019 by Irene Celino
Human-in-the-loop @ ISWS 2019Human-in-the-loop @ ISWS 2019
Human-in-the-loop @ ISWS 2019
Irene Celino886 views
Human computation @ Data Semantics by Irene Celino
Human computation @ Data SemanticsHuman computation @ Data Semantics
Human computation @ Data Semantics
Irene Celino194 views
Interplay of Game Incentives, Player Profiles and Task Difficulty in Games with ... by Irene Celino
Interplay of Game Incentives, Player Profiles and Task Difficulty in Games with ...Interplay of Game Incentives, Player Profiles and Task Difficulty in Games with ...
Interplay of Game Incentives, Player Profiles and Task Difficulty in Games with ...
Irene Celino381 views
A Framework to build Games with a Purpose for Linked Data Refinement by Irene Celino
A Framework to build Games with a Purpose  for Linked Data RefinementA Framework to build Games with a Purpose  for Linked Data Refinement
A Framework to build Games with a Purpose for Linked Data Refinement
Irene Celino372 views
Involving people in Citizen Science through game incentives: the case of the ... by Irene Celino
Involving people in Citizen Science through game incentives: the case of the ...Involving people in Citizen Science through game incentives: the case of the ...
Involving people in Citizen Science through game incentives: the case of the ...
Irene Celino343 views
Ninja Riders: sensibilizzare i giovani a una mobilità più sicura attraverso i... by Irene Celino
Ninja Riders: sensibilizzare i giovani a una mobilità più sicura attraverso i...Ninja Riders: sensibilizzare i giovani a una mobilità più sicura attraverso i...
Ninja Riders: sensibilizzare i giovani a una mobilità più sicura attraverso i...
Irene Celino831 views
Human Computation for VGI Management by Irene Celino
Human Computation for VGI ManagementHuman Computation for VGI Management
Human Computation for VGI Management
Irene Celino1.5K views
Ninja Riders - Youth and Road Safety: Discovering Urban Mobility Behaviours by Irene Celino
Ninja Riders - Youth and Road Safety: Discovering Urban Mobility BehavioursNinja Riders - Youth and Road Safety: Discovering Urban Mobility Behaviours
Ninja Riders - Youth and Road Safety: Discovering Urban Mobility Behaviours
Irene Celino481 views
BotDCAT-AP: An Extension of the DCAT Application Profile for Describing Datas... by Irene Celino
BotDCAT-AP: An Extension of the DCAT Application Profile for Describing Datas...BotDCAT-AP: An Extension of the DCAT Application Profile for Describing Datas...
BotDCAT-AP: An Extension of the DCAT Application Profile for Describing Datas...
Irene Celino500 views
Give and Take in Citizen Science by Irene Celino
Give and Take in Citizen ScienceGive and Take in Citizen Science
Give and Take in Citizen Science
Irene Celino190 views
Ninja Riders @ Human Factory Day 2017 by Irene Celino
Ninja Riders @ Human Factory Day 2017Ninja Riders @ Human Factory Day 2017
Ninja Riders @ Human Factory Day 2017
Irene Celino203 views
Night Knights: exploiting games to engage people in a citizen science campaign by Irene Celino
Night Knights: exploiting games to engage people in a citizen science campaignNight Knights: exploiting games to engage people in a citizen science campaign
Night Knights: exploiting games to engage people in a citizen science campaign
Irene Celino437 views
STARS4ALL-CAPSSI-Workshop by Irene Celino
STARS4ALL-CAPSSI-WorkshopSTARS4ALL-CAPSSI-Workshop
STARS4ALL-CAPSSI-Workshop
Irene Celino434 views
Towards Talkin'Piazza: Engaging Citizens through Playful Interaction with Urb... by Irene Celino
Towards Talkin'Piazza: Engaging Citizens through Playful Interaction with Urb...Towards Talkin'Piazza: Engaging Citizens through Playful Interaction with Urb...
Towards Talkin'Piazza: Engaging Citizens through Playful Interaction with Urb...
Irene Celino585 views
SSSW 2016 Cognition Tutorial by Irene Celino
SSSW 2016 Cognition TutorialSSSW 2016 Cognition Tutorial
SSSW 2016 Cognition Tutorial
Irene Celino1.1K views
Analysis of a Cultural Heritage Game with a Purpose with an Educational Incen... by Irene Celino
Analysis of a Cultural Heritage Game with a Purpose with an Educational Incen...Analysis of a Cultural Heritage Game with a Purpose with an Educational Incen...
Analysis of a Cultural Heritage Game with a Purpose with an Educational Incen...
Irene Celino884 views
Smart City Semantics - Data Analytics and Human Computation to understand the... by Irene Celino
Smart City Semantics - Data Analytics and Human Computation to understand the...Smart City Semantics - Data Analytics and Human Computation to understand the...
Smart City Semantics - Data Analytics and Human Computation to understand the...
Irene Celino1.3K views
Towards a Semantic City Service Ecosystem by Irene Celino
Towards a Semantic City Service EcosystemTowards a Semantic City Service Ecosystem
Towards a Semantic City Service Ecosystem
Irene Celino1.1K views

Recently uploaded

Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P... by
Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P...Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P...
Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P...ShapeBlue
154 views62 slides
State of the Union - Rohit Yadav - Apache CloudStack by
State of the Union - Rohit Yadav - Apache CloudStackState of the Union - Rohit Yadav - Apache CloudStack
State of the Union - Rohit Yadav - Apache CloudStackShapeBlue
253 views53 slides
KVM Security Groups Under the Hood - Wido den Hollander - Your.Online by
KVM Security Groups Under the Hood - Wido den Hollander - Your.OnlineKVM Security Groups Under the Hood - Wido den Hollander - Your.Online
KVM Security Groups Under the Hood - Wido den Hollander - Your.OnlineShapeBlue
181 views19 slides
Backroll, News and Demo - Pierre Charton, Matthias Dhellin, Ousmane Diarra - ... by
Backroll, News and Demo - Pierre Charton, Matthias Dhellin, Ousmane Diarra - ...Backroll, News and Demo - Pierre Charton, Matthias Dhellin, Ousmane Diarra - ...
Backroll, News and Demo - Pierre Charton, Matthias Dhellin, Ousmane Diarra - ...ShapeBlue
146 views15 slides
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f... by
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...TrustArc
160 views29 slides
Igniting Next Level Productivity with AI-Infused Data Integration Workflows by
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Safe Software
385 views86 slides

Recently uploaded(20)

Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P... by ShapeBlue
Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P...Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P...
Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P...
ShapeBlue154 views
State of the Union - Rohit Yadav - Apache CloudStack by ShapeBlue
State of the Union - Rohit Yadav - Apache CloudStackState of the Union - Rohit Yadav - Apache CloudStack
State of the Union - Rohit Yadav - Apache CloudStack
ShapeBlue253 views
KVM Security Groups Under the Hood - Wido den Hollander - Your.Online by ShapeBlue
KVM Security Groups Under the Hood - Wido den Hollander - Your.OnlineKVM Security Groups Under the Hood - Wido den Hollander - Your.Online
KVM Security Groups Under the Hood - Wido den Hollander - Your.Online
ShapeBlue181 views
Backroll, News and Demo - Pierre Charton, Matthias Dhellin, Ousmane Diarra - ... by ShapeBlue
Backroll, News and Demo - Pierre Charton, Matthias Dhellin, Ousmane Diarra - ...Backroll, News and Demo - Pierre Charton, Matthias Dhellin, Ousmane Diarra - ...
Backroll, News and Demo - Pierre Charton, Matthias Dhellin, Ousmane Diarra - ...
ShapeBlue146 views
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f... by TrustArc
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...
TrustArc160 views
Igniting Next Level Productivity with AI-Infused Data Integration Workflows by Safe Software
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Safe Software385 views
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit... by ShapeBlue
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...
ShapeBlue117 views
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ... by ShapeBlue
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...
ShapeBlue79 views
Centralized Logging Feature in CloudStack using ELK and Grafana - Kiran Chava... by ShapeBlue
Centralized Logging Feature in CloudStack using ELK and Grafana - Kiran Chava...Centralized Logging Feature in CloudStack using ELK and Grafana - Kiran Chava...
Centralized Logging Feature in CloudStack using ELK and Grafana - Kiran Chava...
ShapeBlue101 views
The Power of Heat Decarbonisation Plans in the Built Environment by IES VE
The Power of Heat Decarbonisation Plans in the Built EnvironmentThe Power of Heat Decarbonisation Plans in the Built Environment
The Power of Heat Decarbonisation Plans in the Built Environment
IES VE69 views
How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ... by ShapeBlue
How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ...How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ...
How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ...
ShapeBlue123 views
Digital Personal Data Protection (DPDP) Practical Approach For CISOs by Priyanka Aash
Digital Personal Data Protection (DPDP) Practical Approach For CISOsDigital Personal Data Protection (DPDP) Practical Approach For CISOs
Digital Personal Data Protection (DPDP) Practical Approach For CISOs
Priyanka Aash153 views
Data Integrity for Banking and Financial Services by Precisely
Data Integrity for Banking and Financial ServicesData Integrity for Banking and Financial Services
Data Integrity for Banking and Financial Services
Precisely78 views
NTGapps NTG LowCode Platform by Mustafa Kuğu
NTGapps NTG LowCode Platform NTGapps NTG LowCode Platform
NTGapps NTG LowCode Platform
Mustafa Kuğu365 views
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive by Network Automation Forum
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLiveAutomating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Future of AR - Facebook Presentation by Rob McCarty
Future of AR - Facebook PresentationFuture of AR - Facebook Presentation
Future of AR - Facebook Presentation
Rob McCarty62 views
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue by ShapeBlue
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlueElevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue
ShapeBlue179 views
Keynote Talk: Open Source is Not Dead - Charles Schulz - Vates by ShapeBlue
Keynote Talk: Open Source is Not Dead - Charles Schulz - VatesKeynote Talk: Open Source is Not Dead - Charles Schulz - Vates
Keynote Talk: Open Source is Not Dead - Charles Schulz - Vates
ShapeBlue210 views
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or... by ShapeBlue
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
ShapeBlue158 views

Supporting Geo-Ontology Engineering through Spatial Data Analytics

  • 1. Supporting Geo-Ontology Engineering through Spatial Data Analytics Gloria Re Calegari, Emanuela Carlino, Irene Celino and Diego Peroni CEFRIEL – Milano, Italy
  • 2. Introduction Notion of place as a “situated concept” Dependent on physical, natural, social, cultural and cognitive processes Entities within a specific spatio-temporal context Geospatial ontologies General-purpose vs. location-specific Top-down vs. bottom-up (Re)engineering: evolution, repair and specialization Can spatial data analytics help in geo-ontology engineering? (data=instances)
  • 3. Objectives O1: Re-engineering Spatial Features Identifying concepts that play a different role in different cities, thus possibly indicating different cultural or pragmatic meanings O2: Specifying Spatial Neighborhoods Highlighting new potential concepts to characterize the urban neighborhoods of different cities
  • 4. Experimental settings Spatial objects from OpenStreetMap/LinkedGeoData Described by the LinkedGeoData ontology  general-purpose geo-ontology Several spatial features considered (amenities, shops, etc.) Data from two cities: Milano and London Similarities as well as differences Assumptions about the coverage and completeness of the data Volunteered Geographic Information (VGI), reflecting the way in which the environment is experienced
  • 5. Objective 1: Re-Engineering Spatial Features Analysis of the pattern distribution of each spatial feature If spatial objects “instances” of a spatial feature are condensed in a specific area, that feature is a relevant element to characterize that area Aim to verify whether the same concept plays a different role in different places Two-step analysis: 1) Analysing Distribution of Spatial Objects 2) Clustering Spatial Objects
  • 6. Objective1:Re-Engineering SpatialFeatures Analysing Spatial Objects' Distribution (1/4) Evaluate the tendency of each spatial feature to show spatial patterns and to create spatial aggregations Morishita index: statistical measure of dispersion based on quadrats count 1 random point patterns >>1 spatial aggregation may exist Moran index: measure of spatial auto-correlation –1 (perfect dispersion) +1 (perfect correlation) 0 (random spatial pattern)
  • 7. Objective1:Re-Engineering SpatialFeatures Analysing Spatial Objects' Distribution (2/4) Common spatial features like restaurants show almost the same spatial pattern in the two cities The analysis shows similar trends in the Morishita plot and similar Moran index values
  • 8. Objective1:Re-Engineering SpatialFeatures Analysing Spatial Objects' Distribution (3/4) Pubs are everywhere in London, while they aggregate much more in Milano This reveals an actual “semantic” difference in the meaning of this spatial feature
  • 9. Objective1:Re-Engineering SpatialFeatures Analysing Spatial Objects' Distribution (4/4) Theatres are spread everywhere in Milano, while they strongly aggregate in London In this case, the meaning of the spatial feature is the same but the analysis reveals the “semantics” of the cultural district
  • 10. Objective1:Re-Engineering SpatialFeatures Clustering Spatial Objects (1/2) Density based clustering to find agglomeration of spatial objects in a specific region OPTICS algorithm (hierarchical DBSCAN extension) – for each category of spatial object Latitude-longitude coordinates as input At least 5 points in a radius of 150 meters lgdo:Restaurant in Milano
  • 11. LONDON • 56 categories clustered at least in one region • lgdo:Hotel 40% points clustered (innermost boroughs) • lgdo:Telephone clusters in the more touristic area • lgdo:Pub widespread around the city, only 7% clustered MILANO • 25 categories clustered at least in one region • lgdo:Hotel 14% points clustered (central and peripheral areas) • lgdo:Telephone no clusters • lgdo:Pub concentrated in nightlife areas only, 20% clustered Emerging differences between the two cities Objective1:Re-Engineering SpatialFeatures Clustering Spatial Objects (2/2)
  • 12. Objective1:Re-EngineeringSpatialFeatures What do clusters and feature distributions suggest to ontology engineers? Evaluation of the validity/applicability of a geo-ontology in different locations Hints: Spatial feature with the same behaviour in different locations  ontology re- engineering is not required Spatial feature with different behaviour in different locations  ontology re- engineering may be required with location-specific extensions to the geo-ontology Note: a pure numerical analysis of the spatial objects is meaningful (lgdo:Convenience are ‘’outliers’’ in Milano, while in London are numerous)
  • 13. Objective 2: Specifying Spatial Neighborhoods Which are the spatial features that occur together? Is it possible to semantically characterize a region according to its semantic features co-occurrence? 3-step analysis: 1) Identifying Neighborhoods 2) Characterizing Neighborhoods 3) Semantically Querying Neighborhoods
  • 14. Objective2:SpecifyingSpatialNeighborhoods Identifying Neighborhoods (1/3) For each cluster obtained with OPTICS, definition of the convex hull polygon containing all the points belonging to that cluster For each polygon, computation of its centroid, which represent the categories' hotspot locations (urban areas with a high number of amenity of a given type)
  • 15. Objective2:SpecifyingSpatialNeighborhoods Identifying Neighborhoods (2/3) Clustering of these centroids to divide the urban space in neighborhoods, according to the simultaneous presence of spatial features’ hotspots Agglomerative clustering technique (with Ward minimum variance method)
  • 16. Objective2:SpecifyingSpatialNeighborhoods Identifying Neighborhoods (3/3) Clusters do not necessarily correspond to actual administrative divisions (cf. black boundaries) Clusters reflect the emerging “semantic” neighborhoods, i.e. regions of the city characterized by co- occurrence of hotspot locations of different spatial features
  • 17. Objective2:SpecifyingSpatialNeighborhoods Characterizing Neighborhoods (1/3) Which spatial features best characterize each city region? Definition of a spatial variant of tf-idf score n : set of clustered points in the neighborhood f : set of all spatial points with that feature |N| : total number of neighborhoods (24 in the experiments, 12 neighborhoods for each city) : number of neighborhood in which the spatial feature is represented by at least a point Spatial Object Frequency Inverse Neighborhood Frequency
  • 18. Objective2:SpecifyingSpatialNeighborhoods Characterizing Neighborhoods (2/3) Top-3 Spatial Object Frequency-Inverse Neighborhood Frequency in each district
  • 19. Comparison between London and Milano sof-inf scores via cosine similarity ‘’Hampstead’’ and ‘’Milano East“: suburban areas with schools, parking lots and offices ‘’Camden town’’ and ‘’Chelsea’’: peculiar spatial features not present in Milano ‘’Navigli’’: there is not a corresponding area. In London pubs are widespread around the city Milano London Objective2:SpecifyingSpatialNeighborhoods Characterizing Neighborhoods (3/3)
  • 20. Objective2:SpecifyingSpatialNeighborhoods Semantically Querying Neighborhoods (1/2) Definition of neighborhood concepts to express the co-occurrence of different spatial features in the same area Specific “spatio-semantic queries” by combining spatial features: Shopping: Clothes, Shoes, Chemist and BeautyShop Nightlife: Restaurant, Pub, Bar and Theatre Residential: School, Atm, Butcher, Greengrocer, Convenience and Parking Culture: Theatre, ArtShop, BookShop, MusicalInstruments and College Alternative: Erotic, Tattoo, Charity, CommunityCentre and ArtShop
  • 21. London Milano Cosine similarity between the sof-inf vectors of the spatio- semantic query and each neighborhood Objective2:SpecifyingSpatialNeighborhoods Semantically Querying Neighborhoods (2/2)
  • 22. Objective2:SpecifyingSpatialNeighborhoods What does Spatial Neighborhoods analysis suggest to ontology engineers? New set of concepts to represent the ‘’semantics’’ of urban districts The sof-inf score is a useful tool to select the most prominent spatial features that characterize a neighborhood  it helps in summarising spatial regions by their distinctive categories The spatio-semantic query approach supports the ontological specification of neighborhood concepts: To verify the actual “instantiation” of those concept To test different hypotheses with different feature compositions To select the most suitable level of abstraction for the geo-ontology
  • 23. Conclusions Ontology engineering is still largely an art that requires a deep domain knowledge In the case of geo-ontologies, spatial data analytics can provide ontology engineers with additional hints and suggestions Location-specific differences between cities Local characterization via a spatio-semantic queries Spatial data analytics can help to Verify existing hypotheses  highlight already-known characteristics Discover unknown specificities  in this case, further exploration is needed
  • 24. Future Work Current main limitation: spatial analytics brings only supporting insights, it is not a fully-automated technique for ontology (re)engineering User study with ontology engineers More tightly integration with ontology engineering process/tools Weighted sof-inf score, by adding spatial features’ weights to the spatio-semantic queries to specify complex neighborhood concepts Extended experiments to further prove generality applicability: Larger set of more heterogeneous cities Different sets of spatial features, also within non-urban contexts
  • 25. Supporting Geo-Ontology Engineering through Spatial Data Analytics Gloria Re Calegari, Emanuela Carlino, Irene Celino andDiego Peroni Thank you! Additional material can be found at http://swa.cefriel.it/geo/eswc2016.html