SlideShare a Scribd company logo
The ACM 10th International Conference on Web Intelligence, Mining and Semantics
(WIMS ’20), June 30 – July 3, 2020, Biarritz, France
The ICARUS ONTOLOGY:
A general aviation ontology developed using
a multi-layer approach
Dimosthenis Stefanidis, Chrysovalantis Christodoulou, Moysis Symeonidis, George Pallis, Marios D.
Dikaiakos, Loukas Pouis, Kalia Orphanou, Fenareti Lampathaki, Dimitrios Alexandrou
dstefa02@cs.ucy.ac.cy
1
Aviation Industry
2
Airlines
Aviation Data
Airports
More than 98
million terabytes
of data by 20261
4.1 billion passengers
and 56.1 million
tones of freight were
carried in 20172
1 www.flightglobal.com/news/articles/insight-from-flightglobal-the-big-data-landscape-446681
2 https://www.icao.int/annual-report-2017/Pages/the-world-of-air-transport-in-2017.aspx
Aviation Data
 Complex
 Derive from heterogeneous data sources
 Lack of standardization
 Data integration and linking challenge
3
“One of the biggest challenges is to integrate the different data silos, for example weather data, live airspace usage or data from the
airports. There is really no standard and that complicates things. Even internally we are still merging different data sets from different areas in
the company. Insights emerge when you put different departmental data sets together, but at the moment, it is not a smooth process.”
By Rey-Villaverde (Head of Data Science, EasyJet)
Varying data formats
A significant bottleneck with huge cost
Data Integration Problem
 With no common standard, aviation data models can vary along
various dimensions!
 Data providers can use different formats to encode aviation data
e.g. an airline carrier ID field could be stored as a three (IATA) or four-letter code (ICAO)
 Field names assigned to values can be misleading
e.g. a provider may use the name "AT" while another may use "arrTime" for the field "aircraft arrival time"
 Even if two data fields are identical, that doesn't ensure that the data represents the same information
e.g. the "aircraft arrival time“ may correspond to a scheduled or an actual arrival time.
 Data values may be recorded at different temporal frequencies (e.g. once per hour) or spatial regions (e.g.
airspace sectors, geographic regions)
 Measurement units are often omitted in the data storage schemes and can lead to problems when different
units are employed across different systems (e.g. metric vs imperial, feet vs flight level)
4
ICARUS Platform
5
 A novel Big Data platform to deal with
the data integration challenges in the
aviation
 Allows exploration, sharing, trading,
curation, integration and deep analysis
in a trusted manner
 Original and derivative data, characterized
by different volume, velocity and variety
www.icarus2020.aero
The ICARUS Ontology
6
The ICARUS Ontology
 Represents meaningfully entities of the ICARUS Platform
e.g. datasets, algorithms, services (a combination of data and ML algorithms), usage statistics,
registered experts etc.
 Captures structural and semantic characteristics of entities by using semantic annotation of
datasets
 Extracts metadata from ICARUS Platform operations to construct the ICARUS knowledge-base
 Supports continuous integration of new datasets, services, and users into the platform
 Supports search, query and linking over multiple data sources and information assets
 Feeds the ICARUS recommendation engine with useful information
7
ICARUS Ontology - A Multi-layer Approach
8
Meta Contexts
and attributes
(top-level ontology
related to metadata
of entities)
Domain-specific
context and attributes
(domain ontologies
related to aviation)
C
C1
C2
C3
C4
Weather Airport Aircraft
FlightPassenger Health
Top-level ontology for the
metadata of entities
…
ICARUS Ontology - Design Process
9
Expand
the domain-level
ontologies
Capture important
concepts,
relationships and
data fields from
aviation
stakeholders’ data
Integrate
existing aviation
domain ontology
(e.g. NASA
ontology1)
Create a top-level
ontology for
describing
platform’s
concepts
Ontology Coding
Based on a formal language (OWL) using Protégé.
1 Keller, R. M. (2016, September). Ontologies for aviation data management. In 2016
IEEE/AIAA 35th Digital Avionics Systems Conference (DASC) (pp. 1-9). IEEE.
Class Hierarchy
10
Class Hierarchy
11
Class Hierarchy
12
Class Hierarchy
13
Class Hierarchy
14
Possible competencies questions that ICARUS Ontology
can answer
15
 ‘‘Which datasets contain columns about flight delay time?’’
 ‘‘Which is the airport departure terminal for a specific flight?’’
 ‘‘How many were the occupied seats on a specific flight?’’
Use Cases
16
Twitter Recommendation System Covid-19
Scenario: Twitter
 Based on Twitter data (e.g. travelers' tweets) that are related to the aviation,
the popularity (e.g. sentiment score) of airlines and airports can be found
and extracted.
 Providing such kind of statistics (e.g. popularity) could help airlines to find:
 the most common problems in case of bad flight
e.g. late flight, long lines, lost luggage, customer service, etc.
 popularity of one airline versus competitors
17
Scenario: Twitter
1. Ontology Extension: expand the ICARUS ontology based on the
concepts and entities related to Twitter e.g. twitter user account, the
number of followers, tweets, etc.
2. Data Collection: retrieve tweets and airlines accounts via the Twitter
Streaming API
3. Data Pre-processing: apply data cleaning and natural language
processing (NLP) techniques to the retrieved tweets
18
Scenario: Twitter
4. Emotions Extraction: perform sentiment analysis (e.g. VADER) on a
set of retrieved tweets, by including emotion categories in the
ontology
5. Storage: store to the ICARUS ontology (knowledge base)
6. Query KB: Use SPARQL queries to answer possible questions of
ICARUS users
e.g. "Which airline has the lowest popularity?" (searching for the entity airline with the
most negative sentiment based on the stored aggregated statistics)
19
Scenario: Recommendations
20
 Provide high-quality recommendations of datasets and services to the
ICARUS users by utilizing the ICARUS ontology.
 We can recommend assets that are connected indirectly to the user’s
preferences and needs by:
 capturing structural and semantic characteristics of the various
ICARUS entities
 inferring relationships (e.g. inheritance) between users and assets that
were hidden
Scenario: Recommendations
21
1. Data Collection: retrieve data related to users and assets of the
ICARUS platform (e.g. preferences, interactions like purchases with
datasets and services, metadata of datasets, etc.).
2. Storage: store to the ICARUS ontology (knowledge base)
3. Reasoning: apply a reasoning algorithm (Pellet) to reveal hidden
relationships (e.g. inheritance)
Scenario: Recommendations
22
4. Recommender: Use a weighted-based hybrid recommendation
system approach to provide recommendation of datasets and
services to each user
 Content-Based: Use SPARQL queries to retrieve users’ preferences, geolocation
and organization types with the respective information of the given datasets and
services
 Collaborative Filtering: Use SPARQL queries to retrieve the interplay between
users and assets to construct the interaction matrix
Scenario: COVID-19
 Current challenges of health organizations:
locate, collect, explore and integrate reliable data about airline and
human mobility, with a sufficient geographical coverage and
resolution.
 Improving such level of detail would result:
 in more accurate epidemic predictions and
 a possible estimation of relative revenue losses to be expected in different
pandemic scenarios
23
Scenario: COVID-19
 The ICARUS ontology and the relationships between each entity
can be utilized to combine epidemics data with other aviation-
related data for data analytics and epidemic forecasts.
1. Ontology Extension: expand the ICARUS ontology based on the
concepts and entities related to COVID e.g. mortality rate, cases per
city/country, etc.
2. Data Collection: retrieve open data related to COVID-19 and aviation
data
24
Scenario: COVID-19
3. Storage: store to the ICARUS ontology (knowledge base)
4. Query KB: Use SPARQL queries, aviation-related data and COVID-
19 data to answer possible questions of ICARUS users
e.g. "Which datasets can help me predict a virus transmission from incoming flights?"
(Find datasets that are related to incoming flights and virus and they are utilized by
forecasting services for virus transmission)
25
Conclusion
 We presented the ICARUS ontology, an aviation domain ontology
designed using a multi-layer approach for enabling:
 the integration and reasoning over multiple sources of heterogeneous aviation-
related data
 the semantic description of metadata produced by the ICARUS platform
 Main strengths of the proposed ontology:
 extendibility and interoperability due to the multi-layer design
 ease of use on multiple aviation data sources of different format and structure
26
Thank You!
The ontology is available here (open source):
https://github.com/UCY-LINC-LAB/icarus-ontology
Co-funded by the European
Commission Horizon 2020
- Grant # 780792
27

More Related Content

Similar to ICARUS @WIMS 2020 (June 2020, virtual)

Advances in aeronautical informatics ( pdf drive.com )
Advances in aeronautical informatics ( pdf drive.com )Advances in aeronautical informatics ( pdf drive.com )
Advances in aeronautical informatics ( pdf drive.com )
BatDeegii
 
Quo vadis, provenancer?  Cui prodest?  our own trajectory: provenance of data...
Quo vadis, provenancer? Cui prodest? our own trajectory: provenance of data...Quo vadis, provenancer? Cui prodest? our own trajectory: provenance of data...
Quo vadis, provenancer?  Cui prodest?  our own trajectory: provenance of data...
Paolo Missier
 
A Brighter Future for the Black Box
A Brighter Future for the Black BoxA Brighter Future for the Black Box
A Brighter Future for the Black BoxJLLARMOR
 
Describe and Publish data sets on the web: vocabularies, catalogues, data por...
Describe and Publish data sets on the web: vocabularies, catalogues, data por...Describe and Publish data sets on the web: vocabularies, catalogues, data por...
Describe and Publish data sets on the web: vocabularies, catalogues, data por...
Franck Michel
 
Isncc2020
Isncc2020Isncc2020
Isncc2020
Rim Moussa
 
Advanced UAV Trajectory Generation Planning And Guidance
Advanced UAV Trajectory Generation  Planning And GuidanceAdvanced UAV Trajectory Generation  Planning And Guidance
Advanced UAV Trajectory Generation Planning And Guidance
Stephen Faucher
 
4 th International Conference on Data Science and Machine Learning (DSML 2023)
4 th International Conference on Data Science and Machine Learning (DSML 2023)4 th International Conference on Data Science and Machine Learning (DSML 2023)
4 th International Conference on Data Science and Machine Learning (DSML 2023)
gerogepatton
 
2.2 Episecc project Overview
2.2   Episecc project Overview2.2   Episecc project Overview
2.2 Episecc project Overview
Fraunhofer FOKUS
 
4th International Conference on Data Science and Machine Learning (DSML 2023)
4th International Conference on Data Science and Machine Learning (DSML 2023) 4th International Conference on Data Science and Machine Learning (DSML 2023)
4th International Conference on Data Science and Machine Learning (DSML 2023)
gerogepatton
 
Data Provenance for Data Science
Data Provenance for Data ScienceData Provenance for Data Science
Data Provenance for Data Science
Paolo Missier
 
ExpertSystemTechForMilitary
ExpertSystemTechForMilitaryExpertSystemTechForMilitary
ExpertSystemTechForMilitaryCora Carmody
 
MACHINE LEARNING TECHNIQUES FOR ANALYSIS OF EGYPTIAN FLIGHT DELAY
MACHINE LEARNING TECHNIQUES FOR ANALYSIS OF EGYPTIAN FLIGHT DELAYMACHINE LEARNING TECHNIQUES FOR ANALYSIS OF EGYPTIAN FLIGHT DELAY
MACHINE LEARNING TECHNIQUES FOR ANALYSIS OF EGYPTIAN FLIGHT DELAY
IJDKP
 
Call for Papers- Special Session: Contemporary Innovations in Data Sciences, ...
Call for Papers- Special Session: Contemporary Innovations in Data Sciences, ...Call for Papers- Special Session: Contemporary Innovations in Data Sciences, ...
Call for Papers- Special Session: Contemporary Innovations in Data Sciences, ...
Christo Ananth
 
Getting insights from IoT data with Apache Spark and Apache Bahir
Getting insights from IoT data with Apache Spark and Apache BahirGetting insights from IoT data with Apache Spark and Apache Bahir
Getting insights from IoT data with Apache Spark and Apache Bahir
Luciano Resende
 
How Bluemix Helps NASA Innovate
How Bluemix Helps NASA InnovateHow Bluemix Helps NASA Innovate
How Bluemix Helps NASA Innovate
IBM
 
Preprint-CSAE,China,21-23 October 2022.pdf
Preprint-CSAE,China,21-23 October 2022.pdfPreprint-CSAE,China,21-23 October 2022.pdf
Preprint-CSAE,China,21-23 October 2022.pdf
Christo Ananth
 
Standards Show
Standards ShowStandards Show
Standards Showmcusack
 
2006-03-14 WG on HTAP-Relevant IT Techniques, Tools and Philosophies: DataFed...
2006-03-14 WG on HTAP-Relevant IT Techniques, Tools and Philosophies: DataFed...2006-03-14 WG on HTAP-Relevant IT Techniques, Tools and Philosophies: DataFed...
2006-03-14 WG on HTAP-Relevant IT Techniques, Tools and Philosophies: DataFed...Rudolf Husar
 
060314 Ispra Htap Presentations Husar 060314 Ispra
060314 Ispra Htap Presentations Husar 060314 Ispra060314 Ispra Htap Presentations Husar 060314 Ispra
060314 Ispra Htap Presentations Husar 060314 Ispra
Rudolf Husar
 
Shceduling iot application on cloud computing
Shceduling iot application on cloud computingShceduling iot application on cloud computing
Shceduling iot application on cloud computing
Eman Ahmed
 

Similar to ICARUS @WIMS 2020 (June 2020, virtual) (20)

Advances in aeronautical informatics ( pdf drive.com )
Advances in aeronautical informatics ( pdf drive.com )Advances in aeronautical informatics ( pdf drive.com )
Advances in aeronautical informatics ( pdf drive.com )
 
Quo vadis, provenancer?  Cui prodest?  our own trajectory: provenance of data...
Quo vadis, provenancer? Cui prodest? our own trajectory: provenance of data...Quo vadis, provenancer? Cui prodest? our own trajectory: provenance of data...
Quo vadis, provenancer?  Cui prodest?  our own trajectory: provenance of data...
 
A Brighter Future for the Black Box
A Brighter Future for the Black BoxA Brighter Future for the Black Box
A Brighter Future for the Black Box
 
Describe and Publish data sets on the web: vocabularies, catalogues, data por...
Describe and Publish data sets on the web: vocabularies, catalogues, data por...Describe and Publish data sets on the web: vocabularies, catalogues, data por...
Describe and Publish data sets on the web: vocabularies, catalogues, data por...
 
Isncc2020
Isncc2020Isncc2020
Isncc2020
 
Advanced UAV Trajectory Generation Planning And Guidance
Advanced UAV Trajectory Generation  Planning And GuidanceAdvanced UAV Trajectory Generation  Planning And Guidance
Advanced UAV Trajectory Generation Planning And Guidance
 
4 th International Conference on Data Science and Machine Learning (DSML 2023)
4 th International Conference on Data Science and Machine Learning (DSML 2023)4 th International Conference on Data Science and Machine Learning (DSML 2023)
4 th International Conference on Data Science and Machine Learning (DSML 2023)
 
2.2 Episecc project Overview
2.2   Episecc project Overview2.2   Episecc project Overview
2.2 Episecc project Overview
 
4th International Conference on Data Science and Machine Learning (DSML 2023)
4th International Conference on Data Science and Machine Learning (DSML 2023) 4th International Conference on Data Science and Machine Learning (DSML 2023)
4th International Conference on Data Science and Machine Learning (DSML 2023)
 
Data Provenance for Data Science
Data Provenance for Data ScienceData Provenance for Data Science
Data Provenance for Data Science
 
ExpertSystemTechForMilitary
ExpertSystemTechForMilitaryExpertSystemTechForMilitary
ExpertSystemTechForMilitary
 
MACHINE LEARNING TECHNIQUES FOR ANALYSIS OF EGYPTIAN FLIGHT DELAY
MACHINE LEARNING TECHNIQUES FOR ANALYSIS OF EGYPTIAN FLIGHT DELAYMACHINE LEARNING TECHNIQUES FOR ANALYSIS OF EGYPTIAN FLIGHT DELAY
MACHINE LEARNING TECHNIQUES FOR ANALYSIS OF EGYPTIAN FLIGHT DELAY
 
Call for Papers- Special Session: Contemporary Innovations in Data Sciences, ...
Call for Papers- Special Session: Contemporary Innovations in Data Sciences, ...Call for Papers- Special Session: Contemporary Innovations in Data Sciences, ...
Call for Papers- Special Session: Contemporary Innovations in Data Sciences, ...
 
Getting insights from IoT data with Apache Spark and Apache Bahir
Getting insights from IoT data with Apache Spark and Apache BahirGetting insights from IoT data with Apache Spark and Apache Bahir
Getting insights from IoT data with Apache Spark and Apache Bahir
 
How Bluemix Helps NASA Innovate
How Bluemix Helps NASA InnovateHow Bluemix Helps NASA Innovate
How Bluemix Helps NASA Innovate
 
Preprint-CSAE,China,21-23 October 2022.pdf
Preprint-CSAE,China,21-23 October 2022.pdfPreprint-CSAE,China,21-23 October 2022.pdf
Preprint-CSAE,China,21-23 October 2022.pdf
 
Standards Show
Standards ShowStandards Show
Standards Show
 
2006-03-14 WG on HTAP-Relevant IT Techniques, Tools and Philosophies: DataFed...
2006-03-14 WG on HTAP-Relevant IT Techniques, Tools and Philosophies: DataFed...2006-03-14 WG on HTAP-Relevant IT Techniques, Tools and Philosophies: DataFed...
2006-03-14 WG on HTAP-Relevant IT Techniques, Tools and Philosophies: DataFed...
 
060314 Ispra Htap Presentations Husar 060314 Ispra
060314 Ispra Htap Presentations Husar 060314 Ispra060314 Ispra Htap Presentations Husar 060314 Ispra
060314 Ispra Htap Presentations Husar 060314 Ispra
 
Shceduling iot application on cloud computing
Shceduling iot application on cloud computingShceduling iot application on cloud computing
Shceduling iot application on cloud computing
 

Recently uploaded

Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
GetInData
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTESAdjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Subhajit Sahu
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
rwarrenll
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
mzpolocfi
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 

Recently uploaded (20)

Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTESAdjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 

ICARUS @WIMS 2020 (June 2020, virtual)

  • 1. The ACM 10th International Conference on Web Intelligence, Mining and Semantics (WIMS ’20), June 30 – July 3, 2020, Biarritz, France The ICARUS ONTOLOGY: A general aviation ontology developed using a multi-layer approach Dimosthenis Stefanidis, Chrysovalantis Christodoulou, Moysis Symeonidis, George Pallis, Marios D. Dikaiakos, Loukas Pouis, Kalia Orphanou, Fenareti Lampathaki, Dimitrios Alexandrou dstefa02@cs.ucy.ac.cy 1
  • 2. Aviation Industry 2 Airlines Aviation Data Airports More than 98 million terabytes of data by 20261 4.1 billion passengers and 56.1 million tones of freight were carried in 20172 1 www.flightglobal.com/news/articles/insight-from-flightglobal-the-big-data-landscape-446681 2 https://www.icao.int/annual-report-2017/Pages/the-world-of-air-transport-in-2017.aspx
  • 3. Aviation Data  Complex  Derive from heterogeneous data sources  Lack of standardization  Data integration and linking challenge 3 “One of the biggest challenges is to integrate the different data silos, for example weather data, live airspace usage or data from the airports. There is really no standard and that complicates things. Even internally we are still merging different data sets from different areas in the company. Insights emerge when you put different departmental data sets together, but at the moment, it is not a smooth process.” By Rey-Villaverde (Head of Data Science, EasyJet) Varying data formats A significant bottleneck with huge cost
  • 4. Data Integration Problem  With no common standard, aviation data models can vary along various dimensions!  Data providers can use different formats to encode aviation data e.g. an airline carrier ID field could be stored as a three (IATA) or four-letter code (ICAO)  Field names assigned to values can be misleading e.g. a provider may use the name "AT" while another may use "arrTime" for the field "aircraft arrival time"  Even if two data fields are identical, that doesn't ensure that the data represents the same information e.g. the "aircraft arrival time“ may correspond to a scheduled or an actual arrival time.  Data values may be recorded at different temporal frequencies (e.g. once per hour) or spatial regions (e.g. airspace sectors, geographic regions)  Measurement units are often omitted in the data storage schemes and can lead to problems when different units are employed across different systems (e.g. metric vs imperial, feet vs flight level) 4
  • 5. ICARUS Platform 5  A novel Big Data platform to deal with the data integration challenges in the aviation  Allows exploration, sharing, trading, curation, integration and deep analysis in a trusted manner  Original and derivative data, characterized by different volume, velocity and variety www.icarus2020.aero
  • 7. The ICARUS Ontology  Represents meaningfully entities of the ICARUS Platform e.g. datasets, algorithms, services (a combination of data and ML algorithms), usage statistics, registered experts etc.  Captures structural and semantic characteristics of entities by using semantic annotation of datasets  Extracts metadata from ICARUS Platform operations to construct the ICARUS knowledge-base  Supports continuous integration of new datasets, services, and users into the platform  Supports search, query and linking over multiple data sources and information assets  Feeds the ICARUS recommendation engine with useful information 7
  • 8. ICARUS Ontology - A Multi-layer Approach 8 Meta Contexts and attributes (top-level ontology related to metadata of entities) Domain-specific context and attributes (domain ontologies related to aviation) C C1 C2 C3 C4 Weather Airport Aircraft FlightPassenger Health Top-level ontology for the metadata of entities …
  • 9. ICARUS Ontology - Design Process 9 Expand the domain-level ontologies Capture important concepts, relationships and data fields from aviation stakeholders’ data Integrate existing aviation domain ontology (e.g. NASA ontology1) Create a top-level ontology for describing platform’s concepts Ontology Coding Based on a formal language (OWL) using Protégé. 1 Keller, R. M. (2016, September). Ontologies for aviation data management. In 2016 IEEE/AIAA 35th Digital Avionics Systems Conference (DASC) (pp. 1-9). IEEE.
  • 15. Possible competencies questions that ICARUS Ontology can answer 15  ‘‘Which datasets contain columns about flight delay time?’’  ‘‘Which is the airport departure terminal for a specific flight?’’  ‘‘How many were the occupied seats on a specific flight?’’
  • 17. Scenario: Twitter  Based on Twitter data (e.g. travelers' tweets) that are related to the aviation, the popularity (e.g. sentiment score) of airlines and airports can be found and extracted.  Providing such kind of statistics (e.g. popularity) could help airlines to find:  the most common problems in case of bad flight e.g. late flight, long lines, lost luggage, customer service, etc.  popularity of one airline versus competitors 17
  • 18. Scenario: Twitter 1. Ontology Extension: expand the ICARUS ontology based on the concepts and entities related to Twitter e.g. twitter user account, the number of followers, tweets, etc. 2. Data Collection: retrieve tweets and airlines accounts via the Twitter Streaming API 3. Data Pre-processing: apply data cleaning and natural language processing (NLP) techniques to the retrieved tweets 18
  • 19. Scenario: Twitter 4. Emotions Extraction: perform sentiment analysis (e.g. VADER) on a set of retrieved tweets, by including emotion categories in the ontology 5. Storage: store to the ICARUS ontology (knowledge base) 6. Query KB: Use SPARQL queries to answer possible questions of ICARUS users e.g. "Which airline has the lowest popularity?" (searching for the entity airline with the most negative sentiment based on the stored aggregated statistics) 19
  • 20. Scenario: Recommendations 20  Provide high-quality recommendations of datasets and services to the ICARUS users by utilizing the ICARUS ontology.  We can recommend assets that are connected indirectly to the user’s preferences and needs by:  capturing structural and semantic characteristics of the various ICARUS entities  inferring relationships (e.g. inheritance) between users and assets that were hidden
  • 21. Scenario: Recommendations 21 1. Data Collection: retrieve data related to users and assets of the ICARUS platform (e.g. preferences, interactions like purchases with datasets and services, metadata of datasets, etc.). 2. Storage: store to the ICARUS ontology (knowledge base) 3. Reasoning: apply a reasoning algorithm (Pellet) to reveal hidden relationships (e.g. inheritance)
  • 22. Scenario: Recommendations 22 4. Recommender: Use a weighted-based hybrid recommendation system approach to provide recommendation of datasets and services to each user  Content-Based: Use SPARQL queries to retrieve users’ preferences, geolocation and organization types with the respective information of the given datasets and services  Collaborative Filtering: Use SPARQL queries to retrieve the interplay between users and assets to construct the interaction matrix
  • 23. Scenario: COVID-19  Current challenges of health organizations: locate, collect, explore and integrate reliable data about airline and human mobility, with a sufficient geographical coverage and resolution.  Improving such level of detail would result:  in more accurate epidemic predictions and  a possible estimation of relative revenue losses to be expected in different pandemic scenarios 23
  • 24. Scenario: COVID-19  The ICARUS ontology and the relationships between each entity can be utilized to combine epidemics data with other aviation- related data for data analytics and epidemic forecasts. 1. Ontology Extension: expand the ICARUS ontology based on the concepts and entities related to COVID e.g. mortality rate, cases per city/country, etc. 2. Data Collection: retrieve open data related to COVID-19 and aviation data 24
  • 25. Scenario: COVID-19 3. Storage: store to the ICARUS ontology (knowledge base) 4. Query KB: Use SPARQL queries, aviation-related data and COVID- 19 data to answer possible questions of ICARUS users e.g. "Which datasets can help me predict a virus transmission from incoming flights?" (Find datasets that are related to incoming flights and virus and they are utilized by forecasting services for virus transmission) 25
  • 26. Conclusion  We presented the ICARUS ontology, an aviation domain ontology designed using a multi-layer approach for enabling:  the integration and reasoning over multiple sources of heterogeneous aviation- related data  the semantic description of metadata produced by the ICARUS platform  Main strengths of the proposed ontology:  extendibility and interoperability due to the multi-layer design  ease of use on multiple aviation data sources of different format and structure 26
  • 27. Thank You! The ontology is available here (open source): https://github.com/UCY-LINC-LAB/icarus-ontology Co-funded by the European Commission Horizon 2020 - Grant # 780792 27