From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent Systems

Edward Curry
Edward CurryResearch Unit Leader at Digital Enterprise Research Institute (DERI)
From Data Platforms to Dataspaces:
Enabling Data Ecosystems for Intelligent Systems
Edward Curry,
Insight SFI Research Centre for Data Analytics
edward.curry@nuigalway.ie
LDAC2021 - 9th Linked Data in Architecture and Construction Workshop (11 - 13 October 2021)
Overview
• Part I: Data Ecosystems for Intelligent Systems
• Part II: Real-time Linked Dataspaces
• Part III: Final Thoughts on Research Directions and Data Policy
Contents
Part I: Fundamentals and Concepts
Part II: Data Support Services
Part III: Stream and Event Processing Services
Part IV: Intelligent Systems and Applications
Part V: Future Directions
Team
http://dataspaces.info
Web:
dataspaces.info
A Team Effort: Open Access Book
Part I: Data Ecosystems for Intelligent
Systems
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent Systems
First LDAC Meeting 2012
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent Systems
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent Systems
Emerging Smart
Environments….
Real World Digital World
Sensors Orient
Decide
Actuators Act
Observe
Physical Twin
(Asset-centric)
Digital Twin
(System-centric)
Digital
Twins
http://dataspaces.info 10
11
Data-driven Intelligence will be drive by industrial, personal and open data
Connected Intelligent Systems
Distributed and Decentralised Data Ecosystems
Key Barrier: Interoperability – Protocols and Semantics
12
Curry, E. and Sheth, A. (2018) ‘Next-Generation Smart Environments: From System of Systems to Data Ecosystems’,
IEEE Intelligent Systems, 33(3), pp. 69–76. doi: 10.1109/MIS.2018.033001418.
Ecosystem
community of organisms and their
environment interacting as a system
Tansley (1935) Lindeman (1942),…
Data
Ecosystem
socio-technical system
extracting value from data
value chains by interacting
organisations and individuals
oriented to business and
societal purposes
marketplace, competition,
collaboration
Curry, E. (2016) ‘The Big Data Value Chain: Definitions, Concepts,
and Theoretical Approaches’, in Cavanillas, J. M., Curry, E., and
Wahlster, W. (eds) New Horizons for a Data-Driven Economy..
http://dataspaces.info 15
The “gold mining” metaphor applied to data processing
Transforming Transport has
made use of a total of 164
terabytes of data from 160
different data sources
Maturity stages of data assets and related “sieves”
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent Systems
Traditional Approaches to Data Integration
Low
High
High
Frequency
of use
Cost of administration &
semantic integration using
traditional approaches
Popularity
/
Use
Number of data sources, entities, attributes
http://dataspaces.info
The Long Tail of Data
20
• Heterogeneous, complex and large-scale data
• Very-large and dynamic “schemas”
• Open Environments: distributed, decentralised
decoupled data sources, anonymous users, multi-
domain, lack of global order of information flow
• Multiple perspectives
(conceptualisations) of the reality.
• Ambiguity, vagueness, inconsistency.
Content Space: From Rigid
Schemas to Schema-less.....
...and Fundamental
Decentralisation
The Red
Queen
Hypothesis
“It takes all the running you can do, to keep in the
same place. If you want to get somewhere else,
you must run at least twice as fast as that!”
Lewis Carroll's Through the Looking-Glass
Part II: Real-time Linked Dataspaces
Data Platforms will Fuel AI-Driven Decision-Making
Data Generation and Analysis
(including IoT)
Data Platforms
(Access and Portability)
AI and Decision Platforms
IoT-Enablement
Layer 1 - Communication and Sensing
IPv6, Wi-Fi, RFID, CoAP, AVB, etc.
Layer 3 - Data
Schema, Entities, Catalog, Sharing, Access/Control, etc.
Layer 4 – Intelligent Apps, Analytics, and Users
Datasets
Things / Sensors
Contextual Data Sources
(including legacy systems)
Predictive
Analytics
Situation
Awareness
Decision
Support
Digital
Twin
Machine
Learning
Users
Layer 2 - Middleware
Peer-to-Peer, Events, Pub/Sub, SOA, SDN, etc.
A Data Sharing Layer is needed….
Adapted from: L. Atzori, A. Iera, and G. Morabito, “The
Internet of Things: A survey,” Comput. Networks, vol. 54,
no. 15, pp. 2787–2805, Oct. 2010.
http://dataspaces.info
Human Interactivity: Web Search
From Structure to Knowledge Graph
to Search
~1995
~100K Websites
Exact Results
Human Curated
~1998
~2.4M Websites
Approximate Results
Computed
~2012
~700M
Approximate Results + Exact
Computed + Crowd
25
Cost of Data Management Solutions
http://dataspaces.info
Administrative Proximity
– Close vs. Loose Coordination
– Assumptions concerning
guarantees such as data, access,
quality, and consistency,
Semantic Integration
– Degree to which data schemas are
matched up (types, attributes, and
names).
26
Halevy, A., Franklin, M. and Maier, D. 2006. Principles of dataspace
systems. 25th ACM SIGMOD-SIGACT-SIGART symposium on Principles of
database systems - PODS ’06 (New York, New York, USA, 2006), 1–9.
Approximate and Best Effort Approaches
Low
High
High
Frequency
of use Approximate &
best-effort
approaches
Cost of administration &
semantic integration using
traditional approaches
Popularity
/
Use
Number of data sources, entities, attributes
http://dataspaces.info
The Long Tail of Data
Dataspace
“Dataspaces are not a data integration approach; rather, they are
more of a data co-existence approach. The goal of dataspace
support is to provide base functionality over all data sources,
regardless of how integrated they are.”
(Halevy, A., Franklin, M. and Maier, D. 2006.)
Enabling platform for data management for intelligent
systems within smart environments
Combines the pay-as-you-go paradigm of dataspaces,
linked data, and knowledge graphs with entity-centric
real-time queries
Real-time Linked Dataspaces
29
Principles: (adapted from by Halevy et al.)
• Must deal with many different formats of streams
and events.
• Does not subsume the stream and event processing
engines; they still provide individual access via their
native interfaces.
• Queries in are provided on a best-effort and
approximate basis.
• Must provide pathways to improve the integration
among the data sources, including streams and
events, in a pay-as-you-go fashion.
Key Challenge
http://dataspaces.info
Investigate techniques to enable approximate
and best-effort support services for loose
administrative proximity and semantic
integration
Incremental support services
• Catalog
• entity management
• query and search
• data discovery
• human tasks
• quality of service
• complex event
processing
• streams dissemination
• approximate semantic
event matching
•
•
Sahlgren, 2013
Formal World Real World
Baroni et al. 2013
• Distributional hypothesis: the context surrounding a given word in a text provides
relevant information about its meaning.
– "a word is characterized by the company it keeps" was popularized by Firth in the 1950s
• Simplified semantic model: Associational and quantitative.
32
A wife is a female partner in a marriage. The term "wife" seems to
be a close term to bride, the latter is a female participant in a
wedding ceremony, while a wife is a married woman during her
marriage.
...
Distributional Semantic Model
32
c1
child
husband
spouse
cn
c2
function (number of times that the words occur in c1)
0.7
0.5
Distributional Semantic Model
Distributional
semantic model:
Semantic statistical
knowledge extracted
from large Web
corpora
Works as a semantic
ranking function
E.g. esa(room, building)= 0.099
E.g. esa(room, car)= 0.009
θ
Gabrilovich, E.; Markovitch, S.(2007). Computing semantic relatedness using Wikipedia-based
Explicit Semantic Analysis. Proc. 20th Int'l Joint Conf. on Artificial Intelligence (IJCAI).
33
Schema-Agnostic Natural Language Queries
NobelPrizeWinner
A
Semantic Gap
Marie Curie
:type
Possible Data Representations
Information Need: Who are the children of Marie Curie married to?
Marie Curie
2
B C
Marie Curie
Henry R. Labouisse
Ève Curie
Irène Joliot-Curie
:motherOf
:motherOf :wifeOf
:type
:numberOfKids
Frédéric Joliot-Curie
:wifeOf
Frédéric Joliot-Curie
Irène Joliot-Curie
:Spouse
:Child
Henry R. Labouisse
Ève Curie
:Spouse
:Child
Scientist
Freitas, A. and Curry, E. (2014) ‘Natural Language Queries over Heterogeneous Linked Data Graphs: A Distributional-Compositional
Semantics Approach’, in 18th International Conference on Intelligent User Interfaces (IUI’14): ACM
Marie Curie children married to Person
:Marie Curie
Query:
Linked
Data:
:Ève Curie
:motherOf
:Henry R. Labouisse
:wifeOf
Distributional Semantic Search
Information Need: Who are the children of Marie Curie married to?
Query Planner
Ƭ-Space
Large-scale
unstructured data
Commonsense
knowledge
Database
Distributional
semantics
Core semantic approximation
& composition operations
Query Analysis
Query Query Features
Query Plan
Treo: Question Answering over Linked Data
Challenges
• Heterogeneity in Event Semantics
(000s schema)
• Heterogeneity in processing Rules
(000s of rule tied to schema)
• Manually Implemented
Approximate Semantic Event Matcher
• Distributional Event Semantics
• Enables pay-as-you-go event
matching for data streams
• Replaced 48,000 exact rules with
100 approximate rules with around
85% accuracy
Approximate Semantic Matching of Streams
37
Hasan, S. and Curry, E. (2014) ‘Approximate Semantic Matching of Events
for the Internet of Things’, ACM Transactions on Internet Technology, 14(1).
Intelligent Systems and Applications
http://dataspaces.info
L
OCATION
Airport Office Home Mixed Use School
LINATE AIRPORT,
MILAN, ITALY
INSIGHT,
GALWAY, IRELAND
HOUSES,
THERMI, GREECE
ENGINEERING,
NUI GALWAY
COLÁISTE NA
COIRIBE, IRELAND
T
ARGET
U
SER
S
• Corporate users
• ~9.5 million
passengers
• Utilities
management
• Maintenance
staff
• Environmental
managers
• 130 staff
• Office consumers
• Operations
managers
• Utility providers
• Building
managers
• Domestic
consumers
(adults, young
adults and
children)
• Utility providers
• Mixed/Public
consumers
• Building
managers
• 100 staff
• 1000 students
(ages 18 to 24)
• Mixed/Public
consumers
• School
management
• Maintenance
staff
• 500 students
(ages 12 to 18)
• 40 teachers
I
NFRASTRUCTURE
• Safety critical
• 10 km water
network
• Multiple
buildings
• Water meters
• Energy meters
• Legacy systems
• 2190 m2 space
• 22 offices + 160
open plan spaces
• Conference room
• 4 meeting rooms
• 3 kitchens
• Data centre
• 30 person café
• Energy meters
• 10 households
• Typical variety of
domestic settings
including kitchen,
showers, baths,
living room,
bedrooms, and
garden
• Water meters
• Water meters
• Energy meters
• Rainwater
harvesting
• Café
• Weather station
• Wet labs
• Showers
• Water meters
• Energy meters
• Rainwater
harvesting
Smart Water
and Energy
Management
Pilots
Smart School
CnaC School in
Galway, Ireland
Mixed Use
Galway, Ireland
Building
Manager
University Students
Smart Airport
Milan Linate,
Italy
Corporate
Staff
Passengers
Smart Homes
Municipality of
Thermi, Greece
Smart Office
Galway, Ireland
Families
Operational
Staff
Researchers
Application
Developers
Teaching Staff School Students
Data
Scientist
Need to target different Target Users
IoT-enabled
Digital Twins
and
Intelligent
Applications
Real-time Linked Dataspace
Datasets
Things / Sensors
Entity Management Service
Catalog &
Access Control
Service
Personal Dashboard
Public Dashboards
Decision Analytics and
Machine Learning
Notifications Apps
Alerts
Orient Decide
Act
Search & Query
Service
Entity-Centric
Real-Time Query
Service
Complex Event
Processing Service
Digital Twin
CEP
D
Human Task Service
Human Task
Service
Observe
http://dataspaces.info
“OODA” Loop
Interactive Public Displays
Alerts and Notifications
Personalised Dashboards
Example
Applications
Pilot Impacts
Experiences and Lessons Learnt from Dataspaces
spaces.info
• Developer education need for stream processing and approximate
results
• Incremental data management can support agile software
development
• Build the business case for data-driven innovation
• Integration with legacy data is a significant cost in smart environments
• The 5 star pay-as-you-go model simplified communication with non-
technical users
• A secure canonical source for entity data simplifies application
development
• Data quality with things and sensors is challenging in an operational
environment
• Working with three pipelines adds overhead (LAMBDA + Entity Layer)
43
Part III: Final Thoughts on
Research Directions and Data
Policy
http://dataspaces.info 45
Large-scale Decentralised Support Services
• Enhanced Supported Services
• Scaling Entity Management
• Maintenance and Operation Cost
Multimedia/Knowledge-Intensive Event
Processing
• Support Services for Multimedia Data
• Placement of Multimedia Data and
Workloads
• Adaptive Training of Classifiers
• Complex Multimedia Event Processing
Trusted Data Sharing
• Trusted Platforms
• Usage Control
• Personal/ Industrial Dataspaces
Ecosystem Governance and Economic
Models
• Decentralised Data Governance
• Economic Models
Incremental Intelligent Systems
Engineering Cognitive Adaptability
• Pay-as-you-go Systems
• Cognitive Adaptability
Towards Human-centric Systems
• Explainable Artificial Intelligence
and Data Provenance
• Human-in-the-loop
Future Research Directions
Internet of Multimedia Things (IoMT)
Overview
Multimodal Event Processing
• Shift from Structure to Unstructured
• Enabling Intelligent Systems with Real-
time Multimodal Data
Multimodal Data is a game changer
for Smart Environments….
47
• Multimodal Data Streams
• Structured
• Video
• Audio
• Rich-Content Processing
• Larger data volumes
• Larger Content-space
• Content Extraction Costs
• Edge and Resources
• Computational Intensive
• Network Intensive
Person
Person
Vest
Vest
Hat
Hat
Temp
Wind
Speed
Lux
Site
Structured Sensor Streams Unstructured Sensor Streams
occupant
Left/right
wearing
wearing
wearing
wearing
occupant
has
has
has
Real-time Health and Safety Monitoring
Queries
§ Is everyone wearing
PPE/hardhat?
§ Are there any visitors?
§ Is it a safe working
temperature?
§ Is smoke detected?
§ Is the wind speed
safe?
§ Is there any unsafe
behaviour?
Neuro Symbolic
Gnosis: Neuro-Symbolic Event Processing
Camera
Sensor
Query 1
IoMT Sources IoMT Applications
Camera
Camera
Sensor
Sensor
…
…
Query 2
Query 3
Sound
Sound
Sound
Complex Event Matcher
Single Event Matcher
History Rules
Multimedia Flows
Structured Flows
Multimodal Event Processing Language
Yadav, P. et al. (2021) ‘Query-Driven Video Event Processing for
the Internet of Multimedia Things (Demo)’, Proceedings of the
VLDB Endowment, 14(12), pp. 2847–2850.
Data Policy
“The future is already here –
it’s just not evenly distributed.” William Gibson
(Open) Data is Key to AI
“The world’s most valuable resource is
no longer oil, but data. The data
economy demands a new approach to
antitrust rules”
The Economist
…startups and established firms that are
just beginning to use AI need access to
data in order to train their AI systems.
Difficulty in accessing the necessary data
can create a barrier to entry, potentially
reducing competition and innovation. -
Forbes
From Open Data to …….
Public Digital Infrastructures
Forward-thinking societies
will see the provision of
digital infrastructure
(including data platforms) as
a shared societal service in
the same way as water,
sanitation, and healthcare.
54
Over
100
million
A European strategy for data
European Strategy for Data
Data can flow within the
EU and across sectors
European rules and values
are fully respected
Rules for access and use of data are
fair, practical and clear & clear data
governance mechanisms are in place
A common European data space, a single market for data
Availability of high quality data
to create and innovate
Health
Industrial &
Manufacturing Agriculture Culture Mobility Green Deal Security
Cloud Federation, common European data spaces and AI
Public
Administration
• Driven by stakeholders
• Rich pool of data of varying degree of openness
• Sectoral data governance (contracts, licenses,
access rights, usage rights)
• Technical tools for data pooling and sharing
High Value
Datasets
From
public
sector
AI Testing and
Experimentation Facilities
AI on demand platform
IaaS (Infrastructure as a Service)
Servers, computing, OS, storage, network
PaaS (Platforms as a Service)
Smart Interoperability Middleware
SaaS (Software as a Service)
Software, ERP, CRM, data analytics
Edge
Infrastructure
& Services
High-
Performance
Computing
Federation of Cloud & HPC Infrastructure & Services
Cloud stack management and multi-cloud / hybrid cloud, cloud governance
Marketplace for Cloud to Edge based Services
Cloud services meeting high requirements for data protection, security, portability, interoperability, energy efficiency
Media
Boosting the Adoption
of AI in Europe
Towards a European-Governed
Data Sharing Space
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent Systems
http://dataspaces.info 62
The future is already here –
it’s just not
……..WE need to evenly distribute it
1 of 63

Recommended

Towards Lightweight Cyber-Physical Energy Systems using Linked Data, the Web ... by
Towards Lightweight Cyber-Physical Energy Systems using Linked Data, the Web ...Towards Lightweight Cyber-Physical Energy Systems using Linked Data, the Web ...
Towards Lightweight Cyber-Physical Energy Systems using Linked Data, the Web ...Edward Curry
7.3K views40 slides
Crowdsourcing Approaches to Big Data Curation - Rio Big Data Meetup by
Crowdsourcing Approaches to Big Data Curation - Rio Big Data MeetupCrowdsourcing Approaches to Big Data Curation - Rio Big Data Meetup
Crowdsourcing Approaches to Big Data Curation - Rio Big Data MeetupEdward Curry
4.3K views106 slides
The impact of Big Data on next generation of smart cities by
The impact of Big Data on next generation of smart citiesThe impact of Big Data on next generation of smart cities
The impact of Big Data on next generation of smart citiesPayamBarnaghi
1.7K views53 slides
ISWC 2016 Tutorial: Semantic Web of Things M3 framework & FIESTA-IoT EU project by
ISWC 2016 Tutorial: Semantic Web of Things  M3 framework & FIESTA-IoT EU projectISWC 2016 Tutorial: Semantic Web of Things  M3 framework & FIESTA-IoT EU project
ISWC 2016 Tutorial: Semantic Web of Things M3 framework & FIESTA-IoT EU projectFIESTA-IoT
518 views23 slides
Memory Connected by
Memory ConnectedMemory Connected
Memory ConnectedLi Ding
1.3K views11 slides
BD2K and the Commons : ELIXR All Hands by
BD2K and the Commons : ELIXR All Hands BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands Vivien Bonazzi
260 views60 slides

More Related Content

What's hot

Crowdsourcing Approaches for Smart City Open Data Management by
Crowdsourcing Approaches for Smart City Open Data ManagementCrowdsourcing Approaches for Smart City Open Data Management
Crowdsourcing Approaches for Smart City Open Data ManagementEdward Curry
4.3K views30 slides
Linked Open Government Data: What’s Next? by
Linked Open Government Data:  What’s Next?Linked Open Government Data:  What’s Next?
Linked Open Government Data: What’s Next?Li Ding
907 views34 slides
SLUA: Towards Semantic Linking of Users with Actions in Crowdsourcing by
SLUA: Towards Semantic Linking of Users with Actions in CrowdsourcingSLUA: Towards Semantic Linking of Users with Actions in Crowdsourcing
SLUA: Towards Semantic Linking of Users with Actions in CrowdsourcingEdward Curry
8.3K views22 slides
Querying Heterogeneous Datasets on the Linked Data Web by
Querying Heterogeneous Datasets on the Linked Data WebQuerying Heterogeneous Datasets on the Linked Data Web
Querying Heterogeneous Datasets on the Linked Data WebEdward Curry
6.9K views22 slides
Research issues in the big data and its Challenges by
Research issues in the big data and its ChallengesResearch issues in the big data and its Challenges
Research issues in the big data and its ChallengesKathirvel Ayyaswamy
1.4K views15 slides
EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie... by
EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie...EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie...
EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie...European Data Forum
1.1K views14 slides

What's hot(20)

Crowdsourcing Approaches for Smart City Open Data Management by Edward Curry
Crowdsourcing Approaches for Smart City Open Data ManagementCrowdsourcing Approaches for Smart City Open Data Management
Crowdsourcing Approaches for Smart City Open Data Management
Edward Curry4.3K views
Linked Open Government Data: What’s Next? by Li Ding
Linked Open Government Data:  What’s Next?Linked Open Government Data:  What’s Next?
Linked Open Government Data: What’s Next?
Li Ding907 views
SLUA: Towards Semantic Linking of Users with Actions in Crowdsourcing by Edward Curry
SLUA: Towards Semantic Linking of Users with Actions in CrowdsourcingSLUA: Towards Semantic Linking of Users with Actions in Crowdsourcing
SLUA: Towards Semantic Linking of Users with Actions in Crowdsourcing
Edward Curry8.3K views
Querying Heterogeneous Datasets on the Linked Data Web by Edward Curry
Querying Heterogeneous Datasets on the Linked Data WebQuerying Heterogeneous Datasets on the Linked Data Web
Querying Heterogeneous Datasets on the Linked Data Web
Edward Curry6.9K views
Research issues in the big data and its Challenges by Kathirvel Ayyaswamy
Research issues in the big data and its ChallengesResearch issues in the big data and its Challenges
Research issues in the big data and its Challenges
Kathirvel Ayyaswamy1.4K views
EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie... by European Data Forum
EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie...EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie...
EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie...
European Data Forum1.1K views
Stanford DeepDive Framework by Ran Zhang
Stanford DeepDive FrameworkStanford DeepDive Framework
Stanford DeepDive Framework
Ran Zhang1K views
Graphs in Government by Neo4j
Graphs in GovernmentGraphs in Government
Graphs in Government
Neo4j309 views
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen... by Sirris
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
Sirris337 views
Key Technology Trends for Big Data in Europe by Edward Curry
Key Technology Trends for Big Data in EuropeKey Technology Trends for Big Data in Europe
Key Technology Trends for Big Data in Europe
Edward Curry4.6K views
Linked Water Data For Water Information Management by Edward Curry
Linked Water Data For Water Information ManagementLinked Water Data For Water Information Management
Linked Water Data For Water Information Management
Edward Curry4.3K views
Why should semantic technologies pay more attention to privacy... and vice-ve... by Mathieu d'Aquin
Why should semantic technologies pay more attention to privacy... and vice-ve...Why should semantic technologies pay more attention to privacy... and vice-ve...
Why should semantic technologies pay more attention to privacy... and vice-ve...
Mathieu d'Aquin1.2K views
Linked Building (Energy) Data by Edward Curry
Linked Building (Energy) DataLinked Building (Energy) Data
Linked Building (Energy) Data
Edward Curry8.8K views
EDF2013: Invited talk Florian Bauer: Unleashing climate and energy knowledge ... by European Data Forum
EDF2013: Invited talk Florian Bauer: Unleashing climate and energy knowledge ...EDF2013: Invited talk Florian Bauer: Unleashing climate and energy knowledge ...
EDF2013: Invited talk Florian Bauer: Unleashing climate and energy knowledge ...
European Data Forum1.2K views
Denver's Open Data Initiative by Allan Glen
Denver's Open Data InitiativeDenver's Open Data Initiative
Denver's Open Data Initiative
Allan Glen2.4K views
State of Florida Neo4J Graph Briefing - Keynote by Neo4j
State of Florida Neo4J Graph Briefing - KeynoteState of Florida Neo4J Graph Briefing - Keynote
State of Florida Neo4J Graph Briefing - Keynote
Neo4j93 views
GENI Engineering Conference -- Ian Foster by Ian Foster
GENI Engineering Conference -- Ian FosterGENI Engineering Conference -- Ian Foster
GENI Engineering Conference -- Ian Foster
Ian Foster1.4K views
Briefing on US EPA Open Data Strategy using a Linked Data Approach by 3 Round Stones
Briefing on US EPA Open Data Strategy using a Linked Data ApproachBriefing on US EPA Open Data Strategy using a Linked Data Approach
Briefing on US EPA Open Data Strategy using a Linked Data Approach
3 Round Stones1K views

Similar to From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent Systems

From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S... by
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...Edward Curry
1.4K views34 slides
Toward universal information access on the digital object cloud by
Toward universal information access on the digital object cloudToward universal information access on the digital object cloud
Toward universal information access on the digital object cloudNational Institute of Informatics
233 views19 slides
Dynamic Data Analytics for the Internet of Things: Challenges and Opportunities by
Dynamic Data Analytics for the Internet of Things: Challenges and OpportunitiesDynamic Data Analytics for the Internet of Things: Challenges and Opportunities
Dynamic Data Analytics for the Internet of Things: Challenges and OpportunitiesPayamBarnaghi
2.4K views23 slides
51 Use Cases and implications for HPC & Apache Big Data Stack by
51 Use Cases and implications for HPC & Apache Big Data Stack51 Use Cases and implications for HPC & Apache Big Data Stack
51 Use Cases and implications for HPC & Apache Big Data StackGeoffrey Fox
2.5K views8 slides
LIS 653 fall 2013 final project posters by
LIS 653 fall 2013 final project postersLIS 653 fall 2013 final project posters
LIS 653 fall 2013 final project postersPrattSILS
635 views6 slides
How to clean data less through Linked (Open Data) approach? by
How to clean data less through Linked (Open Data) approach?How to clean data less through Linked (Open Data) approach?
How to clean data less through Linked (Open Data) approach?andrea huang
1.8K views23 slides

Similar to From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent Systems(20)

From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S... by Edward Curry
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
Edward Curry1.4K views
Dynamic Data Analytics for the Internet of Things: Challenges and Opportunities by PayamBarnaghi
Dynamic Data Analytics for the Internet of Things: Challenges and OpportunitiesDynamic Data Analytics for the Internet of Things: Challenges and Opportunities
Dynamic Data Analytics for the Internet of Things: Challenges and Opportunities
PayamBarnaghi2.4K views
51 Use Cases and implications for HPC & Apache Big Data Stack by Geoffrey Fox
51 Use Cases and implications for HPC & Apache Big Data Stack51 Use Cases and implications for HPC & Apache Big Data Stack
51 Use Cases and implications for HPC & Apache Big Data Stack
Geoffrey Fox2.5K views
LIS 653 fall 2013 final project posters by PrattSILS
LIS 653 fall 2013 final project postersLIS 653 fall 2013 final project posters
LIS 653 fall 2013 final project posters
PrattSILS635 views
How to clean data less through Linked (Open Data) approach? by andrea huang
How to clean data less through Linked (Open Data) approach?How to clean data less through Linked (Open Data) approach?
How to clean data less through Linked (Open Data) approach?
andrea huang1.8K views
Relationship Web: Trailblazing, Analytics and Computing for Human Experience by Amit Sheth
Relationship Web: Trailblazing, Analytics and Computing for Human ExperienceRelationship Web: Trailblazing, Analytics and Computing for Human Experience
Relationship Web: Trailblazing, Analytics and Computing for Human Experience
Amit Sheth1.5K views
Data commons bonazzi bd2 k fundamentals of science feb 2017 by Vivien Bonazzi
Data commons bonazzi   bd2 k fundamentals of science feb 2017Data commons bonazzi   bd2 k fundamentals of science feb 2017
Data commons bonazzi bd2 k fundamentals of science feb 2017
Vivien Bonazzi155 views
Linked Data and Users in Library - Does the library communicate efficiently? by Hansung University
Linked Data and Users in Library - Does the library communicate efficiently?Linked Data and Users in Library - Does the library communicate efficiently?
Linked Data and Users in Library - Does the library communicate efficiently?
Hansung University971 views
Managing Metadata for Science and Technology Studies: the RISIS case by Rinke Hoekstra
Managing Metadata for Science and Technology Studies: the RISIS caseManaging Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS case
Rinke Hoekstra498 views
Real-World Data Challenges: Moving Towards Richer Data Ecosystems by Anita de Waard
Real-World Data Challenges: Moving Towards Richer Data EcosystemsReal-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
Anita de Waard593 views
EMBL Australian Bioinformatics Resource AHM - Data Commons by Vivien Bonazzi
EMBL Australian Bioinformatics Resource AHM   - Data CommonsEMBL Australian Bioinformatics Resource AHM   - Data Commons
EMBL Australian Bioinformatics Resource AHM - Data Commons
Vivien Bonazzi405 views
Session 1.4 a distributed network of heritage information by semanticsconference
Session 1.4   a distributed network of heritage informationSession 1.4   a distributed network of heritage information
Session 1.4 a distributed network of heritage information
A distributed network of digital heritage information - Semantics Amsterdam by Enno Meijers
A distributed network of digital heritage information - Semantics AmsterdamA distributed network of digital heritage information - Semantics Amsterdam
A distributed network of digital heritage information - Semantics Amsterdam
Enno Meijers517 views
Thoughts on Knowledge Graphs & Deeper Provenance by Paul Groth
Thoughts on Knowledge Graphs  & Deeper ProvenanceThoughts on Knowledge Graphs  & Deeper Provenance
Thoughts on Knowledge Graphs & Deeper Provenance
Paul Groth575 views
A distributed network of digital heritage information - Unesco/NDL India by Enno Meijers
A distributed network of digital heritage information - Unesco/NDL IndiaA distributed network of digital heritage information - Unesco/NDL India
A distributed network of digital heritage information - Unesco/NDL India
Enno Meijers488 views

Recently uploaded

[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation by
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented GenerationDataScienceConferenc1
15 views29 slides
SAP-TCodes.pdf by
SAP-TCodes.pdfSAP-TCodes.pdf
SAP-TCodes.pdfmustafaghulam8181
10 views285 slides
UNEP FI CRS Climate Risk Results.pptx by
UNEP FI CRS Climate Risk Results.pptxUNEP FI CRS Climate Risk Results.pptx
UNEP FI CRS Climate Risk Results.pptxpekka28
11 views51 slides
Data Journeys Hard Talk workshop final.pptx by
Data Journeys Hard Talk workshop final.pptxData Journeys Hard Talk workshop final.pptx
Data Journeys Hard Talk workshop final.pptxinfo828217
10 views18 slides
[DSC Europe 23] Aleksandar Tomcic - Adversarial Attacks by
[DSC Europe 23] Aleksandar Tomcic - Adversarial Attacks[DSC Europe 23] Aleksandar Tomcic - Adversarial Attacks
[DSC Europe 23] Aleksandar Tomcic - Adversarial AttacksDataScienceConferenc1
5 views20 slides
CRIJ4385_Death Penalty_F23.pptx by
CRIJ4385_Death Penalty_F23.pptxCRIJ4385_Death Penalty_F23.pptx
CRIJ4385_Death Penalty_F23.pptxyvettemm100
7 views24 slides

Recently uploaded(20)

[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation by DataScienceConferenc1
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation
UNEP FI CRS Climate Risk Results.pptx by pekka28
UNEP FI CRS Climate Risk Results.pptxUNEP FI CRS Climate Risk Results.pptx
UNEP FI CRS Climate Risk Results.pptx
pekka2811 views
Data Journeys Hard Talk workshop final.pptx by info828217
Data Journeys Hard Talk workshop final.pptxData Journeys Hard Talk workshop final.pptx
Data Journeys Hard Talk workshop final.pptx
info82821710 views
CRIJ4385_Death Penalty_F23.pptx by yvettemm100
CRIJ4385_Death Penalty_F23.pptxCRIJ4385_Death Penalty_F23.pptx
CRIJ4385_Death Penalty_F23.pptx
yvettemm1007 views
[DSC Europe 23][AI:CSI] Aleksa Stojanovic - Applying AI for Threat Detection ... by DataScienceConferenc1
[DSC Europe 23][AI:CSI] Aleksa Stojanovic - Applying AI for Threat Detection ...[DSC Europe 23][AI:CSI] Aleksa Stojanovic - Applying AI for Threat Detection ...
[DSC Europe 23][AI:CSI] Aleksa Stojanovic - Applying AI for Threat Detection ...
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M... by DataScienceConferenc1
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
OECD-Persol Holdings Workshop on Advancing Employee Well-being in Business an... by StatsCommunications
OECD-Persol Holdings Workshop on Advancing Employee Well-being in Business an...OECD-Persol Holdings Workshop on Advancing Employee Well-being in Business an...
OECD-Persol Holdings Workshop on Advancing Employee Well-being in Business an...
SUPER STORE SQL PROJECT.pptx by khan888620
SUPER STORE SQL PROJECT.pptxSUPER STORE SQL PROJECT.pptx
SUPER STORE SQL PROJECT.pptx
khan88862013 views
Data about the sector workshop by info828217
Data about the sector workshopData about the sector workshop
Data about the sector workshop
info82821715 views
PRIVACY AWRE PERSONAL DATA STORAGE by antony420421
PRIVACY AWRE PERSONAL DATA STORAGEPRIVACY AWRE PERSONAL DATA STORAGE
PRIVACY AWRE PERSONAL DATA STORAGE
antony4204215 views
CRM stick or twist workshop by info828217
CRM stick or twist workshopCRM stick or twist workshop
CRM stick or twist workshop
info82821711 views
[DSC Europe 23][AI:CSI] Dragan Pleskonjic - AI Impact on Cybersecurity and P... by DataScienceConferenc1
[DSC Europe 23][AI:CSI]  Dragan Pleskonjic - AI Impact on Cybersecurity and P...[DSC Europe 23][AI:CSI]  Dragan Pleskonjic - AI Impact on Cybersecurity and P...
[DSC Europe 23][AI:CSI] Dragan Pleskonjic - AI Impact on Cybersecurity and P...
[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx by DataScienceConferenc1
[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx
[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx

From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent Systems

  • 1. From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent Systems Edward Curry, Insight SFI Research Centre for Data Analytics edward.curry@nuigalway.ie LDAC2021 - 9th Linked Data in Architecture and Construction Workshop (11 - 13 October 2021)
  • 2. Overview • Part I: Data Ecosystems for Intelligent Systems • Part II: Real-time Linked Dataspaces • Part III: Final Thoughts on Research Directions and Data Policy
  • 3. Contents Part I: Fundamentals and Concepts Part II: Data Support Services Part III: Stream and Event Processing Services Part IV: Intelligent Systems and Applications Part V: Future Directions Team http://dataspaces.info Web: dataspaces.info A Team Effort: Open Access Book
  • 4. Part I: Data Ecosystems for Intelligent Systems
  • 10. Real World Digital World Sensors Orient Decide Actuators Act Observe Physical Twin (Asset-centric) Digital Twin (System-centric) Digital Twins http://dataspaces.info 10
  • 11. 11 Data-driven Intelligence will be drive by industrial, personal and open data Connected Intelligent Systems
  • 12. Distributed and Decentralised Data Ecosystems Key Barrier: Interoperability – Protocols and Semantics 12 Curry, E. and Sheth, A. (2018) ‘Next-Generation Smart Environments: From System of Systems to Data Ecosystems’, IEEE Intelligent Systems, 33(3), pp. 69–76. doi: 10.1109/MIS.2018.033001418.
  • 13. Ecosystem community of organisms and their environment interacting as a system Tansley (1935) Lindeman (1942),…
  • 14. Data Ecosystem socio-technical system extracting value from data value chains by interacting organisations and individuals oriented to business and societal purposes marketplace, competition, collaboration Curry, E. (2016) ‘The Big Data Value Chain: Definitions, Concepts, and Theoretical Approaches’, in Cavanillas, J. M., Curry, E., and Wahlster, W. (eds) New Horizons for a Data-Driven Economy..
  • 16. The “gold mining” metaphor applied to data processing Transforming Transport has made use of a total of 164 terabytes of data from 160 different data sources
  • 17. Maturity stages of data assets and related “sieves”
  • 19. Traditional Approaches to Data Integration Low High High Frequency of use Cost of administration & semantic integration using traditional approaches Popularity / Use Number of data sources, entities, attributes http://dataspaces.info The Long Tail of Data
  • 20. 20 • Heterogeneous, complex and large-scale data • Very-large and dynamic “schemas” • Open Environments: distributed, decentralised decoupled data sources, anonymous users, multi- domain, lack of global order of information flow • Multiple perspectives (conceptualisations) of the reality. • Ambiguity, vagueness, inconsistency. Content Space: From Rigid Schemas to Schema-less..... ...and Fundamental Decentralisation
  • 21. The Red Queen Hypothesis “It takes all the running you can do, to keep in the same place. If you want to get somewhere else, you must run at least twice as fast as that!” Lewis Carroll's Through the Looking-Glass
  • 22. Part II: Real-time Linked Dataspaces
  • 23. Data Platforms will Fuel AI-Driven Decision-Making Data Generation and Analysis (including IoT) Data Platforms (Access and Portability) AI and Decision Platforms
  • 24. IoT-Enablement Layer 1 - Communication and Sensing IPv6, Wi-Fi, RFID, CoAP, AVB, etc. Layer 3 - Data Schema, Entities, Catalog, Sharing, Access/Control, etc. Layer 4 – Intelligent Apps, Analytics, and Users Datasets Things / Sensors Contextual Data Sources (including legacy systems) Predictive Analytics Situation Awareness Decision Support Digital Twin Machine Learning Users Layer 2 - Middleware Peer-to-Peer, Events, Pub/Sub, SOA, SDN, etc. A Data Sharing Layer is needed…. Adapted from: L. Atzori, A. Iera, and G. Morabito, “The Internet of Things: A survey,” Comput. Networks, vol. 54, no. 15, pp. 2787–2805, Oct. 2010. http://dataspaces.info
  • 25. Human Interactivity: Web Search From Structure to Knowledge Graph to Search ~1995 ~100K Websites Exact Results Human Curated ~1998 ~2.4M Websites Approximate Results Computed ~2012 ~700M Approximate Results + Exact Computed + Crowd 25
  • 26. Cost of Data Management Solutions http://dataspaces.info Administrative Proximity – Close vs. Loose Coordination – Assumptions concerning guarantees such as data, access, quality, and consistency, Semantic Integration – Degree to which data schemas are matched up (types, attributes, and names). 26 Halevy, A., Franklin, M. and Maier, D. 2006. Principles of dataspace systems. 25th ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems - PODS ’06 (New York, New York, USA, 2006), 1–9.
  • 27. Approximate and Best Effort Approaches Low High High Frequency of use Approximate & best-effort approaches Cost of administration & semantic integration using traditional approaches Popularity / Use Number of data sources, entities, attributes http://dataspaces.info The Long Tail of Data
  • 28. Dataspace “Dataspaces are not a data integration approach; rather, they are more of a data co-existence approach. The goal of dataspace support is to provide base functionality over all data sources, regardless of how integrated they are.” (Halevy, A., Franklin, M. and Maier, D. 2006.)
  • 29. Enabling platform for data management for intelligent systems within smart environments Combines the pay-as-you-go paradigm of dataspaces, linked data, and knowledge graphs with entity-centric real-time queries Real-time Linked Dataspaces 29 Principles: (adapted from by Halevy et al.) • Must deal with many different formats of streams and events. • Does not subsume the stream and event processing engines; they still provide individual access via their native interfaces. • Queries in are provided on a best-effort and approximate basis. • Must provide pathways to improve the integration among the data sources, including streams and events, in a pay-as-you-go fashion.
  • 30. Key Challenge http://dataspaces.info Investigate techniques to enable approximate and best-effort support services for loose administrative proximity and semantic integration Incremental support services • Catalog • entity management • query and search • data discovery • human tasks • quality of service • complex event processing • streams dissemination • approximate semantic event matching
  • 31. • • Sahlgren, 2013 Formal World Real World Baroni et al. 2013
  • 32. • Distributional hypothesis: the context surrounding a given word in a text provides relevant information about its meaning. – "a word is characterized by the company it keeps" was popularized by Firth in the 1950s • Simplified semantic model: Associational and quantitative. 32 A wife is a female partner in a marriage. The term "wife" seems to be a close term to bride, the latter is a female participant in a wedding ceremony, while a wife is a married woman during her marriage. ... Distributional Semantic Model 32
  • 33. c1 child husband spouse cn c2 function (number of times that the words occur in c1) 0.7 0.5 Distributional Semantic Model Distributional semantic model: Semantic statistical knowledge extracted from large Web corpora Works as a semantic ranking function E.g. esa(room, building)= 0.099 E.g. esa(room, car)= 0.009 θ Gabrilovich, E.; Markovitch, S.(2007). Computing semantic relatedness using Wikipedia-based Explicit Semantic Analysis. Proc. 20th Int'l Joint Conf. on Artificial Intelligence (IJCAI). 33
  • 34. Schema-Agnostic Natural Language Queries NobelPrizeWinner A Semantic Gap Marie Curie :type Possible Data Representations Information Need: Who are the children of Marie Curie married to? Marie Curie 2 B C Marie Curie Henry R. Labouisse Ève Curie Irène Joliot-Curie :motherOf :motherOf :wifeOf :type :numberOfKids Frédéric Joliot-Curie :wifeOf Frédéric Joliot-Curie Irène Joliot-Curie :Spouse :Child Henry R. Labouisse Ève Curie :Spouse :Child Scientist Freitas, A. and Curry, E. (2014) ‘Natural Language Queries over Heterogeneous Linked Data Graphs: A Distributional-Compositional Semantics Approach’, in 18th International Conference on Intelligent User Interfaces (IUI’14): ACM
  • 35. Marie Curie children married to Person :Marie Curie Query: Linked Data: :Ève Curie :motherOf :Henry R. Labouisse :wifeOf Distributional Semantic Search Information Need: Who are the children of Marie Curie married to?
  • 36. Query Planner Ƭ-Space Large-scale unstructured data Commonsense knowledge Database Distributional semantics Core semantic approximation & composition operations Query Analysis Query Query Features Query Plan Treo: Question Answering over Linked Data
  • 37. Challenges • Heterogeneity in Event Semantics (000s schema) • Heterogeneity in processing Rules (000s of rule tied to schema) • Manually Implemented Approximate Semantic Event Matcher • Distributional Event Semantics • Enables pay-as-you-go event matching for data streams • Replaced 48,000 exact rules with 100 approximate rules with around 85% accuracy Approximate Semantic Matching of Streams 37 Hasan, S. and Curry, E. (2014) ‘Approximate Semantic Matching of Events for the Internet of Things’, ACM Transactions on Internet Technology, 14(1).
  • 38. Intelligent Systems and Applications http://dataspaces.info L OCATION Airport Office Home Mixed Use School LINATE AIRPORT, MILAN, ITALY INSIGHT, GALWAY, IRELAND HOUSES, THERMI, GREECE ENGINEERING, NUI GALWAY COLÁISTE NA COIRIBE, IRELAND T ARGET U SER S • Corporate users • ~9.5 million passengers • Utilities management • Maintenance staff • Environmental managers • 130 staff • Office consumers • Operations managers • Utility providers • Building managers • Domestic consumers (adults, young adults and children) • Utility providers • Mixed/Public consumers • Building managers • 100 staff • 1000 students (ages 18 to 24) • Mixed/Public consumers • School management • Maintenance staff • 500 students (ages 12 to 18) • 40 teachers I NFRASTRUCTURE • Safety critical • 10 km water network • Multiple buildings • Water meters • Energy meters • Legacy systems • 2190 m2 space • 22 offices + 160 open plan spaces • Conference room • 4 meeting rooms • 3 kitchens • Data centre • 30 person café • Energy meters • 10 households • Typical variety of domestic settings including kitchen, showers, baths, living room, bedrooms, and garden • Water meters • Water meters • Energy meters • Rainwater harvesting • Café • Weather station • Wet labs • Showers • Water meters • Energy meters • Rainwater harvesting Smart Water and Energy Management Pilots
  • 39. Smart School CnaC School in Galway, Ireland Mixed Use Galway, Ireland Building Manager University Students Smart Airport Milan Linate, Italy Corporate Staff Passengers Smart Homes Municipality of Thermi, Greece Smart Office Galway, Ireland Families Operational Staff Researchers Application Developers Teaching Staff School Students Data Scientist Need to target different Target Users
  • 40. IoT-enabled Digital Twins and Intelligent Applications Real-time Linked Dataspace Datasets Things / Sensors Entity Management Service Catalog & Access Control Service Personal Dashboard Public Dashboards Decision Analytics and Machine Learning Notifications Apps Alerts Orient Decide Act Search & Query Service Entity-Centric Real-Time Query Service Complex Event Processing Service Digital Twin CEP D Human Task Service Human Task Service Observe http://dataspaces.info “OODA” Loop
  • 41. Interactive Public Displays Alerts and Notifications Personalised Dashboards Example Applications
  • 43. Experiences and Lessons Learnt from Dataspaces spaces.info • Developer education need for stream processing and approximate results • Incremental data management can support agile software development • Build the business case for data-driven innovation • Integration with legacy data is a significant cost in smart environments • The 5 star pay-as-you-go model simplified communication with non- technical users • A secure canonical source for entity data simplifies application development • Data quality with things and sensors is challenging in an operational environment • Working with three pipelines adds overhead (LAMBDA + Entity Layer) 43
  • 44. Part III: Final Thoughts on Research Directions and Data Policy
  • 45. http://dataspaces.info 45 Large-scale Decentralised Support Services • Enhanced Supported Services • Scaling Entity Management • Maintenance and Operation Cost Multimedia/Knowledge-Intensive Event Processing • Support Services for Multimedia Data • Placement of Multimedia Data and Workloads • Adaptive Training of Classifiers • Complex Multimedia Event Processing Trusted Data Sharing • Trusted Platforms • Usage Control • Personal/ Industrial Dataspaces Ecosystem Governance and Economic Models • Decentralised Data Governance • Economic Models Incremental Intelligent Systems Engineering Cognitive Adaptability • Pay-as-you-go Systems • Cognitive Adaptability Towards Human-centric Systems • Explainable Artificial Intelligence and Data Provenance • Human-in-the-loop Future Research Directions
  • 46. Internet of Multimedia Things (IoMT)
  • 47. Overview Multimodal Event Processing • Shift from Structure to Unstructured • Enabling Intelligent Systems with Real- time Multimodal Data Multimodal Data is a game changer for Smart Environments…. 47 • Multimodal Data Streams • Structured • Video • Audio • Rich-Content Processing • Larger data volumes • Larger Content-space • Content Extraction Costs • Edge and Resources • Computational Intensive • Network Intensive
  • 48. Person Person Vest Vest Hat Hat Temp Wind Speed Lux Site Structured Sensor Streams Unstructured Sensor Streams occupant Left/right wearing wearing wearing wearing occupant has has has Real-time Health and Safety Monitoring Queries § Is everyone wearing PPE/hardhat? § Are there any visitors? § Is it a safe working temperature? § Is smoke detected? § Is the wind speed safe? § Is there any unsafe behaviour?
  • 49. Neuro Symbolic Gnosis: Neuro-Symbolic Event Processing Camera Sensor Query 1 IoMT Sources IoMT Applications Camera Camera Sensor Sensor … … Query 2 Query 3 Sound Sound Sound Complex Event Matcher Single Event Matcher History Rules Multimedia Flows Structured Flows
  • 50. Multimodal Event Processing Language Yadav, P. et al. (2021) ‘Query-Driven Video Event Processing for the Internet of Multimedia Things (Demo)’, Proceedings of the VLDB Endowment, 14(12), pp. 2847–2850.
  • 52. “The future is already here – it’s just not evenly distributed.” William Gibson
  • 53. (Open) Data is Key to AI “The world’s most valuable resource is no longer oil, but data. The data economy demands a new approach to antitrust rules” The Economist …startups and established firms that are just beginning to use AI need access to data in order to train their AI systems. Difficulty in accessing the necessary data can create a barrier to entry, potentially reducing competition and innovation. - Forbes
  • 54. From Open Data to ……. Public Digital Infrastructures Forward-thinking societies will see the provision of digital infrastructure (including data platforms) as a shared societal service in the same way as water, sanitation, and healthcare. 54
  • 57. European Strategy for Data Data can flow within the EU and across sectors European rules and values are fully respected Rules for access and use of data are fair, practical and clear & clear data governance mechanisms are in place A common European data space, a single market for data Availability of high quality data to create and innovate
  • 58. Health Industrial & Manufacturing Agriculture Culture Mobility Green Deal Security Cloud Federation, common European data spaces and AI Public Administration • Driven by stakeholders • Rich pool of data of varying degree of openness • Sectoral data governance (contracts, licenses, access rights, usage rights) • Technical tools for data pooling and sharing High Value Datasets From public sector AI Testing and Experimentation Facilities AI on demand platform IaaS (Infrastructure as a Service) Servers, computing, OS, storage, network PaaS (Platforms as a Service) Smart Interoperability Middleware SaaS (Software as a Service) Software, ERP, CRM, data analytics Edge Infrastructure & Services High- Performance Computing Federation of Cloud & HPC Infrastructure & Services Cloud stack management and multi-cloud / hybrid cloud, cloud governance Marketplace for Cloud to Edge based Services Cloud services meeting high requirements for data protection, security, portability, interoperability, energy efficiency Media
  • 59. Boosting the Adoption of AI in Europe
  • 63. The future is already here – it’s just not ……..WE need to evenly distribute it