SlideShare a Scribd company logo
1 of 25
Download to read offline
STREAM REASONING
AN APPROACH TO BLEND 

AI AND STREAM PROCESSING
Emanuele Della Valle

Politecnico di Milano

http://emanueledellavalle.org

@manudellavalle
Milano - 17.10.2019
DATA ENGINEERING AND DATA SCIENCE TECHS
CAN TAME VOLUME
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
DATA ENGINEERING AND DATA SCIENCE TECHS
CAN TAME VELOCITY
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
DATA ENGINEERING AND DATA SCIENCE TECHS
CANNOT TAME VOLUME AND VELOCITY SIMULTANEOUSLY
ZB
EB
PB
TB
GB
MB
KB
months days hours min. sec. ms.
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
DATA ENGINEERING AND DATA SCIENCE TECHS
CAN TAME VARIETY USING AI
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
DATA ENGINEERING AND DATA SCIENCE TECHS
VARIETY MAKES PROBLEMS HARDER
ZB
EB
PB
TB
GB
MB
KB
months days hours min. sec. ms.
VARIETY
STILL THERE ARE USERS
WHOSE DECISIONS 

NEED TO TAME ALL Vs
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
STILL THERE ARE USERS WHOSE DECISIONS NEED TO TAME ALL Vs
OFF-SHORE OIL OPERATIONS
‣ When sensors on a drilling pipe in an oil-rig indicate that it is about to get
stuck, how long — according to historical records — can I keep drilling?
‣ 400,000 sensors from 10s of differente producers
‣ 10,000 observations per second, many out-of-operational-ranges
STILL THERE ARE USERS WHOSE DECISIONS NEED TO TAME ALL Vs
REQUIREMENT ANALYSIS
A system able to answer those queries must be able to
▸ handle massive datasets x
▸ process data streams on the fly x
▸ cope with heterogeneous datasets x
▸ cope with incomplete data x x
▸ cope with noisy data x
▸ provide reactive answers x
Volume
Velocity
Variety
Veracity
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
STILL THERE ARE USERS WHOSE DECISIONS NEED TO TAME ALL Vs
(PARTIAL) SOLUTIONS: STREAM PROCESSING
▸ A paradigmatic change!
window
input streams streams of answerRegistered
Continuous
Query
Dynamic
System
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
STILL THERE ARE USERS WHOSE DECISIONS NEED TO TAME ALL Vs
STREAM PROCESSING VS. REQUIREMENTS
Requirement SP
massive datasets
data streams
heterogeneous dataset
incomplete data
noisy data
reactive answers
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
STILL THERE ARE USERS WHOSE DECISIONS NEED TO TAME ALL Vs
AI VS. REQUIREMENTS
Requirement SP AI
massive datasets
data streams
heterogeneous dataset
incomplete data
noisy data
reactive answers
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
Is it possible to make sense in real time 

of multiple, heterogeneous, gigantic and 

inevitably noisy and incomplete data streams 

in order to support the decision processes of
extremely large numbers of concurrent
users?
E. Della Valle, S. Ceri, F. van Harmelen & H. Stuckenschmidt, 2010
STILL THERE ARE USERS WHOSE DECISIONS NEED TO TAME ALL Vs
STREAM REASONING RESEARCH QUESTION
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
(					,	13),	(					,	12),	(					,	8)	,	(					,	8)	
DEDUCTIVE STREAM REASONING
STREAM PROCESSING
time
1	minute	wide	window
Which are the top-4
most frequent colours
in the last minute?
Is there a 

followed by a 

in the last minute yes, many
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
DEDUCTIVE STREAM REASONING
STREAM PROCESSING + SYMBOLIC AI
time
1	minute	wide	window
An ontology of colours
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
(					,	13),(					,	8)	,	(					,	8)	
DEDUCTIVE STREAM REASONING
DEDUCTIVE STREAM REASONING
time
1	minute	wide	window
Which are the top-2 most
frequent cool colours in
the last minute?
Is there a primary cool
colour followed by a
secondary warm one

yes, followed by .
An ontology of colours
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
INDUCTIVE STREAM REASONING
THE CONCEPT DRIFT PROBLEM
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
INDUCTIVE STREAM REASONING
AN EARLY ATTEMPT OF INDUCTIVE STREAM REASONING
D.F.	Barbieri,	D.	Braga,	S.	Ceri,	E.	Della	Valle,	Y.	Huang,	V.	Tresp,

A.	Rettinger,	H.	Wermser:	Deductive	and	Inductive	Stream	
Reasoning	for	Semantic	Social	Media	Analytics.	

IEEE	Intelligent	Systems	25(6):	32-41	(2010)	
• How	can	we	determining	the	optimal	size	of	the	window?
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
INDUCTIVE STREAM REASONING
ADAPTIVE SLIDING WINDOW (ADWIN)
Bifet,	A.	and	Gavaldà,	R.,	2009,	August.	Adaptive	learning	from	evolving	data	streams.	In	
International	Symposium	on	Intelligent	Data	Analysis	(pp.	249-260).	Springer
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
INDUCTIVE STREAM REASONING
CONCEPT DRIFT AND STREAMING MACHINE LEARNING
• Hoeffding
Adaptive Tree
• Adaptive Random
Forest
• Temporally
Augmented
Classifier
A.	Bifet,	R.	Gavaldà,	G.	Holmes,	B.	Pfahringer:	Machine	Learning	for	Data	Streams:	with	Practical	
Examples	in	MOA.	The	MIT	Press	(March	2,	2018)
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
https://moa.cms.waikato.ac.nz/
STREAM REASONING
DEDUCTIVE + INDUCTIVE STREAM REASONING
time
1	minute	wide	window
A better ontology 

of colours continuously 

learned from the data
Which are the most
frequent sentiments in
the last minute?
Is there a impulsive,
irritating colour followed
by an happy one

The better is the ontology of the colours we are using
the more expressive are the queries we can register
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
STREAM REASONING
1000+ SCIENTIFIC PAPERS IN 12 YEAR
▸ It is possible extend the Semantic Web stack in order 

to represent heterogeneous data streams, continuous
queries, and continuous reasoning tasks
▸ It is possible to optimise continuous querying and
continuous reasoning so to provide reactive answers
▸ Streaming Machine Learning is starting to show that it is
possible to continuously learn models
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
STREAM REASONING
STREAM REASONING VS. REQUIREMENTS
Requirement Stream Reasoning
massive datasets
data streams
heterogeneous dataset
incomplete data
noisy data
reactive answers
not specifically treated so far treated but not resolved universally addressed by all studies
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
STREAM REASONING
MASTER THESIS?
▸ Semantics of stream processing languages
▸ Scaling deductive Stream Reasoning
▸ Stream Reasoning on Kafka
▸ Stream Reasoning on Spark
▸ Advancing Inductive Stream Reasoning
▸ Streaming Graph Machine Learning
▸ Streaming Anomaly Detection
▸ Build the first working inductive and 

deductive Stream Reasoners
▸ Applications
Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
STREAM REASONING
EVENTUALLY :-P
STREAM REASONING
THANK YOU!
ANY QUESTION?
Emanuele Della Valle

Politecnico di Milano

http://emanueledellavalle.org

@manudellavalle
Milano - 17.10.2019

More Related Content

Similar to Stream reasoning

Stream Reasoning: mastering the velocity and variety dimensions of Big Data...
Stream Reasoning: mastering the velocity and variety dimensions of Big Data...Stream Reasoning: mastering the velocity and variety dimensions of Big Data...
Stream Reasoning: mastering the velocity and variety dimensions of Big Data...Artificial Intelligence Institute at UofSC
 
Data Science versus Artificial Intelligence: a useful distinction
Data Science versus Artificial Intelligence: a useful distinctionData Science versus Artificial Intelligence: a useful distinction
Data Science versus Artificial Intelligence: a useful distinctionChristoforos Anagnostopoulos
 
Ist16-03 An Introduction to the Semantic Web
Ist16-03 An Introduction to the Semantic Web Ist16-03 An Introduction to the Semantic Web
Ist16-03 An Introduction to the Semantic Web Emanuele Della Valle
 
Stream Reasoning: a summary of ten years of research and a vision for the nex...
Stream Reasoning: a summary of ten years of research and a vision for the nex...Stream Reasoning: a summary of ten years of research and a vision for the nex...
Stream Reasoning: a summary of ten years of research and a vision for the nex...Emanuele Della Valle
 
Incremental rebalancing learning on evolving data streams
Incremental rebalancing learning on evolving data streamsIncremental rebalancing learning on evolving data streams
Incremental rebalancing learning on evolving data streamsAlessio Bernardo
 
Listening to the pulse of our cities fusing Social Media Streams and Call Dat...
Listening to the pulse of our cities fusing Social Media Streams and Call Dat...Listening to the pulse of our cities fusing Social Media Streams and Call Dat...
Listening to the pulse of our cities fusing Social Media Streams and Call Dat...Artificial Intelligence Institute at UofSC
 
Open data & crowdsourcing of environmental observations in MMEA
Open data & crowdsourcing of environmental observations in MMEA Open data & crowdsourcing of environmental observations in MMEA
Open data & crowdsourcing of environmental observations in MMEA CLIC Innovation Ltd
 
Mobile Data Mashups for Urban Computing Applications
Mobile Data Mashups for Urban Computing ApplicationsMobile Data Mashups for Urban Computing Applications
Mobile Data Mashups for Urban Computing ApplicationsEmanuele Della Valle
 
BIG DATA | How to explain it & how to use it for your career?
BIG DATA | How to explain it & how to use it for your career?BIG DATA | How to explain it & how to use it for your career?
BIG DATA | How to explain it & how to use it for your career?Tuan Yang
 
Getting your head around big data
Getting your head around big dataGetting your head around big data
Getting your head around big dataGlenn Block
 
Listening to the pulse of our cities fusing Social Media Streams and Call Dat...
Listening to the pulse of our cities fusing Social Media Streams and Call Dat...Listening to the pulse of our cities fusing Social Media Streams and Call Dat...
Listening to the pulse of our cities fusing Social Media Streams and Call Dat...Emanuele Della Valle
 
Ontology based top-k query answering over massive, heterogeneous, and dynamic...
Ontology based top-k query answering over massive, heterogeneous, and dynamic...Ontology based top-k query answering over massive, heterogeneous, and dynamic...
Ontology based top-k query answering over massive, heterogeneous, and dynamic...Daniele Dell'Aglio
 
A Biological Internet?: Eywa
A Biological Internet?: EywaA Biological Internet?: Eywa
A Biological Internet?: EywaEugene Siow
 
MAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAI
MAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAIMAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAI
MAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAIBig Data Week
 
IST16-01 - Introduction to Interoperability and Semantic Technologies
IST16-01 - Introduction to Interoperability and Semantic TechnologiesIST16-01 - Introduction to Interoperability and Semantic Technologies
IST16-01 - Introduction to Interoperability and Semantic TechnologiesEmanuele Della Valle
 
Inria Tech Talk : boostez la performance de vos objets connectés - Mercredi 2...
Inria Tech Talk : boostez la performance de vos objets connectés - Mercredi 2...Inria Tech Talk : boostez la performance de vos objets connectés - Mercredi 2...
Inria Tech Talk : boostez la performance de vos objets connectés - Mercredi 2...FrenchTechCentral
 
Smart Data for Behavioural Change: Towards Energy Efficient Buildings
Smart Data for Behavioural Change: Towards Energy Efficient BuildingsSmart Data for Behavioural Change: Towards Energy Efficient Buildings
Smart Data for Behavioural Change: Towards Energy Efficient BuildingsAnna Fensel
 
CS8611-Mini Project - PPT Template-4 (1).ppt
CS8611-Mini Project - PPT Template-4 (1).pptCS8611-Mini Project - PPT Template-4 (1).ppt
CS8611-Mini Project - PPT Template-4 (1).pptJANARTHANANP013
 

Similar to Stream reasoning (20)

Stream Reasoning: mastering the velocity and variety dimensions of Big Data...
Stream Reasoning: mastering the velocity and variety dimensions of Big Data...Stream Reasoning: mastering the velocity and variety dimensions of Big Data...
Stream Reasoning: mastering the velocity and variety dimensions of Big Data...
 
Data Science versus Artificial Intelligence: a useful distinction
Data Science versus Artificial Intelligence: a useful distinctionData Science versus Artificial Intelligence: a useful distinction
Data Science versus Artificial Intelligence: a useful distinction
 
Ist16-03 An Introduction to the Semantic Web
Ist16-03 An Introduction to the Semantic Web Ist16-03 An Introduction to the Semantic Web
Ist16-03 An Introduction to the Semantic Web
 
Stream Reasoning: a summary of ten years of research and a vision for the nex...
Stream Reasoning: a summary of ten years of research and a vision for the nex...Stream Reasoning: a summary of ten years of research and a vision for the nex...
Stream Reasoning: a summary of ten years of research and a vision for the nex...
 
Incremental rebalancing learning on evolving data streams
Incremental rebalancing learning on evolving data streamsIncremental rebalancing learning on evolving data streams
Incremental rebalancing learning on evolving data streams
 
Listening to the pulse of our cities fusing Social Media Streams and Call Dat...
Listening to the pulse of our cities fusing Social Media Streams and Call Dat...Listening to the pulse of our cities fusing Social Media Streams and Call Dat...
Listening to the pulse of our cities fusing Social Media Streams and Call Dat...
 
Open data & crowdsourcing of environmental observations in MMEA
Open data & crowdsourcing of environmental observations in MMEA Open data & crowdsourcing of environmental observations in MMEA
Open data & crowdsourcing of environmental observations in MMEA
 
Mobile Data Mashups for Urban Computing Applications
Mobile Data Mashups for Urban Computing ApplicationsMobile Data Mashups for Urban Computing Applications
Mobile Data Mashups for Urban Computing Applications
 
BIG DATA | How to explain it & how to use it for your career?
BIG DATA | How to explain it & how to use it for your career?BIG DATA | How to explain it & how to use it for your career?
BIG DATA | How to explain it & how to use it for your career?
 
Getting your head around big data
Getting your head around big dataGetting your head around big data
Getting your head around big data
 
Listening to the pulse of our cities fusing Social Media Streams and Call Dat...
Listening to the pulse of our cities fusing Social Media Streams and Call Dat...Listening to the pulse of our cities fusing Social Media Streams and Call Dat...
Listening to the pulse of our cities fusing Social Media Streams and Call Dat...
 
Ontology based top-k query answering over massive, heterogeneous, and dynamic...
Ontology based top-k query answering over massive, heterogeneous, and dynamic...Ontology based top-k query answering over massive, heterogeneous, and dynamic...
Ontology based top-k query answering over massive, heterogeneous, and dynamic...
 
A Biological Internet?: Eywa
A Biological Internet?: EywaA Biological Internet?: Eywa
A Biological Internet?: Eywa
 
MAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAI
MAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAIMAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAI
MAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAI
 
RDA Virtual Research Environments Workshop
RDA Virtual Research Environments WorkshopRDA Virtual Research Environments Workshop
RDA Virtual Research Environments Workshop
 
IST16-01 - Introduction to Interoperability and Semantic Technologies
IST16-01 - Introduction to Interoperability and Semantic TechnologiesIST16-01 - Introduction to Interoperability and Semantic Technologies
IST16-01 - Introduction to Interoperability and Semantic Technologies
 
On Big Data
On Big DataOn Big Data
On Big Data
 
Inria Tech Talk : boostez la performance de vos objets connectés - Mercredi 2...
Inria Tech Talk : boostez la performance de vos objets connectés - Mercredi 2...Inria Tech Talk : boostez la performance de vos objets connectés - Mercredi 2...
Inria Tech Talk : boostez la performance de vos objets connectés - Mercredi 2...
 
Smart Data for Behavioural Change: Towards Energy Efficient Buildings
Smart Data for Behavioural Change: Towards Energy Efficient BuildingsSmart Data for Behavioural Change: Towards Energy Efficient Buildings
Smart Data for Behavioural Change: Towards Energy Efficient Buildings
 
CS8611-Mini Project - PPT Template-4 (1).ppt
CS8611-Mini Project - PPT Template-4 (1).pptCS8611-Mini Project - PPT Template-4 (1).ppt
CS8611-Mini Project - PPT Template-4 (1).ppt
 

More from Emanuele Della Valle

Taming velocity - a tale of four streams
Taming velocity - a tale of four streamsTaming velocity - a tale of four streams
Taming velocity - a tale of four streamsEmanuele Della Valle
 
Work in progress on Inductive Stream Reasoning
Work in progress on Inductive Stream ReasoningWork in progress on Inductive Stream Reasoning
Work in progress on Inductive Stream ReasoningEmanuele Della Valle
 
Knowledge graphs in search engines
Knowledge graphs in search enginesKnowledge graphs in search engines
Knowledge graphs in search enginesEmanuele Della Valle
 
La città dei balocchi 2017 in numeri - Fluxedo
La città dei balocchi 2017 in numeri - FluxedoLa città dei balocchi 2017 in numeri - Fluxedo
La città dei balocchi 2017 in numeri - FluxedoEmanuele Della Valle
 
Big Data: how to use it to create value
Big Data: how to use it to create valueBig Data: how to use it to create value
Big Data: how to use it to create valueEmanuele Della Valle
 
Ist16-02 HL7 from v2 (syntax) to v3 (semantics)
Ist16-02 HL7 from v2 (syntax) to v3 (semantics)Ist16-02 HL7 from v2 (syntax) to v3 (semantics)
Ist16-02 HL7 from v2 (syntax) to v3 (semantics)Emanuele Della Valle
 
Social listener-brera-design-district-2015-03
Social listener-brera-design-district-2015-03Social listener-brera-design-district-2015-03
Social listener-brera-design-district-2015-03Emanuele Della Valle
 
City Data Fusion for Event Management (in Italiano)
City Data Fusion for Event Management (in Italiano)City Data Fusion for Event Management (in Italiano)
City Data Fusion for Event Management (in Italiano)Emanuele Della Valle
 
Semantic technologies and Interoperability
Semantic technologies and InteroperabilitySemantic technologies and Interoperability
Semantic technologies and InteroperabilityEmanuele Della Valle
 
Big data: why, what, paradigm shifts enabled , tools and market landscape
Big data: why, what, paradigm shifts enabled , tools and market landscapeBig data: why, what, paradigm shifts enabled , tools and market landscape
Big data: why, what, paradigm shifts enabled , tools and market landscapeEmanuele Della Valle
 
City Data Fusion and City Sensing presented at EIT ICT Labs for EXPO 2015
City Data Fusion and City Sensing presented at EIT ICT Labs for EXPO 2015City Data Fusion and City Sensing presented at EIT ICT Labs for EXPO 2015
City Data Fusion and City Sensing presented at EIT ICT Labs for EXPO 2015Emanuele Della Valle
 
On the effectiveness of a Mobile Puzzle Game UI to Crowdsource Linked Data Ma...
On the effectiveness of a Mobile Puzzle Game UI to Crowdsource Linked Data Ma...On the effectiveness of a Mobile Puzzle Game UI to Crowdsource Linked Data Ma...
On the effectiveness of a Mobile Puzzle Game UI to Crowdsource Linked Data Ma...Emanuele Della Valle
 
City Data Fusion: A Big Data Infrastructure to sense the pulse of the city in...
City Data Fusion: A Big Data Infrastructure to sense the pulse of the city in...City Data Fusion: A Big Data Infrastructure to sense the pulse of the city in...
City Data Fusion: A Big Data Infrastructure to sense the pulse of the city in...Emanuele Della Valle
 
On the need to include functional testing in RDF stream engine benchmarks
On the need to include functional testing in RDF stream engine benchmarks On the need to include functional testing in RDF stream engine benchmarks
On the need to include functional testing in RDF stream engine benchmarks Emanuele Della Valle
 
twindex.fuorisalone.it - Social Listening of FUORISALONE 2013
twindex.fuorisalone.it  - Social Listening of FUORISALONE 2013twindex.fuorisalone.it  - Social Listening of FUORISALONE 2013
twindex.fuorisalone.it - Social Listening of FUORISALONE 2013Emanuele Della Valle
 
Order Matters! Harnessing a World of Orderings for Reasoning over Massive Data
Order Matters! Harnessing a World of Orderings for Reasoning over Massive DataOrder Matters! Harnessing a World of Orderings for Reasoning over Massive Data
Order Matters! Harnessing a World of Orderings for Reasoning over Massive DataEmanuele Della Valle
 
Stream Reasoning: State of the Art and Beyond
Stream Reasoning: State of the Art and BeyondStream Reasoning: State of the Art and Beyond
Stream Reasoning: State of the Art and BeyondEmanuele Della Valle
 
People Dimension in Software Projects
People Dimension in Software ProjectsPeople Dimension in Software Projects
People Dimension in Software ProjectsEmanuele Della Valle
 

More from Emanuele Della Valle (20)

Taming velocity - a tale of four streams
Taming velocity - a tale of four streamsTaming velocity - a tale of four streams
Taming velocity - a tale of four streams
 
Work in progress on Inductive Stream Reasoning
Work in progress on Inductive Stream ReasoningWork in progress on Inductive Stream Reasoning
Work in progress on Inductive Stream Reasoning
 
Big Data and Data Science W's
Big Data and Data Science W'sBig Data and Data Science W's
Big Data and Data Science W's
 
Knowledge graphs in search engines
Knowledge graphs in search enginesKnowledge graphs in search engines
Knowledge graphs in search engines
 
La città dei balocchi 2017 in numeri - Fluxedo
La città dei balocchi 2017 in numeri - FluxedoLa città dei balocchi 2017 in numeri - Fluxedo
La città dei balocchi 2017 in numeri - Fluxedo
 
Big Data: how to use it to create value
Big Data: how to use it to create valueBig Data: how to use it to create value
Big Data: how to use it to create value
 
Ist16-04 An introduction to RDF
Ist16-04 An introduction to RDF Ist16-04 An introduction to RDF
Ist16-04 An introduction to RDF
 
Ist16-02 HL7 from v2 (syntax) to v3 (semantics)
Ist16-02 HL7 from v2 (syntax) to v3 (semantics)Ist16-02 HL7 from v2 (syntax) to v3 (semantics)
Ist16-02 HL7 from v2 (syntax) to v3 (semantics)
 
Social listener-brera-design-district-2015-03
Social listener-brera-design-district-2015-03Social listener-brera-design-district-2015-03
Social listener-brera-design-district-2015-03
 
City Data Fusion for Event Management (in Italiano)
City Data Fusion for Event Management (in Italiano)City Data Fusion for Event Management (in Italiano)
City Data Fusion for Event Management (in Italiano)
 
Semantic technologies and Interoperability
Semantic technologies and InteroperabilitySemantic technologies and Interoperability
Semantic technologies and Interoperability
 
Big data: why, what, paradigm shifts enabled , tools and market landscape
Big data: why, what, paradigm shifts enabled , tools and market landscapeBig data: why, what, paradigm shifts enabled , tools and market landscape
Big data: why, what, paradigm shifts enabled , tools and market landscape
 
City Data Fusion and City Sensing presented at EIT ICT Labs for EXPO 2015
City Data Fusion and City Sensing presented at EIT ICT Labs for EXPO 2015City Data Fusion and City Sensing presented at EIT ICT Labs for EXPO 2015
City Data Fusion and City Sensing presented at EIT ICT Labs for EXPO 2015
 
On the effectiveness of a Mobile Puzzle Game UI to Crowdsource Linked Data Ma...
On the effectiveness of a Mobile Puzzle Game UI to Crowdsource Linked Data Ma...On the effectiveness of a Mobile Puzzle Game UI to Crowdsource Linked Data Ma...
On the effectiveness of a Mobile Puzzle Game UI to Crowdsource Linked Data Ma...
 
City Data Fusion: A Big Data Infrastructure to sense the pulse of the city in...
City Data Fusion: A Big Data Infrastructure to sense the pulse of the city in...City Data Fusion: A Big Data Infrastructure to sense the pulse of the city in...
City Data Fusion: A Big Data Infrastructure to sense the pulse of the city in...
 
On the need to include functional testing in RDF stream engine benchmarks
On the need to include functional testing in RDF stream engine benchmarks On the need to include functional testing in RDF stream engine benchmarks
On the need to include functional testing in RDF stream engine benchmarks
 
twindex.fuorisalone.it - Social Listening of FUORISALONE 2013
twindex.fuorisalone.it  - Social Listening of FUORISALONE 2013twindex.fuorisalone.it  - Social Listening of FUORISALONE 2013
twindex.fuorisalone.it - Social Listening of FUORISALONE 2013
 
Order Matters! Harnessing a World of Orderings for Reasoning over Massive Data
Order Matters! Harnessing a World of Orderings for Reasoning over Massive DataOrder Matters! Harnessing a World of Orderings for Reasoning over Massive Data
Order Matters! Harnessing a World of Orderings for Reasoning over Massive Data
 
Stream Reasoning: State of the Art and Beyond
Stream Reasoning: State of the Art and BeyondStream Reasoning: State of the Art and Beyond
Stream Reasoning: State of the Art and Beyond
 
People Dimension in Software Projects
People Dimension in Software ProjectsPeople Dimension in Software Projects
People Dimension in Software Projects
 

Recently uploaded

How Do I Begin the Linksys Velop Setup Process?
How Do I Begin the Linksys Velop Setup Process?How Do I Begin the Linksys Velop Setup Process?
How Do I Begin the Linksys Velop Setup Process?Linksys Velop Login
 
I’ll See Y’All Motherfuckers In Game 7 Shirt
I’ll See Y’All Motherfuckers In Game 7 ShirtI’ll See Y’All Motherfuckers In Game 7 Shirt
I’ll See Y’All Motherfuckers In Game 7 Shirtrahman018755
 
iThome_CYBERSEC2024_Drive_Into_the_DarkWeb
iThome_CYBERSEC2024_Drive_Into_the_DarkWebiThome_CYBERSEC2024_Drive_Into_the_DarkWeb
iThome_CYBERSEC2024_Drive_Into_the_DarkWebJie Liau
 
Development Lifecycle.pptx for the secure development of apps
Development Lifecycle.pptx for the secure development of appsDevelopment Lifecycle.pptx for the secure development of apps
Development Lifecycle.pptx for the secure development of appscristianmanaila2
 
TORTOGEL TELAH MENJADI SALAH SATU PLATFORM PERMAINAN PALING FAVORIT.
TORTOGEL TELAH MENJADI SALAH SATU PLATFORM PERMAINAN PALING FAVORIT.TORTOGEL TELAH MENJADI SALAH SATU PLATFORM PERMAINAN PALING FAVORIT.
TORTOGEL TELAH MENJADI SALAH SATU PLATFORM PERMAINAN PALING FAVORIT.Tortogel
 
Free scottie t shirts Free scottie t shirts
Free scottie t shirts Free scottie t shirtsFree scottie t shirts Free scottie t shirts
Free scottie t shirts Free scottie t shirtsrahman018755
 
Premier Mobile App Development Agency in USA.pdf
Premier Mobile App Development Agency in USA.pdfPremier Mobile App Development Agency in USA.pdf
Premier Mobile App Development Agency in USA.pdfappinfoedgeca
 
Pvtaan Social media marketing proposal.pdf
Pvtaan Social media marketing proposal.pdfPvtaan Social media marketing proposal.pdf
Pvtaan Social media marketing proposal.pdfPvtaan
 
The Use of AI in Indonesia Election 2024: A Case Study
The Use of AI in Indonesia Election 2024: A Case StudyThe Use of AI in Indonesia Election 2024: A Case Study
The Use of AI in Indonesia Election 2024: A Case StudyDamar Juniarto
 
Cyber Security Services Unveiled: Strategies to Secure Your Digital Presence
Cyber Security Services Unveiled: Strategies to Secure Your Digital PresenceCyber Security Services Unveiled: Strategies to Secure Your Digital Presence
Cyber Security Services Unveiled: Strategies to Secure Your Digital PresencePC Doctors NET
 
audience research (emma) 1.pptxkkkkkkkkkkkkkkkkk
audience research (emma) 1.pptxkkkkkkkkkkkkkkkkkaudience research (emma) 1.pptxkkkkkkkkkkkkkkkkk
audience research (emma) 1.pptxkkkkkkkkkkkkkkkkklolsDocherty
 
Production 2024 sunderland culture final - Copy.pptx
Production 2024 sunderland culture final - Copy.pptxProduction 2024 sunderland culture final - Copy.pptx
Production 2024 sunderland culture final - Copy.pptxChloeMeadows1
 
Statistical Analysis of DNS Latencies.pdf
Statistical Analysis of DNS Latencies.pdfStatistical Analysis of DNS Latencies.pdf
Statistical Analysis of DNS Latencies.pdfOndejSur
 
Bug Bounty Blueprint : A Beginner's Guide
Bug Bounty Blueprint : A Beginner's GuideBug Bounty Blueprint : A Beginner's Guide
Bug Bounty Blueprint : A Beginner's GuideVarun Mithran
 
Reggie miller choke t shirtsReggie miller choke t shirts
Reggie miller choke t shirtsReggie miller choke t shirtsReggie miller choke t shirtsReggie miller choke t shirts
Reggie miller choke t shirtsReggie miller choke t shirtsrahman018755
 
Thank You Luv I’ll Never Walk Alone Again T shirts
Thank You Luv I’ll Never Walk Alone Again T shirtsThank You Luv I’ll Never Walk Alone Again T shirts
Thank You Luv I’ll Never Walk Alone Again T shirtsrahman018755
 

Recently uploaded (17)

How Do I Begin the Linksys Velop Setup Process?
How Do I Begin the Linksys Velop Setup Process?How Do I Begin the Linksys Velop Setup Process?
How Do I Begin the Linksys Velop Setup Process?
 
I’ll See Y’All Motherfuckers In Game 7 Shirt
I’ll See Y’All Motherfuckers In Game 7 ShirtI’ll See Y’All Motherfuckers In Game 7 Shirt
I’ll See Y’All Motherfuckers In Game 7 Shirt
 
iThome_CYBERSEC2024_Drive_Into_the_DarkWeb
iThome_CYBERSEC2024_Drive_Into_the_DarkWebiThome_CYBERSEC2024_Drive_Into_the_DarkWeb
iThome_CYBERSEC2024_Drive_Into_the_DarkWeb
 
Development Lifecycle.pptx for the secure development of apps
Development Lifecycle.pptx for the secure development of appsDevelopment Lifecycle.pptx for the secure development of apps
Development Lifecycle.pptx for the secure development of apps
 
TORTOGEL TELAH MENJADI SALAH SATU PLATFORM PERMAINAN PALING FAVORIT.
TORTOGEL TELAH MENJADI SALAH SATU PLATFORM PERMAINAN PALING FAVORIT.TORTOGEL TELAH MENJADI SALAH SATU PLATFORM PERMAINAN PALING FAVORIT.
TORTOGEL TELAH MENJADI SALAH SATU PLATFORM PERMAINAN PALING FAVORIT.
 
Free scottie t shirts Free scottie t shirts
Free scottie t shirts Free scottie t shirtsFree scottie t shirts Free scottie t shirts
Free scottie t shirts Free scottie t shirts
 
Premier Mobile App Development Agency in USA.pdf
Premier Mobile App Development Agency in USA.pdfPremier Mobile App Development Agency in USA.pdf
Premier Mobile App Development Agency in USA.pdf
 
Pvtaan Social media marketing proposal.pdf
Pvtaan Social media marketing proposal.pdfPvtaan Social media marketing proposal.pdf
Pvtaan Social media marketing proposal.pdf
 
The Use of AI in Indonesia Election 2024: A Case Study
The Use of AI in Indonesia Election 2024: A Case StudyThe Use of AI in Indonesia Election 2024: A Case Study
The Use of AI in Indonesia Election 2024: A Case Study
 
Cyber Security Services Unveiled: Strategies to Secure Your Digital Presence
Cyber Security Services Unveiled: Strategies to Secure Your Digital PresenceCyber Security Services Unveiled: Strategies to Secure Your Digital Presence
Cyber Security Services Unveiled: Strategies to Secure Your Digital Presence
 
audience research (emma) 1.pptxkkkkkkkkkkkkkkkkk
audience research (emma) 1.pptxkkkkkkkkkkkkkkkkkaudience research (emma) 1.pptxkkkkkkkkkkkkkkkkk
audience research (emma) 1.pptxkkkkkkkkkkkkkkkkk
 
Production 2024 sunderland culture final - Copy.pptx
Production 2024 sunderland culture final - Copy.pptxProduction 2024 sunderland culture final - Copy.pptx
Production 2024 sunderland culture final - Copy.pptx
 
Statistical Analysis of DNS Latencies.pdf
Statistical Analysis of DNS Latencies.pdfStatistical Analysis of DNS Latencies.pdf
Statistical Analysis of DNS Latencies.pdf
 
GOOGLE Io 2024 At takes center stage.pdf
GOOGLE Io 2024 At takes center stage.pdfGOOGLE Io 2024 At takes center stage.pdf
GOOGLE Io 2024 At takes center stage.pdf
 
Bug Bounty Blueprint : A Beginner's Guide
Bug Bounty Blueprint : A Beginner's GuideBug Bounty Blueprint : A Beginner's Guide
Bug Bounty Blueprint : A Beginner's Guide
 
Reggie miller choke t shirtsReggie miller choke t shirts
Reggie miller choke t shirtsReggie miller choke t shirtsReggie miller choke t shirtsReggie miller choke t shirts
Reggie miller choke t shirtsReggie miller choke t shirts
 
Thank You Luv I’ll Never Walk Alone Again T shirts
Thank You Luv I’ll Never Walk Alone Again T shirtsThank You Luv I’ll Never Walk Alone Again T shirts
Thank You Luv I’ll Never Walk Alone Again T shirts
 

Stream reasoning

  • 1. STREAM REASONING AN APPROACH TO BLEND 
 AI AND STREAM PROCESSING Emanuele Della Valle
 Politecnico di Milano
 http://emanueledellavalle.org
 @manudellavalle Milano - 17.10.2019
  • 2. DATA ENGINEERING AND DATA SCIENCE TECHS CAN TAME VOLUME Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 3. DATA ENGINEERING AND DATA SCIENCE TECHS CAN TAME VELOCITY Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 4. DATA ENGINEERING AND DATA SCIENCE TECHS CANNOT TAME VOLUME AND VELOCITY SIMULTANEOUSLY ZB EB PB TB GB MB KB months days hours min. sec. ms. Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 5. DATA ENGINEERING AND DATA SCIENCE TECHS CAN TAME VARIETY USING AI Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 6. DATA ENGINEERING AND DATA SCIENCE TECHS VARIETY MAKES PROBLEMS HARDER ZB EB PB TB GB MB KB months days hours min. sec. ms. VARIETY STILL THERE ARE USERS WHOSE DECISIONS 
 NEED TO TAME ALL Vs Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 7. STILL THERE ARE USERS WHOSE DECISIONS NEED TO TAME ALL Vs OFF-SHORE OIL OPERATIONS ‣ When sensors on a drilling pipe in an oil-rig indicate that it is about to get stuck, how long — according to historical records — can I keep drilling? ‣ 400,000 sensors from 10s of differente producers ‣ 10,000 observations per second, many out-of-operational-ranges
  • 8. STILL THERE ARE USERS WHOSE DECISIONS NEED TO TAME ALL Vs REQUIREMENT ANALYSIS A system able to answer those queries must be able to ▸ handle massive datasets x ▸ process data streams on the fly x ▸ cope with heterogeneous datasets x ▸ cope with incomplete data x x ▸ cope with noisy data x ▸ provide reactive answers x Volume Velocity Variety Veracity Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 9. STILL THERE ARE USERS WHOSE DECISIONS NEED TO TAME ALL Vs (PARTIAL) SOLUTIONS: STREAM PROCESSING ▸ A paradigmatic change! window input streams streams of answerRegistered Continuous Query Dynamic System Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 10. STILL THERE ARE USERS WHOSE DECISIONS NEED TO TAME ALL Vs STREAM PROCESSING VS. REQUIREMENTS Requirement SP massive datasets data streams heterogeneous dataset incomplete data noisy data reactive answers Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 11. STILL THERE ARE USERS WHOSE DECISIONS NEED TO TAME ALL Vs AI VS. REQUIREMENTS Requirement SP AI massive datasets data streams heterogeneous dataset incomplete data noisy data reactive answers Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 12. Is it possible to make sense in real time 
 of multiple, heterogeneous, gigantic and 
 inevitably noisy and incomplete data streams 
 in order to support the decision processes of extremely large numbers of concurrent users? E. Della Valle, S. Ceri, F. van Harmelen & H. Stuckenschmidt, 2010 STILL THERE ARE USERS WHOSE DECISIONS NEED TO TAME ALL Vs STREAM REASONING RESEARCH QUESTION Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 13. ( , 13), ( , 12), ( , 8) , ( , 8) DEDUCTIVE STREAM REASONING STREAM PROCESSING time 1 minute wide window Which are the top-4 most frequent colours in the last minute? Is there a 
 followed by a 
 in the last minute yes, many Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 14. DEDUCTIVE STREAM REASONING STREAM PROCESSING + SYMBOLIC AI time 1 minute wide window An ontology of colours Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 15. ( , 13),( , 8) , ( , 8) DEDUCTIVE STREAM REASONING DEDUCTIVE STREAM REASONING time 1 minute wide window Which are the top-2 most frequent cool colours in the last minute? Is there a primary cool colour followed by a secondary warm one
 yes, followed by . An ontology of colours Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 16. INDUCTIVE STREAM REASONING THE CONCEPT DRIFT PROBLEM Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 17. INDUCTIVE STREAM REASONING AN EARLY ATTEMPT OF INDUCTIVE STREAM REASONING D.F. Barbieri, D. Braga, S. Ceri, E. Della Valle, Y. Huang, V. Tresp,
 A. Rettinger, H. Wermser: Deductive and Inductive Stream Reasoning for Semantic Social Media Analytics. 
 IEEE Intelligent Systems 25(6): 32-41 (2010) • How can we determining the optimal size of the window? Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 18. INDUCTIVE STREAM REASONING ADAPTIVE SLIDING WINDOW (ADWIN) Bifet, A. and Gavaldà, R., 2009, August. Adaptive learning from evolving data streams. In International Symposium on Intelligent Data Analysis (pp. 249-260). Springer Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 19. INDUCTIVE STREAM REASONING CONCEPT DRIFT AND STREAMING MACHINE LEARNING • Hoeffding Adaptive Tree • Adaptive Random Forest • Temporally Augmented Classifier A. Bifet, R. Gavaldà, G. Holmes, B. Pfahringer: Machine Learning for Data Streams: with Practical Examples in MOA. The MIT Press (March 2, 2018) Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle https://moa.cms.waikato.ac.nz/
  • 20. STREAM REASONING DEDUCTIVE + INDUCTIVE STREAM REASONING time 1 minute wide window A better ontology 
 of colours continuously 
 learned from the data Which are the most frequent sentiments in the last minute? Is there a impulsive, irritating colour followed by an happy one
 The better is the ontology of the colours we are using the more expressive are the queries we can register Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 21. STREAM REASONING 1000+ SCIENTIFIC PAPERS IN 12 YEAR ▸ It is possible extend the Semantic Web stack in order 
 to represent heterogeneous data streams, continuous queries, and continuous reasoning tasks ▸ It is possible to optimise continuous querying and continuous reasoning so to provide reactive answers ▸ Streaming Machine Learning is starting to show that it is possible to continuously learn models Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 22. STREAM REASONING STREAM REASONING VS. REQUIREMENTS Requirement Stream Reasoning massive datasets data streams heterogeneous dataset incomplete data noisy data reactive answers not specifically treated so far treated but not resolved universally addressed by all studies Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 23. STREAM REASONING MASTER THESIS? ▸ Semantics of stream processing languages ▸ Scaling deductive Stream Reasoning ▸ Stream Reasoning on Kafka ▸ Stream Reasoning on Spark ▸ Advancing Inductive Stream Reasoning ▸ Streaming Graph Machine Learning ▸ Streaming Anomaly Detection ▸ Build the first working inductive and 
 deductive Stream Reasoners ▸ Applications Emanuele Della Valle - http://emanueledellavalle.org - @manudellavalle
  • 25. STREAM REASONING THANK YOU! ANY QUESTION? Emanuele Della Valle
 Politecnico di Milano
 http://emanueledellavalle.org
 @manudellavalle Milano - 17.10.2019