Ph. D. Final Dissertation SLides

SuNDroPS: Semantic and dyNamic Data in a
Pervasive System
Context-ADDICT Revisited
Doctoral Dissertation of:
Emanuele Panigati
Advisor: Prof. Letizia Tanca
Co-Advisors: Prof. Fabio A. Schreiber, Prof. G. Cugola
Politecnico di Milano { Dipartimento di Elettronica, Informazione e Bioingegneria
December 10th, 2014

Summary
Introduction and Motivation
The Green Move Project
The SuNDroPS Building Blocks
The SuNDroPS System
Real-World Testing
Conclusions & Future Works
Summary of the content
Introduction and Motivation Go
A motivating scenario: the Green Move project Go
The SuNDroPS system Go
The SuNDroPS legacy blocks Go
Context-ADDICT Go
PerLa & Tesla Go
Hystorical data analysis Go
The SuNDroPS new components Go
New features of PerLa Go
New features of Tesla/TRex: SemTRex Go
MR-Miner & MREClaT Go
Testing SuNDroPS in the Green Move scenario Go
Conclusions and

nal remarks Go
E. Panigati SuNDroPS: Semantic and dyNamic Data in a Pervasive System

Summary
The SuNDroPS System
Real-World Testing
Users are surrounded by a high quantity of heterogeneous data,
often in the form of data streams
Humans cannot fully exploit the whole richness of these data
without digital support for their analysis
Real-time, on-the-
y and historical data processing are equally
necessary to obtain useful knowledge

Summary
The SuNDroPS System
Real-World Testing
A Motivating Scenario: the Green Move Project
Green Move is a zero-emission vehicle-sharing system which
supports the users with additional digital services
The Green Move project has been used as a real-world on-

eld test,
using several SuNDroPS components
The user experience is

nely personalized based on the user context
and on contextual preferences, so the data management process
must consider a contextual tailoring of the data
Data coming as streams from dierent kinds of sensors (e.g.,
on-board vehicle status sensor, environmental sensor, . . . ) must be
processed on-the-
y in order to give to users an immediate feedback
and empower their experience.
Dierent Information Flow Processing systems have been considered
to perform this task (PerLa TRex)

Summary
The SuNDroPS System
Real-World Testing
Conclusions Future Works
Architecture Overview
Legacy: The Context-ADDICT system
Legacy: PerLa Tesla
Legacy: Hystorical Data Analysis
SuNDroPS Architecture Overview
SuNDroPS allows to manage the

ow of (possibly semantically
enriched) information contained
in data streams and to study
how to extract useful knowledge
from it and from the more
traditional, static datasets.

Summary
The SuNDroPS System
Real-World Testing
Legacy: PerLa Tesla
Allows to query dierent
and heterogeneous data
sources providing a
single entry-point for
queries
Automatically tailors the
query according to the
current context of the
user, and rewrites it,
integrating the results
coming from each
dierent data source

Summary
The SuNDroPS System
Real-World Testing
Legacy: PerLa Tesla
Legacy: PerLa Tesla
Two information
ow
processing systems
PerLa is based on a DSMS
paradigm
Tesla/TRex is based on a
Complex Event Processing
paradigm

Summary
The SuNDroPS System
Real-World Testing
Legacy: PerLa Tesla
More on Information Flow Processing
Two dierent approaches:
DSMSs, developed by the database community, consider a data
stream as a sequence of tuples, processing them using SQL-like
query languages
CEPs, developed by the distributed software engineering community,
consider the stream as a sequence of events and process them using
rule and/or logic based languages for temporal pattern detection
PerLa is an example of the

rst kind of systems while Tesla/TRex
belongs to the second category.

rst kind of systems while Tesla/TRex
belongs to the second category.
We next compare the features of the two approaches.

Summary
The SuNDroPS System
Real-World Testing
Legacy: PerLa Tesla
Check if a Vehicle is Being Stolen { PerLa Data
VehicleData
greenBox id Timestamp Speed
A300A 10/12/2014 14:00 0.0
B400B 10/12/2014 14:15 15.0
C100C 10/12/2014 14:01 50.0
TakenOrReleased
greenBox id Timestamp takenReleased
A300A 10/12/2014 7:00 TAKEN
A300A 10/12/2014 8:00 RELEASED
B400B 10/12/2014 10:00 TAKEN
B400B 10/12/2014 11:00 RELEASED
C100C 10/12/2014 12:00 TAKEN

Summary
The SuNDroPS System
Real-World Testing
Legacy: PerLa Tesla
Check if a Vehicle is Being Stolen { PerLa Queries
PerLa Low Level Query
CREATE SNAPSHOT MostRecentUse (greenBox id String, takenReleased Integer, date[3] Integer)
WITH DURATION 10
AS LOW:
EVERY 30 m
SELECT greenBox id, takenReleased, date[3]
HAVINGdate = MAX(date, 10)
UP TO 30m
SAMPLING ON EVENT takenInCharge Released
PerLa High Level Query
CREATE OUTPUT STREAM Theft (greenBox id String, recentUsage date)
AS HIGH:
EVERY 10 m
SELECT greenBox id, MAX(MostRecentUse.date) as mass
FROM MostRecentUse, TakenOrReleased, VehicleData
WHERE VehicleData.greenBox id = TakenOrReleased.greenBox id AND
TakenOrReleased.date = mass AND TakenOrReleased.takenReleased = 0 AND
VehicleData.speed 0

Summary
The SuNDroPS System
Real-World Testing
Legacy: PerLa Tesla
Check if a Vehicle is Being Stolen { PerLa Query Results
MostRecentUse
greenBox id takenReleased date
A300A 10/12/2014
8:00
RELEASED
B400B 10/12/2014
11:00
RELEASED
C100C 10/12/2014
12:00
TAKEN
Theft
greenBox id RecentUsage
B400B 10/12/2014 11:00

Summary
The SuNDroPS System
Real-World Testing
Legacy: PerLa Tesla
Check if a Vehicle is Being Stolen { Tesla Data Rule
Events
event id : 17; greenBox id : A300A; ts : 10=12=2014 14 : 00; speed : 0:0
event id : 17; greenBox id : B400B; ts : 10=12=2014 14 : 15; speed : 15:0
event id : 17; greenBox id : C100C; ts : 10=12=2014 14 : 01; speed : 50:0
event id : 112; greenBox id : A300A; ts : 10=12=2014 7 : 00
event id : 112; greenBox id : B400B; ts : 10=12=2014 10 : 00
event id : 112; greenBox id : C100C; ts : 10=12=2014 12 : 00

Summary
The SuNDroPS System
Real-World Testing
Legacy: PerLa Tesla
Events
Rule
DEFINE Theft (ID : String)
FROM VehicleData (greenBox id = $id AND speed 0) AND
LAST Release (greenBox id = $id) WITHIN 10day FROM VehicleData AND
NOT Taken (greenBox id = $id) BETWEEN Release AND VehicleData
WHERE ID = VehicleData.greenBox id

Summary
The SuNDroPS System
Real-World Testing
Legacy: PerLa Tesla
Events
Rule
DEFINE Theft (ID : String)
FROM VehicleData (greenBox id = $id AND speed 0) AND
LAST Release (greenBox id = $id) WITHIN 10day FROM VehicleData AND
NOT Taken (greenBox id = $id) BETWEEN Release AND VehicleData
WHERE ID = VehicleData.greenBox id
Results

Summary
The SuNDroPS System
Real-World Testing
Legacy: PerLa Tesla
Check User Driving Style w.r.t. Weather Conditions {
PerLa Data
VehicleData
greenBox id GPS Speed
A300A 45.1,15.1 30.0
B400B 45.1,20.2 15.0
C100C 42.2,15.1 50.0
Weather
GPS Climate Limit
45.1,15.1 Normal 130
45.1,20.2 Rain 90
42.2,15.1 Ice 30

Summary
The SuNDroPS System
Real-World Testing
Legacy: PerLa Tesla
PerLa Queries
PerLa Low Level Query
CREATE STREAM WeatherChange (position gps data, climate String)
AS LOW:
EVERY 10 m
SELECT position, climate
SAMPLING ON EVENT WeatherChanged
WHERE climate = Rain OR climate = Ice OR
climate = Snow OR climate = Fog
REFRESH EVERY 5 m
PerLa High Level Query
CREATE OUTPUT SNAPSHOT DangerousDriving (greenBox id String)
WITH DURATION 2 h
AS HIGH:
SELECT greenBox id
FROM VehicleData, WeatherChange, Weather
WHERE VehicleData.gps data = WeatherChange.position AND
WeatherChange.climate = Weather.climate AND
VehicleData.speed Weather.limits

Summary
The SuNDroPS System
Real-World Testing
Legacy: PerLa Tesla
PerLa Query Results
DangerousDriving
position climate
45.1,20.2 Rain
42.2,15.1 Ice
WeatherChange
greenBox id
C100C

Summary
The SuNDroPS System
Real-World Testing
Legacy: PerLa Tesla
Tesla Data
Events
event id : 17; greenBox id : A300A; pos : 45:1; 15:1; speed : 30:0
event id : 17; greenBox id : B400B; pos : 45:1; 20:2; speed : 15:0
event id : 17; greenBox id : C100C; pos : 42:2; 15:1; speed : 50:0
event id : 40; pos : 45:1; 15:1; climate : Normal; temp : 20
event id : 40; pos : 45:1; 20:2; climate : Rain; temp : 17
event id : 40; pos : 42:2; 15:1; climate : Ice; temp : 2

Summary
The SuNDroPS System
Real-World Testing
Legacy: PerLa Tesla
Tesla Rules Results
Rule (Rain)
DEFINE DangerousDrivingRain(ID : String) FROM VehicleData(speed90) AND LAST
Weather(VehicleData.pos-xposVehicleData.pos+x) WITHIN 1h FROM VehicleData AND Weather.climate=rain
WHERE DangerousDrivingRain.ID = VehicleData.greenBox id
Rule (Ice)
DEFINE DangerousDrivingIce(ID : String) FROM VehicleData(speed50) AND LAST
Weather(VehicleData.pos-xposVehicleData.pos+x and temp0) WITHIN 1h FROM VehicleData WHERE
DangerousDrivingRain.ID = VehicleData.greenBox id
Results
event id : 340; greenBox id : C100C

Summary
The SuNDroPS System
Real-World Testing
Legacy: PerLa Tesla
Data Mining allows to extract knowledge from the gathered data,
discovering previously unknown facts from them.
Frequent Itemset Mining

nds in the database all the sets of items
whose frequency is above a given support threshold
Several algorithms are available to perform this task:
A Priori
Partition
FP-Growth
EClaT

Summary
The SuNDroPS System
Real-World Testing
The SuNDroPS New Components
New features of PerLa
New features of Tesla/TRex
MapReduce-based Frequent Itemset Mining : MR-Miner MREClaT
New Components in the Big Data Era
SuNDroPS Adds new features to
Context-ADDICT:
Monitors the environment directly,
using sensors, also reasoning on the
gathered data
Automatically infers (part of) the
user context from the
environmental data that have been
sensed
Integrates historical data processing
with analysis operations,
introducing a new parallel Data
Mining algorithms

Summary
The SuNDroPS System
Real-World Testing
The PerLa middleware has been completely reengineered to include
asynchronous behaviors of sources (sensors, web services, . . . )
Distributed PerLa allows to exploit the sources (and network)
computation power
PerLa for Context explicitly integrates the context-aware approach of
Context-ADDICT with Context-Oriented Programming COP,
allowing sensors to behave dierently according to their current
context

Summary
The SuNDroPS System
Real-World Testing
New features of Tesla/TRex: SemTRex
Original TRex cannot interact with static data
SemTRex adds a RDF static data repository to TRex and new
operators in the Tesla language (IN)
Integrating RDF repositories allows reasoning on the data
IN allows to:
Enrich the events, including into them facts retrieved from the KB
Filter the events using facts included in the KB
Prefetching and Caching of data become necessary to keep
reasonable response times: basic cache, parametric basic cache,
frequent data cache, combined caching strategy.

Summary
The SuNDroPS System
Real-World Testing
MapReduce-based Frequent Itemset Mining
MR-Miner supports the mining
processes in SuNDroPS using
MREClaT, an EClaT-based
algorithm exploiting the MapReduce
programming paradigm, that allows
SuNDroPS to analyze the data load
typical of Big Data scenarios

Summary
The SuNDroPS System
Real-World Testing
MR-Miner MREClaT { Algorithm Details
First step: Mine 1-frequent itemsets

Summary
The SuNDroPS System
Real-World Testing
Second step: Mine 2-frequent itemsets

Summary
The SuNDroPS System
Real-World Testing
Third step: Mine k-frequent itemsets

Summary
The SuNDroPS System
Real-World Testing
Experiments

Summary
The SuNDroPS System
Real-World Testing
Pre

x Extension Experiments

Summary
The SuNDroPS System
Real-World Testing
Testing SuNDroPS in the Green Move Scenario
PerLa context-aware sensors
SemTRex Pervasive and context-aware information push
Context-aware vehicle assignment to user reservation, based on
contextual user preferences

Summary
The SuNDroPS System
Real-World Testing
Conclusions
Future Works
Conclusions
The SuNDroPS system helps users to deal with the high information load
that surronds them
Context inference based on the environmental sensor data
ows
Historical data mining using parallel MapReduce algorithms to speed
up processing
Semantic-enhanced complex event processing using cache to reduce
the performance degradation due to the disk bottle-neck
Reengineering of the PerLa middleware allowing its distribution on
the network components and the integration with other
context-oriented paradigms (e.g. Context Oriented Programming)

Ph. D. Final Dissertation SLides

Recommended

Recommended

More Related Content

What's hot

What's hot (15)

Viewers also liked

Viewers also liked (19)

Similar to Ph. D. Final Dissertation SLides

Similar to Ph. D. Final Dissertation SLides (20)

Recently uploaded

Recently uploaded (20)

Ph. D. Final Dissertation SLides