SlideShare a Scribd company logo
1 of 32
Download to read offline
 Copyright 2010 Digital Enterprise Research Institute. All rights reserved.
Digital Enterprise Research Institute www.deri.ie
On-The-Fly Generation of
Multidimensional Data Cubes for
Web of Things
Muntazir Mehdi
(DERI, TU Kaiserslautern)
Stefan.Decker@deri.org
http://www.StefanDecker.org/
Digital Enterprise Research Institute www.deri.ie
Agenda
 Motivation and Background
• Problem statement, Use case, Linked Data, WoT
 Processing Metadata for Cube Creation
• Capturing and Publishing Sensor data, Event Registration
 Cube Generation
• EDWH Agent, An example Scenario
 Other Potential Use Cases
 Results and Evaluation
 Conclusion
Digital Enterprise Research Institute www.deri.ie
Motivation & Background
 Enterprises producing huge amounts of data
making data management, exchange and decision
making complex.
 Use Case (Smart Buildings)
1. Rely on Sensor data for decision making
2. Heterogeneous and Big Data Management
3. Event Processing can be applied to sustain decision making
4. Limited support for decision making with event processing
techniques
5. Controlling supply / demand based on statistical data
6. Identify meaningful event and deal with them asap
Digital Enterprise Research Institute www.deri.ie
Motivation & Background (continued)
 Heterogeneous Data Management
1. Different Data generated from different applications within one or
more smart environments.
2. For example: A smart city relying on combined data from different
smart buildings.
3. Linked data: A set of best practices to represent data into RDF and
link, relate or connect to other RDF data.
4. Linked Open Data (LOD) Cloud: A huge openly available cloud of
linked data from different domains.
Digital Enterprise Research Institute www.deri.ie
Motivation & Background (continued)
 Big Data Management
1. A fast response to complex queries to support event processing.
2. Huge amounts of sensor data as RDF.
3. Generation of real-time multidimensional and contextual data cubes
to sustain fast responses to complex queries.
4. An event data-warehouse.
5. Multidimensional shape of data in data-warehouse = A data cube =
Structuring information into dimensions and facts or measures.
Digital Enterprise Research Institute www.deri.ie
 Why Data-warehouse for events?
1. Data characteristics:
• Logged once, never updated
• Flat data, no need to normalize
• Incoming data: temporal (based on time)
2. Objective characteristics:
• Reporting, Analysis, Prediction, Mining, Pattern Identification……
• To use a data model to speed up querying unlike transactional processing system
• To provide with a historical repository containing features as per interest
• Support Complex Event Processing
Motivation & Background (continued)
Digital Enterprise Research Institute www.deri.ie
Motivation & Background (continued)
 Web of Things
1. Extending the Web to easily blend real-world objects like electronic
appliances, sensors and embedded devices etc.
2. Even though we are limited to sensor data in our use case, the
approach can be easily extended.
3. CoAP (Constrained Application Protocol): A Web transfer protocol for
request/response model.
Digital Enterprise Research Institute www.deri.ie
Related Work
 Antoniades, Athos, et al. "Linked2Safety: A secure linked
data medical information space for semantically-
interconnecting EHRs advancing patients' safety in medical
research." Bioinformatics & Bioengineering (BIBE), 2012 IEEE
12th International Conference on. IEEE, 2012.
 Lefort, Laurent, et al. "A Linked Sensor Data Cube for a 100
Year Homogenised Daily Temperature Dataset." SSN. 2012.
 ENERGIE VISIBLE
(http://www.webofthings.org/energievisible/)
Digital Enterprise Research Institute www.deri.ie
Processing Metadata for Cube Generation
Involves two major steps:
1. Capturing and Publishing Sensor Data
2. Event Registration
Digital Enterprise Research Institute www.deri.ie
Capturing and Publishing Sensor Data: An
example Scenario
JMS
SERVER
& publish on JMS Server
RDF
Oh wait,
I see a way of converting them into RDF,
add relevant metadata,
SSN
Event Stream
Event Stream
Event Stream
Digital Enterprise Research Institute www.deri.ie
Capturing and Publishing Sensor Data:
Process
Filter
UDP Listeners
&
CoAP Clients
RDFizer
JMS Publisher Enricher
JMS Server Metadata
Knowledge Base
S1
S2
S3
Sn
Digital Enterprise Research Institute www.deri.ie
Event Registration: EDWH Ontology
NamedCubeGraph
Configuration
Dimension
Measure
Source Event
JMSSource
Digital Enterprise Research Institute www.deri.ie
Event Registration Process
Specify Event
Type
Specify Event
Source
Select
Measures
Select
Dimensions
Specify Graph
Details
EDWH Ontology Instance
Digital Enterprise Research Institute www.deri.ie
Dimension Selection: Example
Digital Enterprise Research Institute www.deri.ie
Measure Selection: Example
Digital Enterprise Research Institute www.deri.ie
Cube Generation
1. Requires an event to be registered into the system.
2. Current implementation generates cubes based on
time dimension only. However, it can be easily
extended to attain other dimensions.
3. Critical component: EDWH Agent
Digital Enterprise Research Institute www.deri.ie
Cube Generation: EDWH Agent Architecture
Digital Enterprise Research Institute www.deri.ie
Cubes Generation: An example Scenario
JMS
SERVER
RDF
RDF
CUBES AS
RDF MEETS Mr. CUBES
EDWH Ontology
CUBE
Store
Digital Enterprise Research Institute www.deri.ie
Cubes Generation: Our Use Case
Digital Enterprise Research Institute www.deri.ie
Use Cases
CUBE
Store
Digital Enterprise Research Institute www.deri.ie
Use Case: 1
The electricity usage at location X for duration Y for consumer Z
has been moderate as compared to previous duration W.
CUBE
Store
Digital Enterprise Research Institute www.deri.ie
Use Case: 2
Historical Data suggests that the weather is going to be windy and
Rainy in Galway even after the Easter.
CUBE
Store
Digital Enterprise Research Institute www.deri.ie
Use Case: 3
CUBE
Store
Some suspicious activity has been detected on your credit card!
Digital Enterprise Research Institute www.deri.ie
Use Case: 4
Linked
CUBE
Stores
Each of these things
can be achieved from
one place
Digital Enterprise Research Institute www.deri.ie
Evaluation
 We evaluated our system in terms of
1. Total number of cubes generated
2. Size of each cube
3. Accuracy of generated cubes
4. Impact of adding and removing dimensions on size of cube
5. Performance of the system to generate cubes
6. Query Execution Time (QET)
Digital Enterprise Research Institute www.deri.ie
Evaluation
0
2000
4000
6000
8000
10000
12000
14000
16000
Time(milliseconds)
Quarter Cube
Day Cube
Hour Cube
Digital Enterprise Research Institute www.deri.ie
Evaluation: Size
Digital Enterprise Research Institute www.deri.ie
Evaluation: Impact of dimensions
1 Dim
1 Dim
1 Dim
2 Dim
2 Dim
2 Dim
3 Dim
3 Dim
3 Dim
0
50
100
150
200
Quarter Hour Day
StorgaeSizeperCube(KB)
1 Dim 2 Dim 3 Dim
Digital Enterprise Research Institute www.deri.ie
Evaluation: Query Set for QET
Digital Enterprise Research Institute www.deri.ie
Evaluation: QET Comparison
Digital Enterprise Research Institute www.deri.ie
Conclusion
With the approach presented, we were able to enrich
events with necessary metadata, and process
enriched events to generate on-the-fly data cubes.
After looking at performance chart shown in previous
slides, it is safe to conclude that our approach
provides a good way of generating data cubes on-the-
fly in a real-time sensor network.
Digital Enterprise Research Institute www.deri.ie
Questions

More Related Content

What's hot

The Rise of Logical Data Architecture - Breaking the Data Gravity Notion (Mid...
The Rise of Logical Data Architecture - Breaking the Data Gravity Notion (Mid...The Rise of Logical Data Architecture - Breaking the Data Gravity Notion (Mid...
The Rise of Logical Data Architecture - Breaking the Data Gravity Notion (Mid...
Denodo
 
How Google Does Big Data - DevNexus 2014
How Google Does Big Data - DevNexus 2014How Google Does Big Data - DevNexus 2014
How Google Does Big Data - DevNexus 2014
James Chittenden
 

What's hot (20)

Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Empower Splunk and other SIEMs with the Databricks Lakehouse for Cybersecurity
Empower Splunk and other SIEMs with the Databricks Lakehouse for CybersecurityEmpower Splunk and other SIEMs with the Databricks Lakehouse for Cybersecurity
Empower Splunk and other SIEMs with the Databricks Lakehouse for Cybersecurity
 
Big Data Fundamentals
Big Data FundamentalsBig Data Fundamentals
Big Data Fundamentals
 
The Rise of Logical Data Architecture - Breaking the Data Gravity Notion (Mid...
The Rise of Logical Data Architecture - Breaking the Data Gravity Notion (Mid...The Rise of Logical Data Architecture - Breaking the Data Gravity Notion (Mid...
The Rise of Logical Data Architecture - Breaking the Data Gravity Notion (Mid...
 
Big Data Analytics for Real Time Systems
Big Data Analytics for Real Time SystemsBig Data Analytics for Real Time Systems
Big Data Analytics for Real Time Systems
 
Big Data Hadoop Training by Easylearning Guru
Big Data Hadoop Training by Easylearning GuruBig Data Hadoop Training by Easylearning Guru
Big Data Hadoop Training by Easylearning Guru
 
Big data management
Big data managementBig data management
Big data management
 
Big Data on Public Cloud
Big Data on Public CloudBig Data on Public Cloud
Big Data on Public Cloud
 
Big data and cloud computing 9 sep-2017
Big data and cloud computing 9 sep-2017Big data and cloud computing 9 sep-2017
Big data and cloud computing 9 sep-2017
 
Introduction to Cloud Computing and Big Data
Introduction to Cloud Computing and Big DataIntroduction to Cloud Computing and Big Data
Introduction to Cloud Computing and Big Data
 
How Google Does Big Data - DevNexus 2014
How Google Does Big Data - DevNexus 2014How Google Does Big Data - DevNexus 2014
How Google Does Big Data - DevNexus 2014
 
Digital Velocity 2014: "The Holy Grail of Digital Data Analytics"
Digital Velocity 2014: "The Holy Grail of Digital Data Analytics"Digital Velocity 2014: "The Holy Grail of Digital Data Analytics"
Digital Velocity 2014: "The Holy Grail of Digital Data Analytics"
 
Motivation for big data
Motivation for big dataMotivation for big data
Motivation for big data
 
Bigdata
Bigdata Bigdata
Bigdata
 
Big data privacy issues in public social media
Big data privacy issues in public social mediaBig data privacy issues in public social media
Big data privacy issues in public social media
 
Architectures for Data Commons (XLDB 15 Lightning Talk)
Architectures for Data Commons (XLDB 15 Lightning Talk)Architectures for Data Commons (XLDB 15 Lightning Talk)
Architectures for Data Commons (XLDB 15 Lightning Talk)
 
A Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data ScienceA Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data Science
 
Lesson 1 introduction to_big_data_and_hadoop.pptx
Lesson 1 introduction to_big_data_and_hadoop.pptxLesson 1 introduction to_big_data_and_hadoop.pptx
Lesson 1 introduction to_big_data_and_hadoop.pptx
 
10 Most Effective Big Data Technologies
10 Most Effective Big Data Technologies10 Most Effective Big Data Technologies
10 Most Effective Big Data Technologies
 
Introduction to Big Data Analytics: Batch, Real-Time, and the Best of Both Wo...
Introduction to Big Data Analytics: Batch, Real-Time, and the Best of Both Wo...Introduction to Big Data Analytics: Batch, Real-Time, and the Best of Both Wo...
Introduction to Big Data Analytics: Batch, Real-Time, and the Best of Both Wo...
 

Viewers also liked (8)

Imagenyforma
ImagenyformaImagenyforma
Imagenyforma
 
W.Q. "el símbolo en la imágen"
W.Q. "el símbolo en la imágen"W.Q. "el símbolo en la imágen"
W.Q. "el símbolo en la imágen"
 
Figura humana en el arte
Figura humana en el arteFigura humana en el arte
Figura humana en el arte
 
Signosysimbolos
SignosysimbolosSignosysimbolos
Signosysimbolos
 
La figura humana en el arte
La figura humana en el arteLa figura humana en el arte
La figura humana en el arte
 
ImáGenes Y SíMbolos
ImáGenes Y SíMbolosImáGenes Y SíMbolos
ImáGenes Y SíMbolos
 
La figura humana a través de la historia
La figura humana a través de la historiaLa figura humana a través de la historia
La figura humana a través de la historia
 
El cuerpo humano en las artes visuales
El cuerpo humano en las artes visualesEl cuerpo humano en las artes visuales
El cuerpo humano en las artes visuales
 

Similar to IDEAS 2013 Presentation

Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
ALTER WAY
 
Big Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapBig Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and Roadmap
Srinath Perera
 
So Long Computer Overlords
So Long Computer OverlordsSo Long Computer Overlords
So Long Computer Overlords
Ian Foster
 

Similar to IDEAS 2013 Presentation (20)

Introduction Big Data
Introduction Big DataIntroduction Big Data
Introduction Big Data
 
Big data4businessusers
Big data4businessusersBig data4businessusers
Big data4businessusers
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
Big Data and Data Science: The Technologies Shaping Our Lives
Big Data and Data Science: The Technologies Shaping Our LivesBig Data and Data Science: The Technologies Shaping Our Lives
Big Data and Data Science: The Technologies Shaping Our Lives
 
A Key to Real-time Insights in a Post-COVID World (ASEAN)
A Key to Real-time Insights in a Post-COVID World (ASEAN)A Key to Real-time Insights in a Post-COVID World (ASEAN)
A Key to Real-time Insights in a Post-COVID World (ASEAN)
 
IMCSummit 2015 - Day 2 Developer Track - The Internet of Analytics – Discover...
IMCSummit 2015 - Day 2 Developer Track - The Internet of Analytics – Discover...IMCSummit 2015 - Day 2 Developer Track - The Internet of Analytics – Discover...
IMCSummit 2015 - Day 2 Developer Track - The Internet of Analytics – Discover...
 
Towards Lightweight Cyber-Physical Energy Systems using Linked Data, the Web ...
Towards Lightweight Cyber-Physical Energy Systems using Linked Data, the Web ...Towards Lightweight Cyber-Physical Energy Systems using Linked Data, the Web ...
Towards Lightweight Cyber-Physical Energy Systems using Linked Data, the Web ...
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Big data.pptx
Big data.pptxBig data.pptx
Big data.pptx
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 
SKILLWISE-BIGDATA ANALYSIS
SKILLWISE-BIGDATA ANALYSISSKILLWISE-BIGDATA ANALYSIS
SKILLWISE-BIGDATA ANALYSIS
 
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
 
Breed data scientists_ A Presentation.pptx
Breed data scientists_ A Presentation.pptxBreed data scientists_ A Presentation.pptx
Breed data scientists_ A Presentation.pptx
 
Big Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapBig Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and Roadmap
 
Myth Busters III: I’m Building a Data Lake, So I Don’t Need Data Virtualization
Myth Busters III: I’m Building a Data Lake, So I Don’t Need Data VirtualizationMyth Busters III: I’m Building a Data Lake, So I Don’t Need Data Virtualization
Myth Busters III: I’m Building a Data Lake, So I Don’t Need Data Virtualization
 
SQL Server 2008 R2 StreamInsight
SQL Server 2008 R2 StreamInsightSQL Server 2008 R2 StreamInsight
SQL Server 2008 R2 StreamInsight
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 
So Long Computer Overlords
So Long Computer OverlordsSo Long Computer Overlords
So Long Computer Overlords
 
Internet Infrastructures for Big Data (Verisign's Distinguished Speaker Series)
Internet Infrastructures for Big Data (Verisign's Distinguished Speaker Series)Internet Infrastructures for Big Data (Verisign's Distinguished Speaker Series)
Internet Infrastructures for Big Data (Verisign's Distinguished Speaker Series)
 
Internet of Things
Internet of ThingsInternet of Things
Internet of Things
 

Recently uploaded

The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
AnaAcapella
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 

Recently uploaded (20)

Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 

IDEAS 2013 Presentation

  • 1.  Copyright 2010 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute www.deri.ie On-The-Fly Generation of Multidimensional Data Cubes for Web of Things Muntazir Mehdi (DERI, TU Kaiserslautern) Stefan.Decker@deri.org http://www.StefanDecker.org/
  • 2. Digital Enterprise Research Institute www.deri.ie Agenda  Motivation and Background • Problem statement, Use case, Linked Data, WoT  Processing Metadata for Cube Creation • Capturing and Publishing Sensor data, Event Registration  Cube Generation • EDWH Agent, An example Scenario  Other Potential Use Cases  Results and Evaluation  Conclusion
  • 3. Digital Enterprise Research Institute www.deri.ie Motivation & Background  Enterprises producing huge amounts of data making data management, exchange and decision making complex.  Use Case (Smart Buildings) 1. Rely on Sensor data for decision making 2. Heterogeneous and Big Data Management 3. Event Processing can be applied to sustain decision making 4. Limited support for decision making with event processing techniques 5. Controlling supply / demand based on statistical data 6. Identify meaningful event and deal with them asap
  • 4. Digital Enterprise Research Institute www.deri.ie Motivation & Background (continued)  Heterogeneous Data Management 1. Different Data generated from different applications within one or more smart environments. 2. For example: A smart city relying on combined data from different smart buildings. 3. Linked data: A set of best practices to represent data into RDF and link, relate or connect to other RDF data. 4. Linked Open Data (LOD) Cloud: A huge openly available cloud of linked data from different domains.
  • 5. Digital Enterprise Research Institute www.deri.ie Motivation & Background (continued)  Big Data Management 1. A fast response to complex queries to support event processing. 2. Huge amounts of sensor data as RDF. 3. Generation of real-time multidimensional and contextual data cubes to sustain fast responses to complex queries. 4. An event data-warehouse. 5. Multidimensional shape of data in data-warehouse = A data cube = Structuring information into dimensions and facts or measures.
  • 6. Digital Enterprise Research Institute www.deri.ie  Why Data-warehouse for events? 1. Data characteristics: • Logged once, never updated • Flat data, no need to normalize • Incoming data: temporal (based on time) 2. Objective characteristics: • Reporting, Analysis, Prediction, Mining, Pattern Identification…… • To use a data model to speed up querying unlike transactional processing system • To provide with a historical repository containing features as per interest • Support Complex Event Processing Motivation & Background (continued)
  • 7. Digital Enterprise Research Institute www.deri.ie Motivation & Background (continued)  Web of Things 1. Extending the Web to easily blend real-world objects like electronic appliances, sensors and embedded devices etc. 2. Even though we are limited to sensor data in our use case, the approach can be easily extended. 3. CoAP (Constrained Application Protocol): A Web transfer protocol for request/response model.
  • 8. Digital Enterprise Research Institute www.deri.ie Related Work  Antoniades, Athos, et al. "Linked2Safety: A secure linked data medical information space for semantically- interconnecting EHRs advancing patients' safety in medical research." Bioinformatics & Bioengineering (BIBE), 2012 IEEE 12th International Conference on. IEEE, 2012.  Lefort, Laurent, et al. "A Linked Sensor Data Cube for a 100 Year Homogenised Daily Temperature Dataset." SSN. 2012.  ENERGIE VISIBLE (http://www.webofthings.org/energievisible/)
  • 9. Digital Enterprise Research Institute www.deri.ie Processing Metadata for Cube Generation Involves two major steps: 1. Capturing and Publishing Sensor Data 2. Event Registration
  • 10. Digital Enterprise Research Institute www.deri.ie Capturing and Publishing Sensor Data: An example Scenario JMS SERVER & publish on JMS Server RDF Oh wait, I see a way of converting them into RDF, add relevant metadata, SSN Event Stream Event Stream Event Stream
  • 11. Digital Enterprise Research Institute www.deri.ie Capturing and Publishing Sensor Data: Process Filter UDP Listeners & CoAP Clients RDFizer JMS Publisher Enricher JMS Server Metadata Knowledge Base S1 S2 S3 Sn
  • 12. Digital Enterprise Research Institute www.deri.ie Event Registration: EDWH Ontology NamedCubeGraph Configuration Dimension Measure Source Event JMSSource
  • 13. Digital Enterprise Research Institute www.deri.ie Event Registration Process Specify Event Type Specify Event Source Select Measures Select Dimensions Specify Graph Details EDWH Ontology Instance
  • 14. Digital Enterprise Research Institute www.deri.ie Dimension Selection: Example
  • 15. Digital Enterprise Research Institute www.deri.ie Measure Selection: Example
  • 16. Digital Enterprise Research Institute www.deri.ie Cube Generation 1. Requires an event to be registered into the system. 2. Current implementation generates cubes based on time dimension only. However, it can be easily extended to attain other dimensions. 3. Critical component: EDWH Agent
  • 17. Digital Enterprise Research Institute www.deri.ie Cube Generation: EDWH Agent Architecture
  • 18. Digital Enterprise Research Institute www.deri.ie Cubes Generation: An example Scenario JMS SERVER RDF RDF CUBES AS RDF MEETS Mr. CUBES EDWH Ontology CUBE Store
  • 19. Digital Enterprise Research Institute www.deri.ie Cubes Generation: Our Use Case
  • 20. Digital Enterprise Research Institute www.deri.ie Use Cases CUBE Store
  • 21. Digital Enterprise Research Institute www.deri.ie Use Case: 1 The electricity usage at location X for duration Y for consumer Z has been moderate as compared to previous duration W. CUBE Store
  • 22. Digital Enterprise Research Institute www.deri.ie Use Case: 2 Historical Data suggests that the weather is going to be windy and Rainy in Galway even after the Easter. CUBE Store
  • 23. Digital Enterprise Research Institute www.deri.ie Use Case: 3 CUBE Store Some suspicious activity has been detected on your credit card!
  • 24. Digital Enterprise Research Institute www.deri.ie Use Case: 4 Linked CUBE Stores Each of these things can be achieved from one place
  • 25. Digital Enterprise Research Institute www.deri.ie Evaluation  We evaluated our system in terms of 1. Total number of cubes generated 2. Size of each cube 3. Accuracy of generated cubes 4. Impact of adding and removing dimensions on size of cube 5. Performance of the system to generate cubes 6. Query Execution Time (QET)
  • 26. Digital Enterprise Research Institute www.deri.ie Evaluation 0 2000 4000 6000 8000 10000 12000 14000 16000 Time(milliseconds) Quarter Cube Day Cube Hour Cube
  • 27. Digital Enterprise Research Institute www.deri.ie Evaluation: Size
  • 28. Digital Enterprise Research Institute www.deri.ie Evaluation: Impact of dimensions 1 Dim 1 Dim 1 Dim 2 Dim 2 Dim 2 Dim 3 Dim 3 Dim 3 Dim 0 50 100 150 200 Quarter Hour Day StorgaeSizeperCube(KB) 1 Dim 2 Dim 3 Dim
  • 29. Digital Enterprise Research Institute www.deri.ie Evaluation: Query Set for QET
  • 30. Digital Enterprise Research Institute www.deri.ie Evaluation: QET Comparison
  • 31. Digital Enterprise Research Institute www.deri.ie Conclusion With the approach presented, we were able to enrich events with necessary metadata, and process enriched events to generate on-the-fly data cubes. After looking at performance chart shown in previous slides, it is safe to conclude that our approach provides a good way of generating data cubes on-the- fly in a real-time sensor network.
  • 32. Digital Enterprise Research Institute www.deri.ie Questions