SlideShare a Scribd company logo
An Autonomic Approach to Real-Time Predictive
Analytics using Open Data and Internet of Things
Wassim Derguech, Eanna Burke, Edward Curry
Insight Centre For Data Analytics
National University of Ireland, Galway
UIC 2014 - The 11th IEEE International Conference
on Ubiquitous Intelligence and Computing
December 9-12, 2014
Ayodya Resort, Bali, Indonesia
Motivation: Internet of Things (IOT)
Smart Homes, Grids, Cities…
by 2020 50 billion devices connected to mobile networks (OECD, 2012)
Today’s Internet of Things “behaviour”(Abbas M. Keynote Presentation at UiTM WSN Seminar 2012)
74%
Real-time
location-
based info.
71%
Payment
apps.
60%
Weather
apps.
60%
Want
connected
system in
car.
29%
Health apps.
51%
Maps/Naviga
tion/Search
Motivation: Open Data
• Open Data, not simply big data, will be driver for growth, ingenuity, and innovation
in the UK economy. (Deloitte Analytics, 2012)
• $1.5 Billion: US National Weather Service supporting a private weather industry
per year (CapGemini 2014)
• $32 Billion: Estimated direct impact of Open Data in 2010 on the EU27, annual
growth of 7% (Vickery 2011)
• $140 Billion: Estimated aggregate direct and indirect impact across EU27
(Vickery 2011)
• $3 Trillion: Estimated annual economic potential across seven domains.
(McKinsey 2013)
[Deirdre Lee, Presentation from The Open Group Conference in London, 22 October 2014]
Problem statement and contribution
• Open Data is becoming more and more available and valuable!
• Both public and private data can be used to drive decision making.
 Challenge: selection of the best Open Data and IoT source
to support predictive analytics.
• Our contribution:
(1) data management: collection, filtering, and warehousing
(2) data analytics: source selection and predictive analytics
• Our promise: An autonomic system
Self-Configuration Self-Optimization Self-Healing
Open Data
Weather Forecast
Web Services
Internet of Things
Building Power Prediction System
Sensor Data
Building Power consumption
Learning 
relationships 
between variables
Use the learned 
relationships for 
prediction
Use Case: predicting energy usage using sensor and weather data
Open Data Management
Any data source including:
sensors, web services, IoT, etc.
For each data source a collector
is required depending on their
communication protocol.
Receive data from collectors
and transform it using a
predefined format/RDF
schema/Ontology.
Persist the data into a local RDF
store.
Ontologies in use for our use case
• Ontologies constitute formal specifications for shared
conceptualisations  foster reuse of existing assets
• Criteria for choosing the right ontology for our use case:
•In use : proven to be good in practice and well documented for easy learning
•Relevant: describing surface readings rather than space or marine conditions
•Numerical: in order to be suitable for machine learning. Avoid vague terms such as
“hot” or “humid”
Sensor Data
Semantic Sensor
Network (SSN)
Weather Observation
AEMET Weather
Observation
Ontology
Weather Prediction
Meteo Ontology
Predictive Analytics
Source
Selector
• Evaluates sources and builds a
prediction model from the best
Open Data and IoT source
• Requirements for machine learning algorithm:
• Reasonable accuracy
• Quick prediction model generation
• Work well with little data
• Work well with nominal and numerical inputs
• Low configuration
• Give insights into the factors influencing predictions
Reselection
Controller
• Triggers the re-
selection of data
sources for building a
new prediction model
• Criteria for a re-
selection
Error Comparison
Module
• Uses the prediction model generated by the Source Selector (6) for
generating energy usage predictions
• Sends predicted data to the UI (7) to be displayed to the user
• Compares the predicted and current energy usage values and
generates the error rates produced (1)
Implementation: Data Sources
Weather Observations
Local Weather
Station within the
University
Open Weather Map
web service
Weather Forecast
Ham Weather
Forecast Web
Service
YR.no Weather
Forecast Web
Service
Sensor Data
Building Power
consumption
Implementation:
User Interface
• The User Interface is the
consumer of produced
predictions.
• Used only for displaying
results
• Not used for providing
any inputs or
configurations
Evaluation: Initiation of the System
Object: Evaluate how quick the system starts to
provide reasonable results
Very high error
rate at the
initiation of the
system
Better results
due to 1 day
historical data
First non-
working day
(Saturday)
Conclusion:
• Initial very high error rates, reduced over time depending on the length of
the historical data
• The system runs for 12 days to reach a steady error rate
Without introducing faults
Usage ratio between August 23
at 16:23 and 26th at 18:11
64%
36%
Evaluation: Partial Failure of a Weather Station
Object: Evaluate the autonomous aspect of the
system (self-healing and self-configuration)
Local Weather Station
within the University
Open Weather Map web
service
Close proximity of the weather
station provides better results
Introducing Faults
Temperature = 0.0
 4 hours to select an alternative station
 Possible improvement: update the reselection
criteria
Evaluation: Machine Learning Algorithms Testing
Object: What is the most suitable machine learning
algorithm for effective predictive analytics
Experimental Set-Up
• 1 week data as short term data set
• 5 weeks data as a long term data set
• Observation: building main incoming power
• Weather observation: local NUIG station
• Training set = 66% vs. test set = 34%
Four machine learning algorithms in WEKA
1. SMOReg – Support Vector Machine for Regression
2. 1 hidden layer back propagation ANN
3. 2 hidden layer back propagation ANN
4. Linear Regression
Evaluation: Machine Learning Algorithms Testing
DATASET SIZE = 672 – TRAINING DATA SIZE = 443
– TEST DATOA SET SIZE = 229
MA Error RMSE Time Corr. Coeff
SMOReg 5.3638 6.9759 0.851 0.6233
1 layer ANN 2.7606 3.6242 47.004 0.9158
2 layer ANN 3.0473 4.1506 50.842 0.8961
Linear Regression 4.8586 5.9283 0.759 0.7396
MA Error RMSE Time Corr. Coeff
SMOReg 4.6755 6.5965 32.4 0.8054
1 layer ANN 3.3332 4.5841 229.7 0.9162
2 layer ANN 3.7566 4.7279 247.0 0.9221
Linear Regression 4.7579 6.0173 2.4 0.8396
DATASET SIZE = 3298 – TRAINING DATA SIZE = 2176
– TEST DATOA SET SIZE = 1122
ANN more
accurate but
very slow
SMOReg and LR
took roughly the
same time but
LR more
accurate
Conclusion
Our contribution: an architecture for evaluating open data sources for
real-time predictive analytics comprising of:
(1) data management: collection, filtering, and warehousing
 Multiple data sources
 Reused existing vocabularies
(2) data analytics: source selection and predictive analytics
 Tested with short and long term data sets
 No configuration input required
 Results shown on a simple UI
• Our promise: An autonomic system:
 Experiment 1 = self-configuration and self-optimization
 Experiment 2 = self-healing and self-optimization
Future Work
• Fully autonomous system:
 discover data source autonomously
• ANN has high accuracy but very slow:
 investigate solutions for reducing the time for creating ANNs
• Using Open Data for better analytics
 working days vs. non-working days
Contact details
Wassim Derguech, Eanna Burke, Edward Curry
Insight Centre For Data Analytics -National University of Ireland, Galway
wassim.derguech@insight-centre.org ,
eannaburke1@gmail.com
edward.curry@insight-centre.org

More Related Content

What's hot

MobiDE’2012, Phoenix, AZ, United States, 20 May, 2012
MobiDE’2012, Phoenix, AZ, United States, 20 May, 2012MobiDE’2012, Phoenix, AZ, United States, 20 May, 2012
MobiDE’2012, Phoenix, AZ, United States, 20 May, 2012
Charith Perera
 
Enrich Machine-to-Machine Data with Semantic Web Technologies for Cross-Domai...
Enrich Machine-to-Machine Data with Semantic Web Technologies for Cross-Domai...Enrich Machine-to-Machine Data with Semantic Web Technologies for Cross-Domai...
Enrich Machine-to-Machine Data with Semantic Web Technologies for Cross-Domai...
Amélie Gyrard
 
IRJET- Optimization of Completion Time through Efficient Resource Allocation ...
IRJET- Optimization of Completion Time through Efficient Resource Allocation ...IRJET- Optimization of Completion Time through Efficient Resource Allocation ...
IRJET- Optimization of Completion Time through Efficient Resource Allocation ...
IRJET Journal
 
CONTEXT-AWARE DECISION MAKING SYSTEM FOR MOBILE CLOUD OFFLOADING
CONTEXT-AWARE DECISION MAKING SYSTEM FOR MOBILE CLOUD OFFLOADINGCONTEXT-AWARE DECISION MAKING SYSTEM FOR MOBILE CLOUD OFFLOADING
CONTEXT-AWARE DECISION MAKING SYSTEM FOR MOBILE CLOUD OFFLOADING
IJCNCJournal
 
PIMRC-2012, Sydney, Australia, 28 July, 2012
PIMRC-2012, Sydney, Australia, 28 July, 2012PIMRC-2012, Sydney, Australia, 28 July, 2012
PIMRC-2012, Sydney, Australia, 28 July, 2012
Charith Perera
 
IOT-2016 7-9 Septermber, 2016, Stuttgart, Germany
IOT-2016  7-9 Septermber, 2016, Stuttgart, GermanyIOT-2016  7-9 Septermber, 2016, Stuttgart, Germany
IOT-2016 7-9 Septermber, 2016, Stuttgart, Germany
Charith Perera
 
Designing Cross-Domain Semantic Web of Things Applications
Designing Cross-Domain Semantic Web of Things ApplicationsDesigning Cross-Domain Semantic Web of Things Applications
Designing Cross-Domain Semantic Web of Things Applications
Amélie Gyrard
 
K luo bera_poster
K luo bera_posterK luo bera_poster
K luo bera_poster
lkcozy
 
Use Machine Learning to Get the Most out of Your Big Data Clusters
Use Machine Learning to Get the Most out of Your Big Data ClustersUse Machine Learning to Get the Most out of Your Big Data Clusters
Use Machine Learning to Get the Most out of Your Big Data Clusters
Databricks
 
HICSS-2014-Big Island, Hawaii, United States, 08 January 2014
HICSS-2014-Big Island, Hawaii, United States, 08 January 2014HICSS-2014-Big Island, Hawaii, United States, 08 January 2014
HICSS-2014-Big Island, Hawaii, United States, 08 January 2014
Charith Perera
 
Open data & crowdsourcing of environmental observations in MMEA
Open data & crowdsourcing of environmental observations in MMEA Open data & crowdsourcing of environmental observations in MMEA
Open data & crowdsourcing of environmental observations in MMEA
CLIC Innovation Ltd
 
MongoDB and the Internet of Things
MongoDB and the Internet of ThingsMongoDB and the Internet of Things
MongoDB and the Internet of Things
MongoDB
 
SPARK USE CASE- Distributed Reinforcement Learning for Electricity Market Bi...
SPARK USE CASE-  Distributed Reinforcement Learning for Electricity Market Bi...SPARK USE CASE-  Distributed Reinforcement Learning for Electricity Market Bi...
SPARK USE CASE- Distributed Reinforcement Learning for Electricity Market Bi...
Impetus Technologies
 
Assisting IoT Projects and Developers in Designing Interoperable Semantic Web...
Assisting IoT Projects and Developers in Designing Interoperable Semantic Web...Assisting IoT Projects and Developers in Designing Interoperable Semantic Web...
Assisting IoT Projects and Developers in Designing Interoperable Semantic Web...
Amélie Gyrard
 

What's hot (15)

MobiDE’2012, Phoenix, AZ, United States, 20 May, 2012
MobiDE’2012, Phoenix, AZ, United States, 20 May, 2012MobiDE’2012, Phoenix, AZ, United States, 20 May, 2012
MobiDE’2012, Phoenix, AZ, United States, 20 May, 2012
 
Enrich Machine-to-Machine Data with Semantic Web Technologies for Cross-Domai...
Enrich Machine-to-Machine Data with Semantic Web Technologies for Cross-Domai...Enrich Machine-to-Machine Data with Semantic Web Technologies for Cross-Domai...
Enrich Machine-to-Machine Data with Semantic Web Technologies for Cross-Domai...
 
IRJET- Optimization of Completion Time through Efficient Resource Allocation ...
IRJET- Optimization of Completion Time through Efficient Resource Allocation ...IRJET- Optimization of Completion Time through Efficient Resource Allocation ...
IRJET- Optimization of Completion Time through Efficient Resource Allocation ...
 
CONTEXT-AWARE DECISION MAKING SYSTEM FOR MOBILE CLOUD OFFLOADING
CONTEXT-AWARE DECISION MAKING SYSTEM FOR MOBILE CLOUD OFFLOADINGCONTEXT-AWARE DECISION MAKING SYSTEM FOR MOBILE CLOUD OFFLOADING
CONTEXT-AWARE DECISION MAKING SYSTEM FOR MOBILE CLOUD OFFLOADING
 
PIMRC-2012, Sydney, Australia, 28 July, 2012
PIMRC-2012, Sydney, Australia, 28 July, 2012PIMRC-2012, Sydney, Australia, 28 July, 2012
PIMRC-2012, Sydney, Australia, 28 July, 2012
 
IOT-2016 7-9 Septermber, 2016, Stuttgart, Germany
IOT-2016  7-9 Septermber, 2016, Stuttgart, GermanyIOT-2016  7-9 Septermber, 2016, Stuttgart, Germany
IOT-2016 7-9 Septermber, 2016, Stuttgart, Germany
 
Designing Cross-Domain Semantic Web of Things Applications
Designing Cross-Domain Semantic Web of Things ApplicationsDesigning Cross-Domain Semantic Web of Things Applications
Designing Cross-Domain Semantic Web of Things Applications
 
eChallenges_e2011_JS
eChallenges_e2011_JSeChallenges_e2011_JS
eChallenges_e2011_JS
 
K luo bera_poster
K luo bera_posterK luo bera_poster
K luo bera_poster
 
Use Machine Learning to Get the Most out of Your Big Data Clusters
Use Machine Learning to Get the Most out of Your Big Data ClustersUse Machine Learning to Get the Most out of Your Big Data Clusters
Use Machine Learning to Get the Most out of Your Big Data Clusters
 
HICSS-2014-Big Island, Hawaii, United States, 08 January 2014
HICSS-2014-Big Island, Hawaii, United States, 08 January 2014HICSS-2014-Big Island, Hawaii, United States, 08 January 2014
HICSS-2014-Big Island, Hawaii, United States, 08 January 2014
 
Open data & crowdsourcing of environmental observations in MMEA
Open data & crowdsourcing of environmental observations in MMEA Open data & crowdsourcing of environmental observations in MMEA
Open data & crowdsourcing of environmental observations in MMEA
 
MongoDB and the Internet of Things
MongoDB and the Internet of ThingsMongoDB and the Internet of Things
MongoDB and the Internet of Things
 
SPARK USE CASE- Distributed Reinforcement Learning for Electricity Market Bi...
SPARK USE CASE-  Distributed Reinforcement Learning for Electricity Market Bi...SPARK USE CASE-  Distributed Reinforcement Learning for Electricity Market Bi...
SPARK USE CASE- Distributed Reinforcement Learning for Electricity Market Bi...
 
Assisting IoT Projects and Developers in Designing Interoperable Semantic Web...
Assisting IoT Projects and Developers in Designing Interoperable Semantic Web...Assisting IoT Projects and Developers in Designing Interoperable Semantic Web...
Assisting IoT Projects and Developers in Designing Interoperable Semantic Web...
 

Viewers also liked

Waternomics Results and Impact
Waternomics Results and ImpactWaternomics Results and Impact
Waternomics Results and Impact
Wassim Derguech
 
Waternomics - ICT For Water Management
Waternomics - ICT For Water ManagementWaternomics - ICT For Water Management
Waternomics - ICT For Water Management
Wassim Derguech
 
(Big) Data Analytics for Environmental Sustainability
(Big) Data Analytics for Environmental Sustainability(Big) Data Analytics for Environmental Sustainability
(Big) Data Analytics for Environmental Sustainability
Wassim Derguech
 
Modelling and Querying Sensor Services using Ontologies
Modelling and Querying Sensor Services using OntologiesModelling and Querying Sensor Services using Ontologies
Modelling and Querying Sensor Services using Ontologies
Wassim Derguech
 
Organizing Capabilities using Formal Concept Analysis
Organizing Capabilities using Formal Concept AnalysisOrganizing Capabilities using Formal Concept Analysis
Organizing Capabilities using Formal Concept Analysis
Wassim Derguech
 
An Automation Support for Creating Configurable Process Models
An Automation Support for Creating Configurable Process ModelsAn Automation Support for Creating Configurable Process Models
An Automation Support for Creating Configurable Process Models
Wassim Derguech
 
Business Capability-centric Management of Services and Business Process Models
Business Capability-centric Management of Services and Business Process ModelsBusiness Capability-centric Management of Services and Business Process Models
Business Capability-centric Management of Services and Business Process Models
Wassim Derguech
 
Web Service Capability Meta Model
Web Service Capability Meta ModelWeb Service Capability Meta Model
Web Service Capability Meta Model
Wassim Derguech
 
Waternomics: Making Sense of Data
Waternomics: Making Sense of DataWaternomics: Making Sense of Data
Waternomics: Making Sense of Data
Wassim Derguech
 
Ubiquitous Service Capability Modeling and Similarity Based Searching
Ubiquitous Service Capability Modeling and Similarity Based SearchingUbiquitous Service Capability Modeling and Similarity Based Searching
Ubiquitous Service Capability Modeling and Similarity Based SearchingWassim Derguech
 
ORGpedia: The Open Organizational Data Project
ORGpedia: The Open Organizational Data ProjectORGpedia: The Open Organizational Data Project
ORGpedia: The Open Organizational Data Project
3 Round Stones
 

Viewers also liked (11)

Waternomics Results and Impact
Waternomics Results and ImpactWaternomics Results and Impact
Waternomics Results and Impact
 
Waternomics - ICT For Water Management
Waternomics - ICT For Water ManagementWaternomics - ICT For Water Management
Waternomics - ICT For Water Management
 
(Big) Data Analytics for Environmental Sustainability
(Big) Data Analytics for Environmental Sustainability(Big) Data Analytics for Environmental Sustainability
(Big) Data Analytics for Environmental Sustainability
 
Modelling and Querying Sensor Services using Ontologies
Modelling and Querying Sensor Services using OntologiesModelling and Querying Sensor Services using Ontologies
Modelling and Querying Sensor Services using Ontologies
 
Organizing Capabilities using Formal Concept Analysis
Organizing Capabilities using Formal Concept AnalysisOrganizing Capabilities using Formal Concept Analysis
Organizing Capabilities using Formal Concept Analysis
 
An Automation Support for Creating Configurable Process Models
An Automation Support for Creating Configurable Process ModelsAn Automation Support for Creating Configurable Process Models
An Automation Support for Creating Configurable Process Models
 
Business Capability-centric Management of Services and Business Process Models
Business Capability-centric Management of Services and Business Process ModelsBusiness Capability-centric Management of Services and Business Process Models
Business Capability-centric Management of Services and Business Process Models
 
Web Service Capability Meta Model
Web Service Capability Meta ModelWeb Service Capability Meta Model
Web Service Capability Meta Model
 
Waternomics: Making Sense of Data
Waternomics: Making Sense of DataWaternomics: Making Sense of Data
Waternomics: Making Sense of Data
 
Ubiquitous Service Capability Modeling and Similarity Based Searching
Ubiquitous Service Capability Modeling and Similarity Based SearchingUbiquitous Service Capability Modeling and Similarity Based Searching
Ubiquitous Service Capability Modeling and Similarity Based Searching
 
ORGpedia: The Open Organizational Data Project
ORGpedia: The Open Organizational Data ProjectORGpedia: The Open Organizational Data Project
ORGpedia: The Open Organizational Data Project
 

Similar to An Autonomic Approach to Real-Time Predictive Analytics using Open Data and Internet of Things

Semantics in Sensor Networks
Semantics in Sensor NetworksSemantics in Sensor Networks
Semantics in Sensor Networks
Oscar Corcho
 
Weather Station Using IoT
Weather Station Using IoT Weather Station Using IoT
Weather Station Using IoT
fatinfaqihah5124
 
weatherstationusingiotfyp-190505230304 (1).pdf
weatherstationusingiotfyp-190505230304 (1).pdfweatherstationusingiotfyp-190505230304 (1).pdf
weatherstationusingiotfyp-190505230304 (1).pdf
TanishJadh
 
Model-Simulation-and-Measurement-Based Systems Engineering of Power System Sy...
Model-Simulation-and-Measurement-Based Systems Engineering of Power System Sy...Model-Simulation-and-Measurement-Based Systems Engineering of Power System Sy...
Model-Simulation-and-Measurement-Based Systems Engineering of Power System Sy...
Luigi Vanfretti
 
Weather Station Data Publication at Irstea: an implementation Report.
Weather Station Data Publication at Irstea: an implementation Report.  Weather Station Data Publication at Irstea: an implementation Report.
Weather Station Data Publication at Irstea: an implementation Report.
catherine roussey
 
resume v 5.0
resume v 5.0resume v 5.0
resume v 5.0Ye Xu
 
GurminderBharani_Masters_Thesis
GurminderBharani_Masters_ThesisGurminderBharani_Masters_Thesis
GurminderBharani_Masters_Thesisbharanigurminder
 
Io t technologies_ppt-2
Io t technologies_ppt-2Io t technologies_ppt-2
Io t technologies_ppt-2
achakracu
 
IRJET - Intelligent Weather Forecasting using Machine Learning Techniques
IRJET -  	  Intelligent Weather Forecasting using Machine Learning TechniquesIRJET -  	  Intelligent Weather Forecasting using Machine Learning Techniques
IRJET - Intelligent Weather Forecasting using Machine Learning Techniques
IRJET Journal
 
Dynamic Integrations of Crop Data and Corresponding Meteorological Data based...
Dynamic Integrations of Crop Data and Corresponding Meteorological Data based...Dynamic Integrations of Crop Data and Corresponding Meteorological Data based...
Dynamic Integrations of Crop Data and Corresponding Meteorological Data based...
AIMS (Agricultural Information Management Standards)
 
Sensor Fault Detection in IoT System Using Machine Learning
Sensor Fault Detection in IoT System Using Machine LearningSensor Fault Detection in IoT System Using Machine Learning
Sensor Fault Detection in IoT System Using Machine Learning
IRJET Journal
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
inside-BigData.com
 
Journals analysis ppt
Journals analysis pptJournals analysis ppt
Journals analysis ppt
Muhammad Heikal
 
Integrating wireless sensor network into cloud services for real time data co...
Integrating wireless sensor network into cloud services for real time data co...Integrating wireless sensor network into cloud services for real time data co...
Integrating wireless sensor network into cloud services for real time data co...Rajeev Piyare
 
Fi cloudpresentationgyrardaugust2015 v2
Fi cloudpresentationgyrardaugust2015 v2Fi cloudpresentationgyrardaugust2015 v2
Fi cloudpresentationgyrardaugust2015 v2
Amélie Gyrard
 
Svm Classifier Algorithm for Data Stream Mining Using Hive and R
Svm Classifier Algorithm for Data Stream Mining Using Hive and RSvm Classifier Algorithm for Data Stream Mining Using Hive and R
Svm Classifier Algorithm for Data Stream Mining Using Hive and R
IRJET Journal
 
DATI, AI E ROBOTICA @POLITO
DATI, AI E ROBOTICA @POLITODATI, AI E ROBOTICA @POLITO
DATI, AI E ROBOTICA @POLITO
MarcoMellia
 

Similar to An Autonomic Approach to Real-Time Predictive Analytics using Open Data and Internet of Things (20)

Semantics in Sensor Networks
Semantics in Sensor NetworksSemantics in Sensor Networks
Semantics in Sensor Networks
 
Weather Station Using IoT
Weather Station Using IoT Weather Station Using IoT
Weather Station Using IoT
 
weatherstationusingiotfyp-190505230304 (1).pdf
weatherstationusingiotfyp-190505230304 (1).pdfweatherstationusingiotfyp-190505230304 (1).pdf
weatherstationusingiotfyp-190505230304 (1).pdf
 
finalDraftPoster
finalDraftPosterfinalDraftPoster
finalDraftPoster
 
Model-Simulation-and-Measurement-Based Systems Engineering of Power System Sy...
Model-Simulation-and-Measurement-Based Systems Engineering of Power System Sy...Model-Simulation-and-Measurement-Based Systems Engineering of Power System Sy...
Model-Simulation-and-Measurement-Based Systems Engineering of Power System Sy...
 
Weather Station Data Publication at Irstea: an implementation Report.
Weather Station Data Publication at Irstea: an implementation Report.  Weather Station Data Publication at Irstea: an implementation Report.
Weather Station Data Publication at Irstea: an implementation Report.
 
resume v 5.0
resume v 5.0resume v 5.0
resume v 5.0
 
GurminderBharani_Masters_Thesis
GurminderBharani_Masters_ThesisGurminderBharani_Masters_Thesis
GurminderBharani_Masters_Thesis
 
Io t technologies_ppt-2
Io t technologies_ppt-2Io t technologies_ppt-2
Io t technologies_ppt-2
 
IRJET - Intelligent Weather Forecasting using Machine Learning Techniques
IRJET -  	  Intelligent Weather Forecasting using Machine Learning TechniquesIRJET -  	  Intelligent Weather Forecasting using Machine Learning Techniques
IRJET - Intelligent Weather Forecasting using Machine Learning Techniques
 
Dynamic integrations of crop data and corresponding meteorological data based...
Dynamic integrations of crop data and corresponding meteorological data based...Dynamic integrations of crop data and corresponding meteorological data based...
Dynamic integrations of crop data and corresponding meteorological data based...
 
Dynamic Integrations of Crop Data and Corresponding Meteorological Data based...
Dynamic Integrations of Crop Data and Corresponding Meteorological Data based...Dynamic Integrations of Crop Data and Corresponding Meteorological Data based...
Dynamic Integrations of Crop Data and Corresponding Meteorological Data based...
 
Sensor Fault Detection in IoT System Using Machine Learning
Sensor Fault Detection in IoT System Using Machine LearningSensor Fault Detection in IoT System Using Machine Learning
Sensor Fault Detection in IoT System Using Machine Learning
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
 
Journals analysis ppt
Journals analysis pptJournals analysis ppt
Journals analysis ppt
 
Integrating wireless sensor network into cloud services for real time data co...
Integrating wireless sensor network into cloud services for real time data co...Integrating wireless sensor network into cloud services for real time data co...
Integrating wireless sensor network into cloud services for real time data co...
 
Fi cloudpresentationgyrardaugust2015 v2
Fi cloudpresentationgyrardaugust2015 v2Fi cloudpresentationgyrardaugust2015 v2
Fi cloudpresentationgyrardaugust2015 v2
 
Svm Classifier Algorithm for Data Stream Mining Using Hive and R
Svm Classifier Algorithm for Data Stream Mining Using Hive and RSvm Classifier Algorithm for Data Stream Mining Using Hive and R
Svm Classifier Algorithm for Data Stream Mining Using Hive and R
 
DATI, AI E ROBOTICA @POLITO
DATI, AI E ROBOTICA @POLITODATI, AI E ROBOTICA @POLITO
DATI, AI E ROBOTICA @POLITO
 
rerngvit_phd_seminar
rerngvit_phd_seminarrerngvit_phd_seminar
rerngvit_phd_seminar
 

Recently uploaded

Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
MaleehaSheikh2
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
StarCompliance.io
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
NABLAS株式会社
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
Oppotus
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
alex933524
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 

Recently uploaded (20)

Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 

An Autonomic Approach to Real-Time Predictive Analytics using Open Data and Internet of Things

  • 1. An Autonomic Approach to Real-Time Predictive Analytics using Open Data and Internet of Things Wassim Derguech, Eanna Burke, Edward Curry Insight Centre For Data Analytics National University of Ireland, Galway UIC 2014 - The 11th IEEE International Conference on Ubiquitous Intelligence and Computing December 9-12, 2014 Ayodya Resort, Bali, Indonesia
  • 2. Motivation: Internet of Things (IOT) Smart Homes, Grids, Cities… by 2020 50 billion devices connected to mobile networks (OECD, 2012) Today’s Internet of Things “behaviour”(Abbas M. Keynote Presentation at UiTM WSN Seminar 2012) 74% Real-time location- based info. 71% Payment apps. 60% Weather apps. 60% Want connected system in car. 29% Health apps. 51% Maps/Naviga tion/Search
  • 3. Motivation: Open Data • Open Data, not simply big data, will be driver for growth, ingenuity, and innovation in the UK economy. (Deloitte Analytics, 2012) • $1.5 Billion: US National Weather Service supporting a private weather industry per year (CapGemini 2014) • $32 Billion: Estimated direct impact of Open Data in 2010 on the EU27, annual growth of 7% (Vickery 2011) • $140 Billion: Estimated aggregate direct and indirect impact across EU27 (Vickery 2011) • $3 Trillion: Estimated annual economic potential across seven domains. (McKinsey 2013) [Deirdre Lee, Presentation from The Open Group Conference in London, 22 October 2014]
  • 4. Problem statement and contribution • Open Data is becoming more and more available and valuable! • Both public and private data can be used to drive decision making.  Challenge: selection of the best Open Data and IoT source to support predictive analytics. • Our contribution: (1) data management: collection, filtering, and warehousing (2) data analytics: source selection and predictive analytics • Our promise: An autonomic system Self-Configuration Self-Optimization Self-Healing
  • 5. Open Data Weather Forecast Web Services Internet of Things Building Power Prediction System Sensor Data Building Power consumption Learning  relationships  between variables Use the learned  relationships for  prediction Use Case: predicting energy usage using sensor and weather data
  • 6. Open Data Management Any data source including: sensors, web services, IoT, etc. For each data source a collector is required depending on their communication protocol. Receive data from collectors and transform it using a predefined format/RDF schema/Ontology. Persist the data into a local RDF store.
  • 7. Ontologies in use for our use case • Ontologies constitute formal specifications for shared conceptualisations  foster reuse of existing assets • Criteria for choosing the right ontology for our use case: •In use : proven to be good in practice and well documented for easy learning •Relevant: describing surface readings rather than space or marine conditions •Numerical: in order to be suitable for machine learning. Avoid vague terms such as “hot” or “humid” Sensor Data Semantic Sensor Network (SSN) Weather Observation AEMET Weather Observation Ontology Weather Prediction Meteo Ontology
  • 9. Source Selector • Evaluates sources and builds a prediction model from the best Open Data and IoT source • Requirements for machine learning algorithm: • Reasonable accuracy • Quick prediction model generation • Work well with little data • Work well with nominal and numerical inputs • Low configuration • Give insights into the factors influencing predictions
  • 10. Reselection Controller • Triggers the re- selection of data sources for building a new prediction model • Criteria for a re- selection
  • 11. Error Comparison Module • Uses the prediction model generated by the Source Selector (6) for generating energy usage predictions • Sends predicted data to the UI (7) to be displayed to the user • Compares the predicted and current energy usage values and generates the error rates produced (1)
  • 12. Implementation: Data Sources Weather Observations Local Weather Station within the University Open Weather Map web service Weather Forecast Ham Weather Forecast Web Service YR.no Weather Forecast Web Service Sensor Data Building Power consumption
  • 13. Implementation: User Interface • The User Interface is the consumer of produced predictions. • Used only for displaying results • Not used for providing any inputs or configurations
  • 14. Evaluation: Initiation of the System Object: Evaluate how quick the system starts to provide reasonable results Very high error rate at the initiation of the system Better results due to 1 day historical data First non- working day (Saturday) Conclusion: • Initial very high error rates, reduced over time depending on the length of the historical data • The system runs for 12 days to reach a steady error rate
  • 15. Without introducing faults Usage ratio between August 23 at 16:23 and 26th at 18:11 64% 36% Evaluation: Partial Failure of a Weather Station Object: Evaluate the autonomous aspect of the system (self-healing and self-configuration) Local Weather Station within the University Open Weather Map web service Close proximity of the weather station provides better results Introducing Faults Temperature = 0.0  4 hours to select an alternative station  Possible improvement: update the reselection criteria
  • 16. Evaluation: Machine Learning Algorithms Testing Object: What is the most suitable machine learning algorithm for effective predictive analytics Experimental Set-Up • 1 week data as short term data set • 5 weeks data as a long term data set • Observation: building main incoming power • Weather observation: local NUIG station • Training set = 66% vs. test set = 34% Four machine learning algorithms in WEKA 1. SMOReg – Support Vector Machine for Regression 2. 1 hidden layer back propagation ANN 3. 2 hidden layer back propagation ANN 4. Linear Regression
  • 17. Evaluation: Machine Learning Algorithms Testing DATASET SIZE = 672 – TRAINING DATA SIZE = 443 – TEST DATOA SET SIZE = 229 MA Error RMSE Time Corr. Coeff SMOReg 5.3638 6.9759 0.851 0.6233 1 layer ANN 2.7606 3.6242 47.004 0.9158 2 layer ANN 3.0473 4.1506 50.842 0.8961 Linear Regression 4.8586 5.9283 0.759 0.7396 MA Error RMSE Time Corr. Coeff SMOReg 4.6755 6.5965 32.4 0.8054 1 layer ANN 3.3332 4.5841 229.7 0.9162 2 layer ANN 3.7566 4.7279 247.0 0.9221 Linear Regression 4.7579 6.0173 2.4 0.8396 DATASET SIZE = 3298 – TRAINING DATA SIZE = 2176 – TEST DATOA SET SIZE = 1122 ANN more accurate but very slow SMOReg and LR took roughly the same time but LR more accurate
  • 18. Conclusion Our contribution: an architecture for evaluating open data sources for real-time predictive analytics comprising of: (1) data management: collection, filtering, and warehousing  Multiple data sources  Reused existing vocabularies (2) data analytics: source selection and predictive analytics  Tested with short and long term data sets  No configuration input required  Results shown on a simple UI • Our promise: An autonomic system:  Experiment 1 = self-configuration and self-optimization  Experiment 2 = self-healing and self-optimization
  • 19. Future Work • Fully autonomous system:  discover data source autonomously • ANN has high accuracy but very slow:  investigate solutions for reducing the time for creating ANNs • Using Open Data for better analytics  working days vs. non-working days Contact details Wassim Derguech, Eanna Burke, Edward Curry Insight Centre For Data Analytics -National University of Ireland, Galway wassim.derguech@insight-centre.org , eannaburke1@gmail.com edward.curry@insight-centre.org