SlideShare a Scribd company logo
Quality of Result-aware data analytics
Hong-Linh Truong
Distributed Systems Group,
Vienna University of Technology
truong@dsg.tuwien.ac.at
http://dsg.tuwien.ac.at/staff/truong
1ASE Summer 2015
Advanced Services Engineering,
Summer 2015
Advanced Services Engineering,
Summer 2015
Outline
 Data analytics: workflow structures and systems
 Enable quality of results for data analytics
 Quality of data in data analytics workflows
ASE Summer 2015 2
Data Analytics
Conceptual View
ASE Summer 2015 3
Data Processing FrameworksData Processing Frameworks
Streaming/Online
Data Processing
Batch Data
Processing
Hybrid Data
Processing
Static data(Near)
realtime data
Decision Data Analysis
Analytics,
Tools,
Processes &
Models
Data analytics workflows
ASE Summer 2015 4
Things
People
DaaSDaaS
Computation
Service
Computation
Service
We use the term „workflow“ in a
generic meaning!!!
Different views of (data analytics)
workflow systems
5
View
Domain
view
Business
Workflow
Scientific/E-
science
Workflow
Data/Computati
on view
Data
intensive
workflow
Computatio
n intensive
workflow
Human-
intensive
workflow
System
view
Grid
workflow
Enterprise
workflow
Cloud-
based
workflow
Execution
model
view
Service-
based
workflow
Batch job
workflow
Interactive
workflow
ASE Summer 2015
Pros and cons of (data analytics)
workflow systems
ASE Summer 2015 6
 Ian J. Taylor, Ewa Deelman, Dennis B. Gannon, and Matthew Shields. 2006. Workflows for E-Science: Scientific
Workflows for Grids. Springer-Verlag New York, Inc., Secaucus, NJ, USA.
 Bertram Ludäscher, Mathias Weske, Timothy M. McPhillips, Shawn Bowers: Scientific Workflows: Business as
Usual? BPM 2009: 31-47
 Mirko Sonntag, Dimka Karastoyanova, Frank Leymann: The Missing Features of Workflow Systems for Scientific
Computations. Software Engineering (Workshops) 2010: 209-216
 Lavanya Ramakrishnan and Beth Plale. 2010. A multi-dimensional classification model for scientific workflow
characteristics. In Proceedings of the 1st International Workshop on Workflow Approaches to New Data-centric
Science (Wands '10). ACM, New York, NY, USA, , Article 4 , 12 pages. DOI=10.1145/1833398.1833402
http://doi.acm.org/10.1145/1833398.1833402
 Jia Yu and Rajkumar Buyya. 2005. A taxonomy of scientific workflow systems for grid computing. SIGMOD Rec. 34,
3 (September 2005), 44-49. DOI=10.1145/1084805.1084814 http://doi.acm.org/10.1145/1084805.1084814
 Ian J. Taylor, Ewa Deelman, Dennis B. Gannon, and Matthew Shields. 2006. Workflows for E-Science: Scientific
Workflows for Grids. Springer-Verlag New York, Inc., Secaucus, NJ, USA.
 Bertram Ludäscher, Mathias Weske, Timothy M. McPhillips, Shawn Bowers: Scientific Workflows: Business as
Usual? BPM 2009: 31-47
 Mirko Sonntag, Dimka Karastoyanova, Frank Leymann: The Missing Features of Workflow Systems for Scientific
Computations. Software Engineering (Workshops) 2010: 209-216
 Lavanya Ramakrishnan and Beth Plale. 2010. A multi-dimensional classification model for scientific workflow
characteristics. In Proceedings of the 1st International Workshop on Workflow Approaches to New Data-centric
Science (Wands '10). ACM, New York, NY, USA, , Article 4 , 12 pages. DOI=10.1145/1833398.1833402
http://doi.acm.org/10.1145/1833398.1833402
 Jia Yu and Rajkumar Buyya. 2005. A taxonomy of scientific workflow systems for grid computing. SIGMOD Rec. 34,
3 (September 2005), 44-49. DOI=10.1145/1084805.1084814 http://doi.acm.org/10.1145/1084805.1084814
Hierarchical view of workflows (1)
mProject1Service.java
public void mProject1() {
}
mProject1Service.java
public void mProject1() {
}
WorkflowWorkflow
A();A();
<parallel>
</parallel>
<parallel>
</parallel>
Workflow Region nWorkflow Region n
Activity mActivity m
Invoked Application mInvoked Application m
Code
Region 1
Code
Region 1
Code
Region q
Code
Region q
Code
Region …
Code
Region …
<activity name="mProject1">
<executable name="mProject1"/>
</activity>
<activity name="mProject1">
<executable name="mProject1"/>
</activity>
<activity name="mProject2">
<executable name="mProject2"/>
</activity>
<activity name="mProject2">
<executable name="mProject2"/>
</activity>
while () {
...
}
while () {
...
}
Hong Linh Truong, Schahram Dustdar, Thomas Fahringer:
Performance metrics and ontologies for Grid workflows. Future
Generation Comp. Syst. 23(6): 760-772 (2007)
Hong Linh Truong, Schahram Dustdar, Thomas Fahringer:
Performance metrics and ontologies for Grid workflows. Future
Generation Comp. Syst. 23(6): 760-772 (2007)
ASE Summer 2015 7
Representing and programming
data analytics workflows
 Programming languages
 General- and specific-purpose programming
languages, such as Java, Python, Swift
 Programming models, such as MapReduce, Hadoop,
Complex event processing, Spark
 Descriptive languages
 BPEL and several languages designed for specific
workflow engines
 They can also be combined
8ASE Summer 2015
Data analytics workflow execution
models
ASE Summer 2015 9
Data analytics
workflows
Data analytics
workflows Execution EngineExecution Engine
Local SchedulerLocal Scheduler
jobjob jobjob jobjob jobjob
Web
serviceWeb
serviceWeb
service
Web
service
People
Data analytics workflow execution
models
ASE Summer 2015 10
Data analytics
workflows
Data analytics
workflows
Execution EngineExecution Engine
Service
unit
Local
input
data
Analytics
Results
Web service
MapReduce/Hadoop
Sub-Workflow
MPI
Other solutions
Servers/Cloud/Cluster
Examples of systems and
frameworks for data analytics
workflows
ASE Summer 2015 11
ASKALONASKALON
KEPLERKEPLER
TAVERNATAVERNA
TRIDENTTRIDENT
Apache ODE +
WS-BPEL
Apache ODE +
WS-BPEL
PegasusPegasus
JOperaJOperaADEPTADEPT
MapReduce/HadoopMapReduce/Hadoop
SwiftSwiftRR
Some examples (1)
ASE Summer 2015 12
Source: Gideon Juve, Ewa Deelman, G. Bruce Berriman, Benjamin P. Berman, Philip Maechling: An Evaluation of the
Cost and Performance of Scientific Workflows on Amazon EC2. J. Grid Comput. 10(1): 5-21 (2012)
Source: Gideon Juve, Ewa Deelman, G. Bruce Berriman, Benjamin P. Berman, Philip Maechling: An Evaluation of the
Cost and Performance of Scientific Workflows on Amazon EC2. J. Grid Comput. 10(1): 5-21 (2012)
Some examples (2)
ASE Summer 2015 13
Source: http://www.dps.uibk.ac.at/projects/brokerage/Source: http://www.dps.uibk.ac.at/projects/brokerage/
Some examples (3)
ASE Summer 2015 14
Source: Cesare Pautasso, Thomas Heinis, Gustavo Alonso: JOpera: Autonomic Service
Orchestration. IEEE Data Eng. Bull. 29(3): 32-39 (2006)
Source: Cesare Pautasso, Thomas Heinis, Gustavo Alonso: JOpera: Autonomic Service
Orchestration. IEEE Data Eng. Bull. 29(3): 32-39 (2006)
Some examples (4)
ASE Summer 2015 15
Source: Sudipto Das, Yannis Sismanis, Kevin S. Beyer, Rainer Gemulla, Peter J. Haas, and John McPherson. 2010.
Ricardo: integrating R and Hadoop. In Proceedings of the 2010 ACM SIGMOD International Conference on Management
of data (SIGMOD '10). ACM, New York, NY, USA, 987-998. DOI=10.1145/1807167.1807275
http://doi.acm.org/10.1145/1807167.1807275
Source: Sudipto Das, Yannis Sismanis, Kevin S. Beyer, Rainer Gemulla, Peter J. Haas, and John McPherson. 2010.
Ricardo: integrating R and Hadoop. In Proceedings of the 2010 ACM SIGMOD International Conference on Management
of data (SIGMOD '10). ACM, New York, NY, USA, 987-998. DOI=10.1145/1807167.1807275
http://doi.acm.org/10.1145/1807167.1807275
QUALITY OF RESULTS
ASE Summer 2015 16
Recall - QoR
 Characterize the results of analytics processes
 Different elements of QoR
 Performance
 Data quality
 Cost
 Form/data format of output results
 Etc.
 Customer: expects QoR
 Provider: offers QoR and enforces QoR
ASE Summer 2015 17
Performance and Data Quality
Aspects
18
Data Analytics
Data in
Data out
Executed on
Analytics
Processes
uses
Execution time?
Performance Overhead?
Memory Consumption?
Is the data good
enough?
How bad data
impacts on
performance?
Is the data good enough
to be stored and shared?
Data quality metrics and models are
strongly domain-specific
Data quality metrics and models are
strongly domain-specific
Which processes should
be used?
ASE Summer 2015 18
Model QoR and analytics processes
19ASE Summer 2015
Hong Linh Truong, Schahram Dustdar: Principles of Software-Defined Elastic Systems for Big Data Analytics. IC2E
2014: 562-567
Hong Linh Truong, Schahram Dustdar: Principles of Software-Defined Elastic Systems for Big Data Analytics. IC2E
2014: 562-567
SO HOW DO WE ENABLE
QOR-AWARE ANALYTICS?
ASE Summer 2015 20
Solutions
 Computational resources provisioning?
 Replication of analytics ?
 Performance and cost measurement and
optimization?
 Improve quality of input data ?
 Improve the quality of output data?
ASE Summer 2015 21
22
Hong Linh Truong, Peter Brunner, Vlad Nae, Thomas Fahringer: DIPAS: A distributed performance analysis service for
grid service-based workflows. Future Generation Comp. Syst. 25(4): 385-398 (2009)
Hong Linh Truong, Peter Brunner, Vlad Nae, Thomas Fahringer: DIPAS: A distributed performance analysis service for
grid service-based workflows. Future Generation Comp. Syst. 25(4): 385-398 (2009)
Well-addressed concerns --
performance
ASE Summer 2015 22
Well-addressed concerns –
performance/cost
ASE Summer 2015 23
Source: David Chiu, Sagar Deshpande, Gagan Agrawal, Rongxing Li: Cost and accuracy sensitive dynamic workflow
composition over grid environments. GRID 2008: 9-16
Source: David Chiu, Sagar Deshpande, Gagan Agrawal, Rongxing Li: Cost and accuracy sensitive dynamic workflow
composition over grid environments. GRID 2008: 9-16
QUALITY OF DATA IN DATA
ANALYTICS WORKFLOWS
ASE Summer 2015 24
Very little support
 Qurator workbench
 “Personal quality models” can be expressed and
embedded into query processors or workflows.
 Assume that quality evidence is presented
 Kepler
 A data quality monitor allows user to specify quality
thresholds.
 Expect that rules can be used to control the execution
based on quality.
ASE Summer 2015 25
P Missier, S M Embury, M Greenwood, A D Preece, & B Jin, Managing Information Quality in e-Science: the Qurator
Workbench, Proc ACM International Conference on Management of Data (SIGMOD 2007), ACM Press, pages 1150-
1152, 2007.
Aisa Na’im, Daniel Crawl,Maria Indrawan, Ilkay Altintas, and Shulei Sun. Monitoring data quality in kepler. In Salim Hariri
and Kate Keahey, editors, HPDC, pages 560–564. ACM, 2010.
P Missier, S M Embury, M Greenwood, A D Preece, & B Jin, Managing Information Quality in e-Science: the Qurator
Workbench, Proc ACM International Conference on Management of Data (SIGMOD 2007), ACM Press, pages 1150-
1152, 2007.
Aisa Na’im, Daniel Crawl,Maria Indrawan, Ilkay Altintas, and Shulei Sun. Monitoring data quality in kepler. In Salim Hariri
and Kate Keahey, editors, HPDC, pages 560–564. ACM, 2010.
Research questions
 What are main QoD metrics, what are the relationship between QoD
metrics and other service level objectives, and what are their roles
and possible trade-offs?
 How to support different domain-specific QoD models and link them
to workflow structures?
 How to model, evaluate and estimate QoD associated with data
movement into, within, and out to workflows? When and where
software or scientists can perform automatic or manual QoD
measurement and analysis
 How to optimize the workflow composition and execution based on
QoD specification?
 How does QoD impact on the provisioning of data services,
computational services and supporting services?
ASE Summer 2015 26
Approach
ASE Summer 2015 27
Core models, techniques and algorithms to allow
the modeling and evaluating QoD metrics
Core models, techniques and algorithms to allow
the modeling and evaluating QoD metrics
QoD-aware composition and executionQoD-aware composition and execution
QoD-aware service provisioning and
infrastructure optimization
QoD-aware service provisioning and
infrastructure optimization
Modeling and evaluating QoD
metrics for data analytics
workflows
ASE Summer 2015 28
QoD-aware optimization for data
analytics workflow composition
and execution
ASE Summer 2015 29
HOW TO INTEGRATE QOD
EVALUATORS? AND WHICH CONCERNS
NEED TO BE CONSIDERED?
ASE Summer 2015 30
QoD metrics evaluation
 Domain-specific metrics
 Need specific tools and expertise for determining
metrics
 Evaluation
 Cannot done by software only: humans are required
 Complex integration model
 Where to put QoD evaluators and why?
 How evaluators obtain the data to be evaluated?
 Impact of QoD evaluation on performance of
data analytics workflows
ASE Summer 2015 31
WHAT KIND OF OPTIMIZATION CAN BE
DONE?
ASE Summer 2015 32
QoD-aware optimization for data
analytics workflows
 Improving quality of results
 Reducing analytics costs and time
 Enabling early failure detection
 Enabling elasticitiy of services provisioning
 Enabling elastic data analytics support
 Etc.
ASE Summer 2015 33
EXAMPLE: QOD-AWARE
SIMULATION WORKFLOWS
ASE Summer 2015 34
35
QoD-aware simulation workflows
Michael Reiter, Hong Linh Truong, Schahram Dustdar, Dimka Karastoyanova, Robert Krause, Frank Leymann, Dieter
Pahr: On Analyzing Quality of Data Influences on Performance of Finite Elements Driven Computational Simulations.
Euro-Par 2012: 793-804
Michael Reiter, Uwe Breitenbücher, Schahram Dustdar, Dimka Karastoyanova, Frank Leymann, Hong Linh Truong: A
Novel Framework for Monitoring and Analyzing Quality of Data in Simulation Workflows. eScience 2011: 105-112
Michael Reiter, Hong Linh Truong, Schahram Dustdar, Dimka Karastoyanova, Robert Krause, Frank Leymann, Dieter
Pahr: On Analyzing Quality of Data Influences on Performance of Finite Elements Driven Computational Simulations.
Euro-Par 2012: 793-804
Michael Reiter, Uwe Breitenbücher, Schahram Dustdar, Dimka Karastoyanova, Frank Leymann, Hong Linh Truong: A
Novel Framework for Monitoring and Analyzing Quality of Data in Simulation Workflows. eScience 2011: 105-112
ASE Summer 2015
Hybrid resources needed for
quality evaluation
 Challenges:
 Subjective and objective evaluation
 Long running processes
 Our approach
 Different QoD measurements
 Human and software tasks
36ASE Summer 2015
37
Evaluating quality of data in
workflows
Michael Reiter, Uwe Breitenbücher, Schahram Dustdar, Dimka Karastoyanova, Frank Leymann, Hong Linh Truong: A
Novel Framework for Monitoring and Analyzing Quality of Data in Simulation Workflows. eScience 2011: 105-112
Michael Reiter, Uwe Breitenbücher, Schahram Dustdar, Dimka Karastoyanova, Frank Leymann, Hong Linh Truong: A
Novel Framework for Monitoring and Analyzing Quality of Data in Simulation Workflows. eScience 2011: 105-112
ASE Summer 2015
QoD Evaluator
 Software-based QoD evaluators
 Can be provided under libraries integrated into
invoked applications
 Web services-based evaluators
 Human-based QoD evaluators
 Built based on the concept human-based services
 Can be interfaces via Human-Task
 Simple mapping at the moment
 Human resources from clouds/crowds
ASE Summer 2015 38
Open issues: quality-of-result
(QoR) driven workflows
 Tradeoffs are challenging
 How to support QoR driven analytics?
 Some basic steps
 Conceptualize expected QoR
 Associate the expected QoR with workflow activities
 Use the expected QoR
 to match/select underlying services (e.g., data sources,
cloud IaaS, etc
 Utilize the expected QoR and the measured QoR
and apply elasticity principles for
 Refine the workflow structure
 Provision computation, network and data
ASE Summer 2015 39
Exercises
 Read mentioned papers
 Discuss pros and cons of descriptive languages
- and programming languages – based data
analytics workflows
 Examine how QoD evaluators can be integrated
into different programming models for QoR-
aware data analytics workflows
 Implement some QoD evaluators
 Develop techniques for determining places
where QoD evaluators can be performed
ASE Summer 2015 40
41
Thanks for
your attention
Hong-Linh Truong
Distributed Systems Group
Vienna University of Technology
truong@dsg.tuwien.ac.at
http://dsg.tuwien.ac.at/staff/truong
ASE Summer 2015

More Related Content

What's hot

Business Intelligence and Big Data in Cloud
Business Intelligence and Big Data in CloudBusiness Intelligence and Big Data in Cloud
Business Intelligence and Big Data in Cloud
Ding Li
 
Conceptual framework for entity integration from multiple data sources - Draz...
Conceptual framework for entity integration from multiple data sources - Draz...Conceptual framework for entity integration from multiple data sources - Draz...
Conceptual framework for entity integration from multiple data sources - Draz...
Institute of Contemporary Sciences
 
Automatic Spatio-temporal Indexing to Integrate and Analyze the Data of an Or...
Automatic Spatio-temporal Indexing to Integrate and Analyze the Data of an Or...Automatic Spatio-temporal Indexing to Integrate and Analyze the Data of an Or...
Automatic Spatio-temporal Indexing to Integrate and Analyze the Data of an Or...
Craig Knoblock
 
Data Imputation by Soft Computing
Data Imputation by Soft ComputingData Imputation by Soft Computing
Data Imputation by Soft Computing
ijtsrd
 
From Artwork to Cyber Attacks: Lessons Learned in Building Knowledge Graphs u...
From Artwork to Cyber Attacks: Lessons Learned in Building Knowledge Graphs u...From Artwork to Cyber Attacks: Lessons Learned in Building Knowledge Graphs u...
From Artwork to Cyber Attacks: Lessons Learned in Building Knowledge Graphs u...
Craig Knoblock
 
Leveraging NLP and Deep Learning for Document Recommendations in the Cloud
Leveraging NLP and Deep Learning for Document Recommendations in the CloudLeveraging NLP and Deep Learning for Document Recommendations in the Cloud
Leveraging NLP and Deep Learning for Document Recommendations in the Cloud
Databricks
 

What's hot (6)

Business Intelligence and Big Data in Cloud
Business Intelligence and Big Data in CloudBusiness Intelligence and Big Data in Cloud
Business Intelligence and Big Data in Cloud
 
Conceptual framework for entity integration from multiple data sources - Draz...
Conceptual framework for entity integration from multiple data sources - Draz...Conceptual framework for entity integration from multiple data sources - Draz...
Conceptual framework for entity integration from multiple data sources - Draz...
 
Automatic Spatio-temporal Indexing to Integrate and Analyze the Data of an Or...
Automatic Spatio-temporal Indexing to Integrate and Analyze the Data of an Or...Automatic Spatio-temporal Indexing to Integrate and Analyze the Data of an Or...
Automatic Spatio-temporal Indexing to Integrate and Analyze the Data of an Or...
 
Data Imputation by Soft Computing
Data Imputation by Soft ComputingData Imputation by Soft Computing
Data Imputation by Soft Computing
 
From Artwork to Cyber Attacks: Lessons Learned in Building Knowledge Graphs u...
From Artwork to Cyber Attacks: Lessons Learned in Building Knowledge Graphs u...From Artwork to Cyber Attacks: Lessons Learned in Building Knowledge Graphs u...
From Artwork to Cyber Attacks: Lessons Learned in Building Knowledge Graphs u...
 
Leveraging NLP and Deep Learning for Document Recommendations in the Cloud
Leveraging NLP and Deep Learning for Document Recommendations in the CloudLeveraging NLP and Deep Learning for Document Recommendations in the Cloud
Leveraging NLP and Deep Learning for Document Recommendations in the Cloud
 

Viewers also liked

Context-aware Programming for Hybrid and Diversity-aware Collective Adaptive ...
Context-aware Programming for Hybrid and Diversity-aware Collective Adaptive ...Context-aware Programming for Hybrid and Diversity-aware Collective Adaptive ...
Context-aware Programming for Hybrid and Diversity-aware Collective Adaptive ...
Hong-Linh Truong
 
TUWien - ASE Summer 2015: Engineering human-based services in elastic systems
TUWien - ASE Summer 2015: Engineering human-based services in elastic systemsTUWien - ASE Summer 2015: Engineering human-based services in elastic systems
TUWien - ASE Summer 2015: Engineering human-based services in elastic systemsHong-Linh Truong
 
Coordination-aware Elasticity
Coordination-aware ElasticityCoordination-aware Elasticity
Coordination-aware Elasticity
Hong-Linh Truong
 
Emerging Dynamic TUW-ASE Summer 2015 - Distributed Systems and Challenges for...
Emerging Dynamic TUW-ASE Summer 2015 - Distributed Systems and Challenges for...Emerging Dynamic TUW-ASE Summer 2015 - Distributed Systems and Challenges for...
Emerging Dynamic TUW-ASE Summer 2015 - Distributed Systems and Challenges for...
Hong-Linh Truong
 
Bridging Socially-Enhanced Virtual Communities
Bridging Socially-Enhanced Virtual CommunitiesBridging Socially-Enhanced Virtual Communities
Bridging Socially-Enhanced Virtual Communities
Daniel Schall
 
Towards Hybrid and Diversity-Aware Collective Adaptive Systems
Towards Hybrid and Diversity-Aware Collective Adaptive SystemsTowards Hybrid and Diversity-Aware Collective Adaptive Systems
Towards Hybrid and Diversity-Aware Collective Adaptive Systems
Smart-Society-Project
 
TUW-ASE-Summer 2015: Advanced Services Engineering - Introduction
TUW-ASE-Summer 2015: Advanced Services Engineering - IntroductionTUW-ASE-Summer 2015: Advanced Services Engineering - Introduction
TUW-ASE-Summer 2015: Advanced Services Engineering - IntroductionHong-Linh Truong
 
Programming Elasticity in the Cloud
Programming Elasticity in the CloudProgramming Elasticity in the Cloud
Programming Elasticity in the CloudHong-Linh Truong
 
SmartSociety – A Platform for Collaborative People-Machine Computation
SmartSociety – A Platform for Collaborative People-Machine ComputationSmartSociety – A Platform for Collaborative People-Machine Computation
SmartSociety – A Platform for Collaborative People-Machine Computation
Hong-Linh Truong
 
ICSOC 2015 Panel: Service Engineering Analytics in the IoT Cloud Systems
ICSOC 2015 Panel: Service Engineering Analytics in the IoT Cloud SystemsICSOC 2015 Panel: Service Engineering Analytics in the IoT Cloud Systems
ICSOC 2015 Panel: Service Engineering Analytics in the IoT Cloud Systems
Hong-Linh Truong
 
On Developing and Operating of Data Elasticity Management Process
On Developing and Operating of Data Elasticity Management ProcessOn Developing and Operating of Data Elasticity Management Process
On Developing and Operating of Data Elasticity Management Process
Hong-Linh Truong
 
Principles for Engineering Elastic IoT Cloud Systems
Principles for Engineering Elastic IoT Cloud SystemsPrinciples for Engineering Elastic IoT Cloud Systems
Principles for Engineering Elastic IoT Cloud SystemsHong-Linh Truong
 
Training Toolkit: Incentive Server - Example
Training Toolkit: Incentive Server - ExampleTraining Toolkit: Incentive Server - Example
Training Toolkit: Incentive Server - Example
Smart-Society-Project
 
TUW-ASE Summer 2015: Advanced service-based data analytics: Models, Elasticit...
TUW-ASE Summer 2015: Advanced service-based data analytics: Models, Elasticit...TUW-ASE Summer 2015: Advanced service-based data analytics: Models, Elasticit...
TUW-ASE Summer 2015: Advanced service-based data analytics: Models, Elasticit...
Hong-Linh Truong
 
Governing Elastic IoT Cloud Systems under Uncertainties
Governing Elastic IoT Cloud Systems under UncertaintiesGoverning Elastic IoT Cloud Systems under Uncertainties
Governing Elastic IoT Cloud Systems under Uncertainties
Hong-Linh Truong
 
TUW-ASE Summer 2015: Data marketplaces: core models and concepts
TUW-ASE Summer 2015: Data marketplaces:  core models and conceptsTUW-ASE Summer 2015: Data marketplaces:  core models and concepts
TUW-ASE Summer 2015: Data marketplaces: core models and concepts
Hong-Linh Truong
 
TUW-ASE Summer 2015: Data as a Service - Models and Data Concerns
TUW-ASE Summer 2015: Data as a Service - Models and Data ConcernsTUW-ASE Summer 2015: Data as a Service - Models and Data Concerns
TUW-ASE Summer 2015: Data as a Service - Models and Data ConcernsHong-Linh Truong
 
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...
Hong-Linh Truong
 
On the Elasticity of Social Compute Units @ CAiSE2014
On the Elasticity of Social Compute Units @ CAiSE2014On the Elasticity of Social Compute Units @ CAiSE2014
On the Elasticity of Social Compute Units @ CAiSE2014Mirela Riveni
 
SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...
SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...
SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...
Hong-Linh Truong
 

Viewers also liked (20)

Context-aware Programming for Hybrid and Diversity-aware Collective Adaptive ...
Context-aware Programming for Hybrid and Diversity-aware Collective Adaptive ...Context-aware Programming for Hybrid and Diversity-aware Collective Adaptive ...
Context-aware Programming for Hybrid and Diversity-aware Collective Adaptive ...
 
TUWien - ASE Summer 2015: Engineering human-based services in elastic systems
TUWien - ASE Summer 2015: Engineering human-based services in elastic systemsTUWien - ASE Summer 2015: Engineering human-based services in elastic systems
TUWien - ASE Summer 2015: Engineering human-based services in elastic systems
 
Coordination-aware Elasticity
Coordination-aware ElasticityCoordination-aware Elasticity
Coordination-aware Elasticity
 
Emerging Dynamic TUW-ASE Summer 2015 - Distributed Systems and Challenges for...
Emerging Dynamic TUW-ASE Summer 2015 - Distributed Systems and Challenges for...Emerging Dynamic TUW-ASE Summer 2015 - Distributed Systems and Challenges for...
Emerging Dynamic TUW-ASE Summer 2015 - Distributed Systems and Challenges for...
 
Bridging Socially-Enhanced Virtual Communities
Bridging Socially-Enhanced Virtual CommunitiesBridging Socially-Enhanced Virtual Communities
Bridging Socially-Enhanced Virtual Communities
 
Towards Hybrid and Diversity-Aware Collective Adaptive Systems
Towards Hybrid and Diversity-Aware Collective Adaptive SystemsTowards Hybrid and Diversity-Aware Collective Adaptive Systems
Towards Hybrid and Diversity-Aware Collective Adaptive Systems
 
TUW-ASE-Summer 2015: Advanced Services Engineering - Introduction
TUW-ASE-Summer 2015: Advanced Services Engineering - IntroductionTUW-ASE-Summer 2015: Advanced Services Engineering - Introduction
TUW-ASE-Summer 2015: Advanced Services Engineering - Introduction
 
Programming Elasticity in the Cloud
Programming Elasticity in the CloudProgramming Elasticity in the Cloud
Programming Elasticity in the Cloud
 
SmartSociety – A Platform for Collaborative People-Machine Computation
SmartSociety – A Platform for Collaborative People-Machine ComputationSmartSociety – A Platform for Collaborative People-Machine Computation
SmartSociety – A Platform for Collaborative People-Machine Computation
 
ICSOC 2015 Panel: Service Engineering Analytics in the IoT Cloud Systems
ICSOC 2015 Panel: Service Engineering Analytics in the IoT Cloud SystemsICSOC 2015 Panel: Service Engineering Analytics in the IoT Cloud Systems
ICSOC 2015 Panel: Service Engineering Analytics in the IoT Cloud Systems
 
On Developing and Operating of Data Elasticity Management Process
On Developing and Operating of Data Elasticity Management ProcessOn Developing and Operating of Data Elasticity Management Process
On Developing and Operating of Data Elasticity Management Process
 
Principles for Engineering Elastic IoT Cloud Systems
Principles for Engineering Elastic IoT Cloud SystemsPrinciples for Engineering Elastic IoT Cloud Systems
Principles for Engineering Elastic IoT Cloud Systems
 
Training Toolkit: Incentive Server - Example
Training Toolkit: Incentive Server - ExampleTraining Toolkit: Incentive Server - Example
Training Toolkit: Incentive Server - Example
 
TUW-ASE Summer 2015: Advanced service-based data analytics: Models, Elasticit...
TUW-ASE Summer 2015: Advanced service-based data analytics: Models, Elasticit...TUW-ASE Summer 2015: Advanced service-based data analytics: Models, Elasticit...
TUW-ASE Summer 2015: Advanced service-based data analytics: Models, Elasticit...
 
Governing Elastic IoT Cloud Systems under Uncertainties
Governing Elastic IoT Cloud Systems under UncertaintiesGoverning Elastic IoT Cloud Systems under Uncertainties
Governing Elastic IoT Cloud Systems under Uncertainties
 
TUW-ASE Summer 2015: Data marketplaces: core models and concepts
TUW-ASE Summer 2015: Data marketplaces:  core models and conceptsTUW-ASE Summer 2015: Data marketplaces:  core models and concepts
TUW-ASE Summer 2015: Data marketplaces: core models and concepts
 
TUW-ASE Summer 2015: Data as a Service - Models and Data Concerns
TUW-ASE Summer 2015: Data as a Service - Models and Data ConcernsTUW-ASE Summer 2015: Data as a Service - Models and Data Concerns
TUW-ASE Summer 2015: Data as a Service - Models and Data Concerns
 
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...
 
On the Elasticity of Social Compute Units @ CAiSE2014
On the Elasticity of Social Compute Units @ CAiSE2014On the Elasticity of Social Compute Units @ CAiSE2014
On the Elasticity of Social Compute Units @ CAiSE2014
 
SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...
SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...
SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...
 

Similar to TUW-ASE Summer 2015 - Quality of Result-aware data analytics

TUW - Quality of data-aware data analytics workflows
TUW - Quality of data-aware data analytics workflowsTUW - Quality of data-aware data analytics workflows
TUW - Quality of data-aware data analytics workflowsHong-Linh Truong
 
Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data Science
Pouria Amirian
 
Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data Science
Pouria Amirian
 
Analytics of Performance and Data Quality for Mobile Edge Cloud Applications
Analytics of Performance and Data Quality for Mobile Edge Cloud ApplicationsAnalytics of Performance and Data Quality for Mobile Edge Cloud Applications
Analytics of Performance and Data Quality for Mobile Edge Cloud Applications
Hong-Linh Truong
 
Comparing the performance of a business process: using Excel & Python
Comparing the performance of a business process: using Excel & PythonComparing the performance of a business process: using Excel & Python
Comparing the performance of a business process: using Excel & Python
IRJET Journal
 
TUW-ASE-Summer 2014: Emerging Dynamic Distributed Systems and Challenges for ...
TUW-ASE-Summer 2014: Emerging Dynamic Distributed Systems and Challenges for ...TUW-ASE-Summer 2014: Emerging Dynamic Distributed Systems and Challenges for ...
TUW-ASE-Summer 2014: Emerging Dynamic Distributed Systems and Challenges for ...
Hong-Linh Truong
 
IRJET - Conversion of Unsupervised Data to Supervised Data using Topic Mo...
IRJET -  	  Conversion of Unsupervised Data to Supervised Data using Topic Mo...IRJET -  	  Conversion of Unsupervised Data to Supervised Data using Topic Mo...
IRJET - Conversion of Unsupervised Data to Supervised Data using Topic Mo...
IRJET Journal
 
Benchmarking Techniques for Performance Analysis of Operating Systems and Pro...
Benchmarking Techniques for Performance Analysis of Operating Systems and Pro...Benchmarking Techniques for Performance Analysis of Operating Systems and Pro...
Benchmarking Techniques for Performance Analysis of Operating Systems and Pro...
IRJET Journal
 
حلقة تكنولوجية 11 بحث علمى بعنوان A Systematic Mapping Study for Big Data Str...
حلقة تكنولوجية 11 بحث علمى بعنوان A Systematic Mapping Study for Big Data Str...حلقة تكنولوجية 11 بحث علمى بعنوان A Systematic Mapping Study for Big Data Str...
حلقة تكنولوجية 11 بحث علمى بعنوان A Systematic Mapping Study for Big Data Str...
Adel Sabour
 
Development of Information Extraction for Data Analysis using NLP
Development of Information Extraction for Data Analysis using NLPDevelopment of Information Extraction for Data Analysis using NLP
Development of Information Extraction for Data Analysis using NLP
IRJET Journal
 
A modified k means algorithm for big data clustering
A modified k means algorithm for big data clusteringA modified k means algorithm for big data clustering
A modified k means algorithm for big data clustering
SK Ahammad Fahad
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
Carole Goble
 
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
Hong-Linh Truong
 
Ppt for paper id 696 a review of hybrid data mining algorithm for big data mi...
Ppt for paper id 696 a review of hybrid data mining algorithm for big data mi...Ppt for paper id 696 a review of hybrid data mining algorithm for big data mi...
Ppt for paper id 696 a review of hybrid data mining algorithm for big data mi...
Prasanta Paul
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
Sanjay Padhi, Ph.D
 
Provinance in scientific workflows in e science
Provinance in scientific workflows in e scienceProvinance in scientific workflows in e science
Provinance in scientific workflows in e science
bdemchak
 
IRJET- E-MORES: Efficient Multiple Output Regression for Streaming Data
IRJET- E-MORES: Efficient Multiple Output Regression for Streaming DataIRJET- E-MORES: Efficient Multiple Output Regression for Streaming Data
IRJET- E-MORES: Efficient Multiple Output Regression for Streaming Data
IRJET Journal
 

Similar to TUW-ASE Summer 2015 - Quality of Result-aware data analytics (20)

TUW - Quality of data-aware data analytics workflows
TUW - Quality of data-aware data analytics workflowsTUW - Quality of data-aware data analytics workflows
TUW - Quality of data-aware data analytics workflows
 
Resume
ResumeResume
Resume
 
Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data Science
 
Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data Science
 
Analytics of Performance and Data Quality for Mobile Edge Cloud Applications
Analytics of Performance and Data Quality for Mobile Edge Cloud ApplicationsAnalytics of Performance and Data Quality for Mobile Edge Cloud Applications
Analytics of Performance and Data Quality for Mobile Edge Cloud Applications
 
Comparing the performance of a business process: using Excel & Python
Comparing the performance of a business process: using Excel & PythonComparing the performance of a business process: using Excel & Python
Comparing the performance of a business process: using Excel & Python
 
TUW-ASE-Summer 2014: Emerging Dynamic Distributed Systems and Challenges for ...
TUW-ASE-Summer 2014: Emerging Dynamic Distributed Systems and Challenges for ...TUW-ASE-Summer 2014: Emerging Dynamic Distributed Systems and Challenges for ...
TUW-ASE-Summer 2014: Emerging Dynamic Distributed Systems and Challenges for ...
 
IRJET - Conversion of Unsupervised Data to Supervised Data using Topic Mo...
IRJET -  	  Conversion of Unsupervised Data to Supervised Data using Topic Mo...IRJET -  	  Conversion of Unsupervised Data to Supervised Data using Topic Mo...
IRJET - Conversion of Unsupervised Data to Supervised Data using Topic Mo...
 
Benchmarking Techniques for Performance Analysis of Operating Systems and Pro...
Benchmarking Techniques for Performance Analysis of Operating Systems and Pro...Benchmarking Techniques for Performance Analysis of Operating Systems and Pro...
Benchmarking Techniques for Performance Analysis of Operating Systems and Pro...
 
حلقة تكنولوجية 11 بحث علمى بعنوان A Systematic Mapping Study for Big Data Str...
حلقة تكنولوجية 11 بحث علمى بعنوان A Systematic Mapping Study for Big Data Str...حلقة تكنولوجية 11 بحث علمى بعنوان A Systematic Mapping Study for Big Data Str...
حلقة تكنولوجية 11 بحث علمى بعنوان A Systematic Mapping Study for Big Data Str...
 
Development of Information Extraction for Data Analysis using NLP
Development of Information Extraction for Data Analysis using NLPDevelopment of Information Extraction for Data Analysis using NLP
Development of Information Extraction for Data Analysis using NLP
 
A modified k means algorithm for big data clustering
A modified k means algorithm for big data clusteringA modified k means algorithm for big data clustering
A modified k means algorithm for big data clustering
 
-linkedin
-linkedin-linkedin
-linkedin
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
 
Ppt for paper id 696 a review of hybrid data mining algorithm for big data mi...
Ppt for paper id 696 a review of hybrid data mining algorithm for big data mi...Ppt for paper id 696 a review of hybrid data mining algorithm for big data mi...
Ppt for paper id 696 a review of hybrid data mining algorithm for big data mi...
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
 
Provinance in scientific workflows in e science
Provinance in scientific workflows in e scienceProvinance in scientific workflows in e science
Provinance in scientific workflows in e science
 
IRJET- E-MORES: Efficient Multiple Output Regression for Streaming Data
IRJET- E-MORES: Efficient Multiple Output Regression for Streaming DataIRJET- E-MORES: Efficient Multiple Output Regression for Streaming Data
IRJET- E-MORES: Efficient Multiple Output Regression for Streaming Data
 
WorkExamples
WorkExamplesWorkExamples
WorkExamples
 

More from Hong-Linh Truong

QoA4ML – A Framework for Supporting Contracts in Machine Learning Services
QoA4ML – A Framework for Supporting Contracts in Machine Learning ServicesQoA4ML – A Framework for Supporting Contracts in Machine Learning Services
QoA4ML – A Framework for Supporting Contracts in Machine Learning Services
Hong-Linh Truong
 
Sharing Blockchain Performance Knowledge for Edge Service Development
Sharing Blockchain Performance Knowledge for Edge Service DevelopmentSharing Blockchain Performance Knowledge for Edge Service Development
Sharing Blockchain Performance Knowledge for Edge Service Development
Hong-Linh Truong
 
Measuring, Quantifying, & Predicting the Cost-Accuracy Tradeoff
Measuring, Quantifying, & Predicting the Cost-Accuracy TradeoffMeasuring, Quantifying, & Predicting the Cost-Accuracy Tradeoff
Measuring, Quantifying, & Predicting the Cost-Accuracy Tradeoff
Hong-Linh Truong
 
DevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
DevOps for Dynamic Interoperability of IoT, Edge and Cloud SystemsDevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
DevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
Hong-Linh Truong
 
Dynamic IoT data, protocol, and middleware interoperability with resource sli...
Dynamic IoT data, protocol, and middleware interoperability with resource sli...Dynamic IoT data, protocol, and middleware interoperability with resource sli...
Dynamic IoT data, protocol, and middleware interoperability with resource sli...
Hong-Linh Truong
 
Modeling and Provisioning IoT Cloud Systems for Testing Uncertainties
Modeling and Provisioning IoT Cloud Systems for Testing UncertaintiesModeling and Provisioning IoT Cloud Systems for Testing Uncertainties
Modeling and Provisioning IoT Cloud Systems for Testing Uncertainties
Hong-Linh Truong
 
Characterizing Incidents in Cloud-based IoT Data Analytics
Characterizing Incidents in Cloud-based IoT Data AnalyticsCharacterizing Incidents in Cloud-based IoT Data Analytics
Characterizing Incidents in Cloud-based IoT Data Analytics
Hong-Linh Truong
 
Enabling Edge Analytics of IoT Data: The Case of LoRaWAN
Enabling Edge Analytics of IoT Data: The Case of LoRaWANEnabling Edge Analytics of IoT Data: The Case of LoRaWAN
Enabling Edge Analytics of IoT Data: The Case of LoRaWAN
Hong-Linh Truong
 
Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...
Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...
Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...
Hong-Linh Truong
 
Deep Context-Awareness: Context Coupling and New Types of Context Information...
Deep Context-Awareness: Context Coupling and New Types of Context Information...Deep Context-Awareness: Context Coupling and New Types of Context Information...
Deep Context-Awareness: Context Coupling and New Types of Context Information...
Hong-Linh Truong
 
Managing and Testing Ensembles of IoT, Network functions, and Clouds
Managing and Testing Ensembles of IoT, Network functions, and CloudsManaging and Testing Ensembles of IoT, Network functions, and Clouds
Managing and Testing Ensembles of IoT, Network functions, and Clouds
Hong-Linh Truong
 
Towards a Resource Slice Interoperability Hub for IoT
Towards a Resource Slice Interoperability Hub for IoTTowards a Resource Slice Interoperability Hub for IoT
Towards a Resource Slice Interoperability Hub for IoT
Hong-Linh Truong
 
On Supporting Contract-aware IoT Dataspace Services
On Supporting Contract-aware IoT Dataspace ServicesOn Supporting Contract-aware IoT Dataspace Services
On Supporting Contract-aware IoT Dataspace Services
Hong-Linh Truong
 
Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...
Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...
Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...
Hong-Linh Truong
 
On Engineering Analytics of Elastic IoT Cloud Systems
On Engineering Analytics of Elastic IoT Cloud SystemsOn Engineering Analytics of Elastic IoT Cloud Systems
On Engineering Analytics of Elastic IoT Cloud Systems
Hong-Linh Truong
 
TUW-ASE Summer 2015: IoT Cloud Systems
TUW-ASE Summer 2015:  IoT Cloud SystemsTUW-ASE Summer 2015:  IoT Cloud Systems
TUW-ASE Summer 2015: IoT Cloud Systems
Hong-Linh Truong
 

More from Hong-Linh Truong (16)

QoA4ML – A Framework for Supporting Contracts in Machine Learning Services
QoA4ML – A Framework for Supporting Contracts in Machine Learning ServicesQoA4ML – A Framework for Supporting Contracts in Machine Learning Services
QoA4ML – A Framework for Supporting Contracts in Machine Learning Services
 
Sharing Blockchain Performance Knowledge for Edge Service Development
Sharing Blockchain Performance Knowledge for Edge Service DevelopmentSharing Blockchain Performance Knowledge for Edge Service Development
Sharing Blockchain Performance Knowledge for Edge Service Development
 
Measuring, Quantifying, & Predicting the Cost-Accuracy Tradeoff
Measuring, Quantifying, & Predicting the Cost-Accuracy TradeoffMeasuring, Quantifying, & Predicting the Cost-Accuracy Tradeoff
Measuring, Quantifying, & Predicting the Cost-Accuracy Tradeoff
 
DevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
DevOps for Dynamic Interoperability of IoT, Edge and Cloud SystemsDevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
DevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
 
Dynamic IoT data, protocol, and middleware interoperability with resource sli...
Dynamic IoT data, protocol, and middleware interoperability with resource sli...Dynamic IoT data, protocol, and middleware interoperability with resource sli...
Dynamic IoT data, protocol, and middleware interoperability with resource sli...
 
Modeling and Provisioning IoT Cloud Systems for Testing Uncertainties
Modeling and Provisioning IoT Cloud Systems for Testing UncertaintiesModeling and Provisioning IoT Cloud Systems for Testing Uncertainties
Modeling and Provisioning IoT Cloud Systems for Testing Uncertainties
 
Characterizing Incidents in Cloud-based IoT Data Analytics
Characterizing Incidents in Cloud-based IoT Data AnalyticsCharacterizing Incidents in Cloud-based IoT Data Analytics
Characterizing Incidents in Cloud-based IoT Data Analytics
 
Enabling Edge Analytics of IoT Data: The Case of LoRaWAN
Enabling Edge Analytics of IoT Data: The Case of LoRaWANEnabling Edge Analytics of IoT Data: The Case of LoRaWAN
Enabling Edge Analytics of IoT Data: The Case of LoRaWAN
 
Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...
Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...
Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...
 
Deep Context-Awareness: Context Coupling and New Types of Context Information...
Deep Context-Awareness: Context Coupling and New Types of Context Information...Deep Context-Awareness: Context Coupling and New Types of Context Information...
Deep Context-Awareness: Context Coupling and New Types of Context Information...
 
Managing and Testing Ensembles of IoT, Network functions, and Clouds
Managing and Testing Ensembles of IoT, Network functions, and CloudsManaging and Testing Ensembles of IoT, Network functions, and Clouds
Managing and Testing Ensembles of IoT, Network functions, and Clouds
 
Towards a Resource Slice Interoperability Hub for IoT
Towards a Resource Slice Interoperability Hub for IoTTowards a Resource Slice Interoperability Hub for IoT
Towards a Resource Slice Interoperability Hub for IoT
 
On Supporting Contract-aware IoT Dataspace Services
On Supporting Contract-aware IoT Dataspace ServicesOn Supporting Contract-aware IoT Dataspace Services
On Supporting Contract-aware IoT Dataspace Services
 
Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...
Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...
Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...
 
On Engineering Analytics of Elastic IoT Cloud Systems
On Engineering Analytics of Elastic IoT Cloud SystemsOn Engineering Analytics of Elastic IoT Cloud Systems
On Engineering Analytics of Elastic IoT Cloud Systems
 
TUW-ASE Summer 2015: IoT Cloud Systems
TUW-ASE Summer 2015:  IoT Cloud SystemsTUW-ASE Summer 2015:  IoT Cloud Systems
TUW-ASE Summer 2015: IoT Cloud Systems
 

Recently uploaded

Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
EduSkills OECD
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
GeoBlogs
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
heathfieldcps1
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
Jheel Barad
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
Tamralipta Mahavidyalaya
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
joachimlavalley1
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
Peter Windle
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
Levi Shapiro
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
Ashokrao Mane college of Pharmacy Peth-Vadgaon
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
Atul Kumar Singh
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Thiyagu K
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
EugeneSaldivar
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
beazzy04
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
Anna Sz.
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
DhatriParmar
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
Vikramjit Singh
 

Recently uploaded (20)

Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
 

TUW-ASE Summer 2015 - Quality of Result-aware data analytics

  • 1. Quality of Result-aware data analytics Hong-Linh Truong Distributed Systems Group, Vienna University of Technology truong@dsg.tuwien.ac.at http://dsg.tuwien.ac.at/staff/truong 1ASE Summer 2015 Advanced Services Engineering, Summer 2015 Advanced Services Engineering, Summer 2015
  • 2. Outline  Data analytics: workflow structures and systems  Enable quality of results for data analytics  Quality of data in data analytics workflows ASE Summer 2015 2
  • 3. Data Analytics Conceptual View ASE Summer 2015 3 Data Processing FrameworksData Processing Frameworks Streaming/Online Data Processing Batch Data Processing Hybrid Data Processing Static data(Near) realtime data Decision Data Analysis Analytics, Tools, Processes & Models
  • 4. Data analytics workflows ASE Summer 2015 4 Things People DaaSDaaS Computation Service Computation Service We use the term „workflow“ in a generic meaning!!!
  • 5. Different views of (data analytics) workflow systems 5 View Domain view Business Workflow Scientific/E- science Workflow Data/Computati on view Data intensive workflow Computatio n intensive workflow Human- intensive workflow System view Grid workflow Enterprise workflow Cloud- based workflow Execution model view Service- based workflow Batch job workflow Interactive workflow ASE Summer 2015
  • 6. Pros and cons of (data analytics) workflow systems ASE Summer 2015 6  Ian J. Taylor, Ewa Deelman, Dennis B. Gannon, and Matthew Shields. 2006. Workflows for E-Science: Scientific Workflows for Grids. Springer-Verlag New York, Inc., Secaucus, NJ, USA.  Bertram Ludäscher, Mathias Weske, Timothy M. McPhillips, Shawn Bowers: Scientific Workflows: Business as Usual? BPM 2009: 31-47  Mirko Sonntag, Dimka Karastoyanova, Frank Leymann: The Missing Features of Workflow Systems for Scientific Computations. Software Engineering (Workshops) 2010: 209-216  Lavanya Ramakrishnan and Beth Plale. 2010. A multi-dimensional classification model for scientific workflow characteristics. In Proceedings of the 1st International Workshop on Workflow Approaches to New Data-centric Science (Wands '10). ACM, New York, NY, USA, , Article 4 , 12 pages. DOI=10.1145/1833398.1833402 http://doi.acm.org/10.1145/1833398.1833402  Jia Yu and Rajkumar Buyya. 2005. A taxonomy of scientific workflow systems for grid computing. SIGMOD Rec. 34, 3 (September 2005), 44-49. DOI=10.1145/1084805.1084814 http://doi.acm.org/10.1145/1084805.1084814  Ian J. Taylor, Ewa Deelman, Dennis B. Gannon, and Matthew Shields. 2006. Workflows for E-Science: Scientific Workflows for Grids. Springer-Verlag New York, Inc., Secaucus, NJ, USA.  Bertram Ludäscher, Mathias Weske, Timothy M. McPhillips, Shawn Bowers: Scientific Workflows: Business as Usual? BPM 2009: 31-47  Mirko Sonntag, Dimka Karastoyanova, Frank Leymann: The Missing Features of Workflow Systems for Scientific Computations. Software Engineering (Workshops) 2010: 209-216  Lavanya Ramakrishnan and Beth Plale. 2010. A multi-dimensional classification model for scientific workflow characteristics. In Proceedings of the 1st International Workshop on Workflow Approaches to New Data-centric Science (Wands '10). ACM, New York, NY, USA, , Article 4 , 12 pages. DOI=10.1145/1833398.1833402 http://doi.acm.org/10.1145/1833398.1833402  Jia Yu and Rajkumar Buyya. 2005. A taxonomy of scientific workflow systems for grid computing. SIGMOD Rec. 34, 3 (September 2005), 44-49. DOI=10.1145/1084805.1084814 http://doi.acm.org/10.1145/1084805.1084814
  • 7. Hierarchical view of workflows (1) mProject1Service.java public void mProject1() { } mProject1Service.java public void mProject1() { } WorkflowWorkflow A();A(); <parallel> </parallel> <parallel> </parallel> Workflow Region nWorkflow Region n Activity mActivity m Invoked Application mInvoked Application m Code Region 1 Code Region 1 Code Region q Code Region q Code Region … Code Region … <activity name="mProject1"> <executable name="mProject1"/> </activity> <activity name="mProject1"> <executable name="mProject1"/> </activity> <activity name="mProject2"> <executable name="mProject2"/> </activity> <activity name="mProject2"> <executable name="mProject2"/> </activity> while () { ... } while () { ... } Hong Linh Truong, Schahram Dustdar, Thomas Fahringer: Performance metrics and ontologies for Grid workflows. Future Generation Comp. Syst. 23(6): 760-772 (2007) Hong Linh Truong, Schahram Dustdar, Thomas Fahringer: Performance metrics and ontologies for Grid workflows. Future Generation Comp. Syst. 23(6): 760-772 (2007) ASE Summer 2015 7
  • 8. Representing and programming data analytics workflows  Programming languages  General- and specific-purpose programming languages, such as Java, Python, Swift  Programming models, such as MapReduce, Hadoop, Complex event processing, Spark  Descriptive languages  BPEL and several languages designed for specific workflow engines  They can also be combined 8ASE Summer 2015
  • 9. Data analytics workflow execution models ASE Summer 2015 9 Data analytics workflows Data analytics workflows Execution EngineExecution Engine Local SchedulerLocal Scheduler jobjob jobjob jobjob jobjob Web serviceWeb serviceWeb service Web service People
  • 10. Data analytics workflow execution models ASE Summer 2015 10 Data analytics workflows Data analytics workflows Execution EngineExecution Engine Service unit Local input data Analytics Results Web service MapReduce/Hadoop Sub-Workflow MPI Other solutions Servers/Cloud/Cluster
  • 11. Examples of systems and frameworks for data analytics workflows ASE Summer 2015 11 ASKALONASKALON KEPLERKEPLER TAVERNATAVERNA TRIDENTTRIDENT Apache ODE + WS-BPEL Apache ODE + WS-BPEL PegasusPegasus JOperaJOperaADEPTADEPT MapReduce/HadoopMapReduce/Hadoop SwiftSwiftRR
  • 12. Some examples (1) ASE Summer 2015 12 Source: Gideon Juve, Ewa Deelman, G. Bruce Berriman, Benjamin P. Berman, Philip Maechling: An Evaluation of the Cost and Performance of Scientific Workflows on Amazon EC2. J. Grid Comput. 10(1): 5-21 (2012) Source: Gideon Juve, Ewa Deelman, G. Bruce Berriman, Benjamin P. Berman, Philip Maechling: An Evaluation of the Cost and Performance of Scientific Workflows on Amazon EC2. J. Grid Comput. 10(1): 5-21 (2012)
  • 13. Some examples (2) ASE Summer 2015 13 Source: http://www.dps.uibk.ac.at/projects/brokerage/Source: http://www.dps.uibk.ac.at/projects/brokerage/
  • 14. Some examples (3) ASE Summer 2015 14 Source: Cesare Pautasso, Thomas Heinis, Gustavo Alonso: JOpera: Autonomic Service Orchestration. IEEE Data Eng. Bull. 29(3): 32-39 (2006) Source: Cesare Pautasso, Thomas Heinis, Gustavo Alonso: JOpera: Autonomic Service Orchestration. IEEE Data Eng. Bull. 29(3): 32-39 (2006)
  • 15. Some examples (4) ASE Summer 2015 15 Source: Sudipto Das, Yannis Sismanis, Kevin S. Beyer, Rainer Gemulla, Peter J. Haas, and John McPherson. 2010. Ricardo: integrating R and Hadoop. In Proceedings of the 2010 ACM SIGMOD International Conference on Management of data (SIGMOD '10). ACM, New York, NY, USA, 987-998. DOI=10.1145/1807167.1807275 http://doi.acm.org/10.1145/1807167.1807275 Source: Sudipto Das, Yannis Sismanis, Kevin S. Beyer, Rainer Gemulla, Peter J. Haas, and John McPherson. 2010. Ricardo: integrating R and Hadoop. In Proceedings of the 2010 ACM SIGMOD International Conference on Management of data (SIGMOD '10). ACM, New York, NY, USA, 987-998. DOI=10.1145/1807167.1807275 http://doi.acm.org/10.1145/1807167.1807275
  • 16. QUALITY OF RESULTS ASE Summer 2015 16
  • 17. Recall - QoR  Characterize the results of analytics processes  Different elements of QoR  Performance  Data quality  Cost  Form/data format of output results  Etc.  Customer: expects QoR  Provider: offers QoR and enforces QoR ASE Summer 2015 17
  • 18. Performance and Data Quality Aspects 18 Data Analytics Data in Data out Executed on Analytics Processes uses Execution time? Performance Overhead? Memory Consumption? Is the data good enough? How bad data impacts on performance? Is the data good enough to be stored and shared? Data quality metrics and models are strongly domain-specific Data quality metrics and models are strongly domain-specific Which processes should be used? ASE Summer 2015 18
  • 19. Model QoR and analytics processes 19ASE Summer 2015 Hong Linh Truong, Schahram Dustdar: Principles of Software-Defined Elastic Systems for Big Data Analytics. IC2E 2014: 562-567 Hong Linh Truong, Schahram Dustdar: Principles of Software-Defined Elastic Systems for Big Data Analytics. IC2E 2014: 562-567
  • 20. SO HOW DO WE ENABLE QOR-AWARE ANALYTICS? ASE Summer 2015 20
  • 21. Solutions  Computational resources provisioning?  Replication of analytics ?  Performance and cost measurement and optimization?  Improve quality of input data ?  Improve the quality of output data? ASE Summer 2015 21
  • 22. 22 Hong Linh Truong, Peter Brunner, Vlad Nae, Thomas Fahringer: DIPAS: A distributed performance analysis service for grid service-based workflows. Future Generation Comp. Syst. 25(4): 385-398 (2009) Hong Linh Truong, Peter Brunner, Vlad Nae, Thomas Fahringer: DIPAS: A distributed performance analysis service for grid service-based workflows. Future Generation Comp. Syst. 25(4): 385-398 (2009) Well-addressed concerns -- performance ASE Summer 2015 22
  • 23. Well-addressed concerns – performance/cost ASE Summer 2015 23 Source: David Chiu, Sagar Deshpande, Gagan Agrawal, Rongxing Li: Cost and accuracy sensitive dynamic workflow composition over grid environments. GRID 2008: 9-16 Source: David Chiu, Sagar Deshpande, Gagan Agrawal, Rongxing Li: Cost and accuracy sensitive dynamic workflow composition over grid environments. GRID 2008: 9-16
  • 24. QUALITY OF DATA IN DATA ANALYTICS WORKFLOWS ASE Summer 2015 24
  • 25. Very little support  Qurator workbench  “Personal quality models” can be expressed and embedded into query processors or workflows.  Assume that quality evidence is presented  Kepler  A data quality monitor allows user to specify quality thresholds.  Expect that rules can be used to control the execution based on quality. ASE Summer 2015 25 P Missier, S M Embury, M Greenwood, A D Preece, & B Jin, Managing Information Quality in e-Science: the Qurator Workbench, Proc ACM International Conference on Management of Data (SIGMOD 2007), ACM Press, pages 1150- 1152, 2007. Aisa Na’im, Daniel Crawl,Maria Indrawan, Ilkay Altintas, and Shulei Sun. Monitoring data quality in kepler. In Salim Hariri and Kate Keahey, editors, HPDC, pages 560–564. ACM, 2010. P Missier, S M Embury, M Greenwood, A D Preece, & B Jin, Managing Information Quality in e-Science: the Qurator Workbench, Proc ACM International Conference on Management of Data (SIGMOD 2007), ACM Press, pages 1150- 1152, 2007. Aisa Na’im, Daniel Crawl,Maria Indrawan, Ilkay Altintas, and Shulei Sun. Monitoring data quality in kepler. In Salim Hariri and Kate Keahey, editors, HPDC, pages 560–564. ACM, 2010.
  • 26. Research questions  What are main QoD metrics, what are the relationship between QoD metrics and other service level objectives, and what are their roles and possible trade-offs?  How to support different domain-specific QoD models and link them to workflow structures?  How to model, evaluate and estimate QoD associated with data movement into, within, and out to workflows? When and where software or scientists can perform automatic or manual QoD measurement and analysis  How to optimize the workflow composition and execution based on QoD specification?  How does QoD impact on the provisioning of data services, computational services and supporting services? ASE Summer 2015 26
  • 27. Approach ASE Summer 2015 27 Core models, techniques and algorithms to allow the modeling and evaluating QoD metrics Core models, techniques and algorithms to allow the modeling and evaluating QoD metrics QoD-aware composition and executionQoD-aware composition and execution QoD-aware service provisioning and infrastructure optimization QoD-aware service provisioning and infrastructure optimization
  • 28. Modeling and evaluating QoD metrics for data analytics workflows ASE Summer 2015 28
  • 29. QoD-aware optimization for data analytics workflow composition and execution ASE Summer 2015 29
  • 30. HOW TO INTEGRATE QOD EVALUATORS? AND WHICH CONCERNS NEED TO BE CONSIDERED? ASE Summer 2015 30
  • 31. QoD metrics evaluation  Domain-specific metrics  Need specific tools and expertise for determining metrics  Evaluation  Cannot done by software only: humans are required  Complex integration model  Where to put QoD evaluators and why?  How evaluators obtain the data to be evaluated?  Impact of QoD evaluation on performance of data analytics workflows ASE Summer 2015 31
  • 32. WHAT KIND OF OPTIMIZATION CAN BE DONE? ASE Summer 2015 32
  • 33. QoD-aware optimization for data analytics workflows  Improving quality of results  Reducing analytics costs and time  Enabling early failure detection  Enabling elasticitiy of services provisioning  Enabling elastic data analytics support  Etc. ASE Summer 2015 33
  • 35. 35 QoD-aware simulation workflows Michael Reiter, Hong Linh Truong, Schahram Dustdar, Dimka Karastoyanova, Robert Krause, Frank Leymann, Dieter Pahr: On Analyzing Quality of Data Influences on Performance of Finite Elements Driven Computational Simulations. Euro-Par 2012: 793-804 Michael Reiter, Uwe Breitenbücher, Schahram Dustdar, Dimka Karastoyanova, Frank Leymann, Hong Linh Truong: A Novel Framework for Monitoring and Analyzing Quality of Data in Simulation Workflows. eScience 2011: 105-112 Michael Reiter, Hong Linh Truong, Schahram Dustdar, Dimka Karastoyanova, Robert Krause, Frank Leymann, Dieter Pahr: On Analyzing Quality of Data Influences on Performance of Finite Elements Driven Computational Simulations. Euro-Par 2012: 793-804 Michael Reiter, Uwe Breitenbücher, Schahram Dustdar, Dimka Karastoyanova, Frank Leymann, Hong Linh Truong: A Novel Framework for Monitoring and Analyzing Quality of Data in Simulation Workflows. eScience 2011: 105-112 ASE Summer 2015
  • 36. Hybrid resources needed for quality evaluation  Challenges:  Subjective and objective evaluation  Long running processes  Our approach  Different QoD measurements  Human and software tasks 36ASE Summer 2015
  • 37. 37 Evaluating quality of data in workflows Michael Reiter, Uwe Breitenbücher, Schahram Dustdar, Dimka Karastoyanova, Frank Leymann, Hong Linh Truong: A Novel Framework for Monitoring and Analyzing Quality of Data in Simulation Workflows. eScience 2011: 105-112 Michael Reiter, Uwe Breitenbücher, Schahram Dustdar, Dimka Karastoyanova, Frank Leymann, Hong Linh Truong: A Novel Framework for Monitoring and Analyzing Quality of Data in Simulation Workflows. eScience 2011: 105-112 ASE Summer 2015
  • 38. QoD Evaluator  Software-based QoD evaluators  Can be provided under libraries integrated into invoked applications  Web services-based evaluators  Human-based QoD evaluators  Built based on the concept human-based services  Can be interfaces via Human-Task  Simple mapping at the moment  Human resources from clouds/crowds ASE Summer 2015 38
  • 39. Open issues: quality-of-result (QoR) driven workflows  Tradeoffs are challenging  How to support QoR driven analytics?  Some basic steps  Conceptualize expected QoR  Associate the expected QoR with workflow activities  Use the expected QoR  to match/select underlying services (e.g., data sources, cloud IaaS, etc  Utilize the expected QoR and the measured QoR and apply elasticity principles for  Refine the workflow structure  Provision computation, network and data ASE Summer 2015 39
  • 40. Exercises  Read mentioned papers  Discuss pros and cons of descriptive languages - and programming languages – based data analytics workflows  Examine how QoD evaluators can be integrated into different programming models for QoR- aware data analytics workflows  Implement some QoD evaluators  Develop techniques for determining places where QoD evaluators can be performed ASE Summer 2015 40
  • 41. 41 Thanks for your attention Hong-Linh Truong Distributed Systems Group Vienna University of Technology truong@dsg.tuwien.ac.at http://dsg.tuwien.ac.at/staff/truong ASE Summer 2015