SlideShare a Scribd company logo
Performance Analysis of Grid workflows
      in K-WfGrid and ASKALON
         Hong-Linh Truong and Thomas Fahringer
                    Distributed and Parallel Systems Group
            Institute of Computer Science, University of Innsbruck
                            {truong,tf}@dps.uibk.ac.at
                        http://dps.uibk.ac.at/projects/pma

    With contributions from:
    Bartosz Balis, Marian Bubak, Peter Brunner, Schahram Dustdar, Jakub
    Dziwisz, Andreas Hoheisel, Tilman Linden, Radu Prodan, Francesco
    Nerieri, Kuba Rozkwitalski, Robert Samborski




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.
Outline
       Motivation
       Objectives and approach
       Performance metrics and ontologies
       Architecture of Grid monitoring and analysis
       Conclusions




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   2
Performance Instrumentation, Monitoring, and Analysis
                     for the Grid
       Challenging task because of dynamic nature of
    the Grid and its applications
        Combination of fine-grained and coarse-grained
        models
        Multilingual applications
        Heterogeneous and dynamic systems
       Not just performance but also dependability
      The focus of the talk
        Our concepts, architecture, interfaces and
        integration

Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   3
ASKALON Toolkit
                  http://dps.uibk.ac.at/projects/askalon/




       Programming and execution environment for Grid workflows
         Integrated various services for scheduling, executing,
         monitoring and analyzing Grid workflows
      Target applications
        Workflows of scientific applications (C/Fortran)
         • Material science, flooding, astrophysics, movie
           rendering, etc.
Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   4
The K-WfGrid Project
     An FP6 EU project
          www.kwfgrid.net
                                                                    Execute workflow
     Focuses
          Automatic construction and
          reuse of workflows based on
                                                        Construct                         Monitor
          knowledge gathered through
          execution                                     workflow                      environment
     Target applications                                              K-WfGrid
          Workflows of Grid/Web services               Reuse                                 Analyze
          Grid/Web services may invoke                 knowledge                          information
          legacy applications
          Business (Coordinated Traffic
          Management, EPR) and
                                                                   Capture knowledge
          scientific (e.g., Food simulation)
          applications




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.          5
The K-WfGrid Project
     An FP6 EU project
          www.kwfgrid.net
                                                                    Execute workflow
     Focuses
          Automatic construction and
          reuse of workflows based on
                                                        Construct                         Monitor
          knowledge gathered through
          execution                                     workflow                      environment
                                        Many Grid users and
     Target applications                                 K-WfGrid
                                        developers we know
          Workflows of Grid/Web services       Reuse               Analyze
                                emphasize the interoperability, information
          Grid/Web services may invoke         knowledge
          legacy applications    integration and reliability, not
          Business (Coordinated Traffic
          Management, EPR) and
                                          just performance
                                                                   Capture knowledge
          scientific (e.g., Food simulation)
          applications




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.      6
Objectives, Requirements, Approaches
       Performance monitoring and analysis for Grid workflows of
       Web/Grid services and multilingual components

       Performance and dependability metrics
       Monitoring and performance analysis
         Service-oriented distributed architecture and peer-to-
         peer model
         Unified monitoring and performance analysis system
         covering infrastructure and applications
         Standardized data representations for monitoring data
         and events
         Adaptive and generic sensors, distributed analysis,
         performance bottleneck search

Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   7
Hierarchical View of Grid Workflows
                                       <parallel>
             Workflow
                                        <activity name="mProject1">
                                        <executable name="mProject1"/>
                                        </activity>
      Workflow Region n
                                        <activity name="mProject2">
                                        <executable name="mProject2"/>
                                         </activity>
             Activity m


                                       </parallel>
                                                          mProject1Service.java
      Invoked Application m
                                                                 public void mProject1() {
                                                                         A();

                                                                         while () {
  Code            Code           Code                                       ...
 Region 1       Region …        Region q                                     }

                                                                 }
Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.      8
Hierarchical View of Grid Workflows
                                       <parallel>
             Workflow
                                        <activity name="mProject1">
                                        <executable name="mProject1"/>
                                        </activity>
      Workflow Region n
                                        <activity name="mProject2">
                                        <executable name="mProject2"/>
                                         </activity>
             Activity m


                                       </parallel>
                                         Monitoring and
                                                     mProject1Service.java
      Invoked Application m            analysis at multiple
                                                          public void mProject1() {
                                      levels of abstraction
                                                                         A();

                                                                         while () {
  Code            Code           Code                                       ...
 Region 1       Region …        Region q                                     }

                                                                 }
Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   9
Workflow Execution Model (Simplified)




                                                                  Local scheduler

       Workflow execution
        Spanning multiple Grid sites
        Highly inter-organizational, inter-related and dynamic
       Multiple levels of job scheduling
        At workflow execution engine (part of WfMS)
        At Grid sites
Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   10
Performance Metrics of Grid Workflows
      Interesting performance metrics associated with multiple
    levels of abstraction
         Metrics can be used in workflow composition, for
         comparing different invoked applications of a single activity,
         adaptive scheduling, etc.
      Five levels of abstraction
         Code region, Invoked application, Activity ,Workflow
         Region, Workflow
         Topdown PMA: from a higher level to a lower one




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   11
Monitoring and Measuring Performance
                    Metrics
    Performance monitoring and analysis tools
       Operate at multiple levels
       Correlate performance metrics from multiple levels
   Middleware and application instrumentation
      Instrument execution engine
       • Execution engine can be distributed or centralized
      Instrument applications
       • Distributed, spanning multiple Grid sites
    Challenging problems: the complexity of performance tools and data
      Integrate multiple performance monitoring tools executed on multiple
      Grid sites
      Integrate performance data produced by various tools

      We need common concepts for performance data associated with
                          Grid workflows
Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   12
Utilizing Ontologies for Describing Performance
               Data of Grid workflows
      Metrics ontology
        Specifies which performance metrics a tool can provide
        Simplifies the access to performance metrics provided by various tools
     Performance data integration
        Performance data integration based on common concepts.
        High-level search and retrieval of performance data
     Knowledge base performance data of Grid workflows
        Utilized by high-level tools such as schedulers, workflow composition
        tools, etc.
        Used to re(discover) workflow patterns, interactions in workflows, to
        check correct execution, etc.
     Distributed performance analysis
         Performance analysis requests can be built based on ontologies

Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   13
Workflow Performance Ontology

     WfPerfOnto (Ontology describing Performance data of Grid
    Workflows)
       Specifies performance metrics
       Basic concepts
        • Concepts reflect the hierarchical structure of a workflow
        • Static and dynamic workflow performance data
       Relationships
        • Static and dynamic relationships among concepts




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   14
WfPerfOnto




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   15
Monitoring and Analysis Scenario




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   16
Components for Grid PMA
      Three main parts of a unified performance monitoring,
    instrumentation and analysis system
          Monitoring and instrumentation services
          Performance analysis services
          Performance service interfaces and data representations
           • We must reuse existing tools and techniques as much as
             possible

      Integration model
          Loosely coupled: among Grid sites/organizations
           • Utilizing SOA for performance tools
          Tightly coupled: within services deployed in a single Grid
          site/organization
           • Interfacing to existing (parallel) performance tools


Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   17
K-WfGrid Monitoring and Analysis Architecture




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   18
Self-Managing Sensor-Based Middleware
       Integrating diverse types of sensors into a single system
          Event-driven and demand-driven sensors for system and
          applications monitoring, rule-based monitoring
       Self-managing services
         Service-based operations and TCP-based stream data
         delivery
         Peer-to-peer Grid services for the monitoring middleware
       Query and subscription of monitoring data
        Data query and subscription
        Group-based data query and subscription, and notification




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   19
Self-Managing Sensor Based Monitoring
                      Middleware




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   20
Workflow Instrumentation
      Issues to be addressed
          Multiple levels of instrumentation
          Instrumentation of multilingual applications (C/Java/Fortran)
          Must be dynamic (enabled) instrumentation

      ASKALON and K-WfGrid approach
          Utilizing existing instrumentation techniques
           • OCM-G (Roland Wismueller and Marian Bubak)
                  dynamic enabled instrumentation C/Fortran
           • Dyninst (Barton Miller, Jeff Hollingsworth)
                  for binary code generated from C/Fortran
           • Source/dynamic instrumentation of Java (from Java 1.5)
          Using APART standardized intermediate representation (SIR)
          XML-based request for controlling instrumentation


Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   21
DIPAS: Distributed Performance Analysis
           DIPAS Portal                     DIPAS Portal


                  DIPAS                       DIPAS
                                                                               Knowledge
                 GateWay                     GateWay                         Builder Agent
                                                                                 GOM:
            Grid analysis                                                   Ontology Memory
                                            Grid analysis
               agent                           agent
                                                                           Monitoring Service
                            Grid analysis
                               agent                                            GWES:
                               DIPAS                                        Execution Service
  Grid analysis agent accepts WARL and returns performance metrics
  described in XML under a tree of metrics
     WARL (workflow analysis request langauge): based on concepts and
     properties in WfPerfOnto
Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   22
temporal overheads


Workflow Overhead                                                     unidentified overhead

                                                                      middleware
                                                                                           resource brokerage


Classification                                                                             performance prediction

                                                                                          scheduling

                                                                                                           optimization algorithm

                                                                                                           rescheduling


      Middleware                                                                          execution management

                                                                                                             control of parallelism


           Scheduler                                                                                                             fork construct

                                                                                                                                 barrier



           Resource Manager
                                                                                                             job preparation

                                                                                                                                  archive/compression of data

                                                                                                                                 extractdecompression/

           Execution management                                                                              job submission

                                                                                                                                 submission decision policy

            • Control of parallelism                                                                                             GRAM submission

                                                                                                                                 GRAM polling


            • Loss of parallelism                                                                                                queue

                                                                                                                                                  queue waiting

                                                                                                                                                  queue residency



      Loss of parallelism                                                                 restart

                                                                                                             job failure

                                                                                                           job cancel


           Load imbalance                                                                  service latency

                                                                                                               resource broker


           Serialization                                                                                     performance prediction

                                                                                                                  scheduler



           Replicated job                                                                 security
                                                                                                             execution management



                                                                          loss of parallelism



      Data transfer                                                                       load imbalance

                                                                                           serialisation

                                                                                          replicated job




      Activity
                                                                          data transfer

                                                                                          file staging

                                                                                                             stage in


           Parallel processing overheads                                                   access to database
                                                                                                             stage out




           External load                                                                   third party data transfer

                                                                                           input from user

                                                                                           data transfer imbalance

                                                                          activity


Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   parallel overheads

                                                                                           external load
                                                                                                                                                  23
Current status of the implementation
      SOA-based
          Monitoring, Instrumentation and Analysis services are GT4 based
          XML-based for performance data representations and requests
      Monitoring and Instrumentation Services
          gSOAP, GSI-based dynamic instrumentation service
          Java-based dynamic instrumentation
          OCM-G
          But we need to integrate them into a single framework for workflows
          and it is a non trivial task
      Analysis services
          GT4-based with distributed components
          Simple language for workflow analysis request (WARL), designed
          based on WfPerfOnto
          Metrics are described in XML


Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   24
Example: Dynamic Instrumentation




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   25
Snapshot: Online System Monitoring




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   26
Rule-based Monitoring
       Sensors use rules to analyze monitoring data
       Rules are based on IBM ABLE toolkit

       Example:
          A network path in the
          Austrian Grid, bandwidth <
          5MB/s (based on Iperf)

          Define a fuzzy variable with
          VERY LOW, LOW, MEDUM,
          HIGH, VERY HIGH

          Fuzzy rules: Events send
          when bandwidth VERY LOW
          or VERY HIGH
Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   27
Online Infrastructure Monitoring




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   28
DAG-based workflow monitoring and
                       analysis
       Online application
            profiling




  Monitoring data of
  workflow activity
     execution




                                                               Monitoring data of
                                                              computational node




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   29
K-WfGrid: Online Workflow Monitoring and
                       Analysis




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   30
Online Workflow Analysis




Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   31
Conclusions
    The architecture of monitoring and analysis service must tackle the
   dynamics and the diversity of the Grid
          Service-oriented, peer-to-peer model, adaptive sensors
     Integration and reuse are important issues
          Loosely coupled and tightly coupled
          Do not neglect data representations and service interfaces
     Performance metrics and ontology for Grid workflows
          What performance metrics are important and how to measure them
          Common and generic concepts and relationships monitored and alayzed
    Given well-defined service interfaces, data representation, performance
   metrics and ontology
          Simplify the integration among components
          Towards automatic, intelligent and distributed performance performance
          analysis

                        UIBK perf. work: http://dps.uibk.ac.at/projects/pma/
                       Papers: http://dps.uibk.ac.at/index.pl/publications


Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.   32

More Related Content

Similar to Performance Analysis of Grid Workflows in K-WfGrid and ASKALON

K-WfGrid Distributed Monitoring and Performance Analysis Services for Workflo...
K-WfGrid Distributed Monitoring and Performance Analysis Services for Workflo...K-WfGrid Distributed Monitoring and Performance Analysis Services for Workflo...
K-WfGrid Distributed Monitoring and Performance Analysis Services for Workflo...
Hong-Linh Truong
 
Performance Metrics and Ontology for Describing Performance Data of Grid Work...
Performance Metrics and Ontology for Describing Performance Data of Grid Work...Performance Metrics and Ontology for Describing Performance Data of Grid Work...
Performance Metrics and Ontology for Describing Performance Data of Grid Work...
Hong-Linh Truong
 
Session 46 - Principles of workflow management and execution
Session 46 - Principles of workflow management and execution Session 46 - Principles of workflow management and execution
Session 46 - Principles of workflow management and execution ISSGC Summer School
 
Webinar september 2013
Webinar september 2013Webinar september 2013
Webinar september 2013Marc Gille
 
LUXproject Functionality Overview R12.2
LUXproject Functionality Overview R12.2LUXproject Functionality Overview R12.2
LUXproject Functionality Overview R12.2Alexander Zagvozdin
 
Presentation for use
Presentation   for usePresentation   for use
Presentation for usebolu804
 
Zou Layered VO PDCAT2008 V0.5 Concise
Zou Layered VO PDCAT2008 V0.5 ConciseZou Layered VO PDCAT2008 V0.5 Concise
Zou Layered VO PDCAT2008 V0.5 Concise
yongqiangzou
 
Building resilient scheduling in distributed systems with Spring
Building resilient scheduling in distributed systems with SpringBuilding resilient scheduling in distributed systems with Spring
Building resilient scheduling in distributed systems with Spring
Marek Jeszka
 
Automation Failover in Openstack
Automation Failover in OpenstackAutomation Failover in Openstack
Automation Failover in Openstack
jannahyusoff1
 
20100512 Workflow Ramage
20100512 Workflow Ramage20100512 Workflow Ramage
20100512 Workflow Ramage
Steven Ramage
 
Presentation for slideshare
Presentation   for slidesharePresentation   for slideshare
Presentation for slidesharebolu804
 
Network Analyzer and Report Generation Tool for NS-2 using TCL Script
Network Analyzer and Report Generation Tool for NS-2 using TCL ScriptNetwork Analyzer and Report Generation Tool for NS-2 using TCL Script
Network Analyzer and Report Generation Tool for NS-2 using TCL Script
IRJET Journal
 
Live Coding 12 Factor App
Live Coding 12 Factor AppLive Coding 12 Factor App
Live Coding 12 Factor App
Emily Jiang
 
LUXproject functionality overview R11.8
LUXproject functionality overview R11.8LUXproject functionality overview R11.8
LUXproject functionality overview R11.8
Alexander Zagvozdin
 
HC-4022, Towards an Ecosystem for Heterogeneous Parallel Computing, by Wu Feng
HC-4022, Towards an Ecosystem for Heterogeneous Parallel Computing, by Wu FengHC-4022, Towards an Ecosystem for Heterogeneous Parallel Computing, by Wu Feng
HC-4022, Towards an Ecosystem for Heterogeneous Parallel Computing, by Wu Feng
AMD Developer Central
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
Carole Goble
 
Rejunevating software reengineering processes
Rejunevating software reengineering processesRejunevating software reengineering processes
Rejunevating software reengineering processes
manishthaper
 
Automation chapt 3
Automation chapt 3Automation chapt 3
Automation chapt 3
jannahyusoff1
 
Microservices and modularity with java
Microservices and modularity with javaMicroservices and modularity with java
Microservices and modularity with java
DPC Consulting Ltd
 
IPv4 to IPv6 network transformation
IPv4 to IPv6 network transformationIPv4 to IPv6 network transformation
IPv4 to IPv6 network transformation
Nikolay Milovanov
 

Similar to Performance Analysis of Grid Workflows in K-WfGrid and ASKALON (20)

K-WfGrid Distributed Monitoring and Performance Analysis Services for Workflo...
K-WfGrid Distributed Monitoring and Performance Analysis Services for Workflo...K-WfGrid Distributed Monitoring and Performance Analysis Services for Workflo...
K-WfGrid Distributed Monitoring and Performance Analysis Services for Workflo...
 
Performance Metrics and Ontology for Describing Performance Data of Grid Work...
Performance Metrics and Ontology for Describing Performance Data of Grid Work...Performance Metrics and Ontology for Describing Performance Data of Grid Work...
Performance Metrics and Ontology for Describing Performance Data of Grid Work...
 
Session 46 - Principles of workflow management and execution
Session 46 - Principles of workflow management and execution Session 46 - Principles of workflow management and execution
Session 46 - Principles of workflow management and execution
 
Webinar september 2013
Webinar september 2013Webinar september 2013
Webinar september 2013
 
LUXproject Functionality Overview R12.2
LUXproject Functionality Overview R12.2LUXproject Functionality Overview R12.2
LUXproject Functionality Overview R12.2
 
Presentation for use
Presentation   for usePresentation   for use
Presentation for use
 
Zou Layered VO PDCAT2008 V0.5 Concise
Zou Layered VO PDCAT2008 V0.5 ConciseZou Layered VO PDCAT2008 V0.5 Concise
Zou Layered VO PDCAT2008 V0.5 Concise
 
Building resilient scheduling in distributed systems with Spring
Building resilient scheduling in distributed systems with SpringBuilding resilient scheduling in distributed systems with Spring
Building resilient scheduling in distributed systems with Spring
 
Automation Failover in Openstack
Automation Failover in OpenstackAutomation Failover in Openstack
Automation Failover in Openstack
 
20100512 Workflow Ramage
20100512 Workflow Ramage20100512 Workflow Ramage
20100512 Workflow Ramage
 
Presentation for slideshare
Presentation   for slidesharePresentation   for slideshare
Presentation for slideshare
 
Network Analyzer and Report Generation Tool for NS-2 using TCL Script
Network Analyzer and Report Generation Tool for NS-2 using TCL ScriptNetwork Analyzer and Report Generation Tool for NS-2 using TCL Script
Network Analyzer and Report Generation Tool for NS-2 using TCL Script
 
Live Coding 12 Factor App
Live Coding 12 Factor AppLive Coding 12 Factor App
Live Coding 12 Factor App
 
LUXproject functionality overview R11.8
LUXproject functionality overview R11.8LUXproject functionality overview R11.8
LUXproject functionality overview R11.8
 
HC-4022, Towards an Ecosystem for Heterogeneous Parallel Computing, by Wu Feng
HC-4022, Towards an Ecosystem for Heterogeneous Parallel Computing, by Wu FengHC-4022, Towards an Ecosystem for Heterogeneous Parallel Computing, by Wu Feng
HC-4022, Towards an Ecosystem for Heterogeneous Parallel Computing, by Wu Feng
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
Rejunevating software reengineering processes
Rejunevating software reengineering processesRejunevating software reengineering processes
Rejunevating software reengineering processes
 
Automation chapt 3
Automation chapt 3Automation chapt 3
Automation chapt 3
 
Microservices and modularity with java
Microservices and modularity with javaMicroservices and modularity with java
Microservices and modularity with java
 
IPv4 to IPv6 network transformation
IPv4 to IPv6 network transformationIPv4 to IPv6 network transformation
IPv4 to IPv6 network transformation
 

More from Hong-Linh Truong

QoA4ML – A Framework for Supporting Contracts in Machine Learning Services
QoA4ML – A Framework for Supporting Contracts in Machine Learning ServicesQoA4ML – A Framework for Supporting Contracts in Machine Learning Services
QoA4ML – A Framework for Supporting Contracts in Machine Learning Services
Hong-Linh Truong
 
Sharing Blockchain Performance Knowledge for Edge Service Development
Sharing Blockchain Performance Knowledge for Edge Service DevelopmentSharing Blockchain Performance Knowledge for Edge Service Development
Sharing Blockchain Performance Knowledge for Edge Service Development
Hong-Linh Truong
 
Measuring, Quantifying, & Predicting the Cost-Accuracy Tradeoff
Measuring, Quantifying, & Predicting the Cost-Accuracy TradeoffMeasuring, Quantifying, & Predicting the Cost-Accuracy Tradeoff
Measuring, Quantifying, & Predicting the Cost-Accuracy Tradeoff
Hong-Linh Truong
 
DevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
DevOps for Dynamic Interoperability of IoT, Edge and Cloud SystemsDevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
DevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
Hong-Linh Truong
 
Dynamic IoT data, protocol, and middleware interoperability with resource sli...
Dynamic IoT data, protocol, and middleware interoperability with resource sli...Dynamic IoT data, protocol, and middleware interoperability with resource sli...
Dynamic IoT data, protocol, and middleware interoperability with resource sli...
Hong-Linh Truong
 
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
Hong-Linh Truong
 
Modeling and Provisioning IoT Cloud Systems for Testing Uncertainties
Modeling and Provisioning IoT Cloud Systems for Testing UncertaintiesModeling and Provisioning IoT Cloud Systems for Testing Uncertainties
Modeling and Provisioning IoT Cloud Systems for Testing Uncertainties
Hong-Linh Truong
 
Characterizing Incidents in Cloud-based IoT Data Analytics
Characterizing Incidents in Cloud-based IoT Data AnalyticsCharacterizing Incidents in Cloud-based IoT Data Analytics
Characterizing Incidents in Cloud-based IoT Data Analytics
Hong-Linh Truong
 
Enabling Edge Analytics of IoT Data: The Case of LoRaWAN
Enabling Edge Analytics of IoT Data: The Case of LoRaWANEnabling Edge Analytics of IoT Data: The Case of LoRaWAN
Enabling Edge Analytics of IoT Data: The Case of LoRaWAN
Hong-Linh Truong
 
Analytics of Performance and Data Quality for Mobile Edge Cloud Applications
Analytics of Performance and Data Quality for Mobile Edge Cloud ApplicationsAnalytics of Performance and Data Quality for Mobile Edge Cloud Applications
Analytics of Performance and Data Quality for Mobile Edge Cloud Applications
Hong-Linh Truong
 
Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...
Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...
Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...
Hong-Linh Truong
 
Deep Context-Awareness: Context Coupling and New Types of Context Information...
Deep Context-Awareness: Context Coupling and New Types of Context Information...Deep Context-Awareness: Context Coupling and New Types of Context Information...
Deep Context-Awareness: Context Coupling and New Types of Context Information...
Hong-Linh Truong
 
Managing and Testing Ensembles of IoT, Network functions, and Clouds
Managing and Testing Ensembles of IoT, Network functions, and CloudsManaging and Testing Ensembles of IoT, Network functions, and Clouds
Managing and Testing Ensembles of IoT, Network functions, and Clouds
Hong-Linh Truong
 
Towards a Resource Slice Interoperability Hub for IoT
Towards a Resource Slice Interoperability Hub for IoTTowards a Resource Slice Interoperability Hub for IoT
Towards a Resource Slice Interoperability Hub for IoT
Hong-Linh Truong
 
On Supporting Contract-aware IoT Dataspace Services
On Supporting Contract-aware IoT Dataspace ServicesOn Supporting Contract-aware IoT Dataspace Services
On Supporting Contract-aware IoT Dataspace Services
Hong-Linh Truong
 
Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...
Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...
Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...
Hong-Linh Truong
 
On Engineering Analytics of Elastic IoT Cloud Systems
On Engineering Analytics of Elastic IoT Cloud SystemsOn Engineering Analytics of Elastic IoT Cloud Systems
On Engineering Analytics of Elastic IoT Cloud Systems
Hong-Linh Truong
 
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...
Hong-Linh Truong
 
SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...
SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...
SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...
Hong-Linh Truong
 
Governing Elastic IoT Cloud Systems under Uncertainties
Governing Elastic IoT Cloud Systems under UncertaintiesGoverning Elastic IoT Cloud Systems under Uncertainties
Governing Elastic IoT Cloud Systems under Uncertainties
Hong-Linh Truong
 

More from Hong-Linh Truong (20)

QoA4ML – A Framework for Supporting Contracts in Machine Learning Services
QoA4ML – A Framework for Supporting Contracts in Machine Learning ServicesQoA4ML – A Framework for Supporting Contracts in Machine Learning Services
QoA4ML – A Framework for Supporting Contracts in Machine Learning Services
 
Sharing Blockchain Performance Knowledge for Edge Service Development
Sharing Blockchain Performance Knowledge for Edge Service DevelopmentSharing Blockchain Performance Knowledge for Edge Service Development
Sharing Blockchain Performance Knowledge for Edge Service Development
 
Measuring, Quantifying, & Predicting the Cost-Accuracy Tradeoff
Measuring, Quantifying, & Predicting the Cost-Accuracy TradeoffMeasuring, Quantifying, & Predicting the Cost-Accuracy Tradeoff
Measuring, Quantifying, & Predicting the Cost-Accuracy Tradeoff
 
DevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
DevOps for Dynamic Interoperability of IoT, Edge and Cloud SystemsDevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
DevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
 
Dynamic IoT data, protocol, and middleware interoperability with resource sli...
Dynamic IoT data, protocol, and middleware interoperability with resource sli...Dynamic IoT data, protocol, and middleware interoperability with resource sli...
Dynamic IoT data, protocol, and middleware interoperability with resource sli...
 
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
 
Modeling and Provisioning IoT Cloud Systems for Testing Uncertainties
Modeling and Provisioning IoT Cloud Systems for Testing UncertaintiesModeling and Provisioning IoT Cloud Systems for Testing Uncertainties
Modeling and Provisioning IoT Cloud Systems for Testing Uncertainties
 
Characterizing Incidents in Cloud-based IoT Data Analytics
Characterizing Incidents in Cloud-based IoT Data AnalyticsCharacterizing Incidents in Cloud-based IoT Data Analytics
Characterizing Incidents in Cloud-based IoT Data Analytics
 
Enabling Edge Analytics of IoT Data: The Case of LoRaWAN
Enabling Edge Analytics of IoT Data: The Case of LoRaWANEnabling Edge Analytics of IoT Data: The Case of LoRaWAN
Enabling Edge Analytics of IoT Data: The Case of LoRaWAN
 
Analytics of Performance and Data Quality for Mobile Edge Cloud Applications
Analytics of Performance and Data Quality for Mobile Edge Cloud ApplicationsAnalytics of Performance and Data Quality for Mobile Edge Cloud Applications
Analytics of Performance and Data Quality for Mobile Edge Cloud Applications
 
Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...
Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...
Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...
 
Deep Context-Awareness: Context Coupling and New Types of Context Information...
Deep Context-Awareness: Context Coupling and New Types of Context Information...Deep Context-Awareness: Context Coupling and New Types of Context Information...
Deep Context-Awareness: Context Coupling and New Types of Context Information...
 
Managing and Testing Ensembles of IoT, Network functions, and Clouds
Managing and Testing Ensembles of IoT, Network functions, and CloudsManaging and Testing Ensembles of IoT, Network functions, and Clouds
Managing and Testing Ensembles of IoT, Network functions, and Clouds
 
Towards a Resource Slice Interoperability Hub for IoT
Towards a Resource Slice Interoperability Hub for IoTTowards a Resource Slice Interoperability Hub for IoT
Towards a Resource Slice Interoperability Hub for IoT
 
On Supporting Contract-aware IoT Dataspace Services
On Supporting Contract-aware IoT Dataspace ServicesOn Supporting Contract-aware IoT Dataspace Services
On Supporting Contract-aware IoT Dataspace Services
 
Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...
Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...
Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...
 
On Engineering Analytics of Elastic IoT Cloud Systems
On Engineering Analytics of Elastic IoT Cloud SystemsOn Engineering Analytics of Elastic IoT Cloud Systems
On Engineering Analytics of Elastic IoT Cloud Systems
 
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...
 
SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...
SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...
SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...
 
Governing Elastic IoT Cloud Systems under Uncertainties
Governing Elastic IoT Cloud Systems under UncertaintiesGoverning Elastic IoT Cloud Systems under Uncertainties
Governing Elastic IoT Cloud Systems under Uncertainties
 

Recently uploaded

Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Product School
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 

Recently uploaded (20)

Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 

Performance Analysis of Grid Workflows in K-WfGrid and ASKALON

  • 1. Performance Analysis of Grid workflows in K-WfGrid and ASKALON Hong-Linh Truong and Thomas Fahringer Distributed and Parallel Systems Group Institute of Computer Science, University of Innsbruck {truong,tf}@dps.uibk.ac.at http://dps.uibk.ac.at/projects/pma With contributions from: Bartosz Balis, Marian Bubak, Peter Brunner, Schahram Dustdar, Jakub Dziwisz, Andreas Hoheisel, Tilman Linden, Radu Prodan, Francesco Nerieri, Kuba Rozkwitalski, Robert Samborski Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005.
  • 2. Outline Motivation Objectives and approach Performance metrics and ontologies Architecture of Grid monitoring and analysis Conclusions Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 2
  • 3. Performance Instrumentation, Monitoring, and Analysis for the Grid Challenging task because of dynamic nature of the Grid and its applications Combination of fine-grained and coarse-grained models Multilingual applications Heterogeneous and dynamic systems Not just performance but also dependability The focus of the talk Our concepts, architecture, interfaces and integration Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 3
  • 4. ASKALON Toolkit http://dps.uibk.ac.at/projects/askalon/ Programming and execution environment for Grid workflows Integrated various services for scheduling, executing, monitoring and analyzing Grid workflows Target applications Workflows of scientific applications (C/Fortran) • Material science, flooding, astrophysics, movie rendering, etc. Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 4
  • 5. The K-WfGrid Project An FP6 EU project www.kwfgrid.net Execute workflow Focuses Automatic construction and reuse of workflows based on Construct Monitor knowledge gathered through execution workflow environment Target applications K-WfGrid Workflows of Grid/Web services Reuse Analyze Grid/Web services may invoke knowledge information legacy applications Business (Coordinated Traffic Management, EPR) and Capture knowledge scientific (e.g., Food simulation) applications Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 5
  • 6. The K-WfGrid Project An FP6 EU project www.kwfgrid.net Execute workflow Focuses Automatic construction and reuse of workflows based on Construct Monitor knowledge gathered through execution workflow environment Many Grid users and Target applications K-WfGrid developers we know Workflows of Grid/Web services Reuse Analyze emphasize the interoperability, information Grid/Web services may invoke knowledge legacy applications integration and reliability, not Business (Coordinated Traffic Management, EPR) and just performance Capture knowledge scientific (e.g., Food simulation) applications Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 6
  • 7. Objectives, Requirements, Approaches Performance monitoring and analysis for Grid workflows of Web/Grid services and multilingual components Performance and dependability metrics Monitoring and performance analysis Service-oriented distributed architecture and peer-to- peer model Unified monitoring and performance analysis system covering infrastructure and applications Standardized data representations for monitoring data and events Adaptive and generic sensors, distributed analysis, performance bottleneck search Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 7
  • 8. Hierarchical View of Grid Workflows <parallel> Workflow <activity name="mProject1"> <executable name="mProject1"/> </activity> Workflow Region n <activity name="mProject2"> <executable name="mProject2"/> </activity> Activity m </parallel> mProject1Service.java Invoked Application m public void mProject1() { A(); while () { Code Code Code ... Region 1 Region … Region q } } Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 8
  • 9. Hierarchical View of Grid Workflows <parallel> Workflow <activity name="mProject1"> <executable name="mProject1"/> </activity> Workflow Region n <activity name="mProject2"> <executable name="mProject2"/> </activity> Activity m </parallel> Monitoring and mProject1Service.java Invoked Application m analysis at multiple public void mProject1() { levels of abstraction A(); while () { Code Code Code ... Region 1 Region … Region q } } Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 9
  • 10. Workflow Execution Model (Simplified) Local scheduler Workflow execution Spanning multiple Grid sites Highly inter-organizational, inter-related and dynamic Multiple levels of job scheduling At workflow execution engine (part of WfMS) At Grid sites Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 10
  • 11. Performance Metrics of Grid Workflows Interesting performance metrics associated with multiple levels of abstraction Metrics can be used in workflow composition, for comparing different invoked applications of a single activity, adaptive scheduling, etc. Five levels of abstraction Code region, Invoked application, Activity ,Workflow Region, Workflow Topdown PMA: from a higher level to a lower one Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 11
  • 12. Monitoring and Measuring Performance Metrics Performance monitoring and analysis tools Operate at multiple levels Correlate performance metrics from multiple levels Middleware and application instrumentation Instrument execution engine • Execution engine can be distributed or centralized Instrument applications • Distributed, spanning multiple Grid sites Challenging problems: the complexity of performance tools and data Integrate multiple performance monitoring tools executed on multiple Grid sites Integrate performance data produced by various tools We need common concepts for performance data associated with Grid workflows Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 12
  • 13. Utilizing Ontologies for Describing Performance Data of Grid workflows Metrics ontology Specifies which performance metrics a tool can provide Simplifies the access to performance metrics provided by various tools Performance data integration Performance data integration based on common concepts. High-level search and retrieval of performance data Knowledge base performance data of Grid workflows Utilized by high-level tools such as schedulers, workflow composition tools, etc. Used to re(discover) workflow patterns, interactions in workflows, to check correct execution, etc. Distributed performance analysis Performance analysis requests can be built based on ontologies Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 13
  • 14. Workflow Performance Ontology WfPerfOnto (Ontology describing Performance data of Grid Workflows) Specifies performance metrics Basic concepts • Concepts reflect the hierarchical structure of a workflow • Static and dynamic workflow performance data Relationships • Static and dynamic relationships among concepts Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 14
  • 15. WfPerfOnto Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 15
  • 16. Monitoring and Analysis Scenario Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 16
  • 17. Components for Grid PMA Three main parts of a unified performance monitoring, instrumentation and analysis system Monitoring and instrumentation services Performance analysis services Performance service interfaces and data representations • We must reuse existing tools and techniques as much as possible Integration model Loosely coupled: among Grid sites/organizations • Utilizing SOA for performance tools Tightly coupled: within services deployed in a single Grid site/organization • Interfacing to existing (parallel) performance tools Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 17
  • 18. K-WfGrid Monitoring and Analysis Architecture Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 18
  • 19. Self-Managing Sensor-Based Middleware Integrating diverse types of sensors into a single system Event-driven and demand-driven sensors for system and applications monitoring, rule-based monitoring Self-managing services Service-based operations and TCP-based stream data delivery Peer-to-peer Grid services for the monitoring middleware Query and subscription of monitoring data Data query and subscription Group-based data query and subscription, and notification Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 19
  • 20. Self-Managing Sensor Based Monitoring Middleware Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 20
  • 21. Workflow Instrumentation Issues to be addressed Multiple levels of instrumentation Instrumentation of multilingual applications (C/Java/Fortran) Must be dynamic (enabled) instrumentation ASKALON and K-WfGrid approach Utilizing existing instrumentation techniques • OCM-G (Roland Wismueller and Marian Bubak) dynamic enabled instrumentation C/Fortran • Dyninst (Barton Miller, Jeff Hollingsworth) for binary code generated from C/Fortran • Source/dynamic instrumentation of Java (from Java 1.5) Using APART standardized intermediate representation (SIR) XML-based request for controlling instrumentation Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 21
  • 22. DIPAS: Distributed Performance Analysis DIPAS Portal DIPAS Portal DIPAS DIPAS Knowledge GateWay GateWay Builder Agent GOM: Grid analysis Ontology Memory Grid analysis agent agent Monitoring Service Grid analysis agent GWES: DIPAS Execution Service Grid analysis agent accepts WARL and returns performance metrics described in XML under a tree of metrics WARL (workflow analysis request langauge): based on concepts and properties in WfPerfOnto Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 22
  • 23. temporal overheads Workflow Overhead unidentified overhead middleware resource brokerage Classification performance prediction scheduling optimization algorithm rescheduling Middleware execution management control of parallelism Scheduler fork construct barrier Resource Manager job preparation archive/compression of data extractdecompression/ Execution management job submission submission decision policy • Control of parallelism GRAM submission GRAM polling • Loss of parallelism queue queue waiting queue residency Loss of parallelism restart job failure job cancel Load imbalance service latency resource broker Serialization performance prediction scheduler Replicated job security execution management loss of parallelism Data transfer load imbalance serialisation replicated job Activity data transfer file staging stage in Parallel processing overheads access to database stage out External load third party data transfer input from user data transfer imbalance activity Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. parallel overheads external load 23
  • 24. Current status of the implementation SOA-based Monitoring, Instrumentation and Analysis services are GT4 based XML-based for performance data representations and requests Monitoring and Instrumentation Services gSOAP, GSI-based dynamic instrumentation service Java-based dynamic instrumentation OCM-G But we need to integrate them into a single framework for workflows and it is a non trivial task Analysis services GT4-based with distributed components Simple language for workflow analysis request (WARL), designed based on WfPerfOnto Metrics are described in XML Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 24
  • 25. Example: Dynamic Instrumentation Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 25
  • 26. Snapshot: Online System Monitoring Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 26
  • 27. Rule-based Monitoring Sensors use rules to analyze monitoring data Rules are based on IBM ABLE toolkit Example: A network path in the Austrian Grid, bandwidth < 5MB/s (based on Iperf) Define a fuzzy variable with VERY LOW, LOW, MEDUM, HIGH, VERY HIGH Fuzzy rules: Events send when bandwidth VERY LOW or VERY HIGH Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 27
  • 28. Online Infrastructure Monitoring Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 28
  • 29. DAG-based workflow monitoring and analysis Online application profiling Monitoring data of workflow activity execution Monitoring data of computational node Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 29
  • 30. K-WfGrid: Online Workflow Monitoring and Analysis Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 30
  • 31. Online Workflow Analysis Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 31
  • 32. Conclusions The architecture of monitoring and analysis service must tackle the dynamics and the diversity of the Grid Service-oriented, peer-to-peer model, adaptive sensors Integration and reuse are important issues Loosely coupled and tightly coupled Do not neglect data representations and service interfaces Performance metrics and ontology for Grid workflows What performance metrics are important and how to measure them Common and generic concepts and relationships monitored and alayzed Given well-defined service interfaces, data representation, performance metrics and ontology Simplify the integration among components Towards automatic, intelligent and distributed performance performance analysis UIBK perf. work: http://dps.uibk.ac.at/projects/pma/ Papers: http://dps.uibk.ac.at/index.pl/publications Hong-Linh Truong, Dagstuhl seminar on Automatic Performance Analysis, 15 December 2005. 32