SlideShare a Scribd company logo
1 of 27
Scheduling In Distributed Systems
          Candidacy exam


                              Andrii Vozniuk
                              EPFL
                              July 4, 2012
Big Data
       Data explosion
       Processing gets more complicated




          Generates: 25 TB/day       Generates: 40 TB/day
          Stores:    10 PB/year      Stores:    20 PB/year

            Resources of many computers should be used
    2
Typical Data Processing Pipeline


                     Log              Sensor
                     data              data


ETL-like batch      Clean            Analyze        Using resources of
 processing         data              data          many organizations

                                  Particle found!
Efficient query     Query
  execution         data


                  User model

           No one-size-fits-all system currently exists
 3
Outline
    Ɣ Gamma - parallel database
        MapReduce - data-intensive system

        Condor - compute-intensive system

 Conclusions
 Future Research




4
Scheduling In Distributed Systems
       Scheduling
           Policy: setting an ordering of tasks                            task
                                                       task
           Assigning resources to tasks
                                                       task
                                                       task


                                        How to match resources and tasks?




              Scheduling is challenging in distributed systems
    5
Matching Tasks With Resources
       Perspectives
           Data model
           Execution model


             System/Perspecti   Data model      Execution model
             ve
             Gamma              Relational      Multioperator
             MapReduce          Unconstrained   MapReduce
             Condor             Unconstrained   Unconstrained




            How scheduling is influenced by data and execution
    6                             models?
Gamma                                                Ɣ
       Pioneering parallel database
       Data model: constrained
           Relational data model
           Relations are horizontally partitioned
       Execution model: constrained
           Multioperator queries
           Operators employ hash-based algorithms




    7
Gamma: Scheduler                                                         Ɣ
SELECT r FROM R      Query                                   Host
WHERE r < ‘k’ query Manager          Catalog
                                                             Machine

                                                             Gamma
       Optimizes query                                Schedules
                                  Scheduler                   Database
       Compiles plan                                  operators
                                   Process


                          Operator            Operator
               Node 1     Process              Process        Node 2
         Execution on
         relevant nodes     a-m                 n-z



          Scheduling is done at the operator level
 8
Gamma: Batch Scheduling                                           Ɣ
       Exploit sharing by scheduling in a batch
       Example of selection sharing


                σ1      σ2            σ1       σ2
                                                    Shared scan

                A       A                  A



       Reads of A can be shared applying predicates in turn
       Shared relation A is scanned only once


              Batch scheduling trades latency for throughput
    9
Gamma: Batch Scheduling Joins                                           Ɣ
    Several hash-joins in a batch of queries
    Hash table for the same relation can be shared
    Example assumes 100% selectivity of σ
                                                      Shared hash-table for A


             ⋈            ⋈                   ⋈        ⋈

         σ       σ    σ       σ           σ       σ     σ

         A       Β    A       C           B       A     C


    Sharing reduces I/O and memory usage

             Sharing among joins reduces total execution time
    10
Limitations Of Gamma                                           Ɣ
    Gamma offers
        Efficient query execution
        Sharing in a batch of queries
    Gamma operates on structured data
    Gamma is not suitable for
        Unstructured data processing
        ETL type of workload
        Running on large scale




             A different system for ETL processing is needed
    11
MapReduce
    System for data-intensive applications
    Execution model: constrained
        Job is a set of map and reduce tasks
        Tasks are independent
    Data model: unconstrained
        Arbitrary data format
        Files are partitioned into chunks
        Each chunk is replicated several times




    12
MapReduce: Scheduling
                                    Map
                                    Reduc             Map
                                     1e                2
          Example:
                          Chunk1            Chunk2
         MapReduce job
                          Result1
                          Temp1             Temp2
         4 Map tasks

         2 Reduce task              Map               Reduc
                                                      Map
                                     3                 4e
                          Chunk3            Chunk4
                          Temp3             Result2
                                            Temp4
    Tasks are scheduled close to data
    Execution is scalable and fault-tolerant
    Execution is elastic
           Fine grain scheduling improves fault tolerance and
    13                          elasticity
MapReduce: Speculative Execution
    Nodes may become slow
    Speculative execution minimizes job’s response time
    Launch if progress is 20% less than average
                                        backup
          Normal node


                            straggler

Temporary slow node




         Speculative execution works well in homogeneous
    14                     environment
Emerging Heterogeneous Infrastructures
    Replacement of failed components
    Extending existing cluster with new machines
    Virtualized data centers of cloud providers
        CPU and RAM are isolated
        Contention for disk and network
              IO Performance per




                                   60
                  VM (MB/s)




                                   40

                                   20

                                   0
                                        1   2     3      4      5      6   7
                                                VMs on Physical Host

In many real-life cases the infrastructure is heterogeneous
    15
MapReduce: Heterogeneous Cluster
    Fast node




Slow node



    Performance degrades on heterogeneous cluster
        Slow nodes are wasted
        Backup tasks on slow nodes
        All straggling tasks are treated equally
        Thrashing due to excessive speculative execution

     Speculative execution should be improved for heterogeneous
    16                         cluster
MapReduce: LATE Scheduler
    Idea: back up the task with the largest estimated finish
     time (Longest Approximate Time to End)
                                          progress score
                      progress rate =
                                          execution time

                                         1 – progress score
                estimated time left =
                                           progress rate
    Thresholds
        Limit the number of backup tasks
        Launch backup tasks on fast nodes
        Backup only sufficiently slow tasks
         LATE looks forward to prioritize tasks to speculate
    17
MapReduce: LATE Example
   Back up the task with Longest Approximate Time to End
                                   2 min

1                                                 Estimated time left:
                                                  (1-0.66) / (1/3) = 1
     1 task/min

2                 Progress = 66%
                                                  Estimated time left:
                                                  (1-0.05) / (1/1.9) = 1.8
     3x slower
                            Progress = 5.3%
3
    1.9x slower


                               Time (min)     improvement

LATE correctly identifies task which hurts the response time the
18                             most
Limitations Of MapReduce
    MapReduce offers
        High scalability
        Good fault tolerance
        Handling of unstructured data
    MapReduce is not suitable for
        Running on multi organization infrastructure
        Harvesting idle resources in organization




     A different system for multi organization infrastructure is
    19                       needed
Condor
    Compute-intensive system harvesting idle resources
    Data model: arbitrary
    Execution model: arbitrary
                           How to increase utilization
                           and respect the owners?




                                          job

                                          job
                                                              job
                                          job
       Increase resources utilization by scheduling jobs on idle
    20                         machines
Condor Scheduler: Centralized?
                         Scheduler




                                     job

                                     job
                                                       job
                                     job
     Efficient but not reliable, possible bottleneck
21
Condor Scheduler: Distributed?
                                            Scheduler


     Scheduler




                                            Scheduler

                       Scheduler



                                   job

                                   job
                                                 job
                                   job
                 Reliable but inefficient
22
Condor Scheduler: Hybrid!

Information about tasks            Matchmaker           Information about nodes

      Scheduler           1
                              3                          1
                                          1
                                                    2
                                              3                     Scheduler

                              Scheduler


                              4
                                                  job

                                                  job
                                                                          job
                                                  job
            Hybrid approach has the best of both worlds
 23
ClassAds: Describing Jobs and Resources
          Job Description          Machine Description

          [MyType=“Job”            [MyType=“Machine“
          TargetType = “Machine“   TargetType=“Job“
          Department=“CompSci“     Machine=“nostos.cs.wisc.edu“
          Requirements =           OpSys=“LINUX“
          (other.OpSys==LINUX &&   Disk=3076077
          other.Disk > 10000000)   Requirement = (LoadAvg <= 0.3) &&
          Rank=Memory]             (KeyboardIdle > (15*60))
                                   Rank =
                                   other.Department==self.Department]
    Requirements should be satisfied
    Candidate with the highest rank is returned
         Matchmaker is suitable for heterogeneous shared clusters
    24
Conclusions
    Scheduling done at different levels
        Gamma: operator level scheduling enables sharing
        MR and Condor: arbitrary code => sharing is hard
        Condor: matchmaking gives control on job placement

    Hybrid approaches are promising for big data processing
    Scheduling in heterogeneous deployments is challenging




    25
Thank you for your attention!

        Feedback & Question?
        Andrii.Vozniuk@epfl.ch




26
References
    Matchmaking: Distributed Resource Management for
     High Throughput Computing by Rajesh Raman, Miron
     Livny and Marvin Solomon.
    Batch Scheduling in Parallel Database Systems by Manish
     Mehta, Valery Soloviev and David J. DeWitt.
    Improving MapReduce performance in heterogeneous
     environments by Matei Zaharia, Andy Konwinski, Anthony
     D. Joseph, Randy Katz and Ion Stoica
    Slides 14 and 18 exploit presentation ideas from the LATE
     slides for OSDI 2008 by Matei Zaharia


    27

More Related Content

What's hot

File models and file accessing models
File models and file accessing modelsFile models and file accessing models
File models and file accessing modelsishmecse13
 
Lecture 3 parallel programming platforms
Lecture 3   parallel programming platformsLecture 3   parallel programming platforms
Lecture 3 parallel programming platformsVajira Thambawita
 
Lecture 1 introduction to parallel and distributed computing
Lecture 1   introduction to parallel and distributed computingLecture 1   introduction to parallel and distributed computing
Lecture 1 introduction to parallel and distributed computingVajira Thambawita
 
Flynns classification
Flynns classificationFlynns classification
Flynns classificationYasir Khan
 
Fault tolerance in distributed systems
Fault tolerance in distributed systemsFault tolerance in distributed systems
Fault tolerance in distributed systemssumitjain2013
 
Synchronization in distributed computing
Synchronization in distributed computingSynchronization in distributed computing
Synchronization in distributed computingSVijaylakshmi
 
Paging and Segmentation in Operating System
Paging and Segmentation in Operating SystemPaging and Segmentation in Operating System
Paging and Segmentation in Operating SystemRaj Mohan
 
Database , 8 Query Optimization
Database , 8 Query OptimizationDatabase , 8 Query Optimization
Database , 8 Query OptimizationAli Usman
 
8 memory management strategies
8 memory management strategies8 memory management strategies
8 memory management strategiesDr. Loganathan R
 

What's hot (20)

11. dfs
11. dfs11. dfs
11. dfs
 
Memory Hierarchy
Memory HierarchyMemory Hierarchy
Memory Hierarchy
 
File models and file accessing models
File models and file accessing modelsFile models and file accessing models
File models and file accessing models
 
6.distributed shared memory
6.distributed shared memory6.distributed shared memory
6.distributed shared memory
 
Naming in Distributed System
Naming in Distributed SystemNaming in Distributed System
Naming in Distributed System
 
Lecture 3 parallel programming platforms
Lecture 3   parallel programming platformsLecture 3   parallel programming platforms
Lecture 3 parallel programming platforms
 
Lecture 1 introduction to parallel and distributed computing
Lecture 1   introduction to parallel and distributed computingLecture 1   introduction to parallel and distributed computing
Lecture 1 introduction to parallel and distributed computing
 
Mainframe systems
Mainframe systemsMainframe systems
Mainframe systems
 
Flynns classification
Flynns classificationFlynns classification
Flynns classification
 
Fault tolerance in distributed systems
Fault tolerance in distributed systemsFault tolerance in distributed systems
Fault tolerance in distributed systems
 
Synchronization in distributed computing
Synchronization in distributed computingSynchronization in distributed computing
Synchronization in distributed computing
 
Paging and Segmentation in Operating System
Paging and Segmentation in Operating SystemPaging and Segmentation in Operating System
Paging and Segmentation in Operating System
 
Database , 8 Query Optimization
Database , 8 Query OptimizationDatabase , 8 Query Optimization
Database , 8 Query Optimization
 
Linux Memory Management
Linux Memory ManagementLinux Memory Management
Linux Memory Management
 
SCHEDULING ALGORITHMS
SCHEDULING ALGORITHMSSCHEDULING ALGORITHMS
SCHEDULING ALGORITHMS
 
8 memory management strategies
8 memory management strategies8 memory management strategies
8 memory management strategies
 
Memory management
Memory managementMemory management
Memory management
 
Process Scheduling
Process SchedulingProcess Scheduling
Process Scheduling
 
memory hierarchy
memory hierarchymemory hierarchy
memory hierarchy
 
Lec 7 query processing
Lec 7 query processingLec 7 query processing
Lec 7 query processing
 

Similar to Scheduling in distributed systems - Andrii Vozniuk

Hadoop Network Performance profile
Hadoop Network Performance profileHadoop Network Performance profile
Hadoop Network Performance profilepramodbiligiri
 
The Performance of MapReduce: An In-depth Study
The Performance of MapReduce: An In-depth StudyThe Performance of MapReduce: An In-depth Study
The Performance of MapReduce: An In-depth StudyKevin Tong
 
Next generation analytics with yarn, spark and graph lab
Next generation analytics with yarn, spark and graph labNext generation analytics with yarn, spark and graph lab
Next generation analytics with yarn, spark and graph labImpetus Technologies
 
Architecting and productionising data science applications at scale
Architecting and productionising data science applications at scaleArchitecting and productionising data science applications at scale
Architecting and productionising data science applications at scalesamthemonad
 
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...Reynold Xin
 
Hanborq Optimizations on Hadoop MapReduce
Hanborq Optimizations on Hadoop MapReduceHanborq Optimizations on Hadoop MapReduce
Hanborq Optimizations on Hadoop MapReduceHanborq Inc.
 
Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...
Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...
Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...Yahoo Developer Network
 
Hanborq optimizations on hadoop map reduce 20120221a
Hanborq optimizations on hadoop map reduce 20120221aHanborq optimizations on hadoop map reduce 20120221a
Hanborq optimizations on hadoop map reduce 20120221aSchubert Zhang
 
MapReduce Paradigm
MapReduce ParadigmMapReduce Paradigm
MapReduce ParadigmDilip Reddy
 
MapReduce Paradigm
MapReduce ParadigmMapReduce Paradigm
MapReduce ParadigmDilip Reddy
 
Взгляд на облака с точки зрения HPC
Взгляд на облака с точки зрения HPCВзгляд на облака с точки зрения HPC
Взгляд на облака с точки зрения HPCOlga Lavrentieva
 
Strata + Hadoop World 2012: Knitting Boar
Strata + Hadoop World 2012: Knitting BoarStrata + Hadoop World 2012: Knitting Boar
Strata + Hadoop World 2012: Knitting BoarCloudera, Inc.
 
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARNHadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARNJosh Patterson
 
High Performance Computing - Cloud Point of View
High Performance Computing - Cloud Point of ViewHigh Performance Computing - Cloud Point of View
High Performance Computing - Cloud Point of Viewaragozin
 
Parallel Data Processing with MapReduce: A Survey
Parallel Data Processing with MapReduce: A SurveyParallel Data Processing with MapReduce: A Survey
Parallel Data Processing with MapReduce: A SurveyKyong-Ha Lee
 
Ling liu part 02:big graph processing
Ling liu part 02:big graph processingLing liu part 02:big graph processing
Ling liu part 02:big graph processingjins0618
 
MapReduce:Simplified Data Processing on Large Cluster Presented by Areej Qas...
MapReduce:Simplified Data Processing on Large Cluster  Presented by Areej Qas...MapReduce:Simplified Data Processing on Large Cluster  Presented by Areej Qas...
MapReduce:Simplified Data Processing on Large Cluster Presented by Areej Qas...areej qasrawi
 
Seminar Presentation Hadoop
Seminar Presentation HadoopSeminar Presentation Hadoop
Seminar Presentation HadoopVarun Narang
 

Similar to Scheduling in distributed systems - Andrii Vozniuk (20)

Hadoop Network Performance profile
Hadoop Network Performance profileHadoop Network Performance profile
Hadoop Network Performance profile
 
The Performance of MapReduce: An In-depth Study
The Performance of MapReduce: An In-depth StudyThe Performance of MapReduce: An In-depth Study
The Performance of MapReduce: An In-depth Study
 
Spark
SparkSpark
Spark
 
Next generation analytics with yarn, spark and graph lab
Next generation analytics with yarn, spark and graph labNext generation analytics with yarn, spark and graph lab
Next generation analytics with yarn, spark and graph lab
 
Architecting and productionising data science applications at scale
Architecting and productionising data science applications at scaleArchitecting and productionising data science applications at scale
Architecting and productionising data science applications at scale
 
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
 
Hanborq Optimizations on Hadoop MapReduce
Hanborq Optimizations on Hadoop MapReduceHanborq Optimizations on Hadoop MapReduce
Hanborq Optimizations on Hadoop MapReduce
 
Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...
Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...
Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...
 
Hanborq optimizations on hadoop map reduce 20120221a
Hanborq optimizations on hadoop map reduce 20120221aHanborq optimizations on hadoop map reduce 20120221a
Hanborq optimizations on hadoop map reduce 20120221a
 
MapReduce Paradigm
MapReduce ParadigmMapReduce Paradigm
MapReduce Paradigm
 
MapReduce Paradigm
MapReduce ParadigmMapReduce Paradigm
MapReduce Paradigm
 
Взгляд на облака с точки зрения HPC
Взгляд на облака с точки зрения HPCВзгляд на облака с точки зрения HPC
Взгляд на облака с точки зрения HPC
 
Hadoop at JavaZone 2010
Hadoop at JavaZone 2010Hadoop at JavaZone 2010
Hadoop at JavaZone 2010
 
Strata + Hadoop World 2012: Knitting Boar
Strata + Hadoop World 2012: Knitting BoarStrata + Hadoop World 2012: Knitting Boar
Strata + Hadoop World 2012: Knitting Boar
 
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARNHadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
Hadoop Summit EU 2013: Parallel Linear Regression, IterativeReduce, and YARN
 
High Performance Computing - Cloud Point of View
High Performance Computing - Cloud Point of ViewHigh Performance Computing - Cloud Point of View
High Performance Computing - Cloud Point of View
 
Parallel Data Processing with MapReduce: A Survey
Parallel Data Processing with MapReduce: A SurveyParallel Data Processing with MapReduce: A Survey
Parallel Data Processing with MapReduce: A Survey
 
Ling liu part 02:big graph processing
Ling liu part 02:big graph processingLing liu part 02:big graph processing
Ling liu part 02:big graph processing
 
MapReduce:Simplified Data Processing on Large Cluster Presented by Areej Qas...
MapReduce:Simplified Data Processing on Large Cluster  Presented by Areej Qas...MapReduce:Simplified Data Processing on Large Cluster  Presented by Areej Qas...
MapReduce:Simplified Data Processing on Large Cluster Presented by Areej Qas...
 
Seminar Presentation Hadoop
Seminar Presentation HadoopSeminar Presentation Hadoop
Seminar Presentation Hadoop
 

More from Andrii Vozniuk

Enhancing Social Media Platforms for Educational and Humanitarian Knowledge S...
Enhancing Social Media Platforms for Educational and Humanitarian Knowledge S...Enhancing Social Media Platforms for Educational and Humanitarian Knowledge S...
Enhancing Social Media Platforms for Educational and Humanitarian Knowledge S...Andrii Vozniuk
 
Embedded interactive learning analytics dashboards with Elasticsearch and Kib...
Embedded interactive learning analytics dashboards with Elasticsearch and Kib...Embedded interactive learning analytics dashboards with Elasticsearch and Kib...
Embedded interactive learning analytics dashboards with Elasticsearch and Kib...Andrii Vozniuk
 
Interactive learning analytics dashboards with ELK (Elasticsearch Logstash Ki...
Interactive learning analytics dashboards with ELK (Elasticsearch Logstash Ki...Interactive learning analytics dashboards with ELK (Elasticsearch Logstash Ki...
Interactive learning analytics dashboards with ELK (Elasticsearch Logstash Ki...Andrii Vozniuk
 
Combining content analytics and activity tracking to mine user interests and ...
Combining content analytics and activity tracking to mine user interests and ...Combining content analytics and activity tracking to mine user interests and ...
Combining content analytics and activity tracking to mine user interests and ...Andrii Vozniuk
 
TPC-DS performance evaluation for JAQL and PIG queries - Andrii Vozniuk, Serg...
TPC-DS performance evaluation for JAQL and PIG queries - Andrii Vozniuk, Serg...TPC-DS performance evaluation for JAQL and PIG queries - Andrii Vozniuk, Serg...
TPC-DS performance evaluation for JAQL and PIG queries - Andrii Vozniuk, Serg...Andrii Vozniuk
 
Contextual learning analytics apps to create awareness in blended inquiry lea...
Contextual learning analytics apps to create awareness in blended inquiry lea...Contextual learning analytics apps to create awareness in blended inquiry lea...
Contextual learning analytics apps to create awareness in blended inquiry lea...Andrii Vozniuk
 
Graspeo: a Social Media Platform for Knowledge Management in NGOs - Andrii Vo...
Graspeo: a Social Media Platform for Knowledge Management in NGOs - Andrii Vo...Graspeo: a Social Media Platform for Knowledge Management in NGOs - Andrii Vo...
Graspeo: a Social Media Platform for Knowledge Management in NGOs - Andrii Vo...Andrii Vozniuk
 
Towards portable learning analytics dashboards - Andrii Vozniuk, Sten Govaert...
Towards portable learning analytics dashboards - Andrii Vozniuk, Sten Govaert...Towards portable learning analytics dashboards - Andrii Vozniuk, Sten Govaert...
Towards portable learning analytics dashboards - Andrii Vozniuk, Sten Govaert...Andrii Vozniuk
 
AngeLA: Putting the teacher in control of student privacy in the online class...
AngeLA: Putting the teacher in control of student privacy in the online class...AngeLA: Putting the teacher in control of student privacy in the online class...
AngeLA: Putting the teacher in control of student privacy in the online class...Andrii Vozniuk
 
Cloud infrastructure. Google File System and MapReduce - Andrii Vozniuk
Cloud infrastructure. Google File System and MapReduce - Andrii VozniukCloud infrastructure. Google File System and MapReduce - Andrii Vozniuk
Cloud infrastructure. Google File System and MapReduce - Andrii VozniukAndrii Vozniuk
 
Symbolic Reasoning and Concrete Execution - Andrii Vozniuk
Symbolic Reasoning and Concrete Execution - Andrii Vozniuk Symbolic Reasoning and Concrete Execution - Andrii Vozniuk
Symbolic Reasoning and Concrete Execution - Andrii Vozniuk Andrii Vozniuk
 

More from Andrii Vozniuk (11)

Enhancing Social Media Platforms for Educational and Humanitarian Knowledge S...
Enhancing Social Media Platforms for Educational and Humanitarian Knowledge S...Enhancing Social Media Platforms for Educational and Humanitarian Knowledge S...
Enhancing Social Media Platforms for Educational and Humanitarian Knowledge S...
 
Embedded interactive learning analytics dashboards with Elasticsearch and Kib...
Embedded interactive learning analytics dashboards with Elasticsearch and Kib...Embedded interactive learning analytics dashboards with Elasticsearch and Kib...
Embedded interactive learning analytics dashboards with Elasticsearch and Kib...
 
Interactive learning analytics dashboards with ELK (Elasticsearch Logstash Ki...
Interactive learning analytics dashboards with ELK (Elasticsearch Logstash Ki...Interactive learning analytics dashboards with ELK (Elasticsearch Logstash Ki...
Interactive learning analytics dashboards with ELK (Elasticsearch Logstash Ki...
 
Combining content analytics and activity tracking to mine user interests and ...
Combining content analytics and activity tracking to mine user interests and ...Combining content analytics and activity tracking to mine user interests and ...
Combining content analytics and activity tracking to mine user interests and ...
 
TPC-DS performance evaluation for JAQL and PIG queries - Andrii Vozniuk, Serg...
TPC-DS performance evaluation for JAQL and PIG queries - Andrii Vozniuk, Serg...TPC-DS performance evaluation for JAQL and PIG queries - Andrii Vozniuk, Serg...
TPC-DS performance evaluation for JAQL and PIG queries - Andrii Vozniuk, Serg...
 
Contextual learning analytics apps to create awareness in blended inquiry lea...
Contextual learning analytics apps to create awareness in blended inquiry lea...Contextual learning analytics apps to create awareness in blended inquiry lea...
Contextual learning analytics apps to create awareness in blended inquiry lea...
 
Graspeo: a Social Media Platform for Knowledge Management in NGOs - Andrii Vo...
Graspeo: a Social Media Platform for Knowledge Management in NGOs - Andrii Vo...Graspeo: a Social Media Platform for Knowledge Management in NGOs - Andrii Vo...
Graspeo: a Social Media Platform for Knowledge Management in NGOs - Andrii Vo...
 
Towards portable learning analytics dashboards - Andrii Vozniuk, Sten Govaert...
Towards portable learning analytics dashboards - Andrii Vozniuk, Sten Govaert...Towards portable learning analytics dashboards - Andrii Vozniuk, Sten Govaert...
Towards portable learning analytics dashboards - Andrii Vozniuk, Sten Govaert...
 
AngeLA: Putting the teacher in control of student privacy in the online class...
AngeLA: Putting the teacher in control of student privacy in the online class...AngeLA: Putting the teacher in control of student privacy in the online class...
AngeLA: Putting the teacher in control of student privacy in the online class...
 
Cloud infrastructure. Google File System and MapReduce - Andrii Vozniuk
Cloud infrastructure. Google File System and MapReduce - Andrii VozniukCloud infrastructure. Google File System and MapReduce - Andrii Vozniuk
Cloud infrastructure. Google File System and MapReduce - Andrii Vozniuk
 
Symbolic Reasoning and Concrete Execution - Andrii Vozniuk
Symbolic Reasoning and Concrete Execution - Andrii Vozniuk Symbolic Reasoning and Concrete Execution - Andrii Vozniuk
Symbolic Reasoning and Concrete Execution - Andrii Vozniuk
 

Recently uploaded

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbuapidays
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 

Recently uploaded (20)

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 

Scheduling in distributed systems - Andrii Vozniuk

  • 1. Scheduling In Distributed Systems Candidacy exam  Andrii Vozniuk  EPFL  July 4, 2012
  • 2. Big Data  Data explosion  Processing gets more complicated Generates: 25 TB/day Generates: 40 TB/day Stores: 10 PB/year Stores: 20 PB/year Resources of many computers should be used 2
  • 3. Typical Data Processing Pipeline Log Sensor data data ETL-like batch Clean Analyze Using resources of processing data data many organizations Particle found! Efficient query Query execution data User model No one-size-fits-all system currently exists 3
  • 4. Outline Ɣ Gamma - parallel database MapReduce - data-intensive system Condor - compute-intensive system Conclusions Future Research 4
  • 5. Scheduling In Distributed Systems  Scheduling  Policy: setting an ordering of tasks task task  Assigning resources to tasks task task How to match resources and tasks? Scheduling is challenging in distributed systems 5
  • 6. Matching Tasks With Resources  Perspectives  Data model  Execution model System/Perspecti Data model Execution model ve Gamma Relational Multioperator MapReduce Unconstrained MapReduce Condor Unconstrained Unconstrained How scheduling is influenced by data and execution 6 models?
  • 7. Gamma Ɣ  Pioneering parallel database  Data model: constrained  Relational data model  Relations are horizontally partitioned  Execution model: constrained  Multioperator queries  Operators employ hash-based algorithms 7
  • 8. Gamma: Scheduler Ɣ SELECT r FROM R Query Host WHERE r < ‘k’ query Manager Catalog Machine Gamma Optimizes query Schedules Scheduler Database Compiles plan operators Process Operator Operator Node 1 Process Process Node 2 Execution on relevant nodes a-m n-z Scheduling is done at the operator level 8
  • 9. Gamma: Batch Scheduling Ɣ  Exploit sharing by scheduling in a batch  Example of selection sharing σ1 σ2 σ1 σ2 Shared scan A A A  Reads of A can be shared applying predicates in turn  Shared relation A is scanned only once Batch scheduling trades latency for throughput 9
  • 10. Gamma: Batch Scheduling Joins Ɣ  Several hash-joins in a batch of queries  Hash table for the same relation can be shared  Example assumes 100% selectivity of σ Shared hash-table for A ⋈ ⋈ ⋈ ⋈ σ σ σ σ σ σ σ A Β A C B A C  Sharing reduces I/O and memory usage Sharing among joins reduces total execution time 10
  • 11. Limitations Of Gamma Ɣ  Gamma offers  Efficient query execution  Sharing in a batch of queries  Gamma operates on structured data  Gamma is not suitable for  Unstructured data processing  ETL type of workload  Running on large scale A different system for ETL processing is needed 11
  • 12. MapReduce  System for data-intensive applications  Execution model: constrained  Job is a set of map and reduce tasks  Tasks are independent  Data model: unconstrained  Arbitrary data format  Files are partitioned into chunks  Each chunk is replicated several times 12
  • 13. MapReduce: Scheduling Map Reduc Map 1e 2 Example: Chunk1 Chunk2 MapReduce job Result1 Temp1 Temp2 4 Map tasks 2 Reduce task Map Reduc Map 3 4e Chunk3 Chunk4 Temp3 Result2 Temp4  Tasks are scheduled close to data  Execution is scalable and fault-tolerant  Execution is elastic Fine grain scheduling improves fault tolerance and 13 elasticity
  • 14. MapReduce: Speculative Execution  Nodes may become slow  Speculative execution minimizes job’s response time  Launch if progress is 20% less than average backup Normal node straggler Temporary slow node Speculative execution works well in homogeneous 14 environment
  • 15. Emerging Heterogeneous Infrastructures  Replacement of failed components  Extending existing cluster with new machines  Virtualized data centers of cloud providers  CPU and RAM are isolated  Contention for disk and network IO Performance per 60 VM (MB/s) 40 20 0 1 2 3 4 5 6 7 VMs on Physical Host In many real-life cases the infrastructure is heterogeneous 15
  • 16. MapReduce: Heterogeneous Cluster Fast node Slow node  Performance degrades on heterogeneous cluster  Slow nodes are wasted  Backup tasks on slow nodes  All straggling tasks are treated equally  Thrashing due to excessive speculative execution Speculative execution should be improved for heterogeneous 16 cluster
  • 17. MapReduce: LATE Scheduler  Idea: back up the task with the largest estimated finish time (Longest Approximate Time to End) progress score progress rate = execution time 1 – progress score estimated time left = progress rate  Thresholds  Limit the number of backup tasks  Launch backup tasks on fast nodes  Backup only sufficiently slow tasks LATE looks forward to prioritize tasks to speculate 17
  • 18. MapReduce: LATE Example  Back up the task with Longest Approximate Time to End 2 min 1 Estimated time left: (1-0.66) / (1/3) = 1 1 task/min 2 Progress = 66% Estimated time left: (1-0.05) / (1/1.9) = 1.8 3x slower Progress = 5.3% 3 1.9x slower Time (min) improvement LATE correctly identifies task which hurts the response time the 18 most
  • 19. Limitations Of MapReduce  MapReduce offers  High scalability  Good fault tolerance  Handling of unstructured data  MapReduce is not suitable for  Running on multi organization infrastructure  Harvesting idle resources in organization A different system for multi organization infrastructure is 19 needed
  • 20. Condor  Compute-intensive system harvesting idle resources  Data model: arbitrary  Execution model: arbitrary How to increase utilization and respect the owners? job job job job Increase resources utilization by scheduling jobs on idle 20 machines
  • 21. Condor Scheduler: Centralized? Scheduler job job job job Efficient but not reliable, possible bottleneck 21
  • 22. Condor Scheduler: Distributed? Scheduler Scheduler Scheduler Scheduler job job job job Reliable but inefficient 22
  • 23. Condor Scheduler: Hybrid! Information about tasks Matchmaker Information about nodes Scheduler 1 3 1 1 2 3 Scheduler Scheduler 4 job job job job Hybrid approach has the best of both worlds 23
  • 24. ClassAds: Describing Jobs and Resources Job Description Machine Description [MyType=“Job” [MyType=“Machine“ TargetType = “Machine“ TargetType=“Job“ Department=“CompSci“ Machine=“nostos.cs.wisc.edu“ Requirements = OpSys=“LINUX“ (other.OpSys==LINUX && Disk=3076077 other.Disk > 10000000) Requirement = (LoadAvg <= 0.3) && Rank=Memory] (KeyboardIdle > (15*60)) Rank = other.Department==self.Department]  Requirements should be satisfied  Candidate with the highest rank is returned Matchmaker is suitable for heterogeneous shared clusters 24
  • 25. Conclusions  Scheduling done at different levels  Gamma: operator level scheduling enables sharing  MR and Condor: arbitrary code => sharing is hard  Condor: matchmaking gives control on job placement  Hybrid approaches are promising for big data processing  Scheduling in heterogeneous deployments is challenging 25
  • 26. Thank you for your attention! Feedback & Question? Andrii.Vozniuk@epfl.ch 26
  • 27. References  Matchmaking: Distributed Resource Management for High Throughput Computing by Rajesh Raman, Miron Livny and Marvin Solomon.  Batch Scheduling in Parallel Database Systems by Manish Mehta, Valery Soloviev and David J. DeWitt.  Improving MapReduce performance in heterogeneous environments by Matei Zaharia, Andy Konwinski, Anthony D. Joseph, Randy Katz and Ion Stoica  Slides 14 and 18 exploit presentation ideas from the LATE slides for OSDI 2008 by Matei Zaharia 27