SlideShare a Scribd company logo
1 of 57
Energy Efficient Data
    Storage Systems

         Xiao Qin
Department of Computer Science
     and Software Engineering
         Auburn University
http://www.eng.auburn.edu/~xqin
        xqin@auburn.edu
Investigators



Ziliang Zong             Adam Manzanares




Xiaojun Ruan                  Shu Yin
                    2
Data-Intensive Applications



  Stream Multimedia        Bioinformatic




     3D Graphic           Weather Forecast


                      3
Cluster Computing in
    Data Centers




       Data Centers

            4
Computing and Storage
           Nodes in a Cluster

                                             Storage Node
                       Head            (or Storage Area Network)
           Internet




                       Node
Client




                      Network switch




         Computing Nodes
Clusters in Our Lab at Auburn




              6
Energy Consumption was
        Growing




EPA Report to Congress on Server and Data Center Energy Efficiency, 2007



                                   7
2020 Projections

   Data Center:
    increases by 200%



       Clients:
number – increases by 800%
Power – increases by 300%


       Network:
    Increases by 300%
Energy Efficiency of Data
            Centers
Data Centers consume 110 Billion kWh per Year
Average cost: ¢9.46 per kWh


                 Storage
                   37%
      Other, 6             Dell’s Texas Data Center
        3%



                               Storage system may
                              cost 2.8 Billion Dollars!
Build Energy-Efficient Data
         Centers
Energy Conservation
    Techniques
Energy Efficient Devices
Multiple Design Goals

    Performance             Energy Efficiency




                       High-
                   Performance
                    Computing
                     Platforms




     Reliability                 Security
DVS – Dynamic Voltage Scaling




       Trade performance for
          energy efficiency
               14
Energy-Aware Scheduling for Clusters




Z.-L. Zong, X.-J. Ruan, A. Manzanares, and X. Qin, “EAD and PEBD: Two Energy-
Aware Duplication Scheduling Algorithms for Parallel Tasks on Homogeneous
Clusters,” IEEE Transactions on Computers, vol. 60, no. 3, pp. 360- 374, March 2011.
Parallel Applications
                                                      Entry
                                                      Task

                                 3        1
                             3                                3
                                                  3
                                                                       2
                3
                    2                4        3                    4

        3
                                 3                                     2
                                                  4


1                                                             7    20
    5                   10       6

                                                              20
    1                   10
                                         10


                    8                             9           5
            7


                    7                                 5

                                              Exit Task
                                 10

                                     8
Energy-Aware Scheduling:
  Motivational Example
Motivational Example (cont.)
The EAD and PEBD Algorithms
                                                     Generate the DAG of given task sets
 Calculate energy increase                                                                                                                 Calculate energy increase
     and time decrease
                                                        Find all the critical paths in DAG

Ratio= energy increase/ time
         decrease                         Generate scheduling queue based on the level (ascending)                                    more_energy<=Threshold?
                                                                                                                       No


                                 No   select the task (has not been scheduled yet) with the lowest level as                                     Yes
    Ratio<=Threshold?
                                                                   starting task




                                                                                                                   meet entry task
                                                                                                                                       Duplicate this task and select
          Yes                                                                                                                           the next task in the same
                                                                                                                                               critical path

Duplicate this task and select                              For each task which is in the
 the next task in the same                            same critical path with starting task, check
        critical path                                          if it is already scheduled                     No

                                                                                                  allocate it to the same
                                                   No                Yes                      processor with the tasks in the
                                                                                                    same critical path

                                                                Save time if duplicate
                                                                      this task?


                                                                    Yes



                             PEBD                                                                                                    EAD
                                                                                                                                                             19
Energy Dissipation in Processors




          http://www.xbitlabs.com
                                    20
Parallel Scientific Applications
                                        T1
                  T1


                                        T2   T3       T4     T5      T6

       T2                    T3

                                             T7



                                             T8       T9     T10     T11
T4          T5         T6         T7


                                                     T12



T8          T9         T10        T11                T13     T14     T15




                                                             T16



T12         T13        T14        T15                        T17     T18




       Fast Fourier Transform                 Gaussian Elimination
                                                                     21
Large-Scale Parallel Applications




      Robot Control                    Sparse Matrix Solver

2013-2-13       http://www.kasahara.elec.waseda.ac.           22

                jp/schedule/
Impact of CPU Power Dissipation
                                          19.4% 3.7%
                         Total Energy Consumption                                                      Total Energy Consumption
                                                          Athlon 4600+                                                                  Athlon 4600+
                40000                                     85W                            40000                                          85W

                35000                                                                    35000

                30000                                                                    30000
                                                          Athlon 4600+                                                                  Athlon 4600+
                                                          65W                                                                           65W
Energy (Joul)




                                                                         Energy (Joul)
                25000                                                                    25000

                20000                                                                    20000

                15000                                     Athlon 3800+                   15000                                          Athlon 3800+
                                                          35W                                                                           35W
                10000                                                                    10000

                 5000                                                                    5000

                    0                                     Intel Core2                        0                                          Intel Core2
                                                          Duo E6300                                                                     Duo E6300
                        EAD        CPU Type
                                PEBD    TDS           Power (busy)
                                                    MCP                                  Power (idle)
                                                                                                EAD           PEBD   GapTDS       MCP



                                                          104w                                   15w                 89w
                Energy consumption for different                                           Energy consumption for different
                                                          75w                                14w           61w
                processors (Gaussian, CCR=0.4)                                               processors (FFT, CCR=0.4)
                                                          47w                                    11w                 36w

                                                          44w                                    26w                 18w

                                  Observation: CPUs with large gap between CPU_busy and                                                 23
                                  CPU_idle can obtain greater energy savings
Performance

                              Schedule Length                                                                  Schedule Length
                  160                                                                             200
                  140                                        TDS                                  180                                    TDS
                  120                                                                             160
                                                                                                  140
  Time Unit (S)




                  100                                        EAD




                                                                                  Time Unit (S)
                                                                                                  120                                    EAD
                   80
                                                                                                  100
                   60                                        PEBD                                  80
                                                                                                   60                                    PEBD
                   40
                   20                                        MCP                                   40
                                                                                                   20
                    0                                                                                                                    MCP
                                                                                                    0
                        0.1    0.5     1      5    10
                                                                                                        0.1      0.5     1     5    10




Schedule length of Gaussian Elimination                                        Schedule length of Sparse Matrix Solver

                                           Application               EAD Performance                          PEBD Performance
                                                                    Degradation (: TDS)                       Degradation (: TDS)
                                      Gaussian Elimination                 5.7%                                        2.2%

                                      Sparse Matrix Solver                2.92%                                        2.02%

                                     Observation: it is worth trading a marginal degradation in schedule                                   24
                                     length for a significant energy savings for cluster systems.
Energy Consumption of Disks




            2/13/2013
Power States of Disks
 Active State: high
 energy consumption




     Active                         Standby
              State transition
                penalty
                             Standby State: low
                             energy consumption

26
A Hard Disk Drive




     A10000RPM Hard Drive may
     take 10.9 seconds to wake up!
27
Parallel Disks




Performance   Energy Efficiency
Put It All Together:
                               Buffer Disk Architecture
Energy-Related Reliability Model                           Prefetching                 Data Partitioning


                                                                               Security Model
      Disk Requests                RAM Buffer         Buffer Disk Controller
                                                                                       Load Balancing


                                                                                     Power Management




                                     m buffer disks                                   n data disks
IBM Ultrastar 36Z15
Transfer Rate       55 MB/s Spin Down Time: TD        1.5 s

Active Power: PA       13.5 W Spin Up Time: TU       10.9 s

Idle Power: PI         10.2 W Spin Down Energy: ED    13 J

Standby Power: PA       2.5 W Spin Up Energy: EU     135 J

Break-Even Time: TBE                                 15.2 S
Prefetching
Buffer Disk

   Disk 1

   Disk 2

   Disk 3
Energy Saving Principles
    Energy Saving Principle One
     ◦ Increase the length and number of idle
       periods larger than the disk break-even
       time TBE

    Energy Saving Principle Two
     ◦ Reduce the number of power-state
       transitions

A. Manzanares, X. Qin, X.-J. Ruan, and S. Yin, “PRE-BUD: Prefetching for Energy-
Efficient Parallel I/O Systems with Buffer Disks,” ACM Transactions on Storage, vol.
7, no. 1, Article 3 June 2011.
Energy Savings Hit Rate 85%




           33
             2/13/2013
State Transitions
Heat-Based Dynamic Data Caching


        buffer   buffer   buffer
        disk     disk     disk

Requests Queue




      35
Heat-Based Dynamic Data Caching
Requests Queue




        buffer   buffer   buffer
        disk     disk     disk




      36
Energy Consumption Results
  Large Reads: average 84.4%
improvement (64MB)

  Small Reads: average 78.77%
improvement (64KB)
                                Energy consumption for large reads




     2/13/2013
                                Energy consumption for small reads
                                                               37
Load Balancing Comparison




            Load balancing comparison for three mapping strategies
2/13/2013                                                            38
Energy Efficient Virtual File
          System
EEVFS Process Flow
Energy Savings
Improving Performance of EEVFS
            Parallel Striping Groups

File 1 Group 1 File 3      File 2   Group 2   File 4

Buffer                     Buffer
         Disk 1   Disk 2            Disk 5    Disk 6
 Disk                       Disk
   Storage Node 1             Storage Node 3


Buffer                     Buffer
         Disk 3   Disk 4            Disk 7    Disk 8
 Disk                       Disk
   Storage Node 2             Storage Node 4
Striping Within a Group

Buffer Disk 2
          1     Disk 1   3 5 7 9      Disk 2   4 6 8   10

                     Storage Node 1



Buffer Disk 2
          1     Disk 3   3 5 7 9      Disk 4   4 6 8   10

                    Storage Node 2


   1 1
   File 1                Group 1         File 2
                                         2
Measured Results




                   2/13/2013
A Parallel Disk System with a
      Write Buffer Disk
Under High Workload
    Conditions

        Data Disks can serve requests
        without buffer disks when
        workload is high
Wakeup Data Disks
 Requests Queue



                  Buffer Disk




47
Energy Savings
Low Workload, UltraStar
Energy Conservation Techniques

Software-Directed Power Management
Dynamic Power Management
Redundancy Technique
Multi- speed Setting


How Reliable Are They?


                49
Tradeoff between Energy
 Efficiency and Reliability




   Example: Disk Spin Up and Down




                 50
MINT
     (MATHEMATICAL RELIABILITY MODELS FOR ENERGY-EFFICIENT PARALLEL DISK SYSTEMS)




                                   Energy Conservation
                                       Techniques




                               Single Disk Reliability Model




                              System-Level Reliability Model




S. Yin et al. “Reliability Analysis for an Energy-Aware RAID System,”
Proc. the 30th IEEE International Performance Computing and
Communications Conference (IPCCC), Nov. 2011.
MINT                          (Single Disk)



                                 Disk Age        Temperature




Frequency     Utilization




            Single Disk Reliability Model



                        Reliability of Single
                                Disk




                             52
MINT
(MATHEMATICAL RELIABILITY MODELS FOR ENERGY-EFFICIENT PARALLEL DISK SYSTEMS)


                                 Access Pattern



                        Energy Conservation Techniques




                           Single Disk Reliability Model




                          System Level Reliability Model



                             Reliability of A Parallel
                                  Disk System
Preliminary Result
Comparison Between PDC and MAID




             AFR Comparison of PDC and MAID
          Access Rate(*104) Impacts on AFR (T=35°C)




                             54
Summary
• Energy-Aware Scheduling

• BUD - Buffer Disk Architecture

• Energy-Efficient File Systems

• Reliability Models for Energy-Efficient
 Storage Systems
Download the presentation slides
 http://www.slideshare.net/xqin74




                    Google: slideshare Xiao Qin
Questions

More Related Content

Viewers also liked

Project 2 how to modify OS/161
Project 2 how to modify OS/161Project 2 how to modify OS/161
Project 2 how to modify OS/161Xiao Qin
 
IPCCC 2012 Conference Program Overview
IPCCC 2012 Conference Program OverviewIPCCC 2012 Conference Program Overview
IPCCC 2012 Conference Program OverviewXiao Qin
 
Reliability Analysis for an Energy-Aware RAID System
Reliability Analysis for an Energy-Aware RAID SystemReliability Analysis for an Energy-Aware RAID System
Reliability Analysis for an Energy-Aware RAID SystemXiao Qin
 
Why Major in Computer Science and Software Engineering at Auburn University?
Why Major in Computer Science and Software Engineering at Auburn University?Why Major in Computer Science and Software Engineering at Auburn University?
Why Major in Computer Science and Software Engineering at Auburn University?Xiao Qin
 
Thermal modeling and management of cluster storage systems xunfei jiang 2014
Thermal modeling and management of cluster storage systems xunfei jiang 2014Thermal modeling and management of cluster storage systems xunfei jiang 2014
Thermal modeling and management of cluster storage systems xunfei jiang 2014Xiao Qin
 
Common grammar mistakes
Common grammar mistakesCommon grammar mistakes
Common grammar mistakesXiao Qin
 
Surviving a group project
Surviving a group projectSurviving a group project
Surviving a group projectXiao Qin
 
Project 2 how to install and compile os161
Project 2 how to install and compile os161Project 2 how to install and compile os161
Project 2 how to install and compile os161Xiao Qin
 
Project 2 How to modify os161: A Manual
Project 2 How to modify os161: A ManualProject 2 How to modify os161: A Manual
Project 2 How to modify os161: A ManualXiao Qin
 
How to add system calls to OS/161
How to add system calls to OS/161How to add system calls to OS/161
How to add system calls to OS/161Xiao Qin
 
COMP2710: Software Construction - Linked list exercises
COMP2710: Software Construction - Linked list exercisesCOMP2710: Software Construction - Linked list exercises
COMP2710: Software Construction - Linked list exercisesXiao Qin
 
Data center specific thermal and energy saving techniques
Data center specific thermal and energy saving techniquesData center specific thermal and energy saving techniques
Data center specific thermal and energy saving techniquesXiao Qin
 
Understanding what our customer wants-slideshare
Understanding what our customer wants-slideshareUnderstanding what our customer wants-slideshare
Understanding what our customer wants-slideshareXiao Qin
 
Performance Evaluation of Traditional Caching Policies on a Large System with...
Performance Evaluation of Traditional Caching Policies on a Large System with...Performance Evaluation of Traditional Caching Policies on a Large System with...
Performance Evaluation of Traditional Caching Policies on a Large System with...Xiao Qin
 
HDFS-HC2: Analysis of Data Placement Strategy based on Computing Power of Nod...
HDFS-HC2: Analysis of Data Placement Strategy based on Computing Power of Nod...HDFS-HC2: Analysis of Data Placement Strategy based on Computing Power of Nod...
HDFS-HC2: Analysis of Data Placement Strategy based on Computing Power of Nod...Xiao Qin
 

Viewers also liked (15)

Project 2 how to modify OS/161
Project 2 how to modify OS/161Project 2 how to modify OS/161
Project 2 how to modify OS/161
 
IPCCC 2012 Conference Program Overview
IPCCC 2012 Conference Program OverviewIPCCC 2012 Conference Program Overview
IPCCC 2012 Conference Program Overview
 
Reliability Analysis for an Energy-Aware RAID System
Reliability Analysis for an Energy-Aware RAID SystemReliability Analysis for an Energy-Aware RAID System
Reliability Analysis for an Energy-Aware RAID System
 
Why Major in Computer Science and Software Engineering at Auburn University?
Why Major in Computer Science and Software Engineering at Auburn University?Why Major in Computer Science and Software Engineering at Auburn University?
Why Major in Computer Science and Software Engineering at Auburn University?
 
Thermal modeling and management of cluster storage systems xunfei jiang 2014
Thermal modeling and management of cluster storage systems xunfei jiang 2014Thermal modeling and management of cluster storage systems xunfei jiang 2014
Thermal modeling and management of cluster storage systems xunfei jiang 2014
 
Common grammar mistakes
Common grammar mistakesCommon grammar mistakes
Common grammar mistakes
 
Surviving a group project
Surviving a group projectSurviving a group project
Surviving a group project
 
Project 2 how to install and compile os161
Project 2 how to install and compile os161Project 2 how to install and compile os161
Project 2 how to install and compile os161
 
Project 2 How to modify os161: A Manual
Project 2 How to modify os161: A ManualProject 2 How to modify os161: A Manual
Project 2 How to modify os161: A Manual
 
How to add system calls to OS/161
How to add system calls to OS/161How to add system calls to OS/161
How to add system calls to OS/161
 
COMP2710: Software Construction - Linked list exercises
COMP2710: Software Construction - Linked list exercisesCOMP2710: Software Construction - Linked list exercises
COMP2710: Software Construction - Linked list exercises
 
Data center specific thermal and energy saving techniques
Data center specific thermal and energy saving techniquesData center specific thermal and energy saving techniques
Data center specific thermal and energy saving techniques
 
Understanding what our customer wants-slideshare
Understanding what our customer wants-slideshareUnderstanding what our customer wants-slideshare
Understanding what our customer wants-slideshare
 
Performance Evaluation of Traditional Caching Policies on a Large System with...
Performance Evaluation of Traditional Caching Policies on a Large System with...Performance Evaluation of Traditional Caching Policies on a Large System with...
Performance Evaluation of Traditional Caching Policies on a Large System with...
 
HDFS-HC2: Analysis of Data Placement Strategy based on Computing Power of Nod...
HDFS-HC2: Analysis of Data Placement Strategy based on Computing Power of Nod...HDFS-HC2: Analysis of Data Placement Strategy based on Computing Power of Nod...
HDFS-HC2: Analysis of Data Placement Strategy based on Computing Power of Nod...
 

Similar to Energy Efficient Data Storage Systems

Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...
Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...
Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...Yahoo Developer Network
 
Workflow Allocations and Scheduling on IaaS Platforms, from Theory to Practice
Workflow Allocations and Scheduling on IaaS Platforms, from Theory to PracticeWorkflow Allocations and Scheduling on IaaS Platforms, from Theory to Practice
Workflow Allocations and Scheduling on IaaS Platforms, from Theory to PracticeFrederic Desprez
 
22). smlevel energy eff-dynamictaskschedng
22). smlevel energy eff-dynamictaskschedng22). smlevel energy eff-dynamictaskschedng
22). smlevel energy eff-dynamictaskschedngPoornima_Rajanna
 
Deep Implicit Layers: Learning Structured Problems with Neural Networks
Deep Implicit Layers: Learning Structured Problems with Neural NetworksDeep Implicit Layers: Learning Structured Problems with Neural Networks
Deep Implicit Layers: Learning Structured Problems with Neural NetworksSangwoo Mo
 
Workflowsim escience12
Workflowsim escience12Workflowsim escience12
Workflowsim escience12Weiwei Chen
 
IndexedRDD: Efficeint Fine-Grained Updates for RDD's-(Ankur Dave, UC Berkeley)
IndexedRDD: Efficeint Fine-Grained Updates for RDD's-(Ankur Dave, UC Berkeley)IndexedRDD: Efficeint Fine-Grained Updates for RDD's-(Ankur Dave, UC Berkeley)
IndexedRDD: Efficeint Fine-Grained Updates for RDD's-(Ankur Dave, UC Berkeley)Spark Summit
 
Scalable and Order-robust Continual Learning with Additive Parameter Decompos...
Scalable and Order-robust Continual Learning with Additive Parameter Decompos...Scalable and Order-robust Continual Learning with Additive Parameter Decompos...
Scalable and Order-robust Continual Learning with Additive Parameter Decompos...MLAI2
 
Data decomposition techniques
Data decomposition techniquesData decomposition techniques
Data decomposition techniquesMohamed Ramadan
 
U-SQL Query Execution and Performance Basics (SQLBits 2016)
U-SQL Query Execution and Performance Basics (SQLBits 2016)U-SQL Query Execution and Performance Basics (SQLBits 2016)
U-SQL Query Execution and Performance Basics (SQLBits 2016)Michael Rys
 
226 team project-report-manjula kollipara
226 team project-report-manjula kollipara226 team project-report-manjula kollipara
226 team project-report-manjula kolliparaManjula Kollipara
 
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingBERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingMinh Pham
 
ICPE 2022 - Data Challenge
ICPE 2022 - Data ChallengeICPE 2022 - Data Challenge
ICPE 2022 - Data ChallengeLuc Lesoil
 
ICSE2013
ICSE2013ICSE2013
ICSE2013swy351
 
Energy efficient resource management for high-performance clusters
Energy efficient resource management for high-performance clustersEnergy efficient resource management for high-performance clusters
Energy efficient resource management for high-performance clustersXiao Qin
 
Hadoop & MapReduce
Hadoop & MapReduceHadoop & MapReduce
Hadoop & MapReduceNewvewm
 
Android Bootstrap
Android BootstrapAndroid Bootstrap
Android Bootstrapdonnfelker
 
International Refereed Journal of Engineering and Science (IRJES)
International Refereed Journal of Engineering and Science (IRJES)International Refereed Journal of Engineering and Science (IRJES)
International Refereed Journal of Engineering and Science (IRJES)irjes
 
Generalized Pipeline Parallelism for DNN Training
Generalized Pipeline Parallelism for DNN TrainingGeneralized Pipeline Parallelism for DNN Training
Generalized Pipeline Parallelism for DNN TrainingDatabricks
 

Similar to Energy Efficient Data Storage Systems (20)

Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...
Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...
Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...
 
Workflow Allocations and Scheduling on IaaS Platforms, from Theory to Practice
Workflow Allocations and Scheduling on IaaS Platforms, from Theory to PracticeWorkflow Allocations and Scheduling on IaaS Platforms, from Theory to Practice
Workflow Allocations and Scheduling on IaaS Platforms, from Theory to Practice
 
22). smlevel energy eff-dynamictaskschedng
22). smlevel energy eff-dynamictaskschedng22). smlevel energy eff-dynamictaskschedng
22). smlevel energy eff-dynamictaskschedng
 
Deep Implicit Layers: Learning Structured Problems with Neural Networks
Deep Implicit Layers: Learning Structured Problems with Neural NetworksDeep Implicit Layers: Learning Structured Problems with Neural Networks
Deep Implicit Layers: Learning Structured Problems with Neural Networks
 
Parallel Processing Concepts
Parallel Processing Concepts Parallel Processing Concepts
Parallel Processing Concepts
 
Workflowsim escience12
Workflowsim escience12Workflowsim escience12
Workflowsim escience12
 
IndexedRDD: Efficeint Fine-Grained Updates for RDD's-(Ankur Dave, UC Berkeley)
IndexedRDD: Efficeint Fine-Grained Updates for RDD's-(Ankur Dave, UC Berkeley)IndexedRDD: Efficeint Fine-Grained Updates for RDD's-(Ankur Dave, UC Berkeley)
IndexedRDD: Efficeint Fine-Grained Updates for RDD's-(Ankur Dave, UC Berkeley)
 
Scalable and Order-robust Continual Learning with Additive Parameter Decompos...
Scalable and Order-robust Continual Learning with Additive Parameter Decompos...Scalable and Order-robust Continual Learning with Additive Parameter Decompos...
Scalable and Order-robust Continual Learning with Additive Parameter Decompos...
 
Data decomposition techniques
Data decomposition techniquesData decomposition techniques
Data decomposition techniques
 
U-SQL Query Execution and Performance Basics (SQLBits 2016)
U-SQL Query Execution and Performance Basics (SQLBits 2016)U-SQL Query Execution and Performance Basics (SQLBits 2016)
U-SQL Query Execution and Performance Basics (SQLBits 2016)
 
226 team project-report-manjula kollipara
226 team project-report-manjula kollipara226 team project-report-manjula kollipara
226 team project-report-manjula kollipara
 
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingBERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
 
ICPE 2022 - Data Challenge
ICPE 2022 - Data ChallengeICPE 2022 - Data Challenge
ICPE 2022 - Data Challenge
 
ICSE2013
ICSE2013ICSE2013
ICSE2013
 
Energy efficient resource management for high-performance clusters
Energy efficient resource management for high-performance clustersEnergy efficient resource management for high-performance clusters
Energy efficient resource management for high-performance clusters
 
Hadoop & MapReduce
Hadoop & MapReduceHadoop & MapReduce
Hadoop & MapReduce
 
Android Bootstrap
Android BootstrapAndroid Bootstrap
Android Bootstrap
 
International Refereed Journal of Engineering and Science (IRJES)
International Refereed Journal of Engineering and Science (IRJES)International Refereed Journal of Engineering and Science (IRJES)
International Refereed Journal of Engineering and Science (IRJES)
 
Generalized Pipeline Parallelism for DNN Training
Generalized Pipeline Parallelism for DNN TrainingGeneralized Pipeline Parallelism for DNN Training
Generalized Pipeline Parallelism for DNN Training
 
2021 06-02-tabnet
2021 06-02-tabnet2021 06-02-tabnet
2021 06-02-tabnet
 

More from Xiao Qin

How to apply for internship positions?
How to apply for internship positions?How to apply for internship positions?
How to apply for internship positions?Xiao Qin
 
How to write research papers? Version 5.0
How to write research papers? Version 5.0How to write research papers? Version 5.0
How to write research papers? Version 5.0Xiao Qin
 
Making a competitive nsf career proposal: Part 2 Worksheet
Making a competitive nsf career proposal: Part 2 WorksheetMaking a competitive nsf career proposal: Part 2 Worksheet
Making a competitive nsf career proposal: Part 2 WorksheetXiao Qin
 
Making a competitive nsf career proposal: Part 1 Tips
Making a competitive nsf career proposal: Part 1 TipsMaking a competitive nsf career proposal: Part 1 Tips
Making a competitive nsf career proposal: Part 1 TipsXiao Qin
 
Auburn csse faculty orientation
Auburn csse faculty orientationAuburn csse faculty orientation
Auburn csse faculty orientationXiao Qin
 
Auburn CSSE graduate student orientation
Auburn CSSE graduate student orientationAuburn CSSE graduate student orientation
Auburn CSSE graduate student orientationXiao Qin
 
CSSE Graduate Programs Committee: Progress Report
CSSE Graduate Programs Committee: Progress ReportCSSE Graduate Programs Committee: Progress Report
CSSE Graduate Programs Committee: Progress ReportXiao Qin
 
P#1 stream of praise
P#1 stream of praiseP#1 stream of praise
P#1 stream of praiseXiao Qin
 
Reliability Modeling and Analysis of Energy-Efficient Storage Systems
Reliability Modeling and Analysis of Energy-Efficient Storage SystemsReliability Modeling and Analysis of Energy-Efficient Storage Systems
Reliability Modeling and Analysis of Energy-Efficient Storage SystemsXiao Qin
 

More from Xiao Qin (9)

How to apply for internship positions?
How to apply for internship positions?How to apply for internship positions?
How to apply for internship positions?
 
How to write research papers? Version 5.0
How to write research papers? Version 5.0How to write research papers? Version 5.0
How to write research papers? Version 5.0
 
Making a competitive nsf career proposal: Part 2 Worksheet
Making a competitive nsf career proposal: Part 2 WorksheetMaking a competitive nsf career proposal: Part 2 Worksheet
Making a competitive nsf career proposal: Part 2 Worksheet
 
Making a competitive nsf career proposal: Part 1 Tips
Making a competitive nsf career proposal: Part 1 TipsMaking a competitive nsf career proposal: Part 1 Tips
Making a competitive nsf career proposal: Part 1 Tips
 
Auburn csse faculty orientation
Auburn csse faculty orientationAuburn csse faculty orientation
Auburn csse faculty orientation
 
Auburn CSSE graduate student orientation
Auburn CSSE graduate student orientationAuburn CSSE graduate student orientation
Auburn CSSE graduate student orientation
 
CSSE Graduate Programs Committee: Progress Report
CSSE Graduate Programs Committee: Progress ReportCSSE Graduate Programs Committee: Progress Report
CSSE Graduate Programs Committee: Progress Report
 
P#1 stream of praise
P#1 stream of praiseP#1 stream of praise
P#1 stream of praise
 
Reliability Modeling and Analysis of Energy-Efficient Storage Systems
Reliability Modeling and Analysis of Energy-Efficient Storage SystemsReliability Modeling and Analysis of Energy-Efficient Storage Systems
Reliability Modeling and Analysis of Energy-Efficient Storage Systems
 

Recently uploaded

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 

Recently uploaded (20)

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 

Energy Efficient Data Storage Systems

  • 1. Energy Efficient Data Storage Systems Xiao Qin Department of Computer Science and Software Engineering Auburn University http://www.eng.auburn.edu/~xqin xqin@auburn.edu
  • 2. Investigators Ziliang Zong Adam Manzanares Xiaojun Ruan Shu Yin 2
  • 3. Data-Intensive Applications Stream Multimedia Bioinformatic 3D Graphic Weather Forecast 3
  • 4. Cluster Computing in Data Centers Data Centers 4
  • 5. Computing and Storage Nodes in a Cluster Storage Node Head (or Storage Area Network) Internet Node Client Network switch Computing Nodes
  • 6. Clusters in Our Lab at Auburn 6
  • 7. Energy Consumption was Growing EPA Report to Congress on Server and Data Center Energy Efficiency, 2007 7
  • 8. 2020 Projections Data Center: increases by 200% Clients: number – increases by 800% Power – increases by 300% Network: Increases by 300%
  • 9. Energy Efficiency of Data Centers Data Centers consume 110 Billion kWh per Year Average cost: ¢9.46 per kWh Storage 37% Other, 6 Dell’s Texas Data Center 3% Storage system may cost 2.8 Billion Dollars!
  • 11. Energy Conservation Techniques
  • 13. Multiple Design Goals Performance Energy Efficiency High- Performance Computing Platforms Reliability Security
  • 14. DVS – Dynamic Voltage Scaling Trade performance for energy efficiency 14
  • 15. Energy-Aware Scheduling for Clusters Z.-L. Zong, X.-J. Ruan, A. Manzanares, and X. Qin, “EAD and PEBD: Two Energy- Aware Duplication Scheduling Algorithms for Parallel Tasks on Homogeneous Clusters,” IEEE Transactions on Computers, vol. 60, no. 3, pp. 360- 374, March 2011.
  • 16. Parallel Applications Entry Task 3 1 3 3 3 2 3 2 4 3 4 3 3 2 4 1 7 20 5 10 6 20 1 10 10 8 9 5 7 7 5 Exit Task 10 8
  • 17. Energy-Aware Scheduling: Motivational Example
  • 19. The EAD and PEBD Algorithms Generate the DAG of given task sets Calculate energy increase Calculate energy increase and time decrease Find all the critical paths in DAG Ratio= energy increase/ time decrease Generate scheduling queue based on the level (ascending) more_energy<=Threshold? No No select the task (has not been scheduled yet) with the lowest level as Yes Ratio<=Threshold? starting task meet entry task Duplicate this task and select Yes the next task in the same critical path Duplicate this task and select For each task which is in the the next task in the same same critical path with starting task, check critical path if it is already scheduled No allocate it to the same No Yes processor with the tasks in the same critical path Save time if duplicate this task? Yes PEBD EAD 19
  • 20. Energy Dissipation in Processors http://www.xbitlabs.com 20
  • 21. Parallel Scientific Applications T1 T1 T2 T3 T4 T5 T6 T2 T3 T7 T8 T9 T10 T11 T4 T5 T6 T7 T12 T8 T9 T10 T11 T13 T14 T15 T16 T12 T13 T14 T15 T17 T18 Fast Fourier Transform Gaussian Elimination 21
  • 22. Large-Scale Parallel Applications Robot Control Sparse Matrix Solver 2013-2-13 http://www.kasahara.elec.waseda.ac. 22 jp/schedule/
  • 23. Impact of CPU Power Dissipation 19.4% 3.7% Total Energy Consumption Total Energy Consumption Athlon 4600+ Athlon 4600+ 40000 85W 40000 85W 35000 35000 30000 30000 Athlon 4600+ Athlon 4600+ 65W 65W Energy (Joul) Energy (Joul) 25000 25000 20000 20000 15000 Athlon 3800+ 15000 Athlon 3800+ 35W 35W 10000 10000 5000 5000 0 Intel Core2 0 Intel Core2 Duo E6300 Duo E6300 EAD CPU Type PEBD TDS Power (busy) MCP Power (idle) EAD PEBD GapTDS MCP 104w 15w 89w Energy consumption for different Energy consumption for different 75w 14w 61w processors (Gaussian, CCR=0.4) processors (FFT, CCR=0.4) 47w 11w 36w 44w 26w 18w Observation: CPUs with large gap between CPU_busy and 23 CPU_idle can obtain greater energy savings
  • 24. Performance Schedule Length Schedule Length 160 200 140 TDS 180 TDS 120 160 140 Time Unit (S) 100 EAD Time Unit (S) 120 EAD 80 100 60 PEBD 80 60 PEBD 40 20 MCP 40 20 0 MCP 0 0.1 0.5 1 5 10 0.1 0.5 1 5 10 Schedule length of Gaussian Elimination Schedule length of Sparse Matrix Solver Application EAD Performance PEBD Performance Degradation (: TDS) Degradation (: TDS) Gaussian Elimination 5.7% 2.2% Sparse Matrix Solver 2.92% 2.02% Observation: it is worth trading a marginal degradation in schedule 24 length for a significant energy savings for cluster systems.
  • 25. Energy Consumption of Disks 2/13/2013
  • 26. Power States of Disks Active State: high energy consumption Active Standby State transition penalty Standby State: low energy consumption 26
  • 27. A Hard Disk Drive A10000RPM Hard Drive may take 10.9 seconds to wake up! 27
  • 28. Parallel Disks Performance Energy Efficiency
  • 29. Put It All Together: Buffer Disk Architecture Energy-Related Reliability Model Prefetching Data Partitioning Security Model Disk Requests RAM Buffer Buffer Disk Controller Load Balancing Power Management m buffer disks n data disks
  • 30. IBM Ultrastar 36Z15 Transfer Rate 55 MB/s Spin Down Time: TD 1.5 s Active Power: PA 13.5 W Spin Up Time: TU 10.9 s Idle Power: PI 10.2 W Spin Down Energy: ED 13 J Standby Power: PA 2.5 W Spin Up Energy: EU 135 J Break-Even Time: TBE 15.2 S
  • 31. Prefetching Buffer Disk Disk 1 Disk 2 Disk 3
  • 32. Energy Saving Principles  Energy Saving Principle One ◦ Increase the length and number of idle periods larger than the disk break-even time TBE  Energy Saving Principle Two ◦ Reduce the number of power-state transitions A. Manzanares, X. Qin, X.-J. Ruan, and S. Yin, “PRE-BUD: Prefetching for Energy- Efficient Parallel I/O Systems with Buffer Disks,” ACM Transactions on Storage, vol. 7, no. 1, Article 3 June 2011.
  • 33. Energy Savings Hit Rate 85% 33 2/13/2013
  • 35. Heat-Based Dynamic Data Caching buffer buffer buffer disk disk disk Requests Queue 35
  • 36. Heat-Based Dynamic Data Caching Requests Queue buffer buffer buffer disk disk disk 36
  • 37. Energy Consumption Results Large Reads: average 84.4% improvement (64MB) Small Reads: average 78.77% improvement (64KB) Energy consumption for large reads 2/13/2013 Energy consumption for small reads 37
  • 38. Load Balancing Comparison Load balancing comparison for three mapping strategies 2/13/2013 38
  • 42. Improving Performance of EEVFS Parallel Striping Groups File 1 Group 1 File 3 File 2 Group 2 File 4 Buffer Buffer Disk 1 Disk 2 Disk 5 Disk 6 Disk Disk Storage Node 1 Storage Node 3 Buffer Buffer Disk 3 Disk 4 Disk 7 Disk 8 Disk Disk Storage Node 2 Storage Node 4
  • 43. Striping Within a Group Buffer Disk 2 1 Disk 1 3 5 7 9 Disk 2 4 6 8 10 Storage Node 1 Buffer Disk 2 1 Disk 3 3 5 7 9 Disk 4 4 6 8 10 Storage Node 2 1 1 File 1 Group 1 File 2 2
  • 44. Measured Results 2/13/2013
  • 45. A Parallel Disk System with a Write Buffer Disk
  • 46. Under High Workload Conditions Data Disks can serve requests without buffer disks when workload is high
  • 47. Wakeup Data Disks Requests Queue Buffer Disk 47
  • 49. Energy Conservation Techniques Software-Directed Power Management Dynamic Power Management Redundancy Technique Multi- speed Setting How Reliable Are They? 49
  • 50. Tradeoff between Energy Efficiency and Reliability Example: Disk Spin Up and Down 50
  • 51. MINT (MATHEMATICAL RELIABILITY MODELS FOR ENERGY-EFFICIENT PARALLEL DISK SYSTEMS) Energy Conservation Techniques Single Disk Reliability Model System-Level Reliability Model S. Yin et al. “Reliability Analysis for an Energy-Aware RAID System,” Proc. the 30th IEEE International Performance Computing and Communications Conference (IPCCC), Nov. 2011.
  • 52. MINT (Single Disk) Disk Age Temperature Frequency Utilization Single Disk Reliability Model Reliability of Single Disk 52
  • 53. MINT (MATHEMATICAL RELIABILITY MODELS FOR ENERGY-EFFICIENT PARALLEL DISK SYSTEMS) Access Pattern Energy Conservation Techniques Single Disk Reliability Model System Level Reliability Model Reliability of A Parallel Disk System
  • 54. Preliminary Result Comparison Between PDC and MAID AFR Comparison of PDC and MAID Access Rate(*104) Impacts on AFR (T=35°C) 54
  • 55. Summary • Energy-Aware Scheduling • BUD - Buffer Disk Architecture • Energy-Efficient File Systems • Reliability Models for Energy-Efficient Storage Systems
  • 56. Download the presentation slides http://www.slideshare.net/xqin74 Google: slideshare Xiao Qin

Editor's Notes

  1. Dell * 8Intel XEON X34302GB DDR3(PC-10600)Ubuntu 9.04 32-bitGigabit LANHP * 12Intel Core 2 Q94002GB DDR3(PC-10600)Ubuntu 9.04 32-bitGigabit LAN
  2. This slide shows a typical high-performance computing platform, which was built by Google in the Oregon state. There is no doubt that they have significantly changed our lives and we all benefit from the great services provided by these super computing platforms . However, these giant machines consume a huge amount of energy.
  3. Architecture – Multiple LayersIn our architecture, we have four layers: application layer, middleware layer, resource layer, network layer. In each layer, we can incorporate energy-aware techniques. For example, in the application layer, we can reduce the unnecessary access to hardware when writing the code. In the middleware layer, we can schedule parallel tasks in more energy-efficient ways. In the resource and network layers, we can do energy-aware resource management.
  4. This slide shows some typical hardware in the resource and network layers like CPU, main board, storage disk, network adapter, switch and router.
  5. One thing I would like to emphasize here is that any energy-oriented research should not scarify other important characters like performance, reliability or security. Although there must be some tradeoff once we introduce energy-aware techniques, we do not want to see significant degradation in other characters. In other words, we would like to make our research compatible with existing techniques. For my research, I mainly focus on the tradeoff between performance and energy.
  6. Before we talk about the algorithms, let’s see the cluster systems first. In a cluster, we have the master node and slave nodes. The master node is responsible to schedule tasks and allocate them to slave nodes for parallel execution. All slave nodes are connected by high speed interconnections and they communicate with each other through message passing.
  7. The parallel tasks running on clusters are represented using Directed Acyclic Graph , or DAG for short. Usually, a dag has one entry task and one or multiple exit tasks. Dag shows the task number and the execution time of each task. It also shows the dependence and communication time among tasks. Explain a little bit…
  8. Weakness 1: Do not consider energy conservation in memoryWeakness 2: Energy can’t be conserved even then network interconnects are idleIn order to improve performance, we use duplication strategy. This slide shows why duplication can improve performance. Here we have 4 tasks represented by the DAG in the left side. If we use linear scheduling, all four tasks will be allocated in 1 CPU and the execution time will be 39s. However, we noticed that we can schedule task 2 to the 2nd CPU so that we do not need to wait the completion of task 3. In that way, the total time will shortened to 32s. We also noticed that 6s are wasted in the 2nd CPU because task 2 has to wait the message from task 1. If duplicate task 1 in the 2nd CPU, we can further shorted the schedule length to 29s. Obviously, the duplication could improve performance.
  9. However, if we calculate the energy, we will find that duplication may consume more power. For example, if we set the energy consumption for CPU and network 6w and 1w, the total energy consumption of duplication will be 42J more than NDS and 50J more than linear schedule. That is mainly because task 1 are executed twice. Here I would like to mention that I will use NDS(MCP) to represent no duplication schedule and use TDS to represent task duplication schedule. You will see a lot of them in the simulation results.
  10. This diagram summarize the steps we just talked about. I will just skip it.
  11. Now we are going to discuss the simulation results. We implement our own simulator using C language under Linux system. The CPU power consumption parameters come from the xbitlabs. We simulate 4 different CPUs, 3 of them are AMD and one is Intel.
  12. This slide shows the structure of two small task set. The left one is Fast Fourier Transform and the right one is Gaussian Elimination.
  13. The slide shows the DAG structure of two real-world applications. The left one is Robot Control and the right one is Sparse Matrix Solver.
  14. This slide shows the impact of CPU types. Recall that I simulate 4 different CPUs, which are represented in 4 different colors. We found that the CPU with blue color can save more energy compares with other 3 CPUs. For example, we can save 19.4% energy using blue CPU while we only can save 3.7% for the purple CPU. The indication behind is that these 4 CPUs have different gaps between CPU_busy and CPU_idle. This table summarize the difference. The gap for the blue CPU is 89w but the gap for the purple CPU is only 18w. So our observation is…
  15. This group of simulation results show the impact to performance. The left one is for Gaussian and the right one is for Sparse. This table summarize that the overall performance degradation of EAD and PEBD is 5.7% and 2.2% compared with TDS for Gaussian. For Sparse, the number is 2.92% and 2.02%. Our observation is …
  16. 16%
  17. 12 disk
  18. In the traditional disk arrays, requests are served directly by data disks. To maximize the performance, the traditional disk arrays tend to keep the disks in active state, even they do not produce any useful work. This will waste a lot of energy. Actually, some disks has very low utilization rate in real world traces, probably less than 20% or even 10%, we can save a lot of energy if we turn them into sleep state. Please note that the golden disks are active disks and gray disks are sleeping disks. However, the utilization of some disks might be extremely high because they stored very hot or popular blocks. It is almost impossible to make them sleep. Our idea is to add one more layer, called the buffer disk layer, on the top of the data disk layer. The requests will be served by buffer disks first rather than by data disks directly. We can cache all those hot blocks to the buffer disks and sleep as many data disks as possible.
  19. Cont..
  20. motivation.Results: x-axis, y-axix, red bars, green barsObservationindication
  21. SRB is small, transfer data from buffer disk to data disk for too many times. Small SRB, more spin up and spin down times.