Energy Efficient Data Storage Systems

Energy Efficient Data
Storage Systems

Xiao Qin
Department of Computer Science
and Software Engineering
Auburn University
http://www.eng.auburn.edu/~xqin
xqin@auburn.edu

Investigators

Ziliang Zong Adam Manzanares

Xiaojun Ruan Shu Yin
2

Data-Intensive Applications

Stream Multimedia Bioinformatic

3D Graphic Weather Forecast

3

Cluster Computing in
Data Centers

Data Centers

4

Computing and Storage
Nodes in a Cluster

Storage Node
Head (or Storage Area Network)
Internet

Node
Client

Network switch

Computing Nodes

Clusters in Our Lab at Auburn

6

Energy Consumption was
Growing

EPA Report to Congress on Server and Data Center Energy Efficiency, 2007

7

2020 Projections

Data Center:
increases by 200%

Clients:
number – increases by 800%
Power – increases by 300%

Network:
Increases by 300%

Energy Efficiency of Data
Centers
Data Centers consume 110 Billion kWh per Year
Average cost: ¢9.46 per kWh

Storage
37%
Other, 6 Dell’s Texas Data Center
3%

Storage system may
cost 2.8 Billion Dollars!

Build Energy-Efficient Data
Centers

Energy Conservation
Techniques

Multiple Design Goals

Performance Energy Efficiency

High-
Performance
Computing
Platforms

Reliability Security

DVS – Dynamic Voltage Scaling

Trade performance for
energy efficiency
14

Energy-Aware Scheduling for Clusters

Z.-L. Zong, X.-J. Ruan, A. Manzanares, and X. Qin, “EAD and PEBD: Two Energy-
Aware Duplication Scheduling Algorithms for Parallel Tasks on Homogeneous
Clusters,” IEEE Transactions on Computers, vol. 60, no. 3, pp. 360- 374, March 2011.

Parallel Applications
Entry
Task

3 1
3 3
3
2
3
2 4 3 4

3
3 2
4

1 7 20
5 10 6

20
1 10
10

8 9 5
7

7 5

Exit Task
10

8

Energy-Aware Scheduling:
Motivational Example

The EAD and PEBD Algorithms
Generate the DAG of given task sets
Calculate energy increase Calculate energy increase
and time decrease
Find all the critical paths in DAG

Ratio= energy increase/ time
decrease Generate scheduling queue based on the level (ascending) more_energy<=Threshold?
No

No select the task (has not been scheduled yet) with the lowest level as Yes
Ratio<=Threshold?
starting task

meet entry task
Duplicate this task and select
Yes the next task in the same
critical path

Duplicate this task and select For each task which is in the
the next task in the same same critical path with starting task, check
critical path if it is already scheduled No

allocate it to the same
No Yes processor with the tasks in the
same critical path

Save time if duplicate
this task?

Yes

PEBD EAD
19

Energy Dissipation in Processors

http://www.xbitlabs.com
20

Parallel Scientific Applications
T1
T1

T2 T3 T4 T5 T6

T2 T3

T7

T8 T9 T10 T11
T4 T5 T6 T7

T12

T8 T9 T10 T11 T13 T14 T15

T16

T12 T13 T14 T15 T17 T18

Fast Fourier Transform Gaussian Elimination
21

Large-Scale Parallel Applications

Robot Control Sparse Matrix Solver

2013-2-13 http://www.kasahara.elec.waseda.ac. 22

jp/schedule/

Impact of CPU Power Dissipation
19.4% 3.7%
Total Energy Consumption Total Energy Consumption
Athlon 4600+ Athlon 4600+
40000 85W 40000 85W

35000 35000

30000 30000
Athlon 4600+ Athlon 4600+
65W 65W
Energy (Joul)

Energy (Joul)
25000 25000

20000 20000

15000 Athlon 3800+ 15000 Athlon 3800+
35W 35W
10000 10000

5000 5000

0 Intel Core2 0 Intel Core2
Duo E6300 Duo E6300
EAD CPU Type
PEBD TDS Power (busy)
MCP Power (idle)
EAD PEBD GapTDS MCP

104w 15w 89w
Energy consumption for different Energy consumption for different
75w 14w 61w
processors (Gaussian, CCR=0.4) processors (FFT, CCR=0.4)
47w 11w 36w

44w 26w 18w

Observation: CPUs with large gap between CPU_busy and 23
CPU_idle can obtain greater energy savings

Performance

Schedule Length Schedule Length
160 200
140 TDS 180 TDS
120 160
140
Time Unit (S)

100 EAD

Time Unit (S)
120 EAD
80
100
60 PEBD 80
60 PEBD
40
20 MCP 40
20
0 MCP
0
0.1 0.5 1 5 10
0.1 0.5 1 5 10

Schedule length of Gaussian Elimination Schedule length of Sparse Matrix Solver

Application EAD Performance PEBD Performance
Degradation (: TDS) Degradation (: TDS)
Gaussian Elimination 5.7% 2.2%

Sparse Matrix Solver 2.92% 2.02%

Observation: it is worth trading a marginal degradation in schedule 24
length for a significant energy savings for cluster systems.

Energy Consumption of Disks

2/13/2013

Power States of Disks
Active State: high
energy consumption

Active Standby
State transition
penalty
Standby State: low
energy consumption

26

A Hard Disk Drive

A10000RPM Hard Drive may
take 10.9 seconds to wake up!
27

Parallel Disks

Performance Energy Efficiency

Put It All Together:
Buffer Disk Architecture
Energy-Related Reliability Model Prefetching Data Partitioning

Security Model
Disk Requests RAM Buffer Buffer Disk Controller
Load Balancing

Power Management

m buffer disks n data disks

IBM Ultrastar 36Z15
Transfer Rate 55 MB/s Spin Down Time: TD 1.5 s

Active Power: PA 13.5 W Spin Up Time: TU 10.9 s

Idle Power: PI 10.2 W Spin Down Energy: ED 13 J

Standby Power: PA 2.5 W Spin Up Energy: EU 135 J

Break-Even Time: TBE 15.2 S

Prefetching
Buffer Disk

Disk 1

Disk 2

Disk 3

Energy Saving Principles
 Energy Saving Principle One
◦ Increase the length and number of idle
periods larger than the disk break-even
time TBE

 Energy Saving Principle Two
◦ Reduce the number of power-state
transitions

A. Manzanares, X. Qin, X.-J. Ruan, and S. Yin, “PRE-BUD: Prefetching for Energy-
Efficient Parallel I/O Systems with Buffer Disks,” ACM Transactions on Storage, vol.
7, no. 1, Article 3 June 2011.

Energy Savings Hit Rate 85%

33
2/13/2013

Heat-Based Dynamic Data Caching

buffer buffer buffer
disk disk disk

Requests Queue

35

Heat-Based Dynamic Data Caching
Requests Queue

buffer buffer buffer
disk disk disk

36

Energy Consumption Results
Large Reads: average 84.4%
improvement (64MB)

Small Reads: average 78.77%
improvement (64KB)
Energy consumption for large reads

2/13/2013
Energy consumption for small reads
37

Load Balancing Comparison

Load balancing comparison for three mapping strategies
2/13/2013 38

Energy Efficient Virtual File
System

Improving Performance of EEVFS
Parallel Striping Groups

File 1 Group 1 File 3 File 2 Group 2 File 4

Buffer Buffer
Disk 1 Disk 2 Disk 5 Disk 6
Disk Disk
Storage Node 1 Storage Node 3

Buffer Buffer
Disk 3 Disk 4 Disk 7 Disk 8
Disk Disk
Storage Node 2 Storage Node 4

Striping Within a Group

Buffer Disk 2
1 Disk 1 3 5 7 9 Disk 2 4 6 8 10

Storage Node 1

Buffer Disk 2
1 Disk 3 3 5 7 9 Disk 4 4 6 8 10

Storage Node 2

1 1
File 1 Group 1 File 2
2

Measured Results

2/13/2013

A Parallel Disk System with a
Write Buffer Disk

Under High Workload
Conditions

Data Disks can serve requests
without buffer disks when
workload is high

Wakeup Data Disks
Requests Queue

Buffer Disk

47

Energy Savings
Low Workload, UltraStar

Energy Conservation Techniques

Software-Directed Power Management
Dynamic Power Management
Redundancy Technique
Multi- speed Setting

How Reliable Are They?

49

Tradeoff between Energy
Efficiency and Reliability

Example: Disk Spin Up and Down

50

MINT
(MATHEMATICAL RELIABILITY MODELS FOR ENERGY-EFFICIENT PARALLEL DISK SYSTEMS)

Energy Conservation
Techniques

Single Disk Reliability Model

System-Level Reliability Model

S. Yin et al. “Reliability Analysis for an Energy-Aware RAID System,”
Proc. the 30th IEEE International Performance Computing and
Communications Conference (IPCCC), Nov. 2011.

MINT (Single Disk)

Disk Age Temperature

Frequency Utilization


Reliability of Single
Disk

52

MINT
(MATHEMATICAL RELIABILITY MODELS FOR ENERGY-EFFICIENT PARALLEL DISK SYSTEMS)

Access Pattern

Energy Conservation Techniques


System Level Reliability Model

Reliability of A Parallel
Disk System

Preliminary Result
Comparison Between PDC and MAID

AFR Comparison of PDC and MAID
Access Rate(*104) Impacts on AFR (T=35°C)

54

Summary
• Energy-Aware Scheduling

• BUD - Buffer Disk Architecture

• Energy-Efficient File Systems

• Reliability Models for Energy-Efficient
Storage Systems

Download the presentation slides
http://www.slideshare.net/xqin74

Google: slideshare Xiao Qin

Energy Efficient Data Storage Systems

More Related Content

Viewers also liked

Similar to Energy Efficient Data Storage Systems

More from Xiao Qin

Recently uploaded

Energy Efficient Data Storage Systems

Editor's Notes