SlideShare a Scribd company logo
1 of 27
Statistical Power Consumption
Analysis and Modeling for GPU
Based Computing
By
Xiaohan Ma, Mian Dong, Lin Zhong
and Zhigang Deng
9/25/2013 1
Content
1.GPU
2.Statistical Power Consumption Analyzing
3.Statistical GPU Power Model
4.Evaluation and Validation
5.Discussion
6.References
9/25/2013 2
1.GPU
• Graphics Processing Unit
• Accelerate scientific and engineering
applications. (Example : 3D Gaming)
Fig1 :NVidia GeForce
8800gt
9/25/2013 3
Why ?
• More integrated transistors
• Rising power consumption
• Dissipation of heat , Complex cooling solution,
Nosier fans
• Challenge of developing energy efficient code
• Analyzing and modeling consumption of
runtime GPUs
9/25/2013 4
2. Statistical Power Consumption
Analyzing
• High level methodology to model
• First work that applies statistical analysis to
model the power consumption of a GPU
• Using coupling among power consumption
characteristics , run time performance,
dynamic workloads
9/25/2013 5
2.1 How?
• Record power consumption , run time
workload signals, performance data
• Build a statistical regression model
o Ability to estimate the power consumption of GPU
dynamically
o Bridge the dynamic workload of runtime GPUs
their estimated power consumptions
• Uses NVidia GeForce 8800gt graphics card
9/25/2013 6
2.2 Data Acquisition
• Power consumption data
• GPU Workload Signal Recording
9/25/2013 7
2.2.1 Power Consumption Data
Acquisition
• Test Computer  Programs designed to test GPU
o NVidia GeForce 8800gt graphics card with a 200 Watt power specification
o AMD Athlon 64x2 3.0GHz Dual-Core Processor
o 2GB memory
o Corsair TX 750W power supply
• Host Computer Specialized data recording software
,Power acquisition system (FLUKE 2680A)
9/25/2013 8
2.2.2 GPU Workload Signal Processing
• Record using NVidia PerfKit performance
analysis tool simultaneously
- Cable of dynamically extracting 39 GPU
workload variables
• Choose 5 major variables
- Represent the runtime utilizations of major `
pipeline stages on the GPU
• Record GPU workload signals
• Resample GPU workload signals
9/25/2013 9
Fig 2: Recorded and Corresponding Resampled Data
Five Major variables
1. vertex_shader_busy (the percentage of time when the vertex shader is busy),
2. pixel_shader_busy (the percentage of time when the pixel shader is busy)
3. texture_busy (the percentage of time when the texture unit is busy)
4. goem_busy (the percentage of time when the geometry shader is busy)
5. rop_busy (the percentage of time when the ROP unit is active)
9/25/2013 10
3. Statistical GPU Power Model
• Assuming,
–Processed power consumption data is Y =
{Yt1, Yt2, …….Ytn} (ti denotes the time
index)
–Aligned GPU workload data is Xj = {XJ
t1 ,
Xj
t2 ,….., Xj
tn } (1 j N, Xj represents
jth GPU workload variable)
9/25/2013 11
Constructed a statistical multivariable
function (model)
Yt = F(Xt
1, Xt
2,…..Xt
N)
That can robustly and accurately predict the
GPU power consumption Yt ,given any GPU
workload variables (Xt
1, Xt
2,…..Xt
N).
9/25/2013 12
3.1 Methodology
• 5 major GPU workload variables.
• Split the data set into training subset and a
cross validation subset (test data).
• Used the training subset to learn a Support
Vector Regression model using LIBSVM.
• Compared the cross validation results of the
chosen SVR model with a Simple Least
Square Based Linear Regression (SLR) model.
9/25/2013 13
Graphic
Program
GPGPU Jorik
benchmark
Fig 3 :Cross Validation Comparison Result
9/25/2013 14
Open GL
Geometry
Benchmark 1.0
(Graphic
Program)
GPGPU
Jorik
Benchmark
SLR 656.83 44.523
SVR 589.73 39.427
Sum Square Error Comparison Between SLR and
SVR for Cross Validation Data
Table 1 : Sum Square Error Comparison Results between SLR and SVR9/25/2013 15
It’s Clear ….!!!!
Regardless of whether graphic computing
or GPGPU applications are used chosen
SVR model measurably performed better
than traditional SLR on the cross
validation data(test data ) set .
9/25/2013 16
4. Evaluation and Validation
• What are the accuracy and the robustness
of the proposed statistical model if the
GPU runs non bench mark programs ?
9/25/2013 17
Eight test programs ( 4 graphics programs and
4 GPGPU computing applications) were
selected for the testing.
Graphic Program – Nexuiz
Xmas Tree
HDR
Dual Depth Peeling
Each of the program ran for 100 seconds.
9/25/2013 18
GPGPU Programs – GNN
N-body simulation
Option Pricing
Fast Walsh Transform
N-body simulation ran for 20 seconds and other
three stopped automatically once they generated
outputs.
9/25/2013 19
Table 2 : Summary of Power Prediction Errors as a percentage of mean GPU Power
Consumption
9/25/2013 20
4.1 Results
Fig 4 : Comparison between the ground truth (blue) and the predicted GPU
power consumption data (red) for the chosen four graphics programs
9/25/2013 21
Fig 5 : Comparison between the ground truth (blue) and the predicted GPU
power consumption data (red) for the chosen four GPGPU programs
9/25/2013 22
5. Discussion
1. This research work studied correlation of
power consumption and performance of
graphic applications using NVIDIA Perfkit.
2. NVIDIA Perfkit is designed to identify usage
of GPU components by conventional graphic
applications.
3. It cannot identify GPGPU special events such
as Global Memory Access which has the
largest factor in power consumption of GPU.
9/25/2013 23
4. This model completely depends on the recorded
workload signals of the runtime GPU. But sometimes
it fails to indicate the power consumption of the
underlying GPU.
5. Can not accurately model power consumption peaks.
(due to some other factors as bus communication, or
memory access)
6. It is hard to predict how much training data is
sufficient and will be needed in advance.
9/25/2013 24
5.1 Related Work
• Statistical power modeling of GPU kernels using
performance counters – 2010
• Quantifying the impact of GPUs on performance and
energy efficiency in HPC clusters – 2010
• Performance and Power Analysis of ATI GPU: A
Statistical Approach- 2011
• Tree Structured Analysis on GPU Power Study - 2011
9/25/2013 25
6. References
• H.Nagasaka, N. Maruyama, A. Nukada, S. Matsuoka, T.Endo, Statistical
Power Modeling of GPU Kernels Using Performance Counters, In Proc.
of International Conference on Green Computing, p. 115-122, 2010
• J.Chen, B. Li, Y.Shang, L. Peng, J.Pier, Tree Structured Power Analysis on
GPU Power Study, In Proc. Of 29th International Conference on Computer
Design, p.57-64, 2011
• Y.Zhang, Y.Hu, B.Li, L.Pen, Performance and Power Analysis of ATI GPU
: Statistical Approach, In Proc. of 6th IEEE International Conference on
NAS, p. 149-158,2011
• R.Suda, D.Q. Ren, Accurate Measurements and Precise Modeling of
Power Dissipation of CUDA Kernels toward Power Optimized High
Performance CPU-GPU Computing , In Proc. of International Conference
on Parallel and Distributed Computing, Application and Development, p.
432-438, 2009
9/25/2013 26
Thank You
9/25/2013 27

More Related Content

What's hot

An Energy Efficient Demand- Response Model for High performance Computing System
An Energy Efficient Demand- Response Model for High performance Computing SystemAn Energy Efficient Demand- Response Model for High performance Computing System
An Energy Efficient Demand- Response Model for High performance Computing SystemJason Liu
 
JOB SCHEDULING USING ANT COLONY OPTIMIZATION ALGORITHM
JOB SCHEDULING USING ANT COLONY OPTIMIZATION ALGORITHMJOB SCHEDULING USING ANT COLONY OPTIMIZATION ALGORITHM
JOB SCHEDULING USING ANT COLONY OPTIMIZATION ALGORITHMmailjkb
 
A Scalable Implementation of a MapReduce-based Graph Processing Algorithm for...
A Scalable Implementation of a MapReduce-based Graph Processing Algorithm for...A Scalable Implementation of a MapReduce-based Graph Processing Algorithm for...
A Scalable Implementation of a MapReduce-based Graph Processing Algorithm for...Koichi Shirahata
 
Optimization of energy consumption in cloud computing datacenters
Optimization of energy consumption in cloud computing datacenters Optimization of energy consumption in cloud computing datacenters
Optimization of energy consumption in cloud computing datacenters IJECEIAES
 
A Review on Scheduling in Cloud Computing
A Review on Scheduling in Cloud ComputingA Review on Scheduling in Cloud Computing
A Review on Scheduling in Cloud Computingijujournal
 
HP - Jerome Rolia - Hadoop World 2010
HP - Jerome Rolia - Hadoop World 2010HP - Jerome Rolia - Hadoop World 2010
HP - Jerome Rolia - Hadoop World 2010Cloudera, Inc.
 
Landset 8 的雲層去除技巧實作
Landset 8 的雲層去除技巧實作Landset 8 的雲層去除技巧實作
Landset 8 的雲層去除技巧實作鈵斯 倪
 
High Dimensionality Structures Selection for Efficient Economic Big data usin...
High Dimensionality Structures Selection for Efficient Economic Big data usin...High Dimensionality Structures Selection for Efficient Economic Big data usin...
High Dimensionality Structures Selection for Efficient Economic Big data usin...IRJET Journal
 
IMPROVING REAL TIME TASK AND HARNESSING ENERGY USING CSBTS IN VIRTUALIZED CLOUD
IMPROVING REAL TIME TASK AND HARNESSING ENERGY USING CSBTS IN VIRTUALIZED CLOUDIMPROVING REAL TIME TASK AND HARNESSING ENERGY USING CSBTS IN VIRTUALIZED CLOUD
IMPROVING REAL TIME TASK AND HARNESSING ENERGY USING CSBTS IN VIRTUALIZED CLOUDijcax
 
IMPROVING REAL TIME TASK AND HARNESSING ENERGY USING CSBTS IN VIRTUALIZED CLOUD
IMPROVING REAL TIME TASK AND HARNESSING ENERGY USING CSBTS IN VIRTUALIZED CLOUDIMPROVING REAL TIME TASK AND HARNESSING ENERGY USING CSBTS IN VIRTUALIZED CLOUD
IMPROVING REAL TIME TASK AND HARNESSING ENERGY USING CSBTS IN VIRTUALIZED CLOUDijcax
 
ExtremeEarth Open Workshop - Overview and Achievements
ExtremeEarth Open Workshop - Overview and AchievementsExtremeEarth Open Workshop - Overview and Achievements
ExtremeEarth Open Workshop - Overview and AchievementsExtremeEarth
 
BUDW: Energy-Efficient Parallel Storage Systems with Write-Buffer Disks
BUDW: Energy-Efficient Parallel Storage Systems with Write-Buffer DisksBUDW: Energy-Efficient Parallel Storage Systems with Write-Buffer Disks
BUDW: Energy-Efficient Parallel Storage Systems with Write-Buffer DisksXiao Qin
 
"빅" 데이터의 분석적 시각화
"빅" 데이터의 분석적 시각화"빅" 데이터의 분석적 시각화
"빅" 데이터의 분석적 시각화Myung-Hoe Huh
 

What's hot (15)

An Energy Efficient Demand- Response Model for High performance Computing System
An Energy Efficient Demand- Response Model for High performance Computing SystemAn Energy Efficient Demand- Response Model for High performance Computing System
An Energy Efficient Demand- Response Model for High performance Computing System
 
JOB SCHEDULING USING ANT COLONY OPTIMIZATION ALGORITHM
JOB SCHEDULING USING ANT COLONY OPTIMIZATION ALGORITHMJOB SCHEDULING USING ANT COLONY OPTIMIZATION ALGORITHM
JOB SCHEDULING USING ANT COLONY OPTIMIZATION ALGORITHM
 
A Scalable Implementation of a MapReduce-based Graph Processing Algorithm for...
A Scalable Implementation of a MapReduce-based Graph Processing Algorithm for...A Scalable Implementation of a MapReduce-based Graph Processing Algorithm for...
A Scalable Implementation of a MapReduce-based Graph Processing Algorithm for...
 
Optimization of energy consumption in cloud computing datacenters
Optimization of energy consumption in cloud computing datacenters Optimization of energy consumption in cloud computing datacenters
Optimization of energy consumption in cloud computing datacenters
 
A Review on Scheduling in Cloud Computing
A Review on Scheduling in Cloud ComputingA Review on Scheduling in Cloud Computing
A Review on Scheduling in Cloud Computing
 
T180304125129
T180304125129T180304125129
T180304125129
 
51 b wittmer_latest_features_of_p_vsyst
51 b wittmer_latest_features_of_p_vsyst51 b wittmer_latest_features_of_p_vsyst
51 b wittmer_latest_features_of_p_vsyst
 
HP - Jerome Rolia - Hadoop World 2010
HP - Jerome Rolia - Hadoop World 2010HP - Jerome Rolia - Hadoop World 2010
HP - Jerome Rolia - Hadoop World 2010
 
Landset 8 的雲層去除技巧實作
Landset 8 的雲層去除技巧實作Landset 8 的雲層去除技巧實作
Landset 8 的雲層去除技巧實作
 
High Dimensionality Structures Selection for Efficient Economic Big data usin...
High Dimensionality Structures Selection for Efficient Economic Big data usin...High Dimensionality Structures Selection for Efficient Economic Big data usin...
High Dimensionality Structures Selection for Efficient Economic Big data usin...
 
IMPROVING REAL TIME TASK AND HARNESSING ENERGY USING CSBTS IN VIRTUALIZED CLOUD
IMPROVING REAL TIME TASK AND HARNESSING ENERGY USING CSBTS IN VIRTUALIZED CLOUDIMPROVING REAL TIME TASK AND HARNESSING ENERGY USING CSBTS IN VIRTUALIZED CLOUD
IMPROVING REAL TIME TASK AND HARNESSING ENERGY USING CSBTS IN VIRTUALIZED CLOUD
 
IMPROVING REAL TIME TASK AND HARNESSING ENERGY USING CSBTS IN VIRTUALIZED CLOUD
IMPROVING REAL TIME TASK AND HARNESSING ENERGY USING CSBTS IN VIRTUALIZED CLOUDIMPROVING REAL TIME TASK AND HARNESSING ENERGY USING CSBTS IN VIRTUALIZED CLOUD
IMPROVING REAL TIME TASK AND HARNESSING ENERGY USING CSBTS IN VIRTUALIZED CLOUD
 
ExtremeEarth Open Workshop - Overview and Achievements
ExtremeEarth Open Workshop - Overview and AchievementsExtremeEarth Open Workshop - Overview and Achievements
ExtremeEarth Open Workshop - Overview and Achievements
 
BUDW: Energy-Efficient Parallel Storage Systems with Write-Buffer Disks
BUDW: Energy-Efficient Parallel Storage Systems with Write-Buffer DisksBUDW: Energy-Efficient Parallel Storage Systems with Write-Buffer Disks
BUDW: Energy-Efficient Parallel Storage Systems with Write-Buffer Disks
 
"빅" 데이터의 분석적 시각화
"빅" 데이터의 분석적 시각화"빅" 데이터의 분석적 시각화
"빅" 데이터의 분석적 시각화
 

Viewers also liked

Increasing Power without Increasing Sample Size
Increasing Power without Increasing Sample SizeIncreasing Power without Increasing Sample Size
Increasing Power without Increasing Sample Sizesmackinnon
 
20140602 statistical power - husnul and nur
20140602   statistical power - husnul and nur20140602   statistical power - husnul and nur
20140602 statistical power - husnul and nurMuhammad Khuluq
 
Lecture 10 Sample Size
Lecture 10 Sample SizeLecture 10 Sample Size
Lecture 10 Sample Sizeq8dentist
 
Statistical Writing (Sven Sandin)
Statistical Writing (Sven Sandin)Statistical Writing (Sven Sandin)
Statistical Writing (Sven Sandin)kgr023
 
GPU power consumption and performance trends
GPU power consumption and performance trendsGPU power consumption and performance trends
GPU power consumption and performance trendsAlessio Villardita
 
Understanding Statistical Power for Non-Statisticians
Understanding Statistical Power for Non-StatisticiansUnderstanding Statistical Power for Non-Statisticians
Understanding Statistical Power for Non-StatisticiansStatistics & Data Corporation
 
GPU vs CPU Supercomputing Security Shootout
GPU vs CPU Supercomputing Security ShootoutGPU vs CPU Supercomputing Security Shootout
GPU vs CPU Supercomputing Security ShootoutSecurityTube.Net
 
LCE12: How to measure SoC power
LCE12: How to measure SoC powerLCE12: How to measure SoC power
LCE12: How to measure SoC powerLinaro
 
CORE: May the “Power” (Statistical) - Be with You!
CORE: May the “Power” (Statistical) - Be with You!CORE: May the “Power” (Statistical) - Be with You!
CORE: May the “Power” (Statistical) - Be with You!Trident University
 
Power, effect size, and Issues in NHST
Power, effect size, and Issues in NHSTPower, effect size, and Issues in NHST
Power, effect size, and Issues in NHSTCarlo Magno
 
Statistical inference: Statistical Power, ANOVA, and Post Hoc tests
Statistical inference: Statistical Power, ANOVA, and Post Hoc testsStatistical inference: Statistical Power, ANOVA, and Post Hoc tests
Statistical inference: Statistical Power, ANOVA, and Post Hoc testsEugene Yan Ziyou
 
Sample size and power
Sample size and powerSample size and power
Sample size and powerChristina K J
 
CPU vs. GPU presentation
CPU vs. GPU presentationCPU vs. GPU presentation
CPU vs. GPU presentationVishal Singh
 
What's Significant? Hypothesis Testing, Effect Size, Confidence Intervals, & ...
What's Significant? Hypothesis Testing, Effect Size, Confidence Intervals, & ...What's Significant? Hypothesis Testing, Effect Size, Confidence Intervals, & ...
What's Significant? Hypothesis Testing, Effect Size, Confidence Intervals, & ...Pat Barlow
 

Viewers also liked (20)

SOC Power Estimation
SOC Power EstimationSOC Power Estimation
SOC Power Estimation
 
Soc lect1
Soc lect1Soc lect1
Soc lect1
 
Increasing Power without Increasing Sample Size
Increasing Power without Increasing Sample SizeIncreasing Power without Increasing Sample Size
Increasing Power without Increasing Sample Size
 
20140602 statistical power - husnul and nur
20140602   statistical power - husnul and nur20140602   statistical power - husnul and nur
20140602 statistical power - husnul and nur
 
Statistical Power
Statistical PowerStatistical Power
Statistical Power
 
Lecture 10 Sample Size
Lecture 10 Sample SizeLecture 10 Sample Size
Lecture 10 Sample Size
 
Statistical Writing (Sven Sandin)
Statistical Writing (Sven Sandin)Statistical Writing (Sven Sandin)
Statistical Writing (Sven Sandin)
 
Girish_BharadwajK_RESUME
Girish_BharadwajK_RESUMEGirish_BharadwajK_RESUME
Girish_BharadwajK_RESUME
 
GPU power consumption and performance trends
GPU power consumption and performance trendsGPU power consumption and performance trends
GPU power consumption and performance trends
 
Understanding Statistical Power for Non-Statisticians
Understanding Statistical Power for Non-StatisticiansUnderstanding Statistical Power for Non-Statisticians
Understanding Statistical Power for Non-Statisticians
 
GPU vs CPU Supercomputing Security Shootout
GPU vs CPU Supercomputing Security ShootoutGPU vs CPU Supercomputing Security Shootout
GPU vs CPU Supercomputing Security Shootout
 
LCE12: How to measure SoC power
LCE12: How to measure SoC powerLCE12: How to measure SoC power
LCE12: How to measure SoC power
 
CORE: May the “Power” (Statistical) - Be with You!
CORE: May the “Power” (Statistical) - Be with You!CORE: May the “Power” (Statistical) - Be with You!
CORE: May the “Power” (Statistical) - Be with You!
 
Power, effect size, and Issues in NHST
Power, effect size, and Issues in NHSTPower, effect size, and Issues in NHST
Power, effect size, and Issues in NHST
 
Statistical inference: Statistical Power, ANOVA, and Post Hoc tests
Statistical inference: Statistical Power, ANOVA, and Post Hoc testsStatistical inference: Statistical Power, ANOVA, and Post Hoc tests
Statistical inference: Statistical Power, ANOVA, and Post Hoc tests
 
Sample size and power
Sample size and powerSample size and power
Sample size and power
 
SIGIR2016
SIGIR2016SIGIR2016
SIGIR2016
 
CPU vs. GPU presentation
CPU vs. GPU presentationCPU vs. GPU presentation
CPU vs. GPU presentation
 
Low power VLSI design
Low power VLSI designLow power VLSI design
Low power VLSI design
 
What's Significant? Hypothesis Testing, Effect Size, Confidence Intervals, & ...
What's Significant? Hypothesis Testing, Effect Size, Confidence Intervals, & ...What's Significant? Hypothesis Testing, Effect Size, Confidence Intervals, & ...
What's Significant? Hypothesis Testing, Effect Size, Confidence Intervals, & ...
 

Similar to Statistical power consumption analysis and modeling

Task Resource Consumption Prediction for Scientific Applications and Workflows
Task Resource Consumption Prediction for Scientific Applications and WorkflowsTask Resource Consumption Prediction for Scientific Applications and Workflows
Task Resource Consumption Prediction for Scientific Applications and WorkflowsRafael Ferreira da Silva
 
Exploiting a Synergy between Greedy Approach and NSGA for Scheduling in Compu...
Exploiting a Synergy between Greedy Approach and NSGA for Scheduling in Compu...Exploiting a Synergy between Greedy Approach and NSGA for Scheduling in Compu...
Exploiting a Synergy between Greedy Approach and NSGA for Scheduling in Compu...Tarik Reza Toha
 
Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Editor IJARCET
 
Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Editor IJARCET
 
Web Traffic Time Series Forecasting
Web Traffic  Time Series ForecastingWeb Traffic  Time Series Forecasting
Web Traffic Time Series ForecastingBillTubbs
 
Bring Intelligent Motion Using Reinforcement Learning Engines | SIGGRAPH 2019...
Bring Intelligent Motion Using Reinforcement Learning Engines | SIGGRAPH 2019...Bring Intelligent Motion Using Reinforcement Learning Engines | SIGGRAPH 2019...
Bring Intelligent Motion Using Reinforcement Learning Engines | SIGGRAPH 2019...Intel® Software
 
Accelerating S3D A GPGPU Case Study
Accelerating S3D  A GPGPU Case StudyAccelerating S3D  A GPGPU Case Study
Accelerating S3D A GPGPU Case StudyMartha Brown
 
Graphics processing unit ppt
Graphics processing unit pptGraphics processing unit ppt
Graphics processing unit pptSandeep Singh
 
Technology Updates of PG-Strom at Aug-2014 (PGUnconf@Tokyo)
Technology Updates of PG-Strom at Aug-2014 (PGUnconf@Tokyo)Technology Updates of PG-Strom at Aug-2014 (PGUnconf@Tokyo)
Technology Updates of PG-Strom at Aug-2014 (PGUnconf@Tokyo)Kohei KaiGai
 
Parallel Application Performance Prediction of Using Analysis Based Modeling
Parallel Application Performance Prediction of Using Analysis Based ModelingParallel Application Performance Prediction of Using Analysis Based Modeling
Parallel Application Performance Prediction of Using Analysis Based ModelingJason Liu
 
Architecture exploration of recent GPUs to analyze the efficiency of hardware...
Architecture exploration of recent GPUs to analyze the efficiency of hardware...Architecture exploration of recent GPUs to analyze the efficiency of hardware...
Architecture exploration of recent GPUs to analyze the efficiency of hardware...journalBEEI
 
Hybrid Multicore Computing : NOTES
Hybrid Multicore Computing : NOTESHybrid Multicore Computing : NOTES
Hybrid Multicore Computing : NOTESSubhajit Sahu
 
Accelerating Data Science With GPUs
Accelerating Data Science With GPUsAccelerating Data Science With GPUs
Accelerating Data Science With GPUsiguazio
 
GPU Support in Spark and GPU/CPU Mixed Resource Scheduling at Production Scale
GPU Support in Spark and GPU/CPU Mixed Resource Scheduling at Production ScaleGPU Support in Spark and GPU/CPU Mixed Resource Scheduling at Production Scale
GPU Support in Spark and GPU/CPU Mixed Resource Scheduling at Production Scalesparktc
 
Sustainable Development using Green Programming
Sustainable Development using Green ProgrammingSustainable Development using Green Programming
Sustainable Development using Green ProgrammingIRJET Journal
 

Similar to Statistical power consumption analysis and modeling (20)

Task Resource Consumption Prediction for Scientific Applications and Workflows
Task Resource Consumption Prediction for Scientific Applications and WorkflowsTask Resource Consumption Prediction for Scientific Applications and Workflows
Task Resource Consumption Prediction for Scientific Applications and Workflows
 
Exploiting a Synergy between Greedy Approach and NSGA for Scheduling in Compu...
Exploiting a Synergy between Greedy Approach and NSGA for Scheduling in Compu...Exploiting a Synergy between Greedy Approach and NSGA for Scheduling in Compu...
Exploiting a Synergy between Greedy Approach and NSGA for Scheduling in Compu...
 
Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045
 
Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045
 
Web Traffic Time Series Forecasting
Web Traffic  Time Series ForecastingWeb Traffic  Time Series Forecasting
Web Traffic Time Series Forecasting
 
Bring Intelligent Motion Using Reinforcement Learning Engines | SIGGRAPH 2019...
Bring Intelligent Motion Using Reinforcement Learning Engines | SIGGRAPH 2019...Bring Intelligent Motion Using Reinforcement Learning Engines | SIGGRAPH 2019...
Bring Intelligent Motion Using Reinforcement Learning Engines | SIGGRAPH 2019...
 
IMQA Paper
IMQA PaperIMQA Paper
IMQA Paper
 
Accelerating S3D A GPGPU Case Study
Accelerating S3D  A GPGPU Case StudyAccelerating S3D  A GPGPU Case Study
Accelerating S3D A GPGPU Case Study
 
E3MV - Embedded Vision - Sundance
E3MV - Embedded Vision - SundanceE3MV - Embedded Vision - Sundance
E3MV - Embedded Vision - Sundance
 
Graphics processing unit ppt
Graphics processing unit pptGraphics processing unit ppt
Graphics processing unit ppt
 
Technology Updates of PG-Strom at Aug-2014 (PGUnconf@Tokyo)
Technology Updates of PG-Strom at Aug-2014 (PGUnconf@Tokyo)Technology Updates of PG-Strom at Aug-2014 (PGUnconf@Tokyo)
Technology Updates of PG-Strom at Aug-2014 (PGUnconf@Tokyo)
 
Parallel Application Performance Prediction of Using Analysis Based Modeling
Parallel Application Performance Prediction of Using Analysis Based ModelingParallel Application Performance Prediction of Using Analysis Based Modeling
Parallel Application Performance Prediction of Using Analysis Based Modeling
 
Architecture exploration of recent GPUs to analyze the efficiency of hardware...
Architecture exploration of recent GPUs to analyze the efficiency of hardware...Architecture exploration of recent GPUs to analyze the efficiency of hardware...
Architecture exploration of recent GPUs to analyze the efficiency of hardware...
 
Hybrid Multicore Computing : NOTES
Hybrid Multicore Computing : NOTESHybrid Multicore Computing : NOTES
Hybrid Multicore Computing : NOTES
 
Accelerating Data Science With GPUs
Accelerating Data Science With GPUsAccelerating Data Science With GPUs
Accelerating Data Science With GPUs
 
parallel OLAP
parallel OLAPparallel OLAP
parallel OLAP
 
GPU Support in Spark and GPU/CPU Mixed Resource Scheduling at Production Scale
GPU Support in Spark and GPU/CPU Mixed Resource Scheduling at Production ScaleGPU Support in Spark and GPU/CPU Mixed Resource Scheduling at Production Scale
GPU Support in Spark and GPU/CPU Mixed Resource Scheduling at Production Scale
 
Nvidia GTC 2014 Talk
Nvidia GTC 2014 TalkNvidia GTC 2014 Talk
Nvidia GTC 2014 Talk
 
Sustainable Development using Green Programming
Sustainable Development using Green ProgrammingSustainable Development using Green Programming
Sustainable Development using Green Programming
 
GPGPU_report_v3
GPGPU_report_v3GPGPU_report_v3
GPGPU_report_v3
 

Recently uploaded

Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 

Recently uploaded (20)

Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 

Statistical power consumption analysis and modeling

  • 1. Statistical Power Consumption Analysis and Modeling for GPU Based Computing By Xiaohan Ma, Mian Dong, Lin Zhong and Zhigang Deng 9/25/2013 1
  • 2. Content 1.GPU 2.Statistical Power Consumption Analyzing 3.Statistical GPU Power Model 4.Evaluation and Validation 5.Discussion 6.References 9/25/2013 2
  • 3. 1.GPU • Graphics Processing Unit • Accelerate scientific and engineering applications. (Example : 3D Gaming) Fig1 :NVidia GeForce 8800gt 9/25/2013 3
  • 4. Why ? • More integrated transistors • Rising power consumption • Dissipation of heat , Complex cooling solution, Nosier fans • Challenge of developing energy efficient code • Analyzing and modeling consumption of runtime GPUs 9/25/2013 4
  • 5. 2. Statistical Power Consumption Analyzing • High level methodology to model • First work that applies statistical analysis to model the power consumption of a GPU • Using coupling among power consumption characteristics , run time performance, dynamic workloads 9/25/2013 5
  • 6. 2.1 How? • Record power consumption , run time workload signals, performance data • Build a statistical regression model o Ability to estimate the power consumption of GPU dynamically o Bridge the dynamic workload of runtime GPUs their estimated power consumptions • Uses NVidia GeForce 8800gt graphics card 9/25/2013 6
  • 7. 2.2 Data Acquisition • Power consumption data • GPU Workload Signal Recording 9/25/2013 7
  • 8. 2.2.1 Power Consumption Data Acquisition • Test Computer  Programs designed to test GPU o NVidia GeForce 8800gt graphics card with a 200 Watt power specification o AMD Athlon 64x2 3.0GHz Dual-Core Processor o 2GB memory o Corsair TX 750W power supply • Host Computer Specialized data recording software ,Power acquisition system (FLUKE 2680A) 9/25/2013 8
  • 9. 2.2.2 GPU Workload Signal Processing • Record using NVidia PerfKit performance analysis tool simultaneously - Cable of dynamically extracting 39 GPU workload variables • Choose 5 major variables - Represent the runtime utilizations of major ` pipeline stages on the GPU • Record GPU workload signals • Resample GPU workload signals 9/25/2013 9
  • 10. Fig 2: Recorded and Corresponding Resampled Data Five Major variables 1. vertex_shader_busy (the percentage of time when the vertex shader is busy), 2. pixel_shader_busy (the percentage of time when the pixel shader is busy) 3. texture_busy (the percentage of time when the texture unit is busy) 4. goem_busy (the percentage of time when the geometry shader is busy) 5. rop_busy (the percentage of time when the ROP unit is active) 9/25/2013 10
  • 11. 3. Statistical GPU Power Model • Assuming, –Processed power consumption data is Y = {Yt1, Yt2, …….Ytn} (ti denotes the time index) –Aligned GPU workload data is Xj = {XJ t1 , Xj t2 ,….., Xj tn } (1 j N, Xj represents jth GPU workload variable) 9/25/2013 11
  • 12. Constructed a statistical multivariable function (model) Yt = F(Xt 1, Xt 2,…..Xt N) That can robustly and accurately predict the GPU power consumption Yt ,given any GPU workload variables (Xt 1, Xt 2,…..Xt N). 9/25/2013 12
  • 13. 3.1 Methodology • 5 major GPU workload variables. • Split the data set into training subset and a cross validation subset (test data). • Used the training subset to learn a Support Vector Regression model using LIBSVM. • Compared the cross validation results of the chosen SVR model with a Simple Least Square Based Linear Regression (SLR) model. 9/25/2013 13
  • 14. Graphic Program GPGPU Jorik benchmark Fig 3 :Cross Validation Comparison Result 9/25/2013 14
  • 15. Open GL Geometry Benchmark 1.0 (Graphic Program) GPGPU Jorik Benchmark SLR 656.83 44.523 SVR 589.73 39.427 Sum Square Error Comparison Between SLR and SVR for Cross Validation Data Table 1 : Sum Square Error Comparison Results between SLR and SVR9/25/2013 15
  • 16. It’s Clear ….!!!! Regardless of whether graphic computing or GPGPU applications are used chosen SVR model measurably performed better than traditional SLR on the cross validation data(test data ) set . 9/25/2013 16
  • 17. 4. Evaluation and Validation • What are the accuracy and the robustness of the proposed statistical model if the GPU runs non bench mark programs ? 9/25/2013 17
  • 18. Eight test programs ( 4 graphics programs and 4 GPGPU computing applications) were selected for the testing. Graphic Program – Nexuiz Xmas Tree HDR Dual Depth Peeling Each of the program ran for 100 seconds. 9/25/2013 18
  • 19. GPGPU Programs – GNN N-body simulation Option Pricing Fast Walsh Transform N-body simulation ran for 20 seconds and other three stopped automatically once they generated outputs. 9/25/2013 19
  • 20. Table 2 : Summary of Power Prediction Errors as a percentage of mean GPU Power Consumption 9/25/2013 20
  • 21. 4.1 Results Fig 4 : Comparison between the ground truth (blue) and the predicted GPU power consumption data (red) for the chosen four graphics programs 9/25/2013 21
  • 22. Fig 5 : Comparison between the ground truth (blue) and the predicted GPU power consumption data (red) for the chosen four GPGPU programs 9/25/2013 22
  • 23. 5. Discussion 1. This research work studied correlation of power consumption and performance of graphic applications using NVIDIA Perfkit. 2. NVIDIA Perfkit is designed to identify usage of GPU components by conventional graphic applications. 3. It cannot identify GPGPU special events such as Global Memory Access which has the largest factor in power consumption of GPU. 9/25/2013 23
  • 24. 4. This model completely depends on the recorded workload signals of the runtime GPU. But sometimes it fails to indicate the power consumption of the underlying GPU. 5. Can not accurately model power consumption peaks. (due to some other factors as bus communication, or memory access) 6. It is hard to predict how much training data is sufficient and will be needed in advance. 9/25/2013 24
  • 25. 5.1 Related Work • Statistical power modeling of GPU kernels using performance counters – 2010 • Quantifying the impact of GPUs on performance and energy efficiency in HPC clusters – 2010 • Performance and Power Analysis of ATI GPU: A Statistical Approach- 2011 • Tree Structured Analysis on GPU Power Study - 2011 9/25/2013 25
  • 26. 6. References • H.Nagasaka, N. Maruyama, A. Nukada, S. Matsuoka, T.Endo, Statistical Power Modeling of GPU Kernels Using Performance Counters, In Proc. of International Conference on Green Computing, p. 115-122, 2010 • J.Chen, B. Li, Y.Shang, L. Peng, J.Pier, Tree Structured Power Analysis on GPU Power Study, In Proc. Of 29th International Conference on Computer Design, p.57-64, 2011 • Y.Zhang, Y.Hu, B.Li, L.Pen, Performance and Power Analysis of ATI GPU : Statistical Approach, In Proc. of 6th IEEE International Conference on NAS, p. 149-158,2011 • R.Suda, D.Q. Ren, Accurate Measurements and Precise Modeling of Power Dissipation of CUDA Kernels toward Power Optimized High Performance CPU-GPU Computing , In Proc. of International Conference on Parallel and Distributed Computing, Application and Development, p. 432-438, 2009 9/25/2013 26

Editor's Notes

  1. Vertex shader busyPixel shader busyTexture busyGeom busyRop busy
  2. Sum of square error is used as a metric to measure the prediction quality.
  3. .