SlideShare a Scribd company logo
1 of 4
Analysis of Parallel Algorithms for Energy
Conservation in Scalable Multicore Architecture
Problem statement
An important concern for computing, as for society, is conserving energy. Moreover, energy
conservation is critical in mobile devices for practical reasons. The relation between parallel
applications and their energy requirements on scalable multicore processors are examined. It
believe this sort of analysis can provide programmers with intuitions about the energy required by
the parallel algorithms they are using–thus guiding the choice of algorithm, architecture and the
number of cores used for a particular application. Researchers have studied performance scalability
of parallel algorithms for some time [10]. However, it is important to note that performance
scalability of a parallel algorithm is not the same as its energy scalability. This difference between
performance scalability and energy scalability is due to two important factors. There is a nonlinear
relationship between power and frequency at which the cores operate in multicore processors. In
fact, the power consumed by a core is (typically) proportional to the cube of its frequency.
Executing parallel algorithms typically involves communication (or shared memory accesses) as well
as computation. The power and performance characteristics of communication and computation
may be different. For example, in many algorithms, communication time may be masked by
overlapping communication and computation (e.g., [1]). However, the power required for
communication would be unaffected whether communication overlaps with the computation or not.
Objective
(a) To analyse energy characteristics of parallel algorithms executed on scalable multicore
processors.
(b) To study the sensitivity of the analysis to changes in parameters such as the ratio of power
required for computation versus power required for communication.
(c) To determine how many cores to use in order to minimize energy consumption.
Methodology
The methodology is to evaluate energy scalability under iso-performance of parallel applications.
Partitioning Step
Find the critical path of the parallel algorithm. The critical path is the longest path through the task
dependency graph (where edges represents task serialization) of the parallel algorithm. Note that
the critical path length gives a lower bound on execution time of the parallel algorithm. In the
parallel algorithm, the critical path is easy to find. It is the execution path of the core that has the
sum of all numbers at the end. Half of the cores send the sum they compute to the other half so that
no core receives a sum from more than one core. The receiving core then add the number the local
sum they have computed. We perform the same step recursively until there is only one core left. At
the end of computation, one core will store the sum of all N numbers.
Communication and computation steps
Partition the critical path into communication and computation steps. We can see that there are log
(M) communication steps and ((N=M) � 1 + log (M)) computation steps. Consider a simple parallel
algorithm to add N numbers using M cores. Initially all N numbers are equally distributed among the
M cores; at the end of the computation, one of the cores stores their sum. Without loss of
generality, assume that the number of cores available is some power of two. The algorithm runs in
log (M) steps.
Adding N numbers using 4 actors; Left most
line represents the critical path; embarrassingly
parallel application but represents a broad
class of tree algorithms
Agglomeration step
Scale computation steps of the critical path so that the parallel performance matches the
performance requirement. Scaling the computation time of the critical path to the difference of the
required performance and the communication time of the critical path, and obtain the new reduced
frequency at which all M cores should run. Reduced frequency obtained at which all M cores should
run to complete in time T.
Where β represents number of cycles required per addition. In order to achieve energy savings, we
require 0 < X0 < F. This restriction provides a lower bound on the input size as a function of M and Kc.
Evaluate the message complexity (total number of messages processed) of the parallel algorithm.
The example algorithms discuss before show that the message complexity of some parallel
algorithms may depend only on the number of cores, while for others it depends on both the input
size and the number of cores used. It is trivial to see that number of message transfers for this
parallel algorithm when running on M cores is (M-1). In the algorithm, the message complexity is
only dependent on M and not on the input size N.
Evaluate the total idle time at all the cores assuming the frequency obtained, running at new
frequency X’. Scaling the parallel algorithm critical path may lead to an increase in idle time in other
paths at other cores. Total idle time is:
Where the first term represents the total idle time spent by idle cores while other cores are busy
computing and second term represents the total idle time spent by idle cores while other cores are
involved in message communication.
Step 6 Frame an expression for energy consumption of the parallel algorithm using the energy
model. The energy expression is the sum of the energy consumed by Computation, Ecomp,
Communication, Ecomm and Idling (static power), Eidle
Ecomp is lower if the cores run at a lower frequency, while Eidle may increase as the busy cores take
longer to finish. Ecomm may increase as more cores are used since the computation is more
distributed. Observe that (N-1) is the total number of computation steps. The energy consumed for
computation, communication and idling while the algorithm is running on M cores at reduced
frequency X’ is:
Mapping step
Analyse the equation to obtain the number of cores required for minimum energy consumption as a
function of input size. The energy expression is dependent on many variables such as N (Input Size),
M (Number of cores), (Number of instruction per addition), Kc (no of cycles executed at maximum
frequency for single message communication time), Em (energy consumed for single message
communication between cores), Ps (static power) and the maximum frequency of a core. In most
architectures, the number of cycles involved per addition is just one, so we assume = 1. Set idle
energy consumed per cycle as (Ps=F) = 1, where the cycle is at the maximum frequency F. Express all
energy values with respect to this normalized energy value.
Figure 1 plot for any input size N, initially energy decreases with increasing M and later on increases
with increasing M. Energy for computation decreases with an increase in number of cores running at
reduced frequencies, and energy for communication increases with increasing cores. We can see
that increasing the input size leads to an increase in the optimal number of cores required for
minimum energy consumption. We now consider the sensitivity of this analysis with respect to the
ratio k. Figure 2 plots the optimal number of cores required for minimum energy consumption by
varying input size and k.
Figure 2 shows that for a fixed input size, the optimal number of cores required for minimum energy
consumption decreases with increasing k. Moreover, it is evident that with increasing input size, the
trend remains the same (approximating a negative exponential curve with negative coefficient).

More Related Content

What's hot

Enriched Firefly Algorithm for Solving Reactive Power Problem
Enriched Firefly Algorithm for Solving Reactive Power ProblemEnriched Firefly Algorithm for Solving Reactive Power Problem
Enriched Firefly Algorithm for Solving Reactive Power Problemijeei-iaes
 
Lut optimization for memory based computation
Lut optimization for memory based computationLut optimization for memory based computation
Lut optimization for memory based computationJohn Williams
 
Energy Efficient Geographical Forwarding Algorithm For Wireless Ad Hoc And Se...
Energy Efficient Geographical Forwarding Algorithm For Wireless Ad Hoc And Se...Energy Efficient Geographical Forwarding Algorithm For Wireless Ad Hoc And Se...
Energy Efficient Geographical Forwarding Algorithm For Wireless Ad Hoc And Se...zhendong
 
Presentation: Wind Speed Prediction using Radial Basis Function Neural Network
Presentation: Wind Speed Prediction using Radial Basis Function Neural NetworkPresentation: Wind Speed Prediction using Radial Basis Function Neural Network
Presentation: Wind Speed Prediction using Radial Basis Function Neural NetworkArzam Muzaffar Kotriwala
 
Kalman Filter Algorithm for Mitigation of Power System Harmonics
Kalman Filter Algorithm for Mitigation of Power System Harmonics  Kalman Filter Algorithm for Mitigation of Power System Harmonics
Kalman Filter Algorithm for Mitigation of Power System Harmonics IJECEIAES
 
A Threshold Enhancement Technique for Chaotic On-Off Keying Scheme
A Threshold Enhancement Technique for Chaotic On-Off Keying SchemeA Threshold Enhancement Technique for Chaotic On-Off Keying Scheme
A Threshold Enhancement Technique for Chaotic On-Off Keying SchemeCSCJournals
 
Economic/Emission Load Dispatch Using Artificial Bee Colony Algorithm
Economic/Emission Load Dispatch Using Artificial Bee Colony AlgorithmEconomic/Emission Load Dispatch Using Artificial Bee Colony Algorithm
Economic/Emission Load Dispatch Using Artificial Bee Colony AlgorithmIDES Editor
 
Design & Implementation of LUT Based Multiplier Using APCOMS Technique
Design & Implementation of LUT Based Multiplier Using APCOMS TechniqueDesign & Implementation of LUT Based Multiplier Using APCOMS Technique
Design & Implementation of LUT Based Multiplier Using APCOMS Techniqueijsrd.com
 
A study of dipole antennas using mat lab
A study of dipole antennas using mat labA study of dipole antennas using mat lab
A study of dipole antennas using mat labLe Professionist
 
A reduced complexity and an efficient channel
A reduced complexity and an efficient channelA reduced complexity and an efficient channel
A reduced complexity and an efficient channeleSAT Publishing House
 
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDMOD...
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDMOD...PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDMOD...
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDMOD...ijgca
 
Reduction of Active Power Loss byUsing Adaptive Cat Swarm Optimization
Reduction of Active Power Loss byUsing Adaptive Cat Swarm OptimizationReduction of Active Power Loss byUsing Adaptive Cat Swarm Optimization
Reduction of Active Power Loss byUsing Adaptive Cat Swarm Optimizationijeei-iaes
 
Combination of Immune Genetic Particle Swarm Optimization algorithm with BP a...
Combination of Immune Genetic Particle Swarm Optimization algorithm with BP a...Combination of Immune Genetic Particle Swarm Optimization algorithm with BP a...
Combination of Immune Genetic Particle Swarm Optimization algorithm with BP a...paperpublications3
 
Performance of CBR Traffic on Node Overutilization in MANETs
Performance of CBR Traffic on Node Overutilization in MANETsPerformance of CBR Traffic on Node Overutilization in MANETs
Performance of CBR Traffic on Node Overutilization in MANETsRSIS International
 
Adaptive Channel Equalization for Nonlinear Channels using Signed Regressor F...
Adaptive Channel Equalization for Nonlinear Channels using Signed Regressor F...Adaptive Channel Equalization for Nonlinear Channels using Signed Regressor F...
Adaptive Channel Equalization for Nonlinear Channels using Signed Regressor F...IDES Editor
 
Performance Analysis of Ultra Wideband Receivers for High Data Rate Wireless ...
Performance Analysis of Ultra Wideband Receivers for High Data Rate Wireless ...Performance Analysis of Ultra Wideband Receivers for High Data Rate Wireless ...
Performance Analysis of Ultra Wideband Receivers for High Data Rate Wireless ...graphhoc
 

What's hot (19)

Enriched Firefly Algorithm for Solving Reactive Power Problem
Enriched Firefly Algorithm for Solving Reactive Power ProblemEnriched Firefly Algorithm for Solving Reactive Power Problem
Enriched Firefly Algorithm for Solving Reactive Power Problem
 
Lut optimization for memory based computation
Lut optimization for memory based computationLut optimization for memory based computation
Lut optimization for memory based computation
 
Energy Efficient Geographical Forwarding Algorithm For Wireless Ad Hoc And Se...
Energy Efficient Geographical Forwarding Algorithm For Wireless Ad Hoc And Se...Energy Efficient Geographical Forwarding Algorithm For Wireless Ad Hoc And Se...
Energy Efficient Geographical Forwarding Algorithm For Wireless Ad Hoc And Se...
 
Presentation: Wind Speed Prediction using Radial Basis Function Neural Network
Presentation: Wind Speed Prediction using Radial Basis Function Neural NetworkPresentation: Wind Speed Prediction using Radial Basis Function Neural Network
Presentation: Wind Speed Prediction using Radial Basis Function Neural Network
 
Kalman Filter Algorithm for Mitigation of Power System Harmonics
Kalman Filter Algorithm for Mitigation of Power System Harmonics  Kalman Filter Algorithm for Mitigation of Power System Harmonics
Kalman Filter Algorithm for Mitigation of Power System Harmonics
 
A Threshold Enhancement Technique for Chaotic On-Off Keying Scheme
A Threshold Enhancement Technique for Chaotic On-Off Keying SchemeA Threshold Enhancement Technique for Chaotic On-Off Keying Scheme
A Threshold Enhancement Technique for Chaotic On-Off Keying Scheme
 
Economic/Emission Load Dispatch Using Artificial Bee Colony Algorithm
Economic/Emission Load Dispatch Using Artificial Bee Colony AlgorithmEconomic/Emission Load Dispatch Using Artificial Bee Colony Algorithm
Economic/Emission Load Dispatch Using Artificial Bee Colony Algorithm
 
Design & Implementation of LUT Based Multiplier Using APCOMS Technique
Design & Implementation of LUT Based Multiplier Using APCOMS TechniqueDesign & Implementation of LUT Based Multiplier Using APCOMS Technique
Design & Implementation of LUT Based Multiplier Using APCOMS Technique
 
A study of dipole antennas using mat lab
A study of dipole antennas using mat labA study of dipole antennas using mat lab
A study of dipole antennas using mat lab
 
E0812730
E0812730E0812730
E0812730
 
A reduced complexity and an efficient channel
A reduced complexity and an efficient channelA reduced complexity and an efficient channel
A reduced complexity and an efficient channel
 
Economic load dispatch problem solving using "Cuckoo Search"
Economic load dispatch problem solving using "Cuckoo Search"Economic load dispatch problem solving using "Cuckoo Search"
Economic load dispatch problem solving using "Cuckoo Search"
 
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDMOD...
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDMOD...PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDMOD...
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDMOD...
 
Reduction of Active Power Loss byUsing Adaptive Cat Swarm Optimization
Reduction of Active Power Loss byUsing Adaptive Cat Swarm OptimizationReduction of Active Power Loss byUsing Adaptive Cat Swarm Optimization
Reduction of Active Power Loss byUsing Adaptive Cat Swarm Optimization
 
Wireless Sensor Network Security Analytics
Wireless Sensor Network Security AnalyticsWireless Sensor Network Security Analytics
Wireless Sensor Network Security Analytics
 
Combination of Immune Genetic Particle Swarm Optimization algorithm with BP a...
Combination of Immune Genetic Particle Swarm Optimization algorithm with BP a...Combination of Immune Genetic Particle Swarm Optimization algorithm with BP a...
Combination of Immune Genetic Particle Swarm Optimization algorithm with BP a...
 
Performance of CBR Traffic on Node Overutilization in MANETs
Performance of CBR Traffic on Node Overutilization in MANETsPerformance of CBR Traffic on Node Overutilization in MANETs
Performance of CBR Traffic on Node Overutilization in MANETs
 
Adaptive Channel Equalization for Nonlinear Channels using Signed Regressor F...
Adaptive Channel Equalization for Nonlinear Channels using Signed Regressor F...Adaptive Channel Equalization for Nonlinear Channels using Signed Regressor F...
Adaptive Channel Equalization for Nonlinear Channels using Signed Regressor F...
 
Performance Analysis of Ultra Wideband Receivers for High Data Rate Wireless ...
Performance Analysis of Ultra Wideband Receivers for High Data Rate Wireless ...Performance Analysis of Ultra Wideband Receivers for High Data Rate Wireless ...
Performance Analysis of Ultra Wideband Receivers for High Data Rate Wireless ...
 

Similar to Analysis of parallel algorithms for energy consumption

OPTIMIZED TASK ALLOCATION IN SENSOR NETWORKS
OPTIMIZED TASK ALLOCATION IN SENSOR NETWORKSOPTIMIZED TASK ALLOCATION IN SENSOR NETWORKS
OPTIMIZED TASK ALLOCATION IN SENSOR NETWORKSZac Darcy
 
Edd clustering algorithm for
Edd clustering algorithm forEdd clustering algorithm for
Edd clustering algorithm forcsandit
 
EDD CLUSTERING ALGORITHM FOR WIRELESS SENSOR NETWORKS
EDD CLUSTERING ALGORITHM FOR WIRELESS SENSOR NETWORKSEDD CLUSTERING ALGORITHM FOR WIRELESS SENSOR NETWORKS
EDD CLUSTERING ALGORITHM FOR WIRELESS SENSOR NETWORKScscpconf
 
Energy consumption of vlsi decoders
Energy consumption of vlsi decodersEnergy consumption of vlsi decoders
Energy consumption of vlsi decodersjpstudcorner
 
Parallel Algorithms: Sort & Merge, Image Processing, Fault Tolerance
Parallel Algorithms: Sort & Merge, Image Processing, Fault ToleranceParallel Algorithms: Sort & Merge, Image Processing, Fault Tolerance
Parallel Algorithms: Sort & Merge, Image Processing, Fault ToleranceUniversity of Technology - Iraq
 
A CLUSTER BASED STABLE ROUTING PROTOCOL USING BINARY PARTICLE SWARM OPTIMIZAT...
A CLUSTER BASED STABLE ROUTING PROTOCOL USING BINARY PARTICLE SWARM OPTIMIZAT...A CLUSTER BASED STABLE ROUTING PROTOCOL USING BINARY PARTICLE SWARM OPTIMIZAT...
A CLUSTER BASED STABLE ROUTING PROTOCOL USING BINARY PARTICLE SWARM OPTIMIZAT...ijmnct
 
Design Optimization of Energy and Delay Efficient Wireless Sensor Network wit...
Design Optimization of Energy and Delay Efficient Wireless Sensor Network wit...Design Optimization of Energy and Delay Efficient Wireless Sensor Network wit...
Design Optimization of Energy and Delay Efficient Wireless Sensor Network wit...IOSR Journals
 
Cluster head election using imperialist competitive algorithm (chei) for wire...
Cluster head election using imperialist competitive algorithm (chei) for wire...Cluster head election using imperialist competitive algorithm (chei) for wire...
Cluster head election using imperialist competitive algorithm (chei) for wire...ijmnct
 
E NERGY D EGREE D ISTANCE C LUSTERING A LGORITHM FOR W Sns
E NERGY D EGREE D ISTANCE C LUSTERING A LGORITHM FOR W SnsE NERGY D EGREE D ISTANCE C LUSTERING A LGORITHM FOR W Sns
E NERGY D EGREE D ISTANCE C LUSTERING A LGORITHM FOR W Snsijwmn
 
Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...
Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...
Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...IJCNCJournal
 
Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...
Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...
Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...IJCNCJournal
 
Energy-Efficient Hybrid K-Means Algorithm for Clustered Wireless Sensor Netw...
Energy-Efficient Hybrid K-Means Algorithm for Clustered  Wireless Sensor Netw...Energy-Efficient Hybrid K-Means Algorithm for Clustered  Wireless Sensor Netw...
Energy-Efficient Hybrid K-Means Algorithm for Clustered Wireless Sensor Netw...IJECEIAES
 
Efficient use of Energy in WSN using H-LEACH
Efficient use of Energy in WSN using H-LEACHEfficient use of Energy in WSN using H-LEACH
Efficient use of Energy in WSN using H-LEACHdbpublications
 
Comprehensive Performance Evaluation on Multiplication of Matrices using MPI
Comprehensive Performance Evaluation on Multiplication of Matrices using MPIComprehensive Performance Evaluation on Multiplication of Matrices using MPI
Comprehensive Performance Evaluation on Multiplication of Matrices using MPIijtsrd
 
PERFORMANCE AND COMPLEXITY ANALYSIS OF A REDUCED ITERATIONS LLL ALGORITHM
PERFORMANCE AND COMPLEXITY ANALYSIS OF A REDUCED ITERATIONS LLL ALGORITHMPERFORMANCE AND COMPLEXITY ANALYSIS OF A REDUCED ITERATIONS LLL ALGORITHM
PERFORMANCE AND COMPLEXITY ANALYSIS OF A REDUCED ITERATIONS LLL ALGORITHMIJCNCJournal
 
Approximated computing for low power neural networks
Approximated computing for low power neural networksApproximated computing for low power neural networks
Approximated computing for low power neural networksTELKOMNIKA JOURNAL
 
Performance evaluation of various routing techniques in wireless multimedia
Performance evaluation of various routing techniques in wireless multimediaPerformance evaluation of various routing techniques in wireless multimedia
Performance evaluation of various routing techniques in wireless multimediaIAEME Publication
 
Optimum Network Reconfiguration using Grey Wolf Optimizer
Optimum Network Reconfiguration using Grey Wolf OptimizerOptimum Network Reconfiguration using Grey Wolf Optimizer
Optimum Network Reconfiguration using Grey Wolf OptimizerTELKOMNIKA JOURNAL
 

Similar to Analysis of parallel algorithms for energy consumption (20)

OPTIMIZED TASK ALLOCATION IN SENSOR NETWORKS
OPTIMIZED TASK ALLOCATION IN SENSOR NETWORKSOPTIMIZED TASK ALLOCATION IN SENSOR NETWORKS
OPTIMIZED TASK ALLOCATION IN SENSOR NETWORKS
 
Edd clustering algorithm for
Edd clustering algorithm forEdd clustering algorithm for
Edd clustering algorithm for
 
EDD CLUSTERING ALGORITHM FOR WIRELESS SENSOR NETWORKS
EDD CLUSTERING ALGORITHM FOR WIRELESS SENSOR NETWORKSEDD CLUSTERING ALGORITHM FOR WIRELESS SENSOR NETWORKS
EDD CLUSTERING ALGORITHM FOR WIRELESS SENSOR NETWORKS
 
Energy consumption of vlsi decoders
Energy consumption of vlsi decodersEnergy consumption of vlsi decoders
Energy consumption of vlsi decoders
 
Parallel Algorithms: Sort & Merge, Image Processing, Fault Tolerance
Parallel Algorithms: Sort & Merge, Image Processing, Fault ToleranceParallel Algorithms: Sort & Merge, Image Processing, Fault Tolerance
Parallel Algorithms: Sort & Merge, Image Processing, Fault Tolerance
 
A CLUSTER BASED STABLE ROUTING PROTOCOL USING BINARY PARTICLE SWARM OPTIMIZAT...
A CLUSTER BASED STABLE ROUTING PROTOCOL USING BINARY PARTICLE SWARM OPTIMIZAT...A CLUSTER BASED STABLE ROUTING PROTOCOL USING BINARY PARTICLE SWARM OPTIMIZAT...
A CLUSTER BASED STABLE ROUTING PROTOCOL USING BINARY PARTICLE SWARM OPTIMIZAT...
 
Design Optimization of Energy and Delay Efficient Wireless Sensor Network wit...
Design Optimization of Energy and Delay Efficient Wireless Sensor Network wit...Design Optimization of Energy and Delay Efficient Wireless Sensor Network wit...
Design Optimization of Energy and Delay Efficient Wireless Sensor Network wit...
 
Ijecet 06 09_003
Ijecet 06 09_003Ijecet 06 09_003
Ijecet 06 09_003
 
Cluster head election using imperialist competitive algorithm (chei) for wire...
Cluster head election using imperialist competitive algorithm (chei) for wire...Cluster head election using imperialist competitive algorithm (chei) for wire...
Cluster head election using imperialist competitive algorithm (chei) for wire...
 
E NERGY D EGREE D ISTANCE C LUSTERING A LGORITHM FOR W Sns
E NERGY D EGREE D ISTANCE C LUSTERING A LGORITHM FOR W SnsE NERGY D EGREE D ISTANCE C LUSTERING A LGORITHM FOR W Sns
E NERGY D EGREE D ISTANCE C LUSTERING A LGORITHM FOR W Sns
 
Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...
Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...
Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...
 
Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...
Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...
Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...
 
Energy-Efficient Hybrid K-Means Algorithm for Clustered Wireless Sensor Netw...
Energy-Efficient Hybrid K-Means Algorithm for Clustered  Wireless Sensor Netw...Energy-Efficient Hybrid K-Means Algorithm for Clustered  Wireless Sensor Netw...
Energy-Efficient Hybrid K-Means Algorithm for Clustered Wireless Sensor Netw...
 
Efficient use of Energy in WSN using H-LEACH
Efficient use of Energy in WSN using H-LEACHEfficient use of Energy in WSN using H-LEACH
Efficient use of Energy in WSN using H-LEACH
 
Comprehensive Performance Evaluation on Multiplication of Matrices using MPI
Comprehensive Performance Evaluation on Multiplication of Matrices using MPIComprehensive Performance Evaluation on Multiplication of Matrices using MPI
Comprehensive Performance Evaluation on Multiplication of Matrices using MPI
 
PERFORMANCE AND COMPLEXITY ANALYSIS OF A REDUCED ITERATIONS LLL ALGORITHM
PERFORMANCE AND COMPLEXITY ANALYSIS OF A REDUCED ITERATIONS LLL ALGORITHMPERFORMANCE AND COMPLEXITY ANALYSIS OF A REDUCED ITERATIONS LLL ALGORITHM
PERFORMANCE AND COMPLEXITY ANALYSIS OF A REDUCED ITERATIONS LLL ALGORITHM
 
Fo2510211029
Fo2510211029Fo2510211029
Fo2510211029
 
Approximated computing for low power neural networks
Approximated computing for low power neural networksApproximated computing for low power neural networks
Approximated computing for low power neural networks
 
Performance evaluation of various routing techniques in wireless multimedia
Performance evaluation of various routing techniques in wireless multimediaPerformance evaluation of various routing techniques in wireless multimedia
Performance evaluation of various routing techniques in wireless multimedia
 
Optimum Network Reconfiguration using Grey Wolf Optimizer
Optimum Network Reconfiguration using Grey Wolf OptimizerOptimum Network Reconfiguration using Grey Wolf Optimizer
Optimum Network Reconfiguration using Grey Wolf Optimizer
 

More from are you

Business plan (air umbrella)
Business plan (air umbrella)Business plan (air umbrella)
Business plan (air umbrella)are you
 
A* algorithm
A* algorithmA* algorithm
A* algorithmare you
 
Dijkstra algorithm
Dijkstra algorithmDijkstra algorithm
Dijkstra algorithmare you
 
Application of least square method
Application of least square methodApplication of least square method
Application of least square methodare you
 
Comparative study of fish harvesting
Comparative study of fish harvestingComparative study of fish harvesting
Comparative study of fish harvestingare you
 
Business idea
Business ideaBusiness idea
Business ideaare you
 
Effective determinants of corporate nano
Effective determinants of corporate nanoEffective determinants of corporate nano
Effective determinants of corporate nanoare you
 
Govida chocolate
Govida chocolateGovida chocolate
Govida chocolateare you
 
Yourprezi
YourpreziYourprezi
Yourpreziare you
 
Skrip Arabic (الكتابة العربية)
Skrip Arabic (الكتابة العربية)Skrip Arabic (الكتابة العربية)
Skrip Arabic (الكتابة العربية)are you
 
Report ent600
Report  ent600Report  ent600
Report ent600are you
 
Mgt project
Mgt projectMgt project
Mgt projectare you
 
MyBride Presentation
MyBride PresentationMyBride Presentation
MyBride Presentationare you
 
Levi's jeans presentation
Levi's jeans presentationLevi's jeans presentation
Levi's jeans presentationare you
 
Portable Printer
Portable PrinterPortable Printer
Portable Printerare you
 
Air craft control system of parallel processing
Air craft control system of parallel processingAir craft control system of parallel processing
Air craft control system of parallel processingare you
 
Application of Parallel Processing
Application of Parallel ProcessingApplication of Parallel Processing
Application of Parallel Processingare you
 
Mybride gallery
Mybride galleryMybride gallery
Mybride galleryare you
 

More from are you (20)

Business plan (air umbrella)
Business plan (air umbrella)Business plan (air umbrella)
Business plan (air umbrella)
 
A* algorithm
A* algorithmA* algorithm
A* algorithm
 
Poster
PosterPoster
Poster
 
Dijkstra algorithm
Dijkstra algorithmDijkstra algorithm
Dijkstra algorithm
 
Application of least square method
Application of least square methodApplication of least square method
Application of least square method
 
Comparative study of fish harvesting
Comparative study of fish harvestingComparative study of fish harvesting
Comparative study of fish harvesting
 
Business idea
Business ideaBusiness idea
Business idea
 
Effective determinants of corporate nano
Effective determinants of corporate nanoEffective determinants of corporate nano
Effective determinants of corporate nano
 
Govida chocolate
Govida chocolateGovida chocolate
Govida chocolate
 
Yourprezi
YourpreziYourprezi
Yourprezi
 
Skrip Arabic (الكتابة العربية)
Skrip Arabic (الكتابة العربية)Skrip Arabic (الكتابة العربية)
Skrip Arabic (الكتابة العربية)
 
Report ent600
Report  ent600Report  ent600
Report ent600
 
Mgt project
Mgt projectMgt project
Mgt project
 
MyBride Presentation
MyBride PresentationMyBride Presentation
MyBride Presentation
 
Prezi
PreziPrezi
Prezi
 
Levi's jeans presentation
Levi's jeans presentationLevi's jeans presentation
Levi's jeans presentation
 
Portable Printer
Portable PrinterPortable Printer
Portable Printer
 
Air craft control system of parallel processing
Air craft control system of parallel processingAir craft control system of parallel processing
Air craft control system of parallel processing
 
Application of Parallel Processing
Application of Parallel ProcessingApplication of Parallel Processing
Application of Parallel Processing
 
Mybride gallery
Mybride galleryMybride gallery
Mybride gallery
 

Recently uploaded

ROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationAadityaSharma884161
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
Atmosphere science 7 quarter 4 .........
Atmosphere science 7 quarter 4 .........Atmosphere science 7 quarter 4 .........
Atmosphere science 7 quarter 4 .........LeaCamillePacle
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxChelloAnnAsuncion2
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementmkooblal
 
Romantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptxRomantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptxsqpmdrvczh
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.arsicmarija21
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfUjwalaBharambe
 

Recently uploaded (20)

ROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint Presentation
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
Atmosphere science 7 quarter 4 .........
Atmosphere science 7 quarter 4 .........Atmosphere science 7 quarter 4 .........
Atmosphere science 7 quarter 4 .........
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of management
 
Romantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptxRomantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptx
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
 

Analysis of parallel algorithms for energy consumption

  • 1. Analysis of Parallel Algorithms for Energy Conservation in Scalable Multicore Architecture Problem statement An important concern for computing, as for society, is conserving energy. Moreover, energy conservation is critical in mobile devices for practical reasons. The relation between parallel applications and their energy requirements on scalable multicore processors are examined. It believe this sort of analysis can provide programmers with intuitions about the energy required by the parallel algorithms they are using–thus guiding the choice of algorithm, architecture and the number of cores used for a particular application. Researchers have studied performance scalability of parallel algorithms for some time [10]. However, it is important to note that performance scalability of a parallel algorithm is not the same as its energy scalability. This difference between performance scalability and energy scalability is due to two important factors. There is a nonlinear relationship between power and frequency at which the cores operate in multicore processors. In fact, the power consumed by a core is (typically) proportional to the cube of its frequency. Executing parallel algorithms typically involves communication (or shared memory accesses) as well as computation. The power and performance characteristics of communication and computation may be different. For example, in many algorithms, communication time may be masked by overlapping communication and computation (e.g., [1]). However, the power required for communication would be unaffected whether communication overlaps with the computation or not. Objective (a) To analyse energy characteristics of parallel algorithms executed on scalable multicore processors. (b) To study the sensitivity of the analysis to changes in parameters such as the ratio of power required for computation versus power required for communication. (c) To determine how many cores to use in order to minimize energy consumption.
  • 2. Methodology The methodology is to evaluate energy scalability under iso-performance of parallel applications. Partitioning Step Find the critical path of the parallel algorithm. The critical path is the longest path through the task dependency graph (where edges represents task serialization) of the parallel algorithm. Note that the critical path length gives a lower bound on execution time of the parallel algorithm. In the parallel algorithm, the critical path is easy to find. It is the execution path of the core that has the sum of all numbers at the end. Half of the cores send the sum they compute to the other half so that no core receives a sum from more than one core. The receiving core then add the number the local sum they have computed. We perform the same step recursively until there is only one core left. At the end of computation, one core will store the sum of all N numbers. Communication and computation steps Partition the critical path into communication and computation steps. We can see that there are log (M) communication steps and ((N=M) � 1 + log (M)) computation steps. Consider a simple parallel algorithm to add N numbers using M cores. Initially all N numbers are equally distributed among the M cores; at the end of the computation, one of the cores stores their sum. Without loss of generality, assume that the number of cores available is some power of two. The algorithm runs in log (M) steps. Adding N numbers using 4 actors; Left most line represents the critical path; embarrassingly parallel application but represents a broad class of tree algorithms
  • 3. Agglomeration step Scale computation steps of the critical path so that the parallel performance matches the performance requirement. Scaling the computation time of the critical path to the difference of the required performance and the communication time of the critical path, and obtain the new reduced frequency at which all M cores should run. Reduced frequency obtained at which all M cores should run to complete in time T. Where β represents number of cycles required per addition. In order to achieve energy savings, we require 0 < X0 < F. This restriction provides a lower bound on the input size as a function of M and Kc. Evaluate the message complexity (total number of messages processed) of the parallel algorithm. The example algorithms discuss before show that the message complexity of some parallel algorithms may depend only on the number of cores, while for others it depends on both the input size and the number of cores used. It is trivial to see that number of message transfers for this parallel algorithm when running on M cores is (M-1). In the algorithm, the message complexity is only dependent on M and not on the input size N. Evaluate the total idle time at all the cores assuming the frequency obtained, running at new frequency X’. Scaling the parallel algorithm critical path may lead to an increase in idle time in other paths at other cores. Total idle time is: Where the first term represents the total idle time spent by idle cores while other cores are busy computing and second term represents the total idle time spent by idle cores while other cores are involved in message communication. Step 6 Frame an expression for energy consumption of the parallel algorithm using the energy model. The energy expression is the sum of the energy consumed by Computation, Ecomp, Communication, Ecomm and Idling (static power), Eidle Ecomp is lower if the cores run at a lower frequency, while Eidle may increase as the busy cores take longer to finish. Ecomm may increase as more cores are used since the computation is more distributed. Observe that (N-1) is the total number of computation steps. The energy consumed for computation, communication and idling while the algorithm is running on M cores at reduced frequency X’ is:
  • 4. Mapping step Analyse the equation to obtain the number of cores required for minimum energy consumption as a function of input size. The energy expression is dependent on many variables such as N (Input Size), M (Number of cores), (Number of instruction per addition), Kc (no of cycles executed at maximum frequency for single message communication time), Em (energy consumed for single message communication between cores), Ps (static power) and the maximum frequency of a core. In most architectures, the number of cycles involved per addition is just one, so we assume = 1. Set idle energy consumed per cycle as (Ps=F) = 1, where the cycle is at the maximum frequency F. Express all energy values with respect to this normalized energy value. Figure 1 plot for any input size N, initially energy decreases with increasing M and later on increases with increasing M. Energy for computation decreases with an increase in number of cores running at reduced frequencies, and energy for communication increases with increasing cores. We can see that increasing the input size leads to an increase in the optimal number of cores required for minimum energy consumption. We now consider the sensitivity of this analysis with respect to the ratio k. Figure 2 plots the optimal number of cores required for minimum energy consumption by varying input size and k. Figure 2 shows that for a fixed input size, the optimal number of cores required for minimum energy consumption decreases with increasing k. Moreover, it is evident that with increasing input size, the trend remains the same (approximating a negative exponential curve with negative coefficient).