SlideShare a Scribd company logo
Instruction Level Power Analysis




                                   1
Layout
   Introduction
   Components of Power Consumption
   Power Characterization
   Instruction Level Power Analysis for RISC
    processors
   Extensions for VLIW/EPIC processors
   Register Files
   Caches



                                                2
Introduction
   Why power of nano-electronics became so
    important?
       Because of Moore’s law still holds true through
        complex applications
       Mobile systems – battery “bottleneck”
       High performance computation – heat
        extraction
       Operating cost and reliability
          Data warehouse of ISP with 8000 servers

           needs 2 MW



                                                          3
Introduction
   Power or Energy? Aren’t they go hand-in-hand?
       Power varies significantly with time!
       A given battery has fixed amount of energy
       Average power consumption = Energy/Execution-
        time
            Decides average chip and junction temperature
            Decides battery life (if peak current < rated
             current)
       Peak power and current
            Voltage drops, hot spots, rate of battery discharge
   Power-efficient, Energy-efficient, Battery-efficient
    design paradigms do exist!



                                                                   4
Components of Power
Consumption
   System = hardware platform + software (sys. & app.)
       Software impacts hardware power consumption
   Static power
       Sub-threshold leakage & reverse biased junction leakage
       Quiescent biasing power (in case of non-CMOS circuits)
   Dynamic power
       Charging and discharging of capacitance (switching
        activity)
       Short circuit power during transition (rate of change,
        delay)
       Alternative grouping (used at component/cell level)
            Switching power at the boundaries of cells
            Internal cell power
                  Short circuit power
                  Switching power at internal nodes




                                                                  5
System Abstractions - Power
                                  Functional Specifications and Constraints




                                                                              Accuracy of power characterization
Opportunities for optimization



                                            System Level Netlist




                                                                                                                   Time complexity
                                    Register Transfer Level (RTL) Netlist

                                        Component/Cell Level Netlist

                                        Layout or Configuration-bits

                                                    Chip


                                                                                                                          6
Power Characterization
   Measurement (Chip/Board Level)
       Most accurate
       Perhaps the fastest, if setup and tools
        exist
       Too late to change hardware details
       Software/Load control is still possible
       Typically used for software
        optimizations


                                              7
Power Characterization (cont…)
   Transistor Level (estimation)
       Spice simulation of transistor level netlist
       Most accurate in the simulation world
       Requires complete implementation details
       Unmanageable time complexity even for
        simpler designs
       Typically used for cell/component
        characterization
       Synopsys PowerMill (said to provide spice-
        like accuracy)


                                                       8
Power Characterization (cont…)
   Cell Level (estimation)
       After logic synthesis
       Requires RTL implementation
       Simulation to capture switching activity
            Requires delay simulation if glitches need to be accounted
       Characterized cells – empirical formulas or table look-up
       Interconnect power
            Either unaccounted or
            Using estimated wire load models (typically based on
             experience) or
            Extracted layout (if done after physical synthesis)
       Still unmanageable time complexity especially to use in
        design space exploration
       Synopsys PrimePower
            Netlist, interconnect capacitance, VCD traces, cell power
             library




                                                                          9
Power Characterization (cont…)
   Register Transfer Level (estimation)
       Requires conceptual RTL description (detailed
        micro-architecture)
       Data-path is modeled as netlist of macro cells,
        which are characterized offline
       Control path and glue logic
            Either unaccounted or estimated based on I/O
       Simulation to capture switching activity
            Typically glitches are not considered but methods do
             exist
       Interconnect power
            Typically unaccounted but possible to estimate
             through floor-planning
       Typically used in DSE mostly using in-house tools


                                                                    10
System Level Power Estimation
   For Design Space Exploration
   Least accurate but uncertainty of exploration results
    can be reduced if models have good fidelity
   Purpose, target architecture and available system
    details govern the system-level estimation models
       Selecting algorithm or designing hardware for given
        algorithm?
       ASIC based or processor based?
       Is ISA fixed or extensible?
   Typically system-level power estimation models are
    macro-architecture template specific
   Major constituents of power consumption
       Computation, communication, storage units & peripherals




                                                              11
Power Estimation Models
   Activity Based Models
   Instruction Level Energy Models




                                      12
Activity Based Models
   Fixed Activity Model
   N-Transition Model
   Dual Bit Model




                           13
Fixed Activity Model

                   P = ∑ i kiGifi
Where:
ki = PFA proportionality constant extracted
   empirically from past designs
Gi = Measure of hardware complexity
fi = Activation frequency

   Disadvantage: Do not model the influence of data
    activity on power consumption



                                                   14
N-Transition Model

               P = Pconst + n.Pchange

   Disadvantage:
    It does not differentiate between transitions on
     different inputs.




                                                       15
Dual Bit Type Model
   Drawback in previous
    approaches:
       Less Accurate
       Characterizes the
        module on basis of
        Uniform White Noise
        (UWN) input
       Leads to high error if
        the input dynamic
        range does not fully
        occupy the word
        length



                                 16
Dual Bit Type Model
The Approach

   Combines reduced complexity of the
    architecture level with the accuracy of
    gate and circuit level
   Black box model of capacitance switched
    in each module for various types of inputs
   Easy to parameterize capacitance models
    to take into account size , etc.




                                             17
Dual Bit Type Model
Modeling Complexity
   Power consumed by a module is a
    function of its complexity as large
    modules contain more circuitry
   Examples:
       Capacitance of N-bit ripple carry subtracter:
               CT = Ceff * N
       Not restricted to linear models, but can be
        used to specify even more complex models




                                                        18
Dual Bit Type Model
Capacitive Data Coefficients

   Describe the average amount of
    capacitance switched within a module
    during an input transition
       LSB regions suffer random transitions and
        hence can be characterized by a single
        capacitive coefficient CUU
       MSB region experiences sign transitions and so
        is characterized by capacitive sign coefficients
        C+-,C++, etc.




                                                      19
Instruction Level Power Estimation
     First introduced to characterize
      processor power consumption to drive
      software optimizations
     Each instruction is associated with
      some current
     Inter instruction effects for better
      accuracy




                                         20
Instruction Level Power Estimation
     E = Σ(Bi x Ni) + Σ(O(i,j) x N(I,j)) +
      ΣEk
        Bi: Base Energy Cost
        Oi.j: Inter-instruction effect Energy Cost

        Ek: additional energy penalties due to

         resource constraints
     Require cost associated with every pair
      of instructions: O(N2), where N =
      number of instructions in ISA


                                                      21
JouleTrack
   Experiments on StrongARM by Amit Sinha &
    A.P.Chandran
       Current/instruction ~ 0.2A (averaged over all
        instructions)
       Min-max variation of 38% of average current
       Address mode and data dependent variation is
        smaller
       But, max current variation across benchmarks is
        < 8% !
       Concluded that first order energy model of a
        given processor is, E = V I(V, f) T
       Second order effects can be significant for data-
        path dominated processors such as DSP, VLIW


                                                        22
Instruction Level Power Estimation
   Impractical for CISC processors with
    very large instruction set
       Higher Average Instruction Energy
       Low Energy Per Instruction Variance
       Do not consider inter instruction effects
       Cluster Similar Instructions as a single
        class
   Exponential Storage Problem for VLIW
    architectures
       No. of Long Instructions = N operations
        into a K-wide VLIW = N(2k)


                                                    23
Modified Energy Model for VLIW
   Assume Independent Energy dissipation for
    different Execution slots
   Consider nop as the base energy
      E(W) = ΣU(wn|wn-1) + mxpxS + lxqxM

      U(wn|wn-1) = U(0|0) + Σv(wnk,wn-1k)

            Wnk = operation issued on lane k by instruction wn
       Example
            Wn = [ ALU NOP NOP NOP], Wn-1 = [ LS NOP ALU
             NOP]
            U(wn|wn-1) = U(0|0) + v(ALU|LS) + v(NOP|ALU)
   Memory Requirement
       O(K*N2)

                                                             24
Modified Energy Model for VLIW
   Cluster Similar Instructions based on cost
       Θ = {e1, e2, …, et}
            et = energy consumption of instruction t
       Partition Θ into K clusters (C1, C2, …, Ck) s.t.
            ΣΣ (xi,j –cj)2 = minimum
   Large number of clusters
       Good Accuracy
       Huge no. of experiments
   Small number of clusters
      Small number of experiments
      High Variance between clusters
      Reduced Accuracy
   Memory Requirement
       O(C*N2)



                                                           25
Limitations of ILPA
   Does not provide any insight on the
    causes of power consumption within the
    processor core
   Does not account for the power consumed
    in the memory system, which is often
    dominant
   To address the second limitation, power
    estimation frameworks which integrate
    processor and memory models are built
    around instruction set simulators


                                          26
MicroArchitecture ILPA
   Pipeline Aware Instruction Level Energy Model
   Divide the design into smaller architectural blocks
       Usually Processor’s Pipeline Stages
            Fetch, Decode, RF, Execute, WB
       E(wn|wn-1) = Σ As(wn|wn-1) + I(wn|wn-1)
            As = Energy Consumed Per stage s when executing
             wn after wn-1
            I(wn|wn-1) = Interstage connections energy
             (PipeLine Registers + Buses)
       Provides better insight for power bottlenecks
       Smoother Energy Behaviour than Blackbox model
       Require a Pipeline Structure Aware ISS



                                                          27
Energy Models for Register File
   Assume Linear Power Behaviour for
    access across different ports
       PRF = Pi + 1/T Σ (Er,n + Ew,n)
       Er,n = Σ H(RRi,n, RRi,n-1) *Erb
       Ew,n = Σ H(RWi,n, oldi,n) * Ewb




                                          28
Energy Model for Caches
   Power consumption depends on mode of
    operation (read, write, idle)
   Energy consumed in a given clock cycle is
    function of node transition between
    previous and current cycle.
   Characterize energy as function of state
    transitions(read-read, read-write, etc).
   For a given transition, dependence upon
    transition on address lines.


                                            29
Thank You


            30

More Related Content

What's hot

Design and Implementation of Low Power DSP Core with Programmable Truncated V...
Design and Implementation of Low Power DSP Core with Programmable Truncated V...Design and Implementation of Low Power DSP Core with Programmable Truncated V...
Design and Implementation of Low Power DSP Core with Programmable Truncated V...
ijsrd.com
 
ISCA2021 Tutorial-Methods for Characterization and Analysis of Voltage Margin...
ISCA2021 Tutorial-Methods for Characterization and Analysis of Voltage Margin...ISCA2021 Tutorial-Methods for Characterization and Analysis of Voltage Margin...
ISCA2021 Tutorial-Methods for Characterization and Analysis of Voltage Margin...
Behzad Salami
 
International Journal of Computational Engineering Research (IJCER)
International Journal of Computational Engineering Research (IJCER) International Journal of Computational Engineering Research (IJCER)
International Journal of Computational Engineering Research (IJCER)
ijceronline
 
Ijecet 06 08_003
Ijecet 06 08_003Ijecet 06 08_003
Ijecet 06 08_003
IAEME Publication
 
Low power design-ver_26_mar08
Low power design-ver_26_mar08Low power design-ver_26_mar08
Low power design-ver_26_mar08Obsidian Software
 
Vlsi Design of Low Transition Low Power Test Pattern Generator Using Fault Co...
Vlsi Design of Low Transition Low Power Test Pattern Generator Using Fault Co...Vlsi Design of Low Transition Low Power Test Pattern Generator Using Fault Co...
Vlsi Design of Low Transition Low Power Test Pattern Generator Using Fault Co...
iosrjce
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER) International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER) ijceronline
 
Low power embedded system design
Low power embedded system designLow power embedded system design
Low power embedded system design
Sri Manakula Vinayagar Engineering College
 
Low Power System on chip based design methodology
Low Power System on chip based design methodologyLow Power System on chip based design methodology
Low Power System on chip based design methodologyAakash Patel
 
Implementation of resource sharing strategy for power optimization in embedde...
Implementation of resource sharing strategy for power optimization in embedde...Implementation of resource sharing strategy for power optimization in embedde...
Implementation of resource sharing strategy for power optimization in embedde...Alexander Decker
 
Comparative Performance Analysis of Low Power Full Adder Design in Different ...
Comparative Performance Analysis of Low Power Full Adder Design in Different ...Comparative Performance Analysis of Low Power Full Adder Design in Different ...
Comparative Performance Analysis of Low Power Full Adder Design in Different ...
ijcisjournal
 
Low power electronic design
Low power electronic designLow power electronic design
Low power electronic design
Mahesh Dananjaya
 
Design of a Low Power and Area Efficient Flip Flop With Embedded Logic Module
Design of a Low Power and Area Efficient Flip Flop With Embedded Logic ModuleDesign of a Low Power and Area Efficient Flip Flop With Embedded Logic Module
Design of a Low Power and Area Efficient Flip Flop With Embedded Logic Module
iosrjce
 
HIGH PERFORMANCE ETHERNET PACKET PROCESSOR CORE FOR NEXT GENERATION NETWORKS
HIGH PERFORMANCE ETHERNET PACKET PROCESSOR CORE FOR NEXT GENERATION NETWORKSHIGH PERFORMANCE ETHERNET PACKET PROCESSOR CORE FOR NEXT GENERATION NETWORKS
HIGH PERFORMANCE ETHERNET PACKET PROCESSOR CORE FOR NEXT GENERATION NETWORKS
ijngnjournal
 
LOGIC OPTIMIZATION USING TECHNOLOGY INDEPENDENT MUX BASED ADDERS IN FPGA
LOGIC OPTIMIZATION USING TECHNOLOGY INDEPENDENT MUX BASED ADDERS IN FPGALOGIC OPTIMIZATION USING TECHNOLOGY INDEPENDENT MUX BASED ADDERS IN FPGA
LOGIC OPTIMIZATION USING TECHNOLOGY INDEPENDENT MUX BASED ADDERS IN FPGA
VLSICS Design
 
Energy Efficient and Process Tolerant Full Adder in Technologies beyond CMOS
Energy Efficient and Process Tolerant Full Adder in Technologies beyond CMOSEnergy Efficient and Process Tolerant Full Adder in Technologies beyond CMOS
Energy Efficient and Process Tolerant Full Adder in Technologies beyond CMOS
IDES Editor
 

What's hot (16)

Design and Implementation of Low Power DSP Core with Programmable Truncated V...
Design and Implementation of Low Power DSP Core with Programmable Truncated V...Design and Implementation of Low Power DSP Core with Programmable Truncated V...
Design and Implementation of Low Power DSP Core with Programmable Truncated V...
 
ISCA2021 Tutorial-Methods for Characterization and Analysis of Voltage Margin...
ISCA2021 Tutorial-Methods for Characterization and Analysis of Voltage Margin...ISCA2021 Tutorial-Methods for Characterization and Analysis of Voltage Margin...
ISCA2021 Tutorial-Methods for Characterization and Analysis of Voltage Margin...
 
International Journal of Computational Engineering Research (IJCER)
International Journal of Computational Engineering Research (IJCER) International Journal of Computational Engineering Research (IJCER)
International Journal of Computational Engineering Research (IJCER)
 
Ijecet 06 08_003
Ijecet 06 08_003Ijecet 06 08_003
Ijecet 06 08_003
 
Low power design-ver_26_mar08
Low power design-ver_26_mar08Low power design-ver_26_mar08
Low power design-ver_26_mar08
 
Vlsi Design of Low Transition Low Power Test Pattern Generator Using Fault Co...
Vlsi Design of Low Transition Low Power Test Pattern Generator Using Fault Co...Vlsi Design of Low Transition Low Power Test Pattern Generator Using Fault Co...
Vlsi Design of Low Transition Low Power Test Pattern Generator Using Fault Co...
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER) International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
Low power embedded system design
Low power embedded system designLow power embedded system design
Low power embedded system design
 
Low Power System on chip based design methodology
Low Power System on chip based design methodologyLow Power System on chip based design methodology
Low Power System on chip based design methodology
 
Implementation of resource sharing strategy for power optimization in embedde...
Implementation of resource sharing strategy for power optimization in embedde...Implementation of resource sharing strategy for power optimization in embedde...
Implementation of resource sharing strategy for power optimization in embedde...
 
Comparative Performance Analysis of Low Power Full Adder Design in Different ...
Comparative Performance Analysis of Low Power Full Adder Design in Different ...Comparative Performance Analysis of Low Power Full Adder Design in Different ...
Comparative Performance Analysis of Low Power Full Adder Design in Different ...
 
Low power electronic design
Low power electronic designLow power electronic design
Low power electronic design
 
Design of a Low Power and Area Efficient Flip Flop With Embedded Logic Module
Design of a Low Power and Area Efficient Flip Flop With Embedded Logic ModuleDesign of a Low Power and Area Efficient Flip Flop With Embedded Logic Module
Design of a Low Power and Area Efficient Flip Flop With Embedded Logic Module
 
HIGH PERFORMANCE ETHERNET PACKET PROCESSOR CORE FOR NEXT GENERATION NETWORKS
HIGH PERFORMANCE ETHERNET PACKET PROCESSOR CORE FOR NEXT GENERATION NETWORKSHIGH PERFORMANCE ETHERNET PACKET PROCESSOR CORE FOR NEXT GENERATION NETWORKS
HIGH PERFORMANCE ETHERNET PACKET PROCESSOR CORE FOR NEXT GENERATION NETWORKS
 
LOGIC OPTIMIZATION USING TECHNOLOGY INDEPENDENT MUX BASED ADDERS IN FPGA
LOGIC OPTIMIZATION USING TECHNOLOGY INDEPENDENT MUX BASED ADDERS IN FPGALOGIC OPTIMIZATION USING TECHNOLOGY INDEPENDENT MUX BASED ADDERS IN FPGA
LOGIC OPTIMIZATION USING TECHNOLOGY INDEPENDENT MUX BASED ADDERS IN FPGA
 
Energy Efficient and Process Tolerant Full Adder in Technologies beyond CMOS
Energy Efficient and Process Tolerant Full Adder in Technologies beyond CMOSEnergy Efficient and Process Tolerant Full Adder in Technologies beyond CMOS
Energy Efficient and Process Tolerant Full Adder in Technologies beyond CMOS
 

Viewers also liked

Payback timevalue of money and iir
Payback timevalue of money and iirPayback timevalue of money and iir
Payback timevalue of money and iir
Subhash Rohit
 
Lightning Protection
Lightning ProtectionLightning Protection
Lightning Protectionsuresh5
 
Payback period time value of money ad internal rate ofreturn
Payback period time value of money ad internal rate ofreturn Payback period time value of money ad internal rate ofreturn
Payback period time value of money ad internal rate ofreturn
Subhash Rohit
 
Payback Analysis
Payback AnalysisPayback Analysis
Payback Analysis
Eric de Diesbach
 
Pay back period ..chapter solution to problems...
Pay back period ..chapter solution to problems...Pay back period ..chapter solution to problems...
Pay back period ..chapter solution to problems...
mianmohsinmumtazshb
 
Life cycle costing
Life cycle costingLife cycle costing
Life cycle costing
Jessy Chong
 
Net Present Value A
Net Present Value ANet Present Value A
Net Present Value A
Craig Brown
 
Lightning protection 1 by ambuj mishra
Lightning protection 1 by ambuj mishraLightning protection 1 by ambuj mishra
Lightning protection 1 by ambuj mishraAmbuj Mishra
 
Net Present Value - NPV
Net Present Value - NPVNet Present Value - NPV
Net Present Value - NPV
ASAD ALI
 
Lightning protection
Lightning protectionLightning protection
Lightning protection
Vincent Wedelich, PE MBA
 
Capital Budgeting
Capital BudgetingCapital Budgeting
Capital BudgetingDayasagar S
 
life cycle costing
life cycle costing life cycle costing
life cycle costing Mohd Hisyam
 

Viewers also liked (13)

Payback timevalue of money and iir
Payback timevalue of money and iirPayback timevalue of money and iir
Payback timevalue of money and iir
 
Lightning Protection
Lightning ProtectionLightning Protection
Lightning Protection
 
Payback period time value of money ad internal rate ofreturn
Payback period time value of money ad internal rate ofreturn Payback period time value of money ad internal rate ofreturn
Payback period time value of money ad internal rate ofreturn
 
Payback Analysis
Payback AnalysisPayback Analysis
Payback Analysis
 
Pay back period ..chapter solution to problems...
Pay back period ..chapter solution to problems...Pay back period ..chapter solution to problems...
Pay back period ..chapter solution to problems...
 
Life cycle costing
Life cycle costingLife cycle costing
Life cycle costing
 
Net Present Value A
Net Present Value ANet Present Value A
Net Present Value A
 
Lightning protection 1 by ambuj mishra
Lightning protection 1 by ambuj mishraLightning protection 1 by ambuj mishra
Lightning protection 1 by ambuj mishra
 
Net Present Value - NPV
Net Present Value - NPVNet Present Value - NPV
Net Present Value - NPV
 
Lightning protection
Lightning protectionLightning protection
Lightning protection
 
Depreciation
DepreciationDepreciation
Depreciation
 
Capital Budgeting
Capital BudgetingCapital Budgeting
Capital Budgeting
 
life cycle costing
life cycle costing life cycle costing
life cycle costing
 

Similar to Instruction level power analysis

How to achieve 95%+ Accurate power measurement during architecture exploration?
How to achieve 95%+ Accurate power measurement during architecture exploration? How to achieve 95%+ Accurate power measurement during architecture exploration?
How to achieve 95%+ Accurate power measurement during architecture exploration?
Deepak Shankar
 
Run-time power management in cloud and containerized environments
Run-time power management in cloud and containerized environmentsRun-time power management in cloud and containerized environments
Run-time power management in cloud and containerized environments
NECST Lab @ Politecnico di Milano
 
Empirically Derived Abstractions in Uncore Power Modeling for a Server-Class...
Empirically Derived Abstractions in Uncore Power Modeling for a  Server-Class...Empirically Derived Abstractions in Uncore Power Modeling for a  Server-Class...
Empirically Derived Abstractions in Uncore Power Modeling for a Server-Class...
Arun Joseph
 
Low-Power Design and Verification
Low-Power Design and VerificationLow-Power Design and Verification
Low-Power Design and VerificationDVClub
 
OPAL-RT RT13 Conference: Rapid control prototyping solutions for power electr...
OPAL-RT RT13 Conference: Rapid control prototyping solutions for power electr...OPAL-RT RT13 Conference: Rapid control prototyping solutions for power electr...
OPAL-RT RT13 Conference: Rapid control prototyping solutions for power electr...
OPAL-RT TECHNOLOGIES
 
International Journal of Computational Engineering Research(IJCER)
 International Journal of Computational Engineering Research(IJCER)  International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
ijceronline
 
Per domain power analysis
Per domain power analysisPer domain power analysis
Per domain power analysis
Arun Joseph
 
Implementation of Low Power Test Pattern Generator Using LFSR
Implementation of Low Power Test Pattern Generator Using LFSRImplementation of Low Power Test Pattern Generator Using LFSR
Implementation of Low Power Test Pattern Generator Using LFSR
International Journal of Science and Research (IJSR)
 
A Hybrid Approach to Standard Cell Power Characterization based on PVT Indepe...
A Hybrid Approach to Standard Cell Power Characterization based on PVT Indepe...A Hybrid Approach to Standard Cell Power Characterization based on PVT Indepe...
A Hybrid Approach to Standard Cell Power Characterization based on PVT Indepe...
Arun Joseph
 
Implementation of Area Effective Carry Select Adders
Implementation of Area Effective Carry Select AddersImplementation of Area Effective Carry Select Adders
Implementation of Area Effective Carry Select Adders
Kumar Goud
 
Mobile computing edited
Mobile computing editedMobile computing edited
Mobile computing edited
m_hughes
 
Tossim
Tossim Tossim
Tossim
crew1274
 
WP79 Online vs Line-interactive
WP79   Online vs Line-interactiveWP79   Online vs Line-interactive
WP79 Online vs Line-interactiveSE_NAM_Training
 
Wireless Sensor Network
Wireless Sensor NetworkWireless Sensor Network
Wireless Sensor Network
Shubhi Singh chauhan
 
IEC61850 tutorial
IEC61850 tutorialIEC61850 tutorial
IEC61850 tutorial
Alidu Abubakari
 
H010613642
H010613642H010613642
H010613642
IOSR Journals
 
DEEP-mon: Dynamic and Energy Efficient Power monitoring for container-based i...
DEEP-mon: Dynamic and Energy Efficient Power monitoring for container-based i...DEEP-mon: Dynamic and Energy Efficient Power monitoring for container-based i...
DEEP-mon: Dynamic and Energy Efficient Power monitoring for container-based i...
NECST Lab @ Politecnico di Milano
 
Roy aeroVerifying Power Domains in AeroFONE
Roy aeroVerifying Power Domains in AeroFONERoy aeroVerifying Power Domains in AeroFONE
Roy aeroVerifying Power Domains in AeroFONEDVClub
 

Similar to Instruction level power analysis (20)

How to achieve 95%+ Accurate power measurement during architecture exploration?
How to achieve 95%+ Accurate power measurement during architecture exploration? How to achieve 95%+ Accurate power measurement during architecture exploration?
How to achieve 95%+ Accurate power measurement during architecture exploration?
 
Run-time power management in cloud and containerized environments
Run-time power management in cloud and containerized environmentsRun-time power management in cloud and containerized environments
Run-time power management in cloud and containerized environments
 
Empirically Derived Abstractions in Uncore Power Modeling for a Server-Class...
Empirically Derived Abstractions in Uncore Power Modeling for a  Server-Class...Empirically Derived Abstractions in Uncore Power Modeling for a  Server-Class...
Empirically Derived Abstractions in Uncore Power Modeling for a Server-Class...
 
Low-Power Design and Verification
Low-Power Design and VerificationLow-Power Design and Verification
Low-Power Design and Verification
 
OPAL-RT RT13 Conference: Rapid control prototyping solutions for power electr...
OPAL-RT RT13 Conference: Rapid control prototyping solutions for power electr...OPAL-RT RT13 Conference: Rapid control prototyping solutions for power electr...
OPAL-RT RT13 Conference: Rapid control prototyping solutions for power electr...
 
International Journal of Computational Engineering Research(IJCER)
 International Journal of Computational Engineering Research(IJCER)  International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
Per domain power analysis
Per domain power analysisPer domain power analysis
Per domain power analysis
 
Implementation of Low Power Test Pattern Generator Using LFSR
Implementation of Low Power Test Pattern Generator Using LFSRImplementation of Low Power Test Pattern Generator Using LFSR
Implementation of Low Power Test Pattern Generator Using LFSR
 
A Hybrid Approach to Standard Cell Power Characterization based on PVT Indepe...
A Hybrid Approach to Standard Cell Power Characterization based on PVT Indepe...A Hybrid Approach to Standard Cell Power Characterization based on PVT Indepe...
A Hybrid Approach to Standard Cell Power Characterization based on PVT Indepe...
 
E3 s binghamton
E3 s binghamtonE3 s binghamton
E3 s binghamton
 
Implementation of Area Effective Carry Select Adders
Implementation of Area Effective Carry Select AddersImplementation of Area Effective Carry Select Adders
Implementation of Area Effective Carry Select Adders
 
Mobile computing edited
Mobile computing editedMobile computing edited
Mobile computing edited
 
Tossim
Tossim Tossim
Tossim
 
WP79 Online vs Line-interactive
WP79   Online vs Line-interactiveWP79   Online vs Line-interactive
WP79 Online vs Line-interactive
 
Wireless Sensor Network
Wireless Sensor NetworkWireless Sensor Network
Wireless Sensor Network
 
Altera trcak g
Altera  trcak gAltera  trcak g
Altera trcak g
 
IEC61850 tutorial
IEC61850 tutorialIEC61850 tutorial
IEC61850 tutorial
 
H010613642
H010613642H010613642
H010613642
 
DEEP-mon: Dynamic and Energy Efficient Power monitoring for container-based i...
DEEP-mon: Dynamic and Energy Efficient Power monitoring for container-based i...DEEP-mon: Dynamic and Energy Efficient Power monitoring for container-based i...
DEEP-mon: Dynamic and Energy Efficient Power monitoring for container-based i...
 
Roy aeroVerifying Power Domains in AeroFONE
Roy aeroVerifying Power Domains in AeroFONERoy aeroVerifying Power Domains in AeroFONE
Roy aeroVerifying Power Domains in AeroFONE
 

Recently uploaded

Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
Fwdays
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Product School
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Jeffrey Haguewood
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
CatarinaPereira64715
 

Recently uploaded (20)

Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 

Instruction level power analysis

  • 2. Layout  Introduction  Components of Power Consumption  Power Characterization  Instruction Level Power Analysis for RISC processors  Extensions for VLIW/EPIC processors  Register Files  Caches 2
  • 3. Introduction  Why power of nano-electronics became so important?  Because of Moore’s law still holds true through complex applications  Mobile systems – battery “bottleneck”  High performance computation – heat extraction  Operating cost and reliability  Data warehouse of ISP with 8000 servers needs 2 MW 3
  • 4. Introduction  Power or Energy? Aren’t they go hand-in-hand?  Power varies significantly with time!  A given battery has fixed amount of energy  Average power consumption = Energy/Execution- time  Decides average chip and junction temperature  Decides battery life (if peak current < rated current)  Peak power and current  Voltage drops, hot spots, rate of battery discharge  Power-efficient, Energy-efficient, Battery-efficient design paradigms do exist! 4
  • 5. Components of Power Consumption  System = hardware platform + software (sys. & app.)  Software impacts hardware power consumption  Static power  Sub-threshold leakage & reverse biased junction leakage  Quiescent biasing power (in case of non-CMOS circuits)  Dynamic power  Charging and discharging of capacitance (switching activity)  Short circuit power during transition (rate of change, delay)  Alternative grouping (used at component/cell level)  Switching power at the boundaries of cells  Internal cell power  Short circuit power  Switching power at internal nodes 5
  • 6. System Abstractions - Power Functional Specifications and Constraints Accuracy of power characterization Opportunities for optimization System Level Netlist Time complexity Register Transfer Level (RTL) Netlist Component/Cell Level Netlist Layout or Configuration-bits Chip 6
  • 7. Power Characterization  Measurement (Chip/Board Level)  Most accurate  Perhaps the fastest, if setup and tools exist  Too late to change hardware details  Software/Load control is still possible  Typically used for software optimizations 7
  • 8. Power Characterization (cont…)  Transistor Level (estimation)  Spice simulation of transistor level netlist  Most accurate in the simulation world  Requires complete implementation details  Unmanageable time complexity even for simpler designs  Typically used for cell/component characterization  Synopsys PowerMill (said to provide spice- like accuracy) 8
  • 9. Power Characterization (cont…)  Cell Level (estimation)  After logic synthesis  Requires RTL implementation  Simulation to capture switching activity  Requires delay simulation if glitches need to be accounted  Characterized cells – empirical formulas or table look-up  Interconnect power  Either unaccounted or  Using estimated wire load models (typically based on experience) or  Extracted layout (if done after physical synthesis)  Still unmanageable time complexity especially to use in design space exploration  Synopsys PrimePower  Netlist, interconnect capacitance, VCD traces, cell power library 9
  • 10. Power Characterization (cont…)  Register Transfer Level (estimation)  Requires conceptual RTL description (detailed micro-architecture)  Data-path is modeled as netlist of macro cells, which are characterized offline  Control path and glue logic  Either unaccounted or estimated based on I/O  Simulation to capture switching activity  Typically glitches are not considered but methods do exist  Interconnect power  Typically unaccounted but possible to estimate through floor-planning  Typically used in DSE mostly using in-house tools 10
  • 11. System Level Power Estimation  For Design Space Exploration  Least accurate but uncertainty of exploration results can be reduced if models have good fidelity  Purpose, target architecture and available system details govern the system-level estimation models  Selecting algorithm or designing hardware for given algorithm?  ASIC based or processor based?  Is ISA fixed or extensible?  Typically system-level power estimation models are macro-architecture template specific  Major constituents of power consumption  Computation, communication, storage units & peripherals 11
  • 12. Power Estimation Models  Activity Based Models  Instruction Level Energy Models 12
  • 13. Activity Based Models  Fixed Activity Model  N-Transition Model  Dual Bit Model 13
  • 14. Fixed Activity Model P = ∑ i kiGifi Where: ki = PFA proportionality constant extracted empirically from past designs Gi = Measure of hardware complexity fi = Activation frequency  Disadvantage: Do not model the influence of data activity on power consumption 14
  • 15. N-Transition Model P = Pconst + n.Pchange  Disadvantage: It does not differentiate between transitions on different inputs. 15
  • 16. Dual Bit Type Model  Drawback in previous approaches:  Less Accurate  Characterizes the module on basis of Uniform White Noise (UWN) input  Leads to high error if the input dynamic range does not fully occupy the word length 16
  • 17. Dual Bit Type Model The Approach  Combines reduced complexity of the architecture level with the accuracy of gate and circuit level  Black box model of capacitance switched in each module for various types of inputs  Easy to parameterize capacitance models to take into account size , etc. 17
  • 18. Dual Bit Type Model Modeling Complexity  Power consumed by a module is a function of its complexity as large modules contain more circuitry  Examples:  Capacitance of N-bit ripple carry subtracter: CT = Ceff * N  Not restricted to linear models, but can be used to specify even more complex models 18
  • 19. Dual Bit Type Model Capacitive Data Coefficients  Describe the average amount of capacitance switched within a module during an input transition  LSB regions suffer random transitions and hence can be characterized by a single capacitive coefficient CUU  MSB region experiences sign transitions and so is characterized by capacitive sign coefficients C+-,C++, etc. 19
  • 20. Instruction Level Power Estimation  First introduced to characterize processor power consumption to drive software optimizations  Each instruction is associated with some current  Inter instruction effects for better accuracy 20
  • 21. Instruction Level Power Estimation  E = Σ(Bi x Ni) + Σ(O(i,j) x N(I,j)) + ΣEk  Bi: Base Energy Cost  Oi.j: Inter-instruction effect Energy Cost  Ek: additional energy penalties due to resource constraints  Require cost associated with every pair of instructions: O(N2), where N = number of instructions in ISA 21
  • 22. JouleTrack  Experiments on StrongARM by Amit Sinha & A.P.Chandran  Current/instruction ~ 0.2A (averaged over all instructions)  Min-max variation of 38% of average current  Address mode and data dependent variation is smaller  But, max current variation across benchmarks is < 8% !  Concluded that first order energy model of a given processor is, E = V I(V, f) T  Second order effects can be significant for data- path dominated processors such as DSP, VLIW 22
  • 23. Instruction Level Power Estimation  Impractical for CISC processors with very large instruction set  Higher Average Instruction Energy  Low Energy Per Instruction Variance  Do not consider inter instruction effects  Cluster Similar Instructions as a single class  Exponential Storage Problem for VLIW architectures  No. of Long Instructions = N operations into a K-wide VLIW = N(2k) 23
  • 24. Modified Energy Model for VLIW  Assume Independent Energy dissipation for different Execution slots  Consider nop as the base energy  E(W) = ΣU(wn|wn-1) + mxpxS + lxqxM  U(wn|wn-1) = U(0|0) + Σv(wnk,wn-1k)  Wnk = operation issued on lane k by instruction wn  Example  Wn = [ ALU NOP NOP NOP], Wn-1 = [ LS NOP ALU NOP]  U(wn|wn-1) = U(0|0) + v(ALU|LS) + v(NOP|ALU)  Memory Requirement  O(K*N2) 24
  • 25. Modified Energy Model for VLIW  Cluster Similar Instructions based on cost  Θ = {e1, e2, …, et}  et = energy consumption of instruction t  Partition Θ into K clusters (C1, C2, …, Ck) s.t.  ΣΣ (xi,j –cj)2 = minimum  Large number of clusters  Good Accuracy  Huge no. of experiments  Small number of clusters  Small number of experiments  High Variance between clusters  Reduced Accuracy  Memory Requirement  O(C*N2) 25
  • 26. Limitations of ILPA  Does not provide any insight on the causes of power consumption within the processor core  Does not account for the power consumed in the memory system, which is often dominant  To address the second limitation, power estimation frameworks which integrate processor and memory models are built around instruction set simulators 26
  • 27. MicroArchitecture ILPA  Pipeline Aware Instruction Level Energy Model  Divide the design into smaller architectural blocks  Usually Processor’s Pipeline Stages  Fetch, Decode, RF, Execute, WB  E(wn|wn-1) = Σ As(wn|wn-1) + I(wn|wn-1)  As = Energy Consumed Per stage s when executing wn after wn-1  I(wn|wn-1) = Interstage connections energy (PipeLine Registers + Buses)  Provides better insight for power bottlenecks  Smoother Energy Behaviour than Blackbox model  Require a Pipeline Structure Aware ISS 27
  • 28. Energy Models for Register File  Assume Linear Power Behaviour for access across different ports  PRF = Pi + 1/T Σ (Er,n + Ew,n)  Er,n = Σ H(RRi,n, RRi,n-1) *Erb  Ew,n = Σ H(RWi,n, oldi,n) * Ewb 28
  • 29. Energy Model for Caches  Power consumption depends on mode of operation (read, write, idle)  Energy consumed in a given clock cycle is function of node transition between previous and current cycle.  Characterize energy as function of state transitions(read-read, read-write, etc).  For a given transition, dependence upon transition on address lines. 29
  • 30. Thank You 30