SlideShare a Scribd company logo
1 of 40
Download to read offline
NATIONAL INSTITUTE OF TECHNOLOGY HAMIRPUR
Electronics and Communication Engineering Department
Presented By
Dr. Gargi Khanna
Associate Professor
E&CED, NIT Hamirpur HP.
LOW POWER VLSI DESIGN
Architectural Level Techniques
What will we learn ?
Dynamic Voltage Scaling
Vth-Hopping Scheme
Dynamic Vth Scheme
Processer Modes
Adaptive Filtering
Switching activity reduction
Guarded Evaluation
Bus Multiplexing
Parallel and Pipelines Techniques
Dynamic Voltage Scaling
• Power Management technique in computer architecture, where the voltage used in a
component is increased or decreased, depending upon circumstances.
• Dynamic voltage scaling to increase voltage is known as overvolting.
• Dynamic voltage scaling to decrease voltage is known as undervolting.
Undervolting is done in order to conserve power in portable devices, where energy
comes from a battery and thus is limited, or in rare cases, to increase reliability.
Overvolting is done in order to increase computer performance.
Dynamic Vth Designs
• Dynamic threshold voltage scaling is a technique for active leakage
power reduction during active mode of the circuit.
• This scheme utilizes dynamic adjustment of frequency through back-
gate bias control depending on the workload of a system.
• When the workload decreases, less power is consumed by increasing
Vth. Two varieties of dynamic Vth scaling have been proposed as
described below.
Vth-Hopping Scheme
6
NIT HAMIRPUR
Dynamic Vth Scaling (DVTS) Scheme
• Clock speed scheduler ( embedded in the operating system) determines the clock frequency at run time.
• The DVTS controller adjusts the PMOS and NMOS body bias so that the oscillator frequency of the voltage-
controlled oscillator tracks the given reference clock frequency.
• The error signal, which is the difference between the reference clock frequency and the oscillator frequency,
is fed into the feedback controller.
• The continuous feedback loop also compensates for variation in temperature and
supply voltage.
7
NIT HAMIRPUR
Dynamic Voltage Scaling
8
NIT HAMIRPUR
Power and Performance Management
 Manage the performance and throughput of a system based on its computation
needs to achieve low power goals
 Do not waste power by designing hardware that has more performance than
necessary.
 The throughput adaptability of the hardware is also an issue.
 A good low power system should be able to consume less power when the
throughput requirements are low.
9
NIT HAMIRPUR
Microprocessor Sleep Modes
 Deactivate some functional units when no computation is required.
 E.g. the clock supply to a floating point unit can be disabled if not used
 This can be done at different hierarchies and at different levels of the design:
subsystems, modules, buses, functional units, state machines, etc.
 The trade-off is to justify the additional hardware and design complexity in
managing the various functional units.
 At architecture design, this is a particularly attractive technique because little
hardware and design complexity is needed to achieve substantial power
saving.
10
NIT HAMIRPUR
 In an advanced processor design, using hardware logic to decide which
functional units to be disabled based on the instruction and data stream.
 This technique is effective when the processor is running at full speed.
 In another technique, the processor is consciously instructed to
reduce its performance by the application software.
 It may also be halted at a certain state to conserve power while waiting
for other events to resume computation.
11
NIT HAMIRPUR
 The CPU has three primary power saver modes called DOZE, NAP and
SLEEP controlled by software.
 DOZE mode, most functional units of the processor are stopped except
the on-chip cache memory to maintain cache coherency.
 NAP mode, the cache is also turned off to conserve power and the
processor wakes up after a fixed amount of time or upon an external
interrupt.
 SLEEP mode, the entire processor clock may be halted and only an
external reset or interrupt can resume its operation.
 The intent is to put more functional units in idle when the processor goes
into a deeper sleep mode.
More latency to resume computation from a deeper sleep mode.
Motorola's PowerPC 603 processor design
12
NIT HAMIRPUR
 Allows the software to choose the proper power saver mode based on the
computation requirements of the processor.
 As computation requirements become less, the processor can be put into
deeper sleep modes without any perceivable loss in performance.
 eg, in a portable computer application, when an application program is
runningand the processor is waiting for a keystroke, it could be put into
the DOZE mode.
 If the processor is waiting for a disk access, it may go into the NAP
mode.
 If the processor has been idle for a long time and the power saver is
activated, the processorcan be put into the SLEEP mode.
Motorola's PowerPC 603 processor design
Motorola's PowerPC 603 processor design
 In PowerPC 603, the sleep modes require less than 10 system clock cycles to
resume operation if the system clock and phase-locked-loop (PLL) circuits are
not turned off.
 If the system clock and PLLs are turned off, the chip does not consume any
dynamic power but it may take up to 100us to wake up
 The Dynamic Power Management mode refers to the application of clock
gating at the macro module level in which unused functional units are not
clocked.
Motorola's PowerPC 603 processor design
Intel Processor
• CPU/Package sleep states
• C0 - Active: CPU is on and operating.
• C1 - Auto Halt: Core clock is off. The processor is not executing instructions, but
can return to an executing state almost instantaneously. Some processors also
support an enhanced C1 state (C1E) for lower power consumption.
• C2 - Stop Clock: Core and bus clocks are off. The processor maintains all software-
visible state, but can take longer to wake up.
• C3 - Deep Sleep: Clock generator is off. The processor does not need to keep its
cache coherent, but maintains other states. Some processors have variations of
the C3 state (Deep Sleep, Deeper Sleep) that differ by how long it takes to wake
the processor.
• C4 - Deeper Sleep: Reduced VCC
• DC4 - Deeper C4 Sleep: Further reduced VCC
Source ------https://www.intel.in/content/www/in/en/support/articles/000006619/processors/intel-
core-processors.html
Adaptive Filtering
 A filter is a basic signal processing unit in an electronic system.
 It allows certain frequency component (pass-band) of the input signal to pass
through while rejecting the other signal component (stop-band).
 For analog signals, filters can be implemented by LC resonance circuits, operational
amplifiers, or other electronic devices.
 A digital filter operates much like an analog filter except that the input and output
signals are represented by periodic digital samples of the analog signal. The inputs
and outputs are merely streams of numeric values.
 The digital filtering process is achieved by some arithmetic operations (additions,
subtractions and multiplications) of the input samples to produce the output
samples.
Digital Filter
Digital filter ::: Finite impulse response (FIR) filter and Infinite
impulse response (llR) filter
Finite Response Filter
The length of the delay chain of the filter is called the order
of the filter.
output of the filters
consists of the sum of
products of successively
delayed input x(n) and
output samples y(n).
Digital Filter
A higher order filter achieves a higher
"quality" of filtering in the sense that the
attenuation between the pass-band and
the stop-band is larger and the slope is
steeper.
This means that the unwanted signal
component is better suppressed and the
transition region from the pass-band to the
stop-band is narrower.
Infinite impulse response (llR) filter
Digital Filter
 A lower order filter requires less power to operate because the amount of
computation is less.
 Adjust the filter's order length depending on the noise characteristics
of the input signal.
• When the input signal is less noisy, the filter order is reduced to conserve power.
• When the input noise increases, the filter order is lengthened accordingly.
• The output SNR of the filter will be maintained above the minimum threshold
specified by the design thus achieving the desired output signal quality.
Adaptive Filtering
• To adjust the filter order, we need to estimate the output SNR or equivalently the noise energy of
the filter's output.
• The difference between the input energy and the output energy of the filter is a very good
estimate of the input noise energy.
• The energy difference is given by
L is the filter length
21
NIT HAMIRPUR
Adaptive Filtering
• The actual noise energy of the output can be estimated by
ESB [Order(n)] is the stop-band energy of the filter and α is a constant.
The energy ESB [Order(n)] can be obtained by taking the integration of the filter transfer
function at the stop-band and stored in a lookup table.
Q(n) represents the noise energy measured at a particular
output sample n. The filter order update algorithm is
22
NIT HAMIRPUR
 Switching activities are the biggest cause of power dissipation in most CMOS
digital systems.
 some switching activities do not contribute to the actual computation and should
be eliminated.
Such switching activities are results of undefined or unspecified behavior
of a system.
Switching Activity Reduction
 The suppression of switching activities always involves some trade-off decisions.
 hardware logic is required to suppress unwanted switching activities and the
additional logic itself consumes power.
Therefore, it is important to estimate how many switching activities can be
eliminated by a particular technique so that it can be justified.
23
NIT HAMIRPUR
reduce switching activities by adding latches or blocking gates at the inputs of a
combinational module if the outputs are not used.
Guarded Evaluation
24
NIT HAMIRPUR
 One common way to reduce hardware resources in a digital system is to share long data busses
with time multiplexing.
 At even clock cycles source SI uses the shared bus to send data to destination DI while at odd
cycles source S2 sends data to D2
Bus Multiplexing
Switching activities of a positively correlated data stream.
Parallel Architecture with Voltage Reduction
• Parallelism has been traditionally used to improve the computation
throughput of high performance digital systems.
• It has been successfully used in advanced microprocessors operating
at the frequency limited by the process technology.
• Hardware Replication
• N identical processing element, each implementing the logic function in parallel
• Lowers the operating frequency while maintaining the same system throughput.
• Parallelism essentially trades area off for a lower operating frequency or higher
throughput.
Conti
….
• Each input register is clocked at a lower frequency fclk/N
• Time allowed to compute the function of each input vector is increased by a
factor of N
• This implies the power supply can be reduced untill critical path delay equals New
clock period NTclk
Parallel Architecture
Pipelined Architecture
P1
P2
P3
2 nS
3 nS
2 nS
Throughput : 1 data / 7 nS
3 Ns
Faster
P I P E L I N I N G
OR
StillThroughput : 1 data / 7nS soWe can
reduce the supply voltage
P1
P2
P3
2.5 nS
2.5 nS
2.5 nS
Throughput : 1 data / 2.5 nS
P I P E L I N I N G
SPEEDUP = 3
P1
P2
P3
2 nS
3 nS
2 nS
Throughput : 1 data / 3 nS
P I P E L I N I N G
SPEEDUP = 2.33
P1a
2 nS
P2a
3 nS
P3a
2 nS
Throughput : 1 data / 1 nS
PIPELINED + PARALEL
P1b
2 nS
P2b
3 nS
P3b
2 nS
P1c
2 nS
P2c
3 nS
P3c
2 nS
66.67 %
66.67 %
100 %
Utilization
P1a
P2a
P3a
2 nS
3 nS
2 nS
Throughput : 1 data / 1 nS
P1b
P2b P2c
P3b
Combining Parallel and
Pipelined Structures
Flow Graph Transformation
• Computation intensive data path operations with simple control structures.
• The system architecture can be represented by a control data flow graph.
• The graph consists of control nodes and data nodes connected by directed
edges.
• Control nodes change the flow of data that pass through it.
Examples::: multiplexors, condition selectors, branch and delay elements etc.
Data nodes provide computation operators for the input data streams such
as addition, multiplication, shift, etc. The graph
• edges represent the data streams of the system.
40
NIT HAMIRPUR
Your Limitations-----
Its only your Imagination
Thank You !

More Related Content

What's hot

Fpga(field programmable gate array)
Fpga(field programmable gate array) Fpga(field programmable gate array)
Fpga(field programmable gate array)
Iffat Anjum
 

What's hot (20)

Actel fpga
Actel fpgaActel fpga
Actel fpga
 
Combinational Logic Circuit
Combinational Logic CircuitCombinational Logic Circuit
Combinational Logic Circuit
 
M ary psk and m ary qam ppt
M ary psk and m ary qam pptM ary psk and m ary qam ppt
M ary psk and m ary qam ppt
 
Flipflop
FlipflopFlipflop
Flipflop
 
Latch & Flip-Flop Design.pptx
Latch & Flip-Flop Design.pptxLatch & Flip-Flop Design.pptx
Latch & Flip-Flop Design.pptx
 
PLDs
PLDsPLDs
PLDs
 
Lecture21
Lecture21Lecture21
Lecture21
 
Matched filter
Matched filterMatched filter
Matched filter
 
Digital transmission
Digital transmissionDigital transmission
Digital transmission
 
Fpga(field programmable gate array)
Fpga(field programmable gate array) Fpga(field programmable gate array)
Fpga(field programmable gate array)
 
Dcs lec03 - z-analysis of discrete time control systems
Dcs   lec03 - z-analysis of discrete time control systemsDcs   lec03 - z-analysis of discrete time control systems
Dcs lec03 - z-analysis of discrete time control systems
 
Pwm(Pulse Width Modulation) presentation
Pwm(Pulse Width Modulation) presentationPwm(Pulse Width Modulation) presentation
Pwm(Pulse Width Modulation) presentation
 
DOMINO LOGIC CIRCUIT (VLSI)
DOMINO LOGIC CIRCUIT (VLSI)DOMINO LOGIC CIRCUIT (VLSI)
DOMINO LOGIC CIRCUIT (VLSI)
 
Vlsi power estimation
Vlsi power estimationVlsi power estimation
Vlsi power estimation
 
Pulse code modulation and Demodulation
Pulse code modulation and DemodulationPulse code modulation and Demodulation
Pulse code modulation and Demodulation
 
JTAG Interface (Intro)
JTAG Interface (Intro)JTAG Interface (Intro)
JTAG Interface (Intro)
 
Phase Shift Keying & π/4 -Quadrature Phase Shift Keying
Phase Shift Keying & π/4 -Quadrature Phase Shift KeyingPhase Shift Keying & π/4 -Quadrature Phase Shift Keying
Phase Shift Keying & π/4 -Quadrature Phase Shift Keying
 
3.3 modulation formats msk and gmsk
3.3 modulation formats   msk and gmsk3.3 modulation formats   msk and gmsk
3.3 modulation formats msk and gmsk
 
Synchronization
SynchronizationSynchronization
Synchronization
 
ASK FSK PSK Introduction
ASK FSK PSK IntroductionASK FSK PSK Introduction
ASK FSK PSK Introduction
 

Similar to Architectural Level Techniques

4 Way traffic controll new
4 Way traffic controll new4 Way traffic controll new
4 Way traffic controll new
Mainak Nandi
 
Temp based fan speed control
Temp based fan speed controlTemp based fan speed control
Temp based fan speed control
Sai Malleswar
 
Plc example presentation
Plc example presentationPlc example presentation
Plc example presentation
Roshit Kadiru
 
Track d more performance less power - freescale final
Track d   more performance less power  - freescale finalTrack d   more performance less power  - freescale final
Track d more performance less power - freescale final
chiportal
 

Similar to Architectural Level Techniques (20)

Low power
Low powerLow power
Low power
 
4 Way traffic controll new
4 Way traffic controll new4 Way traffic controll new
4 Way traffic controll new
 
Low Power Flexis™ Series of QE Family MCUs
Low Power Flexis™ Series of QE Family MCUs Low Power Flexis™ Series of QE Family MCUs
Low Power Flexis™ Series of QE Family MCUs
 
Reduce system energy by smart CPU management
Reduce system energy by smart CPU managementReduce system energy by smart CPU management
Reduce system energy by smart CPU management
 
Temp based fan speed control
Temp based fan speed controlTemp based fan speed control
Temp based fan speed control
 
Micro computer control of dc drive
Micro computer control of dc driveMicro computer control of dc drive
Micro computer control of dc drive
 
Plc example presentation
Plc example presentationPlc example presentation
Plc example presentation
 
Design & implementation of 16 bit low power ALU with clock gating
Design & implementation of 16 bit low power ALU with clock gatingDesign & implementation of 16 bit low power ALU with clock gating
Design & implementation of 16 bit low power ALU with clock gating
 
Track d more performance less power - freescale final
Track d   more performance less power  - freescale finalTrack d   more performance less power  - freescale final
Track d more performance less power - freescale final
 
Lecture on PIC-1.pptx
Lecture on PIC-1.pptxLecture on PIC-1.pptx
Lecture on PIC-1.pptx
 
Computer Architecture and Organization
Computer Architecture and OrganizationComputer Architecture and Organization
Computer Architecture and Organization
 
Design and Implementation of Submicron Level 10T Full Adder in ALU Using Cell...
Design and Implementation of Submicron Level 10T Full Adder in ALU Using Cell...Design and Implementation of Submicron Level 10T Full Adder in ALU Using Cell...
Design and Implementation of Submicron Level 10T Full Adder in ALU Using Cell...
 
Arm Processor Based Speed Control Of BLDC Motor
Arm Processor Based Speed Control Of BLDC MotorArm Processor Based Speed Control Of BLDC Motor
Arm Processor Based Speed Control Of BLDC Motor
 
Arm7 architecture
Arm7 architectureArm7 architecture
Arm7 architecture
 
Design and Analysis of Sequential Circuit for Leakage Power Reduction using S...
Design and Analysis of Sequential Circuit for Leakage Power Reduction using S...Design and Analysis of Sequential Circuit for Leakage Power Reduction using S...
Design and Analysis of Sequential Circuit for Leakage Power Reduction using S...
 
Wireless energy meter monitoring with automated tariff calculation
Wireless energy meter monitoring with automated tariff calculationWireless energy meter monitoring with automated tariff calculation
Wireless energy meter monitoring with automated tariff calculation
 
pic_1.pdf
pic_1.pdfpic_1.pdf
pic_1.pdf
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and Development
 
Altera trcak g
Altera  trcak gAltera  trcak g
Altera trcak g
 
Automatic power factor correction
Automatic power factor correction Automatic power factor correction
Automatic power factor correction
 

More from GargiKhanna1 (10)

MOS logic family
MOS logic familyMOS logic family
MOS logic family
 
Digital Counter Design
 Digital Counter Design Digital Counter Design
Digital Counter Design
 
Multiplexers and Demultiplexers
Multiplexers and DemultiplexersMultiplexers and Demultiplexers
Multiplexers and Demultiplexers
 
Error detection and correction codes
Error detection and correction codesError detection and correction codes
Error detection and correction codes
 
Number system
Number system Number system
Number system
 
Boolean Function SOP & POS
Boolean Function SOP &  POSBoolean Function SOP &  POS
Boolean Function SOP & POS
 
Logic gate
Logic gateLogic gate
Logic gate
 
Logic Gates
Logic GatesLogic Gates
Logic Gates
 
Binary codes
Binary codesBinary codes
Binary codes
 
Binary Arithmetic
Binary ArithmeticBinary Arithmetic
Binary Arithmetic
 

Recently uploaded

Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPSSpellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
AnaAcapella
 

Recently uploaded (20)

Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
Play hard learn harder: The Serious Business of Play
Play hard learn harder:  The Serious Business of PlayPlay hard learn harder:  The Serious Business of Play
Play hard learn harder: The Serious Business of Play
 
Our Environment Class 10 Science Notes pdf
Our Environment Class 10 Science Notes pdfOur Environment Class 10 Science Notes pdf
Our Environment Class 10 Science Notes pdf
 
Simple, Complex, and Compound Sentences Exercises.pdf
Simple, Complex, and Compound Sentences Exercises.pdfSimple, Complex, and Compound Sentences Exercises.pdf
Simple, Complex, and Compound Sentences Exercises.pdf
 
How to Manage Call for Tendor in Odoo 17
How to Manage Call for Tendor in Odoo 17How to Manage Call for Tendor in Odoo 17
How to Manage Call for Tendor in Odoo 17
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
Economic Importance Of Fungi In Food Additives
Economic Importance Of Fungi In Food AdditivesEconomic Importance Of Fungi In Food Additives
Economic Importance Of Fungi In Food Additives
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17
 
21st_Century_Skills_Framework_Final_Presentation_2.pptx
21st_Century_Skills_Framework_Final_Presentation_2.pptx21st_Century_Skills_Framework_Final_Presentation_2.pptx
21st_Century_Skills_Framework_Final_Presentation_2.pptx
 
Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPSSpellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 

Architectural Level Techniques

  • 1. NATIONAL INSTITUTE OF TECHNOLOGY HAMIRPUR Electronics and Communication Engineering Department Presented By Dr. Gargi Khanna Associate Professor E&CED, NIT Hamirpur HP. LOW POWER VLSI DESIGN Architectural Level Techniques
  • 2. What will we learn ? Dynamic Voltage Scaling Vth-Hopping Scheme Dynamic Vth Scheme Processer Modes Adaptive Filtering Switching activity reduction Guarded Evaluation Bus Multiplexing Parallel and Pipelines Techniques
  • 3. Dynamic Voltage Scaling • Power Management technique in computer architecture, where the voltage used in a component is increased or decreased, depending upon circumstances. • Dynamic voltage scaling to increase voltage is known as overvolting. • Dynamic voltage scaling to decrease voltage is known as undervolting. Undervolting is done in order to conserve power in portable devices, where energy comes from a battery and thus is limited, or in rare cases, to increase reliability. Overvolting is done in order to increase computer performance.
  • 4. Dynamic Vth Designs • Dynamic threshold voltage scaling is a technique for active leakage power reduction during active mode of the circuit. • This scheme utilizes dynamic adjustment of frequency through back- gate bias control depending on the workload of a system. • When the workload decreases, less power is consumed by increasing Vth. Two varieties of dynamic Vth scaling have been proposed as described below.
  • 6. 6 NIT HAMIRPUR Dynamic Vth Scaling (DVTS) Scheme • Clock speed scheduler ( embedded in the operating system) determines the clock frequency at run time. • The DVTS controller adjusts the PMOS and NMOS body bias so that the oscillator frequency of the voltage- controlled oscillator tracks the given reference clock frequency. • The error signal, which is the difference between the reference clock frequency and the oscillator frequency, is fed into the feedback controller. • The continuous feedback loop also compensates for variation in temperature and supply voltage.
  • 8. 8 NIT HAMIRPUR Power and Performance Management  Manage the performance and throughput of a system based on its computation needs to achieve low power goals  Do not waste power by designing hardware that has more performance than necessary.  The throughput adaptability of the hardware is also an issue.  A good low power system should be able to consume less power when the throughput requirements are low.
  • 9. 9 NIT HAMIRPUR Microprocessor Sleep Modes  Deactivate some functional units when no computation is required.  E.g. the clock supply to a floating point unit can be disabled if not used  This can be done at different hierarchies and at different levels of the design: subsystems, modules, buses, functional units, state machines, etc.  The trade-off is to justify the additional hardware and design complexity in managing the various functional units.  At architecture design, this is a particularly attractive technique because little hardware and design complexity is needed to achieve substantial power saving.
  • 10. 10 NIT HAMIRPUR  In an advanced processor design, using hardware logic to decide which functional units to be disabled based on the instruction and data stream.  This technique is effective when the processor is running at full speed.  In another technique, the processor is consciously instructed to reduce its performance by the application software.  It may also be halted at a certain state to conserve power while waiting for other events to resume computation.
  • 11. 11 NIT HAMIRPUR  The CPU has three primary power saver modes called DOZE, NAP and SLEEP controlled by software.  DOZE mode, most functional units of the processor are stopped except the on-chip cache memory to maintain cache coherency.  NAP mode, the cache is also turned off to conserve power and the processor wakes up after a fixed amount of time or upon an external interrupt.  SLEEP mode, the entire processor clock may be halted and only an external reset or interrupt can resume its operation.  The intent is to put more functional units in idle when the processor goes into a deeper sleep mode. More latency to resume computation from a deeper sleep mode. Motorola's PowerPC 603 processor design
  • 12. 12 NIT HAMIRPUR  Allows the software to choose the proper power saver mode based on the computation requirements of the processor.  As computation requirements become less, the processor can be put into deeper sleep modes without any perceivable loss in performance.  eg, in a portable computer application, when an application program is runningand the processor is waiting for a keystroke, it could be put into the DOZE mode.  If the processor is waiting for a disk access, it may go into the NAP mode.  If the processor has been idle for a long time and the power saver is activated, the processorcan be put into the SLEEP mode. Motorola's PowerPC 603 processor design
  • 13. Motorola's PowerPC 603 processor design  In PowerPC 603, the sleep modes require less than 10 system clock cycles to resume operation if the system clock and phase-locked-loop (PLL) circuits are not turned off.  If the system clock and PLLs are turned off, the chip does not consume any dynamic power but it may take up to 100us to wake up  The Dynamic Power Management mode refers to the application of clock gating at the macro module level in which unused functional units are not clocked.
  • 14. Motorola's PowerPC 603 processor design
  • 15. Intel Processor • CPU/Package sleep states • C0 - Active: CPU is on and operating. • C1 - Auto Halt: Core clock is off. The processor is not executing instructions, but can return to an executing state almost instantaneously. Some processors also support an enhanced C1 state (C1E) for lower power consumption. • C2 - Stop Clock: Core and bus clocks are off. The processor maintains all software- visible state, but can take longer to wake up. • C3 - Deep Sleep: Clock generator is off. The processor does not need to keep its cache coherent, but maintains other states. Some processors have variations of the C3 state (Deep Sleep, Deeper Sleep) that differ by how long it takes to wake the processor. • C4 - Deeper Sleep: Reduced VCC • DC4 - Deeper C4 Sleep: Further reduced VCC Source ------https://www.intel.in/content/www/in/en/support/articles/000006619/processors/intel- core-processors.html
  • 16. Adaptive Filtering  A filter is a basic signal processing unit in an electronic system.  It allows certain frequency component (pass-band) of the input signal to pass through while rejecting the other signal component (stop-band).  For analog signals, filters can be implemented by LC resonance circuits, operational amplifiers, or other electronic devices.  A digital filter operates much like an analog filter except that the input and output signals are represented by periodic digital samples of the analog signal. The inputs and outputs are merely streams of numeric values.  The digital filtering process is achieved by some arithmetic operations (additions, subtractions and multiplications) of the input samples to produce the output samples.
  • 17. Digital Filter Digital filter ::: Finite impulse response (FIR) filter and Infinite impulse response (llR) filter Finite Response Filter The length of the delay chain of the filter is called the order of the filter. output of the filters consists of the sum of products of successively delayed input x(n) and output samples y(n).
  • 18. Digital Filter A higher order filter achieves a higher "quality" of filtering in the sense that the attenuation between the pass-band and the stop-band is larger and the slope is steeper. This means that the unwanted signal component is better suppressed and the transition region from the pass-band to the stop-band is narrower. Infinite impulse response (llR) filter
  • 19. Digital Filter  A lower order filter requires less power to operate because the amount of computation is less.  Adjust the filter's order length depending on the noise characteristics of the input signal. • When the input signal is less noisy, the filter order is reduced to conserve power. • When the input noise increases, the filter order is lengthened accordingly. • The output SNR of the filter will be maintained above the minimum threshold specified by the design thus achieving the desired output signal quality.
  • 20. Adaptive Filtering • To adjust the filter order, we need to estimate the output SNR or equivalently the noise energy of the filter's output. • The difference between the input energy and the output energy of the filter is a very good estimate of the input noise energy. • The energy difference is given by L is the filter length
  • 21. 21 NIT HAMIRPUR Adaptive Filtering • The actual noise energy of the output can be estimated by ESB [Order(n)] is the stop-band energy of the filter and α is a constant. The energy ESB [Order(n)] can be obtained by taking the integration of the filter transfer function at the stop-band and stored in a lookup table. Q(n) represents the noise energy measured at a particular output sample n. The filter order update algorithm is
  • 22. 22 NIT HAMIRPUR  Switching activities are the biggest cause of power dissipation in most CMOS digital systems.  some switching activities do not contribute to the actual computation and should be eliminated. Such switching activities are results of undefined or unspecified behavior of a system. Switching Activity Reduction  The suppression of switching activities always involves some trade-off decisions.  hardware logic is required to suppress unwanted switching activities and the additional logic itself consumes power. Therefore, it is important to estimate how many switching activities can be eliminated by a particular technique so that it can be justified.
  • 23. 23 NIT HAMIRPUR reduce switching activities by adding latches or blocking gates at the inputs of a combinational module if the outputs are not used. Guarded Evaluation
  • 24. 24 NIT HAMIRPUR  One common way to reduce hardware resources in a digital system is to share long data busses with time multiplexing.  At even clock cycles source SI uses the shared bus to send data to destination DI while at odd cycles source S2 sends data to D2 Bus Multiplexing
  • 25. Switching activities of a positively correlated data stream.
  • 26. Parallel Architecture with Voltage Reduction • Parallelism has been traditionally used to improve the computation throughput of high performance digital systems. • It has been successfully used in advanced microprocessors operating at the frequency limited by the process technology. • Hardware Replication
  • 27. • N identical processing element, each implementing the logic function in parallel • Lowers the operating frequency while maintaining the same system throughput. • Parallelism essentially trades area off for a lower operating frequency or higher throughput.
  • 28. Conti …. • Each input register is clocked at a lower frequency fclk/N • Time allowed to compute the function of each input vector is increased by a factor of N • This implies the power supply can be reduced untill critical path delay equals New clock period NTclk
  • 31.
  • 32. P1 P2 P3 2 nS 3 nS 2 nS Throughput : 1 data / 7 nS 3 Ns Faster P I P E L I N I N G OR StillThroughput : 1 data / 7nS soWe can reduce the supply voltage
  • 33.
  • 34. P1 P2 P3 2.5 nS 2.5 nS 2.5 nS Throughput : 1 data / 2.5 nS P I P E L I N I N G SPEEDUP = 3
  • 35. P1 P2 P3 2 nS 3 nS 2 nS Throughput : 1 data / 3 nS P I P E L I N I N G SPEEDUP = 2.33
  • 36.
  • 37. P1a 2 nS P2a 3 nS P3a 2 nS Throughput : 1 data / 1 nS PIPELINED + PARALEL P1b 2 nS P2b 3 nS P3b 2 nS P1c 2 nS P2c 3 nS P3c 2 nS 66.67 % 66.67 % 100 % Utilization
  • 38. P1a P2a P3a 2 nS 3 nS 2 nS Throughput : 1 data / 1 nS P1b P2b P2c P3b Combining Parallel and Pipelined Structures
  • 39. Flow Graph Transformation • Computation intensive data path operations with simple control structures. • The system architecture can be represented by a control data flow graph. • The graph consists of control nodes and data nodes connected by directed edges. • Control nodes change the flow of data that pass through it. Examples::: multiplexors, condition selectors, branch and delay elements etc. Data nodes provide computation operators for the input data streams such as addition, multiplication, shift, etc. The graph • edges represent the data streams of the system.
  • 40. 40 NIT HAMIRPUR Your Limitations----- Its only your Imagination Thank You !