SlideShare a Scribd company logo
CSE539: Advanced Computer Architecture

Chapter 6

Pipelining and Superscalar
Techniques
Book: “Advanced Computer Architecture – Parallelism, Scalability, Programmability”, Hwang & Jotwani

Sumit Mittu
Assistant Professor, CSE/IT
Lovely Professional University
sumit.12735@lpu.co.in
In this chapter…
•
•
•
•
•

Linear Pipeline Processors
Non-linear Pipeline Processors
Instruction Pipeline Design
Arithmetic Pipeline Design
Superscalar Pipeline Design

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

2
LINEAR PIPELINE PROCESSORS
• Linear Pipeline Processor
o (Definition)

• Models of Linear Pipeline
o Synchronous Model
o Asynchronous Model
o (Corresponding reservation tables)

• Clocking and Timing Control
o
o
o
o
o

Clock Cycle
Pipeline Frequency
Clock skewing
Flow-through delay
Speedup, Efficiency and Throughput

• Optimal number of Stages and Performance-Cost Ratio (PCR)
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

3
LINEAR PIPELINE PROCESSORS

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

4
LINEAR PIPELINE PROCESSORS

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

5
NON-LINEAR PIPELINE PROCESSORS
• Dynamic Pipeline
o Static v/s Dynamic Pipeline
o Streamline connection, feed-forward connection and feedback connection

• Reservation and Latency Analysis
o Reservation tables
o Evaluation time

• Latency Analysis
o
o
o
o

Latency
Collision
Forbidden latencies
Latency Sequence, Latency Cycle and Average Latency

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

6
NON-LINEAR PIPELINE PROCESSORS

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

7
NON-LINEAR PIPELINE PROCESSORS

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

8
NON-LINEAR PIPELINE PROCESSORS

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

9
INSTRUCTION PIPELINE DESIGN
• Instruction Execution Phases
o E.g. Fetch, Decode, Issue, Execute, Write-back
o In-order Instruction issuing and Reordered Instruction issuing
• E.g. X = Y + Z , A = B x C

• Mechanisms/Design Issues for Instruction Pipelining
o
o
o
o

Pre-fetch Buffers
Multiple Functional Units
Internal Data Forwarding
Hazard Avoidance

• Dynamic Scheduling
• Branch Handling Techniques
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

10
INSTRUCTION PIPELINE DESIGN

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

11
INSTRUCTION PIPELINE DESIGN

•
•
•
•
•

Fetch: fetches instructions from memory; ideally one per cycle
Decode: reveals instruction operations to be performed and identifies the resources needed
Issue: reserves the resources and reads the operands from registers
Execute: actual processing of operations as indicated by instruction
Write Back: writing results into the registers
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

12
INSTRUCTION PIPELINE DESIGN

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

13
INSTRUCTION PIPELINE DESIGN

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

14
INSTRUCTION PIPELINE DESIGN
Mechanisms/Design Issues of Instruction Pipeline
• Pre-fetch Buffers
o Sequential Buffers
o Target Buffers
o Loop Buffers

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

15
INSTRUCTION PIPELINE DESIGN
Mechanisms/Design Issues of Instruction Pipeline
• Multiple Functional Units
o Reservation Station and Tags
o Slow-station as Bottleneck stage
• Subdivision of Pipeline Bottleneck stage
• Replication of Pipeline Bottleneck stage
• (Example to be discussed)

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

16
INSTRUCTION PIPELINE DESIGN
Mechanisms/Design Issues of Instruction Pipeline

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

17
INSTRUCTION PIPELINE DESIGN
Mechanisms/Design Issues of Instruction Pipeline
• Internal Forwarding and Register Tagging
o Internal Forwarding:
• A “short-circuit” technique to replace unnecessary memory accesses by register-register
transfers in a sequence of fetch-arithmetic-store operations
o Register Tagging:
• Use of tagged registers , buffers and reservation stations, for exploiting concurrent activities
among multiple arithmetic units
o Store-Fetch Forwarding
• (M  R1, R2  M) replaced by (M  R1, R2  R1)
o Fetch-Fetch Forwarding
• (R1  M, R2  M) replaced by (R1  M, R2  R1)
o Store-Store Overwriting
• (M  R1, M  R2) replaced by (M  R2)

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

18
INSTRUCTION PIPELINE DESIGN
Mechanisms/Design Issues of Instruction Pipeline
• Internal Forwarding and Register Tagging

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

19
INSTRUCTION PIPELINE DESIGN
Mechanisms/Design Issues of Instruction Pipeline
• Internal Forwarding and Register Tagging

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

20
INSTRUCTION PIPELINE DESIGN
Mechanisms/Design Issues of Instruction Pipeline
• Hazard Detection and Avoidance
o
o
o
o

Domain or Input Set of an instruction
Range or Output Set of an instruction
Data Hazards: RAW, WAR and WAW
Resolution using Register Renaming approach

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

21
INSTRUCTION PIPELINE DESIGN
Dynamic Instruction Scheduling
• Idea of Static Scheduling
o Compiler based scheduling strategy to resolve Interlocking among instructions

• Dynamic Scheduling
o Tomasulo’s Algorithm (Register-Tagging Scheme)
• Hardware based dependence-resolution
o Scoreboarding Technique
• Scoreboard: the centralized control unit
• A kind of data-driven mechanism

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

22
INSTRUCTION PIPELINE DESIGN
Dynamic Instruction Scheduling

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

23
INSTRUCTION PIPELINE DESIGN
Dynamic Instruction Scheduling

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

24
INSTRUCTION PIPELINE DESIGN
Branch Handling Techniques
• Branch Taken, Branch Target, Delay Slot
• Effect of Branching
o Parameters:
• k:
No. of stages in the pipeline
• n:
Total no. of instructions or tasks
• p:
Percentage of Brach instructions over n
• q:
Percentage of successful branch instructions (branch taken) over p.
• b:
Delay Slot
• τ:
Pipeline Cycle Time
o Branch Penalty = q of (p of n) * bτ = pqnbτ
o Effective Execution Time:
• Teff = [k + (n-1)] τ + pqnbτ = [k + (n-1) + pqnb]τ

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

25
INSTRUCTION PIPELINE DESIGN
Branch Handling Techniques
• Effect of Branching
o Effective Throughput:
• Heff = n/Teff
• Heff = n / {[k + (n-1) + pqnb]τ} = nf / [k + (n-1) + pqnb]
• As nInfinity and b = k-1
o H*eff = f / [pq(k-1)+1]
• If p=0 and q=0 (no branching occurs)
o H**eff = f = 1/τ
o Performance Degradation Factor
• D = 1 – H*eff / f = pq(k-1) / [pq(k-1)+1]

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

26
INSTRUCTION PIPELINE DESIGN
Branch Handling Techniques

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

27
INSTRUCTION PIPELINE DESIGN
Branch Handling Techniques
• Branch Prediction
o Static Branch Prediction: based on branch code types
o Dynamic Branch prediction: based on recent branch history
• Strategy 1: Predict the branch direction based on information found at decode stage.
• Strategy 2: Use a cache to store target addresses at effective address calculation stage.
• Strategy 3: Use a cache to store target instructions at fetch stage
o Brach Target Buffer Organization

• Delayed Branches
o A delayed branch of d cycles allows at most d-1 useful instructions to be executed following the
branch taken.
o Execution of these instructions should be independent of branch instruction to achieve a zero
branch penalty
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

28
INSTRUCTION PIPELINE DESIGN
Branch Handling Techniques

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

29
INSTRUCTION PIPELINE DESIGN
Branch Handling Techniques

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

30
ARITHMETIC PIPELINE DESIGN
Computer Arithmetic Operations
• Finite-precision arithmetic
• Overflow and Underflow
• Fixed-Point operations
o Notations:
• Signed-magnitude, one’s complement and two-complement notation
o Operations:
• Addition:
(n bit, n bit)  (n bit) Sum, 1 bit output carry
• Subtraction:
(n bit, n bit)  (n bit) difference
• Multiplication:
(n bit, n bit)  (2n bit) product
• Division:
(2n bit, n bit)  (n bit) quotient, (n bit) remainder

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

31
ARITHMETIC PIPELINE DESIGN
Computer Arithmetic Operations
• Floating-Point Numbers
o X = (m, e) representation
• m: mantissa or fraction
• e: exponent with an implied base or radix r.
• Actual Value X = m * r e
o Operations on numbers X = (mx, ex) and Y = (my, ey)
• Addition:
(mx * rex-ey + my, ey)
• Subtraction:
(mx * rex-ey – my, ey)
• Multiplication:
(mx * my, ex+ey)
• Division:
(mx / my, ex – ey)

• Elementary Functions
o Transcendental functions like: Trigonometric, Exponential, Logarithmic, etc.

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

32
ARITHMETIC PIPELINE DESIGN
Static Arithmetic Pipelines
•
•
•
•

Separate units for fixed point operations and floating point operations
Scalar and Vector Arithmetic Pipelines
Uni-functional or Static Pipelines
Arithmetic Pipeline Stages
o Majorly involve hardware to perform: Add and Shift micro-operations
o Addition using: Carry Propagation Adder (CPA) and Carry Save Adder (CSA)
o Shift using: Shift Registers

• Multiplication Pipeline Design
o E.g. To multiply two 8-bit numbers that yield a 16-bit product using CSA and CPA Wallace Tree.

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

33
ARITHMETIC PIPELINE DESIGN
Static Arithmetic Pipelines

0

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

34
ARITHMETIC PIPELINE DESIGN
Static Arithmetic Pipelines

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

35
ARITHMETIC PIPELINE DESIGN
Multifunctional Arithmetic Pipelines
• Multifunctional Pipeline:
o Static multifunctional pipeline
o Dynamic multifunctional pipeline

• Case Study: T1/ASC static multifunctional pipeline architecture

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

36
ARITHMETIC PIPELINE DESIGN
Multifunctional Arithmetic Pipelines

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

37
ARITHMETIC PIPELINE DESIGN
Multifunctional Arithmetic Pipelines

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

38
SUPERSCALAR PIPELINE DESIGN
• Pipeline Design Parameters
o Pipeline cycle, Base cycle, Instruction issue rate, Instruction issue Latency, Simple Operation Latency
o ILP to fully utilize the pipeline

•
•
•
•

Superscalar Pipeline Structure
Data and Resource Dependencies
Pipeline Stalling
Superscalar Pipeline Scheduling
o In-order Issue and in-order completion
o In-order Issue and out-of-order completion
o Out-of-order Issue and out-of-order completion

• Superscalar Performance
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

39
SUPERSCALAR PIPELINE DESIGN
Parameter

Base Scalar Processor

Super Scalar Processor
(degree = K)

Pipeline Cycle

1 (base cycle)

K

Instruction Issue Rate

1

K

Instruction Issue Latency

1

1

Simple Operation Latency

1

1

ILP to fully utilize pipeline

1

K

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

40
SUPERSCALAR PIPELINE DESIGN

4,

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

41
SUPERSCALAR PIPELINE DESIGN

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

42
SUPERSCALAR PIPELINE DESIGN

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

43
SUPERSCALAR PIPELINE DESIGN

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

44
SUPERSCALAR PIPELINE DESIGN

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

45
SUPERSCALAR PIPELINE DESIGN

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

46
SUPERSCALAR PIPELINE DESIGN

4,

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

47
SUPERSCALAR PIPELINE DESIGN

Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

48
SUPERSCALAR PIPELINE DESIGN
• Time required by base scalar machine:
o T(1,1) = K + N – 1

• The ideal execution time required by m-issue superscalar machine:
o T(m,1) = K + (N – m)/m
o Where,
• K is the time required to execute first m instructions through m pipelines of k-stages
simultaneously
• Second term corresponds to time required to execute remaining N-m instructions , m per
cycle through m pipelines

• The ideal speedup of superscalar machine
o S(m,1) = T(1,1)/T(m,1) = m(N + k – 1)/[N+ m(k – 1)]

• As n  infinity, S(m,1) m
Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University

49

More Related Content

What's hot

11 instruction sets addressing modes
11  instruction sets addressing modes 11  instruction sets addressing modes
11 instruction sets addressing modes
Sher Shah Merkhel
 
Pipelining powerpoint presentation
Pipelining powerpoint presentationPipelining powerpoint presentation
Pipelining powerpoint presentation
bhavanadonthi
 
Assembly Language
Assembly LanguageAssembly Language
Assembly Language
Ibrahimcommunication Al Ani
 
Single instruction multiple data
Single instruction multiple dataSingle instruction multiple data
Single instruction multiple data
Syed Zaid Irshad
 
Spline representations
Spline representationsSpline representations
Spline representations
Nikhil krishnan
 
Dichotomy of parallel computing platforms
Dichotomy of parallel computing platformsDichotomy of parallel computing platforms
Dichotomy of parallel computing platforms
Syed Zaid Irshad
 
IO buffering
IO bufferingIO buffering
IO buffering
Gunjan Mandaliya
 
Linkers And Loaders
Linkers And LoadersLinkers And Loaders
Linkers And Loaders
Satpal Parmar
 
2.8 normal forms gnf & problems
2.8 normal forms   gnf & problems2.8 normal forms   gnf & problems
2.8 normal forms gnf & problems
Sampath Kumar S
 
Instruction Set Architecture
Instruction Set ArchitectureInstruction Set Architecture
Instruction Set Architecture
Dilum Bandara
 
mano.ppt
mano.pptmano.ppt
mano.ppt
prathamgunj
 
Code generation
Code generationCode generation
Code generation
Aparna Nayak
 
Bus and memory transfer
Bus and memory transferBus and memory transfer
Bus and memory transfer
Anil Pokhrel
 
Interleaved memory
Interleaved memoryInterleaved memory
Interleaved memory
ashishgy
 
Graphic hardware and software
Graphic hardware and softwareGraphic hardware and software
Graphic hardware and software
rafhat
 
Piplining
PipliningPiplining
Piplining
Sonu Mamman
 
Memory Addressing
Memory AddressingMemory Addressing
Memory Addressing
chauhankapil
 
Superscalar & superpipeline processor
Superscalar & superpipeline processorSuperscalar & superpipeline processor
Superscalar & superpipeline processor
Muhammad Ishaq
 
Computer Organization and Design
Computer Organization and DesignComputer Organization and Design
Computer Organization and Design
Ra'Fat Al-Msie'deen
 
Line Drawing Algorithms - Computer Graphics - Notes
Line Drawing Algorithms - Computer Graphics - NotesLine Drawing Algorithms - Computer Graphics - Notes
Line Drawing Algorithms - Computer Graphics - Notes
Omprakash Chauhan
 

What's hot (20)

11 instruction sets addressing modes
11  instruction sets addressing modes 11  instruction sets addressing modes
11 instruction sets addressing modes
 
Pipelining powerpoint presentation
Pipelining powerpoint presentationPipelining powerpoint presentation
Pipelining powerpoint presentation
 
Assembly Language
Assembly LanguageAssembly Language
Assembly Language
 
Single instruction multiple data
Single instruction multiple dataSingle instruction multiple data
Single instruction multiple data
 
Spline representations
Spline representationsSpline representations
Spline representations
 
Dichotomy of parallel computing platforms
Dichotomy of parallel computing platformsDichotomy of parallel computing platforms
Dichotomy of parallel computing platforms
 
IO buffering
IO bufferingIO buffering
IO buffering
 
Linkers And Loaders
Linkers And LoadersLinkers And Loaders
Linkers And Loaders
 
2.8 normal forms gnf & problems
2.8 normal forms   gnf & problems2.8 normal forms   gnf & problems
2.8 normal forms gnf & problems
 
Instruction Set Architecture
Instruction Set ArchitectureInstruction Set Architecture
Instruction Set Architecture
 
mano.ppt
mano.pptmano.ppt
mano.ppt
 
Code generation
Code generationCode generation
Code generation
 
Bus and memory transfer
Bus and memory transferBus and memory transfer
Bus and memory transfer
 
Interleaved memory
Interleaved memoryInterleaved memory
Interleaved memory
 
Graphic hardware and software
Graphic hardware and softwareGraphic hardware and software
Graphic hardware and software
 
Piplining
PipliningPiplining
Piplining
 
Memory Addressing
Memory AddressingMemory Addressing
Memory Addressing
 
Superscalar & superpipeline processor
Superscalar & superpipeline processorSuperscalar & superpipeline processor
Superscalar & superpipeline processor
 
Computer Organization and Design
Computer Organization and DesignComputer Organization and Design
Computer Organization and Design
 
Line Drawing Algorithms - Computer Graphics - Notes
Line Drawing Algorithms - Computer Graphics - NotesLine Drawing Algorithms - Computer Graphics - Notes
Line Drawing Algorithms - Computer Graphics - Notes
 

Viewers also liked

Pipeline
PipelinePipeline
Pipeline
krish198825
 
Advanced computer architecture
Advanced computer architectureAdvanced computer architecture
Advanced computer architecture
Md. Mahedi Mahfuj
 
pipelining
pipeliningpipelining
pipelining
sudhir saurav
 
Instruction pipeline: Computer Architecture
Instruction pipeline: Computer ArchitectureInstruction pipeline: Computer Architecture
Instruction pipeline: Computer Architecture
InteX Research Lab
 
Instruction pipelining
Instruction pipeliningInstruction pipelining
Instruction pipelining
Tech_MX
 
A fast graphic api for non-linear machine learning
A fast graphic api for non-linear machine learningA fast graphic api for non-linear machine learning
A fast graphic api for non-linear machine learning
Peng Cheng
 
Arithmatic sequence Slide
Arithmatic sequence SlideArithmatic sequence Slide
Arithmatic sequence Slide
puteriaprilianti
 
Improving Test Team Throughput via Architecture by Dustin Williams
Improving Test Team Throughput via Architecture by Dustin WilliamsImproving Test Team Throughput via Architecture by Dustin Williams
Improving Test Team Throughput via Architecture by Dustin Williams
QA or the Highway
 
Lec Jan29 2009
Lec Jan29 2009Lec Jan29 2009
Lec Jan29 2009
Ravi Soni
 
Reyes and Shader Pipeline
Reyes and Shader PipelineReyes and Shader Pipeline
Reyes and Shader Pipeline
Shuen-Huei Guan
 
L4 speeding-up-execution
L4 speeding-up-executionL4 speeding-up-execution
L4 speeding-up-execution
rsamurti
 
High Throughput, High Content Screening - Automating the Pipeline
High Throughput, High Content Screening - Automating the PipelineHigh Throughput, High Content Screening - Automating the Pipeline
High Throughput, High Content Screening - Automating the Pipeline
Rajarshi Guha
 
Aca2 08 new
Aca2 08 newAca2 08 new
Aca2 08 new
Sumit Mittu
 
Pipelining Coputing
Pipelining CoputingPipelining Coputing
Pipelining Coputing
m.assayony
 
Tomasulo Algorithm
Tomasulo AlgorithmTomasulo Algorithm
Tomasulo Algorithm
Farwa Ansari
 
3 Pipelining
3 Pipelining3 Pipelining
3 Pipelining
fika sweety
 
Instruction pipelining (ii)
Instruction pipelining (ii)Instruction pipelining (ii)
Instruction pipelining (ii)
Sanjivani Sontakke
 
Chapter 04 the processor
Chapter 04   the processorChapter 04   the processor
Chapter 04 the processor
Bảo Hoang
 
Instruction pipelining
Instruction pipeliningInstruction pipelining
Instruction pipelining
Shoaib Commando
 
Pipelinig hazardous
Pipelinig hazardousPipelinig hazardous
Pipelinig hazardous
jasscheema
 

Viewers also liked (20)

Pipeline
PipelinePipeline
Pipeline
 
Advanced computer architecture
Advanced computer architectureAdvanced computer architecture
Advanced computer architecture
 
pipelining
pipeliningpipelining
pipelining
 
Instruction pipeline: Computer Architecture
Instruction pipeline: Computer ArchitectureInstruction pipeline: Computer Architecture
Instruction pipeline: Computer Architecture
 
Instruction pipelining
Instruction pipeliningInstruction pipelining
Instruction pipelining
 
A fast graphic api for non-linear machine learning
A fast graphic api for non-linear machine learningA fast graphic api for non-linear machine learning
A fast graphic api for non-linear machine learning
 
Arithmatic sequence Slide
Arithmatic sequence SlideArithmatic sequence Slide
Arithmatic sequence Slide
 
Improving Test Team Throughput via Architecture by Dustin Williams
Improving Test Team Throughput via Architecture by Dustin WilliamsImproving Test Team Throughput via Architecture by Dustin Williams
Improving Test Team Throughput via Architecture by Dustin Williams
 
Lec Jan29 2009
Lec Jan29 2009Lec Jan29 2009
Lec Jan29 2009
 
Reyes and Shader Pipeline
Reyes and Shader PipelineReyes and Shader Pipeline
Reyes and Shader Pipeline
 
L4 speeding-up-execution
L4 speeding-up-executionL4 speeding-up-execution
L4 speeding-up-execution
 
High Throughput, High Content Screening - Automating the Pipeline
High Throughput, High Content Screening - Automating the PipelineHigh Throughput, High Content Screening - Automating the Pipeline
High Throughput, High Content Screening - Automating the Pipeline
 
Aca2 08 new
Aca2 08 newAca2 08 new
Aca2 08 new
 
Pipelining Coputing
Pipelining CoputingPipelining Coputing
Pipelining Coputing
 
Tomasulo Algorithm
Tomasulo AlgorithmTomasulo Algorithm
Tomasulo Algorithm
 
3 Pipelining
3 Pipelining3 Pipelining
3 Pipelining
 
Instruction pipelining (ii)
Instruction pipelining (ii)Instruction pipelining (ii)
Instruction pipelining (ii)
 
Chapter 04 the processor
Chapter 04   the processorChapter 04   the processor
Chapter 04 the processor
 
Instruction pipelining
Instruction pipeliningInstruction pipelining
Instruction pipelining
 
Pipelinig hazardous
Pipelinig hazardousPipelinig hazardous
Pipelinig hazardous
 

Similar to Aca2 06 new

Aca2 01 new
Aca2 01 newAca2 01 new
Aca2 01 new
Sumit Mittu
 
6 data envelopment_analysis
6 data envelopment_analysis6 data envelopment_analysis
6 data envelopment_analysis
FEG
 
.pptx
.pptx.pptx
PPCE 1324.pptx process planning and cost estimation
PPCE 1324.pptx process planning and cost estimationPPCE 1324.pptx process planning and cost estimation
PPCE 1324.pptx process planning and cost estimation
AakashR49
 
Project Management Fundamental
Project Management FundamentalProject Management Fundamental
Project Management Fundamental
Andy Pham, PMP
 
Cpmprt
CpmprtCpmprt
Lec1 Intro to Computer Engineering by Hsien-Hsin Sean Lee Georgia Tech -- Intro
Lec1 Intro to Computer Engineering by Hsien-Hsin Sean Lee Georgia Tech -- IntroLec1 Intro to Computer Engineering by Hsien-Hsin Sean Lee Georgia Tech -- Intro
Lec1 Intro to Computer Engineering by Hsien-Hsin Sean Lee Georgia Tech -- Intro
Hsien-Hsin Sean Lee, Ph.D.
 
01-Introduction_to_Optimization-v2021.2-Sept23-2021.pptx
01-Introduction_to_Optimization-v2021.2-Sept23-2021.pptx01-Introduction_to_Optimization-v2021.2-Sept23-2021.pptx
01-Introduction_to_Optimization-v2021.2-Sept23-2021.pptx
Tran273185
 
NEW-Workshop Discovery Thammasat 2022.pptx
NEW-Workshop Discovery Thammasat 2022.pptxNEW-Workshop Discovery Thammasat 2022.pptx
NEW-Workshop Discovery Thammasat 2022.pptx
SnunkhaemEcharoj
 
Week 8: Programming for Data Analysis
Week 8: Programming for Data AnalysisWeek 8: Programming for Data Analysis
Week 8: Programming for Data Analysis
Ferdin Joe John Joseph PhD
 
Reducing computational complexity of Mathematical functions using FPGA
Reducing computational complexity of Mathematical functions using FPGAReducing computational complexity of Mathematical functions using FPGA
Reducing computational complexity of Mathematical functions using FPGA
nehagaur339
 
Computer aided process design and simulation (Cheg.pptx
Computer aided process design and simulation (Cheg.pptxComputer aided process design and simulation (Cheg.pptx
Computer aided process design and simulation (Cheg.pptx
PaulosMekuria
 
Lec3 Computer Architecture by Hsien-Hsin Sean Lee Georgia Tech -- Performance
Lec3 Computer Architecture by Hsien-Hsin Sean Lee Georgia Tech -- PerformanceLec3 Computer Architecture by Hsien-Hsin Sean Lee Georgia Tech -- Performance
Lec3 Computer Architecture by Hsien-Hsin Sean Lee Georgia Tech -- Performance
Hsien-Hsin Sean Lee, Ph.D.
 
Relational algebra
Relational algebraRelational algebra
Relational algebra
Dr. C.V. Suresh Babu
 
CIM UNIT 1.ppt
CIM UNIT 1.pptCIM UNIT 1.ppt
CIM UNIT 1.ppt
Srirajarajan Paramasivam
 
lec01.pdf
lec01.pdflec01.pdf
lec01.pdf
BeiYu6
 
Chapter 10_Project Scheduling Using Critical Path Method (CPM).pdf
Chapter 10_Project Scheduling Using Critical Path Method (CPM).pdfChapter 10_Project Scheduling Using Critical Path Method (CPM).pdf
Chapter 10_Project Scheduling Using Critical Path Method (CPM).pdf
ssuser54fc7a
 
Automated University Timetabling
Automated University TimetablingAutomated University Timetabling
Automated University Timetabling
Alexandre Pinto
 
solve 6 , quiz 2link of book httpwww.irccyn.ec-nantes.fr~mart.pdf
solve 6 , quiz 2link of book httpwww.irccyn.ec-nantes.fr~mart.pdfsolve 6 , quiz 2link of book httpwww.irccyn.ec-nantes.fr~mart.pdf
solve 6 , quiz 2link of book httpwww.irccyn.ec-nantes.fr~mart.pdf
arihantcomputersddn
 
ECESLU Microprocessors Lecture 3
ECESLU Microprocessors Lecture 3ECESLU Microprocessors Lecture 3
ECESLU Microprocessors Lecture 3
Jeffrey Des Binwag
 

Similar to Aca2 06 new (20)

Aca2 01 new
Aca2 01 newAca2 01 new
Aca2 01 new
 
6 data envelopment_analysis
6 data envelopment_analysis6 data envelopment_analysis
6 data envelopment_analysis
 
.pptx
.pptx.pptx
.pptx
 
PPCE 1324.pptx process planning and cost estimation
PPCE 1324.pptx process planning and cost estimationPPCE 1324.pptx process planning and cost estimation
PPCE 1324.pptx process planning and cost estimation
 
Project Management Fundamental
Project Management FundamentalProject Management Fundamental
Project Management Fundamental
 
Cpmprt
CpmprtCpmprt
Cpmprt
 
Lec1 Intro to Computer Engineering by Hsien-Hsin Sean Lee Georgia Tech -- Intro
Lec1 Intro to Computer Engineering by Hsien-Hsin Sean Lee Georgia Tech -- IntroLec1 Intro to Computer Engineering by Hsien-Hsin Sean Lee Georgia Tech -- Intro
Lec1 Intro to Computer Engineering by Hsien-Hsin Sean Lee Georgia Tech -- Intro
 
01-Introduction_to_Optimization-v2021.2-Sept23-2021.pptx
01-Introduction_to_Optimization-v2021.2-Sept23-2021.pptx01-Introduction_to_Optimization-v2021.2-Sept23-2021.pptx
01-Introduction_to_Optimization-v2021.2-Sept23-2021.pptx
 
NEW-Workshop Discovery Thammasat 2022.pptx
NEW-Workshop Discovery Thammasat 2022.pptxNEW-Workshop Discovery Thammasat 2022.pptx
NEW-Workshop Discovery Thammasat 2022.pptx
 
Week 8: Programming for Data Analysis
Week 8: Programming for Data AnalysisWeek 8: Programming for Data Analysis
Week 8: Programming for Data Analysis
 
Reducing computational complexity of Mathematical functions using FPGA
Reducing computational complexity of Mathematical functions using FPGAReducing computational complexity of Mathematical functions using FPGA
Reducing computational complexity of Mathematical functions using FPGA
 
Computer aided process design and simulation (Cheg.pptx
Computer aided process design and simulation (Cheg.pptxComputer aided process design and simulation (Cheg.pptx
Computer aided process design and simulation (Cheg.pptx
 
Lec3 Computer Architecture by Hsien-Hsin Sean Lee Georgia Tech -- Performance
Lec3 Computer Architecture by Hsien-Hsin Sean Lee Georgia Tech -- PerformanceLec3 Computer Architecture by Hsien-Hsin Sean Lee Georgia Tech -- Performance
Lec3 Computer Architecture by Hsien-Hsin Sean Lee Georgia Tech -- Performance
 
Relational algebra
Relational algebraRelational algebra
Relational algebra
 
CIM UNIT 1.ppt
CIM UNIT 1.pptCIM UNIT 1.ppt
CIM UNIT 1.ppt
 
lec01.pdf
lec01.pdflec01.pdf
lec01.pdf
 
Chapter 10_Project Scheduling Using Critical Path Method (CPM).pdf
Chapter 10_Project Scheduling Using Critical Path Method (CPM).pdfChapter 10_Project Scheduling Using Critical Path Method (CPM).pdf
Chapter 10_Project Scheduling Using Critical Path Method (CPM).pdf
 
Automated University Timetabling
Automated University TimetablingAutomated University Timetabling
Automated University Timetabling
 
solve 6 , quiz 2link of book httpwww.irccyn.ec-nantes.fr~mart.pdf
solve 6 , quiz 2link of book httpwww.irccyn.ec-nantes.fr~mart.pdfsolve 6 , quiz 2link of book httpwww.irccyn.ec-nantes.fr~mart.pdf
solve 6 , quiz 2link of book httpwww.irccyn.ec-nantes.fr~mart.pdf
 
ECESLU Microprocessors Lecture 3
ECESLU Microprocessors Lecture 3ECESLU Microprocessors Lecture 3
ECESLU Microprocessors Lecture 3
 

More from Sumit Mittu

Int306 03
Int306 03Int306 03
Int306 03
Sumit Mittu
 
Int306 02
Int306 02Int306 02
Int306 02
Sumit Mittu
 
Int306 01
Int306 01Int306 01
Int306 01
Sumit Mittu
 
Int306 00
Int306 00Int306 00
Int306 00
Sumit Mittu
 
Int306 04
Int306 04Int306 04
Int306 04
Sumit Mittu
 
Aca2 10 11
Aca2 10 11Aca2 10 11
Aca2 10 11
Sumit Mittu
 
Aca2 09 new
Aca2 09 newAca2 09 new
Aca2 09 new
Sumit Mittu
 
Aca2 07 new
Aca2 07 newAca2 07 new
Aca2 07 new
Sumit Mittu
 
Aca11 bk2 ch9
Aca11 bk2 ch9Aca11 bk2 ch9
Aca11 bk2 ch9
Sumit Mittu
 

More from Sumit Mittu (9)

Int306 03
Int306 03Int306 03
Int306 03
 
Int306 02
Int306 02Int306 02
Int306 02
 
Int306 01
Int306 01Int306 01
Int306 01
 
Int306 00
Int306 00Int306 00
Int306 00
 
Int306 04
Int306 04Int306 04
Int306 04
 
Aca2 10 11
Aca2 10 11Aca2 10 11
Aca2 10 11
 
Aca2 09 new
Aca2 09 newAca2 09 new
Aca2 09 new
 
Aca2 07 new
Aca2 07 newAca2 07 new
Aca2 07 new
 
Aca11 bk2 ch9
Aca11 bk2 ch9Aca11 bk2 ch9
Aca11 bk2 ch9
 

Recently uploaded

Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.
Dr. Shivangi Singh Parihar
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
Priyankaranawat4
 
A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
thanhdowork
 
Top five deadliest dog breeds in America
Top five deadliest dog breeds in AmericaTop five deadliest dog breeds in America
Top five deadliest dog breeds in America
Bisnar Chase Personal Injury Attorneys
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
heathfieldcps1
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
Israel Genealogy Research Association
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Excellence Foundation for South Sudan
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
History of Stoke Newington
 
Group Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana BuscigliopptxGroup Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana Buscigliopptx
ArianaBusciglio
 
The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
heathfieldcps1
 
How to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold MethodHow to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold Method
Celine George
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 
Assignment_4_ArianaBusciglio Marvel(1).docx
Assignment_4_ArianaBusciglio Marvel(1).docxAssignment_4_ArianaBusciglio Marvel(1).docx
Assignment_4_ArianaBusciglio Marvel(1).docx
ArianaBusciglio
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
TechSoup
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
tarandeep35
 
MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...
MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...
MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...
NelTorrente
 
writing about opinions about Australia the movie
writing about opinions about Australia the moviewriting about opinions about Australia the movie
writing about opinions about Australia the movie
Nicholas Montgomery
 
Digital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments UnitDigital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments Unit
chanes7
 

Recently uploaded (20)

Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
 
A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
 
Top five deadliest dog breeds in America
Top five deadliest dog breeds in AmericaTop five deadliest dog breeds in America
Top five deadliest dog breeds in America
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
 
Group Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana BuscigliopptxGroup Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana Buscigliopptx
 
The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
 
How to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold MethodHow to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold Method
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 
Assignment_4_ArianaBusciglio Marvel(1).docx
Assignment_4_ArianaBusciglio Marvel(1).docxAssignment_4_ArianaBusciglio Marvel(1).docx
Assignment_4_ArianaBusciglio Marvel(1).docx
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
 
MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...
MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...
MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...
 
writing about opinions about Australia the movie
writing about opinions about Australia the moviewriting about opinions about Australia the movie
writing about opinions about Australia the movie
 
Digital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments UnitDigital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments Unit
 

Aca2 06 new

  • 1. CSE539: Advanced Computer Architecture Chapter 6 Pipelining and Superscalar Techniques Book: “Advanced Computer Architecture – Parallelism, Scalability, Programmability”, Hwang & Jotwani Sumit Mittu Assistant Professor, CSE/IT Lovely Professional University sumit.12735@lpu.co.in
  • 2. In this chapter… • • • • • Linear Pipeline Processors Non-linear Pipeline Processors Instruction Pipeline Design Arithmetic Pipeline Design Superscalar Pipeline Design Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 2
  • 3. LINEAR PIPELINE PROCESSORS • Linear Pipeline Processor o (Definition) • Models of Linear Pipeline o Synchronous Model o Asynchronous Model o (Corresponding reservation tables) • Clocking and Timing Control o o o o o Clock Cycle Pipeline Frequency Clock skewing Flow-through delay Speedup, Efficiency and Throughput • Optimal number of Stages and Performance-Cost Ratio (PCR) Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 3
  • 4. LINEAR PIPELINE PROCESSORS Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 4
  • 5. LINEAR PIPELINE PROCESSORS Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 5
  • 6. NON-LINEAR PIPELINE PROCESSORS • Dynamic Pipeline o Static v/s Dynamic Pipeline o Streamline connection, feed-forward connection and feedback connection • Reservation and Latency Analysis o Reservation tables o Evaluation time • Latency Analysis o o o o Latency Collision Forbidden latencies Latency Sequence, Latency Cycle and Average Latency Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 6
  • 7. NON-LINEAR PIPELINE PROCESSORS Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 7
  • 8. NON-LINEAR PIPELINE PROCESSORS Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 8
  • 9. NON-LINEAR PIPELINE PROCESSORS Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 9
  • 10. INSTRUCTION PIPELINE DESIGN • Instruction Execution Phases o E.g. Fetch, Decode, Issue, Execute, Write-back o In-order Instruction issuing and Reordered Instruction issuing • E.g. X = Y + Z , A = B x C • Mechanisms/Design Issues for Instruction Pipelining o o o o Pre-fetch Buffers Multiple Functional Units Internal Data Forwarding Hazard Avoidance • Dynamic Scheduling • Branch Handling Techniques Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 10
  • 11. INSTRUCTION PIPELINE DESIGN Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 11
  • 12. INSTRUCTION PIPELINE DESIGN • • • • • Fetch: fetches instructions from memory; ideally one per cycle Decode: reveals instruction operations to be performed and identifies the resources needed Issue: reserves the resources and reads the operands from registers Execute: actual processing of operations as indicated by instruction Write Back: writing results into the registers Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 12
  • 13. INSTRUCTION PIPELINE DESIGN Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 13
  • 14. INSTRUCTION PIPELINE DESIGN Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 14
  • 15. INSTRUCTION PIPELINE DESIGN Mechanisms/Design Issues of Instruction Pipeline • Pre-fetch Buffers o Sequential Buffers o Target Buffers o Loop Buffers Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 15
  • 16. INSTRUCTION PIPELINE DESIGN Mechanisms/Design Issues of Instruction Pipeline • Multiple Functional Units o Reservation Station and Tags o Slow-station as Bottleneck stage • Subdivision of Pipeline Bottleneck stage • Replication of Pipeline Bottleneck stage • (Example to be discussed) Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 16
  • 17. INSTRUCTION PIPELINE DESIGN Mechanisms/Design Issues of Instruction Pipeline Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 17
  • 18. INSTRUCTION PIPELINE DESIGN Mechanisms/Design Issues of Instruction Pipeline • Internal Forwarding and Register Tagging o Internal Forwarding: • A “short-circuit” technique to replace unnecessary memory accesses by register-register transfers in a sequence of fetch-arithmetic-store operations o Register Tagging: • Use of tagged registers , buffers and reservation stations, for exploiting concurrent activities among multiple arithmetic units o Store-Fetch Forwarding • (M  R1, R2  M) replaced by (M  R1, R2  R1) o Fetch-Fetch Forwarding • (R1  M, R2  M) replaced by (R1  M, R2  R1) o Store-Store Overwriting • (M  R1, M  R2) replaced by (M  R2) Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 18
  • 19. INSTRUCTION PIPELINE DESIGN Mechanisms/Design Issues of Instruction Pipeline • Internal Forwarding and Register Tagging Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 19
  • 20. INSTRUCTION PIPELINE DESIGN Mechanisms/Design Issues of Instruction Pipeline • Internal Forwarding and Register Tagging Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 20
  • 21. INSTRUCTION PIPELINE DESIGN Mechanisms/Design Issues of Instruction Pipeline • Hazard Detection and Avoidance o o o o Domain or Input Set of an instruction Range or Output Set of an instruction Data Hazards: RAW, WAR and WAW Resolution using Register Renaming approach Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 21
  • 22. INSTRUCTION PIPELINE DESIGN Dynamic Instruction Scheduling • Idea of Static Scheduling o Compiler based scheduling strategy to resolve Interlocking among instructions • Dynamic Scheduling o Tomasulo’s Algorithm (Register-Tagging Scheme) • Hardware based dependence-resolution o Scoreboarding Technique • Scoreboard: the centralized control unit • A kind of data-driven mechanism Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 22
  • 23. INSTRUCTION PIPELINE DESIGN Dynamic Instruction Scheduling Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 23
  • 24. INSTRUCTION PIPELINE DESIGN Dynamic Instruction Scheduling Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 24
  • 25. INSTRUCTION PIPELINE DESIGN Branch Handling Techniques • Branch Taken, Branch Target, Delay Slot • Effect of Branching o Parameters: • k: No. of stages in the pipeline • n: Total no. of instructions or tasks • p: Percentage of Brach instructions over n • q: Percentage of successful branch instructions (branch taken) over p. • b: Delay Slot • τ: Pipeline Cycle Time o Branch Penalty = q of (p of n) * bτ = pqnbτ o Effective Execution Time: • Teff = [k + (n-1)] τ + pqnbτ = [k + (n-1) + pqnb]τ Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 25
  • 26. INSTRUCTION PIPELINE DESIGN Branch Handling Techniques • Effect of Branching o Effective Throughput: • Heff = n/Teff • Heff = n / {[k + (n-1) + pqnb]τ} = nf / [k + (n-1) + pqnb] • As nInfinity and b = k-1 o H*eff = f / [pq(k-1)+1] • If p=0 and q=0 (no branching occurs) o H**eff = f = 1/τ o Performance Degradation Factor • D = 1 – H*eff / f = pq(k-1) / [pq(k-1)+1] Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 26
  • 27. INSTRUCTION PIPELINE DESIGN Branch Handling Techniques Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 27
  • 28. INSTRUCTION PIPELINE DESIGN Branch Handling Techniques • Branch Prediction o Static Branch Prediction: based on branch code types o Dynamic Branch prediction: based on recent branch history • Strategy 1: Predict the branch direction based on information found at decode stage. • Strategy 2: Use a cache to store target addresses at effective address calculation stage. • Strategy 3: Use a cache to store target instructions at fetch stage o Brach Target Buffer Organization • Delayed Branches o A delayed branch of d cycles allows at most d-1 useful instructions to be executed following the branch taken. o Execution of these instructions should be independent of branch instruction to achieve a zero branch penalty Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 28
  • 29. INSTRUCTION PIPELINE DESIGN Branch Handling Techniques Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 29
  • 30. INSTRUCTION PIPELINE DESIGN Branch Handling Techniques Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 30
  • 31. ARITHMETIC PIPELINE DESIGN Computer Arithmetic Operations • Finite-precision arithmetic • Overflow and Underflow • Fixed-Point operations o Notations: • Signed-magnitude, one’s complement and two-complement notation o Operations: • Addition: (n bit, n bit)  (n bit) Sum, 1 bit output carry • Subtraction: (n bit, n bit)  (n bit) difference • Multiplication: (n bit, n bit)  (2n bit) product • Division: (2n bit, n bit)  (n bit) quotient, (n bit) remainder Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 31
  • 32. ARITHMETIC PIPELINE DESIGN Computer Arithmetic Operations • Floating-Point Numbers o X = (m, e) representation • m: mantissa or fraction • e: exponent with an implied base or radix r. • Actual Value X = m * r e o Operations on numbers X = (mx, ex) and Y = (my, ey) • Addition: (mx * rex-ey + my, ey) • Subtraction: (mx * rex-ey – my, ey) • Multiplication: (mx * my, ex+ey) • Division: (mx / my, ex – ey) • Elementary Functions o Transcendental functions like: Trigonometric, Exponential, Logarithmic, etc. Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 32
  • 33. ARITHMETIC PIPELINE DESIGN Static Arithmetic Pipelines • • • • Separate units for fixed point operations and floating point operations Scalar and Vector Arithmetic Pipelines Uni-functional or Static Pipelines Arithmetic Pipeline Stages o Majorly involve hardware to perform: Add and Shift micro-operations o Addition using: Carry Propagation Adder (CPA) and Carry Save Adder (CSA) o Shift using: Shift Registers • Multiplication Pipeline Design o E.g. To multiply two 8-bit numbers that yield a 16-bit product using CSA and CPA Wallace Tree. Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 33
  • 34. ARITHMETIC PIPELINE DESIGN Static Arithmetic Pipelines 0 Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 34
  • 35. ARITHMETIC PIPELINE DESIGN Static Arithmetic Pipelines Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 35
  • 36. ARITHMETIC PIPELINE DESIGN Multifunctional Arithmetic Pipelines • Multifunctional Pipeline: o Static multifunctional pipeline o Dynamic multifunctional pipeline • Case Study: T1/ASC static multifunctional pipeline architecture Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 36
  • 37. ARITHMETIC PIPELINE DESIGN Multifunctional Arithmetic Pipelines Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 37
  • 38. ARITHMETIC PIPELINE DESIGN Multifunctional Arithmetic Pipelines Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 38
  • 39. SUPERSCALAR PIPELINE DESIGN • Pipeline Design Parameters o Pipeline cycle, Base cycle, Instruction issue rate, Instruction issue Latency, Simple Operation Latency o ILP to fully utilize the pipeline • • • • Superscalar Pipeline Structure Data and Resource Dependencies Pipeline Stalling Superscalar Pipeline Scheduling o In-order Issue and in-order completion o In-order Issue and out-of-order completion o Out-of-order Issue and out-of-order completion • Superscalar Performance Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 39
  • 40. SUPERSCALAR PIPELINE DESIGN Parameter Base Scalar Processor Super Scalar Processor (degree = K) Pipeline Cycle 1 (base cycle) K Instruction Issue Rate 1 K Instruction Issue Latency 1 1 Simple Operation Latency 1 1 ILP to fully utilize pipeline 1 K Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 40
  • 41. SUPERSCALAR PIPELINE DESIGN 4, Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 41
  • 42. SUPERSCALAR PIPELINE DESIGN Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 42
  • 43. SUPERSCALAR PIPELINE DESIGN Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 43
  • 44. SUPERSCALAR PIPELINE DESIGN Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 44
  • 45. SUPERSCALAR PIPELINE DESIGN Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 45
  • 46. SUPERSCALAR PIPELINE DESIGN Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 46
  • 47. SUPERSCALAR PIPELINE DESIGN 4, Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 47
  • 48. SUPERSCALAR PIPELINE DESIGN Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 48
  • 49. SUPERSCALAR PIPELINE DESIGN • Time required by base scalar machine: o T(1,1) = K + N – 1 • The ideal execution time required by m-issue superscalar machine: o T(m,1) = K + (N – m)/m o Where, • K is the time required to execute first m instructions through m pipelines of k-stages simultaneously • Second term corresponds to time required to execute remaining N-m instructions , m per cycle through m pipelines • The ideal speedup of superscalar machine o S(m,1) = T(1,1)/T(m,1) = m(N + k – 1)/[N+ m(k – 1)] • As n  infinity, S(m,1) m Sumit Mittu, Assistant Professor, CSE/IT, Lovely Professional University 49