SlideShare a Scribd company logo
Pipeline and vector processors
often require
s i m u l t a n e o u s
access to mema
from two or more
sources. An instruction pipeline may require the fetching of
an
instruction and an operand at the same time from two ditferent segments
Similarly, an
arithmetic pipeline usually requires
two or more operands to0
enter the pipeline at the same time. Instead of using two memory buses for
simultaneous access, the memory
can be partitioned into a number of mod-
ules connected to a common memory
address and data buses. A memory
module is a memory array together with its own address and data registers
Figure 9-13 shows a memory unit with four modules. Each memory array has
its own address register AR and data register DR. The address registers receive
information from a common address bus and the data registers communicate
with a bidirectional data bus. The two least significant bits of the address can
be used to distinguish between the four modules. The modular system permls
one module to initiate a memory access while other modules are in
u
process of reading or writing a word and each module can honor a memoly
request independent of the state of the other modules.
nory
Memory Interleaving
Figure 9-13 Multiple module memory organization.
Address bus
AR AR AR AR
Memory Memory
array Memory Memory
array array array
DR
DR
DR DR
Data bus
327
ECTION 9-6 Vector l'rocessing
The advantage of a modular memory is thatit allows the use o! a tech
nique caled interleaving. In an interleaved memory, different sets of addresses
are
assigned to ditferent memory modules. For example, in a two-modille
memory system, the even addresses may be in one module and the ad
addresses in the other. When the number of modules is a
power of 2. the least
signilicant bits of the address select a
memory module and the remaining bits
designate the specilic location to be accessed within the selected module
A modular memory is useful in systems with pipeline and vector pro-
cessing. A vector
processor that uses an n-way interleaved memory can tetch
n
operands from n different modules. By staggering the memory access, the
effective memory cycle time can be reduced by a factor close to the number
of modules. A CPU with instruction pipeline can take advantage of muliple
memory modules so that each segment in the pipeline can access
memory
independent of memory access from other segments.
Superscalar Processors
A
superscalar processor architecture has a form of parallelism on a
single
chip allowing the system as a whole to run much faster than it would other-
wise be able to at a
given clock speed. A
superscalar architecture fetches, exe-
cutes, and returns results from more than one instruction during a single
pipeline stage. A scalar processor processes one data item at a time. In a vec-
tor processor, by contrast, a
single instruction operates simultaneously on
multiple data items. The difference is
analogous to the difference between
scalar and vector arithmetic. A
superscalar processor is sort of a mixture of
the two. Each instruction processes one data item, but there are
multiple pro-
cessing units so that multiple instructions can be
processing separate data
items at the same time.
A superscalar processor normally has an execution rate in excess of one
instruction per machine cycle. But just processing multiple instructions at the
same time does not make an architecture superscalar. Simple pipelining,
where a
processor may be loading an instruction while doing arithmetic for
the previous one and storing the results from the one before that (thus exe-
cuting three instructions at the. same time) is not
superscalar processing. In a
superscalar processor, there are several unctional units of the same type,
along with additional circuitry to
dispatch instructions to the units. For
instance, most superscalar designs include more than one arithmetic and logic
unit. The dispatcher reads instructions trom memory and decides which ones
can be run in parallel, dispatching them to the two units. Seymour Crav's
CDC 6600 from 1965 is often mentioned as the first
superscalar design. The
Intel i96OCA (1988) and the AMD 29000-series 29050 (1990) microprocessors
were the first commercial single-chiP superscalar microprocessors. The
RS6000 from IBM was released in 1990 and was the world's first superscalar
RISC microprocessor. Intel followed in 1993 with the Pentium, which with its
two ALUs brought the x86 world into the superscalar era.

More Related Content

What's hot

INTRODUCTION TO PARALLEL PROCESSING
INTRODUCTION TO PARALLEL PROCESSINGINTRODUCTION TO PARALLEL PROCESSING
INTRODUCTION TO PARALLEL PROCESSING
GS Kosta
 
multiprocessors and multicomputers
 multiprocessors and multicomputers multiprocessors and multicomputers
multiprocessors and multicomputers
Pankaj Kumar Jain
 
Parallel processing
Parallel processingParallel processing
Parallel processing
Praveen Kumar
 
Data flow architecture
Data flow architectureData flow architecture
Data flow architecture
Sourav Routh
 
Difference Between CISC RISC, Harward & Von-neuman
Difference Between CISC RISC, Harward & Von-neumanDifference Between CISC RISC, Harward & Von-neuman
Difference Between CISC RISC, Harward & Von-neuman
Kailas Kharse
 
RISC Vs CISC, Harvard v/s Van Neumann
RISC Vs CISC, Harvard v/s Van NeumannRISC Vs CISC, Harvard v/s Van Neumann
RISC Vs CISC, Harvard v/s Van Neumann
Ravikumar Tiwari
 
Parallel Processing Concepts
Parallel Processing Concepts Parallel Processing Concepts
Parallel Processing Concepts
Dr Shashikant Athawale
 
Feng’s classification
Feng’s classificationFeng’s classification
Feng’s classification
Narayan Kandel
 
Pipelining and vector processing
Pipelining and vector processingPipelining and vector processing
Pipelining and vector processing
Kamal Acharya
 
Lecture02 types
Lecture02 typesLecture02 types
Lecture02 types
Ganesh Chavan
 
ADVANCED COMPUTER ARCHITECTURE AND PARALLEL PROCESSING
ADVANCED COMPUTER ARCHITECTUREAND PARALLEL PROCESSINGADVANCED COMPUTER ARCHITECTUREAND PARALLEL PROCESSING
ADVANCED COMPUTER ARCHITECTURE AND PARALLEL PROCESSING
Zena Abo-Altaheen
 
Parallel processing (simd and mimd)
Parallel processing (simd and mimd)Parallel processing (simd and mimd)
Parallel processing (simd and mimd)
Bhavik Vashi
 
Evaluation of morden computer & system attributes in ACA
Evaluation of morden computer &  system attributes in ACAEvaluation of morden computer &  system attributes in ACA
Evaluation of morden computer & system attributes in ACA
Pankaj Kumar Jain
 
Hardware and Software parallelism
Hardware and Software parallelismHardware and Software parallelism
Hardware and Software parallelismprashantdahake
 
Advanced computer architecture
Advanced computer architectureAdvanced computer architecture
Advanced computer architecture
krishnaviswambharan
 
Computer arithmetic in computer architecture
Computer arithmetic in computer architectureComputer arithmetic in computer architecture
Computer arithmetic in computer architecture
ishapadhy
 
Harvard architecture
Harvard architectureHarvard architecture
Harvard architectureGichelle Amon
 
Aca2 08 new
Aca2 08 newAca2 08 new
Aca2 08 new
Sumit Mittu
 

What's hot (20)

INTRODUCTION TO PARALLEL PROCESSING
INTRODUCTION TO PARALLEL PROCESSINGINTRODUCTION TO PARALLEL PROCESSING
INTRODUCTION TO PARALLEL PROCESSING
 
multiprocessors and multicomputers
 multiprocessors and multicomputers multiprocessors and multicomputers
multiprocessors and multicomputers
 
Parallel processing Concepts
Parallel processing ConceptsParallel processing Concepts
Parallel processing Concepts
 
Aca 2
Aca 2Aca 2
Aca 2
 
Parallel processing
Parallel processingParallel processing
Parallel processing
 
Data flow architecture
Data flow architectureData flow architecture
Data flow architecture
 
Difference Between CISC RISC, Harward & Von-neuman
Difference Between CISC RISC, Harward & Von-neumanDifference Between CISC RISC, Harward & Von-neuman
Difference Between CISC RISC, Harward & Von-neuman
 
RISC Vs CISC, Harvard v/s Van Neumann
RISC Vs CISC, Harvard v/s Van NeumannRISC Vs CISC, Harvard v/s Van Neumann
RISC Vs CISC, Harvard v/s Van Neumann
 
Parallel Processing Concepts
Parallel Processing Concepts Parallel Processing Concepts
Parallel Processing Concepts
 
Feng’s classification
Feng’s classificationFeng’s classification
Feng’s classification
 
Pipelining and vector processing
Pipelining and vector processingPipelining and vector processing
Pipelining and vector processing
 
Lecture02 types
Lecture02 typesLecture02 types
Lecture02 types
 
ADVANCED COMPUTER ARCHITECTURE AND PARALLEL PROCESSING
ADVANCED COMPUTER ARCHITECTUREAND PARALLEL PROCESSINGADVANCED COMPUTER ARCHITECTUREAND PARALLEL PROCESSING
ADVANCED COMPUTER ARCHITECTURE AND PARALLEL PROCESSING
 
Parallel processing (simd and mimd)
Parallel processing (simd and mimd)Parallel processing (simd and mimd)
Parallel processing (simd and mimd)
 
Evaluation of morden computer & system attributes in ACA
Evaluation of morden computer &  system attributes in ACAEvaluation of morden computer &  system attributes in ACA
Evaluation of morden computer & system attributes in ACA
 
Hardware and Software parallelism
Hardware and Software parallelismHardware and Software parallelism
Hardware and Software parallelism
 
Advanced computer architecture
Advanced computer architectureAdvanced computer architecture
Advanced computer architecture
 
Computer arithmetic in computer architecture
Computer arithmetic in computer architectureComputer arithmetic in computer architecture
Computer arithmetic in computer architecture
 
Harvard architecture
Harvard architectureHarvard architecture
Harvard architecture
 
Aca2 08 new
Aca2 08 newAca2 08 new
Aca2 08 new
 

Similar to Memory interleaving and superscalar processor

Multiprocessor structures
Multiprocessor structuresMultiprocessor structures
Multiprocessor structures
Shareb Ismaeel
 
ICT III - MPMC - Answer Key.pdf
ICT III - MPMC - Answer Key.pdfICT III - MPMC - Answer Key.pdf
ICT III - MPMC - Answer Key.pdf
GowriShankar881783
 
22CS201 COA
22CS201 COA22CS201 COA
22CS201 COA
Kathirvel Ayyaswamy
 
Distributed system lectures
Distributed system lecturesDistributed system lectures
Distributed system lectures
marwaeng
 
Compiler design
Compiler designCompiler design
Compiler design
renukarenuka9
 
Multiprocessor
MultiprocessorMultiprocessor
Multiprocessor
Kamal Acharya
 
Computer architecture multi processor
Computer architecture multi processorComputer architecture multi processor
Computer architecture multi processor
Mazin Alwaaly
 
Dosass2
Dosass2Dosass2
Dosass2
ShivaliKundra
 
Computer System Architecture
Computer System ArchitectureComputer System Architecture
Computer System Architecture
Brenda Debra
 
IS 139 Lecture 5
IS 139 Lecture 5IS 139 Lecture 5
IS 139 Lecture 5
wajanga
 
Cache performance-x86-2009
Cache performance-x86-2009Cache performance-x86-2009
Cache performance-x86-2009Léia de Sousa
 
Multiple processor (ppt 2010)
Multiple processor (ppt 2010)Multiple processor (ppt 2010)
Multiple processor (ppt 2010)
Arth Ramada
 
Chapter 10
Chapter 10Chapter 10
Memory consistency models
Memory consistency modelsMemory consistency models
Memory consistency models
palani kumar
 
Bus Based Multiprocessors v2
Bus Based Multiprocessors v2Bus Based Multiprocessors v2
Bus Based Multiprocessors v2
Mustafa Yumurtacı
 
Hcs Topic 2 Computer Structure V2
Hcs Topic 2  Computer Structure V2Hcs Topic 2  Computer Structure V2
Hcs Topic 2 Computer Structure V2Naruin
 
Hcs Topic 2 Computer Structure V2
Hcs Topic 2  Computer Structure V2Hcs Topic 2  Computer Structure V2
Hcs Topic 2 Computer Structure V2Kyle
 

Similar to Memory interleaving and superscalar processor (20)

Multiprocessor structures
Multiprocessor structuresMultiprocessor structures
Multiprocessor structures
 
ICT III - MPMC - Answer Key.pdf
ICT III - MPMC - Answer Key.pdfICT III - MPMC - Answer Key.pdf
ICT III - MPMC - Answer Key.pdf
 
Cache memory
Cache memoryCache memory
Cache memory
 
22CS201 COA
22CS201 COA22CS201 COA
22CS201 COA
 
shashank_spdp1993_00395543
shashank_spdp1993_00395543shashank_spdp1993_00395543
shashank_spdp1993_00395543
 
Distributed system lectures
Distributed system lecturesDistributed system lectures
Distributed system lectures
 
Compiler design
Compiler designCompiler design
Compiler design
 
Multiprocessor
MultiprocessorMultiprocessor
Multiprocessor
 
Computer architecture multi processor
Computer architecture multi processorComputer architecture multi processor
Computer architecture multi processor
 
Dosass2
Dosass2Dosass2
Dosass2
 
Computer System Architecture
Computer System ArchitectureComputer System Architecture
Computer System Architecture
 
IS 139 Lecture 5
IS 139 Lecture 5IS 139 Lecture 5
IS 139 Lecture 5
 
No sql
No sqlNo sql
No sql
 
Cache performance-x86-2009
Cache performance-x86-2009Cache performance-x86-2009
Cache performance-x86-2009
 
Multiple processor (ppt 2010)
Multiple processor (ppt 2010)Multiple processor (ppt 2010)
Multiple processor (ppt 2010)
 
Chapter 10
Chapter 10Chapter 10
Chapter 10
 
Memory consistency models
Memory consistency modelsMemory consistency models
Memory consistency models
 
Bus Based Multiprocessors v2
Bus Based Multiprocessors v2Bus Based Multiprocessors v2
Bus Based Multiprocessors v2
 
Hcs Topic 2 Computer Structure V2
Hcs Topic 2  Computer Structure V2Hcs Topic 2  Computer Structure V2
Hcs Topic 2 Computer Structure V2
 
Hcs Topic 2 Computer Structure V2
Hcs Topic 2  Computer Structure V2Hcs Topic 2  Computer Structure V2
Hcs Topic 2 Computer Structure V2
 

Recently uploaded

AP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specificAP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specific
BrazilAccount1
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
Kerry Sado
 
Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
Massimo Talia
 
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
AJAYKUMARPUND1
 
DESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docxDESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docx
FluxPrime1
 
Architectural Portfolio Sean Lockwood
Architectural Portfolio Sean LockwoodArchitectural Portfolio Sean Lockwood
Architectural Portfolio Sean Lockwood
seandesed
 
English lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdfEnglish lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdf
BrazilAccount1
 
WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234
AafreenAbuthahir2
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
Pipe Restoration Solutions
 
weather web application report.pdf
weather web application report.pdfweather web application report.pdf
weather web application report.pdf
Pratik Pawar
 
MCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdfMCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdf
Osamah Alsalih
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
zwunae
 
ethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.pptethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.ppt
Jayaprasanna4
 
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdfTop 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Teleport Manpower Consultant
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
R&R Consult
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
Kamal Acharya
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
MdTanvirMahtab2
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
Neometrix_Engineering_Pvt_Ltd
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
gerogepatton
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
JoytuBarua2
 

Recently uploaded (20)

AP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specificAP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specific
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
 
Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
 
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
Pile Foundation by Venkatesh Taduvai (Sub Geotechnical Engineering II)-conver...
 
DESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docxDESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docx
 
Architectural Portfolio Sean Lockwood
Architectural Portfolio Sean LockwoodArchitectural Portfolio Sean Lockwood
Architectural Portfolio Sean Lockwood
 
English lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdfEnglish lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdf
 
WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
 
weather web application report.pdf
weather web application report.pdfweather web application report.pdf
weather web application report.pdf
 
MCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdfMCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdf
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
 
ethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.pptethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.ppt
 
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdfTop 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
 

Memory interleaving and superscalar processor

  • 1. Pipeline and vector processors often require s i m u l t a n e o u s access to mema from two or more sources. An instruction pipeline may require the fetching of an instruction and an operand at the same time from two ditferent segments Similarly, an arithmetic pipeline usually requires two or more operands to0 enter the pipeline at the same time. Instead of using two memory buses for simultaneous access, the memory can be partitioned into a number of mod- ules connected to a common memory address and data buses. A memory module is a memory array together with its own address and data registers Figure 9-13 shows a memory unit with four modules. Each memory array has its own address register AR and data register DR. The address registers receive information from a common address bus and the data registers communicate with a bidirectional data bus. The two least significant bits of the address can be used to distinguish between the four modules. The modular system permls one module to initiate a memory access while other modules are in u process of reading or writing a word and each module can honor a memoly request independent of the state of the other modules. nory Memory Interleaving Figure 9-13 Multiple module memory organization. Address bus AR AR AR AR Memory Memory array Memory Memory array array array DR DR DR DR Data bus
  • 2. 327 ECTION 9-6 Vector l'rocessing The advantage of a modular memory is thatit allows the use o! a tech nique caled interleaving. In an interleaved memory, different sets of addresses are assigned to ditferent memory modules. For example, in a two-modille memory system, the even addresses may be in one module and the ad addresses in the other. When the number of modules is a power of 2. the least signilicant bits of the address select a memory module and the remaining bits designate the specilic location to be accessed within the selected module A modular memory is useful in systems with pipeline and vector pro- cessing. A vector processor that uses an n-way interleaved memory can tetch n operands from n different modules. By staggering the memory access, the effective memory cycle time can be reduced by a factor close to the number of modules. A CPU with instruction pipeline can take advantage of muliple memory modules so that each segment in the pipeline can access memory independent of memory access from other segments. Superscalar Processors A superscalar processor architecture has a form of parallelism on a single chip allowing the system as a whole to run much faster than it would other- wise be able to at a given clock speed. A superscalar architecture fetches, exe- cutes, and returns results from more than one instruction during a single pipeline stage. A scalar processor processes one data item at a time. In a vec- tor processor, by contrast, a single instruction operates simultaneously on multiple data items. The difference is analogous to the difference between scalar and vector arithmetic. A superscalar processor is sort of a mixture of the two. Each instruction processes one data item, but there are multiple pro- cessing units so that multiple instructions can be processing separate data items at the same time. A superscalar processor normally has an execution rate in excess of one instruction per machine cycle. But just processing multiple instructions at the same time does not make an architecture superscalar. Simple pipelining, where a processor may be loading an instruction while doing arithmetic for the previous one and storing the results from the one before that (thus exe- cuting three instructions at the. same time) is not superscalar processing. In a superscalar processor, there are several unctional units of the same type, along with additional circuitry to dispatch instructions to the units. For instance, most superscalar designs include more than one arithmetic and logic unit. The dispatcher reads instructions trom memory and decides which ones can be run in parallel, dispatching them to the two units. Seymour Crav's CDC 6600 from 1965 is often mentioned as the first superscalar design. The Intel i96OCA (1988) and the AMD 29000-series 29050 (1990) microprocessors were the first commercial single-chiP superscalar microprocessors. The RS6000 from IBM was released in 1990 and was the world's first superscalar RISC microprocessor. Intel followed in 1993 with the Pentium, which with its two ALUs brought the x86 world into the superscalar era.