SlideShare a Scribd company logo
Chapter 18
Parallel Processing
(Multiprocessing)
Its All About Increasing Performance
• Processor performance can be measured by the
rate at which it executes instructions
MIPS rate = f * IPC (Millions of Instructions per Second)
— f is the processor clock frequency, in MHz
— IPC the is average Instructions Per Cycle
• Increase performance by
—increasing clock frequency and
—increasing instructions that complete during cycle
– May be reaching limit 
+ Complexity
+ Power consumption
Computer Organizations
Multiprogramming and Multiprocessing
Taxonomy of Parallel Processor Architectures
Multiple Processor Organization
• SISD - Single instruction, single data stream
• SIMD - Single instruction, multiple data stream
• MISD - Multiple instruction, single data stream
• MIMD - Multiple instruction, multiple data stream
SISD - Single Instruction, Single Data Stream
• Single processor
• Single instruction stream
• Data stored in single memory
• Uni-processor
Control Unit Processing Unit Memory Unit
SIMD - Single Instruction, Multiple Data Stream
• Single machine instruction
• Number of processing elements
• Each processing element has associated data memory
• Each instruction simultaneously executed on
different set of data by different processors
Typical Application - Vector and Array processors
MISD Multiple Instruction, Single Data Stream
• One sequence of data
• A set of processors
• Each processor executes different
instruction sequence
Not much practical application
MIMD - Multiple Instruction, Multiple Data Stream
• Set of processors
• Simultaneously execute different instruction sequences
• Different sets of data
— SMPs (Symmetric Multiprocessors)
— NUMA systems (Non-uniform Memory Access)
— Clusters (Groups of “partnering” computers)
Shared memory (SMP or NUMA) Distributed memory (Clusters)
MIMD - Overview
• General purpose processors
• Each can process all instructions
necessary
• Further classified by method of processor
communication & memory access
MIMD - Tightly Coupled
• Processors share memory
• Communicate via that shared memory
Symmetric Multiprocessor (SMP)
- Share single memory or pool
- Shared bus to access memory
- Memory access time to given area of memory is
approximately the same for each processor
Nonuniform memory access (NUMA)
- Access times to different regions of memory may differ
Block Diagram of Tightly Coupled Multiprocessor
MIMD - Loosely Coupled
Clusters
• Collection of independent
uniprocessors
• Interconnected to form a cluster
• Communication via fixed path or network
connections
Symmetric Multiprocessors (SMP)
• A stand alone “computer” with the following
characteristics:
—Two or more similar processors of comparable
capacity
– All processors can perform the same functions (hence
symmetric)
—Processors share same memory and I/O access
– Memory access time is approximately the same for each
processor (time shared bus or multi-port memory)
—Processors are connected by a bus or other internal
connection
—System controlled by integrated operating system
– Providing interaction between processors
– Providing interaction at job, task, file and data element
levels
SMP Advantages
• Performance
— If some work can be done in parallel
• Availability
— Since all processors can perform the same
functions, failure of a single processor
does not halt the system
• Incremental growth
— User can enhance performance by adding
additional processors
• Scaling
— Vendors can offer range of products based
on number of processors
Symmetric Multiprocessor Organization
Time Shared Bus (vs Multiport memory)
• Simplest form
• Structure and interface similar to single
processor system
• Following features provided
—Addressing - distinguish modules on bus
—Arbitration - any module can be temporary master
—Time sharing - if one module has the bus, others must
wait and may have to suspend
Time Share Bus - Advantages
Advantages:
• Simplicity
• Flexibility
• Reliability
Disadvantages
• Performance limited by bus cycle time
• Each processor must have local cache
— Reduce number of bus accesses
• Leads to problems with cache coherence
Operating System Issues
• Simultaneous concurrent processes
• Scheduling
• Synchronization
• Memory management
• Reliability and fault tolerance
• Cache Coherence
Cache Coherence
• Problem - multiple copies of same data
in different caches
• Can result in an inconsistent view of
memory
—Write back policy can lead to inconsistency
—Write through can also give problems unless
caches monitor memory traffic
 MESI Protocol (Modify - Exclusive - Shared - Invalid )
Software Solution to Cache Coherence
Compiler and operating system deal with
problem
• Overhead transferred to compile time
• Design complexity transferred from
hardware to software
— Analyze code to determine safe periods for
caching shared variables
— However, software tends to (must) make
conservative decisions
— Inefficient cache utilization
Hardware Solution to Cache Coherence
Cache coherence hardware protocols
• Dynamic recognition of potential
problems
• Run time solution
—More efficient use of cache
• Transparent to programmer / Compiler
Implemented with:
• Directory protocols
• Snoopy protocols
Directory & Snoopy Protocols
Directory Protocols
Effective in large scale systems with complex interconnection
schemes
• Collect and maintain information about copies of data in cache
— Directory stored in main memory
• Requests are checked against directory
— Appropriate transfers are performed
Creates central bottleneck
Snoopy Protocols
Suited to bus based multiprocessor
• Distribute cache coherence responsibility among cache
controllers
• Cache recognizes that a line is shared
• Updates announced to other caches
Increases bus traffic
Snoopy Protocols
• Write Update Protocol (Write Broadcast)
—Multiple readers and writers
—Updated word is distributed to all other processors
—Multiple readers, one writer
• Write Invalidate protocol (MESI)
—When a write is required, all other caches of the line
are invalidated
—Writing processor then has exclusive (cheap) access
until line is required by another processor
MESI Protocol - State of every line is marked as Modified,
Exclusive, Shared or Invalid
- two bits are included with each cache tag
Processor Designs
• Pipelined ALU
—Within operations
—Across operations
• Parallel ALUs
• Parallel processors

More Related Content

Similar to chapter-18-parallel-processing-multiprocessing (1).ppt

unit 4.pptx
unit 4.pptxunit 4.pptx
High performance computing
High performance computingHigh performance computing
High performance computing
punjab engineering college, chandigarh
 
22CS201 COA
22CS201 COA22CS201 COA
22CS201 COA
Kathirvel Ayyaswamy
 
CA UNIT IV.pptx
CA UNIT IV.pptxCA UNIT IV.pptx
CA UNIT IV.pptx
ssuser9dbd7e
 
Multiprocessor.pptx
 Multiprocessor.pptx Multiprocessor.pptx
Multiprocessor.pptx
Muhammad54342
 
BIL406-Chapter-2-Classifications of Parallel Systems.ppt
BIL406-Chapter-2-Classifications of Parallel Systems.pptBIL406-Chapter-2-Classifications of Parallel Systems.ppt
BIL406-Chapter-2-Classifications of Parallel Systems.ppt
Kadri20
 
EMBEDDED OS
EMBEDDED OSEMBEDDED OS
EMBEDDED OS
AJAL A J
 
Aca module 1
Aca module 1Aca module 1
Aca module 1
Avinash_N Rao
 
Introduction to parallel processing
Introduction to parallel processingIntroduction to parallel processing
Introduction to parallel processing
Page Maker
 
CSA unit5.pptx
CSA unit5.pptxCSA unit5.pptx
CSA unit5.pptx
AbcvDef
 
OS_MD_4.pdf
OS_MD_4.pdfOS_MD_4.pdf
OS_MD_4.pdf
SangeethaBS4
 
Classification of Parallel Computers.pptx
Classification of Parallel Computers.pptxClassification of Parallel Computers.pptx
Classification of Parallel Computers.pptx
Neeraj Singh
 
Advanced processor Principles
Advanced processor PrinciplesAdvanced processor Principles
Advanced processor Principles
Vinit Raut
 
Memory Management in Operating Systems for all
Memory Management in Operating Systems for allMemory Management in Operating Systems for all
Memory Management in Operating Systems for all
VSKAMCSPSGCT
 
Intro_ppt.pptx
Intro_ppt.pptxIntro_ppt.pptx
Intro_ppt.pptx
ssuser906c831
 
Cache Performance Evaluation
Cache Performance EvaluationCache Performance Evaluation
Cache Performance Evaluation
Alfred Mutanga
 

Similar to chapter-18-parallel-processing-multiprocessing (1).ppt (20)

unit 4.pptx
unit 4.pptxunit 4.pptx
unit 4.pptx
 
Parallel processing Concepts
Parallel processing ConceptsParallel processing Concepts
Parallel processing Concepts
 
Lecture4
Lecture4Lecture4
Lecture4
 
High performance computing
High performance computingHigh performance computing
High performance computing
 
22CS201 COA
22CS201 COA22CS201 COA
22CS201 COA
 
CA UNIT IV.pptx
CA UNIT IV.pptxCA UNIT IV.pptx
CA UNIT IV.pptx
 
Multiprocessor.pptx
 Multiprocessor.pptx Multiprocessor.pptx
Multiprocessor.pptx
 
BIL406-Chapter-2-Classifications of Parallel Systems.ppt
BIL406-Chapter-2-Classifications of Parallel Systems.pptBIL406-Chapter-2-Classifications of Parallel Systems.ppt
BIL406-Chapter-2-Classifications of Parallel Systems.ppt
 
EMBEDDED OS
EMBEDDED OSEMBEDDED OS
EMBEDDED OS
 
Multi processing
Multi processingMulti processing
Multi processing
 
Aca module 1
Aca module 1Aca module 1
Aca module 1
 
Introduction to parallel processing
Introduction to parallel processingIntroduction to parallel processing
Introduction to parallel processing
 
CSA unit5.pptx
CSA unit5.pptxCSA unit5.pptx
CSA unit5.pptx
 
OS_MD_4.pdf
OS_MD_4.pdfOS_MD_4.pdf
OS_MD_4.pdf
 
Classification of Parallel Computers.pptx
Classification of Parallel Computers.pptxClassification of Parallel Computers.pptx
Classification of Parallel Computers.pptx
 
Advanced processor Principles
Advanced processor PrinciplesAdvanced processor Principles
Advanced processor Principles
 
Memory Management in Operating Systems for all
Memory Management in Operating Systems for allMemory Management in Operating Systems for all
Memory Management in Operating Systems for all
 
Intro_ppt.pptx
Intro_ppt.pptxIntro_ppt.pptx
Intro_ppt.pptx
 
Cache Performance Evaluation
Cache Performance EvaluationCache Performance Evaluation
Cache Performance Evaluation
 
archintro.pdf
archintro.pdfarchintro.pdf
archintro.pdf
 

More from NANDHINIS109942

machine learning alg.pptx
machine learning alg.pptxmachine learning alg.pptx
machine learning alg.pptx
NANDHINIS109942
 
MACHINE LEARNING.pptx
MACHINE LEARNING.pptxMACHINE LEARNING.pptx
MACHINE LEARNING.pptx
NANDHINIS109942
 
parallel processing.ppt
parallel processing.pptparallel processing.ppt
parallel processing.ppt
NANDHINIS109942
 

More from NANDHINIS109942 (6)

machine learning alg.pptx
machine learning alg.pptxmachine learning alg.pptx
machine learning alg.pptx
 
ml.pptx
ml.pptxml.pptx
ml.pptx
 
MACHINE LEARNING.pptx
MACHINE LEARNING.pptxMACHINE LEARNING.pptx
MACHINE LEARNING.pptx
 
parallel processing.ppt
parallel processing.pptparallel processing.ppt
parallel processing.ppt
 
5.pptx
5.pptx5.pptx
5.pptx
 
5g tech.pptx
5g tech.pptx5g tech.pptx
5g tech.pptx
 

Recently uploaded

The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
Vivekanand Anglo Vedic Academy
 
How to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS ModuleHow to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS Module
Celine George
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
kaushalkr1407
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
BhavyaRajput3
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
The Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve ThomasonThe Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve Thomason
Steve Thomason
 
Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)
rosedainty
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxStudents, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
EduSkills OECD
 
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptxMARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
bennyroshan06
 
Ethnobotany and Ethnopharmacology ......
Ethnobotany and Ethnopharmacology ......Ethnobotany and Ethnopharmacology ......
Ethnobotany and Ethnopharmacology ......
Ashokrao Mane college of Pharmacy Peth-Vadgaon
 
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdfESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
Fundacja Rozwoju Społeczeństwa Przedsiębiorczego
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
GeoBlogs
 
Basic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumersBasic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumers
PedroFerreira53928
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
beazzy04
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
Tamralipta Mahavidyalaya
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
Jisc
 
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
Nguyen Thanh Tu Collection
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
RaedMohamed3
 

Recently uploaded (20)

The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
 
How to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS ModuleHow to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS Module
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
The Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve ThomasonThe Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve Thomason
 
Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxStudents, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
 
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptxMARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
 
Ethnobotany and Ethnopharmacology ......
Ethnobotany and Ethnopharmacology ......Ethnobotany and Ethnopharmacology ......
Ethnobotany and Ethnopharmacology ......
 
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdfESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
 
Basic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumersBasic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumers
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
 
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
 

chapter-18-parallel-processing-multiprocessing (1).ppt

  • 2. Its All About Increasing Performance • Processor performance can be measured by the rate at which it executes instructions MIPS rate = f * IPC (Millions of Instructions per Second) — f is the processor clock frequency, in MHz — IPC the is average Instructions Per Cycle • Increase performance by —increasing clock frequency and —increasing instructions that complete during cycle – May be reaching limit  + Complexity + Power consumption
  • 5. Taxonomy of Parallel Processor Architectures
  • 6. Multiple Processor Organization • SISD - Single instruction, single data stream • SIMD - Single instruction, multiple data stream • MISD - Multiple instruction, single data stream • MIMD - Multiple instruction, multiple data stream
  • 7. SISD - Single Instruction, Single Data Stream • Single processor • Single instruction stream • Data stored in single memory • Uni-processor Control Unit Processing Unit Memory Unit
  • 8. SIMD - Single Instruction, Multiple Data Stream • Single machine instruction • Number of processing elements • Each processing element has associated data memory • Each instruction simultaneously executed on different set of data by different processors Typical Application - Vector and Array processors
  • 9. MISD Multiple Instruction, Single Data Stream • One sequence of data • A set of processors • Each processor executes different instruction sequence Not much practical application
  • 10. MIMD - Multiple Instruction, Multiple Data Stream • Set of processors • Simultaneously execute different instruction sequences • Different sets of data — SMPs (Symmetric Multiprocessors) — NUMA systems (Non-uniform Memory Access) — Clusters (Groups of “partnering” computers) Shared memory (SMP or NUMA) Distributed memory (Clusters)
  • 11. MIMD - Overview • General purpose processors • Each can process all instructions necessary • Further classified by method of processor communication & memory access
  • 12. MIMD - Tightly Coupled • Processors share memory • Communicate via that shared memory Symmetric Multiprocessor (SMP) - Share single memory or pool - Shared bus to access memory - Memory access time to given area of memory is approximately the same for each processor Nonuniform memory access (NUMA) - Access times to different regions of memory may differ
  • 13. Block Diagram of Tightly Coupled Multiprocessor
  • 14. MIMD - Loosely Coupled Clusters • Collection of independent uniprocessors • Interconnected to form a cluster • Communication via fixed path or network connections
  • 15. Symmetric Multiprocessors (SMP) • A stand alone “computer” with the following characteristics: —Two or more similar processors of comparable capacity – All processors can perform the same functions (hence symmetric) —Processors share same memory and I/O access – Memory access time is approximately the same for each processor (time shared bus or multi-port memory) —Processors are connected by a bus or other internal connection —System controlled by integrated operating system – Providing interaction between processors – Providing interaction at job, task, file and data element levels
  • 16. SMP Advantages • Performance — If some work can be done in parallel • Availability — Since all processors can perform the same functions, failure of a single processor does not halt the system • Incremental growth — User can enhance performance by adding additional processors • Scaling — Vendors can offer range of products based on number of processors
  • 18. Time Shared Bus (vs Multiport memory) • Simplest form • Structure and interface similar to single processor system • Following features provided —Addressing - distinguish modules on bus —Arbitration - any module can be temporary master —Time sharing - if one module has the bus, others must wait and may have to suspend
  • 19. Time Share Bus - Advantages Advantages: • Simplicity • Flexibility • Reliability Disadvantages • Performance limited by bus cycle time • Each processor must have local cache — Reduce number of bus accesses • Leads to problems with cache coherence
  • 20. Operating System Issues • Simultaneous concurrent processes • Scheduling • Synchronization • Memory management • Reliability and fault tolerance • Cache Coherence
  • 21. Cache Coherence • Problem - multiple copies of same data in different caches • Can result in an inconsistent view of memory —Write back policy can lead to inconsistency —Write through can also give problems unless caches monitor memory traffic  MESI Protocol (Modify - Exclusive - Shared - Invalid )
  • 22. Software Solution to Cache Coherence Compiler and operating system deal with problem • Overhead transferred to compile time • Design complexity transferred from hardware to software — Analyze code to determine safe periods for caching shared variables — However, software tends to (must) make conservative decisions — Inefficient cache utilization
  • 23. Hardware Solution to Cache Coherence Cache coherence hardware protocols • Dynamic recognition of potential problems • Run time solution —More efficient use of cache • Transparent to programmer / Compiler Implemented with: • Directory protocols • Snoopy protocols
  • 24. Directory & Snoopy Protocols Directory Protocols Effective in large scale systems with complex interconnection schemes • Collect and maintain information about copies of data in cache — Directory stored in main memory • Requests are checked against directory — Appropriate transfers are performed Creates central bottleneck Snoopy Protocols Suited to bus based multiprocessor • Distribute cache coherence responsibility among cache controllers • Cache recognizes that a line is shared • Updates announced to other caches Increases bus traffic
  • 25. Snoopy Protocols • Write Update Protocol (Write Broadcast) —Multiple readers and writers —Updated word is distributed to all other processors —Multiple readers, one writer • Write Invalidate protocol (MESI) —When a write is required, all other caches of the line are invalidated —Writing processor then has exclusive (cheap) access until line is required by another processor MESI Protocol - State of every line is marked as Modified, Exclusive, Shared or Invalid - two bits are included with each cache tag
  • 26. Processor Designs • Pipelined ALU —Within operations —Across operations • Parallel ALUs • Parallel processors