SlideShare a Scribd company logo
1 of 26
Chapter 18
Parallel Processing
(Multiprocessing)
Its All About Increasing Performance
• Processor performance can be measured by the
rate at which it executes instructions
MIPS rate = f * IPC (Millions of Instructions per Second)
— f is the processor clock frequency, in MHz
— IPC the is average Instructions Per Cycle
• Increase performance by
—increasing clock frequency and
—increasing instructions that complete during cycle
– May be reaching limit 
+ Complexity
+ Power consumption
Computer Organizations
Multiprogramming and Multiprocessing
Taxonomy of Parallel Processor Architectures
Multiple Processor Organization
• SISD - Single instruction, single data stream
• SIMD - Single instruction, multiple data stream
• MISD - Multiple instruction, single data stream
• MIMD - Multiple instruction, multiple data stream
SISD - Single Instruction, Single Data Stream
• Single processor
• Single instruction stream
• Data stored in single memory
• Uni-processor
Control Unit Processing Unit Memory Unit
SIMD - Single Instruction, Multiple Data Stream
• Single machine instruction
• Number of processing elements
• Each processing element has associated data memory
• Each instruction simultaneously executed on
different set of data by different processors
Typical Application - Vector and Array processors
MISD Multiple Instruction, Single Data Stream
• One sequence of data
• A set of processors
• Each processor executes different
instruction sequence
Not much practical application
MIMD - Multiple Instruction, Multiple Data Stream
• Set of processors
• Simultaneously execute different instruction sequences
• Different sets of data
— SMPs (Symmetric Multiprocessors)
— NUMA systems (Non-uniform Memory Access)
— Clusters (Groups of “partnering” computers)
Shared memory (SMP or NUMA) Distributed memory (Clusters)
MIMD - Overview
• General purpose processors
• Each can process all instructions
necessary
• Further classified by method of processor
communication & memory access
MIMD - Tightly Coupled
• Processors share memory
• Communicate via that shared memory
Symmetric Multiprocessor (SMP)
- Share single memory or pool
- Shared bus to access memory
- Memory access time to given area of memory is
approximately the same for each processor
Nonuniform memory access (NUMA)
- Access times to different regions of memory may differ
Block Diagram of Tightly Coupled Multiprocessor
MIMD - Loosely Coupled
Clusters
• Collection of independent
uniprocessors
• Interconnected to form a cluster
• Communication via fixed path or network
connections
Symmetric Multiprocessors (SMP)
• A stand alone “computer” with the following
characteristics:
—Two or more similar processors of comparable
capacity
– All processors can perform the same functions (hence
symmetric)
—Processors share same memory and I/O access
– Memory access time is approximately the same for each
processor (time shared bus or multi-port memory)
—Processors are connected by a bus or other internal
connection
—System controlled by integrated operating system
– Providing interaction between processors
– Providing interaction at job, task, file and data element
levels
SMP Advantages
• Performance
— If some work can be done in parallel
• Availability
— Since all processors can perform the same
functions, failure of a single processor
does not halt the system
• Incremental growth
— User can enhance performance by adding
additional processors
• Scaling
— Vendors can offer range of products based
on number of processors
Symmetric Multiprocessor Organization
Time Shared Bus (vs Multiport memory)
• Simplest form
• Structure and interface similar to single
processor system
• Following features provided
—Addressing - distinguish modules on bus
—Arbitration - any module can be temporary master
—Time sharing - if one module has the bus, others must
wait and may have to suspend
Time Share Bus - Advantages
Advantages:
• Simplicity
• Flexibility
• Reliability
Disadvantages
• Performance limited by bus cycle time
• Each processor must have local cache
— Reduce number of bus accesses
• Leads to problems with cache coherence
Operating System Issues
• Simultaneous concurrent processes
• Scheduling
• Synchronization
• Memory management
• Reliability and fault tolerance
• Cache Coherence
Cache Coherence
• Problem - multiple copies of same data
in different caches
• Can result in an inconsistent view of
memory
—Write back policy can lead to inconsistency
—Write through can also give problems unless
caches monitor memory traffic
 MESI Protocol (Modify - Exclusive - Shared - Invalid )
Software Solution to Cache Coherence
Compiler and operating system deal with
problem
• Overhead transferred to compile time
• Design complexity transferred from
hardware to software
— Analyze code to determine safe periods for
caching shared variables
— However, software tends to (must) make
conservative decisions
— Inefficient cache utilization
Hardware Solution to Cache Coherence
Cache coherence hardware protocols
• Dynamic recognition of potential
problems
• Run time solution
—More efficient use of cache
• Transparent to programmer / Compiler
Implemented with:
• Directory protocols
• Snoopy protocols
Directory & Snoopy Protocols
Directory Protocols
Effective in large scale systems with complex interconnection
schemes
• Collect and maintain information about copies of data in cache
— Directory stored in main memory
• Requests are checked against directory
— Appropriate transfers are performed
Creates central bottleneck
Snoopy Protocols
Suited to bus based multiprocessor
• Distribute cache coherence responsibility among cache
controllers
• Cache recognizes that a line is shared
• Updates announced to other caches
Increases bus traffic
Snoopy Protocols
• Write Update Protocol (Write Broadcast)
—Multiple readers and writers
—Updated word is distributed to all other processors
—Multiple readers, one writer
• Write Invalidate protocol (MESI)
—When a write is required, all other caches of the line
are invalidated
—Writing processor then has exclusive (cheap) access
until line is required by another processor
MESI Protocol - State of every line is marked as Modified,
Exclusive, Shared or Invalid
- two bits are included with each cache tag
Processor Designs
• Pipelined ALU
—Within operations
—Across operations
• Parallel ALUs
• Parallel processors

More Related Content

Similar to parallel processing.ppt

Similar to parallel processing.ppt (20)

unit 4.pptx
unit 4.pptxunit 4.pptx
unit 4.pptx
 
Parallel processing Concepts
Parallel processing ConceptsParallel processing Concepts
Parallel processing Concepts
 
Lecture4
Lecture4Lecture4
Lecture4
 
High performance computing
High performance computingHigh performance computing
High performance computing
 
22CS201 COA
22CS201 COA22CS201 COA
22CS201 COA
 
CA UNIT IV.pptx
CA UNIT IV.pptxCA UNIT IV.pptx
CA UNIT IV.pptx
 
Multiprocessor.pptx
 Multiprocessor.pptx Multiprocessor.pptx
Multiprocessor.pptx
 
BIL406-Chapter-2-Classifications of Parallel Systems.ppt
BIL406-Chapter-2-Classifications of Parallel Systems.pptBIL406-Chapter-2-Classifications of Parallel Systems.ppt
BIL406-Chapter-2-Classifications of Parallel Systems.ppt
 
EMBEDDED OS
EMBEDDED OSEMBEDDED OS
EMBEDDED OS
 
Multi processing
Multi processingMulti processing
Multi processing
 
Aca module 1
Aca module 1Aca module 1
Aca module 1
 
Introduction to parallel processing
Introduction to parallel processingIntroduction to parallel processing
Introduction to parallel processing
 
CSA unit5.pptx
CSA unit5.pptxCSA unit5.pptx
CSA unit5.pptx
 
OS_MD_4.pdf
OS_MD_4.pdfOS_MD_4.pdf
OS_MD_4.pdf
 
Classification of Parallel Computers.pptx
Classification of Parallel Computers.pptxClassification of Parallel Computers.pptx
Classification of Parallel Computers.pptx
 
Advanced processor Principles
Advanced processor PrinciplesAdvanced processor Principles
Advanced processor Principles
 
Memory Management in Operating Systems for all
Memory Management in Operating Systems for allMemory Management in Operating Systems for all
Memory Management in Operating Systems for all
 
Intro_ppt.pptx
Intro_ppt.pptxIntro_ppt.pptx
Intro_ppt.pptx
 
Cache Performance Evaluation
Cache Performance EvaluationCache Performance Evaluation
Cache Performance Evaluation
 
archintro.pdf
archintro.pdfarchintro.pdf
archintro.pdf
 

More from NANDHINIS109942

More from NANDHINIS109942 (6)

machine learning alg.pptx
machine learning alg.pptxmachine learning alg.pptx
machine learning alg.pptx
 
ml.pptx
ml.pptxml.pptx
ml.pptx
 
MACHINE LEARNING.pptx
MACHINE LEARNING.pptxMACHINE LEARNING.pptx
MACHINE LEARNING.pptx
 
chapter-18-parallel-processing-multiprocessing (1).ppt
chapter-18-parallel-processing-multiprocessing (1).pptchapter-18-parallel-processing-multiprocessing (1).ppt
chapter-18-parallel-processing-multiprocessing (1).ppt
 
5.pptx
5.pptx5.pptx
5.pptx
 
5g tech.pptx
5g tech.pptx5g tech.pptx
5g tech.pptx
 

Recently uploaded

The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 

Recently uploaded (20)

TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 

parallel processing.ppt

  • 2. Its All About Increasing Performance • Processor performance can be measured by the rate at which it executes instructions MIPS rate = f * IPC (Millions of Instructions per Second) — f is the processor clock frequency, in MHz — IPC the is average Instructions Per Cycle • Increase performance by —increasing clock frequency and —increasing instructions that complete during cycle – May be reaching limit  + Complexity + Power consumption
  • 5. Taxonomy of Parallel Processor Architectures
  • 6. Multiple Processor Organization • SISD - Single instruction, single data stream • SIMD - Single instruction, multiple data stream • MISD - Multiple instruction, single data stream • MIMD - Multiple instruction, multiple data stream
  • 7. SISD - Single Instruction, Single Data Stream • Single processor • Single instruction stream • Data stored in single memory • Uni-processor Control Unit Processing Unit Memory Unit
  • 8. SIMD - Single Instruction, Multiple Data Stream • Single machine instruction • Number of processing elements • Each processing element has associated data memory • Each instruction simultaneously executed on different set of data by different processors Typical Application - Vector and Array processors
  • 9. MISD Multiple Instruction, Single Data Stream • One sequence of data • A set of processors • Each processor executes different instruction sequence Not much practical application
  • 10. MIMD - Multiple Instruction, Multiple Data Stream • Set of processors • Simultaneously execute different instruction sequences • Different sets of data — SMPs (Symmetric Multiprocessors) — NUMA systems (Non-uniform Memory Access) — Clusters (Groups of “partnering” computers) Shared memory (SMP or NUMA) Distributed memory (Clusters)
  • 11. MIMD - Overview • General purpose processors • Each can process all instructions necessary • Further classified by method of processor communication & memory access
  • 12. MIMD - Tightly Coupled • Processors share memory • Communicate via that shared memory Symmetric Multiprocessor (SMP) - Share single memory or pool - Shared bus to access memory - Memory access time to given area of memory is approximately the same for each processor Nonuniform memory access (NUMA) - Access times to different regions of memory may differ
  • 13. Block Diagram of Tightly Coupled Multiprocessor
  • 14. MIMD - Loosely Coupled Clusters • Collection of independent uniprocessors • Interconnected to form a cluster • Communication via fixed path or network connections
  • 15. Symmetric Multiprocessors (SMP) • A stand alone “computer” with the following characteristics: —Two or more similar processors of comparable capacity – All processors can perform the same functions (hence symmetric) —Processors share same memory and I/O access – Memory access time is approximately the same for each processor (time shared bus or multi-port memory) —Processors are connected by a bus or other internal connection —System controlled by integrated operating system – Providing interaction between processors – Providing interaction at job, task, file and data element levels
  • 16. SMP Advantages • Performance — If some work can be done in parallel • Availability — Since all processors can perform the same functions, failure of a single processor does not halt the system • Incremental growth — User can enhance performance by adding additional processors • Scaling — Vendors can offer range of products based on number of processors
  • 18. Time Shared Bus (vs Multiport memory) • Simplest form • Structure and interface similar to single processor system • Following features provided —Addressing - distinguish modules on bus —Arbitration - any module can be temporary master —Time sharing - if one module has the bus, others must wait and may have to suspend
  • 19. Time Share Bus - Advantages Advantages: • Simplicity • Flexibility • Reliability Disadvantages • Performance limited by bus cycle time • Each processor must have local cache — Reduce number of bus accesses • Leads to problems with cache coherence
  • 20. Operating System Issues • Simultaneous concurrent processes • Scheduling • Synchronization • Memory management • Reliability and fault tolerance • Cache Coherence
  • 21. Cache Coherence • Problem - multiple copies of same data in different caches • Can result in an inconsistent view of memory —Write back policy can lead to inconsistency —Write through can also give problems unless caches monitor memory traffic  MESI Protocol (Modify - Exclusive - Shared - Invalid )
  • 22. Software Solution to Cache Coherence Compiler and operating system deal with problem • Overhead transferred to compile time • Design complexity transferred from hardware to software — Analyze code to determine safe periods for caching shared variables — However, software tends to (must) make conservative decisions — Inefficient cache utilization
  • 23. Hardware Solution to Cache Coherence Cache coherence hardware protocols • Dynamic recognition of potential problems • Run time solution —More efficient use of cache • Transparent to programmer / Compiler Implemented with: • Directory protocols • Snoopy protocols
  • 24. Directory & Snoopy Protocols Directory Protocols Effective in large scale systems with complex interconnection schemes • Collect and maintain information about copies of data in cache — Directory stored in main memory • Requests are checked against directory — Appropriate transfers are performed Creates central bottleneck Snoopy Protocols Suited to bus based multiprocessor • Distribute cache coherence responsibility among cache controllers • Cache recognizes that a line is shared • Updates announced to other caches Increases bus traffic
  • 25. Snoopy Protocols • Write Update Protocol (Write Broadcast) —Multiple readers and writers —Updated word is distributed to all other processors —Multiple readers, one writer • Write Invalidate protocol (MESI) —When a write is required, all other caches of the line are invalidated —Writing processor then has exclusive (cheap) access until line is required by another processor MESI Protocol - State of every line is marked as Modified, Exclusive, Shared or Invalid - two bits are included with each cache tag
  • 26. Processor Designs • Pipelined ALU —Within operations —Across operations • Parallel ALUs • Parallel processors