SlideShare a Scribd company logo
1 of 10
1
Bottomline
• Can’t escape multi-cores today: it is the baseline
architecture
• Performance stagnates unless we learn to transform
traditional applications into parallel threads
• It’s all about the data!
Data management: distribution, coherence, consistency
• It’s also about the programming model: onus on
application writer / compiler / hardware
• It’s also about managing on-chip communication
2
Symmetric Multiprocessors (SMP)
• A collection of processors, a collection of memory: both
are connected through some interconnect (usually, the
fastest possible)
• Symmetric because latency for any processor to access
any memory is constant – uniform memory access (UMA)
Proc 1 Proc 2 Proc 3 Proc 4
Mem 1 Mem 2 Mem 3 Mem 4
3
Distributed Memory Multiprocessors
• Each processor has local memory that is accessible
through a fast interconnect
• The different nodes are connected as I/O devices with
(potentially) slower interconnect
• Local memory access is a lot faster than remote memory
– non-uniform memory access (NUMA)
• Advantage: can be built with commodity processors and
many applications will perform well thanks to locality
Proc 1 Mem 1 Proc 2 Mem 2 Proc 3 Mem 3 Proc 4 Mem 4
4
Shared Memory Architectures
• Key differentiating feature: the address space is shared,
i.e., any processor can directly address any memory
location and access them with load/store instructions
• Cooperation is similar to a bulletin board – a processor
writes to a location and that location is visible to reads
by other threads
5
Shared Address Space
Shared
Private
Private
Private
Process P1
Process P2
Process P3
Shared
Shared
Shared
Pvt P1
Pvt P2
Pvt P3
Virtual address space
of each process
Physical address space
6
Message Passing
• Programming model that can apply to clusters of workstations, SMPs,
and even a uniprocessor
• Sends and receives are used for effecting the data transfer – usually,
each process ends up making a copy of data that is relevant to it
• Each process can only name local addresses, other processes, and
a tag to help distinguish between multiple messages
• A send-receive match is a synchronization event – hence, we no
longer need locks or barriers to co-ordinate
7
Models for SEND and RECEIVE
• Synchronous: SEND returns control back to the program
only when the RECEIVE has completed
• Blocking Asynchronous: SEND returns control back to the
program after the OS has copied the message into its space
-- the program can now modify the sent data structure
• Nonblocking Asynchronous: SEND and RECEIVE return
control immediately – the message will get copied at some
point, so the process must overlap some other computation
with the communication – other primitives are used to
probe if the communication has finished or not
8
Deterministic Execution
• Need synch after every anti-diagonal
• Potential load imbalance
• Shared-memory vs. message passing
• Function of the model for SEND-RECEIVE
• Function of the algorithm: diagonal, red-black ordering
9
Cache Coherence
A multiprocessor system is cache coherent if
• a value written by a processor is eventually visible to
reads by other processors – write propagation
• two writes to the same location by two processors are
seen in the same order by all processors – write
serialization
10
Cache Coherence Protocols
• Directory-based: A single location (directory) keeps track
of the sharing status of a block of memory
• Snooping: Every cache block is accompanied by the sharing
status of that block – all cache controllers monitor the
shared bus so they can update the sharing status of the
block, if necessary
 Write-invalidate: a processor gains exclusive access of
a block before writing by invalidating all other copies
 Write-update: when a processor writes, it updates other
shared copies of that block

More Related Content

What's hot

Parallel architecture
Parallel architectureParallel architecture
Parallel architectureMr SMAK
 
Parallel architecture-programming
Parallel architecture-programmingParallel architecture-programming
Parallel architecture-programmingShaveta Banda
 
Parallel Processing Presentation2
Parallel Processing Presentation2Parallel Processing Presentation2
Parallel Processing Presentation2daniyalqureshi712
 
Introduction to parallel processing
Introduction to parallel processingIntroduction to parallel processing
Introduction to parallel processingPage Maker
 
Multithreaded processors ppt
Multithreaded processors pptMultithreaded processors ppt
Multithreaded processors pptSiddhartha Anand
 
Multithreading computer architecture
 Multithreading computer architecture  Multithreading computer architecture
Multithreading computer architecture Haris456
 
Hardware multithreading
Hardware multithreadingHardware multithreading
Hardware multithreadingFraboni Ec
 
What is simultaneous multithreading
What is simultaneous multithreadingWhat is simultaneous multithreading
What is simultaneous multithreadingFraboni Ec
 
Dichotomy of parallel computing platforms
Dichotomy of parallel computing platformsDichotomy of parallel computing platforms
Dichotomy of parallel computing platformsSyed Zaid Irshad
 
Superscalar & superpipeline processor
Superscalar & superpipeline processorSuperscalar & superpipeline processor
Superscalar & superpipeline processorMuhammad Ishaq
 
Cache coherence
Cache coherenceCache coherence
Cache coherenceEmployee
 
Client-centric Consistency Models
Client-centric Consistency ModelsClient-centric Consistency Models
Client-centric Consistency ModelsEnsar Basri Kahveci
 
Parallel computing
Parallel computingParallel computing
Parallel computingvirend111
 

What's hot (20)

Memory models
Memory modelsMemory models
Memory models
 
Parallel architecture
Parallel architectureParallel architecture
Parallel architecture
 
Parallel architecture-programming
Parallel architecture-programmingParallel architecture-programming
Parallel architecture-programming
 
Parallel processing extra
Parallel processing extraParallel processing extra
Parallel processing extra
 
Parallel processing
Parallel processingParallel processing
Parallel processing
 
Parallel processing
Parallel processingParallel processing
Parallel processing
 
Parallel Processing Presentation2
Parallel Processing Presentation2Parallel Processing Presentation2
Parallel Processing Presentation2
 
Cache coherence
Cache coherenceCache coherence
Cache coherence
 
Cache coherence
Cache coherenceCache coherence
Cache coherence
 
Introduction to parallel processing
Introduction to parallel processingIntroduction to parallel processing
Introduction to parallel processing
 
Scope of parallelism
Scope of parallelismScope of parallelism
Scope of parallelism
 
Multithreaded processors ppt
Multithreaded processors pptMultithreaded processors ppt
Multithreaded processors ppt
 
Multithreading computer architecture
 Multithreading computer architecture  Multithreading computer architecture
Multithreading computer architecture
 
Hardware multithreading
Hardware multithreadingHardware multithreading
Hardware multithreading
 
What is simultaneous multithreading
What is simultaneous multithreadingWhat is simultaneous multithreading
What is simultaneous multithreading
 
Dichotomy of parallel computing platforms
Dichotomy of parallel computing platformsDichotomy of parallel computing platforms
Dichotomy of parallel computing platforms
 
Superscalar & superpipeline processor
Superscalar & superpipeline processorSuperscalar & superpipeline processor
Superscalar & superpipeline processor
 
Cache coherence
Cache coherenceCache coherence
Cache coherence
 
Client-centric Consistency Models
Client-centric Consistency ModelsClient-centric Consistency Models
Client-centric Consistency Models
 
Parallel computing
Parallel computingParallel computing
Parallel computing
 

Viewers also liked

Genatic Algorithm
Genatic AlgorithmGenatic Algorithm
Genatic AlgorithmYasir Khan
 
Vo gay philipin
Vo gay philipinVo gay philipin
Vo gay philipinNgo Kim Du
 
Frstorder 9 sldes read
Frstorder 9 sldes readFrstorder 9 sldes read
Frstorder 9 sldes readYasir Khan
 
Pringkat pringkatperkembangankanak-kanakdanteori-teoriperkembanganyangberkait...
Pringkat pringkatperkembangankanak-kanakdanteori-teoriperkembanganyangberkait...Pringkat pringkatperkembangankanak-kanakdanteori-teoriperkembanganyangberkait...
Pringkat pringkatperkembangankanak-kanakdanteori-teoriperkembanganyangberkait...Rozaidi Yusof
 
Dir based imp_5
Dir based imp_5Dir based imp_5
Dir based imp_5Yasir Khan
 
computer graphics at openGL (2)
computer graphics at openGL (2)computer graphics at openGL (2)
computer graphics at openGL (2)Yasir Khan
 
Cs ps, sat, fol resolution strategies
Cs ps, sat, fol resolution strategiesCs ps, sat, fol resolution strategies
Cs ps, sat, fol resolution strategiesYasir Khan
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingYasir Khan
 
Information System Architecture and Audit Control Lecture 2
Information System Architecture and Audit Control Lecture 2Information System Architecture and Audit Control Lecture 2
Information System Architecture and Audit Control Lecture 2Yasir Khan
 
36760501 teori-dan-model-an-awal-kanak-kank
36760501 teori-dan-model-an-awal-kanak-kank36760501 teori-dan-model-an-awal-kanak-kank
36760501 teori-dan-model-an-awal-kanak-kankRozaidi Yusof
 
Knowledge Representation in Artificial intelligence
Knowledge Representation in Artificial intelligence Knowledge Representation in Artificial intelligence
Knowledge Representation in Artificial intelligence Yasir Khan
 

Viewers also liked (18)

Hpc 2
Hpc 2Hpc 2
Hpc 2
 
Genatic Algorithm
Genatic AlgorithmGenatic Algorithm
Genatic Algorithm
 
M6 game
M6 gameM6 game
M6 game
 
Hpc 6 7
Hpc 6 7Hpc 6 7
Hpc 6 7
 
M2 agents
M2 agentsM2 agents
M2 agents
 
Vo gay philipin
Vo gay philipinVo gay philipin
Vo gay philipin
 
Uncertainity
Uncertainity Uncertainity
Uncertainity
 
Hpc 4 5
Hpc 4 5Hpc 4 5
Hpc 4 5
 
Frstorder 9 sldes read
Frstorder 9 sldes readFrstorder 9 sldes read
Frstorder 9 sldes read
 
Pringkat pringkatperkembangankanak-kanakdanteori-teoriperkembanganyangberkait...
Pringkat pringkatperkembangankanak-kanakdanteori-teoriperkembanganyangberkait...Pringkat pringkatperkembangankanak-kanakdanteori-teoriperkembanganyangberkait...
Pringkat pringkatperkembangankanak-kanakdanteori-teoriperkembanganyangberkait...
 
Dir based imp_5
Dir based imp_5Dir based imp_5
Dir based imp_5
 
computer graphics at openGL (2)
computer graphics at openGL (2)computer graphics at openGL (2)
computer graphics at openGL (2)
 
C language
C languageC language
C language
 
Cs ps, sat, fol resolution strategies
Cs ps, sat, fol resolution strategiesCs ps, sat, fol resolution strategies
Cs ps, sat, fol resolution strategies
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Information System Architecture and Audit Control Lecture 2
Information System Architecture and Audit Control Lecture 2Information System Architecture and Audit Control Lecture 2
Information System Architecture and Audit Control Lecture 2
 
36760501 teori-dan-model-an-awal-kanak-kank
36760501 teori-dan-model-an-awal-kanak-kank36760501 teori-dan-model-an-awal-kanak-kank
36760501 teori-dan-model-an-awal-kanak-kank
 
Knowledge Representation in Artificial intelligence
Knowledge Representation in Artificial intelligence Knowledge Representation in Artificial intelligence
Knowledge Representation in Artificial intelligence
 

Similar to Introduction 1

247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
Multiprocessor.pptx
 Multiprocessor.pptx Multiprocessor.pptx
Multiprocessor.pptxMuhammad54342
 
Parallel & Distributed processing
Parallel & Distributed processingParallel & Distributed processing
Parallel & Distributed processingSyed Zaid Irshad
 
Multiprocessor_YChen.ppt
Multiprocessor_YChen.pptMultiprocessor_YChen.ppt
Multiprocessor_YChen.pptAberaZeleke1
 
Module2 MultiThreads.ppt
Module2 MultiThreads.pptModule2 MultiThreads.ppt
Module2 MultiThreads.pptshreesha16
 
Lecture 2
Lecture 2Lecture 2
Lecture 2Mr SMAK
 
18 parallel processing
18 parallel processing18 parallel processing
18 parallel processingdilip kumar
 
message passing vs shared memory
message passing vs shared memorymessage passing vs shared memory
message passing vs shared memoryHamza Zahid
 
Classification of Parallel Computers.pptx
Classification of Parallel Computers.pptxClassification of Parallel Computers.pptx
Classification of Parallel Computers.pptxNeeraj Singh
 
Lecture-7 Main Memroy.pptx
Lecture-7 Main Memroy.pptxLecture-7 Main Memroy.pptx
Lecture-7 Main Memroy.pptxAmanuelmergia
 
Multicore and shared multi processor
Multicore and shared multi processorMulticore and shared multi processor
Multicore and shared multi processorSou Jana
 
Multicore and shared multi processor
Multicore and shared multi processorMulticore and shared multi processor
Multicore and shared multi processorJanani S
 

Similar to Introduction 1 (20)

247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
Multiprocessor.pptx
 Multiprocessor.pptx Multiprocessor.pptx
Multiprocessor.pptx
 
22CS201 COA
22CS201 COA22CS201 COA
22CS201 COA
 
unit 4.pptx
unit 4.pptxunit 4.pptx
unit 4.pptx
 
unit 4.pptx
unit 4.pptxunit 4.pptx
unit 4.pptx
 
Lecture5
Lecture5Lecture5
Lecture5
 
Parallel & Distributed processing
Parallel & Distributed processingParallel & Distributed processing
Parallel & Distributed processing
 
Multiprocessor_YChen.ppt
Multiprocessor_YChen.pptMultiprocessor_YChen.ppt
Multiprocessor_YChen.ppt
 
Module2 MultiThreads.ppt
Module2 MultiThreads.pptModule2 MultiThreads.ppt
Module2 MultiThreads.ppt
 
High performance computing
High performance computingHigh performance computing
High performance computing
 
Snooping 2
Snooping 2Snooping 2
Snooping 2
 
parallel-processing.ppt
parallel-processing.pptparallel-processing.ppt
parallel-processing.ppt
 
Lecture 2
Lecture 2Lecture 2
Lecture 2
 
18 parallel processing
18 parallel processing18 parallel processing
18 parallel processing
 
message passing vs shared memory
message passing vs shared memorymessage passing vs shared memory
message passing vs shared memory
 
Classification of Parallel Computers.pptx
Classification of Parallel Computers.pptxClassification of Parallel Computers.pptx
Classification of Parallel Computers.pptx
 
Lecture-7 Main Memroy.pptx
Lecture-7 Main Memroy.pptxLecture-7 Main Memroy.pptx
Lecture-7 Main Memroy.pptx
 
CA UNIT IV.pptx
CA UNIT IV.pptxCA UNIT IV.pptx
CA UNIT IV.pptx
 
Multicore and shared multi processor
Multicore and shared multi processorMulticore and shared multi processor
Multicore and shared multi processor
 
Multicore and shared multi processor
Multicore and shared multi processorMulticore and shared multi processor
Multicore and shared multi processor
 

More from Yasir Khan (19)

Lecture 6
Lecture 6Lecture 6
Lecture 6
 
Lecture 4
Lecture 4Lecture 4
Lecture 4
 
Lecture 3
Lecture 3Lecture 3
Lecture 3
 
Lecture 2
Lecture 2Lecture 2
Lecture 2
 
Lec#1
Lec#1Lec#1
Lec#1
 
Ch10 (1)
Ch10 (1)Ch10 (1)
Ch10 (1)
 
Ch09
Ch09Ch09
Ch09
 
Ch05
Ch05Ch05
Ch05
 
Snooping protocols 3
Snooping protocols 3Snooping protocols 3
Snooping protocols 3
 
Hpc sys
Hpc sysHpc sys
Hpc sys
 
Hpc 3
Hpc 3Hpc 3
Hpc 3
 
Hpc 1
Hpc 1Hpc 1
Hpc 1
 
Flynns classification
Flynns classificationFlynns classification
Flynns classification
 
Logic
LogicLogic
Logic
 
M4 heuristics
M4 heuristicsM4 heuristics
M4 heuristics
 
M3 search
M3 searchM3 search
M3 search
 
M1 intro
M1 introM1 intro
M1 intro
 
Expert system 21 sldes
Expert system 21 sldesExpert system 21 sldes
Expert system 21 sldes
 
AI Robotics
AI RoboticsAI Robotics
AI Robotics
 

Recently uploaded

How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxCeline George
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...pradhanghanshyam7136
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxDr. Ravikiran H M Gowda
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxDr. Sarita Anand
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17Celine George
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSCeline George
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024Elizabeth Walsh
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structuredhanjurrannsibayan2
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxPooja Bhuva
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseAnaAcapella
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...Nguyen Thanh Tu Collection
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptxMaritesTamaniVerdade
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentationcamerronhm
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the ClassroomPooky Knightsmith
 

Recently uploaded (20)

How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 

Introduction 1

  • 1. 1 Bottomline • Can’t escape multi-cores today: it is the baseline architecture • Performance stagnates unless we learn to transform traditional applications into parallel threads • It’s all about the data! Data management: distribution, coherence, consistency • It’s also about the programming model: onus on application writer / compiler / hardware • It’s also about managing on-chip communication
  • 2. 2 Symmetric Multiprocessors (SMP) • A collection of processors, a collection of memory: both are connected through some interconnect (usually, the fastest possible) • Symmetric because latency for any processor to access any memory is constant – uniform memory access (UMA) Proc 1 Proc 2 Proc 3 Proc 4 Mem 1 Mem 2 Mem 3 Mem 4
  • 3. 3 Distributed Memory Multiprocessors • Each processor has local memory that is accessible through a fast interconnect • The different nodes are connected as I/O devices with (potentially) slower interconnect • Local memory access is a lot faster than remote memory – non-uniform memory access (NUMA) • Advantage: can be built with commodity processors and many applications will perform well thanks to locality Proc 1 Mem 1 Proc 2 Mem 2 Proc 3 Mem 3 Proc 4 Mem 4
  • 4. 4 Shared Memory Architectures • Key differentiating feature: the address space is shared, i.e., any processor can directly address any memory location and access them with load/store instructions • Cooperation is similar to a bulletin board – a processor writes to a location and that location is visible to reads by other threads
  • 5. 5 Shared Address Space Shared Private Private Private Process P1 Process P2 Process P3 Shared Shared Shared Pvt P1 Pvt P2 Pvt P3 Virtual address space of each process Physical address space
  • 6. 6 Message Passing • Programming model that can apply to clusters of workstations, SMPs, and even a uniprocessor • Sends and receives are used for effecting the data transfer – usually, each process ends up making a copy of data that is relevant to it • Each process can only name local addresses, other processes, and a tag to help distinguish between multiple messages • A send-receive match is a synchronization event – hence, we no longer need locks or barriers to co-ordinate
  • 7. 7 Models for SEND and RECEIVE • Synchronous: SEND returns control back to the program only when the RECEIVE has completed • Blocking Asynchronous: SEND returns control back to the program after the OS has copied the message into its space -- the program can now modify the sent data structure • Nonblocking Asynchronous: SEND and RECEIVE return control immediately – the message will get copied at some point, so the process must overlap some other computation with the communication – other primitives are used to probe if the communication has finished or not
  • 8. 8 Deterministic Execution • Need synch after every anti-diagonal • Potential load imbalance • Shared-memory vs. message passing • Function of the model for SEND-RECEIVE • Function of the algorithm: diagonal, red-black ordering
  • 9. 9 Cache Coherence A multiprocessor system is cache coherent if • a value written by a processor is eventually visible to reads by other processors – write propagation • two writes to the same location by two processors are seen in the same order by all processors – write serialization
  • 10. 10 Cache Coherence Protocols • Directory-based: A single location (directory) keeps track of the sharing status of a block of memory • Snooping: Every cache block is accompanied by the sharing status of that block – all cache controllers monitor the shared bus so they can update the sharing status of the block, if necessary  Write-invalidate: a processor gains exclusive access of a block before writing by invalidating all other copies  Write-update: when a processor writes, it updates other shared copies of that block