SlideShare a Scribd company logo
1 of 30
Parallel and Distributed
Computing
CS3006 (BCS-6E-6F)
Lecture 04
Danyal Farhat
School of Computing, FAST NUCES Lahore
13 February, 2023
2
Parallel and Distributed Computing
Dr. Haroon Mahmood
Network Topologies for Parallel
Architectures
3
Parallel and Distributed Computing
Dr. Haroon Mahmood
Network Topologies: Buses
 Simplest and earliest parallel machines used buses
 All processors access a common bus for exchanging data
 The distance between any two nodes is O(1) in a bus. The
bus also provides a convenient broadcast media.
 The bandwidth of the shared bus is a major bottleneck
 Typical bus based machines are limited to dozens of nodes.
Sun Enterprise servers and Intel Pentium based shared-bus
multiprocessors are examples of such architectures.
4
Parallel and Distributed Computing
Dr. Haroon Mahmood
Network Topologies: Crossbar
5
Parallel and Distributed Computing
Dr. Haroon Mahmood
Network Topologies: Crossbar
 Uses an p × m grid of switches to connect p inputs to m
outputs in a non-blocking manner
 The cost of a crossbar of p processors grows as O(p2)
 This is generally difficult to scale for large values of p
 Examples of machines that employ crossbars include the
Sun Ultra HPC 10000 and the Fujitsu VPP500
 Crossbars have excellent performance scalability but poor
cost scalability
 Buses have excellent cost scalability, but poor performance
scalability
6
Parallel and Distributed Computing
Dr. Haroon Mahmood
Network Topologies: Multistage Omega Network
 One of the most commonly used multistage interconnects
is the Omega network.
 This network consists of log p stages, where p is the
number of inputs/outputs.
 At each stage, input i is connected j
7
Parallel and Distributed Computing
Dr. Haroon Mahmood
Network Topologies: Multistage Omega Network
 Each stage of the Omega network implements a perfect
shuffle as follows:
8
Parallel and Distributed Computing
Dr. Haroon Mahmood
8 x 8 Omega network
 A complete Omega network with the perfect shuffle
interconnects and switches can now be illustrated:
 An omega network has p/2 × log p switching nodes, and
the cost of such a network grows as (p log p)
9
Parallel and Distributed Computing
Dr. Haroon Mahmood
Network Topologies: Multistage Omega Network
 Machines like IBM eServer p575 and SGI Altix 4000
use -network
 -network is much more interesting for large
number of processors
 Problem: the switches have to be fast enough, and
also the width of the links is important. 16 bit
parallel is better than serial links
 Multiprocessor vector-processors use instead
crossbars (because at most only 32 processors)
 Synchronization is usually performed with special
communication registers (CPU to CPU); if there is
little synchronization shared memory is admitted
10
Parallel and Distributed Computing
Dr. Haroon Mahmood
Shared Memory Interconnection Network
 Main problem is how to do interconnections
of the CPUs to each other and to the
memory
 There are three main network topologies
available:
 Crossbar (n2 connections - datapath without
sharing)
 -network (n log2 n connections - log2 n
switching stages and shared on a path)
 Central databus (1 connections - n shared)
11
Parallel and Distributed Computing
Dr. Haroon Mahmood
Advantages of shared memory machines
 User-friendly programming environment
due to global address space
 Data sharing is fast and uniform due to
proximity (nearness) of memory to CPUs
 Memory coherence is managed by the
operating system
12
Parallel and Distributed Computing
Dr. Haroon Mahmood
Disadvantages of shared memory machines
 Performance degradation due to “memory
contention” (several processors try to access the
same memory location)
 Programmer’s responsibility for synchronization
constructs (correct access to memory)
 Expensive to design shared memory computers
with increasing numbers of processors
 Adding processors can geometrically increase
traffic on the shared memory-CPU path and for
cache coherence management
13
Parallel and Distributed Computing
Dr. Haroon Mahmood
Distributed-memory SIMD (1)
 These machines are sometimes also known as
processor-array machines
 They work in lock-step, i.e. all processors execute
the same instruction at the same time (but on
different data items); no synchronization is required
 A control processor issues the instructions that are
to be executed by the processors in the processor
array
 Processors sometimes are of a very simple bit-serial
type (i.e the processors operate on the data items
bitwise, irrespective of their type), which can
operate on operands of any length (when operands
are little, this may result in speedups)
14
Parallel and Distributed Computing
Dr. Haroon Mahmood
Distributed-memory SIMD (2)
 Graphical Processing Units (GPUs) are similar to processor
array systems
 DM-SIMD are specialized on digital signal and image
processing and on certain types of Monte Carlo simulations
with virtually no exchange between processors
 Operations that cannot be executed by the processor array
or by the control processor are off-loaded to the front-end
system
 I/O may be through:
 the front end system
 the processor array (it is more efficient in providing data directly
to the memory of the processor array)
 both
15
Parallel and Distributed Computing
Dr. Haroon Mahmood
Distributed-memory MIMD
CPU Mem
Network
CPU Mem
CPU Mem
 Processors have their own local memory
 No concept of global address space across all
processors
 Distributed memory systems require a
communication network to connect inter-
processor memory
16
Parallel and Distributed Computing
Dr. Haroon Mahmood
Examples
Shared-memory SIMD systems
 The Hitachi S3600 series
Distributed-memory SIMD systems
 The Alenia Quadrics
 The Cambridge Parallel Processing
Gamma II
 The Digital Equipment Corp. MPP series
 The MasPar MP-1
 The MasPar MP-2
Shared-memory MIMD systems
 The Cray Research Inc. Cray J90-series,
T90 series
 The Hitachi S3800 series
 The HP/Convex C4 series
 The Digital Equipment Corp.
AlphaServer
 The NEC SX-4
 The Silicon Graphics Power Challenge
 The Tera MTA
Distributed-memory MIMD systems
 The Alex AVX 2
 The Avalon A12
 The C-DAC PARAM 9000/SS
 The Cray Research Inc. T3E
 The Fujitsu AP1000
 The Fujitsu VPP300 series
 The Hitachi SR2201 series
 The HP/Convex Exemplar SPP-1200
 The IBM 9076 SP2
 The Intel Paragon XP
 The Matsushita ADENART
 The Meiko Computing Surface 2
 The nCUBE 3
 The NEC Cenju-3
 The Parallel Computing Industries
system
 The Parsys TA9000
 The Parsytec GC/Power Plus
17
Parallel and Distributed Computing
Dr. Haroon Mahmood
DM-MIMD Routing Mechanism
 Routing mechanism determines the path a message takes
through network to reach from source to destination node.
 Data and task decomposition have to be dealt explicitly!
 Topology and speed of the data paths are crucial and have
to be balanced with costs.
 Routing can be classified as:
 Minimal
 Non-minimal
 It can also be classified as:
 Deterministic routing
 Adoptive routing
18
Parallel and Distributed Computing
Dr. Haroon Mahmood
Network Topologies
Linear Arrays
 In a linear array, each node has two neighbors, one to its left
and one to its right.
 If the nodes at either end are connected, we refer to it as a
1-D torus or a ring.
Linear arrays: (a) with no wraparound links; (b) with
wraparound link.
19
Parallel and Distributed Computing
Dr. Haroon Mahmood
Network Topologies
K-d Meshes
 A generalization has nodes with 4 neighbors, to the north,
south, east, and west.
 A further generalization to d dimensions has nodes with 2d
neighbors (i.e., 6 neighbors in case of 3d cube).
Two and three dimensional meshes: (a) 2-D mesh with no
wraparound; (b) 2-D mesh with wraparound link (2-D torus); and
(c) a 3-D mesh with no wraparound.
20
Parallel and Distributed Computing
Dr. Haroon Mahmood
Hypercube
 For an hypercube with 2d nodes the number of steps to be
taken between any two nodes is at most d (logarithmic
grow)
d = log p (dimensions = log(nodes))
 The distance between any two nodes is at most log p.
 Each node has log p neighbors.
Network Topologies
21
Parallel and Distributed Computing
Dr. Haroon Mahmood
Network Topologies: hypercube
 Rule of thumb is: “d-dimensional hypercube can be
constructed by connecting corresponding nodes of two
different (d-1)-dimensional hypercubes”
22
Parallel and Distributed Computing
Dr. Haroon Mahmood
Network Topologies: hypercube
 The processors are numbered with 3-bit binary numbers
which represent the X-Y-Z coordinates
 The distance between two nodes is given by the number of
bit positions at which the two nodes differ.
23
Parallel and Distributed Computing
Dr. Haroon Mahmood
Network Topologies: Tree based Networks
 A tree network is one in which there is one path between
any pair of nodes
 Linear arrays and star-connected networks are special cases
of tree-based networks
 In static tree network, each node represent a processing
element
 In dynamic tree network, leaf nodes represent processing
element while internal nodes are switching elements.
 The source node sends the message up the tree until it
reaches the node at the root of the smallest subtree
containing both the source and destination nodes.
 Trees can be laid out in 2D with no wire crossings. This is an
attractive property of trees.
 The distance between any two nodes is no more than 2logp.
24
Parallel and Distributed Computing
Dr. Haroon Mahmood
Complete Binary Tree
Complete binary tree networks: (a) a static tree network; and (b) a
dynamic tree network.
Network Topologies: Tree based Networks
25
Parallel and Distributed Computing
Dr. Haroon Mahmood
Routing - Fat Tree
 Another topology is the “fat tree”
 In a tree, a node can speak to another node passing
through the root so we have higher traffic at the root node.
 Fat tree amends this providing more bandwidth (with
multiple connections) in the higher levels of the tree
 N-ary fat tree is when the levels towards the root have N
times the number of connections in the level below it
26
Parallel and Distributed Computing
Dr. Haroon Mahmood
Evaluating Static Inter-Connections
in-terms of Diameter, Arc-
Connectivity, Bisection Width, and
Cost.
Cut-through Routing and Cost-
performance tradeoffs
27
Parallel and Distributed Computing
Dr. Haroon Mahmood
Evaluating Static Interconnections
The parameters to evaluate a static interconnection:-
 Cost: Usually depends on number of links for communication.
E.g., cost for linear array is p-1.
 Lower values are favorable
 Diameter: The shortest distance between the farthest two
nodes in the network. The diameter of a linear array is p − 1.
 Lower values are favorable
 Bisection Width: The minimum number of wires you must cut
to divide the network into two equal parts. The bisection
width of a linear array is 1.
 What it tells about performance of a topology?
28
Parallel and Distributed Computing
Dr. Haroon Mahmood
Evaluating Static Interconnections
 Arc-connectivity: The minimum number of arcs or links that
must be removed from the network, to break the network
into two disconnected networks
 Higher value are desirable
 It is minimum number of the links that must be cut to
separate the single node from the network
 Higher values means, that incase of link failure there are
multiple other routes to the node.
 Arc-connectivity of linear array is 1 and 2 for ring.
29
Parallel and Distributed Computing
Dr. Haroon Mahmood
Evaluating Static Interconnections
30
Parallel and Distributed Computing
Dr. Haroon Mahmood
Thank You!

More Related Content

Similar to PDCLECTURE.pptx

1.NggggggggggghhhhhhhhhhS UNIT - 1.pptx.pdf
1.NggggggggggghhhhhhhhhhS UNIT - 1.pptx.pdf1.NggggggggggghhhhhhhhhhS UNIT - 1.pptx.pdf
1.NggggggggggghhhhhhhhhhS UNIT - 1.pptx.pdfsadoyah492
 
1-introduction-to-computer-networking.ppt
1-introduction-to-computer-networking.ppt1-introduction-to-computer-networking.ppt
1-introduction-to-computer-networking.pptjeronimored
 
1 introduction-to-computer-networking
1 introduction-to-computer-networking1 introduction-to-computer-networking
1 introduction-to-computer-networkingshiv balapure
 
1 introduction-to-computer-networking
1 introduction-to-computer-networking1 introduction-to-computer-networking
1 introduction-to-computer-networkingmartuj
 
1 introduction-to-computer-networking
1 introduction-to-computer-networking1 introduction-to-computer-networking
1 introduction-to-computer-networkingAnuja Lad
 
1 introduction-to-computer-networking
1 introduction-to-computer-networking1 introduction-to-computer-networking
1 introduction-to-computer-networkingKhan Rahimeen
 
Design of an Efficient Communication Protocol for 3d Interconnection Network
Design of an Efficient Communication Protocol for 3d Interconnection NetworkDesign of an Efficient Communication Protocol for 3d Interconnection Network
Design of an Efficient Communication Protocol for 3d Interconnection NetworkIJMTST Journal
 
Advanced computer architecture
Advanced computer architectureAdvanced computer architecture
Advanced computer architectureKaviya452563
 
05 - Networking Components and Devices.ppt
05 - Networking Components and Devices.ppt05 - Networking Components and Devices.ppt
05 - Networking Components and Devices.pptssuserf7cd2b
 
Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...
Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...
Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...Derryck Lamptey, MPhil, CISSP
 
Networking devices(siddique)
Networking devices(siddique)Networking devices(siddique)
Networking devices(siddique)Siddique Ibrahim
 
Networks copy
Networks   copyNetworks   copy
Networks copyAman Goel
 

Similar to PDCLECTURE.pptx (20)

COMPUTER NETWORKING
COMPUTER NETWORKINGCOMPUTER NETWORKING
COMPUTER NETWORKING
 
13. multiprocessing
13. multiprocessing13. multiprocessing
13. multiprocessing
 
Parallel Processing Concepts
Parallel Processing Concepts Parallel Processing Concepts
Parallel Processing Concepts
 
01 - Topology.ppt
01 - Topology.ppt01 - Topology.ppt
01 - Topology.ppt
 
1.NggggggggggghhhhhhhhhhS UNIT - 1.pptx.pdf
1.NggggggggggghhhhhhhhhhS UNIT - 1.pptx.pdf1.NggggggggggghhhhhhhhhhS UNIT - 1.pptx.pdf
1.NggggggggggghhhhhhhhhhS UNIT - 1.pptx.pdf
 
1-introduction-to-computer-networking.ppt
1-introduction-to-computer-networking.ppt1-introduction-to-computer-networking.ppt
1-introduction-to-computer-networking.ppt
 
1 introduction-to-computer-networking
1 introduction-to-computer-networking1 introduction-to-computer-networking
1 introduction-to-computer-networking
 
1 introduction-to-computer-networking
1 introduction-to-computer-networking1 introduction-to-computer-networking
1 introduction-to-computer-networking
 
1 introduction-to-computer-networking
1 introduction-to-computer-networking1 introduction-to-computer-networking
1 introduction-to-computer-networking
 
1 introduction-to-computer-networking
1 introduction-to-computer-networking1 introduction-to-computer-networking
1 introduction-to-computer-networking
 
Design of an Efficient Communication Protocol for 3d Interconnection Network
Design of an Efficient Communication Protocol for 3d Interconnection NetworkDesign of an Efficient Communication Protocol for 3d Interconnection Network
Design of an Efficient Communication Protocol for 3d Interconnection Network
 
Advanced computer architecture
Advanced computer architectureAdvanced computer architecture
Advanced computer architecture
 
05 - Networking Components and Devices.ppt
05 - Networking Components and Devices.ppt05 - Networking Components and Devices.ppt
05 - Networking Components and Devices.ppt
 
Microprocessor
MicroprocessorMicroprocessor
Microprocessor
 
Solution(1)
Solution(1)Solution(1)
Solution(1)
 
Routing protocols in ad hoc network
Routing protocols in ad hoc networkRouting protocols in ad hoc network
Routing protocols in ad hoc network
 
Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...
Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...
Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...
 
Networking devices(siddique)
Networking devices(siddique)Networking devices(siddique)
Networking devices(siddique)
 
What is 3d torus
What is 3d torusWhat is 3d torus
What is 3d torus
 
Networks copy
Networks   copyNetworks   copy
Networks copy
 

Recently uploaded

Call Girls Delhi {Rohini} 9711199012 high profile service
Call Girls Delhi {Rohini} 9711199012 high profile serviceCall Girls Delhi {Rohini} 9711199012 high profile service
Call Girls Delhi {Rohini} 9711199012 high profile servicerehmti665
 
Top Rated Pune Call Girls Chakan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...
Top Rated  Pune Call Girls Chakan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...Top Rated  Pune Call Girls Chakan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...
Top Rated Pune Call Girls Chakan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...Call Girls in Nagpur High Profile
 
Dubai Call Girls O528786472 Call Girls In Dubai Wisteria
Dubai Call Girls O528786472 Call Girls In Dubai WisteriaDubai Call Girls O528786472 Call Girls In Dubai Wisteria
Dubai Call Girls O528786472 Call Girls In Dubai WisteriaUnited Arab Emirates
 
Call Girls Service Kolkata Aishwarya 🤌 8250192130 🚀 Vip Call Girls Kolkata
Call Girls Service Kolkata Aishwarya 🤌  8250192130 🚀 Vip Call Girls KolkataCall Girls Service Kolkata Aishwarya 🤌  8250192130 🚀 Vip Call Girls Kolkata
Call Girls Service Kolkata Aishwarya 🤌 8250192130 🚀 Vip Call Girls Kolkataanamikaraghav4
 
哪里办理美国宾夕法尼亚州立大学毕业证(本硕)psu成绩单原版一模一样
哪里办理美国宾夕法尼亚州立大学毕业证(本硕)psu成绩单原版一模一样哪里办理美国宾夕法尼亚州立大学毕业证(本硕)psu成绩单原版一模一样
哪里办理美国宾夕法尼亚州立大学毕业证(本硕)psu成绩单原版一模一样qaffana
 
Russian Call Girls In South Delhi Delhi 9711199012 💋✔💕😘 Independent Escorts D...
Russian Call Girls In South Delhi Delhi 9711199012 💋✔💕😘 Independent Escorts D...Russian Call Girls In South Delhi Delhi 9711199012 💋✔💕😘 Independent Escorts D...
Russian Call Girls In South Delhi Delhi 9711199012 💋✔💕😘 Independent Escorts D...nagunakhan
 
(PARI) Alandi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(PARI) Alandi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(PARI) Alandi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(PARI) Alandi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
Thane Escorts, (Pooja 09892124323), Thane Call Girls
Thane Escorts, (Pooja 09892124323), Thane Call GirlsThane Escorts, (Pooja 09892124323), Thane Call Girls
Thane Escorts, (Pooja 09892124323), Thane Call GirlsPooja Nehwal
 
Call Girls in Nagpur Bhavna Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Bhavna Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Bhavna Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Bhavna Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
《伯明翰城市大学毕业证成绩单购买》学历证书学位证书区别《复刻原版1:1伯明翰城市大学毕业证书|修改BCU成绩单PDF版》Q微信741003700《BCU学...
《伯明翰城市大学毕业证成绩单购买》学历证书学位证书区别《复刻原版1:1伯明翰城市大学毕业证书|修改BCU成绩单PDF版》Q微信741003700《BCU学...《伯明翰城市大学毕业证成绩单购买》学历证书学位证书区别《复刻原版1:1伯明翰城市大学毕业证书|修改BCU成绩单PDF版》Q微信741003700《BCU学...
《伯明翰城市大学毕业证成绩单购买》学历证书学位证书区别《复刻原版1:1伯明翰城市大学毕业证书|修改BCU成绩单PDF版》Q微信741003700《BCU学...ur8mqw8e
 
Lucknow 💋 Call Girls Adil Nagar | ₹,9500 Pay Cash 8923113531 Free Home Delive...
Lucknow 💋 Call Girls Adil Nagar | ₹,9500 Pay Cash 8923113531 Free Home Delive...Lucknow 💋 Call Girls Adil Nagar | ₹,9500 Pay Cash 8923113531 Free Home Delive...
Lucknow 💋 Call Girls Adil Nagar | ₹,9500 Pay Cash 8923113531 Free Home Delive...anilsa9823
 
Top Rated Pune Call Girls Shirwal ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...
Top Rated  Pune Call Girls Shirwal ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...Top Rated  Pune Call Girls Shirwal ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...
Top Rated Pune Call Girls Shirwal ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...Call Girls in Nagpur High Profile
 
Call Girls in Thane 9892124323, Vashi cAll girls Serivces Juhu Escorts, powai...
Call Girls in Thane 9892124323, Vashi cAll girls Serivces Juhu Escorts, powai...Call Girls in Thane 9892124323, Vashi cAll girls Serivces Juhu Escorts, powai...
Call Girls in Thane 9892124323, Vashi cAll girls Serivces Juhu Escorts, powai...Pooja Nehwal
 
Book Paid Lohegaon Call Girls Pune 8250192130Low Budget Full Independent High...
Book Paid Lohegaon Call Girls Pune 8250192130Low Budget Full Independent High...Book Paid Lohegaon Call Girls Pune 8250192130Low Budget Full Independent High...
Book Paid Lohegaon Call Girls Pune 8250192130Low Budget Full Independent High...ranjana rawat
 
Beautiful Sapna Call Girls CP 9711199012 ☎ Call /Whatsapps
Beautiful Sapna Call Girls CP 9711199012 ☎ Call /WhatsappsBeautiful Sapna Call Girls CP 9711199012 ☎ Call /Whatsapps
Beautiful Sapna Call Girls CP 9711199012 ☎ Call /Whatsappssapnasaifi408
 
(=Towel) Dubai Call Girls O525547819 Call Girls In Dubai (Fav0r)
(=Towel) Dubai Call Girls O525547819 Call Girls In Dubai (Fav0r)(=Towel) Dubai Call Girls O525547819 Call Girls In Dubai (Fav0r)
(=Towel) Dubai Call Girls O525547819 Call Girls In Dubai (Fav0r)kojalkojal131
 
VIP Call Girl Saharanpur Aashi 8250192130 Independent Escort Service Saharanpur
VIP Call Girl Saharanpur Aashi 8250192130 Independent Escort Service SaharanpurVIP Call Girl Saharanpur Aashi 8250192130 Independent Escort Service Saharanpur
VIP Call Girl Saharanpur Aashi 8250192130 Independent Escort Service SaharanpurSuhani Kapoor
 
Pallawi 9167673311 Call Girls in Thane , Independent Escort Service Thane
Pallawi 9167673311  Call Girls in Thane , Independent Escort Service ThanePallawi 9167673311  Call Girls in Thane , Independent Escort Service Thane
Pallawi 9167673311 Call Girls in Thane , Independent Escort Service ThanePooja Nehwal
 
Russian Call Girls Kolkata Chhaya 🤌 8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls Kolkata Chhaya 🤌  8250192130 🚀 Vip Call Girls KolkataRussian Call Girls Kolkata Chhaya 🤌  8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls Kolkata Chhaya 🤌 8250192130 🚀 Vip Call Girls Kolkataanamikaraghav4
 

Recently uploaded (20)

Call Girls Delhi {Rohini} 9711199012 high profile service
Call Girls Delhi {Rohini} 9711199012 high profile serviceCall Girls Delhi {Rohini} 9711199012 high profile service
Call Girls Delhi {Rohini} 9711199012 high profile service
 
Top Rated Pune Call Girls Chakan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...
Top Rated  Pune Call Girls Chakan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...Top Rated  Pune Call Girls Chakan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...
Top Rated Pune Call Girls Chakan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...
 
Dubai Call Girls O528786472 Call Girls In Dubai Wisteria
Dubai Call Girls O528786472 Call Girls In Dubai WisteriaDubai Call Girls O528786472 Call Girls In Dubai Wisteria
Dubai Call Girls O528786472 Call Girls In Dubai Wisteria
 
Call Girls Service Kolkata Aishwarya 🤌 8250192130 🚀 Vip Call Girls Kolkata
Call Girls Service Kolkata Aishwarya 🤌  8250192130 🚀 Vip Call Girls KolkataCall Girls Service Kolkata Aishwarya 🤌  8250192130 🚀 Vip Call Girls Kolkata
Call Girls Service Kolkata Aishwarya 🤌 8250192130 🚀 Vip Call Girls Kolkata
 
哪里办理美国宾夕法尼亚州立大学毕业证(本硕)psu成绩单原版一模一样
哪里办理美国宾夕法尼亚州立大学毕业证(本硕)psu成绩单原版一模一样哪里办理美国宾夕法尼亚州立大学毕业证(本硕)psu成绩单原版一模一样
哪里办理美国宾夕法尼亚州立大学毕业证(本硕)psu成绩单原版一模一样
 
Russian Call Girls In South Delhi Delhi 9711199012 💋✔💕😘 Independent Escorts D...
Russian Call Girls In South Delhi Delhi 9711199012 💋✔💕😘 Independent Escorts D...Russian Call Girls In South Delhi Delhi 9711199012 💋✔💕😘 Independent Escorts D...
Russian Call Girls In South Delhi Delhi 9711199012 💋✔💕😘 Independent Escorts D...
 
(PARI) Alandi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(PARI) Alandi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(PARI) Alandi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(PARI) Alandi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
Thane Escorts, (Pooja 09892124323), Thane Call Girls
Thane Escorts, (Pooja 09892124323), Thane Call GirlsThane Escorts, (Pooja 09892124323), Thane Call Girls
Thane Escorts, (Pooja 09892124323), Thane Call Girls
 
Call Girls in Nagpur Bhavna Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Bhavna Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Bhavna Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Bhavna Call 7001035870 Meet With Nagpur Escorts
 
《伯明翰城市大学毕业证成绩单购买》学历证书学位证书区别《复刻原版1:1伯明翰城市大学毕业证书|修改BCU成绩单PDF版》Q微信741003700《BCU学...
《伯明翰城市大学毕业证成绩单购买》学历证书学位证书区别《复刻原版1:1伯明翰城市大学毕业证书|修改BCU成绩单PDF版》Q微信741003700《BCU学...《伯明翰城市大学毕业证成绩单购买》学历证书学位证书区别《复刻原版1:1伯明翰城市大学毕业证书|修改BCU成绩单PDF版》Q微信741003700《BCU学...
《伯明翰城市大学毕业证成绩单购买》学历证书学位证书区别《复刻原版1:1伯明翰城市大学毕业证书|修改BCU成绩单PDF版》Q微信741003700《BCU学...
 
Lucknow 💋 Call Girls Adil Nagar | ₹,9500 Pay Cash 8923113531 Free Home Delive...
Lucknow 💋 Call Girls Adil Nagar | ₹,9500 Pay Cash 8923113531 Free Home Delive...Lucknow 💋 Call Girls Adil Nagar | ₹,9500 Pay Cash 8923113531 Free Home Delive...
Lucknow 💋 Call Girls Adil Nagar | ₹,9500 Pay Cash 8923113531 Free Home Delive...
 
Top Rated Pune Call Girls Shirwal ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...
Top Rated  Pune Call Girls Shirwal ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...Top Rated  Pune Call Girls Shirwal ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...
Top Rated Pune Call Girls Shirwal ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...
 
Call Girls in Thane 9892124323, Vashi cAll girls Serivces Juhu Escorts, powai...
Call Girls in Thane 9892124323, Vashi cAll girls Serivces Juhu Escorts, powai...Call Girls in Thane 9892124323, Vashi cAll girls Serivces Juhu Escorts, powai...
Call Girls in Thane 9892124323, Vashi cAll girls Serivces Juhu Escorts, powai...
 
🔝 9953056974🔝 Delhi Call Girls in Ajmeri Gate
🔝 9953056974🔝 Delhi Call Girls in Ajmeri Gate🔝 9953056974🔝 Delhi Call Girls in Ajmeri Gate
🔝 9953056974🔝 Delhi Call Girls in Ajmeri Gate
 
Book Paid Lohegaon Call Girls Pune 8250192130Low Budget Full Independent High...
Book Paid Lohegaon Call Girls Pune 8250192130Low Budget Full Independent High...Book Paid Lohegaon Call Girls Pune 8250192130Low Budget Full Independent High...
Book Paid Lohegaon Call Girls Pune 8250192130Low Budget Full Independent High...
 
Beautiful Sapna Call Girls CP 9711199012 ☎ Call /Whatsapps
Beautiful Sapna Call Girls CP 9711199012 ☎ Call /WhatsappsBeautiful Sapna Call Girls CP 9711199012 ☎ Call /Whatsapps
Beautiful Sapna Call Girls CP 9711199012 ☎ Call /Whatsapps
 
(=Towel) Dubai Call Girls O525547819 Call Girls In Dubai (Fav0r)
(=Towel) Dubai Call Girls O525547819 Call Girls In Dubai (Fav0r)(=Towel) Dubai Call Girls O525547819 Call Girls In Dubai (Fav0r)
(=Towel) Dubai Call Girls O525547819 Call Girls In Dubai (Fav0r)
 
VIP Call Girl Saharanpur Aashi 8250192130 Independent Escort Service Saharanpur
VIP Call Girl Saharanpur Aashi 8250192130 Independent Escort Service SaharanpurVIP Call Girl Saharanpur Aashi 8250192130 Independent Escort Service Saharanpur
VIP Call Girl Saharanpur Aashi 8250192130 Independent Escort Service Saharanpur
 
Pallawi 9167673311 Call Girls in Thane , Independent Escort Service Thane
Pallawi 9167673311  Call Girls in Thane , Independent Escort Service ThanePallawi 9167673311  Call Girls in Thane , Independent Escort Service Thane
Pallawi 9167673311 Call Girls in Thane , Independent Escort Service Thane
 
Russian Call Girls Kolkata Chhaya 🤌 8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls Kolkata Chhaya 🤌  8250192130 🚀 Vip Call Girls KolkataRussian Call Girls Kolkata Chhaya 🤌  8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls Kolkata Chhaya 🤌 8250192130 🚀 Vip Call Girls Kolkata
 

PDCLECTURE.pptx

  • 1. Parallel and Distributed Computing CS3006 (BCS-6E-6F) Lecture 04 Danyal Farhat School of Computing, FAST NUCES Lahore 13 February, 2023
  • 2. 2 Parallel and Distributed Computing Dr. Haroon Mahmood Network Topologies for Parallel Architectures
  • 3. 3 Parallel and Distributed Computing Dr. Haroon Mahmood Network Topologies: Buses  Simplest and earliest parallel machines used buses  All processors access a common bus for exchanging data  The distance between any two nodes is O(1) in a bus. The bus also provides a convenient broadcast media.  The bandwidth of the shared bus is a major bottleneck  Typical bus based machines are limited to dozens of nodes. Sun Enterprise servers and Intel Pentium based shared-bus multiprocessors are examples of such architectures.
  • 4. 4 Parallel and Distributed Computing Dr. Haroon Mahmood Network Topologies: Crossbar
  • 5. 5 Parallel and Distributed Computing Dr. Haroon Mahmood Network Topologies: Crossbar  Uses an p × m grid of switches to connect p inputs to m outputs in a non-blocking manner  The cost of a crossbar of p processors grows as O(p2)  This is generally difficult to scale for large values of p  Examples of machines that employ crossbars include the Sun Ultra HPC 10000 and the Fujitsu VPP500  Crossbars have excellent performance scalability but poor cost scalability  Buses have excellent cost scalability, but poor performance scalability
  • 6. 6 Parallel and Distributed Computing Dr. Haroon Mahmood Network Topologies: Multistage Omega Network  One of the most commonly used multistage interconnects is the Omega network.  This network consists of log p stages, where p is the number of inputs/outputs.  At each stage, input i is connected j
  • 7. 7 Parallel and Distributed Computing Dr. Haroon Mahmood Network Topologies: Multistage Omega Network  Each stage of the Omega network implements a perfect shuffle as follows:
  • 8. 8 Parallel and Distributed Computing Dr. Haroon Mahmood 8 x 8 Omega network  A complete Omega network with the perfect shuffle interconnects and switches can now be illustrated:  An omega network has p/2 × log p switching nodes, and the cost of such a network grows as (p log p)
  • 9. 9 Parallel and Distributed Computing Dr. Haroon Mahmood Network Topologies: Multistage Omega Network  Machines like IBM eServer p575 and SGI Altix 4000 use -network  -network is much more interesting for large number of processors  Problem: the switches have to be fast enough, and also the width of the links is important. 16 bit parallel is better than serial links  Multiprocessor vector-processors use instead crossbars (because at most only 32 processors)  Synchronization is usually performed with special communication registers (CPU to CPU); if there is little synchronization shared memory is admitted
  • 10. 10 Parallel and Distributed Computing Dr. Haroon Mahmood Shared Memory Interconnection Network  Main problem is how to do interconnections of the CPUs to each other and to the memory  There are three main network topologies available:  Crossbar (n2 connections - datapath without sharing)  -network (n log2 n connections - log2 n switching stages and shared on a path)  Central databus (1 connections - n shared)
  • 11. 11 Parallel and Distributed Computing Dr. Haroon Mahmood Advantages of shared memory machines  User-friendly programming environment due to global address space  Data sharing is fast and uniform due to proximity (nearness) of memory to CPUs  Memory coherence is managed by the operating system
  • 12. 12 Parallel and Distributed Computing Dr. Haroon Mahmood Disadvantages of shared memory machines  Performance degradation due to “memory contention” (several processors try to access the same memory location)  Programmer’s responsibility for synchronization constructs (correct access to memory)  Expensive to design shared memory computers with increasing numbers of processors  Adding processors can geometrically increase traffic on the shared memory-CPU path and for cache coherence management
  • 13. 13 Parallel and Distributed Computing Dr. Haroon Mahmood Distributed-memory SIMD (1)  These machines are sometimes also known as processor-array machines  They work in lock-step, i.e. all processors execute the same instruction at the same time (but on different data items); no synchronization is required  A control processor issues the instructions that are to be executed by the processors in the processor array  Processors sometimes are of a very simple bit-serial type (i.e the processors operate on the data items bitwise, irrespective of their type), which can operate on operands of any length (when operands are little, this may result in speedups)
  • 14. 14 Parallel and Distributed Computing Dr. Haroon Mahmood Distributed-memory SIMD (2)  Graphical Processing Units (GPUs) are similar to processor array systems  DM-SIMD are specialized on digital signal and image processing and on certain types of Monte Carlo simulations with virtually no exchange between processors  Operations that cannot be executed by the processor array or by the control processor are off-loaded to the front-end system  I/O may be through:  the front end system  the processor array (it is more efficient in providing data directly to the memory of the processor array)  both
  • 15. 15 Parallel and Distributed Computing Dr. Haroon Mahmood Distributed-memory MIMD CPU Mem Network CPU Mem CPU Mem  Processors have their own local memory  No concept of global address space across all processors  Distributed memory systems require a communication network to connect inter- processor memory
  • 16. 16 Parallel and Distributed Computing Dr. Haroon Mahmood Examples Shared-memory SIMD systems  The Hitachi S3600 series Distributed-memory SIMD systems  The Alenia Quadrics  The Cambridge Parallel Processing Gamma II  The Digital Equipment Corp. MPP series  The MasPar MP-1  The MasPar MP-2 Shared-memory MIMD systems  The Cray Research Inc. Cray J90-series, T90 series  The Hitachi S3800 series  The HP/Convex C4 series  The Digital Equipment Corp. AlphaServer  The NEC SX-4  The Silicon Graphics Power Challenge  The Tera MTA Distributed-memory MIMD systems  The Alex AVX 2  The Avalon A12  The C-DAC PARAM 9000/SS  The Cray Research Inc. T3E  The Fujitsu AP1000  The Fujitsu VPP300 series  The Hitachi SR2201 series  The HP/Convex Exemplar SPP-1200  The IBM 9076 SP2  The Intel Paragon XP  The Matsushita ADENART  The Meiko Computing Surface 2  The nCUBE 3  The NEC Cenju-3  The Parallel Computing Industries system  The Parsys TA9000  The Parsytec GC/Power Plus
  • 17. 17 Parallel and Distributed Computing Dr. Haroon Mahmood DM-MIMD Routing Mechanism  Routing mechanism determines the path a message takes through network to reach from source to destination node.  Data and task decomposition have to be dealt explicitly!  Topology and speed of the data paths are crucial and have to be balanced with costs.  Routing can be classified as:  Minimal  Non-minimal  It can also be classified as:  Deterministic routing  Adoptive routing
  • 18. 18 Parallel and Distributed Computing Dr. Haroon Mahmood Network Topologies Linear Arrays  In a linear array, each node has two neighbors, one to its left and one to its right.  If the nodes at either end are connected, we refer to it as a 1-D torus or a ring. Linear arrays: (a) with no wraparound links; (b) with wraparound link.
  • 19. 19 Parallel and Distributed Computing Dr. Haroon Mahmood Network Topologies K-d Meshes  A generalization has nodes with 4 neighbors, to the north, south, east, and west.  A further generalization to d dimensions has nodes with 2d neighbors (i.e., 6 neighbors in case of 3d cube). Two and three dimensional meshes: (a) 2-D mesh with no wraparound; (b) 2-D mesh with wraparound link (2-D torus); and (c) a 3-D mesh with no wraparound.
  • 20. 20 Parallel and Distributed Computing Dr. Haroon Mahmood Hypercube  For an hypercube with 2d nodes the number of steps to be taken between any two nodes is at most d (logarithmic grow) d = log p (dimensions = log(nodes))  The distance between any two nodes is at most log p.  Each node has log p neighbors. Network Topologies
  • 21. 21 Parallel and Distributed Computing Dr. Haroon Mahmood Network Topologies: hypercube  Rule of thumb is: “d-dimensional hypercube can be constructed by connecting corresponding nodes of two different (d-1)-dimensional hypercubes”
  • 22. 22 Parallel and Distributed Computing Dr. Haroon Mahmood Network Topologies: hypercube  The processors are numbered with 3-bit binary numbers which represent the X-Y-Z coordinates  The distance between two nodes is given by the number of bit positions at which the two nodes differ.
  • 23. 23 Parallel and Distributed Computing Dr. Haroon Mahmood Network Topologies: Tree based Networks  A tree network is one in which there is one path between any pair of nodes  Linear arrays and star-connected networks are special cases of tree-based networks  In static tree network, each node represent a processing element  In dynamic tree network, leaf nodes represent processing element while internal nodes are switching elements.  The source node sends the message up the tree until it reaches the node at the root of the smallest subtree containing both the source and destination nodes.  Trees can be laid out in 2D with no wire crossings. This is an attractive property of trees.  The distance between any two nodes is no more than 2logp.
  • 24. 24 Parallel and Distributed Computing Dr. Haroon Mahmood Complete Binary Tree Complete binary tree networks: (a) a static tree network; and (b) a dynamic tree network. Network Topologies: Tree based Networks
  • 25. 25 Parallel and Distributed Computing Dr. Haroon Mahmood Routing - Fat Tree  Another topology is the “fat tree”  In a tree, a node can speak to another node passing through the root so we have higher traffic at the root node.  Fat tree amends this providing more bandwidth (with multiple connections) in the higher levels of the tree  N-ary fat tree is when the levels towards the root have N times the number of connections in the level below it
  • 26. 26 Parallel and Distributed Computing Dr. Haroon Mahmood Evaluating Static Inter-Connections in-terms of Diameter, Arc- Connectivity, Bisection Width, and Cost. Cut-through Routing and Cost- performance tradeoffs
  • 27. 27 Parallel and Distributed Computing Dr. Haroon Mahmood Evaluating Static Interconnections The parameters to evaluate a static interconnection:-  Cost: Usually depends on number of links for communication. E.g., cost for linear array is p-1.  Lower values are favorable  Diameter: The shortest distance between the farthest two nodes in the network. The diameter of a linear array is p − 1.  Lower values are favorable  Bisection Width: The minimum number of wires you must cut to divide the network into two equal parts. The bisection width of a linear array is 1.  What it tells about performance of a topology?
  • 28. 28 Parallel and Distributed Computing Dr. Haroon Mahmood Evaluating Static Interconnections  Arc-connectivity: The minimum number of arcs or links that must be removed from the network, to break the network into two disconnected networks  Higher value are desirable  It is minimum number of the links that must be cut to separate the single node from the network  Higher values means, that incase of link failure there are multiple other routes to the node.  Arc-connectivity of linear array is 1 and 2 for ring.
  • 29. 29 Parallel and Distributed Computing Dr. Haroon Mahmood Evaluating Static Interconnections
  • 30. 30 Parallel and Distributed Computing Dr. Haroon Mahmood Thank You!

Editor's Notes

  1. If p = 8, log(p) = 3.
  2. Perfect Shuffle: Left_rotate.
  3. (p/2) * log(p) => if p = 8, then (8/2) * log(8) = 4 * 3 = 12 switching nodes
  4. n log n connections = 8 * log(8) = 24 connections
  5. Memory Coherence => Multiple processors trying to access the same memory location
  6. Cache Coherence => Multiple processors trying to access the same cache.
  7. Monte Carlo Simulation (a Mathematical technique): Used to estimate possible outcome of an uncertain event
  8. Data Decomposition: Data distribution among distributed memory Tasks Decomposition: Tasks distribution among distributed processors / cores Minimal Routing: Shortest path from source to destination Non-Minimal Routing: Path from source to destination Deterministic Routing: Source to destination path is determined and followed accordingly. Adaptive Routing: Make routing decision dynamically depending upon the network condition. i.e. change path dynamically is case of contention
  9. 3d-weather modeling, 3d-structure modeling
  10. Corresponding nodes of 3-d hypercubes are connected to construct 4-d hypercube
  11. 001 011 -------- 012 => 2 difference
  12. Bisection width is related to available bandwidth of the network when first half of network communicates with second half of the network Our performance increases as no. of dimensions increase [1-d vs. 2-d network]
  13. log(p+1)-1 floors of links in binary trees Exponential formula=2^0 + 2^1+…+2^n=2^(n+1) -1 To solve hypercube induce concept of directed links Star bisection= not ⎣ p/2 ⎦ but 1 because we only have one link to transfer data to second half Determine expressions for all the interconnections Discuss scalability issues How adding or removing a node effects overall network