SlideShare a Scribd company logo
“
”
NADAR SARASWATHI COLLEGE OF ARTS AND SCIENCE
VADAPUTHUPATTI, THENI – 625 531
Department of Computer Science and Information Technology
PRESENTED BY
NIBIYA.G
I-MSC(INFORMATION TECHONOLGY)
ADVANCED COMPUTER
ARCHITECTURE
SEMINAR TOPIC:-
CONNECTION MACHINES-CM-5
CONNECTION MACHINES
• 1981:MIT AI-lab Technical memo on CM
• 1982:Thinking Machines Inc. founded
• 1985:Danny Hillis wins ACM “Best Phn” Award
• 1986:CM-1 ships
• 1987:CM-2 ships
• 1991:CM-5 Announced
• 1991:ships
Cm-1 and cm-2 architecture
• Original design goal to support neuron like simulations
• Up to 64k single bit processors(actually 3 bits in and 2 out)
• 16 processors/chips, 32chips/PCD, 16 PCBs/cube, 8cubes/hypercube
• Hypercube architecture-each 16-proc chip a hyper-node
• Each proc has 4k bits of bit addressable RAM
 Distributed Physical Memory
 Global Memory Addresses
Cm-1 and cm-2 architecture
• Up to 4 front-end computer talk to sequencers via 4X4 crossbar
• “Sequencers” issues SIMD instruction over a Broadcast Network
• Bit process communication via 2D local HW grid connection(“NEWS”)
• Bit process communication via hypercube network using MSG passing
• Lots of Twinkling Lights
CM-1 and CM-2 PROGRAMMING
• ISA support:
 Bit- oriented operations
 Arbitrary precision multi-bit scalar Ops using bit-serial implementation on bit process
 Full multi-Dimensional Vector Ops
• “Virtual processor” idea similar to CUDA threads but they are statically allocated
• OS and programming tools run on front-ends
• List as the initial programming language
• Later c* and CM- Fortran
CM-2 IMPROVEMENTS
• 1 weitex IEEE FP coprocessor per 32 1-bit process
• Up to 256k bits of memory per process
• Added ECC to memory
• Implemented the IO subsystem
 Up to 80 GB RAID array called “Data Vault” uses 39 Striped Disks and ECC, plus spare disks on standby
 High speed graphics output
• En-route MSG combining in H-cube router
• New implementation of multi-Dimensional NEWS on top of H-cube (special addressing mode)
CM-5 vs. CM-1 and CM-2
• Significant departure from CM-1 and CM-2
• Targeted at more scientific and business application
• More commercial off-the-shelf components (“COTS”)
• Large array of SPARC processing nodes
 1-bit processors are abandoned
• Abandoned “NEWS” grid and hyper-cube networks
• Delivered 1024 node machine, with claims 16K nodes possible
• Even more Twinkling lights
CM-5 OVERALL ARCHITECTURE
• “coordination homogeneous array of RISC processors” or “CHARM”
• Asymmetric coprocessors model
 Large array of processor nodes
 Small collection of control nodes
• 2 separate scalable networks
 One for data
 One for control and synchronization
• Still uses striped RAID for high disk Bandwidth
DIVISION OF LABOR
• Processor node can be assigned to a “partition”
• One control node per partition
• Control node runs scalar code, then broadcasts parallel work to processor nodes
• Processor nodes can access other node’s memory by reading or writing a global
memory address
• Processor nodes also communicate via MSG passing
• Processor nodes cannot issue system calls
CONTROL NODES
• Full sun workstations
• Running UNIX
• Connected to the “outside world”
• Handle partition time sharing
• Connected to both data and control networks
• Performs system diagnostics
PROCESSOR NODES
• Nodes are a 5-chip microprocessor
 off the shelf SPARC processor @40 MHz
 32MB local node memory
 Multi-port memory controller for added BW
 “caching techniques do not perform as well on large parallel machines”
 Proprietary 4-FPU vector coprocessor
 Proprietary network controller
DATA NETWORK ARCHITECTURE
• Point to point inter-node communication and i/o
• Implemented as a fat tree
 Fat tree invented by TMI employee Charles leiserson
• Claim: onsite bandwidth expandable
• Delivering 5GB/sec bisection BW on 1024 node machine
• Data router chip is a 8x8 crossbar switch
• Faulty nodes are mapped out of network
 Program can not assume a network topology
• Network can be flushed when time share swaps occur
• Network, not processors, guarantee end-to-end delivery
SEPARATE CONTROL NETWORK
• Synchronization & control network
• Complete binary tree organization
• Provides broadcast capability
• Implements barrier operations
• Implements interrupts for timesharing
• Performs reduction operators
(Sum, Max, AND, OR, Count, etc.,)
CM-5 PROGRAMMING
• Supports multiple parallel high level languages and programming styles
 Including data parallel model from CM-1 and CM-2
• Goal: hide many decisions from programmers
 CM-1, CM-2, vs. CM-5 ISA changes
 Use of processor node CPU vs. vectors coprocessors
 Partition wide synchronization generated by compiler
• Is it MIMD, SPMD, SIMD?
 “globally synchronized MIMD”
SAMPLE CM APPS
• Machine learning
• VLSI design
• Geophysics (oil exploration), plate tectonics
• Fluid flow simulation
• Computer vision
• Computer graphics, animation
• Protein sequence matching
• Global climate model simulation
CONNECTION MACHINES
• The light panels of FROSTBURG a CM-5 on display at the
“NATION CRUPTOLOGIC MUSEM”
• The panels were used to check the usage of the processing nodes & to run
diagnostics
ANY DOUBTS
Advanced Computer Architecture

More Related Content

What's hot

CONDOR: An automated framework to accelerate convolutional neural networks on...
CONDOR: An automated framework to accelerate convolutional neural networks on...CONDOR: An automated framework to accelerate convolutional neural networks on...
CONDOR: An automated framework to accelerate convolutional neural networks on...
NECST Lab @ Politecnico di Milano
 
PowerEdge Rack and Tower Server Masters AMD Processors.pptx
PowerEdge Rack and Tower Server Masters AMD Processors.pptxPowerEdge Rack and Tower Server Masters AMD Processors.pptx
PowerEdge Rack and Tower Server Masters AMD Processors.pptx
NeoKenj
 
PowerEdge Rack and Tower Server Masters - AMD Server Memory.pptx
PowerEdge Rack and Tower Server Masters - AMD Server Memory.pptxPowerEdge Rack and Tower Server Masters - AMD Server Memory.pptx
PowerEdge Rack and Tower Server Masters - AMD Server Memory.pptx
NeoKenj
 
CONDOR: An automated framework to accelerate convolutional neural networks on...
CONDOR: An automated framework to accelerate convolutional neural networks on...CONDOR: An automated framework to accelerate convolutional neural networks on...
CONDOR: An automated framework to accelerate convolutional neural networks on...
NECST Lab @ Politecnico di Milano
 
SOC System Design Approach
SOC System Design ApproachSOC System Design Approach
SOC System Design Approach
A B Shinde
 
AMD Bridges the X86 and ARM Ecosystems for the Data Center
AMD Bridges the X86 and ARM Ecosystems for the Data Center AMD Bridges the X86 and ARM Ecosystems for the Data Center
AMD Bridges the X86 and ARM Ecosystems for the Data Center
AMD
 
GPUs vs CPUs for Parallel Processing
GPUs vs CPUs for Parallel ProcessingGPUs vs CPUs for Parallel Processing
GPUs vs CPUs for Parallel Processing
Mohammed Billoo
 
NetApp C-mode for 7 mode engineers
NetApp C-mode for 7 mode engineersNetApp C-mode for 7 mode engineers
NetApp C-mode for 7 mode engineers
subtitle
 
Open Hardware and Future Computing
Open Hardware and Future ComputingOpen Hardware and Future Computing
Open Hardware and Future Computing
Ganesan Narayanasamy
 
SOC Chip Basics
SOC Chip BasicsSOC Chip Basics
SOC Chip Basics
A B Shinde
 
Sx 6-single-node
Sx 6-single-nodeSx 6-single-node
Sx 6-single-node
Léia de Sousa
 
System-on-Chip
System-on-ChipSystem-on-Chip
System-on-Chip
Lars Jacobs
 
Risc and cisc eugene clewlow
Risc and cisc   eugene clewlowRisc and cisc   eugene clewlow
Risc and cisc eugene clewlow
Manish Prajapati
 
CISC VS CISC
CISC VS CISCCISC VS CISC
System On Chip (SOC)
System On Chip (SOC)System On Chip (SOC)
System On Chip (SOC)Shivam Gupta
 
SOC Application Studies: Image Compression
SOC Application Studies: Image CompressionSOC Application Studies: Image Compression
SOC Application Studies: Image Compression
A B Shinde
 
RISC and CISC Processors
RISC and CISC ProcessorsRISC and CISC Processors
RISC and CISC Processors
Adeel Rasheed
 
CFD acceleration with FPGA (byteLAKE's presentation from PPAM 2019)
CFD acceleration with FPGA (byteLAKE's presentation from PPAM 2019)CFD acceleration with FPGA (byteLAKE's presentation from PPAM 2019)
CFD acceleration with FPGA (byteLAKE's presentation from PPAM 2019)
byteLAKE
 
Processors selection
Processors selectionProcessors selection
Processors selection
Pradeep Shankhwar
 
Risc and cisc computers
Risc and cisc computersRisc and cisc computers
Risc and cisc computers
ankita mundhra
 

What's hot (20)

CONDOR: An automated framework to accelerate convolutional neural networks on...
CONDOR: An automated framework to accelerate convolutional neural networks on...CONDOR: An automated framework to accelerate convolutional neural networks on...
CONDOR: An automated framework to accelerate convolutional neural networks on...
 
PowerEdge Rack and Tower Server Masters AMD Processors.pptx
PowerEdge Rack and Tower Server Masters AMD Processors.pptxPowerEdge Rack and Tower Server Masters AMD Processors.pptx
PowerEdge Rack and Tower Server Masters AMD Processors.pptx
 
PowerEdge Rack and Tower Server Masters - AMD Server Memory.pptx
PowerEdge Rack and Tower Server Masters - AMD Server Memory.pptxPowerEdge Rack and Tower Server Masters - AMD Server Memory.pptx
PowerEdge Rack and Tower Server Masters - AMD Server Memory.pptx
 
CONDOR: An automated framework to accelerate convolutional neural networks on...
CONDOR: An automated framework to accelerate convolutional neural networks on...CONDOR: An automated framework to accelerate convolutional neural networks on...
CONDOR: An automated framework to accelerate convolutional neural networks on...
 
SOC System Design Approach
SOC System Design ApproachSOC System Design Approach
SOC System Design Approach
 
AMD Bridges the X86 and ARM Ecosystems for the Data Center
AMD Bridges the X86 and ARM Ecosystems for the Data Center AMD Bridges the X86 and ARM Ecosystems for the Data Center
AMD Bridges the X86 and ARM Ecosystems for the Data Center
 
GPUs vs CPUs for Parallel Processing
GPUs vs CPUs for Parallel ProcessingGPUs vs CPUs for Parallel Processing
GPUs vs CPUs for Parallel Processing
 
NetApp C-mode for 7 mode engineers
NetApp C-mode for 7 mode engineersNetApp C-mode for 7 mode engineers
NetApp C-mode for 7 mode engineers
 
Open Hardware and Future Computing
Open Hardware and Future ComputingOpen Hardware and Future Computing
Open Hardware and Future Computing
 
SOC Chip Basics
SOC Chip BasicsSOC Chip Basics
SOC Chip Basics
 
Sx 6-single-node
Sx 6-single-nodeSx 6-single-node
Sx 6-single-node
 
System-on-Chip
System-on-ChipSystem-on-Chip
System-on-Chip
 
Risc and cisc eugene clewlow
Risc and cisc   eugene clewlowRisc and cisc   eugene clewlow
Risc and cisc eugene clewlow
 
CISC VS CISC
CISC VS CISCCISC VS CISC
CISC VS CISC
 
System On Chip (SOC)
System On Chip (SOC)System On Chip (SOC)
System On Chip (SOC)
 
SOC Application Studies: Image Compression
SOC Application Studies: Image CompressionSOC Application Studies: Image Compression
SOC Application Studies: Image Compression
 
RISC and CISC Processors
RISC and CISC ProcessorsRISC and CISC Processors
RISC and CISC Processors
 
CFD acceleration with FPGA (byteLAKE's presentation from PPAM 2019)
CFD acceleration with FPGA (byteLAKE's presentation from PPAM 2019)CFD acceleration with FPGA (byteLAKE's presentation from PPAM 2019)
CFD acceleration with FPGA (byteLAKE's presentation from PPAM 2019)
 
Processors selection
Processors selectionProcessors selection
Processors selection
 
Risc and cisc computers
Risc and cisc computersRisc and cisc computers
Risc and cisc computers
 

Similar to Advanced Computer Architecture

Connection Machine
Connection MachineConnection Machine
Connection Machinebutest
 
A Dataflow Processing Chip for Training Deep Neural Networks
A Dataflow Processing Chip for Training Deep Neural NetworksA Dataflow Processing Chip for Training Deep Neural Networks
A Dataflow Processing Chip for Training Deep Neural Networks
inside-BigData.com
 
PEARC17: Improving Uintah's Scalability Through the Use of Portable Kokkos-Ba...
PEARC17: Improving Uintah's Scalability Through the Use of Portable Kokkos-Ba...PEARC17: Improving Uintah's Scalability Through the Use of Portable Kokkos-Ba...
PEARC17: Improving Uintah's Scalability Through the Use of Portable Kokkos-Ba...
John Holmen
 
High performance computing
High performance computingHigh performance computing
High performance computing
punjab engineering college, chandigarh
 
Maxwell siuc hpc_description_tutorial
Maxwell siuc hpc_description_tutorialMaxwell siuc hpc_description_tutorial
Maxwell siuc hpc_description_tutorialmadhuinturi
 
AMC & VPX Form Factor Boards With High Speed SERDES: Embedded World 2010
AMC & VPX Form Factor Boards With High Speed SERDES: Embedded World 2010AMC & VPX Form Factor Boards With High Speed SERDES: Embedded World 2010
AMC & VPX Form Factor Boards With High Speed SERDES: Embedded World 2010Altera Corporation
 
02 computer evolution and performance
02 computer evolution and performance02 computer evolution and performance
02 computer evolution and performanceSher Shah Merkhel
 
Fundamentals of Computer
Fundamentals of ComputerFundamentals of Computer
Fundamentals of Computer
Jubayer Alam Shoikat
 
Evaluating UCIe based multi-die SoC to meet timing and power
Evaluating UCIe based multi-die SoC to meet timing and power Evaluating UCIe based multi-die SoC to meet timing and power
Evaluating UCIe based multi-die SoC to meet timing and power
Deepak Shankar
 
IBM and ASTRON 64bit μServer for DOME
IBM and ASTRON 64bit μServer for DOMEIBM and ASTRON 64bit μServer for DOME
IBM and ASTRON 64bit μServer for DOME
IBM Research
 
Available HPC resources at CSUC
Available HPC resources at CSUCAvailable HPC resources at CSUC
Hpc 4 5
Hpc 4 5Hpc 4 5
Hpc 4 5
Yasir Khan
 
Available HPC resources at CSUC
Available HPC resources at CSUCAvailable HPC resources at CSUC
Ca lecture 03
Ca lecture 03Ca lecture 03
Ca lecture 03
Haris456
 
supercomputer
supercomputersupercomputer
supercomputer
panjab university
 
Computer Evolution.ppt
Computer Evolution.pptComputer Evolution.ppt
Computer Evolution.ppt
VivekTrial
 
Challenges in Embedded Computing
Challenges in Embedded ComputingChallenges in Embedded Computing
Challenges in Embedded Computing
Pradeep Kumar TS
 
Mahti quick-start guide
Mahti quick-start guide Mahti quick-start guide
Mahti quick-start guide
CSC - IT Center for Science
 
ECESLU Microprocessors lecture
ECESLU Microprocessors lecture ECESLU Microprocessors lecture
ECESLU Microprocessors lecture
Jeffrey Des Binwag
 
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
BigDataEverywhere
 

Similar to Advanced Computer Architecture (20)

Connection Machine
Connection MachineConnection Machine
Connection Machine
 
A Dataflow Processing Chip for Training Deep Neural Networks
A Dataflow Processing Chip for Training Deep Neural NetworksA Dataflow Processing Chip for Training Deep Neural Networks
A Dataflow Processing Chip for Training Deep Neural Networks
 
PEARC17: Improving Uintah's Scalability Through the Use of Portable Kokkos-Ba...
PEARC17: Improving Uintah's Scalability Through the Use of Portable Kokkos-Ba...PEARC17: Improving Uintah's Scalability Through the Use of Portable Kokkos-Ba...
PEARC17: Improving Uintah's Scalability Through the Use of Portable Kokkos-Ba...
 
High performance computing
High performance computingHigh performance computing
High performance computing
 
Maxwell siuc hpc_description_tutorial
Maxwell siuc hpc_description_tutorialMaxwell siuc hpc_description_tutorial
Maxwell siuc hpc_description_tutorial
 
AMC & VPX Form Factor Boards With High Speed SERDES: Embedded World 2010
AMC & VPX Form Factor Boards With High Speed SERDES: Embedded World 2010AMC & VPX Form Factor Boards With High Speed SERDES: Embedded World 2010
AMC & VPX Form Factor Boards With High Speed SERDES: Embedded World 2010
 
02 computer evolution and performance
02 computer evolution and performance02 computer evolution and performance
02 computer evolution and performance
 
Fundamentals of Computer
Fundamentals of ComputerFundamentals of Computer
Fundamentals of Computer
 
Evaluating UCIe based multi-die SoC to meet timing and power
Evaluating UCIe based multi-die SoC to meet timing and power Evaluating UCIe based multi-die SoC to meet timing and power
Evaluating UCIe based multi-die SoC to meet timing and power
 
IBM and ASTRON 64bit μServer for DOME
IBM and ASTRON 64bit μServer for DOMEIBM and ASTRON 64bit μServer for DOME
IBM and ASTRON 64bit μServer for DOME
 
Available HPC resources at CSUC
Available HPC resources at CSUCAvailable HPC resources at CSUC
Available HPC resources at CSUC
 
Hpc 4 5
Hpc 4 5Hpc 4 5
Hpc 4 5
 
Available HPC resources at CSUC
Available HPC resources at CSUCAvailable HPC resources at CSUC
Available HPC resources at CSUC
 
Ca lecture 03
Ca lecture 03Ca lecture 03
Ca lecture 03
 
supercomputer
supercomputersupercomputer
supercomputer
 
Computer Evolution.ppt
Computer Evolution.pptComputer Evolution.ppt
Computer Evolution.ppt
 
Challenges in Embedded Computing
Challenges in Embedded ComputingChallenges in Embedded Computing
Challenges in Embedded Computing
 
Mahti quick-start guide
Mahti quick-start guide Mahti quick-start guide
Mahti quick-start guide
 
ECESLU Microprocessors lecture
ECESLU Microprocessors lecture ECESLU Microprocessors lecture
ECESLU Microprocessors lecture
 
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
 

More from nibiganesh

soft computing.pptx
soft computing.pptxsoft computing.pptx
soft computing.pptx
nibiganesh
 
object oriented analysis data.pptx
object oriented analysis data.pptxobject oriented analysis data.pptx
object oriented analysis data.pptx
nibiganesh
 
advance computer architecture.pptx
advance computer architecture.pptxadvance computer architecture.pptx
advance computer architecture.pptx
nibiganesh
 
Women empowerment.pptx
Women empowerment.pptxWomen empowerment.pptx
Women empowerment.pptx
nibiganesh
 
internet of things.pptx
internet of things.pptxinternet of things.pptx
internet of things.pptx
nibiganesh
 
digital image processing.pptx
digital image processing.pptxdigital image processing.pptx
digital image processing.pptx
nibiganesh
 
big data analysis.pptx
big data analysis.pptxbig data analysis.pptx
big data analysis.pptx
nibiganesh
 
client server computing.pptx
client server computing.pptxclient server computing.pptx
client server computing.pptx
nibiganesh
 
python&programming
python&programmingpython&programming
python&programming
nibiganesh
 
distributing computing
distributing computingdistributing computing
distributing computing
nibiganesh
 
Datamining & warehouse
Datamining & warehouseDatamining & warehouse
Datamining & warehouse
nibiganesh
 
Artifical intelligent
Artifical intelligentArtifical intelligent
Artifical intelligent
nibiganesh
 
Advanced java programming
Advanced java programmingAdvanced java programming
Advanced java programming
nibiganesh
 
NETWORK AND INTERNET SECURITY
NETWORK AND INTERNET SECURITYNETWORK AND INTERNET SECURITY
NETWORK AND INTERNET SECURITY
nibiganesh
 
Data Structure Algorithm
Data Structure AlgorithmData Structure Algorithm
Data Structure Algorithm
nibiganesh
 

More from nibiganesh (15)

soft computing.pptx
soft computing.pptxsoft computing.pptx
soft computing.pptx
 
object oriented analysis data.pptx
object oriented analysis data.pptxobject oriented analysis data.pptx
object oriented analysis data.pptx
 
advance computer architecture.pptx
advance computer architecture.pptxadvance computer architecture.pptx
advance computer architecture.pptx
 
Women empowerment.pptx
Women empowerment.pptxWomen empowerment.pptx
Women empowerment.pptx
 
internet of things.pptx
internet of things.pptxinternet of things.pptx
internet of things.pptx
 
digital image processing.pptx
digital image processing.pptxdigital image processing.pptx
digital image processing.pptx
 
big data analysis.pptx
big data analysis.pptxbig data analysis.pptx
big data analysis.pptx
 
client server computing.pptx
client server computing.pptxclient server computing.pptx
client server computing.pptx
 
python&programming
python&programmingpython&programming
python&programming
 
distributing computing
distributing computingdistributing computing
distributing computing
 
Datamining & warehouse
Datamining & warehouseDatamining & warehouse
Datamining & warehouse
 
Artifical intelligent
Artifical intelligentArtifical intelligent
Artifical intelligent
 
Advanced java programming
Advanced java programmingAdvanced java programming
Advanced java programming
 
NETWORK AND INTERNET SECURITY
NETWORK AND INTERNET SECURITYNETWORK AND INTERNET SECURITY
NETWORK AND INTERNET SECURITY
 
Data Structure Algorithm
Data Structure AlgorithmData Structure Algorithm
Data Structure Algorithm
 

Recently uploaded

special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
Chapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdfChapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdf
Kartik Tiwari
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
DhatriParmar
 
Multithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race conditionMultithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race condition
Mohammed Sikander
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
vaibhavrinwa19
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
Atul Kumar Singh
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
heathfieldcps1
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
Vikramjit Singh
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
Levi Shapiro
 
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBCSTRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
kimdan468
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
SACHIN R KONDAGURI
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
EduSkills OECD
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
Scholarhat
 

Recently uploaded (20)

special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
Chapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdfChapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdf
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
 
Multithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race conditionMultithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race condition
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
 
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBCSTRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
 

Advanced Computer Architecture

  • 1. “ ” NADAR SARASWATHI COLLEGE OF ARTS AND SCIENCE VADAPUTHUPATTI, THENI – 625 531 Department of Computer Science and Information Technology PRESENTED BY NIBIYA.G I-MSC(INFORMATION TECHONOLGY)
  • 3. CONNECTION MACHINES • 1981:MIT AI-lab Technical memo on CM • 1982:Thinking Machines Inc. founded • 1985:Danny Hillis wins ACM “Best Phn” Award • 1986:CM-1 ships • 1987:CM-2 ships • 1991:CM-5 Announced • 1991:ships
  • 4. Cm-1 and cm-2 architecture • Original design goal to support neuron like simulations • Up to 64k single bit processors(actually 3 bits in and 2 out) • 16 processors/chips, 32chips/PCD, 16 PCBs/cube, 8cubes/hypercube • Hypercube architecture-each 16-proc chip a hyper-node • Each proc has 4k bits of bit addressable RAM  Distributed Physical Memory  Global Memory Addresses
  • 5. Cm-1 and cm-2 architecture • Up to 4 front-end computer talk to sequencers via 4X4 crossbar • “Sequencers” issues SIMD instruction over a Broadcast Network • Bit process communication via 2D local HW grid connection(“NEWS”) • Bit process communication via hypercube network using MSG passing • Lots of Twinkling Lights
  • 6.
  • 7. CM-1 and CM-2 PROGRAMMING • ISA support:  Bit- oriented operations  Arbitrary precision multi-bit scalar Ops using bit-serial implementation on bit process  Full multi-Dimensional Vector Ops • “Virtual processor” idea similar to CUDA threads but they are statically allocated • OS and programming tools run on front-ends • List as the initial programming language • Later c* and CM- Fortran
  • 8. CM-2 IMPROVEMENTS • 1 weitex IEEE FP coprocessor per 32 1-bit process • Up to 256k bits of memory per process • Added ECC to memory • Implemented the IO subsystem  Up to 80 GB RAID array called “Data Vault” uses 39 Striped Disks and ECC, plus spare disks on standby  High speed graphics output • En-route MSG combining in H-cube router • New implementation of multi-Dimensional NEWS on top of H-cube (special addressing mode)
  • 9.
  • 10. CM-5 vs. CM-1 and CM-2 • Significant departure from CM-1 and CM-2 • Targeted at more scientific and business application • More commercial off-the-shelf components (“COTS”) • Large array of SPARC processing nodes  1-bit processors are abandoned • Abandoned “NEWS” grid and hyper-cube networks • Delivered 1024 node machine, with claims 16K nodes possible • Even more Twinkling lights
  • 11.
  • 12. CM-5 OVERALL ARCHITECTURE • “coordination homogeneous array of RISC processors” or “CHARM” • Asymmetric coprocessors model  Large array of processor nodes  Small collection of control nodes • 2 separate scalable networks  One for data  One for control and synchronization • Still uses striped RAID for high disk Bandwidth
  • 13. DIVISION OF LABOR • Processor node can be assigned to a “partition” • One control node per partition • Control node runs scalar code, then broadcasts parallel work to processor nodes • Processor nodes can access other node’s memory by reading or writing a global memory address • Processor nodes also communicate via MSG passing • Processor nodes cannot issue system calls
  • 14. CONTROL NODES • Full sun workstations • Running UNIX • Connected to the “outside world” • Handle partition time sharing • Connected to both data and control networks • Performs system diagnostics
  • 15. PROCESSOR NODES • Nodes are a 5-chip microprocessor  off the shelf SPARC processor @40 MHz  32MB local node memory  Multi-port memory controller for added BW  “caching techniques do not perform as well on large parallel machines”  Proprietary 4-FPU vector coprocessor  Proprietary network controller
  • 16.
  • 17. DATA NETWORK ARCHITECTURE • Point to point inter-node communication and i/o • Implemented as a fat tree  Fat tree invented by TMI employee Charles leiserson • Claim: onsite bandwidth expandable • Delivering 5GB/sec bisection BW on 1024 node machine • Data router chip is a 8x8 crossbar switch • Faulty nodes are mapped out of network  Program can not assume a network topology • Network can be flushed when time share swaps occur • Network, not processors, guarantee end-to-end delivery
  • 18.
  • 19. SEPARATE CONTROL NETWORK • Synchronization & control network • Complete binary tree organization • Provides broadcast capability • Implements barrier operations • Implements interrupts for timesharing • Performs reduction operators (Sum, Max, AND, OR, Count, etc.,)
  • 20. CM-5 PROGRAMMING • Supports multiple parallel high level languages and programming styles  Including data parallel model from CM-1 and CM-2 • Goal: hide many decisions from programmers  CM-1, CM-2, vs. CM-5 ISA changes  Use of processor node CPU vs. vectors coprocessors  Partition wide synchronization generated by compiler • Is it MIMD, SPMD, SIMD?  “globally synchronized MIMD”
  • 21. SAMPLE CM APPS • Machine learning • VLSI design • Geophysics (oil exploration), plate tectonics • Fluid flow simulation • Computer vision • Computer graphics, animation • Protein sequence matching • Global climate model simulation
  • 22. CONNECTION MACHINES • The light panels of FROSTBURG a CM-5 on display at the “NATION CRUPTOLOGIC MUSEM” • The panels were used to check the usage of the processing nodes & to run diagnostics