SlideShare a Scribd company logo
Parallel and Distributed Systems
Instructor: Xin Yuan
Department of Computer Science
Florida State University
Parallel and distributed systems
• What is a parallel computer?
– A collection of processing elements that
communicate and coorperate to solve large
problems fast.
• What is a distributed system?
– A collection of independent computers that
appear to its users as a single coherent system.
– A parallel computer is implicitly a distributed
system.
What are they?
• More than one processing element
• Elements communicate and cooperate
• Appear as one system
• PDS hardware is everywhere.
– diablo.cs.fsu.edu (2-CPU SMP)
– Multicore workstations/laptops
– linprog.cs.fsu.edu (4-node cluster, 2-CPU SMP, 8 cores per node)
– Hadoop clusters
– IBM Blue Gene/L at DOE/NNSA/LLNL (262992 processors).
– Data centers
– SETI@home.
Why parallel and distributed
computing?
• Moore’s law: the number of transistors
double every 18 months.
Why parallel and distributed
computing?
• How to make good use of the increasing number of
transistors on a chip?
– Increase the computation width (70’s and 80’s)
• 4bit->8bit->16bit->32bit->64bit->….
– Instruction level parallelism (90’s)
• Pipeline, LIW, etc
– ILP to the extreme (early 00’s)
• Out of order execution, 6-way issues, etc
– Sequential program performance keeps increasing.
• The clock rate keeps increasing, clock cycles get smaller
and smaller.
Why parallel and distributed
computing?
• The fundament limit of the speed of a sequential
processor.
– Power wall (high frequency results in heat)
– Latency wall (speed of light does not change)
Power wall: what will happen if
we keep pushing ILP?
Latency wall
• Speed of light = 300000000m/s
• One cycle at 4Ghz = 0.00000000025s
• The distance that the light can move at one cycle:
– 0.00000000025 * 300000000 = 7.5cm
Intel chip dimension = 1.47 in x 1.47 in
= 3.73cm x 3.73cm
Why parallel and distributed
computing?
• Power wall and latency wall indicate that the
era of single thread performance improvement
through Moore’s law is ending/has ended.
• More transistors on a chip are now applied to
increase system throughput, the total number
of instructions executed, but not latency, the
time for a job to finish.
– Improving ILP improves both.
– We see a multi-core era
• The marching of multi-core (2004-now)
– Mainstream processors:
• INTEL Quad-core Xeon (4 cores),
• AMD Quad-core Opteron (4 cores),
• SUN Niagara (8 cores),
• IBM Power6 (2 cores)
– Others
• Intel Tflops (80 cores)
• CISCO CSR-1 (180 cores) (network processors)
– Increase the throughput without increasing the
clock rate.
• Why parallel and distributed computing?
– Programming multi-core systems is fundamentally
different from programming traditional computers.
• Parallelism needs to be explicitly expressed in the program.
– This is traditionally referred to as parallel computing.
• PDC is not really a choice anymore.
For (I=0; I<500; I++)
a[I] = 0;
1970 1980 200620001990
Performance
GO PARALLEL
• In the foreseeable future, parallel systems
with multi-core processors are going
mainstream.
– Lots of hardware, software, and human issues
in mainstream parallel/distributed computing
• How to make use a large number of processors
simultaneously
• How to write parallel programs easily and
efficiently?
• What kind of software supports is needed?
• Analyzing and optimizing parallel programs is still a
hard problem.
• How to debug concurrent program?
Parallel computing issues
• Data sharing – single versus multiple
address space.
• Process coordination – synchronization
using locks, messages, and other means.
• Distributed versus centralized memory.
• Connectivity – single shared bus versus
network with many different topologies.
• Fault tolerance/reliability.
Distributed systems issues
• Transparency
– Access (data representation), location,
migration, relocation, replication, concurrency,
failure, persistence
• Scalability
– Size, geographically
• Reliability/fault tolerance
What we will be doing in this
course?
• This is the first class in PDC.
– Systematically introduce most mainstream
parallel computing platforms and programming
paradigms.
• The coverage is not necessary in depth.

More Related Content

What's hot

Parallel Computing
Parallel ComputingParallel Computing
Parallel Computing
Ameya Waghmare
 
Research Scope in Parallel Computing And Parallel Programming
Research Scope in Parallel Computing And Parallel ProgrammingResearch Scope in Parallel Computing And Parallel Programming
Research Scope in Parallel Computing And Parallel Programming
Shitalkumar Sukhdeve
 
Distributed computing ).ppt him
Distributed computing ).ppt himDistributed computing ).ppt him
Distributed computing ).ppt him
Himanshu Saini
 
Parallel Algorithms Advantages and Disadvantages
Parallel Algorithms Advantages and DisadvantagesParallel Algorithms Advantages and Disadvantages
Parallel Algorithms Advantages and Disadvantages
Murtadha Alsabbagh
 
Introduction to Parallel Distributed Computer Systems
Introduction to Parallel Distributed Computer SystemsIntroduction to Parallel Distributed Computer Systems
Introduction to Parallel Distributed Computer Systems
MrMaKKaWi
 
System models for distributed and cloud computing
System models for distributed and cloud computingSystem models for distributed and cloud computing
System models for distributed and cloud computingpurplesea
 
Distributed Computing
Distributed ComputingDistributed Computing
Distributed Computing
Prashant Tiwari
 
Cluster Computing Seminar.
Cluster Computing Seminar.Cluster Computing Seminar.
Cluster Computing Seminar.
Balvant Biradar
 
Distributed Systems
Distributed SystemsDistributed Systems
Distributed SystemsRupsee
 
Parallel computing
Parallel computingParallel computing
Parallel computing
Engr Zardari Saddam
 
Cluster computing
Cluster computingCluster computing
Cluster computing
Adarsh110
 
Patterns For Parallel Computing
Patterns For Parallel ComputingPatterns For Parallel Computing
Patterns For Parallel Computing
David Chou
 
Wireless Sensor System Architecture
Wireless Sensor System ArchitectureWireless Sensor System Architecture
Wireless Sensor System Architecture
varun kumar
 
Cluster computing
Cluster computingCluster computing
High performance computing with accelarators
High performance computing with accelaratorsHigh performance computing with accelarators
High performance computing with accelarators
Emmanuel college
 
Cluster Computing
Cluster ComputingCluster Computing
Cluster ComputingNIKHIL NAIR
 
Models in ds
Models in dsModels in ds
Models in ds
DUNCAN OPIYO
 
Cluster Tutorial
Cluster TutorialCluster Tutorial
Cluster Tutorialcybercbm
 

What's hot (19)

Parallel Computing
Parallel ComputingParallel Computing
Parallel Computing
 
Research Scope in Parallel Computing And Parallel Programming
Research Scope in Parallel Computing And Parallel ProgrammingResearch Scope in Parallel Computing And Parallel Programming
Research Scope in Parallel Computing And Parallel Programming
 
Distributed computing ).ppt him
Distributed computing ).ppt himDistributed computing ).ppt him
Distributed computing ).ppt him
 
Parallel Computing
Parallel Computing Parallel Computing
Parallel Computing
 
Parallel Algorithms Advantages and Disadvantages
Parallel Algorithms Advantages and DisadvantagesParallel Algorithms Advantages and Disadvantages
Parallel Algorithms Advantages and Disadvantages
 
Introduction to Parallel Distributed Computer Systems
Introduction to Parallel Distributed Computer SystemsIntroduction to Parallel Distributed Computer Systems
Introduction to Parallel Distributed Computer Systems
 
System models for distributed and cloud computing
System models for distributed and cloud computingSystem models for distributed and cloud computing
System models for distributed and cloud computing
 
Distributed Computing
Distributed ComputingDistributed Computing
Distributed Computing
 
Cluster Computing Seminar.
Cluster Computing Seminar.Cluster Computing Seminar.
Cluster Computing Seminar.
 
Distributed Systems
Distributed SystemsDistributed Systems
Distributed Systems
 
Parallel computing
Parallel computingParallel computing
Parallel computing
 
Cluster computing
Cluster computingCluster computing
Cluster computing
 
Patterns For Parallel Computing
Patterns For Parallel ComputingPatterns For Parallel Computing
Patterns For Parallel Computing
 
Wireless Sensor System Architecture
Wireless Sensor System ArchitectureWireless Sensor System Architecture
Wireless Sensor System Architecture
 
Cluster computing
Cluster computingCluster computing
Cluster computing
 
High performance computing with accelarators
High performance computing with accelaratorsHigh performance computing with accelarators
High performance computing with accelarators
 
Cluster Computing
Cluster ComputingCluster Computing
Cluster Computing
 
Models in ds
Models in dsModels in ds
Models in ds
 
Cluster Tutorial
Cluster TutorialCluster Tutorial
Cluster Tutorial
 

Similar to Pdc lecture1

CSA unit5.pptx
CSA unit5.pptxCSA unit5.pptx
CSA unit5.pptx
AbcvDef
 
chap-0 .ppt
chap-0 .pptchap-0 .ppt
chap-0 .ppt
Lookly Sam
 
e-Infrastructure available for research, using the right tool for the right job
e-Infrastructure available for research, using the right tool for the right jobe-Infrastructure available for research, using the right tool for the right job
e-Infrastructure available for research, using the right tool for the right job
David Wallom
 
Overview of HPC.pptx
Overview of HPC.pptxOverview of HPC.pptx
Overview of HPC.pptx
sundariprabhu
 
Aca module 1
Aca module 1Aca module 1
Aca module 1
Avinash_N Rao
 
Chap 2 classification of parralel architecture and introduction to parllel p...
Chap 2  classification of parralel architecture and introduction to parllel p...Chap 2  classification of parralel architecture and introduction to parllel p...
Chap 2 classification of parralel architecture and introduction to parllel p...
Malobe Lottin Cyrille Marcel
 
Operating System
Operating SystemOperating System
Operating System
Hitesh Mohapatra
 
CC unit 1.pptx
CC unit 1.pptxCC unit 1.pptx
CC unit 1.pptx
DivyaRadharapu1
 
High performance computing
High performance computingHigh performance computing
High performance computing
punjab engineering college, chandigarh
 
distributed system lab materials about ad
distributed system lab materials about addistributed system lab materials about ad
distributed system lab materials about ad
milkesa13
 
EMBEDDED OS
EMBEDDED OSEMBEDDED OS
EMBEDDED OS
AJAL A J
 
Lecture 1 (distributed systems)
Lecture 1 (distributed systems)Lecture 1 (distributed systems)
Lecture 1 (distributed systems)
Fazli Amin
 
Distributed systems and scalability rules
Distributed systems and scalability rulesDistributed systems and scalability rules
Distributed systems and scalability rules
Oleg Tsal-Tsalko
 
Fundamentals.pptx
Fundamentals.pptxFundamentals.pptx
Fundamentals.pptx
dhivyak49
 
Array Processors & Architectural Classification Schemes_Computer Architecture...
Array Processors & Architectural Classification Schemes_Computer Architecture...Array Processors & Architectural Classification Schemes_Computer Architecture...
Array Processors & Architectural Classification Schemes_Computer Architecture...
Sumalatha A
 
CCUnit1.pdf
CCUnit1.pdfCCUnit1.pdf
CCUnit1.pdf
AnayGupta26
 
Distributed DNN training: Infrastructure, challenges, and lessons learned
Distributed DNN training: Infrastructure, challenges, and lessons learnedDistributed DNN training: Infrastructure, challenges, and lessons learned
Distributed DNN training: Infrastructure, challenges, and lessons learned
Wee Hyong Tok
 

Similar to Pdc lecture1 (20)

CSA unit5.pptx
CSA unit5.pptxCSA unit5.pptx
CSA unit5.pptx
 
chap-0 .ppt
chap-0 .pptchap-0 .ppt
chap-0 .ppt
 
e-Infrastructure available for research, using the right tool for the right job
e-Infrastructure available for research, using the right tool for the right jobe-Infrastructure available for research, using the right tool for the right job
e-Infrastructure available for research, using the right tool for the right job
 
Overview of HPC.pptx
Overview of HPC.pptxOverview of HPC.pptx
Overview of HPC.pptx
 
Aca module 1
Aca module 1Aca module 1
Aca module 1
 
Chap 2 classification of parralel architecture and introduction to parllel p...
Chap 2  classification of parralel architecture and introduction to parllel p...Chap 2  classification of parralel architecture and introduction to parllel p...
Chap 2 classification of parralel architecture and introduction to parllel p...
 
Introduction
IntroductionIntroduction
Introduction
 
Operating System
Operating SystemOperating System
Operating System
 
CC unit 1.pptx
CC unit 1.pptxCC unit 1.pptx
CC unit 1.pptx
 
High performance computing
High performance computingHigh performance computing
High performance computing
 
distributed system lab materials about ad
distributed system lab materials about addistributed system lab materials about ad
distributed system lab materials about ad
 
EMBEDDED OS
EMBEDDED OSEMBEDDED OS
EMBEDDED OS
 
Lecture 1 (distributed systems)
Lecture 1 (distributed systems)Lecture 1 (distributed systems)
Lecture 1 (distributed systems)
 
Distributed systems and scalability rules
Distributed systems and scalability rulesDistributed systems and scalability rules
Distributed systems and scalability rules
 
Fundamentals.pptx
Fundamentals.pptxFundamentals.pptx
Fundamentals.pptx
 
Coa presentation4
Coa presentation4Coa presentation4
Coa presentation4
 
Array Processors & Architectural Classification Schemes_Computer Architecture...
Array Processors & Architectural Classification Schemes_Computer Architecture...Array Processors & Architectural Classification Schemes_Computer Architecture...
Array Processors & Architectural Classification Schemes_Computer Architecture...
 
CCUnit1.pdf
CCUnit1.pdfCCUnit1.pdf
CCUnit1.pdf
 
Unit 2(oss) (1)
Unit 2(oss) (1)Unit 2(oss) (1)
Unit 2(oss) (1)
 
Distributed DNN training: Infrastructure, challenges, and lessons learned
Distributed DNN training: Infrastructure, challenges, and lessons learnedDistributed DNN training: Infrastructure, challenges, and lessons learned
Distributed DNN training: Infrastructure, challenges, and lessons learned
 

Recently uploaded

J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang,  ICLR 2024, MLILAB, KAIST AI.pdfJ.Yang,  ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
MLILAB
 
The role of big data in decision making.
The role of big data in decision making.The role of big data in decision making.
The role of big data in decision making.
ankuprajapati0525
 
Halogenation process of chemical process industries
Halogenation process of chemical process industriesHalogenation process of chemical process industries
Halogenation process of chemical process industries
MuhammadTufail242431
 
Quality defects in TMT Bars, Possible causes and Potential Solutions.
Quality defects in TMT Bars, Possible causes and Potential Solutions.Quality defects in TMT Bars, Possible causes and Potential Solutions.
Quality defects in TMT Bars, Possible causes and Potential Solutions.
PrashantGoswami42
 
CME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional ElectiveCME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional Elective
karthi keyan
 
ASME IX(9) 2007 Full Version .pdf
ASME IX(9)  2007 Full Version       .pdfASME IX(9)  2007 Full Version       .pdf
ASME IX(9) 2007 Full Version .pdf
AhmedHussein950959
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
Neometrix_Engineering_Pvt_Ltd
 
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
H.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdfH.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdf
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
MLILAB
 
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Dr.Costas Sachpazis
 
DESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docxDESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docx
FluxPrime1
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
Pipe Restoration Solutions
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
gerogepatton
 
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
Amil Baba Dawood bangali
 
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdfHybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
fxintegritypublishin
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
JoytuBarua2
 
addressing modes in computer architecture
addressing modes  in computer architectureaddressing modes  in computer architecture
addressing modes in computer architecture
ShahidSultan24
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
Kamal Acharya
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
Kamal Acharya
 
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
obonagu
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation & Control
 

Recently uploaded (20)

J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang,  ICLR 2024, MLILAB, KAIST AI.pdfJ.Yang,  ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
 
The role of big data in decision making.
The role of big data in decision making.The role of big data in decision making.
The role of big data in decision making.
 
Halogenation process of chemical process industries
Halogenation process of chemical process industriesHalogenation process of chemical process industries
Halogenation process of chemical process industries
 
Quality defects in TMT Bars, Possible causes and Potential Solutions.
Quality defects in TMT Bars, Possible causes and Potential Solutions.Quality defects in TMT Bars, Possible causes and Potential Solutions.
Quality defects in TMT Bars, Possible causes and Potential Solutions.
 
CME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional ElectiveCME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional Elective
 
ASME IX(9) 2007 Full Version .pdf
ASME IX(9)  2007 Full Version       .pdfASME IX(9)  2007 Full Version       .pdf
ASME IX(9) 2007 Full Version .pdf
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
 
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
H.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdfH.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdf
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
 
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
 
DESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docxDESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docx
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
 
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
 
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdfHybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
 
addressing modes in computer architecture
addressing modes  in computer architectureaddressing modes  in computer architecture
addressing modes in computer architecture
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
 
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
 

Pdc lecture1

  • 1. Parallel and Distributed Systems Instructor: Xin Yuan Department of Computer Science Florida State University
  • 2. Parallel and distributed systems • What is a parallel computer? – A collection of processing elements that communicate and coorperate to solve large problems fast. • What is a distributed system? – A collection of independent computers that appear to its users as a single coherent system. – A parallel computer is implicitly a distributed system.
  • 3. What are they? • More than one processing element • Elements communicate and cooperate • Appear as one system • PDS hardware is everywhere. – diablo.cs.fsu.edu (2-CPU SMP) – Multicore workstations/laptops – linprog.cs.fsu.edu (4-node cluster, 2-CPU SMP, 8 cores per node) – Hadoop clusters – IBM Blue Gene/L at DOE/NNSA/LLNL (262992 processors). – Data centers – SETI@home.
  • 4. Why parallel and distributed computing? • Moore’s law: the number of transistors double every 18 months.
  • 5. Why parallel and distributed computing? • How to make good use of the increasing number of transistors on a chip? – Increase the computation width (70’s and 80’s) • 4bit->8bit->16bit->32bit->64bit->…. – Instruction level parallelism (90’s) • Pipeline, LIW, etc – ILP to the extreme (early 00’s) • Out of order execution, 6-way issues, etc – Sequential program performance keeps increasing. • The clock rate keeps increasing, clock cycles get smaller and smaller.
  • 6. Why parallel and distributed computing? • The fundament limit of the speed of a sequential processor. – Power wall (high frequency results in heat) – Latency wall (speed of light does not change)
  • 7. Power wall: what will happen if we keep pushing ILP?
  • 8. Latency wall • Speed of light = 300000000m/s • One cycle at 4Ghz = 0.00000000025s • The distance that the light can move at one cycle: – 0.00000000025 * 300000000 = 7.5cm Intel chip dimension = 1.47 in x 1.47 in = 3.73cm x 3.73cm
  • 9. Why parallel and distributed computing? • Power wall and latency wall indicate that the era of single thread performance improvement through Moore’s law is ending/has ended. • More transistors on a chip are now applied to increase system throughput, the total number of instructions executed, but not latency, the time for a job to finish. – Improving ILP improves both. – We see a multi-core era
  • 10. • The marching of multi-core (2004-now) – Mainstream processors: • INTEL Quad-core Xeon (4 cores), • AMD Quad-core Opteron (4 cores), • SUN Niagara (8 cores), • IBM Power6 (2 cores) – Others • Intel Tflops (80 cores) • CISCO CSR-1 (180 cores) (network processors) – Increase the throughput without increasing the clock rate.
  • 11. • Why parallel and distributed computing? – Programming multi-core systems is fundamentally different from programming traditional computers. • Parallelism needs to be explicitly expressed in the program. – This is traditionally referred to as parallel computing. • PDC is not really a choice anymore.
  • 12. For (I=0; I<500; I++) a[I] = 0; 1970 1980 200620001990 Performance GO PARALLEL
  • 13. • In the foreseeable future, parallel systems with multi-core processors are going mainstream. – Lots of hardware, software, and human issues in mainstream parallel/distributed computing • How to make use a large number of processors simultaneously • How to write parallel programs easily and efficiently? • What kind of software supports is needed? • Analyzing and optimizing parallel programs is still a hard problem. • How to debug concurrent program?
  • 14. Parallel computing issues • Data sharing – single versus multiple address space. • Process coordination – synchronization using locks, messages, and other means. • Distributed versus centralized memory. • Connectivity – single shared bus versus network with many different topologies. • Fault tolerance/reliability.
  • 15. Distributed systems issues • Transparency – Access (data representation), location, migration, relocation, replication, concurrency, failure, persistence • Scalability – Size, geographically • Reliability/fault tolerance
  • 16. What we will be doing in this course? • This is the first class in PDC. – Systematically introduce most mainstream parallel computing platforms and programming paradigms. • The coverage is not necessary in depth.