SlideShare a Scribd company logo
Introduction to Parallel Computing
Ananth Grama, Anshul Gupta, George Karypis, and Vipin Kumar
To accompany the text ``Introduction to Parallel Computing'',
Addison Wesley, 2003.
Topic Overview
• Motivating Parallelism
• Scope of Parallel Computing Applications
• Organization and Contents of the Course
Motivating Parallelism
• The role of parallelism in accelerating computing speeds
has been recognized for several decades.
• Its role in providing multiplicity of datapaths and
increased access to storage elements has been
significant in commercial applications.
• The scalable performance and lower cost of parallel
platforms is reflected in the wide variety of applications.
Motivating Parallelism
• Developing parallel hardware and software has traditionally been
time and effort intensive.
• If one is to view this in the context of rapidly improving uniprocessor
speeds, one is tempted to question the need for parallel computing.
• There are some unmistakable trends in hardware design, which
indicate that uniprocessor (or implicitly parallel) architectures may
not be able to sustain the rate of realizable performance increments
in the future.
• This is the result of a number of fundamental physical and
computational limitations.
• The emergence of standardized parallel programming environments,
libraries, and hardware have significantly reduced time to (parallel)
solution.
The Computational Power Argument
Moore's law states [1965]:
``The complexity for minimum component costs has
increased at a rate of roughly a factor of two per year.
Certainly over the short term this rate can be expected to
continue, if not to increase. Over the longer term, the
rate of increase is a bit more uncertain, although there is
no reason to believe it will not remain nearly constant for
at least 10 years. That means by 1975, the number of
components per integrated circuit for minimum cost will
be 65,000.''
The Computational Power Argument
Moore attributed this doubling rate to exponential
behavior of die sizes, finer minimum dimensions, and
``circuit and device cleverness''.
In 1975, he revised this law as follows:
``There is no room left to squeeze anything out by
being clever. Going forward from here we have to
depend on the two size factors - bigger dies and finer
dimensions.''
He revised his rate of circuit complexity doubling to
18 months and projected from 1975 onwards at this
reduced rate.
The Computational Power Argument
• If one is to buy into Moore's law, the question still
remains - how does one translate transistors into useful
OPS (operations per second)?
• The logical recourse is to rely on parallelism, both
implicit and explicit.
• Most serial (or seemingly serial) processors rely
extensively on implicit parallelism.
• We focus in this class, for the most part, on explicit
parallelism.
The Memory/Disk Speed Argument
• While clock rates of high-end processors have increased at roughly
40% per year over the past decade, DRAM access times have only
improved at the rate of roughly 10% per year over this interval.
• This mismatch in speeds causes significant performance
bottlenecks.
• Parallel platforms provide increased bandwidth to the memory
system.
• Parallel platforms also provide higher aggregate caches.
• Principles of locality of data reference and bulk access, which guide
parallel algorithm design also apply to memory optimization.
• Some of the fastest growing applications of parallel computing utilize
not their raw computational speed, rather their ability to pump data
to memory and disk faster.
The Data Communication Argument
• As the network evolves, the vision of the Internet as one
large computing platform has emerged.
• This view is exploited by applications such as
SETI@home and Folding@home.
• In many other applications (typically databases and data
mining) the volume of data is such that they cannot be
moved.
• Any analyses on this data must be performed over the
network using parallel techniques.
Scope of Parallel Computing Applications
• Parallelism finds applications in very diverse application
domains for different motivating reasons.
• These range from improved application performance to
cost considerations.
Applications in Engineering and Design
• Design of airfoils (optimizing lift, drag, stability), internal
combustion engines (optimizing charge distribution,
burn), high-speed circuits (layouts for delays and
capacitive and inductive effects), and structures
(optimizing structural integrity, design parameters, cost,
etc.).
• Design and simulation of micro- and nano-scale
systems.
• Process optimization, operations research.
Scientific Applications
• Functional and structural characterization of genes and
proteins.
• Advances in computational physics and chemistry have
explored new materials, understanding of chemical
pathways, and more efficient processes.
• Applications in astrophysics have explored the evolution
of galaxies, thermonuclear processes, and the analysis
of extremely large datasets from telescopes.
• Weather modeling, mineral prospecting, flood prediction,
etc., are other important applications.
• Bioinformatics and astrophysics also present some of
the most challenging problems with respect to analyzing
extremely large datasets.
Commercial Applications
• Some of the largest parallel computers power the wall
street!
• Data mining and analysis for optimizing business and
marketing decisions.
• Large scale servers (mail and web servers) are often
implemented using parallel platforms.
• Applications such as information retrieval and search are
typically powered by large clusters.
Applications in Computer Systems
• Network intrusion detection, cryptography, multiparty
computations are some of the core users of parallel
computing techniques.
• Embedded systems increasingly rely on distributed
control algorithms.
• A modern automobile consists of tens of processors
communicating to perform complex tasks for optimizing
handling and performance.
• Conventional structured peer-to-peer networks impose
overlay networks and utilize algorithms directly from
parallel computing.
Organization and Contents of this Course
• Fundamentals: This part of the class covers basic
parallel platforms, principles of algorithm design, group
communication primitives, and analytical modeling
techniques.
• Parallel Programming: This part of the class deals with
programming using message passing libraries and
threads.
• Parallel Algorithms: This part of the class covers basic
algorithms for matrix computations, graphs, sorting,
discrete optimization, and dynamic programming.

More Related Content

What's hot

Real time system
Real time systemReal time system
Real time system
Aftab Hossain
 
Decomposition technique In Software Engineering
Decomposition technique In Software Engineering Decomposition technique In Software Engineering
Decomposition technique In Software Engineering
Bilal Hassan
 
Elaboration and domain model
Elaboration and domain modelElaboration and domain model
Elaboration and domain model
Vignesh Saravanan
 
Knowledge based systems
Knowledge based systemsKnowledge based systems
Knowledge based systems
Yowan Rdotexe
 
Case based reasoning
Case based reasoningCase based reasoning
Case based reasoning
ParthVichhi1
 
Draw and explain the architecture of general purpose microprocessor
Draw and explain the architecture of general purpose microprocessor Draw and explain the architecture of general purpose microprocessor
Draw and explain the architecture of general purpose microprocessor
Pundra university Science and technology
 
Google App Engine
Google App EngineGoogle App Engine
Google App Engine
Saiteja Kaparthi
 
Software engineering lecture notes
Software engineering lecture notesSoftware engineering lecture notes
Software engineering lecture notesSiva Ayyakutti
 
System engineering
System engineeringSystem engineering
System engineering
Lisa Elisa
 
distributed shared memory
 distributed shared memory distributed shared memory
distributed shared memoryAshish Kumar
 
Feng’s classification
Feng’s classificationFeng’s classification
Feng’s classification
Narayan Kandel
 
Domain Modeling
Domain ModelingDomain Modeling
Domain Modeling
Harsh Jegadeesan
 
Quadric surfaces
Quadric surfacesQuadric surfaces
Quadric surfaces
Ankur Kumar
 
Basic communication operations - One to all Broadcast
Basic communication operations - One to all BroadcastBasic communication operations - One to all Broadcast
Basic communication operations - One to all Broadcast
RashiJoshi11
 
raster and random scan
raster and random scanraster and random scan
raster and random scan
Sonia Pahuja
 
Object oriented analysis
Object oriented analysisObject oriented analysis
Object oriented analysisMahesh Bhalerao
 
Lec 4 (program and network properties)
Lec 4 (program and network properties)Lec 4 (program and network properties)
Lec 4 (program and network properties)
Sudarshan Mondal
 
Object modeling
Object modelingObject modeling
Object modeling
Preeti Mishra
 
advanced computer architesture-conditions of parallelism
advanced computer architesture-conditions of parallelismadvanced computer architesture-conditions of parallelism
advanced computer architesture-conditions of parallelism
Pankaj Kumar Jain
 

What's hot (20)

Real time system
Real time systemReal time system
Real time system
 
Decomposition technique In Software Engineering
Decomposition technique In Software Engineering Decomposition technique In Software Engineering
Decomposition technique In Software Engineering
 
Parallel Algorithms
Parallel AlgorithmsParallel Algorithms
Parallel Algorithms
 
Elaboration and domain model
Elaboration and domain modelElaboration and domain model
Elaboration and domain model
 
Knowledge based systems
Knowledge based systemsKnowledge based systems
Knowledge based systems
 
Case based reasoning
Case based reasoningCase based reasoning
Case based reasoning
 
Draw and explain the architecture of general purpose microprocessor
Draw and explain the architecture of general purpose microprocessor Draw and explain the architecture of general purpose microprocessor
Draw and explain the architecture of general purpose microprocessor
 
Google App Engine
Google App EngineGoogle App Engine
Google App Engine
 
Software engineering lecture notes
Software engineering lecture notesSoftware engineering lecture notes
Software engineering lecture notes
 
System engineering
System engineeringSystem engineering
System engineering
 
distributed shared memory
 distributed shared memory distributed shared memory
distributed shared memory
 
Feng’s classification
Feng’s classificationFeng’s classification
Feng’s classification
 
Domain Modeling
Domain ModelingDomain Modeling
Domain Modeling
 
Quadric surfaces
Quadric surfacesQuadric surfaces
Quadric surfaces
 
Basic communication operations - One to all Broadcast
Basic communication operations - One to all BroadcastBasic communication operations - One to all Broadcast
Basic communication operations - One to all Broadcast
 
raster and random scan
raster and random scanraster and random scan
raster and random scan
 
Object oriented analysis
Object oriented analysisObject oriented analysis
Object oriented analysis
 
Lec 4 (program and network properties)
Lec 4 (program and network properties)Lec 4 (program and network properties)
Lec 4 (program and network properties)
 
Object modeling
Object modelingObject modeling
Object modeling
 
advanced computer architesture-conditions of parallelism
advanced computer architesture-conditions of parallelismadvanced computer architesture-conditions of parallelism
advanced computer architesture-conditions of parallelism
 

Similar to Chap1 slides

Introduction to Parallel Computing
Introduction to Parallel ComputingIntroduction to Parallel Computing
Introduction to Parallel Computing
Roshan Karunarathna
 
3 - Grid Computing.pptx
3 - Grid Computing.pptx3 - Grid Computing.pptx
3 - Grid Computing.pptx
RiazSalim1
 
Grid computing
Grid computingGrid computing
42 grid computing
42 grid computing42 grid computing
42 grid computingSALMAN SHAIKH
 
42 grid computing
42 grid computing42 grid computing
42 grid computingSALMAN SHAIKH
 
Cluster Computing
Cluster ComputingCluster Computing
Cluster Computing
Hitesh Mohapatra
 
Grid computing Seminar PPT
Grid computing Seminar PPTGrid computing Seminar PPT
Grid computing Seminar PPTUpender Upr
 
Cloud Computing and Virtualization Overview by Amr Ali
Cloud Computing and Virtualization Overview by Amr AliCloud Computing and Virtualization Overview by Amr Ali
Cloud Computing and Virtualization Overview by Amr Ali
Amr Ali
 
CF-Chapitre1.pdf
CF-Chapitre1.pdfCF-Chapitre1.pdf
CF-Chapitre1.pdf
mohameddouhabi2002
 
Cloud computing
Cloud computingCloud computing
Cloud computing
anumidha
 
Elastic Software Infrastructure to Support the Industrial Internet
Elastic Software Infrastructure to Support the Industrial InternetElastic Software Infrastructure to Support the Industrial Internet
Elastic Software Infrastructure to Support the Industrial Internet
Real-Time Innovations (RTI)
 
Simulating Heterogeneous Resources in CloudLightning
Simulating Heterogeneous Resources in CloudLightningSimulating Heterogeneous Resources in CloudLightning
Simulating Heterogeneous Resources in CloudLightning
CloudLightning
 
pp01.pptx
pp01.pptxpp01.pptx
pp01.pptx
SusheelGeorgeJoseph
 
A Case Study On Implementation Of Grid Computing To Academic Institution
A Case Study On Implementation Of Grid Computing To Academic InstitutionA Case Study On Implementation Of Grid Computing To Academic Institution
A Case Study On Implementation Of Grid Computing To Academic Institution
Arlene Smith
 
Simulation of Heterogeneous Cloud Infrastructures
Simulation of Heterogeneous Cloud InfrastructuresSimulation of Heterogeneous Cloud Infrastructures
Simulation of Heterogeneous Cloud Infrastructures
CloudLightning
 
Cloud-mod1-chap1.pptx
Cloud-mod1-chap1.pptxCloud-mod1-chap1.pptx
Cloud-mod1-chap1.pptx
chayashree12
 
CC unit 1.pptx
CC unit 1.pptxCC unit 1.pptx
CC unit 1.pptx
DivyaRadharapu1
 
High–Performance Computing
High–Performance ComputingHigh–Performance Computing
High–Performance Computing
BRAC University Computer Club
 
CS8791 CLOUD COMPUTING_UNIT-I_FINAL_ppt (1).pptx
CS8791 CLOUD COMPUTING_UNIT-I_FINAL_ppt (1).pptxCS8791 CLOUD COMPUTING_UNIT-I_FINAL_ppt (1).pptx
CS8791 CLOUD COMPUTING_UNIT-I_FINAL_ppt (1).pptx
MALATHYANANDAN
 

Similar to Chap1 slides (20)

Introduction to Parallel Computing
Introduction to Parallel ComputingIntroduction to Parallel Computing
Introduction to Parallel Computing
 
3 - Grid Computing.pptx
3 - Grid Computing.pptx3 - Grid Computing.pptx
3 - Grid Computing.pptx
 
Grid computing
Grid computingGrid computing
Grid computing
 
42 grid computing
42 grid computing42 grid computing
42 grid computing
 
42 grid computing
42 grid computing42 grid computing
42 grid computing
 
Cluster Computing
Cluster ComputingCluster Computing
Cluster Computing
 
Grid computing Seminar PPT
Grid computing Seminar PPTGrid computing Seminar PPT
Grid computing Seminar PPT
 
Cloud Computing and Virtualization Overview by Amr Ali
Cloud Computing and Virtualization Overview by Amr AliCloud Computing and Virtualization Overview by Amr Ali
Cloud Computing and Virtualization Overview by Amr Ali
 
CF-Chapitre1.pdf
CF-Chapitre1.pdfCF-Chapitre1.pdf
CF-Chapitre1.pdf
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
Elastic Software Infrastructure to Support the Industrial Internet
Elastic Software Infrastructure to Support the Industrial InternetElastic Software Infrastructure to Support the Industrial Internet
Elastic Software Infrastructure to Support the Industrial Internet
 
Simulating Heterogeneous Resources in CloudLightning
Simulating Heterogeneous Resources in CloudLightningSimulating Heterogeneous Resources in CloudLightning
Simulating Heterogeneous Resources in CloudLightning
 
pp01.pptx
pp01.pptxpp01.pptx
pp01.pptx
 
A Case Study On Implementation Of Grid Computing To Academic Institution
A Case Study On Implementation Of Grid Computing To Academic InstitutionA Case Study On Implementation Of Grid Computing To Academic Institution
A Case Study On Implementation Of Grid Computing To Academic Institution
 
Simulation of Heterogeneous Cloud Infrastructures
Simulation of Heterogeneous Cloud InfrastructuresSimulation of Heterogeneous Cloud Infrastructures
Simulation of Heterogeneous Cloud Infrastructures
 
Cloud-mod1-chap1.pptx
Cloud-mod1-chap1.pptxCloud-mod1-chap1.pptx
Cloud-mod1-chap1.pptx
 
CC unit 1.pptx
CC unit 1.pptxCC unit 1.pptx
CC unit 1.pptx
 
High–Performance Computing
High–Performance ComputingHigh–Performance Computing
High–Performance Computing
 
CS8791 CLOUD COMPUTING_UNIT-I_FINAL_ppt (1).pptx
CS8791 CLOUD COMPUTING_UNIT-I_FINAL_ppt (1).pptxCS8791 CLOUD COMPUTING_UNIT-I_FINAL_ppt (1).pptx
CS8791 CLOUD COMPUTING_UNIT-I_FINAL_ppt (1).pptx
 
Grid computing
Grid computingGrid computing
Grid computing
 

More from BaliThorat1

Lec15 sfm
Lec15 sfmLec15 sfm
Lec15 sfm
BaliThorat1
 
Lec14 multiview stereo
Lec14 multiview stereoLec14 multiview stereo
Lec14 multiview stereo
BaliThorat1
 
Lec13 stereo converted
Lec13 stereo convertedLec13 stereo converted
Lec13 stereo converted
BaliThorat1
 
Lec12 epipolar
Lec12 epipolarLec12 epipolar
Lec12 epipolar
BaliThorat1
 
Lec11 single view-converted
Lec11 single view-convertedLec11 single view-converted
Lec11 single view-converted
BaliThorat1
 
Lec10 alignment
Lec10 alignmentLec10 alignment
Lec10 alignment
BaliThorat1
 
Lec09 hough
Lec09 houghLec09 hough
Lec09 hough
BaliThorat1
 
Lec08 fitting
Lec08 fittingLec08 fitting
Lec08 fitting
BaliThorat1
 
8 operating system concept
8 operating system concept8 operating system concept
8 operating system concept
BaliThorat1
 
7 processor
7 processor7 processor
7 processor
BaliThorat1
 
6 input output devices
6 input output devices6 input output devices
6 input output devices
BaliThorat1
 
2 windows operating system
2 windows operating system2 windows operating system
2 windows operating system
BaliThorat1
 
5 computer memory
5 computer memory5 computer memory
5 computer memory
BaliThorat1
 
4 computer languages
4 computer languages4 computer languages
4 computer languages
BaliThorat1
 
1 fundamentals of computer
1 fundamentals of computer1 fundamentals of computer
1 fundamentals of computer
BaliThorat1
 
1 fundamentals of computer system
1 fundamentals of computer system1 fundamentals of computer system
1 fundamentals of computer system
BaliThorat1
 
Computer generation and classification
Computer generation and classificationComputer generation and classification
Computer generation and classification
BaliThorat1
 
Algorithm and flowchart
Algorithm and flowchartAlgorithm and flowchart
Algorithm and flowchart
BaliThorat1
 
6 cpu scheduling
6 cpu scheduling6 cpu scheduling
6 cpu scheduling
BaliThorat1
 
5 process synchronization
5 process synchronization5 process synchronization
5 process synchronization
BaliThorat1
 

More from BaliThorat1 (20)

Lec15 sfm
Lec15 sfmLec15 sfm
Lec15 sfm
 
Lec14 multiview stereo
Lec14 multiview stereoLec14 multiview stereo
Lec14 multiview stereo
 
Lec13 stereo converted
Lec13 stereo convertedLec13 stereo converted
Lec13 stereo converted
 
Lec12 epipolar
Lec12 epipolarLec12 epipolar
Lec12 epipolar
 
Lec11 single view-converted
Lec11 single view-convertedLec11 single view-converted
Lec11 single view-converted
 
Lec10 alignment
Lec10 alignmentLec10 alignment
Lec10 alignment
 
Lec09 hough
Lec09 houghLec09 hough
Lec09 hough
 
Lec08 fitting
Lec08 fittingLec08 fitting
Lec08 fitting
 
8 operating system concept
8 operating system concept8 operating system concept
8 operating system concept
 
7 processor
7 processor7 processor
7 processor
 
6 input output devices
6 input output devices6 input output devices
6 input output devices
 
2 windows operating system
2 windows operating system2 windows operating system
2 windows operating system
 
5 computer memory
5 computer memory5 computer memory
5 computer memory
 
4 computer languages
4 computer languages4 computer languages
4 computer languages
 
1 fundamentals of computer
1 fundamentals of computer1 fundamentals of computer
1 fundamentals of computer
 
1 fundamentals of computer system
1 fundamentals of computer system1 fundamentals of computer system
1 fundamentals of computer system
 
Computer generation and classification
Computer generation and classificationComputer generation and classification
Computer generation and classification
 
Algorithm and flowchart
Algorithm and flowchartAlgorithm and flowchart
Algorithm and flowchart
 
6 cpu scheduling
6 cpu scheduling6 cpu scheduling
6 cpu scheduling
 
5 process synchronization
5 process synchronization5 process synchronization
5 process synchronization
 

Recently uploaded

Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Celine George
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
EugeneSaldivar
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
Jheel Barad
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Thiyagu K
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxStudents, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
EduSkills OECD
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
Anna Sz.
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
MIRIAMSALINAS13
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
kaushalkr1407
 
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptxMARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
bennyroshan06
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
Jisc
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
How to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS ModuleHow to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS Module
Celine George
 
Sectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdfSectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdf
Vivekanand Anglo Vedic Academy
 

Recently uploaded (20)

Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxStudents, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
 
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptxMARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
How to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS ModuleHow to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS Module
 
Sectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdfSectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdf
 

Chap1 slides

  • 1. Introduction to Parallel Computing Ananth Grama, Anshul Gupta, George Karypis, and Vipin Kumar To accompany the text ``Introduction to Parallel Computing'', Addison Wesley, 2003.
  • 2. Topic Overview • Motivating Parallelism • Scope of Parallel Computing Applications • Organization and Contents of the Course
  • 3. Motivating Parallelism • The role of parallelism in accelerating computing speeds has been recognized for several decades. • Its role in providing multiplicity of datapaths and increased access to storage elements has been significant in commercial applications. • The scalable performance and lower cost of parallel platforms is reflected in the wide variety of applications.
  • 4. Motivating Parallelism • Developing parallel hardware and software has traditionally been time and effort intensive. • If one is to view this in the context of rapidly improving uniprocessor speeds, one is tempted to question the need for parallel computing. • There are some unmistakable trends in hardware design, which indicate that uniprocessor (or implicitly parallel) architectures may not be able to sustain the rate of realizable performance increments in the future. • This is the result of a number of fundamental physical and computational limitations. • The emergence of standardized parallel programming environments, libraries, and hardware have significantly reduced time to (parallel) solution.
  • 5. The Computational Power Argument Moore's law states [1965]: ``The complexity for minimum component costs has increased at a rate of roughly a factor of two per year. Certainly over the short term this rate can be expected to continue, if not to increase. Over the longer term, the rate of increase is a bit more uncertain, although there is no reason to believe it will not remain nearly constant for at least 10 years. That means by 1975, the number of components per integrated circuit for minimum cost will be 65,000.''
  • 6. The Computational Power Argument Moore attributed this doubling rate to exponential behavior of die sizes, finer minimum dimensions, and ``circuit and device cleverness''. In 1975, he revised this law as follows: ``There is no room left to squeeze anything out by being clever. Going forward from here we have to depend on the two size factors - bigger dies and finer dimensions.'' He revised his rate of circuit complexity doubling to 18 months and projected from 1975 onwards at this reduced rate.
  • 7. The Computational Power Argument • If one is to buy into Moore's law, the question still remains - how does one translate transistors into useful OPS (operations per second)? • The logical recourse is to rely on parallelism, both implicit and explicit. • Most serial (or seemingly serial) processors rely extensively on implicit parallelism. • We focus in this class, for the most part, on explicit parallelism.
  • 8. The Memory/Disk Speed Argument • While clock rates of high-end processors have increased at roughly 40% per year over the past decade, DRAM access times have only improved at the rate of roughly 10% per year over this interval. • This mismatch in speeds causes significant performance bottlenecks. • Parallel platforms provide increased bandwidth to the memory system. • Parallel platforms also provide higher aggregate caches. • Principles of locality of data reference and bulk access, which guide parallel algorithm design also apply to memory optimization. • Some of the fastest growing applications of parallel computing utilize not their raw computational speed, rather their ability to pump data to memory and disk faster.
  • 9. The Data Communication Argument • As the network evolves, the vision of the Internet as one large computing platform has emerged. • This view is exploited by applications such as SETI@home and Folding@home. • In many other applications (typically databases and data mining) the volume of data is such that they cannot be moved. • Any analyses on this data must be performed over the network using parallel techniques.
  • 10. Scope of Parallel Computing Applications • Parallelism finds applications in very diverse application domains for different motivating reasons. • These range from improved application performance to cost considerations.
  • 11. Applications in Engineering and Design • Design of airfoils (optimizing lift, drag, stability), internal combustion engines (optimizing charge distribution, burn), high-speed circuits (layouts for delays and capacitive and inductive effects), and structures (optimizing structural integrity, design parameters, cost, etc.). • Design and simulation of micro- and nano-scale systems. • Process optimization, operations research.
  • 12. Scientific Applications • Functional and structural characterization of genes and proteins. • Advances in computational physics and chemistry have explored new materials, understanding of chemical pathways, and more efficient processes. • Applications in astrophysics have explored the evolution of galaxies, thermonuclear processes, and the analysis of extremely large datasets from telescopes. • Weather modeling, mineral prospecting, flood prediction, etc., are other important applications. • Bioinformatics and astrophysics also present some of the most challenging problems with respect to analyzing extremely large datasets.
  • 13. Commercial Applications • Some of the largest parallel computers power the wall street! • Data mining and analysis for optimizing business and marketing decisions. • Large scale servers (mail and web servers) are often implemented using parallel platforms. • Applications such as information retrieval and search are typically powered by large clusters.
  • 14. Applications in Computer Systems • Network intrusion detection, cryptography, multiparty computations are some of the core users of parallel computing techniques. • Embedded systems increasingly rely on distributed control algorithms. • A modern automobile consists of tens of processors communicating to perform complex tasks for optimizing handling and performance. • Conventional structured peer-to-peer networks impose overlay networks and utilize algorithms directly from parallel computing.
  • 15. Organization and Contents of this Course • Fundamentals: This part of the class covers basic parallel platforms, principles of algorithm design, group communication primitives, and analytical modeling techniques. • Parallel Programming: This part of the class deals with programming using message passing libraries and threads. • Parallel Algorithms: This part of the class covers basic algorithms for matrix computations, graphs, sorting, discrete optimization, and dynamic programming.