SlideShare a Scribd company logo
1 of 33
Jaliya Ekanayake and Geoffrey Fox
           School of Informatics and Computing
               Indiana University Bloomington
Cloud Computing and Software Services: Theory and Techniques
                         July, 2010
                                                     Presented by:
                                                   Inderjeet Singh
   Introduction
   Problem
   Data Analysis Applications
   Evaluations and Analysis
   Performance of MPI on Clouds
   Benchmarks and Results
   Conclusions and Future Work
   Critique
   Apache Hadoop (OpenSource version of Google MapReduce)
   DryadLINQ (Microsoft API for Dryad)
   CGL-MapReduce (Iterative version of MapReduce)

Cloud technologies/Parallel Runtimes/Cloud Runtimes
   On demand provisioning of resources
   Customizable Virtual Machines (VM)
   Root privileges
   Provisioning is very fast (within minutes)
   You pay only for what you use
   Better resource utilization
Cloud Technologies
 Moving computation to data
 Better Quality of Service (QoS)
 Simple communication topologies
 Distributed file system (HDFS,GFS)


Most HPC applications are based upon MPI
 Many fine grained communication topologies
 Usage of fast network
Software framework to support distributed computing
    on large datasets on cluster of computers

   Map step - The master node takes the input, partitions it
    up into smaller sub-problems, and distributes them to
    worker nodes. A worker node may do this again in
    turn, leading to a multi-level tree structure. The worker
    node processes the smaller problem, and passes the
    answer back to its master node

   Reduce step - The master node collects the answers to all
    the sub-problems and combines them in some way to
    form the output or answer
Large data/compute intensive applications
Traditional approach
 Execution on Clusters/grid/supercomputers
 Moving     both application and data to     available
  computational power
 Efficiency decreases with large datasets


Better approach
 Execution with Cloud technologies
 Moving computations to data to perform processing
 More data centric approach
Comparisons of features supported by different
cloud technologies and MPI
   What applications are best handled by cloud
    technologies?
   What overheads do they introduce?
   Can traditional parallel runtimes such as MPI
    be used in cloud?
   If so, what overheads do they have?
Types of Applications (Based upon
    communication)

   Map only (Cap3)
   Map Reduce (HEP)
   Iterative/Complex style (Matrix Multiplication and
    K-Means Clustering)
   Cap3 - Sequence assembly program that operates
    on a collection of gene sequence files to produce
    several outputs

   HEP - High Energy Physics data analysis application

   K-Means clustering - Performs iteratively refining
    computation of clusters

   Matrix Multiplication – Cannon’s algorithm
   MapReduce does not support iterative/complex style
    applications so [Fox] build CGL- MapReduce
   CGL-Mapreduce – Supports long running tasks and retains
    static data in memory across invocations
   Performance (average running time)
   Overhead = [P * T(P) – T(1)]/T(1)
    P = No. of processes




                                         DryadLINQ


                                         Hadoop/
                                         CGL
                                         MapReduce/M
                                         PI
   CAP3 (map only) and HEP (mapreduce) perform well
    with cloud runtimes
   K-means clustering (iterative) and matrix
    multiplications (iterative) show high overheads with
    cloud runtimes compared to MPI runtime
   CGL-Mapreduce also gives less overhead for large
    datasets
Goals
   Overhead of Virtual Machines (VM) on parallel
    applications in MPI
   How applications with different
    communication/computation (c/c) ratio perform on
    cloud?
   Effect of different CPU core assignment strategies
    on VMs and running these MPI applications on
    these VMs
Three MPI applications with different c/c
    ratios requirements

   Matrix multiplication (Cannon’s algorithm)
   K-Means clustering
   Concurrent wave solver
Computation and Communication complexities of the
different MPI applications used
   Eucalyptus     and          Xen       based           cloud
    infrastructure
      16 nodes with 2 Quad Core Intel Xeon processors and
       32 GB of memory
      Nodes connected with 1 gigabit Ethernet connection
   Same s/w configuration for both bare-metal
    nodes and VMs
     • OS - Red Hat Enterprise Linux Server release 5.2
     • OpenMP version 1.3.2
Different CPU core/virtual machines assignment strategies


Invariant to select the number of MPI processes
 Number of MPI processes = Number of CPU cores used
Performance – 64 CPU Cores      Speedup – Fixed Matrix size
                                (5184*5184)

 ◦ Speedup decrease 34% between Bare metal and 8-VM/node
   at 81 processes
 ◦ Exchange of large messages and more communication
Performance – 128 CPU Cores          Total overhead (Number of MPI
                                     Processes =128)
    ◦ Communication is very less than computations
    ◦ Communication here depends upon number of clusters formed
    ◦ Overhead is large for small data sizes, so less speedup is
      observed
Total Overhead (Number of MPI
Performance – 128 CPU Cores         Processes = 128)

   ◦ Amount of communications is fixed, less data transfer rates
   ◦ Lower c/c ratio of O(1/n) leads to more latency and lower
     performance on VMs
   ◦ 8-VMs per node has 7% more overhead than bare metal node
Communication between dom0 and domUs when 1-VM per node is deployed
(top). Communication between dom0 and domUs when 8-VMs per node are
deployed (bottom)

◦ In multi VMs configuration scheduling of I/O
  operation of DomUs (user domains) happens via
  Dom0 (privileged OS)
Figure: LAM vs. OpenMPI in different VM configurations


   When using mutliple VMs on multi-core CPUs, it is good to
    use runtimes supporting in-node communications
    (OpenMP vs LAM-MPI)
   Cloud runtimes work well for pleasing parallel (map
    only and mapreduce) applications with large
    datasets
   Overheads of cloud runtimes are high with parallel
    applications    that    require    iterative/complex
    communication patterns (MPI based applications)
   Work needs to be done on finding algorithms for
    these applications that are cloud friendly
   CGL-MapReduce is efficient for iterative style
    mapreduce applications (k-means)
   Overheads for MPI applications increase as number
    of VMs/node increase (22-50% degradation)
   In-node communication in important
   MapReduce applications (not susceptible to
    latencies) may perform well on VMs deployed on
    clouds
   Integration of MapReduce and MPI (biological DNA
    sequencing application)
   No results of implementation of pleasing parallel
    applications (Cap3, HEP) with MPI, missing MPI and
    cloud runtimes time comparisons
   Missing     evaluations    of    HPC     applications
    implemented with cloud runtimes on private
    cloud, which is critical to show the effect of multi
    VMs/multi-core configurations on performances of
    these applications
   Difference in memory sizes (16/32 GB) for clusters
    of different OS. This could lead to biased results
   Ekanayake Jaliya and Fox Geoffrey, High Performance Parallel
    Computing with Clouds and Cloud Technologies, Lecture Notes of
    the Institute for Computer Sciences, Social Informatics and
    Telecommunications Engineering (2010), Pages 20, Volume 34

   High Performance Parallel Computing with Clouds and Cloud
    Technologies. http://www.slideshare.net/jaliyae/high-performance-
    parallel-computing-with-clouds-and-cloud-technologies

   Map Reduce, Wikipedia: http://en.wikipedia.org/wiki/MapReduce
HPC with Clouds and Cloud Technologies

More Related Content

What's hot

SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHYSPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHYcsandit
 
Energy and latency aware application
Energy and latency aware applicationEnergy and latency aware application
Energy and latency aware applicationcsandit
 
NETWORK-AWARE DATA PREFETCHING OPTIMIZATION OF COMPUTATIONS IN A HETEROGENEOU...
NETWORK-AWARE DATA PREFETCHING OPTIMIZATION OF COMPUTATIONS IN A HETEROGENEOU...NETWORK-AWARE DATA PREFETCHING OPTIMIZATION OF COMPUTATIONS IN A HETEROGENEOU...
NETWORK-AWARE DATA PREFETCHING OPTIMIZATION OF COMPUTATIONS IN A HETEROGENEOU...IJCNCJournal
 
PADAL19: Runtime-Assisted Locality Abstraction Using Elastic Places and Virtu...
PADAL19: Runtime-Assisted Locality Abstraction Using Elastic Places and Virtu...PADAL19: Runtime-Assisted Locality Abstraction Using Elastic Places and Virtu...
PADAL19: Runtime-Assisted Locality Abstraction Using Elastic Places and Virtu...LEGATO project
 
Apache Hadoop India Summit 2011 talk "Adaptive Parallel Computing over Distri...
Apache Hadoop India Summit 2011 talk "Adaptive Parallel Computing over Distri...Apache Hadoop India Summit 2011 talk "Adaptive Parallel Computing over Distri...
Apache Hadoop India Summit 2011 talk "Adaptive Parallel Computing over Distri...Yahoo Developer Network
 
Accelerating Real Time Applications on Heterogeneous Platforms
Accelerating Real Time Applications on Heterogeneous PlatformsAccelerating Real Time Applications on Heterogeneous Platforms
Accelerating Real Time Applications on Heterogeneous PlatformsIJMER
 
Estimation of Optimized Energy and Latency Constraint for Task Allocation in ...
Estimation of Optimized Energy and Latency Constraint for Task Allocation in ...Estimation of Optimized Energy and Latency Constraint for Task Allocation in ...
Estimation of Optimized Energy and Latency Constraint for Task Allocation in ...ijcsit
 
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDM O...
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDM O...PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDM O...
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDM O...ijgca
 
pMatlab on BlueGene
pMatlab on BlueGenepMatlab on BlueGene
pMatlab on BlueGenevsachde
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentIJERD Editor
 
An optimized scientific workflow scheduling in cloud computing
An optimized scientific workflow scheduling in cloud computingAn optimized scientific workflow scheduling in cloud computing
An optimized scientific workflow scheduling in cloud computingDIGVIJAY SHINDE
 
DYNAMIC TASK SCHEDULING BASED ON BURST TIME REQUIREMENT FOR CLOUD ENVIRONMENT
DYNAMIC TASK SCHEDULING BASED ON BURST TIME REQUIREMENT FOR CLOUD ENVIRONMENTDYNAMIC TASK SCHEDULING BASED ON BURST TIME REQUIREMENT FOR CLOUD ENVIRONMENT
DYNAMIC TASK SCHEDULING BASED ON BURST TIME REQUIREMENT FOR CLOUD ENVIRONMENTIJCNCJournal
 
Hadoop scheduler with deadline constraint
Hadoop scheduler with deadline constraintHadoop scheduler with deadline constraint
Hadoop scheduler with deadline constraintijccsa
 
Introduction to National Supercomputer center in Tianjin TH-1A Supercomputer
Introduction to National Supercomputer center in Tianjin TH-1A SupercomputerIntroduction to National Supercomputer center in Tianjin TH-1A Supercomputer
Introduction to National Supercomputer center in Tianjin TH-1A SupercomputerFörderverein Technische Fakultät
 
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDMOD...
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDMOD...PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDMOD...
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDMOD...ijgca
 

What's hot (18)

SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHYSPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
 
TechreportTan14
TechreportTan14TechreportTan14
TechreportTan14
 
Energy and latency aware application
Energy and latency aware applicationEnergy and latency aware application
Energy and latency aware application
 
NETWORK-AWARE DATA PREFETCHING OPTIMIZATION OF COMPUTATIONS IN A HETEROGENEOU...
NETWORK-AWARE DATA PREFETCHING OPTIMIZATION OF COMPUTATIONS IN A HETEROGENEOU...NETWORK-AWARE DATA PREFETCHING OPTIMIZATION OF COMPUTATIONS IN A HETEROGENEOU...
NETWORK-AWARE DATA PREFETCHING OPTIMIZATION OF COMPUTATIONS IN A HETEROGENEOU...
 
PADAL19: Runtime-Assisted Locality Abstraction Using Elastic Places and Virtu...
PADAL19: Runtime-Assisted Locality Abstraction Using Elastic Places and Virtu...PADAL19: Runtime-Assisted Locality Abstraction Using Elastic Places and Virtu...
PADAL19: Runtime-Assisted Locality Abstraction Using Elastic Places and Virtu...
 
Apache Hadoop India Summit 2011 talk "Adaptive Parallel Computing over Distri...
Apache Hadoop India Summit 2011 talk "Adaptive Parallel Computing over Distri...Apache Hadoop India Summit 2011 talk "Adaptive Parallel Computing over Distri...
Apache Hadoop India Summit 2011 talk "Adaptive Parallel Computing over Distri...
 
Accelerating Real Time Applications on Heterogeneous Platforms
Accelerating Real Time Applications on Heterogeneous PlatformsAccelerating Real Time Applications on Heterogeneous Platforms
Accelerating Real Time Applications on Heterogeneous Platforms
 
Estimation of Optimized Energy and Latency Constraint for Task Allocation in ...
Estimation of Optimized Energy and Latency Constraint for Task Allocation in ...Estimation of Optimized Energy and Latency Constraint for Task Allocation in ...
Estimation of Optimized Energy and Latency Constraint for Task Allocation in ...
 
International Journal of Engineering Inventions (IJEI)
International Journal of Engineering Inventions (IJEI)International Journal of Engineering Inventions (IJEI)
International Journal of Engineering Inventions (IJEI)
 
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDM O...
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDM O...PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDM O...
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDM O...
 
pMatlab on BlueGene
pMatlab on BlueGenepMatlab on BlueGene
pMatlab on BlueGene
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and Development
 
An optimized scientific workflow scheduling in cloud computing
An optimized scientific workflow scheduling in cloud computingAn optimized scientific workflow scheduling in cloud computing
An optimized scientific workflow scheduling in cloud computing
 
cloud schedualing
cloud schedualingcloud schedualing
cloud schedualing
 
DYNAMIC TASK SCHEDULING BASED ON BURST TIME REQUIREMENT FOR CLOUD ENVIRONMENT
DYNAMIC TASK SCHEDULING BASED ON BURST TIME REQUIREMENT FOR CLOUD ENVIRONMENTDYNAMIC TASK SCHEDULING BASED ON BURST TIME REQUIREMENT FOR CLOUD ENVIRONMENT
DYNAMIC TASK SCHEDULING BASED ON BURST TIME REQUIREMENT FOR CLOUD ENVIRONMENT
 
Hadoop scheduler with deadline constraint
Hadoop scheduler with deadline constraintHadoop scheduler with deadline constraint
Hadoop scheduler with deadline constraint
 
Introduction to National Supercomputer center in Tianjin TH-1A Supercomputer
Introduction to National Supercomputer center in Tianjin TH-1A SupercomputerIntroduction to National Supercomputer center in Tianjin TH-1A Supercomputer
Introduction to National Supercomputer center in Tianjin TH-1A Supercomputer
 
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDMOD...
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDMOD...PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDMOD...
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDMOD...
 

Viewers also liked

Determining Relevance Rankings from Search Click Logs
Determining Relevance Rankings from Search Click LogsDetermining Relevance Rankings from Search Click Logs
Determining Relevance Rankings from Search Click LogsInderjeet Singh
 
Determining Relevance Rankings with Search Click Logs
Determining Relevance Rankings with Search Click LogsDetermining Relevance Rankings with Search Click Logs
Determining Relevance Rankings with Search Click LogsInderjeet Singh
 
All Pair Shortest Path Algorithm – Parallel Implementation and Analysis
All Pair Shortest Path Algorithm – Parallel Implementation and AnalysisAll Pair Shortest Path Algorithm – Parallel Implementation and Analysis
All Pair Shortest Path Algorithm – Parallel Implementation and AnalysisInderjeet Singh
 
Neural Network Classification and its Applications in Insurance Industry
Neural Network Classification and its Applications in Insurance IndustryNeural Network Classification and its Applications in Insurance Industry
Neural Network Classification and its Applications in Insurance IndustryInderjeet Singh
 
Neural Network Classification and its Applications in Insurance Industry
Neural Network Classification and its Applications in Insurance IndustryNeural Network Classification and its Applications in Insurance Industry
Neural Network Classification and its Applications in Insurance IndustryInderjeet Singh
 

Viewers also liked (6)

Project
ProjectProject
Project
 
Determining Relevance Rankings from Search Click Logs
Determining Relevance Rankings from Search Click LogsDetermining Relevance Rankings from Search Click Logs
Determining Relevance Rankings from Search Click Logs
 
Determining Relevance Rankings with Search Click Logs
Determining Relevance Rankings with Search Click LogsDetermining Relevance Rankings with Search Click Logs
Determining Relevance Rankings with Search Click Logs
 
All Pair Shortest Path Algorithm – Parallel Implementation and Analysis
All Pair Shortest Path Algorithm – Parallel Implementation and AnalysisAll Pair Shortest Path Algorithm – Parallel Implementation and Analysis
All Pair Shortest Path Algorithm – Parallel Implementation and Analysis
 
Neural Network Classification and its Applications in Insurance Industry
Neural Network Classification and its Applications in Insurance IndustryNeural Network Classification and its Applications in Insurance Industry
Neural Network Classification and its Applications in Insurance Industry
 
Neural Network Classification and its Applications in Insurance Industry
Neural Network Classification and its Applications in Insurance IndustryNeural Network Classification and its Applications in Insurance Industry
Neural Network Classification and its Applications in Insurance Industry
 

Similar to HPC with Clouds and Cloud Technologies

Hardback solution to accelerate multimedia computation through mgp in cmp
Hardback solution to accelerate multimedia computation through mgp in cmpHardback solution to accelerate multimedia computation through mgp in cmp
Hardback solution to accelerate multimedia computation through mgp in cmpeSAT Publishing House
 
Qiu bosc2010
Qiu bosc2010Qiu bosc2010
Qiu bosc2010BOSC 2010
 
2023comp90024_Spartan.pdf
2023comp90024_Spartan.pdf2023comp90024_Spartan.pdf
2023comp90024_Spartan.pdfLevLafayette1
 
Applying Cloud Techniques to Address Complexity in HPC System Integrations
Applying Cloud Techniques to Address Complexity in HPC System IntegrationsApplying Cloud Techniques to Address Complexity in HPC System Integrations
Applying Cloud Techniques to Address Complexity in HPC System Integrationsinside-BigData.com
 
Map Reduce in the Clouds (http://salsahpc.indiana.edu/mapreduceroles4azure/)
Map Reduce in the Clouds (http://salsahpc.indiana.edu/mapreduceroles4azure/)Map Reduce in the Clouds (http://salsahpc.indiana.edu/mapreduceroles4azure/)
Map Reduce in the Clouds (http://salsahpc.indiana.edu/mapreduceroles4azure/)Thilina Gunarathne
 
Optical Switching in the Datacenter
Optical Switching in the DatacenterOptical Switching in the Datacenter
Optical Switching in the DatacenterKostas Katrinis
 
Clustering by AKASHMSHAH
Clustering by AKASHMSHAHClustering by AKASHMSHAH
Clustering by AKASHMSHAHAkash M Shah
 
Distributed and Cloud Computing 1st Edition Hwang Solutions Manual
Distributed and Cloud Computing 1st Edition Hwang Solutions ManualDistributed and Cloud Computing 1st Edition Hwang Solutions Manual
Distributed and Cloud Computing 1st Edition Hwang Solutions Manualkyxeminut
 
Presented by Ahmed Abdulhakim Al-Absi - Scaling map reduce applications acro...
Presented by Ahmed Abdulhakim Al-Absi -  Scaling map reduce applications acro...Presented by Ahmed Abdulhakim Al-Absi -  Scaling map reduce applications acro...
Presented by Ahmed Abdulhakim Al-Absi - Scaling map reduce applications acro...Absi Ahmed
 
Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...
Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...
Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...Yahoo Developer Network
 
Fugaku, the Successes and the Lessons Learned
Fugaku, the Successes and the Lessons LearnedFugaku, the Successes and the Lessons Learned
Fugaku, the Successes and the Lessons LearnedRCCSRENKEI
 
Application-Aware Big Data Deduplication in Cloud Environment
Application-Aware Big Data Deduplication in Cloud EnvironmentApplication-Aware Big Data Deduplication in Cloud Environment
Application-Aware Big Data Deduplication in Cloud EnvironmentSafayet Hossain
 
OpenACC Monthly Highlights: October2020
OpenACC Monthly Highlights: October2020OpenACC Monthly Highlights: October2020
OpenACC Monthly Highlights: October2020OpenACC
 

Similar to HPC with Clouds and Cloud Technologies (20)

Hardback solution to accelerate multimedia computation through mgp in cmp
Hardback solution to accelerate multimedia computation through mgp in cmpHardback solution to accelerate multimedia computation through mgp in cmp
Hardback solution to accelerate multimedia computation through mgp in cmp
 
Qiu bosc2010
Qiu bosc2010Qiu bosc2010
Qiu bosc2010
 
Paper
PaperPaper
Paper
 
Gupta_Keynote_VTDC-3
Gupta_Keynote_VTDC-3Gupta_Keynote_VTDC-3
Gupta_Keynote_VTDC-3
 
E031201032036
E031201032036E031201032036
E031201032036
 
Chap 1(one) general introduction
Chap 1(one)  general introductionChap 1(one)  general introduction
Chap 1(one) general introduction
 
2023comp90024_Spartan.pdf
2023comp90024_Spartan.pdf2023comp90024_Spartan.pdf
2023comp90024_Spartan.pdf
 
Applying Cloud Techniques to Address Complexity in HPC System Integrations
Applying Cloud Techniques to Address Complexity in HPC System IntegrationsApplying Cloud Techniques to Address Complexity in HPC System Integrations
Applying Cloud Techniques to Address Complexity in HPC System Integrations
 
Map Reduce in the Clouds (http://salsahpc.indiana.edu/mapreduceroles4azure/)
Map Reduce in the Clouds (http://salsahpc.indiana.edu/mapreduceroles4azure/)Map Reduce in the Clouds (http://salsahpc.indiana.edu/mapreduceroles4azure/)
Map Reduce in the Clouds (http://salsahpc.indiana.edu/mapreduceroles4azure/)
 
Optical Switching in the Datacenter
Optical Switching in the DatacenterOptical Switching in the Datacenter
Optical Switching in the Datacenter
 
Clustering by AKASHMSHAH
Clustering by AKASHMSHAHClustering by AKASHMSHAH
Clustering by AKASHMSHAH
 
CLOUD BIOINFORMATICS Part1
 CLOUD BIOINFORMATICS Part1 CLOUD BIOINFORMATICS Part1
CLOUD BIOINFORMATICS Part1
 
Distributed and Cloud Computing 1st Edition Hwang Solutions Manual
Distributed and Cloud Computing 1st Edition Hwang Solutions ManualDistributed and Cloud Computing 1st Edition Hwang Solutions Manual
Distributed and Cloud Computing 1st Edition Hwang Solutions Manual
 
53
5353
53
 
Presented by Ahmed Abdulhakim Al-Absi - Scaling map reduce applications acro...
Presented by Ahmed Abdulhakim Al-Absi -  Scaling map reduce applications acro...Presented by Ahmed Abdulhakim Al-Absi -  Scaling map reduce applications acro...
Presented by Ahmed Abdulhakim Al-Absi - Scaling map reduce applications acro...
 
Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...
Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...
Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...
 
Fugaku, the Successes and the Lessons Learned
Fugaku, the Successes and the Lessons LearnedFugaku, the Successes and the Lessons Learned
Fugaku, the Successes and the Lessons Learned
 
Gurpinder_Resume
Gurpinder_ResumeGurpinder_Resume
Gurpinder_Resume
 
Application-Aware Big Data Deduplication in Cloud Environment
Application-Aware Big Data Deduplication in Cloud EnvironmentApplication-Aware Big Data Deduplication in Cloud Environment
Application-Aware Big Data Deduplication in Cloud Environment
 
OpenACC Monthly Highlights: October2020
OpenACC Monthly Highlights: October2020OpenACC Monthly Highlights: October2020
OpenACC Monthly Highlights: October2020
 

Recently uploaded

Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 

Recently uploaded (20)

Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 

HPC with Clouds and Cloud Technologies

  • 1. Jaliya Ekanayake and Geoffrey Fox School of Informatics and Computing Indiana University Bloomington Cloud Computing and Software Services: Theory and Techniques July, 2010 Presented by: Inderjeet Singh
  • 2. Introduction  Problem  Data Analysis Applications  Evaluations and Analysis  Performance of MPI on Clouds  Benchmarks and Results  Conclusions and Future Work  Critique
  • 3.
  • 4. Apache Hadoop (OpenSource version of Google MapReduce)  DryadLINQ (Microsoft API for Dryad)  CGL-MapReduce (Iterative version of MapReduce) Cloud technologies/Parallel Runtimes/Cloud Runtimes
  • 5. On demand provisioning of resources  Customizable Virtual Machines (VM)  Root privileges  Provisioning is very fast (within minutes)  You pay only for what you use  Better resource utilization
  • 6. Cloud Technologies  Moving computation to data  Better Quality of Service (QoS)  Simple communication topologies  Distributed file system (HDFS,GFS) Most HPC applications are based upon MPI  Many fine grained communication topologies  Usage of fast network
  • 7. Software framework to support distributed computing on large datasets on cluster of computers  Map step - The master node takes the input, partitions it up into smaller sub-problems, and distributes them to worker nodes. A worker node may do this again in turn, leading to a multi-level tree structure. The worker node processes the smaller problem, and passes the answer back to its master node  Reduce step - The master node collects the answers to all the sub-problems and combines them in some way to form the output or answer
  • 8. Large data/compute intensive applications Traditional approach  Execution on Clusters/grid/supercomputers  Moving both application and data to available computational power  Efficiency decreases with large datasets Better approach  Execution with Cloud technologies  Moving computations to data to perform processing  More data centric approach
  • 9. Comparisons of features supported by different cloud technologies and MPI
  • 10. What applications are best handled by cloud technologies?  What overheads do they introduce?  Can traditional parallel runtimes such as MPI be used in cloud?  If so, what overheads do they have?
  • 11. Types of Applications (Based upon communication)  Map only (Cap3)  Map Reduce (HEP)  Iterative/Complex style (Matrix Multiplication and K-Means Clustering)
  • 12. Cap3 - Sequence assembly program that operates on a collection of gene sequence files to produce several outputs  HEP - High Energy Physics data analysis application  K-Means clustering - Performs iteratively refining computation of clusters  Matrix Multiplication – Cannon’s algorithm
  • 13.
  • 14. MapReduce does not support iterative/complex style applications so [Fox] build CGL- MapReduce  CGL-Mapreduce – Supports long running tasks and retains static data in memory across invocations
  • 15. Performance (average running time)  Overhead = [P * T(P) – T(1)]/T(1) P = No. of processes DryadLINQ Hadoop/ CGL MapReduce/M PI
  • 16.
  • 17.
  • 18. CAP3 (map only) and HEP (mapreduce) perform well with cloud runtimes  K-means clustering (iterative) and matrix multiplications (iterative) show high overheads with cloud runtimes compared to MPI runtime  CGL-Mapreduce also gives less overhead for large datasets
  • 19. Goals  Overhead of Virtual Machines (VM) on parallel applications in MPI  How applications with different communication/computation (c/c) ratio perform on cloud?  Effect of different CPU core assignment strategies on VMs and running these MPI applications on these VMs
  • 20. Three MPI applications with different c/c ratios requirements  Matrix multiplication (Cannon’s algorithm)  K-Means clustering  Concurrent wave solver
  • 21. Computation and Communication complexities of the different MPI applications used
  • 22. Eucalyptus and Xen based cloud infrastructure  16 nodes with 2 Quad Core Intel Xeon processors and 32 GB of memory  Nodes connected with 1 gigabit Ethernet connection  Same s/w configuration for both bare-metal nodes and VMs • OS - Red Hat Enterprise Linux Server release 5.2 • OpenMP version 1.3.2
  • 23. Different CPU core/virtual machines assignment strategies Invariant to select the number of MPI processes Number of MPI processes = Number of CPU cores used
  • 24. Performance – 64 CPU Cores Speedup – Fixed Matrix size (5184*5184) ◦ Speedup decrease 34% between Bare metal and 8-VM/node at 81 processes ◦ Exchange of large messages and more communication
  • 25. Performance – 128 CPU Cores Total overhead (Number of MPI Processes =128) ◦ Communication is very less than computations ◦ Communication here depends upon number of clusters formed ◦ Overhead is large for small data sizes, so less speedup is observed
  • 26. Total Overhead (Number of MPI Performance – 128 CPU Cores Processes = 128) ◦ Amount of communications is fixed, less data transfer rates ◦ Lower c/c ratio of O(1/n) leads to more latency and lower performance on VMs ◦ 8-VMs per node has 7% more overhead than bare metal node
  • 27. Communication between dom0 and domUs when 1-VM per node is deployed (top). Communication between dom0 and domUs when 8-VMs per node are deployed (bottom) ◦ In multi VMs configuration scheduling of I/O operation of DomUs (user domains) happens via Dom0 (privileged OS)
  • 28. Figure: LAM vs. OpenMPI in different VM configurations  When using mutliple VMs on multi-core CPUs, it is good to use runtimes supporting in-node communications (OpenMP vs LAM-MPI)
  • 29. Cloud runtimes work well for pleasing parallel (map only and mapreduce) applications with large datasets  Overheads of cloud runtimes are high with parallel applications that require iterative/complex communication patterns (MPI based applications)  Work needs to be done on finding algorithms for these applications that are cloud friendly  CGL-MapReduce is efficient for iterative style mapreduce applications (k-means)
  • 30. Overheads for MPI applications increase as number of VMs/node increase (22-50% degradation)  In-node communication in important  MapReduce applications (not susceptible to latencies) may perform well on VMs deployed on clouds  Integration of MapReduce and MPI (biological DNA sequencing application)
  • 31. No results of implementation of pleasing parallel applications (Cap3, HEP) with MPI, missing MPI and cloud runtimes time comparisons  Missing evaluations of HPC applications implemented with cloud runtimes on private cloud, which is critical to show the effect of multi VMs/multi-core configurations on performances of these applications  Difference in memory sizes (16/32 GB) for clusters of different OS. This could lead to biased results
  • 32. Ekanayake Jaliya and Fox Geoffrey, High Performance Parallel Computing with Clouds and Cloud Technologies, Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering (2010), Pages 20, Volume 34  High Performance Parallel Computing with Clouds and Cloud Technologies. http://www.slideshare.net/jaliyae/high-performance- parallel-computing-with-clouds-and-cloud-technologies  Map Reduce, Wikipedia: http://en.wikipedia.org/wiki/MapReduce