SlideShare a Scribd company logo
1 of 1
Download to read offline
For our experiments, we use the Stampede Computer Cluster at Texas Advanced Computing Center.
Stampede consists of 6,400 powerful desktop machines providing 522,080 processing cores. For our
tests, we use up to 256 of those machines, each of which containing dual 8-core Intel Xeon E5-2680
processors and 32 gigabytes of memory. At the time of this research, the Stampede Computer
Cluster is the 10th fastest computer in the world [5].
Results
The following graphs show the obtained results for the proposed solution.
The Strong Scaling graph shows how the processing time varies for a fixed sized input, when dividing
the work to up to 16 parallel threads in a single CPU. The Weak Scaling graph, on the other hand,
keeps the data fixed for each thread, therefore increasing the total input with the number of threads.
The presented OpenMP results show that the multithreaded implementation is able to take advantage
of the multi-core architecture up to the point where the number of running threads matches the
number of available processing cores.
On top on the multithreaded OpenMP implementation, MPI is used to further improve the scalability of
our solution by introducing distributed computing capabilities. The same Strong Scaling and Weak
Scaling measures were taken, and the below graphs show our results for running the proposed
solution in up to 512 CPUs, totaling 4096 parallel working threads.
SCALABLE AND DISTRIBUTED APPROACH FOR RSA DECRYPTION ALGORITHM
Alysson Almeida1, Tiago de Almeida11
Prof. Dr. Christopher Stone1
1. Department of Computer Science,
Loyola University of Chicago, Chicago, IL
Introduction
The RSA algorithm was first described in 1977 by R.L. Rivest, A. Shamir, and L. Adleman as an
implementation of the concept of “public-key cryptosystems”. Such concept was presented by Diffie
Helman; however, no practical implementation was developed in his work [1][2]. RSA is the most
popular and widely deployed public-key crypto-system and is used for both encryption and digital
signature. Its algorithm is based on modular arithmetic, and due to the magnitude of the numbers
used, the decryption process is computationally intensive. In this research, we discuss the design,
implementation and results of a scalable, multi-threaded and distributed RSA decryption algorithm,
which aims to use parallel and high performance computing techniques to drastically reduce the
processing time. OpenMP is used to distribute the work to multiple parallel threads running on the
same CPU, while MPI is used in order to create a distributed computing scenario, having hundreds of
computers working in parallel to decrypt data. The presented algorithm is capable of scaling almost
ideally, presenting a considerably constant efficiency and greatly benefiting from
the parallel implementation. The experiments were performed on the Stampede Computer Cluster,
located at the Texas Advanced Computing Center.
Keywords — Multithreading, Distributed Computing, OpenMP, MPI, RSA
Research Objectives
The aim of the proposed research will be to develop software for implementing a parallel solution for
the RSA decryption algorithm. Multithread and distributed computing methods will be used to reach
the aimed objective. This effort will include the development of a hybrid OpenMP/MPI program to
maximize the use of computational resources and, consequently, decrease the time to decrypt large
ciphertexts.
Methods
The proposed method works as follows: The input ciphertext is divided into N blocks, which are
distributed to N processes using Message Passing Interface (MPI). Each process runs on an
independent CPU which, in turn, provides 8 cores for running parallel threads. In order to take full
advantage of that scenario, we divide each block again into 8 smaller blocks, distributing them to 8
threads that run in parallel in a single CPU. OpenMP is used to create and manage the threads. As a
result, the input ciphertext data is divided into N*8 blocks, which are all decrypted by N processes,
totaling N*8 parallel threads. At the end, all blocks are assembled together to create the final
decrypted data. The figure below shows the proposed parallel and distributed approach in contrast
with the original serial algorithm
The Efficiency graphs for the OpenMP implementation confirm the previously observed behavior,
where the parallel implementation scales up to the hardware limitation. The MPI Efficiency graph
shows that using above 256 CPUs to process a 1MB input data sample brings diminishing returns,
due to the overhead of distributed computing.
Conclusions
The proposed multithread and distributed implementation of the RSA
decryption algorithm scales almost ideally. The initial parallel approach using only
OpenMP takes full advantage of the available multiple cores in the CPU, only saturating when
threads need to compete for processing time. The second approach fixes the number of threads in 8
and adds MPI distributed computing capabilities on top. This algorithm also presents considerably
ideal scalability for all numbers of processors we tested, and only showed a decrease in performance
when the input data was too small. For this case, the overhead of managing the distributed work
throughout hundreds of processors becomes apparent, and increasing the number of parallel
processes above 256 brings diminishing returns. For large enough input data, however, the proposed
solution does not present any scalability limit.
Acknowledgment
The authors would like to thank the research funding agency CAPES Foundation, Ministry of
Education of Brazil for the scholarships granted to the post-graduate students participating in the
study. They would also like to thank Rajorshi Biswas, Shibdas Bandyopadhyay, Anirban Banerjee for
distributing their code under the GNU General Public License. Funding to attend and present in
March 2016 at the BRASCON conference (Cambridge, MA) was provided by Loyola University
Chicago Graduate School.
References
[1] R. Rivest, A. Shamir and L. Adleman, 'A method for obtaining digital signatures and public-key
cryptosystems', Communications of the ACM, vol. 21, no. 2, pp. 2-3, 1978.
[2] Diffie, W., and Hellman, M. New directions in cryptography. IEEE Trans. Inform. Theory IT-22,
(Nov. 1976), 644-654.
[3] B. Rajorshi S. Bandyopadhyay, and A. Banerjee, ‘A Fast Implementation Of The RSA Algorithm
Using The GNU MP Library’.
[4] Abusharekh, A., Gaj, K., ‘Comparative Analysis of Software Libraries for Public Key Cryptography’.
http://www.hyperelliptic.org/SPEED/slides/Abusharekh_Gaj_SPEED.pdf
[5] Top 500 supercomputing site.
http://www.top500.org/lists/2015/11/. (accessed February 8, 2016).
0.5
1
2
4
8
16
1 2 4 8 16
TIME	(S)
THREADS
OpenMP
Weak	Scaling	Time
100KB 1MB
0.0039063
0.0078125
0.015625
0.03125
0.0625
0.125
0.25
0.5
1
2
4
8
16
32
64
128
1 2 4 6 16 32 64 128 256 512
TIME	(S)
PROCESSES
OpenMP	+	MPI
Strong	Scaling	Time
1MB 10MB 100MB
0.5
1
2
4 16 64 256
EFFICIENCY
PROCESSES
OpenMP	+	MPI
Weak	Scaling	Efficiency
100	KB 1	MB
0.0625
0.125
0.25
0.5
1
1 4 16 64 256
TIME	(S)
PROCESSES
OpenMP	+	MPI
Weak	Scaling	Time
100	KB 1	MB
0.25
0.5
1
2
2 4 8 16
EFFICIENCY
THREADS
OpenMP
Strong	Scaling	Efficiency
100KB 1MB 10MB
0.25
0.5
1
2
2 4 8 16
EFFICIENCY
THREADS
OpenMP
Weak	Scaling	Efficiency
100KB 1MB
0.0625
0.125
0.25
0.5
1
2
4
8
16
32
64
128
1 2 4 8 16
TIME	(S)
THREADS
OpenMP
Strong	Scaling	Time
100KB 1MB 10MB
0.5
1
2
2 4 8 16 32 64 128 256 512
EFFICIENCY
PROCESSES
OpenMP	+	MPI
Strong	Scaling	Efficiency
1MB 10MB 100MB

More Related Content

What's hot

Parallel Computing 2007: Bring your own parallel application
Parallel Computing 2007: Bring your own parallel applicationParallel Computing 2007: Bring your own parallel application
Parallel Computing 2007: Bring your own parallel applicationGeoffrey Fox
 
A Tale of Data Pattern Discovery in Parallel
A Tale of Data Pattern Discovery in ParallelA Tale of Data Pattern Discovery in Parallel
A Tale of Data Pattern Discovery in ParallelJenny Liu
 
Vol 16 No 2 - July-December 2016
Vol 16 No 2 - July-December 2016Vol 16 No 2 - July-December 2016
Vol 16 No 2 - July-December 2016ijcsbi
 
OWL reasoning with WebPIE: calculating the closer of 100 billion triples
OWL reasoning with WebPIE: calculating the closer of 100 billion triplesOWL reasoning with WebPIE: calculating the closer of 100 billion triples
OWL reasoning with WebPIE: calculating the closer of 100 billion triplesMahdi Atawneh
 
Tech Talk - Underutilized Resources in Distributed System
Tech Talk - Underutilized Resources in Distributed SystemTech Talk - Underutilized Resources in Distributed System
Tech Talk - Underutilized Resources in Distributed SystemRishabh Dugar
 
Collective Communications in MPI
 Collective Communications in MPI Collective Communications in MPI
Collective Communications in MPIHanif Durad
 
International Journal of Computational Science and Information Technology (...
  International Journal of Computational Science and Information Technology (...  International Journal of Computational Science and Information Technology (...
International Journal of Computational Science and Information Technology (...ijcsity
 
Accumulo and the Convergence of Machine Learning, Big Data, and Supercomputing
Accumulo and the Convergence of Machine Learning, Big Data, and SupercomputingAccumulo and the Convergence of Machine Learning, Big Data, and Supercomputing
Accumulo and the Convergence of Machine Learning, Big Data, and SupercomputingAccumulo Summit
 
PERFORMANCE EVALUATION OF BIG DATA PROCESSING OF CLOAK-REDUCE
PERFORMANCE EVALUATION OF BIG DATA PROCESSING OF CLOAK-REDUCEPERFORMANCE EVALUATION OF BIG DATA PROCESSING OF CLOAK-REDUCE
PERFORMANCE EVALUATION OF BIG DATA PROCESSING OF CLOAK-REDUCEijdpsjournal
 
Scimakelatex.83323.robson+medeiros+de+araujo
Scimakelatex.83323.robson+medeiros+de+araujoScimakelatex.83323.robson+medeiros+de+araujo
Scimakelatex.83323.robson+medeiros+de+araujoRobson Araujo
 
AN EFFICIENT THRESHOLD CRYPTOGRAPHY SCHEME FOR CLOUD ERP DATA
AN EFFICIENT THRESHOLD CRYPTOGRAPHY SCHEME FOR CLOUD ERP DATAAN EFFICIENT THRESHOLD CRYPTOGRAPHY SCHEME FOR CLOUD ERP DATA
AN EFFICIENT THRESHOLD CRYPTOGRAPHY SCHEME FOR CLOUD ERP DATAijcisjournal
 
Ling liu part 02:big graph processing
Ling liu part 02:big graph processingLing liu part 02:big graph processing
Ling liu part 02:big graph processingjins0618
 
A Novel Design For Generating Dynamic Length Message Digest To Ensure Integri...
A Novel Design For Generating Dynamic Length Message Digest To Ensure Integri...A Novel Design For Generating Dynamic Length Message Digest To Ensure Integri...
A Novel Design For Generating Dynamic Length Message Digest To Ensure Integri...IRJET Journal
 
Producer consumer-problems
Producer consumer-problemsProducer consumer-problems
Producer consumer-problemsRichard Ashworth
 
Studies of HPCC Systems from Machine Learning Perspectives
Studies of HPCC Systems from Machine Learning PerspectivesStudies of HPCC Systems from Machine Learning Perspectives
Studies of HPCC Systems from Machine Learning PerspectivesHPCC Systems
 
Event-Driven, Client-Server Archetypes for E-Commerce
Event-Driven, Client-Server Archetypes for E-CommerceEvent-Driven, Client-Server Archetypes for E-Commerce
Event-Driven, Client-Server Archetypes for E-Commerceijtsrd
 
Ling liu part 01:big graph processing
Ling liu part 01:big graph processingLing liu part 01:big graph processing
Ling liu part 01:big graph processingjins0618
 

What's hot (20)

Parallel Computing 2007: Bring your own parallel application
Parallel Computing 2007: Bring your own parallel applicationParallel Computing 2007: Bring your own parallel application
Parallel Computing 2007: Bring your own parallel application
 
A Tale of Data Pattern Discovery in Parallel
A Tale of Data Pattern Discovery in ParallelA Tale of Data Pattern Discovery in Parallel
A Tale of Data Pattern Discovery in Parallel
 
Vol 16 No 2 - July-December 2016
Vol 16 No 2 - July-December 2016Vol 16 No 2 - July-December 2016
Vol 16 No 2 - July-December 2016
 
OWL reasoning with WebPIE: calculating the closer of 100 billion triples
OWL reasoning with WebPIE: calculating the closer of 100 billion triplesOWL reasoning with WebPIE: calculating the closer of 100 billion triples
OWL reasoning with WebPIE: calculating the closer of 100 billion triples
 
Tech Talk - Underutilized Resources in Distributed System
Tech Talk - Underutilized Resources in Distributed SystemTech Talk - Underutilized Resources in Distributed System
Tech Talk - Underutilized Resources in Distributed System
 
Eg4301808811
Eg4301808811Eg4301808811
Eg4301808811
 
Collective Communications in MPI
 Collective Communications in MPI Collective Communications in MPI
Collective Communications in MPI
 
International Journal of Computational Science and Information Technology (...
  International Journal of Computational Science and Information Technology (...  International Journal of Computational Science and Information Technology (...
International Journal of Computational Science and Information Technology (...
 
Accumulo and the Convergence of Machine Learning, Big Data, and Supercomputing
Accumulo and the Convergence of Machine Learning, Big Data, and SupercomputingAccumulo and the Convergence of Machine Learning, Big Data, and Supercomputing
Accumulo and the Convergence of Machine Learning, Big Data, and Supercomputing
 
PERFORMANCE EVALUATION OF BIG DATA PROCESSING OF CLOAK-REDUCE
PERFORMANCE EVALUATION OF BIG DATA PROCESSING OF CLOAK-REDUCEPERFORMANCE EVALUATION OF BIG DATA PROCESSING OF CLOAK-REDUCE
PERFORMANCE EVALUATION OF BIG DATA PROCESSING OF CLOAK-REDUCE
 
Scimakelatex.83323.robson+medeiros+de+araujo
Scimakelatex.83323.robson+medeiros+de+araujoScimakelatex.83323.robson+medeiros+de+araujo
Scimakelatex.83323.robson+medeiros+de+araujo
 
NGBT_poster_v0.4
NGBT_poster_v0.4NGBT_poster_v0.4
NGBT_poster_v0.4
 
AN EFFICIENT THRESHOLD CRYPTOGRAPHY SCHEME FOR CLOUD ERP DATA
AN EFFICIENT THRESHOLD CRYPTOGRAPHY SCHEME FOR CLOUD ERP DATAAN EFFICIENT THRESHOLD CRYPTOGRAPHY SCHEME FOR CLOUD ERP DATA
AN EFFICIENT THRESHOLD CRYPTOGRAPHY SCHEME FOR CLOUD ERP DATA
 
Ling liu part 02:big graph processing
Ling liu part 02:big graph processingLing liu part 02:big graph processing
Ling liu part 02:big graph processing
 
A Novel Design For Generating Dynamic Length Message Digest To Ensure Integri...
A Novel Design For Generating Dynamic Length Message Digest To Ensure Integri...A Novel Design For Generating Dynamic Length Message Digest To Ensure Integri...
A Novel Design For Generating Dynamic Length Message Digest To Ensure Integri...
 
Journal paper 1
Journal paper 1Journal paper 1
Journal paper 1
 
Producer consumer-problems
Producer consumer-problemsProducer consumer-problems
Producer consumer-problems
 
Studies of HPCC Systems from Machine Learning Perspectives
Studies of HPCC Systems from Machine Learning PerspectivesStudies of HPCC Systems from Machine Learning Perspectives
Studies of HPCC Systems from Machine Learning Perspectives
 
Event-Driven, Client-Server Archetypes for E-Commerce
Event-Driven, Client-Server Archetypes for E-CommerceEvent-Driven, Client-Server Archetypes for E-Commerce
Event-Driven, Client-Server Archetypes for E-Commerce
 
Ling liu part 01:big graph processing
Ling liu part 01:big graph processingLing liu part 01:big graph processing
Ling liu part 01:big graph processing
 

Viewers also liked

Ciri ciri kemahiran pembelajaran abad ke-21
Ciri ciri kemahiran pembelajaran abad ke-21Ciri ciri kemahiran pembelajaran abad ke-21
Ciri ciri kemahiran pembelajaran abad ke-21jaemah86
 
Бизнес с сибирским здоровьем. Как это работает?
Бизнес с сибирским здоровьем. Как это работает?Бизнес с сибирским здоровьем. Как это работает?
Бизнес с сибирским здоровьем. Как это работает?Irina Alexeenko
 
EU: Ploughs For Agricultural Purposes - Market Report. Analysis And Forecast ...
EU: Ploughs For Agricultural Purposes - Market Report. Analysis And Forecast ...EU: Ploughs For Agricultural Purposes - Market Report. Analysis And Forecast ...
EU: Ploughs For Agricultural Purposes - Market Report. Analysis And Forecast ...IndexBox Marketing
 
Apprenda, sua Business-oriented PaaS
Apprenda, sua Business-oriented PaaSApprenda, sua Business-oriented PaaS
Apprenda, sua Business-oriented PaaSClaudio Romao
 
EU: Polyethylene - Market Report. Analysis And Forecast To 2020
EU: Polyethylene - Market Report. Analysis And Forecast To 2020EU: Polyethylene - Market Report. Analysis And Forecast To 2020
EU: Polyethylene - Market Report. Analysis And Forecast To 2020IndexBox Marketing
 
An overview of indian economy
An overview of indian economyAn overview of indian economy
An overview of indian economyindianeducation
 
internation aviation academy put.ltd exp
internation aviation academy put.ltd expinternation aviation academy put.ltd exp
internation aviation academy put.ltd expMohammed Jazeer
 
Occupational Instructor.pdf
Occupational Instructor.pdfOccupational Instructor.pdf
Occupational Instructor.pdfSteve Naglack
 
Dundee Police and Criminal Justice Group Presentation
Dundee Police and Criminal Justice Group PresentationDundee Police and Criminal Justice Group Presentation
Dundee Police and Criminal Justice Group PresentationEric Halford PhD(can)
 
Black_Stephanie_PPP
Black_Stephanie_PPPBlack_Stephanie_PPP
Black_Stephanie_PPPseblack09
 
La vie. www.fo-coaching.com
La vie. www.fo-coaching.comLa vie. www.fo-coaching.com
La vie. www.fo-coaching.comOlga Markina
 
Release notes - Versão 2.2
Release notes - Versão 2.2Release notes - Versão 2.2
Release notes - Versão 2.2Plataforma CORE
 
CWRRI WITB a history of drought in Colorado
CWRRI WITB a history of drought in ColoradoCWRRI WITB a history of drought in Colorado
CWRRI WITB a history of drought in ColoradoCat Shrier, Ph.D.
 
биография с.л. стерхова
биография с.л. стерховабиография с.л. стерхова
биография с.л. стерховаNadezhda Egovkina
 
Hicks-Alcaraz, 2015 CV
Hicks-Alcaraz, 2015 CV Hicks-Alcaraz, 2015 CV
Hicks-Alcaraz, 2015 CV Marissa Hicks
 

Viewers also liked (20)

Ciri ciri kemahiran pembelajaran abad ke-21
Ciri ciri kemahiran pembelajaran abad ke-21Ciri ciri kemahiran pembelajaran abad ke-21
Ciri ciri kemahiran pembelajaran abad ke-21
 
Advertising
AdvertisingAdvertising
Advertising
 
Reagents eg-march-2009
Reagents eg-march-2009Reagents eg-march-2009
Reagents eg-march-2009
 
Бизнес с сибирским здоровьем. Как это работает?
Бизнес с сибирским здоровьем. Как это работает?Бизнес с сибирским здоровьем. Как это работает?
Бизнес с сибирским здоровьем. Как это работает?
 
EU: Ploughs For Agricultural Purposes - Market Report. Analysis And Forecast ...
EU: Ploughs For Agricultural Purposes - Market Report. Analysis And Forecast ...EU: Ploughs For Agricultural Purposes - Market Report. Analysis And Forecast ...
EU: Ploughs For Agricultural Purposes - Market Report. Analysis And Forecast ...
 
Jovane
JovaneJovane
Jovane
 
Eng - corporate 2014
Eng - corporate 2014Eng - corporate 2014
Eng - corporate 2014
 
Apprenda, sua Business-oriented PaaS
Apprenda, sua Business-oriented PaaSApprenda, sua Business-oriented PaaS
Apprenda, sua Business-oriented PaaS
 
EU: Polyethylene - Market Report. Analysis And Forecast To 2020
EU: Polyethylene - Market Report. Analysis And Forecast To 2020EU: Polyethylene - Market Report. Analysis And Forecast To 2020
EU: Polyethylene - Market Report. Analysis And Forecast To 2020
 
An overview of indian economy
An overview of indian economyAn overview of indian economy
An overview of indian economy
 
Air and water
Air and waterAir and water
Air and water
 
internation aviation academy put.ltd exp
internation aviation academy put.ltd expinternation aviation academy put.ltd exp
internation aviation academy put.ltd exp
 
Occupational Instructor.pdf
Occupational Instructor.pdfOccupational Instructor.pdf
Occupational Instructor.pdf
 
Dundee Police and Criminal Justice Group Presentation
Dundee Police and Criminal Justice Group PresentationDundee Police and Criminal Justice Group Presentation
Dundee Police and Criminal Justice Group Presentation
 
Black_Stephanie_PPP
Black_Stephanie_PPPBlack_Stephanie_PPP
Black_Stephanie_PPP
 
La vie. www.fo-coaching.com
La vie. www.fo-coaching.comLa vie. www.fo-coaching.com
La vie. www.fo-coaching.com
 
Release notes - Versão 2.2
Release notes - Versão 2.2Release notes - Versão 2.2
Release notes - Versão 2.2
 
CWRRI WITB a history of drought in Colorado
CWRRI WITB a history of drought in ColoradoCWRRI WITB a history of drought in Colorado
CWRRI WITB a history of drought in Colorado
 
биография с.л. стерхова
биография с.л. стерховабиография с.л. стерхова
биография с.л. стерхова
 
Hicks-Alcaraz, 2015 CV
Hicks-Alcaraz, 2015 CV Hicks-Alcaraz, 2015 CV
Hicks-Alcaraz, 2015 CV
 

Similar to Harvard poster

PERFORMANCE EVALUATION OF PARALLEL INTERNATIONAL DATA ENCRYPTION ALGORITHM ON...
PERFORMANCE EVALUATION OF PARALLEL INTERNATIONAL DATA ENCRYPTION ALGORITHM ON...PERFORMANCE EVALUATION OF PARALLEL INTERNATIONAL DATA ENCRYPTION ALGORITHM ON...
PERFORMANCE EVALUATION OF PARALLEL INTERNATIONAL DATA ENCRYPTION ALGORITHM ON...IJNSA Journal
 
Complier design
Complier design Complier design
Complier design shreeuva
 
IMPLEMENTATION OF DNA CRYPTOGRAPHY IN CLOUD COMPUTING AND.pptx
IMPLEMENTATION OF DNA CRYPTOGRAPHY IN CLOUD COMPUTING AND.pptxIMPLEMENTATION OF DNA CRYPTOGRAPHY IN CLOUD COMPUTING AND.pptx
IMPLEMENTATION OF DNA CRYPTOGRAPHY IN CLOUD COMPUTING AND.pptxDeepikaShivam
 
2023comp90024_Spartan.pdf
2023comp90024_Spartan.pdf2023comp90024_Spartan.pdf
2023comp90024_Spartan.pdfLevLafayette1
 
Comprehensive Performance Evaluation on Multiplication of Matrices using MPI
Comprehensive Performance Evaluation on Multiplication of Matrices using MPIComprehensive Performance Evaluation on Multiplication of Matrices using MPI
Comprehensive Performance Evaluation on Multiplication of Matrices using MPIijtsrd
 
IRJET- Latin Square Computation of Order-3 using Open CL
IRJET- Latin Square Computation of Order-3 using Open CLIRJET- Latin Square Computation of Order-3 using Open CL
IRJET- Latin Square Computation of Order-3 using Open CLIRJET Journal
 
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHYSPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHYcsandit
 
OpenACC Monthly Highlights: September 2021
OpenACC Monthly Highlights: September 2021OpenACC Monthly Highlights: September 2021
OpenACC Monthly Highlights: September 2021OpenACC
 
IEEE Parallel and distributed system 2016 Title and Abstract
IEEE Parallel and distributed system 2016 Title and AbstractIEEE Parallel and distributed system 2016 Title and Abstract
IEEE Parallel and distributed system 2016 Title and Abstracttsysglobalsolutions
 
Parc4 i parallel implementation of
Parc4 i  parallel implementation ofParc4 i  parallel implementation of
Parc4 i parallel implementation ofcaijjournal
 
First phase slide presentation on "ANALYZING THE EFFECTIVENESS OF THE ADVANCE...
First phase slide presentation on "ANALYZING THE EFFECTIVENESS OF THE ADVANCE...First phase slide presentation on "ANALYZING THE EFFECTIVENESS OF THE ADVANCE...
First phase slide presentation on "ANALYZING THE EFFECTIVENESS OF THE ADVANCE...Nikhil Jain
 
A COMPARISON BETWEEN PARALLEL AND SEGMENTATION METHODS USED FOR IMAGE ENCRYPT...
A COMPARISON BETWEEN PARALLEL AND SEGMENTATION METHODS USED FOR IMAGE ENCRYPT...A COMPARISON BETWEEN PARALLEL AND SEGMENTATION METHODS USED FOR IMAGE ENCRYPT...
A COMPARISON BETWEEN PARALLEL AND SEGMENTATION METHODS USED FOR IMAGE ENCRYPT...ijcsit
 
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY cscpconf
 
OpenACC Monthly Highlights: October2020
OpenACC Monthly Highlights: October2020OpenACC Monthly Highlights: October2020
OpenACC Monthly Highlights: October2020OpenACC
 
OpenACC and Hackathons Monthly Highlights: April 2023
OpenACC and Hackathons Monthly Highlights: April  2023OpenACC and Hackathons Monthly Highlights: April  2023
OpenACC and Hackathons Monthly Highlights: April 2023OpenACC
 
Cloud Computing System models for Distributed and cloud computing & Performan...
Cloud Computing System models for Distributed and cloud computing & Performan...Cloud Computing System models for Distributed and cloud computing & Performan...
Cloud Computing System models for Distributed and cloud computing & Performan...hrmalik20
 
Cloud computing system models for distributed and cloud computing
Cloud computing system models for distributed and cloud computingCloud computing system models for distributed and cloud computing
Cloud computing system models for distributed and cloud computinghrmalik20
 

Similar to Harvard poster (20)

Model checking
Model checkingModel checking
Model checking
 
PERFORMANCE EVALUATION OF PARALLEL INTERNATIONAL DATA ENCRYPTION ALGORITHM ON...
PERFORMANCE EVALUATION OF PARALLEL INTERNATIONAL DATA ENCRYPTION ALGORITHM ON...PERFORMANCE EVALUATION OF PARALLEL INTERNATIONAL DATA ENCRYPTION ALGORITHM ON...
PERFORMANCE EVALUATION OF PARALLEL INTERNATIONAL DATA ENCRYPTION ALGORITHM ON...
 
Complier design
Complier design Complier design
Complier design
 
IMPLEMENTATION OF DNA CRYPTOGRAPHY IN CLOUD COMPUTING AND.pptx
IMPLEMENTATION OF DNA CRYPTOGRAPHY IN CLOUD COMPUTING AND.pptxIMPLEMENTATION OF DNA CRYPTOGRAPHY IN CLOUD COMPUTING AND.pptx
IMPLEMENTATION OF DNA CRYPTOGRAPHY IN CLOUD COMPUTING AND.pptx
 
Flow Solver: HiFUN
Flow Solver: HiFUNFlow Solver: HiFUN
Flow Solver: HiFUN
 
2023comp90024_Spartan.pdf
2023comp90024_Spartan.pdf2023comp90024_Spartan.pdf
2023comp90024_Spartan.pdf
 
Comprehensive Performance Evaluation on Multiplication of Matrices using MPI
Comprehensive Performance Evaluation on Multiplication of Matrices using MPIComprehensive Performance Evaluation on Multiplication of Matrices using MPI
Comprehensive Performance Evaluation on Multiplication of Matrices using MPI
 
IRJET- Latin Square Computation of Order-3 using Open CL
IRJET- Latin Square Computation of Order-3 using Open CLIRJET- Latin Square Computation of Order-3 using Open CL
IRJET- Latin Square Computation of Order-3 using Open CL
 
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHYSPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
 
OpenACC Monthly Highlights: September 2021
OpenACC Monthly Highlights: September 2021OpenACC Monthly Highlights: September 2021
OpenACC Monthly Highlights: September 2021
 
IEEE Parallel and distributed system 2016 Title and Abstract
IEEE Parallel and distributed system 2016 Title and AbstractIEEE Parallel and distributed system 2016 Title and Abstract
IEEE Parallel and distributed system 2016 Title and Abstract
 
Parc4 i parallel implementation of
Parc4 i  parallel implementation ofParc4 i  parallel implementation of
Parc4 i parallel implementation of
 
First phase slide presentation on "ANALYZING THE EFFECTIVENESS OF THE ADVANCE...
First phase slide presentation on "ANALYZING THE EFFECTIVENESS OF THE ADVANCE...First phase slide presentation on "ANALYZING THE EFFECTIVENESS OF THE ADVANCE...
First phase slide presentation on "ANALYZING THE EFFECTIVENESS OF THE ADVANCE...
 
A COMPARISON BETWEEN PARALLEL AND SEGMENTATION METHODS USED FOR IMAGE ENCRYPT...
A COMPARISON BETWEEN PARALLEL AND SEGMENTATION METHODS USED FOR IMAGE ENCRYPT...A COMPARISON BETWEEN PARALLEL AND SEGMENTATION METHODS USED FOR IMAGE ENCRYPT...
A COMPARISON BETWEEN PARALLEL AND SEGMENTATION METHODS USED FOR IMAGE ENCRYPT...
 
cug2011-praveen
cug2011-praveencug2011-praveen
cug2011-praveen
 
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
 
OpenACC Monthly Highlights: October2020
OpenACC Monthly Highlights: October2020OpenACC Monthly Highlights: October2020
OpenACC Monthly Highlights: October2020
 
OpenACC and Hackathons Monthly Highlights: April 2023
OpenACC and Hackathons Monthly Highlights: April  2023OpenACC and Hackathons Monthly Highlights: April  2023
OpenACC and Hackathons Monthly Highlights: April 2023
 
Cloud Computing System models for Distributed and cloud computing & Performan...
Cloud Computing System models for Distributed and cloud computing & Performan...Cloud Computing System models for Distributed and cloud computing & Performan...
Cloud Computing System models for Distributed and cloud computing & Performan...
 
Cloud computing system models for distributed and cloud computing
Cloud computing system models for distributed and cloud computingCloud computing system models for distributed and cloud computing
Cloud computing system models for distributed and cloud computing
 

Recently uploaded

Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfUjwalaBharambe
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...jaredbarbolino94
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentInMediaRes1
 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxEyham Joco
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitolTechU
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementmkooblal
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,Virag Sontakke
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 

Recently uploaded (20)

Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media Component
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptx
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptx
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of management
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 

Harvard poster

  • 1. For our experiments, we use the Stampede Computer Cluster at Texas Advanced Computing Center. Stampede consists of 6,400 powerful desktop machines providing 522,080 processing cores. For our tests, we use up to 256 of those machines, each of which containing dual 8-core Intel Xeon E5-2680 processors and 32 gigabytes of memory. At the time of this research, the Stampede Computer Cluster is the 10th fastest computer in the world [5]. Results The following graphs show the obtained results for the proposed solution. The Strong Scaling graph shows how the processing time varies for a fixed sized input, when dividing the work to up to 16 parallel threads in a single CPU. The Weak Scaling graph, on the other hand, keeps the data fixed for each thread, therefore increasing the total input with the number of threads. The presented OpenMP results show that the multithreaded implementation is able to take advantage of the multi-core architecture up to the point where the number of running threads matches the number of available processing cores. On top on the multithreaded OpenMP implementation, MPI is used to further improve the scalability of our solution by introducing distributed computing capabilities. The same Strong Scaling and Weak Scaling measures were taken, and the below graphs show our results for running the proposed solution in up to 512 CPUs, totaling 4096 parallel working threads. SCALABLE AND DISTRIBUTED APPROACH FOR RSA DECRYPTION ALGORITHM Alysson Almeida1, Tiago de Almeida11 Prof. Dr. Christopher Stone1 1. Department of Computer Science, Loyola University of Chicago, Chicago, IL Introduction The RSA algorithm was first described in 1977 by R.L. Rivest, A. Shamir, and L. Adleman as an implementation of the concept of “public-key cryptosystems”. Such concept was presented by Diffie Helman; however, no practical implementation was developed in his work [1][2]. RSA is the most popular and widely deployed public-key crypto-system and is used for both encryption and digital signature. Its algorithm is based on modular arithmetic, and due to the magnitude of the numbers used, the decryption process is computationally intensive. In this research, we discuss the design, implementation and results of a scalable, multi-threaded and distributed RSA decryption algorithm, which aims to use parallel and high performance computing techniques to drastically reduce the processing time. OpenMP is used to distribute the work to multiple parallel threads running on the same CPU, while MPI is used in order to create a distributed computing scenario, having hundreds of computers working in parallel to decrypt data. The presented algorithm is capable of scaling almost ideally, presenting a considerably constant efficiency and greatly benefiting from the parallel implementation. The experiments were performed on the Stampede Computer Cluster, located at the Texas Advanced Computing Center. Keywords — Multithreading, Distributed Computing, OpenMP, MPI, RSA Research Objectives The aim of the proposed research will be to develop software for implementing a parallel solution for the RSA decryption algorithm. Multithread and distributed computing methods will be used to reach the aimed objective. This effort will include the development of a hybrid OpenMP/MPI program to maximize the use of computational resources and, consequently, decrease the time to decrypt large ciphertexts. Methods The proposed method works as follows: The input ciphertext is divided into N blocks, which are distributed to N processes using Message Passing Interface (MPI). Each process runs on an independent CPU which, in turn, provides 8 cores for running parallel threads. In order to take full advantage of that scenario, we divide each block again into 8 smaller blocks, distributing them to 8 threads that run in parallel in a single CPU. OpenMP is used to create and manage the threads. As a result, the input ciphertext data is divided into N*8 blocks, which are all decrypted by N processes, totaling N*8 parallel threads. At the end, all blocks are assembled together to create the final decrypted data. The figure below shows the proposed parallel and distributed approach in contrast with the original serial algorithm The Efficiency graphs for the OpenMP implementation confirm the previously observed behavior, where the parallel implementation scales up to the hardware limitation. The MPI Efficiency graph shows that using above 256 CPUs to process a 1MB input data sample brings diminishing returns, due to the overhead of distributed computing. Conclusions The proposed multithread and distributed implementation of the RSA decryption algorithm scales almost ideally. The initial parallel approach using only OpenMP takes full advantage of the available multiple cores in the CPU, only saturating when threads need to compete for processing time. The second approach fixes the number of threads in 8 and adds MPI distributed computing capabilities on top. This algorithm also presents considerably ideal scalability for all numbers of processors we tested, and only showed a decrease in performance when the input data was too small. For this case, the overhead of managing the distributed work throughout hundreds of processors becomes apparent, and increasing the number of parallel processes above 256 brings diminishing returns. For large enough input data, however, the proposed solution does not present any scalability limit. Acknowledgment The authors would like to thank the research funding agency CAPES Foundation, Ministry of Education of Brazil for the scholarships granted to the post-graduate students participating in the study. They would also like to thank Rajorshi Biswas, Shibdas Bandyopadhyay, Anirban Banerjee for distributing their code under the GNU General Public License. Funding to attend and present in March 2016 at the BRASCON conference (Cambridge, MA) was provided by Loyola University Chicago Graduate School. References [1] R. Rivest, A. Shamir and L. Adleman, 'A method for obtaining digital signatures and public-key cryptosystems', Communications of the ACM, vol. 21, no. 2, pp. 2-3, 1978. [2] Diffie, W., and Hellman, M. New directions in cryptography. IEEE Trans. Inform. Theory IT-22, (Nov. 1976), 644-654. [3] B. Rajorshi S. Bandyopadhyay, and A. Banerjee, ‘A Fast Implementation Of The RSA Algorithm Using The GNU MP Library’. [4] Abusharekh, A., Gaj, K., ‘Comparative Analysis of Software Libraries for Public Key Cryptography’. http://www.hyperelliptic.org/SPEED/slides/Abusharekh_Gaj_SPEED.pdf [5] Top 500 supercomputing site. http://www.top500.org/lists/2015/11/. (accessed February 8, 2016). 0.5 1 2 4 8 16 1 2 4 8 16 TIME (S) THREADS OpenMP Weak Scaling Time 100KB 1MB 0.0039063 0.0078125 0.015625 0.03125 0.0625 0.125 0.25 0.5 1 2 4 8 16 32 64 128 1 2 4 6 16 32 64 128 256 512 TIME (S) PROCESSES OpenMP + MPI Strong Scaling Time 1MB 10MB 100MB 0.5 1 2 4 16 64 256 EFFICIENCY PROCESSES OpenMP + MPI Weak Scaling Efficiency 100 KB 1 MB 0.0625 0.125 0.25 0.5 1 1 4 16 64 256 TIME (S) PROCESSES OpenMP + MPI Weak Scaling Time 100 KB 1 MB 0.25 0.5 1 2 2 4 8 16 EFFICIENCY THREADS OpenMP Strong Scaling Efficiency 100KB 1MB 10MB 0.25 0.5 1 2 2 4 8 16 EFFICIENCY THREADS OpenMP Weak Scaling Efficiency 100KB 1MB 0.0625 0.125 0.25 0.5 1 2 4 8 16 32 64 128 1 2 4 8 16 TIME (S) THREADS OpenMP Strong Scaling Time 100KB 1MB 10MB 0.5 1 2 2 4 8 16 32 64 128 256 512 EFFICIENCY PROCESSES OpenMP + MPI Strong Scaling Efficiency 1MB 10MB 100MB