SlideShare a Scribd company logo
Kepler vs Xeon Phi : our measures
and their complete source code
http://www.hpcmagazine.fr/en-couverture/kepler-vs-xeon-phi-nos-mesures/
Florent Duguet, PhD
CEO - Altimesh
http://www.altimesh.com/
... article in French
Presentation & translation by
Ronan Keryell (SILKAN / Aptina)
2 different architectures
Some functional analogies...
● Vendor data
● Flops/memop: minimal ratio to avoid waiting for
memory
3 microbenchmarks
From theory to practice...
● 1 memory bound : read a vector
– K20: Naïve/vectorized with float4/use texture cache
– Phi : Naïve/vectorized/gather/aligned vector load
● 1 compute bound : Hörner approximation iterated
(expm1())^12 (= 12 add, 24 mul, 60 madd)
– K20: Naïve/vectorized with float4 or double4
– Phi : Naïve/intrinsics
● 1 latency bound : b[i] += a[i + index[k]]
– K20: Naïve/loop interchange/ __ldg to skip L2$
– Phi : Naïve/vectorized/gather/aligned vector load
Memory bound
Memory bound
Compute bound
Compute bound
Latency bound
Latency bound
Conclusion
● (...) = (vendor data)
● Warning : in this experimentation fma counts for
1 FLOP instead of usual (... and constructors !)
2 FLOP
● Disclaimer : examples available :-) on
http://www.hpcmagazine.fr/files/sources/003-Kepler-vs-Phi.zip

More Related Content

What's hot

Dynamic memory Allocation in c language
Dynamic memory Allocation in c languageDynamic memory Allocation in c language
Dynamic memory Allocation in c language
kiran Patel
 
Cryptography
CryptographyCryptography
Cryptography
nayakslideshare
 
Malloc() and calloc() in c
Malloc() and calloc() in cMalloc() and calloc() in c
Malloc() and calloc() in c
Mahesh Tibrewal
 
Graph 500 DISLIB powered optimized version
Graph 500 DISLIB powered optimized versionGraph 500 DISLIB powered optimized version
Graph 500 DISLIB powered optimized version
Anton Korzh
 
Scheduling in Time-Sensitive Networks (TSN) for Mixed-Criticality Industrial ...
Scheduling in Time-Sensitive Networks (TSN) for Mixed-Criticality Industrial ...Scheduling in Time-Sensitive Networks (TSN) for Mixed-Criticality Industrial ...
Scheduling in Time-Sensitive Networks (TSN) for Mixed-Criticality Industrial ...
Voica Gavrilut
 
[Bop] Block Oriented Programming Automating Data-only Attacks
[Bop] Block Oriented Programming Automating Data-only Attacks[Bop] Block Oriented Programming Automating Data-only Attacks
[Bop] Block Oriented Programming Automating Data-only Attacks
星曼 陈
 
Programming Actor-based Collective Adaptive Systems
Programming Actor-based Collective Adaptive SystemsProgramming Actor-based Collective Adaptive Systems
Programming Actor-based Collective Adaptive Systems
Roberto Casadei
 
Avoiding Hardware Aliasing
Avoiding Hardware AliasingAvoiding Hardware Aliasing
Avoiding Hardware Aliasing
Peter Breuer
 
Fine grained asynchronism for pseudo-spectral codes - with application to tur...
Fine grained asynchronism for pseudo-spectral codes - with application to tur...Fine grained asynchronism for pseudo-spectral codes - with application to tur...
Fine grained asynchronism for pseudo-spectral codes - with application to tur...
Ganesan Narayanasamy
 
Dynamic memory allocation in c
Dynamic memory allocation in cDynamic memory allocation in c
Dynamic memory allocation in c
lavanya marichamy
 
C dynamic ppt
C dynamic pptC dynamic ppt
C dynamic ppt
RJ Mehul Gadhiya
 
Deep Learning meetup
Deep Learning meetupDeep Learning meetup
Deep Learning meetup
Ivan Goloskokovic
 
Python Basis Tutorial
Python Basis TutorialPython Basis Tutorial
Python Basis Tutorial
md sathees
 
Hubba Deep Learning
Hubba Deep LearningHubba Deep Learning
Hubba Deep Learning
Ivan Goloskokovic
 
Lecture1
Lecture1Lecture1
Preference of Efficient Architectures for GF(p) Elliptic Curve Crypto Operati...
Preference of Efficient Architectures for GF(p) Elliptic Curve Crypto Operati...Preference of Efficient Architectures for GF(p) Elliptic Curve Crypto Operati...
Preference of Efficient Architectures for GF(p) Elliptic Curve Crypto Operati...
CSCJournals
 
Cs 62
Cs 62Cs 62
Re-engineering Eclipse MDT/OCL for Xtext
Re-engineering Eclipse MDT/OCL for XtextRe-engineering Eclipse MDT/OCL for Xtext
Re-engineering Eclipse MDT/OCL for Xtext
Edward Willink
 
2Bytesprog2 course_2014_c1_sets
2Bytesprog2 course_2014_c1_sets2Bytesprog2 course_2014_c1_sets
2Bytesprog2 course_2014_c1_sets
kinan keshkeh
 

What's hot (19)

Dynamic memory Allocation in c language
Dynamic memory Allocation in c languageDynamic memory Allocation in c language
Dynamic memory Allocation in c language
 
Cryptography
CryptographyCryptography
Cryptography
 
Malloc() and calloc() in c
Malloc() and calloc() in cMalloc() and calloc() in c
Malloc() and calloc() in c
 
Graph 500 DISLIB powered optimized version
Graph 500 DISLIB powered optimized versionGraph 500 DISLIB powered optimized version
Graph 500 DISLIB powered optimized version
 
Scheduling in Time-Sensitive Networks (TSN) for Mixed-Criticality Industrial ...
Scheduling in Time-Sensitive Networks (TSN) for Mixed-Criticality Industrial ...Scheduling in Time-Sensitive Networks (TSN) for Mixed-Criticality Industrial ...
Scheduling in Time-Sensitive Networks (TSN) for Mixed-Criticality Industrial ...
 
[Bop] Block Oriented Programming Automating Data-only Attacks
[Bop] Block Oriented Programming Automating Data-only Attacks[Bop] Block Oriented Programming Automating Data-only Attacks
[Bop] Block Oriented Programming Automating Data-only Attacks
 
Programming Actor-based Collective Adaptive Systems
Programming Actor-based Collective Adaptive SystemsProgramming Actor-based Collective Adaptive Systems
Programming Actor-based Collective Adaptive Systems
 
Avoiding Hardware Aliasing
Avoiding Hardware AliasingAvoiding Hardware Aliasing
Avoiding Hardware Aliasing
 
Fine grained asynchronism for pseudo-spectral codes - with application to tur...
Fine grained asynchronism for pseudo-spectral codes - with application to tur...Fine grained asynchronism for pseudo-spectral codes - with application to tur...
Fine grained asynchronism for pseudo-spectral codes - with application to tur...
 
Dynamic memory allocation in c
Dynamic memory allocation in cDynamic memory allocation in c
Dynamic memory allocation in c
 
C dynamic ppt
C dynamic pptC dynamic ppt
C dynamic ppt
 
Deep Learning meetup
Deep Learning meetupDeep Learning meetup
Deep Learning meetup
 
Python Basis Tutorial
Python Basis TutorialPython Basis Tutorial
Python Basis Tutorial
 
Hubba Deep Learning
Hubba Deep LearningHubba Deep Learning
Hubba Deep Learning
 
Lecture1
Lecture1Lecture1
Lecture1
 
Preference of Efficient Architectures for GF(p) Elliptic Curve Crypto Operati...
Preference of Efficient Architectures for GF(p) Elliptic Curve Crypto Operati...Preference of Efficient Architectures for GF(p) Elliptic Curve Crypto Operati...
Preference of Efficient Architectures for GF(p) Elliptic Curve Crypto Operati...
 
Cs 62
Cs 62Cs 62
Cs 62
 
Re-engineering Eclipse MDT/OCL for Xtext
Re-engineering Eclipse MDT/OCL for XtextRe-engineering Eclipse MDT/OCL for Xtext
Re-engineering Eclipse MDT/OCL for Xtext
 
2Bytesprog2 course_2014_c1_sets
2Bytesprog2 course_2014_c1_sets2Bytesprog2 course_2014_c1_sets
2Bytesprog2 course_2014_c1_sets
 

Viewers also liked

Thanks for stopping by!
Thanks for stopping by!Thanks for stopping by!
Thanks for stopping by!
Nan Myers
 
Jerry Novack | Qualities of a Great Mentor
Jerry Novack | Qualities of a Great MentorJerry Novack | Qualities of a Great Mentor
Jerry Novack | Qualities of a Great Mentor
Jerry Novack
 
Infografia nativos digitales
Infografia nativos digitalesInfografia nativos digitales
Infografia nativos digitales
Jesús Valverde Berrocoso
 
Présentation travailleurs autonomes 16 novembre 2010
Présentation travailleurs autonomes   16 novembre 2010Présentation travailleurs autonomes   16 novembre 2010
Présentation travailleurs autonomes 16 novembre 2010MXO | agence totale
 
Internship Presentation
Internship PresentationInternship Presentation
Internship Presentation
Julie Gondek
 
MENTEE-Handbook-DIGITAL-1 FINAL
MENTEE-Handbook-DIGITAL-1 FINALMENTEE-Handbook-DIGITAL-1 FINAL
MENTEE-Handbook-DIGITAL-1 FINAL
Gregor Botlik
 
Biodiversità dell’Isola di Gorgona: una nuova varietà di olivo per la ricchez...
Biodiversità dell’Isola di Gorgona: una nuova varietà di olivo per la ricchez...Biodiversità dell’Isola di Gorgona: una nuova varietà di olivo per la ricchez...
Biodiversità dell’Isola di Gorgona: una nuova varietà di olivo per la ricchez...
Biocentro Pertusati
 
ZEE LAMBAYEQUE, PERU
ZEE LAMBAYEQUE, PERUZEE LAMBAYEQUE, PERU
ZEE LAMBAYEQUE, PERU
ANTERO VASQUEZ GARCIA
 
02competencias conceptos y taxonomias aplicables a los distintos niveles educ...
02competencias conceptos y taxonomias aplicables a los distintos niveles educ...02competencias conceptos y taxonomias aplicables a los distintos niveles educ...
02competencias conceptos y taxonomias aplicables a los distintos niveles educ...
Alfredo Prieto Martín
 
Orpos and store practices
Orpos and store practicesOrpos and store practices
Orpos and store practices
ShyamChakrapani
 
Traditional knowledge
Traditional knowledgeTraditional knowledge
Traditional knowledge
Ernesto Empig
 
e-reputation et référencement : Introduction
e-reputation et référencement : Introductione-reputation et référencement : Introduction
e-reputation et référencement : Introduction
Hicham Sabre
 

Viewers also liked (12)

Thanks for stopping by!
Thanks for stopping by!Thanks for stopping by!
Thanks for stopping by!
 
Jerry Novack | Qualities of a Great Mentor
Jerry Novack | Qualities of a Great MentorJerry Novack | Qualities of a Great Mentor
Jerry Novack | Qualities of a Great Mentor
 
Infografia nativos digitales
Infografia nativos digitalesInfografia nativos digitales
Infografia nativos digitales
 
Présentation travailleurs autonomes 16 novembre 2010
Présentation travailleurs autonomes   16 novembre 2010Présentation travailleurs autonomes   16 novembre 2010
Présentation travailleurs autonomes 16 novembre 2010
 
Internship Presentation
Internship PresentationInternship Presentation
Internship Presentation
 
MENTEE-Handbook-DIGITAL-1 FINAL
MENTEE-Handbook-DIGITAL-1 FINALMENTEE-Handbook-DIGITAL-1 FINAL
MENTEE-Handbook-DIGITAL-1 FINAL
 
Biodiversità dell’Isola di Gorgona: una nuova varietà di olivo per la ricchez...
Biodiversità dell’Isola di Gorgona: una nuova varietà di olivo per la ricchez...Biodiversità dell’Isola di Gorgona: una nuova varietà di olivo per la ricchez...
Biodiversità dell’Isola di Gorgona: una nuova varietà di olivo per la ricchez...
 
ZEE LAMBAYEQUE, PERU
ZEE LAMBAYEQUE, PERUZEE LAMBAYEQUE, PERU
ZEE LAMBAYEQUE, PERU
 
02competencias conceptos y taxonomias aplicables a los distintos niveles educ...
02competencias conceptos y taxonomias aplicables a los distintos niveles educ...02competencias conceptos y taxonomias aplicables a los distintos niveles educ...
02competencias conceptos y taxonomias aplicables a los distintos niveles educ...
 
Orpos and store practices
Orpos and store practicesOrpos and store practices
Orpos and store practices
 
Traditional knowledge
Traditional knowledgeTraditional knowledge
Traditional knowledge
 
e-reputation et référencement : Introduction
e-reputation et référencement : Introductione-reputation et référencement : Introduction
e-reputation et référencement : Introduction
 

Similar to Kepler vs Xeon Phi

Integrating Cache Oblivious Approach with Modern Processor Architecture: The ...
Integrating Cache Oblivious Approach with Modern Processor Architecture: The ...Integrating Cache Oblivious Approach with Modern Processor Architecture: The ...
Integrating Cache Oblivious Approach with Modern Processor Architecture: The ...
Tokyo Institute of Technology
 
Targeting GPUs using OpenMP Directives on Summit with GenASiS: A Simple and...
Targeting GPUs using OpenMP  Directives on Summit with  GenASiS: A Simple and...Targeting GPUs using OpenMP  Directives on Summit with  GenASiS: A Simple and...
Targeting GPUs using OpenMP Directives on Summit with GenASiS: A Simple and...
Ganesan Narayanasamy
 
Manycores for the Masses
Manycores for the MassesManycores for the Masses
Manycores for the Masses
Intel® Software
 
Harnessing OpenCL in Modern Coprocessors
Harnessing OpenCL in Modern CoprocessorsHarnessing OpenCL in Modern Coprocessors
Harnessing OpenCL in Modern Coprocessors
Unai Lopez-Novoa
 
Accessible hpc for everyone with docker and containers
Accessible hpc for everyone with docker and containersAccessible hpc for everyone with docker and containers
Accessible hpc for everyone with docker and containers
Docker, Inc.
 
QPACE - QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)
QPACE - QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)QPACE - QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)
QPACE - QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)
Heiko Joerg Schick
 
MattsonTutorialSC14.pptx
MattsonTutorialSC14.pptxMattsonTutorialSC14.pptx
MattsonTutorialSC14.pptx
gopikahari7
 
Kubernetes @ Squarespace (SRE Portland Meetup October 2017)
Kubernetes @ Squarespace (SRE Portland Meetup October 2017)Kubernetes @ Squarespace (SRE Portland Meetup October 2017)
Kubernetes @ Squarespace (SRE Portland Meetup October 2017)
Kevin Lynch
 
A Source-To-Source Approach to HPC Challenges
A Source-To-Source Approach to HPC ChallengesA Source-To-Source Approach to HPC Challenges
A Source-To-Source Approach to HPC Challenges
Chunhua Liao
 
LLVM Optimizations for PGAS Programs -Case Study: LLVM Wide Optimization in C...
LLVM Optimizations for PGAS Programs -Case Study: LLVM Wide Optimization in C...LLVM Optimizations for PGAS Programs -Case Study: LLVM Wide Optimization in C...
LLVM Optimizations for PGAS Programs -Case Study: LLVM Wide Optimization in C...
Akihiro Hayashi
 
Inferno Scalable Deep Learning on Spark
Inferno Scalable Deep Learning on SparkInferno Scalable Deep Learning on Spark
Inferno Scalable Deep Learning on Spark
DataWorks Summit/Hadoop Summit
 
3. Synthesis.pptx
3. Synthesis.pptx3. Synthesis.pptx
3. Synthesis.pptx
Ahmed Abdelazeem
 
Performance Optimization of SPH Algorithms for Multi/Many-Core Architectures
Performance Optimization of SPH Algorithms for Multi/Many-Core ArchitecturesPerformance Optimization of SPH Algorithms for Multi/Many-Core Architectures
Performance Optimization of SPH Algorithms for Multi/Many-Core Architectures
Dr. Fabio Baruffa
 
"Efficient Implementation of Convolutional Neural Networks using OpenCL on FP...
"Efficient Implementation of Convolutional Neural Networks using OpenCL on FP..."Efficient Implementation of Convolutional Neural Networks using OpenCL on FP...
"Efficient Implementation of Convolutional Neural Networks using OpenCL on FP...
Edge AI and Vision Alliance
 
開放運算&GPU技術研究班
開放運算&GPU技術研究班開放運算&GPU技術研究班
開放運算&GPU技術研究班
Paul Chao
 
Understand and Harness the Capabilities of Intel® Xeon Phi™ Processors
Understand and Harness the Capabilities of Intel® Xeon Phi™ ProcessorsUnderstand and Harness the Capabilities of Intel® Xeon Phi™ Processors
Understand and Harness the Capabilities of Intel® Xeon Phi™ Processors
Intel® Software
 
SpeQuloS: A QoS Service for BoT Applications Using Best Effort Distributed Co...
SpeQuloS: A QoS Service for BoT Applications Using Best Effort Distributed Co...SpeQuloS: A QoS Service for BoT Applications Using Best Effort Distributed Co...
SpeQuloS: A QoS Service for BoT Applications Using Best Effort Distributed Co...
Gilles Fedak
 
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
Intel® Software
 
Cost-effective software reliability through autonomic tuning of system resources
Cost-effective software reliability through autonomic tuning of system resourcesCost-effective software reliability through autonomic tuning of system resources
Cost-effective software reliability through autonomic tuning of system resources
Vincenzo De Florio
 
On the Necessity and Inapplicability of Python
On the Necessity and Inapplicability of PythonOn the Necessity and Inapplicability of Python
On the Necessity and Inapplicability of Python
Takeshi Akutsu
 

Similar to Kepler vs Xeon Phi (20)

Integrating Cache Oblivious Approach with Modern Processor Architecture: The ...
Integrating Cache Oblivious Approach with Modern Processor Architecture: The ...Integrating Cache Oblivious Approach with Modern Processor Architecture: The ...
Integrating Cache Oblivious Approach with Modern Processor Architecture: The ...
 
Targeting GPUs using OpenMP Directives on Summit with GenASiS: A Simple and...
Targeting GPUs using OpenMP  Directives on Summit with  GenASiS: A Simple and...Targeting GPUs using OpenMP  Directives on Summit with  GenASiS: A Simple and...
Targeting GPUs using OpenMP Directives on Summit with GenASiS: A Simple and...
 
Manycores for the Masses
Manycores for the MassesManycores for the Masses
Manycores for the Masses
 
Harnessing OpenCL in Modern Coprocessors
Harnessing OpenCL in Modern CoprocessorsHarnessing OpenCL in Modern Coprocessors
Harnessing OpenCL in Modern Coprocessors
 
Accessible hpc for everyone with docker and containers
Accessible hpc for everyone with docker and containersAccessible hpc for everyone with docker and containers
Accessible hpc for everyone with docker and containers
 
QPACE - QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)
QPACE - QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)QPACE - QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)
QPACE - QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)
 
MattsonTutorialSC14.pptx
MattsonTutorialSC14.pptxMattsonTutorialSC14.pptx
MattsonTutorialSC14.pptx
 
Kubernetes @ Squarespace (SRE Portland Meetup October 2017)
Kubernetes @ Squarespace (SRE Portland Meetup October 2017)Kubernetes @ Squarespace (SRE Portland Meetup October 2017)
Kubernetes @ Squarespace (SRE Portland Meetup October 2017)
 
A Source-To-Source Approach to HPC Challenges
A Source-To-Source Approach to HPC ChallengesA Source-To-Source Approach to HPC Challenges
A Source-To-Source Approach to HPC Challenges
 
LLVM Optimizations for PGAS Programs -Case Study: LLVM Wide Optimization in C...
LLVM Optimizations for PGAS Programs -Case Study: LLVM Wide Optimization in C...LLVM Optimizations for PGAS Programs -Case Study: LLVM Wide Optimization in C...
LLVM Optimizations for PGAS Programs -Case Study: LLVM Wide Optimization in C...
 
Inferno Scalable Deep Learning on Spark
Inferno Scalable Deep Learning on SparkInferno Scalable Deep Learning on Spark
Inferno Scalable Deep Learning on Spark
 
3. Synthesis.pptx
3. Synthesis.pptx3. Synthesis.pptx
3. Synthesis.pptx
 
Performance Optimization of SPH Algorithms for Multi/Many-Core Architectures
Performance Optimization of SPH Algorithms for Multi/Many-Core ArchitecturesPerformance Optimization of SPH Algorithms for Multi/Many-Core Architectures
Performance Optimization of SPH Algorithms for Multi/Many-Core Architectures
 
"Efficient Implementation of Convolutional Neural Networks using OpenCL on FP...
"Efficient Implementation of Convolutional Neural Networks using OpenCL on FP..."Efficient Implementation of Convolutional Neural Networks using OpenCL on FP...
"Efficient Implementation of Convolutional Neural Networks using OpenCL on FP...
 
開放運算&GPU技術研究班
開放運算&GPU技術研究班開放運算&GPU技術研究班
開放運算&GPU技術研究班
 
Understand and Harness the Capabilities of Intel® Xeon Phi™ Processors
Understand and Harness the Capabilities of Intel® Xeon Phi™ ProcessorsUnderstand and Harness the Capabilities of Intel® Xeon Phi™ Processors
Understand and Harness the Capabilities of Intel® Xeon Phi™ Processors
 
SpeQuloS: A QoS Service for BoT Applications Using Best Effort Distributed Co...
SpeQuloS: A QoS Service for BoT Applications Using Best Effort Distributed Co...SpeQuloS: A QoS Service for BoT Applications Using Best Effort Distributed Co...
SpeQuloS: A QoS Service for BoT Applications Using Best Effort Distributed Co...
 
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
 
Cost-effective software reliability through autonomic tuning of system resources
Cost-effective software reliability through autonomic tuning of system resourcesCost-effective software reliability through autonomic tuning of system resources
Cost-effective software reliability through autonomic tuning of system resources
 
On the Necessity and Inapplicability of Python
On the Necessity and Inapplicability of PythonOn the Necessity and Inapplicability of Python
On the Necessity and Inapplicability of Python
 

More from Mert Akın

İlk İşim Girişim 2018 Sunum Mert Akın
İlk İşim Girişim 2018 Sunum Mert Akınİlk İşim Girişim 2018 Sunum Mert Akın
İlk İşim Girişim 2018 Sunum Mert Akın
Mert Akın
 
İlk İşim Girişim 2018 Presentation Mert Akın Thai
İlk İşim Girişim 2018 Presentation Mert Akın Thaiİlk İşim Girişim 2018 Presentation Mert Akın Thai
İlk İşim Girişim 2018 Presentation Mert Akın Thai
Mert Akın
 
İlk İşim Girişim 2018 Presentation Mert Akın English
İlk İşim Girişim 2018 Presentation Mert Akın Englishİlk İşim Girişim 2018 Presentation Mert Akın English
İlk İşim Girişim 2018 Presentation Mert Akın English
Mert Akın
 
İlk İşim Girişim 2018 Präsentation Mert Akın Deutsch
İlk İşim Girişim 2018 Präsentation Mert Akın Deutschİlk İşim Girişim 2018 Präsentation Mert Akın Deutsch
İlk İşim Girişim 2018 Präsentation Mert Akın Deutsch
Mert Akın
 
Database Driven OpenCL Programming by Tim Child
Database Driven OpenCL Programming by Tim ChildDatabase Driven OpenCL Programming by Tim Child
Database Driven OpenCL Programming by Tim Child
Mert Akın
 
Big Data Trends 2016 by HPC Asia
Big Data Trends 2016 by HPC AsiaBig Data Trends 2016 by HPC Asia
Big Data Trends 2016 by HPC Asia
Mert Akın
 
Going to the Cloud by Online Colleges
Going to the Cloud by Online CollegesGoing to the Cloud by Online Colleges
Going to the Cloud by Online Colleges
Mert Akın
 
How Many Players in Big Data by Umbel
How Many Players in Big Data by UmbelHow Many Players in Big Data by Umbel
How Many Players in Big Data by Umbel
Mert Akın
 
The Four V's of Big Data By IBM
The Four V's of Big Data By IBMThe Four V's of Big Data By IBM
The Four V's of Big Data By IBM
Mert Akın
 
The Global State of Data Security in the Cloud by Gemalto
The Global State of Data Security in the Cloud by GemaltoThe Global State of Data Security in the Cloud by Gemalto
The Global State of Data Security in the Cloud by Gemalto
Mert Akın
 
HPC, Big Data & Data Center Explanation by Mert Akın
HPC, Big Data & Data  Center Explanation by Mert AkınHPC, Big Data & Data  Center Explanation by Mert Akın
HPC, Big Data & Data Center Explanation by Mert Akın
Mert Akın
 
Comparison of Battery Types According Number of Cycles
Comparison of Battery Types According Number of CyclesComparison of Battery Types According Number of Cycles
Comparison of Battery Types According Number of Cycles
Mert Akın
 
Understanding Big Data by IKANOW
Understanding Big Data by IKANOWUnderstanding Big Data by IKANOW
Understanding Big Data by IKANOW
Mert Akın
 
How the Internet of Things Will Rule Your Workday in 2020 by Forbes and Cent...
How the Internet of Things Will Rule Your Workday in 2020  by Forbes and Cent...How the Internet of Things Will Rule Your Workday in 2020  by Forbes and Cent...
How the Internet of Things Will Rule Your Workday in 2020 by Forbes and Cent...
Mert Akın
 
Future of Big Data
Future of Big DataFuture of Big Data
Future of Big Data
Mert Akın
 
Future of Database
Future of DatabaseFuture of Database
Future of Database
Mert Akın
 
Comparison of Battery Types According Voltage
Comparison of Battery Types According VoltageComparison of Battery Types According Voltage
Comparison of Battery Types According Voltage
Mert Akın
 
Comparison of Battery Types According Energy Density
Comparison of Battery Types According Energy DensityComparison of Battery Types According Energy Density
Comparison of Battery Types According Energy Density
Mert Akın
 
Comparison of Battery Types According Self Discharge Rate Monthly
Comparison of Battery Types According Self Discharge Rate MonthlyComparison of Battery Types According Self Discharge Rate Monthly
Comparison of Battery Types According Self Discharge Rate Monthly
Mert Akın
 
As telcos go digital, cybersecurity risks intensify by pwc
As telcos go digital, cybersecurity risks intensify by pwcAs telcos go digital, cybersecurity risks intensify by pwc
As telcos go digital, cybersecurity risks intensify by pwc
Mert Akın
 

More from Mert Akın (20)

İlk İşim Girişim 2018 Sunum Mert Akın
İlk İşim Girişim 2018 Sunum Mert Akınİlk İşim Girişim 2018 Sunum Mert Akın
İlk İşim Girişim 2018 Sunum Mert Akın
 
İlk İşim Girişim 2018 Presentation Mert Akın Thai
İlk İşim Girişim 2018 Presentation Mert Akın Thaiİlk İşim Girişim 2018 Presentation Mert Akın Thai
İlk İşim Girişim 2018 Presentation Mert Akın Thai
 
İlk İşim Girişim 2018 Presentation Mert Akın English
İlk İşim Girişim 2018 Presentation Mert Akın Englishİlk İşim Girişim 2018 Presentation Mert Akın English
İlk İşim Girişim 2018 Presentation Mert Akın English
 
İlk İşim Girişim 2018 Präsentation Mert Akın Deutsch
İlk İşim Girişim 2018 Präsentation Mert Akın Deutschİlk İşim Girişim 2018 Präsentation Mert Akın Deutsch
İlk İşim Girişim 2018 Präsentation Mert Akın Deutsch
 
Database Driven OpenCL Programming by Tim Child
Database Driven OpenCL Programming by Tim ChildDatabase Driven OpenCL Programming by Tim Child
Database Driven OpenCL Programming by Tim Child
 
Big Data Trends 2016 by HPC Asia
Big Data Trends 2016 by HPC AsiaBig Data Trends 2016 by HPC Asia
Big Data Trends 2016 by HPC Asia
 
Going to the Cloud by Online Colleges
Going to the Cloud by Online CollegesGoing to the Cloud by Online Colleges
Going to the Cloud by Online Colleges
 
How Many Players in Big Data by Umbel
How Many Players in Big Data by UmbelHow Many Players in Big Data by Umbel
How Many Players in Big Data by Umbel
 
The Four V's of Big Data By IBM
The Four V's of Big Data By IBMThe Four V's of Big Data By IBM
The Four V's of Big Data By IBM
 
The Global State of Data Security in the Cloud by Gemalto
The Global State of Data Security in the Cloud by GemaltoThe Global State of Data Security in the Cloud by Gemalto
The Global State of Data Security in the Cloud by Gemalto
 
HPC, Big Data & Data Center Explanation by Mert Akın
HPC, Big Data & Data  Center Explanation by Mert AkınHPC, Big Data & Data  Center Explanation by Mert Akın
HPC, Big Data & Data Center Explanation by Mert Akın
 
Comparison of Battery Types According Number of Cycles
Comparison of Battery Types According Number of CyclesComparison of Battery Types According Number of Cycles
Comparison of Battery Types According Number of Cycles
 
Understanding Big Data by IKANOW
Understanding Big Data by IKANOWUnderstanding Big Data by IKANOW
Understanding Big Data by IKANOW
 
How the Internet of Things Will Rule Your Workday in 2020 by Forbes and Cent...
How the Internet of Things Will Rule Your Workday in 2020  by Forbes and Cent...How the Internet of Things Will Rule Your Workday in 2020  by Forbes and Cent...
How the Internet of Things Will Rule Your Workday in 2020 by Forbes and Cent...
 
Future of Big Data
Future of Big DataFuture of Big Data
Future of Big Data
 
Future of Database
Future of DatabaseFuture of Database
Future of Database
 
Comparison of Battery Types According Voltage
Comparison of Battery Types According VoltageComparison of Battery Types According Voltage
Comparison of Battery Types According Voltage
 
Comparison of Battery Types According Energy Density
Comparison of Battery Types According Energy DensityComparison of Battery Types According Energy Density
Comparison of Battery Types According Energy Density
 
Comparison of Battery Types According Self Discharge Rate Monthly
Comparison of Battery Types According Self Discharge Rate MonthlyComparison of Battery Types According Self Discharge Rate Monthly
Comparison of Battery Types According Self Discharge Rate Monthly
 
As telcos go digital, cybersecurity risks intensify by pwc
As telcos go digital, cybersecurity risks intensify by pwcAs telcos go digital, cybersecurity risks intensify by pwc
As telcos go digital, cybersecurity risks intensify by pwc
 

Recently uploaded

Manufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptxManufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptx
Madan Karki
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
MDSABBIROJJAMANPAYEL
 
spirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptxspirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptx
Madan Karki
 
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMSA SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
IJNSA Journal
 
132/33KV substation case study Presentation
132/33KV substation case study Presentation132/33KV substation case study Presentation
132/33KV substation case study Presentation
kandramariana6
 
DfMAy 2024 - key insights and contributions
DfMAy 2024 - key insights and contributionsDfMAy 2024 - key insights and contributions
DfMAy 2024 - key insights and contributions
gestioneergodomus
 
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptxML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
JamalHussainArman
 
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
insn4465
 
Understanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine LearningUnderstanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine Learning
SUTEJAS
 
Low power architecture of logic gates using adiabatic techniques
Low power architecture of logic gates using adiabatic techniquesLow power architecture of logic gates using adiabatic techniques
Low power architecture of logic gates using adiabatic techniques
nooriasukmaningtyas
 
Modelagem de um CSTR com reação endotermica.pdf
Modelagem de um CSTR com reação endotermica.pdfModelagem de um CSTR com reação endotermica.pdf
Modelagem de um CSTR com reação endotermica.pdf
camseq
 
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
IJECEIAES
 
22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt
KrishnaveniKrishnara1
 
Technical Drawings introduction to drawing of prisms
Technical Drawings introduction to drawing of prismsTechnical Drawings introduction to drawing of prisms
Technical Drawings introduction to drawing of prisms
heavyhaig
 
Literature Review Basics and Understanding Reference Management.pptx
Literature Review Basics and Understanding Reference Management.pptxLiterature Review Basics and Understanding Reference Management.pptx
Literature Review Basics and Understanding Reference Management.pptx
Dr Ramhari Poudyal
 
Series of visio cisco devices Cisco_Icons.ppt
Series of visio cisco devices Cisco_Icons.pptSeries of visio cisco devices Cisco_Icons.ppt
Series of visio cisco devices Cisco_Icons.ppt
PauloRodrigues104553
 
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming PipelinesHarnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
Christina Lin
 
sieving analysis and results interpretation
sieving analysis and results interpretationsieving analysis and results interpretation
sieving analysis and results interpretation
ssuser36d3051
 
Heat Resistant Concrete Presentation ppt
Heat Resistant Concrete Presentation pptHeat Resistant Concrete Presentation ppt
Heat Resistant Concrete Presentation ppt
mamunhossenbd75
 
6th International Conference on Machine Learning & Applications (CMLA 2024)
6th International Conference on Machine Learning & Applications (CMLA 2024)6th International Conference on Machine Learning & Applications (CMLA 2024)
6th International Conference on Machine Learning & Applications (CMLA 2024)
ClaraZara1
 

Recently uploaded (20)

Manufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptxManufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptx
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
 
spirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptxspirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptx
 
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMSA SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
 
132/33KV substation case study Presentation
132/33KV substation case study Presentation132/33KV substation case study Presentation
132/33KV substation case study Presentation
 
DfMAy 2024 - key insights and contributions
DfMAy 2024 - key insights and contributionsDfMAy 2024 - key insights and contributions
DfMAy 2024 - key insights and contributions
 
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptxML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
 
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
 
Understanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine LearningUnderstanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine Learning
 
Low power architecture of logic gates using adiabatic techniques
Low power architecture of logic gates using adiabatic techniquesLow power architecture of logic gates using adiabatic techniques
Low power architecture of logic gates using adiabatic techniques
 
Modelagem de um CSTR com reação endotermica.pdf
Modelagem de um CSTR com reação endotermica.pdfModelagem de um CSTR com reação endotermica.pdf
Modelagem de um CSTR com reação endotermica.pdf
 
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
 
22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt
 
Technical Drawings introduction to drawing of prisms
Technical Drawings introduction to drawing of prismsTechnical Drawings introduction to drawing of prisms
Technical Drawings introduction to drawing of prisms
 
Literature Review Basics and Understanding Reference Management.pptx
Literature Review Basics and Understanding Reference Management.pptxLiterature Review Basics and Understanding Reference Management.pptx
Literature Review Basics and Understanding Reference Management.pptx
 
Series of visio cisco devices Cisco_Icons.ppt
Series of visio cisco devices Cisco_Icons.pptSeries of visio cisco devices Cisco_Icons.ppt
Series of visio cisco devices Cisco_Icons.ppt
 
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming PipelinesHarnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
 
sieving analysis and results interpretation
sieving analysis and results interpretationsieving analysis and results interpretation
sieving analysis and results interpretation
 
Heat Resistant Concrete Presentation ppt
Heat Resistant Concrete Presentation pptHeat Resistant Concrete Presentation ppt
Heat Resistant Concrete Presentation ppt
 
6th International Conference on Machine Learning & Applications (CMLA 2024)
6th International Conference on Machine Learning & Applications (CMLA 2024)6th International Conference on Machine Learning & Applications (CMLA 2024)
6th International Conference on Machine Learning & Applications (CMLA 2024)
 

Kepler vs Xeon Phi

  • 1. Kepler vs Xeon Phi : our measures and their complete source code http://www.hpcmagazine.fr/en-couverture/kepler-vs-xeon-phi-nos-mesures/ Florent Duguet, PhD CEO - Altimesh http://www.altimesh.com/ ... article in French Presentation & translation by Ronan Keryell (SILKAN / Aptina)
  • 3. Some functional analogies... ● Vendor data ● Flops/memop: minimal ratio to avoid waiting for memory
  • 4. 3 microbenchmarks From theory to practice... ● 1 memory bound : read a vector – K20: Naïve/vectorized with float4/use texture cache – Phi : Naïve/vectorized/gather/aligned vector load ● 1 compute bound : Hörner approximation iterated (expm1())^12 (= 12 add, 24 mul, 60 madd) – K20: Naïve/vectorized with float4 or double4 – Phi : Naïve/intrinsics ● 1 latency bound : b[i] += a[i + index[k]] – K20: Naïve/loop interchange/ __ldg to skip L2$ – Phi : Naïve/vectorized/gather/aligned vector load
  • 11. Conclusion ● (...) = (vendor data) ● Warning : in this experimentation fma counts for 1 FLOP instead of usual (... and constructors !) 2 FLOP ● Disclaimer : examples available :-) on http://www.hpcmagazine.fr/files/sources/003-Kepler-vs-Phi.zip