SlideShare a Scribd company logo
1 of 34
Download to read offline
Towards Automatic Code Selection 
with ppOpen‐AT: A Case of FDM 
‐ Variants of Numerical Computations and 
Its Impact on a Multi‐core Processor ‐
Takahiro Katagiri i),ii) , Masaharu Matsumoto i),ii),
Satoshi Ohshima i),ii)
1
i) Information Technology Center, The University of Tokyo
ii)JST, CREST
SPNS2015 (International Workshop on Software for Peta‐Scale Numerical Simulation)
Earthquake Research Institute (ERI), The University of Tokyo,
3rd December 2015, Session 3: Auto‐Tuning, 14:45‐15:15
Outline
1. Background
2. ppOpen‐HPC and ppOpen‐AT
3. Code Selection in Seism3D and 
Its Implementation by ppOpen‐AT 
4. Performance Evaluation
5. Conclusion
Outline
1. Background
2. ppOpen‐HPC and ppOpen‐AT
3. Code Selection in Seism3D and 
Its Implementation by ppOpen‐AT 
4. Performance Evaluation
5. Conclusion
Database for
Tuning Knowledge
Development Flow of HPC Software
1. Phase of Specification
2. Phase of
Programming
!ABCLib$ install unroll (i,k) region start
!ABCLib$ name MyMatMul
!ABCLib$ varied (i,k) from 1 to 8
do i=1, n
do j=1, n
do k=1, n
C( i, j ) = C( i, j ) + A( i, k ) * B( k, j )
enddo
enddo
enddo
!ABCLib$ install unroll (i,k) region end
do i=1, n
do j=1, n
do k=1, n
C( i, j ) = C( i, j ) + A( i, k ) * B( k, j )
enddo
enddo
enddo
do i=1, n, 2
do j=1, n
do k=1, n
C( i, j ) = C( i, j ) + A( i, k ) * B( k, j )
C( i+1, j ) = C( i+1, j ) + A( i+1, k ) * B( k, j )
enddo
enddo
enddo
do i=1, n, 2
do j=1, n
Ctmp1 = C( i, j )
Ctmp2 = C( i+1, j )
do k=1, n
Btmp = B( k, j )
Ctmp1 = Ctmp1 + A( i, k ) * Btmp
Ctmp2 = Ctmp2 + A( i+1, k ) * Btmp
enddo
C( i, j ) = Ctmp1
C( i+1, j ) = Ctmp2
enddo
enddo
do i=1, n, 2
do j=1, n
Ctmp1 = C( i, j )
Ctmp2 = C( i+1, j )
do k=1, n, 2
Btmp1 = B( k, j )
Btmp2 = B( k+1, j )
Ctmp1 = Ctmp1 + A( i, k ) * Btmp1
+ A( i, k+1) * Btmp2
Ctmp2 = Ctmp2 + A( i+1, k ) * Btmp1
+ A( i+1, k+1) * Btmp2
enddo
C( i, j )=Ctmp1
C( i+1, j )=Ctmp2
enddo
enddo
3. Phase of Optimization
Code
Generation
4. Phase of
Database and
Knowledge of
Discovery
forTuning
Analyzing of Results
Target Computers
Compile and Run
Increase of
Number of Cores,
Programming
Models, and Code
Optimizations
(Architecture
Kinds)
Outline
1. Background
2. ppOpen‐HPC and ppOpen‐AT
3. Code Selection in Seism3D and 
Its Implementation by ppOpen‐AT 
4. Performance Evaluation
5. Conclusion
ppOpen‐HPC Project
• Middleware for HPC and Its AT Technology
– Supported by JST, CREST, from FY2011 to FY2015.
– PI: Professor Kengo Nakajima (U. Tokyo)
• ppOpen‐HPC 1)
– An open source infrastructure for reliable simulation codes on 
post‐peta (pp) scale parallel computers.
– Consists of various types of libraries, which covers 
5 kinds of discretization methods for scientific computations. 
• ppOpen‐AT 6),7),8) 
– An auto‐tuning language for ppOpen‐HPC codes 
– Using knowledge of previous project: ABCLibScript Project 4). 
– Auto‐tuning language based on directives for AT.
6
Software Architecture of 
ppOpen‐HPC
7
FVM DEMFDMFEM
Many‐core 
CPUs
GPUs
Multi‐core
CPUs
MG
COMM
Auto‐Tuning Facility
Code Generation for Optimization Candidates
Search for the best candidate
Automatic Execution for the optimization
ppOpen‐APPL
ppOpen‐MATH
BEM
ppOpen‐AT
User Program
GRAPH VIS MP
STATIC DYNAMIC
ppOpen‐SYS FT
Optimize 
memory 
accesses 
ppOpen‐AT System  (Based on FIBER 2),3),4),5) ) 
ppOpen‐APPL /*
ppOpen‐AT
Directives
User 
KnowledgeLibrary 
Developer
① Before 
Release‐time
Candidate
1
Candidate
2
Candidate
3
Candidate
nppOpen‐AT
Auto‐Tuner
ppOpen‐APPL / *
Automatic
Code
Generation②
:Target 
Computers
Execution Time④
Library User
③
Library Call
Selection
⑤
⑥
Auto‐tuned
Kernel
Execution
Run‐
time
This user 
benefited 
from AT.
Scenario of AT for ppOpen‐APPL/FDM
9
Execution with optimized
kernels without AT process.
Library User
Set AT parameters,
and execute the library
(OAT_AT_EXEC=1)
■Execute auto-tuner:
With fixed loop lengths
(by specifying problem size and number of
MPI processes)
Time measurement for target kernels
Store the best variant information.
Set AT parameters, and
execute the library
(OAT_AT_EXEC=0)
Store the fastest kernel
information
Using the fastest kernel without AT 
(except for varying problem size, number 
of MPI processes and OpenMP threads.)
Specify problem size,
number of MPI processes
and OpenMP threads.
AT Timings of ppOpen‐AT (FIBER Framework)
OAT_ATexec()
…
do i=1, MAX_ITER
Target_kernel_k()
…
Target_kernel_m()
…
enddo
Read the best parameter
Is this first call?
Yes
Read the best parameter
Is this first call?
Yes
One time execution (except 
for varying problem size and 
number of MPI processes )
AT for Before Execute‐time
Execute Target_Kernel_k() with varying 
parameters
Execute Target_Kernel_m() with varying 
parameters
parameter 
Store the best 
parameter 
…
Outline
1. Background
2. ppOpen‐HPC and ppOpen‐AT
3. Code Selection in Seism3D and 
Its Implementation by ppOpen‐AT 
4. Performance Evaluation
5. Conclusion
Target Application
• Seism3D: 
Simulation for seismic wave analysis.
• Developed by Professor T.Furumura
at the University of Tokyo.
–The code is re‐constructed as 
ppOpen‐APPL/FDM.
• Finite Differential Method (FDM) 
• 3D simulation
–3D arrays are allocated.
• Data type: Single Precision (real*4) 12
Flow Diagram of 
ppOpen‐APPL/FDM
13
),,,(
}],,,)
2
1
({},,)
2
1
({[
1
),,(
2/
1
zyxqp
zyxmxzyxmxc
x
zyx
dx
d
pqpq
M
m
m
pq



 


 Space difference by FDM.
),,(,
12
1
2
1
zyxptf
zyx
uu n
p
n
zp
n
yp
n
xpn
p
n
p 

















 


 Explicit time expansion by central 
difference.
Initialization
Velocity Derivative (def_vel)
Velocity Update (update_vel)
Stress Derivative (def_stress)
Stress Update (update_stress)
Stop Iteration?
NO
YES
End
Velocity PML condition (update_vel_sponge)
Velocity Passing (MPI) (passing_vel)
Stress PML condition (update_stress_sponge)
Stress Passing (MPI) (passing_stress)
Original Implementation (For Vector Machines)
call ppohFDM_pdiffx3_m4_OAT( VX,DXVX, NXP,NYP,NZP,NXP0,NXP1,NYP0,…)
call ppohFDM_pdiffy3_p4_OAT( VX,DYVX, NXP,NYP,NZP,NXP0,NXP1,NYP0,…)
call ppohFDM_pdiffz3_p4_OAT( VX,DZVX, NXP,NYP,NZP,NXP0,NXP1,NYP0,…)
call ppohFDM_pdiffy3_m4_OAT( VY,DYVY, NXP,NYP,NZP,NXP0,NXP1,NYP0,… )
call ppohFDM_pdiffx3_p4_OAT( VY,DXVY, NXP,NYP,NZP,NXP0,NXP1,NYP0,… )
call ppohFDM_pdiffz3_p4_OAT( VY,DZVY, NXP,NYP,NZP,NXP0,NXP1,NYP0,…)
call ppohFDM_pdiffx3_p4_OAT( VZ,DXVZ, NXP,NYP,NZP,NXP0,NXP1,NYP0,…)
call ppohFDM_pdiffy3_p4_OAT( VZ,DYVZ, NXP,NYP,NZP,NXP0,NXP1,NYP0,… )
call ppohFDM_pdiffz3_m4_OAT( VZ,DZVZ, NXP,NYP,NZP,NXP0,NXP1,NYP0,…)
if( is_fs .or. is_nearfs ) then
call ppohFDM_bc_vel_deriv( KFSZ,NIFS,NJFS,IFSX,IFSY,IFSZ,JFSX,JFSY,JFSZ )
end if
call ppohFDM_update_stress(1, NXP, 1, NYP, 1, NZP)
Fourth‐order accurate central‐difference scheme 
for velocity. (def_stress)
Process of model boundary. 
Explicit time expansion by leap‐frog scheme. (update_stress)
Original Implementation (For Vector Machines)
subroutine OAT_InstallppohFDMupdate_stress(..)
!$omp parallel do private(i,j,k,RL1,RM1,RM2,RLRM2,DXVX1,DYVY1,DZVZ1,…)
do k = NZ00, NZ01
do j = NY00, NY01
do i = NX00, NX01
RL1   = LAM (I,J,K); RM1   = RIG (I,J,K);  RM2   = RM1 + RM1; RLRM2 = RL1+RM2
DXVX1 = DXVX(I,J,K);  DYVY1 = DYVY(I,J,K);  DZVZ1 = DZVZ(I,J,K)
D3V3  = DXVX1 + DYVY1 + DZVZ1
SXX (I,J,K) = SXX (I,J,K) + (RLRM2*(D3V3)‐RM2*(DZVZ1+DYVY1) ) * DT
SYY (I,J,K) = SYY (I,J,K)  + (RLRM2*(D3V3)‐RM2*(DXVX1+DZVZ1) ) * DT
SZZ (I,J,K) = SZZ (I,J,K)  + (RLRM2*(D3V3)‐RM2*(DXVX1+DYVY1) ) * DT
DXVYDYVX1 = DXVY(I,J,K)+DYVX(I,J,K);  DXVZDZVX1 = DXVZ(I,J,K)+DZVX(I,J,K)
DYVZDZVY1 = DYVZ(I,J,K)+DZVY(I,J,K)
SXY (I,J,K) = SXY (I,J,K) + RM1 * DXVYDYVX1 * DT
SXZ (I,J,K) = SXZ (I,J,K) + RM1 * DXVZDZVX1 * DT
SYZ (I,J,K) = SYZ (I,J,K) + RM1 * DYVZDZVY1 * DT
end do
end do
end do
retuen
end
Explicit time 
expansion by 
leap‐frog scheme. 
(update_stress)
Input and output for arrays Input and output for arrays 
in each call ‐> Increase of 
B/F ratio: ~1.7
Implementation Strategy for
Code Variants 
• Change of computation order based on mathematics.
• With experimental knowledge, we implement the 
following codes for the variants.
• For time expansion of stress tensor, the loop can be 
separated to two loops with respect to nature of 
isotropic elastic body (Loop Spilit):
– Loop1: Sxx, Syy, Szz, and 
– Loop2: Sxy, Sxz, Syz, 
where the stress tenser S is defined by: 
S = [(Sxx, Sxy, Sxz), (Sxy, Syy, Syz), (Sxz, Syz, Szz)]  
• The three nested loop for the above two loops is 
collapsed with the outer two loops.
The Code Variants (For Scalar Machines)
 Variant1 (IF‐statements inside)
– The followings include inside loop:
1. Fourth‐order accurate central‐difference scheme for velocity.
2. Process of model boundary.
3. Explicit time expansion by leap‐frog scheme.
 Variant2 (IF‐free, but there is IF‐statements inside loop for 
process of model boundary. )
– To remove IF sentences from the variant1, the loops are 
reconstructed. 
– The order of computations is changed, but the result without round‐
off errors is same. 
– [Main Loop]
1. Fourth‐order accurate central‐difference scheme for velocity.
2. Explicit time expansion by leap‐frog scheme.
– [Loop for process of model boundary]
1. Fourth‐order accurate central‐difference scheme for velocity.
2. Process of model boundary.
3. Explicit time expansion by leap‐frog scheme.
Code selection by ppOpen‐AT and hierarchical AT
Program main
….
!OAT$ install select region start
!OAT$ name ppohFDMupdate_vel_select
!OAT$ select sub region start
call ppohFDM_pdiffx3_p4( SXX,DXSXX,NXP,NYP,NZP,….)
call ppohFDM_pdiffy3_p4( SYY,DYSYY, NXP,NYP,NZP,…..)
…
if( is_fs .or. is_nearfs ) then
call ppohFDM_bc_stress_deriv( KFSZ,NIFS,NJFS,IFSX,….)
end if
call ppohFDM_update_vel    ( 1, NXP, 1, NYP, 1, NZP )
!OAT$ select sub region end
!OAT$ select sub region start
Call ppohFDM_update_vel_Intel  ( 1, NXP, 1, NYP, 1, NZP )
!OAT$ select sub region end
!OAT$ install select region end
Upper Code
With Select clause, 
code selection can be 
specified.
subroutine ppohFDM_update_vel(….)
….
!OAT$ install LoopFusion region start
!OAT$ name ppohFDMupdate_vel
!OAT$ debug (pp)
!$omp parallel do private(i,j,k,ROX,ROY,ROZ)
do k = NZ00, NZ01
do j = NY00, NY01
do i = NX00, NX01
…..
….
Lower Code
subroutine ppohFDM_pdiffx3_p4(….)
….
!OAT$ install LoopFusion region start
….
Call tree graph by the AT
Start
Stress Derivative (def_stress)
Stress Update (update_stress)
Stop iteration?
NO
YES
End
Velocity PML condition (update_vel_sponge)
Velocity Passing (MPI) (passing_vel)
Stress PML condition (update_stress_sponge)
Stress Passing (MPI) (passing_stress)
Velocity Derivative (def_vel)
Velocity Update (update_vel)
Main
Program
Velocity Update (update_vel_Scalar)
Stress Update (update_stress_Scalar)
Selection
Selection
update_vel_select
update_stress_select
update_vel_select
update_stress_select
:auto‐generated codes
Velocity Update (update_vel_IF_free)
Selection
Stress Update (update_stress_IF_free)
Selection
CandidateCandidateCandidateCandidate
CandidateCandidateCandidateCandidate
CandidateCandidateCandidateCandidate
CandidateCandidateCandidateCandidate
CandidateCandidateCandidateCandidate
CandidateCandidateCandidateCandidate
CandidateCandidateCandidateCandidate
Execution Order of the AT
Velocity PML condition (update_vel_sponge)
Velocity Passing (MPI) (passing_vel)
Stress PML condition (update_stress_sponge)
Stress Passing (MPI) (passing_stress)
update_vel_select
update_stress_select
Stress Derivative 
(def_stress)
Stress Update 
(update_stress)
Velocity Derivative
(def_vel)
Velocity Update
(update_vel)
Velocity Update 
(update_vel_Scalar)
Stress Update
(update_stress_Scalar)
Def_* AT Candidates
Update_vel
AT Candidates
Update_stress
AT Candidates
Update_vel_sponge
AT Candidates
Update_stress_sponge
AT Candidates
Passing_vel
AT Candidates
Passing_stress
AT Candidates
①
②
②
③
③
④
④
④
④
Velocity Update 
(update_vel_IF_free)
Stress Update
(update_stress_IF_free)
We can specify the 
order via a directive of 
ppOpen‐AT.
(an extended function)
Outline
1. Background
2. ppOpen‐HPC and ppOpen‐AT
3. Code Selection in Seism3D and 
Its Implementation by ppOpen‐AT 
4. Performance Evaluation
5. Conclusion
The Number of AT Candidates (ppOpen‐APPL/FDM)
22
Kernel Names AT Objects The Number of Candidates
1. update_stress ・Loop Collapses and Splits :8 Kinds
・Code Selections : 2 Kinds
10
2. update_vel ・Loop Collapses, Splits, and re‐
ordering of statements: :6 Kinds
・Code Selections: 2 Kinds
8
3. update_stress_sponge ・Loop Collapses:3 Kinds 3
4. update_vel_sponge ・Loop Collapses:3 Kinds 3
5. ppohFDM_pdiffx3_p4 Kernel Names:def_update、def_vel
・Loop Collapses:3 Kinds
3
6. ppohFDM_pdiffx3_m4 3
7. ppohFDM_pdiffy3_p4 3
8. ppohFDM_pdiffy3_m4 3
9. ppohFDM_pdiffz3_p4 3
10.ppohFDM_pdiffz3_m4 3
11. ppohFDM_ps_pack Data packing and unpacking
・Loop Collapses: 3 Kinds
3
12. ppohFDM_ps_unpack 3
13. ppohFDM_pv_pack 3
14. ppohFDM_pv_unpack 3
 Total:54 Kinds
 Hybrid 
MPI/OpenMP:
7 Kinds
 54×7 = 
378 Kinds
Machine Environment 
(8 nodes of the Xeon Phi)
 The Intel Xeon Phi 
 Xeon Phi 5110P (1.053 GHz), 60 cores
 Memory Amount:8 GB (GDDR5)
 Theoretical Peak Performance:1.01 TFLOPS
 One board per node of the Xeon phi cluster
 InfiniBand FDR x 2 Ports 
 Mellanox Connect‐IB
 PCI‐E Gen3 x16
 56Gbps x 2
 Theoretical Peak bandwidth 13.6GB/s
 Full‐Bisection
 Intel MPI
 Based on MPICH2, MVAPICH2
 Version 5.0 Update 3 Build 20150128 
 Compiler:Intel Fortran version 15.0.3 20150407
 Compiler Options:
‐ipo20 ‐O3 ‐warn all ‐openmp ‐mcmodel=medium ‐shared‐intel –mmic
‐align array64byte
 KMP_AFFINITY=granularity=fine, balanced (Uniform Distribution of threads 
between sockets)
Execution Details
• ppOpen‐APPL/FDM ver.0.2
• ppOpen‐AT ver.0.2
• The number of time step: 2000 steps
• The number of nodes: 8 node
• Native Mode Execution
• Target Problem Size 
(Almost maximum size with 8 GB/node)
– NX * NY * NZ = 512 x  512 x 512 / 8 Node
– NX * NY * NZ = 256 * 256 * 256 / node
(!= per MPI Process)
• The number of iterations for kernels 
to do auto‐tuning: 100
Execution Details of Hybrid 
MPI/OpenMP
• Target MPI Processes and OMP Threads on the Xeon Phi
– The Xeon Phi with 4 HT (Hyper Threading) 
– PX TY: X MPI Processes and Y Threads per process
– P8T240 : Minimum Hybrid MPI/OpenMP execution for 
ppOpen‐APPL/FDM, since it needs minimum 8 MPI Processes.
– P16T120
– P32T60
– P64T30
– P128T15  
– P240T8
– P480T4
– Less than P960T2 cause an MPI error in this environment.
#
0
#
1
#
2
#
3
#
4
#
5
#
6
#
7
#
8
#
9
#
1
0
#
1
1
#
1
2
#
1
3
#
1
4
#
1
5
P2T8
#
0
#
1
#
2
#
3
#
4
#
5
#
6
#
7
#
8
#
9
#
1
0
#
1
1
#
1
2
#
1
3
#
1
4
#
1
5
P4T4 Target of cores for 
one MPI Process
BREAK DOWN OF TIMINGS 
THE BEST IMPLEMENTATION
Outline
1. Background
2. ppOpen‐HPC and ppOpen‐AT
3. Code Selection in Seism3D and 
Its Implementation by ppOpen‐AT 
4. Performance Evaluation
5. Conclusion
RELATED WORK
Originality (AT Languages)
AT Language 
/ Items
#
1
#
2
#
3
#
4
#
5
#
6
#
7
#
8
ppOpen‐AT OAT Directives ✔ ✔ ✔ ✔ ✔ None
Vendor Compilers Out of Target Limited ‐
Transformation 
Recipes 
Recipe
Descriptions
✔ ✔ ChiLL
POET Xform Description ✔ ✔ POET translator, ROSE
X language 
Xlang Pragmas ✔ ✔ X Translation,
‘C and tcc
SPL SPL Expressions ✔ ✔ ✔ A Script Language
ADAPT
ADAPT 
Language
✔ ✔ Polaris Compiler 
Infrastructure, 
Remote Procedure 
Call (RPC)
Atune‐IL
Atune Pragmas ✔ A Monitoring 
Daemon
PEPPHER
PEPPHER Pragmas
(interface)
✔ ✔ ✔ PEPPHER task graph
and run-time
Xevolver
Directive Extension
(Recipe Descriptions)
(✔) (✔) (✔) ROSE,
XSLT Translator
#1: Method for supporting multi-computer environments. #2: Obtaining loop length in run-time.
#3: Loop split with increase of computations6) ,and loop collapses to the split loops6),7),8) .
#4: Re-ordering of inner-loop sentences8) . #5:Code selection with loop transformations (Hierarchical AT
descriptions*) *This is originality in current researches of AT as of 2015. #6:Algorithm selection.
#7: Code generation with execution feedback. #8: Software requirement.
(Users need to define rules. )
Conclusion Remarks
 Propose an IF‐free kernel: An effective kernel 
implementation of an application with FDM by merging 
computations of central‐difference and explicit time 
expansion schemes. 
 Use AT language to adapt code selection for new kernels: 
The effectiveness of the new implementation depends 
on the CPU architecture and execution situation, such as 
problem size and the number of MPI processes and 
OpenMP threads. 
To obtain free code (MIT Licensing):
http://ppopenhpc.cc.u‐tokyo.ac.jp/
Future Work
• Improving Search Algorithm
– We use a brute‐force search in current implementation.
• This is feasible by applying knowledge of application.
– We have implemented a new search algorithm based on black box 
performance models.
• d‐Spline Model (interpolation and incremental addition
based) ,collaborated with Prof. Tanaka (Kogakuin U.)
• Surrogate Model (interpolation and probability based) collaborated 
with Prof. Wang (National Taiwan U.) 
• Off‐loading Implementation Selection 
(for the Xeon Phi)
– If problem size is too small to do off‐loading, the target execution is 
performed on CPU automatically.
• Adaptation of OpenACC for GPU computing
– Selection of OpenACC directives with ppOpenAT 
by Dr. Ohshima.
• gang, vector, parallel, etc. 
http://ppopenhpc.cc.u‐tokyo.ac.jp/
Thank you for your attention!
Questions?
References
1) K. Nakajima, M. Satoh, T. Furumura, H. Okuda, T. Iwashita, H. Sakaguchi, T. Katagiri, M. Matsumoto, S. Ohshima, 
H. Jitsumoto, T. Arakawa, F. Mori, T. Kitayama, A. Ida and M. Y. Matsuo, ppOpen‐HPC: Open Source Infrastructure 
for Development and Execution of Large‐Scale Scientific Applications on Post‐Peta‐Scale Supercomputers with 
Automatic Tuning (AT), Optimization in the Real World, Mathematics for Industry 13, K. Fujisawa et al. (eds.), 
Springer, pp.15‐35 (2016)
2) Takahiro Katagiri, Kenji Kise, Hiroki Honda, Toshitsugu Yuba: FIBER: A general framework for auto‐tuning 
software, The Fifth International Symposium on High Performance Computing (ISHPC‐V), Springer LNCS 2858, pp. 
146‐159 (2003)
3) Takahiro Katagiri, Kenji Kise, Hiroki Honda, Toshitsugu Yuba: Effect of auto‐tuning with user's knowledge for 
numerical software, Proceedings of the ACM 1st conference on Computing frontiers (CF2004), pp.12‐25, (2004)
4) Takahiro Katagiri, Kenji Kise, Hiroki Honda, Toshitsugu Yuba: ABCLibScript: A directive to support specification of 
an auto‐tuning facility for numerical software, Parallel Computing ,32 (1), pp.92‐112 (2006)
5) Takahiro Katagiri, Kenji Kise, Hiroki Honda, and Toshitsugu Yuba: ABCLib_DRSSED: A Parallel Eigensolver with an 
Auto‐tuning Facility, Parallel Computing, Vol.32, Issue 3, pp.231‐250 (2006)
6) Takahiro Katagiri, Satoshi Ito, Satoshi Ohshima: Early experiences for adaptation of auto‐tuning by ppOpen‐AT to 
an explicit method, Special Session: Auto‐Tuning for Multicore and GPU (ATMG) (In Conjunction with the IEEE 
MCSoC‐13), Proceedings of MCSoC‐13, pp.123‐128 (2013) 
7) Takahiro Katagiri, Satoshi Ohshima, Masaharu Matsumoto: Auto‐tuning of computation kernels from an FDM 
Code with ppOpen‐AT, Special Session: Auto‐Tuning for Multicore and GPU (ATMG) (In Conjunction with the IEEE 
MCSoC‐14), Proceedings of MCSoC‐14, pp.253‐260 (2014) 
8) Takahiro Katagiri, Satoshi Ohshima, Masaharu Matsumoto: Directive‐based Auto‐tuning for the Finite Difference 
Method on the Xeon Phi, The Tenth International Workshop on Automatic Performance Tuning (iWAPT2015) (In 
Conjunction with the IEEE IPDPS2015 ), Proceedings of IPDPSW2015, pp.1221‐1230 (2015)

More Related Content

What's hot

Shop 2 presentation slide
Shop 2 presentation slideShop 2 presentation slide
Shop 2 presentation slideAndrea Tucci
 
Auto‐Tuning of Hierarchical Computations with ppOpen‐AT
Auto‐Tuning of Hierarchical Computations with ppOpen‐ATAuto‐Tuning of Hierarchical Computations with ppOpen‐AT
Auto‐Tuning of Hierarchical Computations with ppOpen‐ATTakahiro Katagiri
 
A New Cross Diamond Search Motion Estimation Algorithm for HEVC
A New Cross Diamond Search Motion Estimation Algorithm for HEVCA New Cross Diamond Search Motion Estimation Algorithm for HEVC
A New Cross Diamond Search Motion Estimation Algorithm for HEVCIJERA Editor
 
Fault Tolerant Parallel Filters Based On Bch Codes
Fault Tolerant Parallel Filters Based On Bch CodesFault Tolerant Parallel Filters Based On Bch Codes
Fault Tolerant Parallel Filters Based On Bch CodesIJERA Editor
 
Dsp lab manual
Dsp lab manualDsp lab manual
Dsp lab manualamanabr
 
FPGA Implementation of FIR Filter using Various Algorithms: A Retrospective
FPGA Implementation of FIR Filter using Various Algorithms: A RetrospectiveFPGA Implementation of FIR Filter using Various Algorithms: A Retrospective
FPGA Implementation of FIR Filter using Various Algorithms: A RetrospectiveIJORCS
 
Sucha_ICC_2012
Sucha_ICC_2012Sucha_ICC_2012
Sucha_ICC_2012sucha
 
Reducing computational complexity of Mathematical functions using FPGA
Reducing computational complexity of Mathematical functions using FPGAReducing computational complexity of Mathematical functions using FPGA
Reducing computational complexity of Mathematical functions using FPGAnehagaur339
 
Algorithm And analysis Lecture 03& 04-time complexity.
 Algorithm And analysis Lecture 03& 04-time complexity. Algorithm And analysis Lecture 03& 04-time complexity.
Algorithm And analysis Lecture 03& 04-time complexity.Tariq Khan
 
D I G I T A L I C A P P L I C A T I O N S J N T U M O D E L P A P E R{Www
D I G I T A L  I C  A P P L I C A T I O N S  J N T U  M O D E L  P A P E R{WwwD I G I T A L  I C  A P P L I C A T I O N S  J N T U  M O D E L  P A P E R{Www
D I G I T A L I C A P P L I C A T I O N S J N T U M O D E L P A P E R{Wwwguest3f9c6b
 
FPGA FIR filter implementation (Audio signal processing)
FPGA FIR filter implementation (Audio signal processing)FPGA FIR filter implementation (Audio signal processing)
FPGA FIR filter implementation (Audio signal processing)Hocine Merabti
 

What's hot (18)

Shop 2 presentation slide
Shop 2 presentation slideShop 2 presentation slide
Shop 2 presentation slide
 
[IJCT-V3I2P17] Authors: Sheng Lai, Xiaohua Meng, Dongqin Zheng
[IJCT-V3I2P17] Authors: Sheng Lai, Xiaohua Meng, Dongqin Zheng[IJCT-V3I2P17] Authors: Sheng Lai, Xiaohua Meng, Dongqin Zheng
[IJCT-V3I2P17] Authors: Sheng Lai, Xiaohua Meng, Dongqin Zheng
 
Auto‐Tuning of Hierarchical Computations with ppOpen‐AT
Auto‐Tuning of Hierarchical Computations with ppOpen‐ATAuto‐Tuning of Hierarchical Computations with ppOpen‐AT
Auto‐Tuning of Hierarchical Computations with ppOpen‐AT
 
NLP in 2020
NLP in 2020NLP in 2020
NLP in 2020
 
A New Cross Diamond Search Motion Estimation Algorithm for HEVC
A New Cross Diamond Search Motion Estimation Algorithm for HEVCA New Cross Diamond Search Motion Estimation Algorithm for HEVC
A New Cross Diamond Search Motion Estimation Algorithm for HEVC
 
Lecture 1
Lecture 1Lecture 1
Lecture 1
 
Fault Tolerant Parallel Filters Based On Bch Codes
Fault Tolerant Parallel Filters Based On Bch CodesFault Tolerant Parallel Filters Based On Bch Codes
Fault Tolerant Parallel Filters Based On Bch Codes
 
Dsp lab manual
Dsp lab manualDsp lab manual
Dsp lab manual
 
FPGA Implementation of FIR Filter using Various Algorithms: A Retrospective
FPGA Implementation of FIR Filter using Various Algorithms: A RetrospectiveFPGA Implementation of FIR Filter using Various Algorithms: A Retrospective
FPGA Implementation of FIR Filter using Various Algorithms: A Retrospective
 
Sucha_ICC_2012
Sucha_ICC_2012Sucha_ICC_2012
Sucha_ICC_2012
 
3rd 3DDRESD: Floorplacer
3rd 3DDRESD: Floorplacer3rd 3DDRESD: Floorplacer
3rd 3DDRESD: Floorplacer
 
Time andspacecomplexity
Time andspacecomplexityTime andspacecomplexity
Time andspacecomplexity
 
Reducing computational complexity of Mathematical functions using FPGA
Reducing computational complexity of Mathematical functions using FPGAReducing computational complexity of Mathematical functions using FPGA
Reducing computational complexity of Mathematical functions using FPGA
 
Algorithm And analysis Lecture 03& 04-time complexity.
 Algorithm And analysis Lecture 03& 04-time complexity. Algorithm And analysis Lecture 03& 04-time complexity.
Algorithm And analysis Lecture 03& 04-time complexity.
 
D I G I T A L I C A P P L I C A T I O N S J N T U M O D E L P A P E R{Www
D I G I T A L  I C  A P P L I C A T I O N S  J N T U  M O D E L  P A P E R{WwwD I G I T A L  I C  A P P L I C A T I O N S  J N T U  M O D E L  P A P E R{Www
D I G I T A L I C A P P L I C A T I O N S J N T U M O D E L P A P E R{Www
 
Complexity of Algorithm
Complexity of AlgorithmComplexity of Algorithm
Complexity of Algorithm
 
EEDC Programming Models
EEDC Programming ModelsEEDC Programming Models
EEDC Programming Models
 
FPGA FIR filter implementation (Audio signal processing)
FPGA FIR filter implementation (Audio signal processing)FPGA FIR filter implementation (Audio signal processing)
FPGA FIR filter implementation (Audio signal processing)
 

Viewers also liked

ppOpen-AT : Yet Another Directive-base AT Language
ppOpen-AT : Yet Another Directive-base AT LanguageppOpen-AT : Yet Another Directive-base AT Language
ppOpen-AT : Yet Another Directive-base AT LanguageTakahiro Katagiri
 
ppOpen-HPCコードを自動チューニングする言語ppOpen-ATの現状と新展開
ppOpen-HPCコードを自動チューニングする言語ppOpen-ATの現状と新展開ppOpen-HPCコードを自動チューニングする言語ppOpen-ATの現状と新展開
ppOpen-HPCコードを自動チューニングする言語ppOpen-ATの現状と新展開Takahiro Katagiri
 
ATTA2014基盤B導入(片桐)
ATTA2014基盤B導入(片桐)ATTA2014基盤B導入(片桐)
ATTA2014基盤B導入(片桐)Takahiro Katagiri
 
Towards Auto‐tuning for the Finite Difference Method in Era of 200+ Thread Pa...
Towards Auto‐tuning for the Finite Difference Method in Era of 200+ Thread Pa...Towards Auto‐tuning for the Finite Difference Method in Era of 200+ Thread Pa...
Towards Auto‐tuning for the Finite Difference Method in Era of 200+ Thread Pa...Takahiro Katagiri
 
SCG-AT:静的コード生成のみによる自動チューニング実現方式
SCG-AT:静的コード生成のみによる自動チューニング実現方式SCG-AT:静的コード生成のみによる自動チューニング実現方式
SCG-AT:静的コード生成のみによる自動チューニング実現方式Takahiro Katagiri
 
Extreme‐Scale Parallel Symmetric Eigensolver for Very Small‐Size Matrices Usi...
Extreme‐Scale Parallel Symmetric Eigensolver for Very Small‐Size Matrices Usi...Extreme‐Scale Parallel Symmetric Eigensolver for Very Small‐Size Matrices Usi...
Extreme‐Scale Parallel Symmetric Eigensolver for Very Small‐Size Matrices Usi...Takahiro Katagiri
 
Impact of Auto-tuning of Kernel Loop Transformation by using ppOpen-AT
Impact of Auto-tuning of Kernel Loop Transformation by using ppOpen-ATImpact of Auto-tuning of Kernel Loop Transformation by using ppOpen-AT
Impact of Auto-tuning of Kernel Loop Transformation by using ppOpen-ATTakahiro Katagiri
 
自動チューニングとビックデータ:機械学習の適用の可能性
自動チューニングとビックデータ:機械学習の適用の可能性自動チューニングとビックデータ:機械学習の適用の可能性
自動チューニングとビックデータ:機械学習の適用の可能性Takahiro Katagiri
 
ppOpen-ATによる静的コード生成で実現する 自動チューニング方式の評価
ppOpen-ATによる静的コード生成で実現する自動チューニング方式の評価ppOpen-ATによる静的コード生成で実現する自動チューニング方式の評価
ppOpen-ATによる静的コード生成で実現する 自動チューニング方式の評価Takahiro Katagiri
 
Towards Auto-tuning Facilities into Supercomputers in Operation - The FIBER a...
Towards Auto-tuning Facilities into Supercomputers in Operation - The FIBER a...Towards Auto-tuning Facilities into Supercomputers in Operation - The FIBER a...
Towards Auto-tuning Facilities into Supercomputers in Operation - The FIBER a...Takahiro Katagiri
 
ソフトウェア自動チューニング研究紹介
ソフトウェア自動チューニング研究紹介ソフトウェア自動チューニング研究紹介
ソフトウェア自動チューニング研究紹介Takahiro Katagiri
 

Viewers also liked (13)

ppOpen-AT : Yet Another Directive-base AT Language
ppOpen-AT : Yet Another Directive-base AT LanguageppOpen-AT : Yet Another Directive-base AT Language
ppOpen-AT : Yet Another Directive-base AT Language
 
ppOpen-HPCコードを自動チューニングする言語ppOpen-ATの現状と新展開
ppOpen-HPCコードを自動チューニングする言語ppOpen-ATの現状と新展開ppOpen-HPCコードを自動チューニングする言語ppOpen-ATの現状と新展開
ppOpen-HPCコードを自動チューニングする言語ppOpen-ATの現状と新展開
 
ATTA2014基盤B導入(片桐)
ATTA2014基盤B導入(片桐)ATTA2014基盤B導入(片桐)
ATTA2014基盤B導入(片桐)
 
Towards Auto‐tuning for the Finite Difference Method in Era of 200+ Thread Pa...
Towards Auto‐tuning for the Finite Difference Method in Era of 200+ Thread Pa...Towards Auto‐tuning for the Finite Difference Method in Era of 200+ Thread Pa...
Towards Auto‐tuning for the Finite Difference Method in Era of 200+ Thread Pa...
 
SCG-AT:静的コード生成のみによる自動チューニング実現方式
SCG-AT:静的コード生成のみによる自動チューニング実現方式SCG-AT:静的コード生成のみによる自動チューニング実現方式
SCG-AT:静的コード生成のみによる自動チューニング実現方式
 
Extreme‐Scale Parallel Symmetric Eigensolver for Very Small‐Size Matrices Usi...
Extreme‐Scale Parallel Symmetric Eigensolver for Very Small‐Size Matrices Usi...Extreme‐Scale Parallel Symmetric Eigensolver for Very Small‐Size Matrices Usi...
Extreme‐Scale Parallel Symmetric Eigensolver for Very Small‐Size Matrices Usi...
 
iWAPT2015_katagiri
iWAPT2015_katagiriiWAPT2015_katagiri
iWAPT2015_katagiri
 
Impact of Auto-tuning of Kernel Loop Transformation by using ppOpen-AT
Impact of Auto-tuning of Kernel Loop Transformation by using ppOpen-ATImpact of Auto-tuning of Kernel Loop Transformation by using ppOpen-AT
Impact of Auto-tuning of Kernel Loop Transformation by using ppOpen-AT
 
自動チューニングとビックデータ:機械学習の適用の可能性
自動チューニングとビックデータ:機械学習の適用の可能性自動チューニングとビックデータ:機械学習の適用の可能性
自動チューニングとビックデータ:機械学習の適用の可能性
 
ppOpen-ATによる静的コード生成で実現する 自動チューニング方式の評価
ppOpen-ATによる静的コード生成で実現する自動チューニング方式の評価ppOpen-ATによる静的コード生成で実現する自動チューニング方式の評価
ppOpen-ATによる静的コード生成で実現する 自動チューニング方式の評価
 
Ase20 20151016 hp
Ase20 20151016 hpAse20 20151016 hp
Ase20 20151016 hp
 
Towards Auto-tuning Facilities into Supercomputers in Operation - The FIBER a...
Towards Auto-tuning Facilities into Supercomputers in Operation - The FIBER a...Towards Auto-tuning Facilities into Supercomputers in Operation - The FIBER a...
Towards Auto-tuning Facilities into Supercomputers in Operation - The FIBER a...
 
ソフトウェア自動チューニング研究紹介
ソフトウェア自動チューニング研究紹介ソフトウェア自動チューニング研究紹介
ソフトウェア自動チューニング研究紹介
 

Similar to Towards Automatic Code Selection with ppOpen-AT: A Case of FDM - Variants of Numerical Computations and Its Impact on a Multi-core Processor -

Medical Image Segmentation Using Hidden Markov Random Field A Distributed Ap...
Medical Image Segmentation Using Hidden Markov Random Field  A Distributed Ap...Medical Image Segmentation Using Hidden Markov Random Field  A Distributed Ap...
Medical Image Segmentation Using Hidden Markov Random Field A Distributed Ap...EL-Hachemi Guerrout
 
Data Structures and Algorithm Analysis
Data Structures  and  Algorithm AnalysisData Structures  and  Algorithm Analysis
Data Structures and Algorithm AnalysisMary Margarat
 
Overview of Qiskit Ignis - Struggle with errors -
Overview of Qiskit Ignis   - Struggle with errors - Overview of Qiskit Ignis   - Struggle with errors -
Overview of Qiskit Ignis - Struggle with errors - Shin Nishio
 
Stack Hybridization: A Mechanism for Bridging Two Compilation Strategies in a...
Stack Hybridization: A Mechanism for Bridging Two Compilation Strategies in a...Stack Hybridization: A Mechanism for Bridging Two Compilation Strategies in a...
Stack Hybridization: A Mechanism for Bridging Two Compilation Strategies in a...Yusuke Izawa
 
Computer Architecture and Organization
Computer Architecture and OrganizationComputer Architecture and Organization
Computer Architecture and Organizationssuserdfc773
 
A Configurable and Low Power Hard-Decision Viterbi Decoder in VLSI Architecture
A Configurable and Low Power Hard-Decision Viterbi Decoder in VLSI ArchitectureA Configurable and Low Power Hard-Decision Viterbi Decoder in VLSI Architecture
A Configurable and Low Power Hard-Decision Viterbi Decoder in VLSI ArchitectureIRJET Journal
 
IP Core Design of Hight Lightweight Cipher and its Implementation
IP Core Design of Hight Lightweight Cipher and its Implementation IP Core Design of Hight Lightweight Cipher and its Implementation
IP Core Design of Hight Lightweight Cipher and its Implementation csandit
 
IP CORE DESIGN OF HIGHT LIGHTWEIGHT CIPHER AND ITS IMPLEMENTATION
IP CORE DESIGN OF HIGHT LIGHTWEIGHT CIPHER AND ITS IMPLEMENTATIONIP CORE DESIGN OF HIGHT LIGHTWEIGHT CIPHER AND ITS IMPLEMENTATION
IP CORE DESIGN OF HIGHT LIGHTWEIGHT CIPHER AND ITS IMPLEMENTATIONcscpconf
 
Combining Phase Identification and Statistic Modeling for Automated Parallel ...
Combining Phase Identification and Statistic Modeling for Automated Parallel ...Combining Phase Identification and Statistic Modeling for Automated Parallel ...
Combining Phase Identification and Statistic Modeling for Automated Parallel ...Mingliang Liu
 
Gpu workshop cluster universe: scripting cuda
Gpu workshop cluster universe: scripting cudaGpu workshop cluster universe: scripting cuda
Gpu workshop cluster universe: scripting cudaFerdinand Jamitzky
 
Full stack component of software and middleware for quantum machine
Full stack component of software and middleware for quantum machineFull stack component of software and middleware for quantum machine
Full stack component of software and middleware for quantum machineYuichiro MInato
 
On Implementation of Neuron Network(Back-propagation)
On Implementation of Neuron Network(Back-propagation)On Implementation of Neuron Network(Back-propagation)
On Implementation of Neuron Network(Back-propagation)Yu Liu
 
The CAOS framework: democratize the acceleration of compute intensive applica...
The CAOS framework: democratize the acceleration of compute intensive applica...The CAOS framework: democratize the acceleration of compute intensive applica...
The CAOS framework: democratize the acceleration of compute intensive applica...NECST Lab @ Politecnico di Milano
 
Plan_design and FPGA implement of MIMO OFDM SDM systems
Plan_design and FPGA implement of MIMO OFDM SDM systemsPlan_design and FPGA implement of MIMO OFDM SDM systems
Plan_design and FPGA implement of MIMO OFDM SDM systemsTan Vo
 
Python Introduction | JNTUA | R19 | UNIT 1
Python Introduction | JNTUA | R19 | UNIT 1 Python Introduction | JNTUA | R19 | UNIT 1
Python Introduction | JNTUA | R19 | UNIT 1 FabMinds
 
Static analysis of C++ source code
Static analysis of C++ source codeStatic analysis of C++ source code
Static analysis of C++ source codePVS-Studio
 
Static analysis of C++ source code
Static analysis of C++ source codeStatic analysis of C++ source code
Static analysis of C++ source codeAndrey Karpov
 

Similar to Towards Automatic Code Selection with ppOpen-AT: A Case of FDM - Variants of Numerical Computations and Its Impact on a Multi-core Processor - (20)

6. Implementation
6. Implementation6. Implementation
6. Implementation
 
Medical Image Segmentation Using Hidden Markov Random Field A Distributed Ap...
Medical Image Segmentation Using Hidden Markov Random Field  A Distributed Ap...Medical Image Segmentation Using Hidden Markov Random Field  A Distributed Ap...
Medical Image Segmentation Using Hidden Markov Random Field A Distributed Ap...
 
Dsp lab manual 15 11-2016
Dsp lab manual 15 11-2016Dsp lab manual 15 11-2016
Dsp lab manual 15 11-2016
 
Data Structures and Algorithm Analysis
Data Structures  and  Algorithm AnalysisData Structures  and  Algorithm Analysis
Data Structures and Algorithm Analysis
 
Overview of Qiskit Ignis - Struggle with errors -
Overview of Qiskit Ignis   - Struggle with errors - Overview of Qiskit Ignis   - Struggle with errors -
Overview of Qiskit Ignis - Struggle with errors -
 
Stack Hybridization: A Mechanism for Bridging Two Compilation Strategies in a...
Stack Hybridization: A Mechanism for Bridging Two Compilation Strategies in a...Stack Hybridization: A Mechanism for Bridging Two Compilation Strategies in a...
Stack Hybridization: A Mechanism for Bridging Two Compilation Strategies in a...
 
Computer Architecture and Organization
Computer Architecture and OrganizationComputer Architecture and Organization
Computer Architecture and Organization
 
A Configurable and Low Power Hard-Decision Viterbi Decoder in VLSI Architecture
A Configurable and Low Power Hard-Decision Viterbi Decoder in VLSI ArchitectureA Configurable and Low Power Hard-Decision Viterbi Decoder in VLSI Architecture
A Configurable and Low Power Hard-Decision Viterbi Decoder in VLSI Architecture
 
IP Core Design of Hight Lightweight Cipher and its Implementation
IP Core Design of Hight Lightweight Cipher and its Implementation IP Core Design of Hight Lightweight Cipher and its Implementation
IP Core Design of Hight Lightweight Cipher and its Implementation
 
IP CORE DESIGN OF HIGHT LIGHTWEIGHT CIPHER AND ITS IMPLEMENTATION
IP CORE DESIGN OF HIGHT LIGHTWEIGHT CIPHER AND ITS IMPLEMENTATIONIP CORE DESIGN OF HIGHT LIGHTWEIGHT CIPHER AND ITS IMPLEMENTATION
IP CORE DESIGN OF HIGHT LIGHTWEIGHT CIPHER AND ITS IMPLEMENTATION
 
Combining Phase Identification and Statistic Modeling for Automated Parallel ...
Combining Phase Identification and Statistic Modeling for Automated Parallel ...Combining Phase Identification and Statistic Modeling for Automated Parallel ...
Combining Phase Identification and Statistic Modeling for Automated Parallel ...
 
Project Report
Project ReportProject Report
Project Report
 
Gpu workshop cluster universe: scripting cuda
Gpu workshop cluster universe: scripting cudaGpu workshop cluster universe: scripting cuda
Gpu workshop cluster universe: scripting cuda
 
Full stack component of software and middleware for quantum machine
Full stack component of software and middleware for quantum machineFull stack component of software and middleware for quantum machine
Full stack component of software and middleware for quantum machine
 
On Implementation of Neuron Network(Back-propagation)
On Implementation of Neuron Network(Back-propagation)On Implementation of Neuron Network(Back-propagation)
On Implementation of Neuron Network(Back-propagation)
 
The CAOS framework: democratize the acceleration of compute intensive applica...
The CAOS framework: democratize the acceleration of compute intensive applica...The CAOS framework: democratize the acceleration of compute intensive applica...
The CAOS framework: democratize the acceleration of compute intensive applica...
 
Plan_design and FPGA implement of MIMO OFDM SDM systems
Plan_design and FPGA implement of MIMO OFDM SDM systemsPlan_design and FPGA implement of MIMO OFDM SDM systems
Plan_design and FPGA implement of MIMO OFDM SDM systems
 
Python Introduction | JNTUA | R19 | UNIT 1
Python Introduction | JNTUA | R19 | UNIT 1 Python Introduction | JNTUA | R19 | UNIT 1
Python Introduction | JNTUA | R19 | UNIT 1
 
Static analysis of C++ source code
Static analysis of C++ source codeStatic analysis of C++ source code
Static analysis of C++ source code
 
Static analysis of C++ source code
Static analysis of C++ source codeStatic analysis of C++ source code
Static analysis of C++ source code
 

Recently uploaded

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetEnjoy Anytime
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group
 

Recently uploaded (20)

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2
 

Towards Automatic Code Selection with ppOpen-AT: A Case of FDM - Variants of Numerical Computations and Its Impact on a Multi-core Processor -