SlideShare a Scribd company logo
Università degli Studi di Napoli “Federico II”

CLASSE DELLE LAUREE MAGISTRALI IN INGEGNERIA ELETTRONICA, LM-29
TESI DI LAUREA

Accelerating NMR
via NUFFT algorithms on GPUs
Supervisors:

Candidate:

Prof. Amedeo Capozzoli

Federico Maria Ariù
matr. M61/000013

Ing. Claudio Curcio
Ing. Angelo Liseno
Anno accademico 2012/2013
MRI
Magnetic Resonance Imaging (MRI) is a medical diagnostic technique based on
the physical principle of nuclear magnetic resonance and allows to reconstruct
images of the human body.

MRI has become one of the most used medical diagnostic modalities and it is
used for many purposes, e.g. brain and orthopedic surveys.

Accelerating NMR via NUFFT algorithms on GPUs
Sampling and processing speed in MRI
In many recent applications the need for decrease sampling and processing
times arises as much as possible.

Decreasing the required times for the patient
immobility

Monitoring activities and reactions of patients (e.g.
functional MRI, fMRI)
Continuous monitoring of patient’s conditions
during surgery

Accelerating NMR via NUFFT algorithms on GPUs
Factors of sampling and processing speed

1- ACQUISITION METHODS

2- PROCESSING ALGORITHMS

3-PROCESSING HARDWARE

Accelerating NMR via NUFFT algorithms on GPUs
Thesis statement

The purpose of the thesis was to implement an MRI algorithm based on the
suitable use of parallel computing systems (Graphic Processing Unit, or GPU)
and fast processing algorithms able to handle non-standard acquired data.

Accelerating NMR via NUFFT algorithms on GPUs
MRI reconstruction
MRI system’s data-unknown relation is :
m(k x (t ), k y (t ))  S ( x, y) ( x, y)e

 j 2 [ k x ( t ) x  k y ( t ) y ]

- Cartesian grid discretization
- Time discretization
m(k x (tk ), k y (tk ))  xy  S ( x p , yq )  ( x p , yq )e
p

dxdy
m: measured data
S: sensitivity map
ρ: spin density
t: time parameter

 j 2 [ k x ( t k ) x p  k y ( t k ) yq ]

q

Data are linked to spin density through sensitivity map as a Fourier transform relation.
Data are the sampled values of the spectrum on the (Kx(t), Ky(t)) trajectory described by
varying time

Trasformata di Fourier

The relation between ρ and m could be written as a Discrete Fourier Transform (DFT)
Accelerating NMR via NUFFT algorithms on GPUs
Fast sampling
It is possible to use several data sampling method:
Cartesian

-

slow

+

straightforward
processing

Uniform DFT

(k x (t ), k y (t )) curve

Radial

-

elaborate
processing

Spiral

+

fast

-

elaborate
processing

+

fast

Non Uniform DFT (NUDFT)

About cartesian sampling, processing can be attributed to a uniform DFT
accelerated by a standard FFT algorithm in order to obtain a time complexity
O(NlogN)
Accelerating NMR via NUFFT algorithms on GPUs
Fast data processing
More sophisticated algorithms allowing to use faster sampling methods are
required.

From a numerical point of view, Non Uniform Fast Fourier Transform (NUFFT)
algorithms restore time complexity of FFT algorithms.

3 algorithms NUFFT types can be characterized:
-NUFFT of type 1 – NED (Non-Equispaced Data)
-NUFFT of type 2 – NER (Non-Equispaced Results)
-NUFFT of type 3 – (Non-Equispaced Data and Results)

Accelerating NMR via NUFFT algorithms on GPUs
Fast data processing
MRI system’s data-unknown relation is :
m(k x (tk ), k y (tk ))  xy  S ( x p , yq )  ( x p , yq ) e

 j 2 [ k x ( t k ) x p  k y ( t k ) y q ]

p

q

m: measured data
S: sensitivity map
ρ: spin density
t: time parameter

The relation above can be rewritten as a system
of linear equations:

A  m
(NER-NUFFT)





A A  A m
(NED-NUFFT)

where

A (m)   mi e


 j 2 [ k x ( t ) x p  k y ( t ) y q ]

i

In order to reverse the relation, multiplying the system by the adjoint matrix,
depending on the chosen sampling method, matrix-vector multiplications can be
more rapidly processed by using NER and NED NUFFT algorithms.
Such system of linear equations has been solved by applying the biconjugate
gradient stabilized method (BICGSTAB).
Accelerating NMR via NUFFT algorithms on GPUs
GPU implementation
An MRI algorithm based on a BICGSTAB routine using NER and NED NUFFT was
developed both for the GPU and CPU version. Firstly the algorithm was implemented in
Matlab language, after in C language and finally in CUDA C for the GPU version.

Matlab

C

CUDA C

The convenience of a GPU-implemented algorithm is the data parallelism, that is the
possibility to solve simultaneously arithmetic operations on various data, obtaining
better performance in terms of speed and processing.

GPU acceleration was executed on Nvidia Tesla Kepler K20c.

Accelerating NMR via NUFFT algorithms on GPUs
MRI numerical results
Processing times of MRI application were validated using simulated data from
a nonuniform (spyral type) sampling method.
The image reconsruction pointed out the same accuracy shown in the
literature [“A fast wavelet-based reconstruction method for MRI”, by
Guerquin-Kern, IEEE Transaction on Medical Imaging] linked to the simulator.

Sampling grid

Phantom reconstruction

Computation time[sec]

CPU 196
GPU 3

speedup

TimeCPU
 65
TimeGPU

Accelerating NMR via NUFFT algorithms on GPUs
Non Uniform Fast Fourier Transform of
type 3 (NUFFT-3)
NUFFT-3 allows to use a multi resolution approach on the MRI application when the
examination with more details of certain areas rather than others is required.
N 1

F ( sk , t k )   f j e

isk x j

e

t k y j

j 0

NUFFT -3 2D steps – Greengard&Lee approach:
N 1

f x y (n1x, n2 y )   f j g  x y (n1x  x j , n2 y  y j )

1. Projection of the samples on a uniform grid
 x y

f x y

2. Precompensation
3. FFT

 

F x yx y (m1s, m2 t ) 

xy
2

(n1x, n2 y ) 

( M r X 1) ( M r y 1)
2
2



n1  

M rX
2



n2  

M ry

j 0

1

 x ( n1 x ) 2 y ( n2  y ) 2

4 x y

e

f x y (n1x.n2 y )

 

f x yx y (n1x, n2 y)e im1n1 x  s e im2 n2yt

2
sp
st
 
F x yx y (m1s, m2t ) g x y (m1s  sk ,m2t  tk )
2 m1 ,m2  msp

m

4. Spectrum reconstruction on irregular frequencies
5. Postcompensation

F x y ( sk , tk ) 

F ( sk , t k ) 

1
4 x y

t 2

e sk x e k y F x y ( sk , t k )
2

Accelerating NMR via NUFFT algorithms on GPUs
NUFFT-3 execution times:
CPU vs GPU
Tests on the execution times of the NUFFT-3, for the GPU and CPU version,
were carried out by varying the number of samples.

speedupNUFFT 32 D 

TimeCPU
 20
TimeGPU
Accelerating NMR via NUFFT algorithms on GPUs
Conclusions
An MRI reconstruction algorithm from simulated data sampled on a spiral grid
was implemented.
The convenience of using NER and NED NUFFT algorithms was demonstrated.
MRI processing was entirely developed on GPU.
A two-dimensional NUFFT-3 for the processing of nonuniform samples for multi
resolution imaging applications was implemented.
The performance of the NUFFT-3 implemented algorithm was verified.

Accelerating NMR via NUFFT algorithms on GPUs
Thank you for your time

Accelerating NMR via NUFFT algorithms on GPUs

More Related Content

What's hot

DEEP LEARNING BASED MULTIPLE REGRESSION TO PREDICT TOTAL COLUMN WATER VAPOR (...
DEEP LEARNING BASED MULTIPLE REGRESSION TO PREDICT TOTAL COLUMN WATER VAPOR (...DEEP LEARNING BASED MULTIPLE REGRESSION TO PREDICT TOTAL COLUMN WATER VAPOR (...
DEEP LEARNING BASED MULTIPLE REGRESSION TO PREDICT TOTAL COLUMN WATER VAPOR (...
IJDKP
 
thesis
thesisthesis
APPLYING DYNAMIC MODEL FOR MULTIPLE MANOEUVRING TARGET TRACKING USING PARTICL...
APPLYING DYNAMIC MODEL FOR MULTIPLE MANOEUVRING TARGET TRACKING USING PARTICL...APPLYING DYNAMIC MODEL FOR MULTIPLE MANOEUVRING TARGET TRACKING USING PARTICL...
APPLYING DYNAMIC MODEL FOR MULTIPLE MANOEUVRING TARGET TRACKING USING PARTICL...
IJITCA Journal
 
Problem Decomposition and Information Minimization for the Global, Concurrent...
Problem Decomposition and Information Minimization for the Global, Concurrent...Problem Decomposition and Information Minimization for the Global, Concurrent...
Problem Decomposition and Information Minimization for the Global, Concurrent...
gerogepatton
 
Vibration Analysis and Modelling of a Cantilever Beam
Vibration Analysis and Modelling of a Cantilever Beam Vibration Analysis and Modelling of a Cantilever Beam
Vibration Analysis and Modelling of a Cantilever Beam
Muhammad Usman
 
Multi-core GPU – Fast parallel SAR image generation
Multi-core GPU – Fast parallel SAR image generationMulti-core GPU – Fast parallel SAR image generation
Multi-core GPU – Fast parallel SAR image generation
Mahesh Khadatare
 
presentation_btp
presentation_btppresentation_btp
presentation_btp
Anshul Goyal, EIT
 
Master's thesis
Master's thesisMaster's thesis
Master's thesis
Smrati Kumar Katiyar
 
Optical sensing techniques and signal processing 5
Optical sensing techniques and signal processing 5Optical sensing techniques and signal processing 5
Optical sensing techniques and signal processing 5
ali alavi
 
UNF Undergrad Physics
UNF Undergrad PhysicsUNF Undergrad Physics
UNF Undergrad Physics
Nick Kypreos
 
Smart Metrics for High Performance Material Design
Smart Metrics for High Performance Material DesignSmart Metrics for High Performance Material Design
Smart Metrics for High Performance Material Design
aimsnist
 
Accelerated Materials Discovery & Characterization with Classical, Quantum an...
Accelerated Materials Discovery & Characterization with Classical, Quantum an...Accelerated Materials Discovery & Characterization with Classical, Quantum an...
Accelerated Materials Discovery & Characterization with Classical, Quantum an...
KAMAL CHOUDHARY
 
Molecular Activity Prediction Using Graph Convolutional Deep Neural Network C...
Molecular Activity Prediction Using Graph Convolutional Deep Neural Network C...Molecular Activity Prediction Using Graph Convolutional Deep Neural Network C...
Molecular Activity Prediction Using Graph Convolutional Deep Neural Network C...
Masahito Ohue
 
Ahi terahertz 1
Ahi terahertz 1Ahi terahertz 1
Ahi terahertz 1
Kiarash Ahi
 
Computational methods and vibrational properties applied to materials modeling
Computational methods and vibrational properties applied to materials modelingComputational methods and vibrational properties applied to materials modeling
Computational methods and vibrational properties applied to materials modeling
cippo1987Ita
 
Quantum Computation for Predicting Electron and Phonon Properties of Solids
Quantum Computation for Predicting Electron and Phonon Properties of SolidsQuantum Computation for Predicting Electron and Phonon Properties of Solids
Quantum Computation for Predicting Electron and Phonon Properties of Solids
KAMAL CHOUDHARY
 
Gpuslides
GpuslidesGpuslides
Gpuslides
Storti Mario
 
AbratenkoTalk
AbratenkoTalkAbratenkoTalk
AbratenkoTalk
Polina Abratenko
 
temporal_analysis_forest_cover_using_hmm.ppt
temporal_analysis_forest_cover_using_hmm.ppttemporal_analysis_forest_cover_using_hmm.ppt
temporal_analysis_forest_cover_using_hmm.ppt
grssieee
 
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
KAMAL CHOUDHARY
 

What's hot (20)

DEEP LEARNING BASED MULTIPLE REGRESSION TO PREDICT TOTAL COLUMN WATER VAPOR (...
DEEP LEARNING BASED MULTIPLE REGRESSION TO PREDICT TOTAL COLUMN WATER VAPOR (...DEEP LEARNING BASED MULTIPLE REGRESSION TO PREDICT TOTAL COLUMN WATER VAPOR (...
DEEP LEARNING BASED MULTIPLE REGRESSION TO PREDICT TOTAL COLUMN WATER VAPOR (...
 
thesis
thesisthesis
thesis
 
APPLYING DYNAMIC MODEL FOR MULTIPLE MANOEUVRING TARGET TRACKING USING PARTICL...
APPLYING DYNAMIC MODEL FOR MULTIPLE MANOEUVRING TARGET TRACKING USING PARTICL...APPLYING DYNAMIC MODEL FOR MULTIPLE MANOEUVRING TARGET TRACKING USING PARTICL...
APPLYING DYNAMIC MODEL FOR MULTIPLE MANOEUVRING TARGET TRACKING USING PARTICL...
 
Problem Decomposition and Information Minimization for the Global, Concurrent...
Problem Decomposition and Information Minimization for the Global, Concurrent...Problem Decomposition and Information Minimization for the Global, Concurrent...
Problem Decomposition and Information Minimization for the Global, Concurrent...
 
Vibration Analysis and Modelling of a Cantilever Beam
Vibration Analysis and Modelling of a Cantilever Beam Vibration Analysis and Modelling of a Cantilever Beam
Vibration Analysis and Modelling of a Cantilever Beam
 
Multi-core GPU – Fast parallel SAR image generation
Multi-core GPU – Fast parallel SAR image generationMulti-core GPU – Fast parallel SAR image generation
Multi-core GPU – Fast parallel SAR image generation
 
presentation_btp
presentation_btppresentation_btp
presentation_btp
 
Master's thesis
Master's thesisMaster's thesis
Master's thesis
 
Optical sensing techniques and signal processing 5
Optical sensing techniques and signal processing 5Optical sensing techniques and signal processing 5
Optical sensing techniques and signal processing 5
 
UNF Undergrad Physics
UNF Undergrad PhysicsUNF Undergrad Physics
UNF Undergrad Physics
 
Smart Metrics for High Performance Material Design
Smart Metrics for High Performance Material DesignSmart Metrics for High Performance Material Design
Smart Metrics for High Performance Material Design
 
Accelerated Materials Discovery & Characterization with Classical, Quantum an...
Accelerated Materials Discovery & Characterization with Classical, Quantum an...Accelerated Materials Discovery & Characterization with Classical, Quantum an...
Accelerated Materials Discovery & Characterization with Classical, Quantum an...
 
Molecular Activity Prediction Using Graph Convolutional Deep Neural Network C...
Molecular Activity Prediction Using Graph Convolutional Deep Neural Network C...Molecular Activity Prediction Using Graph Convolutional Deep Neural Network C...
Molecular Activity Prediction Using Graph Convolutional Deep Neural Network C...
 
Ahi terahertz 1
Ahi terahertz 1Ahi terahertz 1
Ahi terahertz 1
 
Computational methods and vibrational properties applied to materials modeling
Computational methods and vibrational properties applied to materials modelingComputational methods and vibrational properties applied to materials modeling
Computational methods and vibrational properties applied to materials modeling
 
Quantum Computation for Predicting Electron and Phonon Properties of Solids
Quantum Computation for Predicting Electron and Phonon Properties of SolidsQuantum Computation for Predicting Electron and Phonon Properties of Solids
Quantum Computation for Predicting Electron and Phonon Properties of Solids
 
Gpuslides
GpuslidesGpuslides
Gpuslides
 
AbratenkoTalk
AbratenkoTalkAbratenkoTalk
AbratenkoTalk
 
temporal_analysis_forest_cover_using_hmm.ppt
temporal_analysis_forest_cover_using_hmm.ppttemporal_analysis_forest_cover_using_hmm.ppt
temporal_analysis_forest_cover_using_hmm.ppt
 
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
 

Similar to Accelerating NMR via NUFFT algorithms on GPUs

Photoacoustic tomography based on the application of virtual detectors
Photoacoustic tomography based on the application of virtual detectorsPhotoacoustic tomography based on the application of virtual detectors
Photoacoustic tomography based on the application of virtual detectors
IAEME Publication
 
G234247
G234247G234247
Ml srhwt-machine-learning-based-superlative-rapid-haar-wavelet-transformation...
Ml srhwt-machine-learning-based-superlative-rapid-haar-wavelet-transformation...Ml srhwt-machine-learning-based-superlative-rapid-haar-wavelet-transformation...
Ml srhwt-machine-learning-based-superlative-rapid-haar-wavelet-transformation...
Jumlesha Shaik
 
Cyclostationary analysis of polytime coded signals for lpi radars
Cyclostationary analysis of polytime coded signals for lpi radarsCyclostationary analysis of polytime coded signals for lpi radars
Cyclostationary analysis of polytime coded signals for lpi radars
eSAT Journals
 
Consistent Nonparametric Spectrum Estimation Via Cepstrum Thresholding
Consistent Nonparametric Spectrum Estimation Via Cepstrum ThresholdingConsistent Nonparametric Spectrum Estimation Via Cepstrum Thresholding
Consistent Nonparametric Spectrum Estimation Via Cepstrum Thresholding
CSCJournals
 
Application of lasers
Application of lasersApplication of lasers
Application of lasers
Alexander Decker
 
Adaptive and inteligence
Adaptive and inteligenceAdaptive and inteligence
Adaptive and inteligence
FinitoTheEnd
 
Performance analysis of compressive sensing recovery algorithms for image pr...
Performance analysis of compressive sensing recovery  algorithms for image pr...Performance analysis of compressive sensing recovery  algorithms for image pr...
Performance analysis of compressive sensing recovery algorithms for image pr...
IJECEIAES
 
First and second order semi-Markov chains for wind speed modeling
First and second order semi-Markov chains for wind speed modelingFirst and second order semi-Markov chains for wind speed modeling
First and second order semi-Markov chains for wind speed modeling
Nozir Shokirov
 
INVERSIONOF MAGNETIC ANOMALIES DUE TO 2-D CYLINDRICAL STRUCTURES –BY AN ARTIF...
INVERSIONOF MAGNETIC ANOMALIES DUE TO 2-D CYLINDRICAL STRUCTURES –BY AN ARTIF...INVERSIONOF MAGNETIC ANOMALIES DUE TO 2-D CYLINDRICAL STRUCTURES –BY AN ARTIF...
INVERSIONOF MAGNETIC ANOMALIES DUE TO 2-D CYLINDRICAL STRUCTURES –BY AN ARTIF...
ijsc
 
INVERSIONOF MAGNETIC ANOMALIES DUE TO 2-D CYLINDRICAL STRUCTURES –BY AN ARTIF...
INVERSIONOF MAGNETIC ANOMALIES DUE TO 2-D CYLINDRICAL STRUCTURES –BY AN ARTIF...INVERSIONOF MAGNETIC ANOMALIES DUE TO 2-D CYLINDRICAL STRUCTURES –BY AN ARTIF...
INVERSIONOF MAGNETIC ANOMALIES DUE TO 2-D CYLINDRICAL STRUCTURES –BY AN ARTIF...
ijsc
 
Inversion of Magnetic Anomalies Due to 2-D Cylindrical Structures – By an Art...
Inversion of Magnetic Anomalies Due to 2-D Cylindrical Structures – By an Art...Inversion of Magnetic Anomalies Due to 2-D Cylindrical Structures – By an Art...
Inversion of Magnetic Anomalies Due to 2-D Cylindrical Structures – By an Art...
ijsc
 
IEEE International Conference Presentation
IEEE International Conference PresentationIEEE International Conference Presentation
IEEE International Conference Presentation
Anmol Dwivedi
 
1789 1800
1789 18001789 1800
1789 1800
Editor IJARCET
 
1789 1800
1789 18001789 1800
1789 1800
Editor IJARCET
 
Using Subspace Pursuit Algorithm to Improve Performance of the Distributed Co...
Using Subspace Pursuit Algorithm to Improve Performance of the Distributed Co...Using Subspace Pursuit Algorithm to Improve Performance of the Distributed Co...
Using Subspace Pursuit Algorithm to Improve Performance of the Distributed Co...
Polytechnique Montreal
 
IOMAC2009
IOMAC2009IOMAC2009
IOMAC2009
M Jafari
 
A New Enhanced Method of Non Parametric power spectrum Estimation.
A New Enhanced Method of Non Parametric power spectrum Estimation.A New Enhanced Method of Non Parametric power spectrum Estimation.
A New Enhanced Method of Non Parametric power spectrum Estimation.
CSCJournals
 
SPECTRAL ESTIMATE FOR STABLE SIGNALS WITH P-ADIC TIME AND OPTIMAL SELECTION O...
SPECTRAL ESTIMATE FOR STABLE SIGNALS WITH P-ADIC TIME AND OPTIMAL SELECTION O...SPECTRAL ESTIMATE FOR STABLE SIGNALS WITH P-ADIC TIME AND OPTIMAL SELECTION O...
SPECTRAL ESTIMATE FOR STABLE SIGNALS WITH P-ADIC TIME AND OPTIMAL SELECTION O...
sipij
 
KZ Spatial Waves Separations
KZ Spatial Waves SeparationsKZ Spatial Waves Separations
KZ Spatial Waves Separations
QUESTJOURNAL
 

Similar to Accelerating NMR via NUFFT algorithms on GPUs (20)

Photoacoustic tomography based on the application of virtual detectors
Photoacoustic tomography based on the application of virtual detectorsPhotoacoustic tomography based on the application of virtual detectors
Photoacoustic tomography based on the application of virtual detectors
 
G234247
G234247G234247
G234247
 
Ml srhwt-machine-learning-based-superlative-rapid-haar-wavelet-transformation...
Ml srhwt-machine-learning-based-superlative-rapid-haar-wavelet-transformation...Ml srhwt-machine-learning-based-superlative-rapid-haar-wavelet-transformation...
Ml srhwt-machine-learning-based-superlative-rapid-haar-wavelet-transformation...
 
Cyclostationary analysis of polytime coded signals for lpi radars
Cyclostationary analysis of polytime coded signals for lpi radarsCyclostationary analysis of polytime coded signals for lpi radars
Cyclostationary analysis of polytime coded signals for lpi radars
 
Consistent Nonparametric Spectrum Estimation Via Cepstrum Thresholding
Consistent Nonparametric Spectrum Estimation Via Cepstrum ThresholdingConsistent Nonparametric Spectrum Estimation Via Cepstrum Thresholding
Consistent Nonparametric Spectrum Estimation Via Cepstrum Thresholding
 
Application of lasers
Application of lasersApplication of lasers
Application of lasers
 
Adaptive and inteligence
Adaptive and inteligenceAdaptive and inteligence
Adaptive and inteligence
 
Performance analysis of compressive sensing recovery algorithms for image pr...
Performance analysis of compressive sensing recovery  algorithms for image pr...Performance analysis of compressive sensing recovery  algorithms for image pr...
Performance analysis of compressive sensing recovery algorithms for image pr...
 
First and second order semi-Markov chains for wind speed modeling
First and second order semi-Markov chains for wind speed modelingFirst and second order semi-Markov chains for wind speed modeling
First and second order semi-Markov chains for wind speed modeling
 
INVERSIONOF MAGNETIC ANOMALIES DUE TO 2-D CYLINDRICAL STRUCTURES –BY AN ARTIF...
INVERSIONOF MAGNETIC ANOMALIES DUE TO 2-D CYLINDRICAL STRUCTURES –BY AN ARTIF...INVERSIONOF MAGNETIC ANOMALIES DUE TO 2-D CYLINDRICAL STRUCTURES –BY AN ARTIF...
INVERSIONOF MAGNETIC ANOMALIES DUE TO 2-D CYLINDRICAL STRUCTURES –BY AN ARTIF...
 
INVERSIONOF MAGNETIC ANOMALIES DUE TO 2-D CYLINDRICAL STRUCTURES –BY AN ARTIF...
INVERSIONOF MAGNETIC ANOMALIES DUE TO 2-D CYLINDRICAL STRUCTURES –BY AN ARTIF...INVERSIONOF MAGNETIC ANOMALIES DUE TO 2-D CYLINDRICAL STRUCTURES –BY AN ARTIF...
INVERSIONOF MAGNETIC ANOMALIES DUE TO 2-D CYLINDRICAL STRUCTURES –BY AN ARTIF...
 
Inversion of Magnetic Anomalies Due to 2-D Cylindrical Structures – By an Art...
Inversion of Magnetic Anomalies Due to 2-D Cylindrical Structures – By an Art...Inversion of Magnetic Anomalies Due to 2-D Cylindrical Structures – By an Art...
Inversion of Magnetic Anomalies Due to 2-D Cylindrical Structures – By an Art...
 
IEEE International Conference Presentation
IEEE International Conference PresentationIEEE International Conference Presentation
IEEE International Conference Presentation
 
1789 1800
1789 18001789 1800
1789 1800
 
1789 1800
1789 18001789 1800
1789 1800
 
Using Subspace Pursuit Algorithm to Improve Performance of the Distributed Co...
Using Subspace Pursuit Algorithm to Improve Performance of the Distributed Co...Using Subspace Pursuit Algorithm to Improve Performance of the Distributed Co...
Using Subspace Pursuit Algorithm to Improve Performance of the Distributed Co...
 
IOMAC2009
IOMAC2009IOMAC2009
IOMAC2009
 
A New Enhanced Method of Non Parametric power spectrum Estimation.
A New Enhanced Method of Non Parametric power spectrum Estimation.A New Enhanced Method of Non Parametric power spectrum Estimation.
A New Enhanced Method of Non Parametric power spectrum Estimation.
 
SPECTRAL ESTIMATE FOR STABLE SIGNALS WITH P-ADIC TIME AND OPTIMAL SELECTION O...
SPECTRAL ESTIMATE FOR STABLE SIGNALS WITH P-ADIC TIME AND OPTIMAL SELECTION O...SPECTRAL ESTIMATE FOR STABLE SIGNALS WITH P-ADIC TIME AND OPTIMAL SELECTION O...
SPECTRAL ESTIMATE FOR STABLE SIGNALS WITH P-ADIC TIME AND OPTIMAL SELECTION O...
 
KZ Spatial Waves Separations
KZ Spatial Waves SeparationsKZ Spatial Waves Separations
KZ Spatial Waves Separations
 

Recently uploaded

“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
Claudio Di Ciccio
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website
Pixlogix Infotech
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
Data structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdfData structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdf
TIPNGVN2
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 

Recently uploaded (20)

“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
Data structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdfData structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdf
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 

Accelerating NMR via NUFFT algorithms on GPUs

  • 1. Università degli Studi di Napoli “Federico II” CLASSE DELLE LAUREE MAGISTRALI IN INGEGNERIA ELETTRONICA, LM-29 TESI DI LAUREA Accelerating NMR via NUFFT algorithms on GPUs Supervisors: Candidate: Prof. Amedeo Capozzoli Federico Maria Ariù matr. M61/000013 Ing. Claudio Curcio Ing. Angelo Liseno Anno accademico 2012/2013
  • 2. MRI Magnetic Resonance Imaging (MRI) is a medical diagnostic technique based on the physical principle of nuclear magnetic resonance and allows to reconstruct images of the human body. MRI has become one of the most used medical diagnostic modalities and it is used for many purposes, e.g. brain and orthopedic surveys. Accelerating NMR via NUFFT algorithms on GPUs
  • 3. Sampling and processing speed in MRI In many recent applications the need for decrease sampling and processing times arises as much as possible. Decreasing the required times for the patient immobility Monitoring activities and reactions of patients (e.g. functional MRI, fMRI) Continuous monitoring of patient’s conditions during surgery Accelerating NMR via NUFFT algorithms on GPUs
  • 4. Factors of sampling and processing speed 1- ACQUISITION METHODS 2- PROCESSING ALGORITHMS 3-PROCESSING HARDWARE Accelerating NMR via NUFFT algorithms on GPUs
  • 5. Thesis statement The purpose of the thesis was to implement an MRI algorithm based on the suitable use of parallel computing systems (Graphic Processing Unit, or GPU) and fast processing algorithms able to handle non-standard acquired data. Accelerating NMR via NUFFT algorithms on GPUs
  • 6. MRI reconstruction MRI system’s data-unknown relation is : m(k x (t ), k y (t ))  S ( x, y) ( x, y)e  j 2 [ k x ( t ) x  k y ( t ) y ] - Cartesian grid discretization - Time discretization m(k x (tk ), k y (tk ))  xy  S ( x p , yq )  ( x p , yq )e p dxdy m: measured data S: sensitivity map ρ: spin density t: time parameter  j 2 [ k x ( t k ) x p  k y ( t k ) yq ] q Data are linked to spin density through sensitivity map as a Fourier transform relation. Data are the sampled values of the spectrum on the (Kx(t), Ky(t)) trajectory described by varying time Trasformata di Fourier The relation between ρ and m could be written as a Discrete Fourier Transform (DFT) Accelerating NMR via NUFFT algorithms on GPUs
  • 7. Fast sampling It is possible to use several data sampling method: Cartesian - slow + straightforward processing Uniform DFT (k x (t ), k y (t )) curve Radial - elaborate processing Spiral + fast - elaborate processing + fast Non Uniform DFT (NUDFT) About cartesian sampling, processing can be attributed to a uniform DFT accelerated by a standard FFT algorithm in order to obtain a time complexity O(NlogN) Accelerating NMR via NUFFT algorithms on GPUs
  • 8. Fast data processing More sophisticated algorithms allowing to use faster sampling methods are required. From a numerical point of view, Non Uniform Fast Fourier Transform (NUFFT) algorithms restore time complexity of FFT algorithms. 3 algorithms NUFFT types can be characterized: -NUFFT of type 1 – NED (Non-Equispaced Data) -NUFFT of type 2 – NER (Non-Equispaced Results) -NUFFT of type 3 – (Non-Equispaced Data and Results) Accelerating NMR via NUFFT algorithms on GPUs
  • 9. Fast data processing MRI system’s data-unknown relation is : m(k x (tk ), k y (tk ))  xy  S ( x p , yq )  ( x p , yq ) e  j 2 [ k x ( t k ) x p  k y ( t k ) y q ] p q m: measured data S: sensitivity map ρ: spin density t: time parameter The relation above can be rewritten as a system of linear equations: A  m (NER-NUFFT)   A A  A m (NED-NUFFT) where A (m)   mi e   j 2 [ k x ( t ) x p  k y ( t ) y q ] i In order to reverse the relation, multiplying the system by the adjoint matrix, depending on the chosen sampling method, matrix-vector multiplications can be more rapidly processed by using NER and NED NUFFT algorithms. Such system of linear equations has been solved by applying the biconjugate gradient stabilized method (BICGSTAB). Accelerating NMR via NUFFT algorithms on GPUs
  • 10. GPU implementation An MRI algorithm based on a BICGSTAB routine using NER and NED NUFFT was developed both for the GPU and CPU version. Firstly the algorithm was implemented in Matlab language, after in C language and finally in CUDA C for the GPU version. Matlab C CUDA C The convenience of a GPU-implemented algorithm is the data parallelism, that is the possibility to solve simultaneously arithmetic operations on various data, obtaining better performance in terms of speed and processing. GPU acceleration was executed on Nvidia Tesla Kepler K20c. Accelerating NMR via NUFFT algorithms on GPUs
  • 11. MRI numerical results Processing times of MRI application were validated using simulated data from a nonuniform (spyral type) sampling method. The image reconsruction pointed out the same accuracy shown in the literature [“A fast wavelet-based reconstruction method for MRI”, by Guerquin-Kern, IEEE Transaction on Medical Imaging] linked to the simulator. Sampling grid Phantom reconstruction Computation time[sec] CPU 196 GPU 3 speedup TimeCPU  65 TimeGPU Accelerating NMR via NUFFT algorithms on GPUs
  • 12. Non Uniform Fast Fourier Transform of type 3 (NUFFT-3) NUFFT-3 allows to use a multi resolution approach on the MRI application when the examination with more details of certain areas rather than others is required. N 1 F ( sk , t k )   f j e isk x j e t k y j j 0 NUFFT -3 2D steps – Greengard&Lee approach: N 1 f x y (n1x, n2 y )   f j g  x y (n1x  x j , n2 y  y j ) 1. Projection of the samples on a uniform grid  x y f x y 2. Precompensation 3. FFT   F x yx y (m1s, m2 t )  xy 2 (n1x, n2 y )  ( M r X 1) ( M r y 1) 2 2  n1   M rX 2  n2   M ry j 0 1  x ( n1 x ) 2 y ( n2  y ) 2 4 x y e f x y (n1x.n2 y )   f x yx y (n1x, n2 y)e im1n1 x  s e im2 n2yt 2 sp st   F x yx y (m1s, m2t ) g x y (m1s  sk ,m2t  tk ) 2 m1 ,m2  msp m 4. Spectrum reconstruction on irregular frequencies 5. Postcompensation F x y ( sk , tk )  F ( sk , t k )  1 4 x y t 2 e sk x e k y F x y ( sk , t k ) 2 Accelerating NMR via NUFFT algorithms on GPUs
  • 13. NUFFT-3 execution times: CPU vs GPU Tests on the execution times of the NUFFT-3, for the GPU and CPU version, were carried out by varying the number of samples. speedupNUFFT 32 D  TimeCPU  20 TimeGPU Accelerating NMR via NUFFT algorithms on GPUs
  • 14. Conclusions An MRI reconstruction algorithm from simulated data sampled on a spiral grid was implemented. The convenience of using NER and NED NUFFT algorithms was demonstrated. MRI processing was entirely developed on GPU. A two-dimensional NUFFT-3 for the processing of nonuniform samples for multi resolution imaging applications was implemented. The performance of the NUFFT-3 implemented algorithm was verified. Accelerating NMR via NUFFT algorithms on GPUs
  • 15. Thank you for your time Accelerating NMR via NUFFT algorithms on GPUs