SlideShare a Scribd company logo
1 of 22
Biological patterns(electronic nose) data classification and
recognition machine learning approaches using GPGPU
Pavels Kartasevs
Msc Applied Bioinformatics course
Cranfield University
Contents
● Electronic nose
● SVM and ANN
● Comparison of developed solution
● Heterogeneous processing
● Results
● Further improvements and conclusions
Problem description
● Prediction tools allows to analyze information
from different sources
● Application: Meat spoilage prediction
● Meat spoilage problem (from manufacturer to
producer)
● Fast enough solution and availability of free
software
Meat spoilage
● Problem that can impact health
● Cause – many different bacteria
● Sensory panel/laboratory analysis
disadvantage
● Automatic analysis tools
Electronic nose
● Wide emerging field of cheap analysis devices
● Can be used for food science
● Automatic food quality determination
Electronic nose in prediction of meat
spoilage
● Electronic nose generates data
● Low cost of the device
● Fast result
● E-nose results interpretation
SVM and Neural networks
● SVM
Support vector machines are relatively new
form of supervised machine learning.
● Artificial neural
networks
Artificial neural network by their model mimics
human brain structure.
Difference between SVM and ANN
● SVM is fast
● Must preform grid
search to find
optimum solution
● Construct
mathematical model
of problem
● ANN learns, opposite
to SVM
● Can work efficiently
than SVM
● Processing speed
depends on neuron
count
SVM Performance comparison
ensembleSVM_Count_ALL.R CPP_BIO Program
0
50
100
150
200
250
300
350
112
10
308
19
"Intel® Xeon(R) CPU X5492 @ 3.40GHz × 8
DDR2 800 Mhz, Ubuntu 64-bit"
Core i5-3210M / 4 Gb DDR3
Minutes
Implementation
● To get such speed all application/algorithm
was reimplemented in C/C++ programming
language which is the fastest programming
language
● LibSVM C/C++ library
Prediction performance of R SVM
1 iterations, C param. from 1 to 50
with step 1, gamma from 0.1 to 10 with
step 0.1, 80 SVM
Time 60 min. Time 6.5 min.
GPU as co-processor
● Gpu is good on
parallel computations
● GPU memory latency
● GPU library call
latency
●
GPU libraries results
Easy-cpu Easy-gpu Svm-train(cpu) Gpusvm-0.2
0
5
10
15
20
25
30
35
Processing time of 2Mb beef_fillets_fitr data
Library
Time(Seconds)
Why is GPU slower?
GPU Ensemble
● Due to small data amount running one SVM
on the GPU in inefficient
● But using GPU structure is making sense to
run ensemble of SVM on the GPU in parallel
Re-implementation of libsvm on the
GPU
● 2 different approaches
Target NVIDIA “FERMI” GPU Target ALL NVIDIA GPU
GPU re-implementation results
● Full GPU processing time: 1 minute vs 12
seconds on the CPU
● As accurate as CPU
GPU re-implementation results(2)
● Heterogeneous GPU processing
cpu gpu
0
0.5
1
1.5
2
2.5
1.8
2
Time (seconds)
GPU implementation is slower by 10%
GPU re-implementation results(3)
30.00%
70.00%
Time performing by CPU to calaulate SVM matrix
SVM Kernel calculation
Other computing
SVM Kernel matrix calculation on GPU saves ~30% of the CPU time, CPU
is free to do other calculations
Solution for Graphical User Interface
1
2
3
3
4
Future improvements
● Further improvements of solution might
include:
● Re-implement solution fully in Java languarge
to make portable and library and platform
independent
● Add Web-interface to the solution
● Write installation application to easy install
solution
Conclusions
● Implemented solution is 10 times faster, than
existing R framework solution
● Graphical interface implemented
● Different analysis types
● Heterogeneous computing
Thank you.
Any questions?

More Related Content

Similar to Msc presentation Bioinformatics

DeepLearningAlgorithmAccelerationOnHardwarePlatforms_V2.0
DeepLearningAlgorithmAccelerationOnHardwarePlatforms_V2.0DeepLearningAlgorithmAccelerationOnHardwarePlatforms_V2.0
DeepLearningAlgorithmAccelerationOnHardwarePlatforms_V2.0
Sahil Kaw
 
Testing and Benchmarking (Joel)
Testing and Benchmarking (Joel)Testing and Benchmarking (Joel)
Testing and Benchmarking (Joel)
iteclearners
 
Presentación GPUs MAEB 2012
Presentación GPUs MAEB 2012Presentación GPUs MAEB 2012
Presentación GPUs MAEB 2012
gustavo_romero
 

Similar to Msc presentation Bioinformatics (20)

20211119 ntuh azure hpc workshop final
20211119 ntuh azure hpc workshop final20211119 ntuh azure hpc workshop final
20211119 ntuh azure hpc workshop final
 
Bioinformatics on GPU
Bioinformatics on GPUBioinformatics on GPU
Bioinformatics on GPU
 
From Hours to Minutes: The Journey of Optimizing Mask-RCNN and BERT Using MXNet
From Hours to Minutes: The Journey of Optimizing Mask-RCNN and BERT Using MXNetFrom Hours to Minutes: The Journey of Optimizing Mask-RCNN and BERT Using MXNet
From Hours to Minutes: The Journey of Optimizing Mask-RCNN and BERT Using MXNet
 
A complete fingerprint matching algorithm on gpu
A complete fingerprint matching algorithm on gpuA complete fingerprint matching algorithm on gpu
A complete fingerprint matching algorithm on gpu
 
AES encryption on modern consumer architectures
AES encryption on modern consumer architecturesAES encryption on modern consumer architectures
AES encryption on modern consumer architectures
 
Monte Carlo on GPUs
Monte Carlo on GPUsMonte Carlo on GPUs
Monte Carlo on GPUs
 
Parallelism in a NumPy-based program
Parallelism in a NumPy-based programParallelism in a NumPy-based program
Parallelism in a NumPy-based program
 
A Framework for Scene Recognition Using Convolutional Neural Network as Featu...
A Framework for Scene Recognition Using Convolutional Neural Network as Featu...A Framework for Scene Recognition Using Convolutional Neural Network as Featu...
A Framework for Scene Recognition Using Convolutional Neural Network as Featu...
 
Machine-Learning-based Performance Heuristics for Runtime CPU/GPU Selection
Machine-Learning-based Performance Heuristics for Runtime CPU/GPU SelectionMachine-Learning-based Performance Heuristics for Runtime CPU/GPU Selection
Machine-Learning-based Performance Heuristics for Runtime CPU/GPU Selection
 
DeepLearningAlgorithmAccelerationOnHardwarePlatforms_V2.0
DeepLearningAlgorithmAccelerationOnHardwarePlatforms_V2.0DeepLearningAlgorithmAccelerationOnHardwarePlatforms_V2.0
DeepLearningAlgorithmAccelerationOnHardwarePlatforms_V2.0
 
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
 
WTF?
WTF?WTF?
WTF?
 
AI Food detector; A model of Generative adversarial network for food Classifier
AI Food detector; A model of Generative adversarial network for food ClassifierAI Food detector; A model of Generative adversarial network for food Classifier
AI Food detector; A model of Generative adversarial network for food Classifier
 
Testing and Benchmarking (Joel)
Testing and Benchmarking (Joel)Testing and Benchmarking (Joel)
Testing and Benchmarking (Joel)
 
Cheap HPC
Cheap HPCCheap HPC
Cheap HPC
 
Benchmark of common AI accelerators: NVIDIA GPU vs. Intel Movidius
Benchmark of common AI accelerators: NVIDIA GPU vs. Intel MovidiusBenchmark of common AI accelerators: NVIDIA GPU vs. Intel Movidius
Benchmark of common AI accelerators: NVIDIA GPU vs. Intel Movidius
 
Presentación GPUs MAEB 2012
Presentación GPUs MAEB 2012Presentación GPUs MAEB 2012
Presentación GPUs MAEB 2012
 
Visual analytics for genomic variants in Jalview 2.11. Jim Procter, U. Dundee...
Visual analytics for genomic variants in Jalview 2.11. Jim Procter, U. Dundee...Visual analytics for genomic variants in Jalview 2.11. Jim Procter, U. Dundee...
Visual analytics for genomic variants in Jalview 2.11. Jim Procter, U. Dundee...
 
20120140505010
2012014050501020120140505010
20120140505010
 
Cache Optimization Techniques for General Purpose Graphic Processing Units
Cache Optimization Techniques for General Purpose Graphic Processing UnitsCache Optimization Techniques for General Purpose Graphic Processing Units
Cache Optimization Techniques for General Purpose Graphic Processing Units
 

Recently uploaded

SPLICE Working Group: Reusable Code Examples
SPLICE Working Group:Reusable Code ExamplesSPLICE Working Group:Reusable Code Examples
SPLICE Working Group: Reusable Code Examples
Peter Brusilovsky
 
Personalisation of Education by AI and Big Data - Lourdes Guàrdia
Personalisation of Education by AI and Big Data - Lourdes GuàrdiaPersonalisation of Education by AI and Big Data - Lourdes Guàrdia
Personalisation of Education by AI and Big Data - Lourdes Guàrdia
EADTU
 
Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPSSpellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
AnaAcapella
 

Recently uploaded (20)

SPLICE Working Group: Reusable Code Examples
SPLICE Working Group:Reusable Code ExamplesSPLICE Working Group:Reusable Code Examples
SPLICE Working Group: Reusable Code Examples
 
Personalisation of Education by AI and Big Data - Lourdes Guàrdia
Personalisation of Education by AI and Big Data - Lourdes GuàrdiaPersonalisation of Education by AI and Big Data - Lourdes Guàrdia
Personalisation of Education by AI and Big Data - Lourdes Guàrdia
 
Observing-Correct-Grammar-in-Making-Definitions.pptx
Observing-Correct-Grammar-in-Making-Definitions.pptxObserving-Correct-Grammar-in-Making-Definitions.pptx
Observing-Correct-Grammar-in-Making-Definitions.pptx
 
Graduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptxGraduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptx
 
Improved Approval Flow in Odoo 17 Studio App
Improved Approval Flow in Odoo 17 Studio AppImproved Approval Flow in Odoo 17 Studio App
Improved Approval Flow in Odoo 17 Studio App
 
PSYPACT- Practicing Over State Lines May 2024.pptx
PSYPACT- Practicing Over State Lines May 2024.pptxPSYPACT- Practicing Over State Lines May 2024.pptx
PSYPACT- Practicing Over State Lines May 2024.pptx
 
AIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.pptAIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.ppt
 
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading RoomSternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
 
Spring gala 2024 photo slideshow - Celebrating School-Community Partnerships
Spring gala 2024 photo slideshow - Celebrating School-Community PartnershipsSpring gala 2024 photo slideshow - Celebrating School-Community Partnerships
Spring gala 2024 photo slideshow - Celebrating School-Community Partnerships
 
8 Tips for Effective Working Capital Management
8 Tips for Effective Working Capital Management8 Tips for Effective Working Capital Management
8 Tips for Effective Working Capital Management
 
How to Manage Website in Odoo 17 Studio App.pptx
How to Manage Website in Odoo 17 Studio App.pptxHow to Manage Website in Odoo 17 Studio App.pptx
How to Manage Website in Odoo 17 Studio App.pptx
 
An overview of the various scriptures in Hinduism
An overview of the various scriptures in HinduismAn overview of the various scriptures in Hinduism
An overview of the various scriptures in Hinduism
 
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
 
Major project report on Tata Motors and its marketing strategies
Major project report on Tata Motors and its marketing strategiesMajor project report on Tata Motors and its marketing strategies
Major project report on Tata Motors and its marketing strategies
 
UChicago CMSC 23320 - The Best Commit Messages of 2024
UChicago CMSC 23320 - The Best Commit Messages of 2024UChicago CMSC 23320 - The Best Commit Messages of 2024
UChicago CMSC 23320 - The Best Commit Messages of 2024
 
DEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUM
DEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUMDEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUM
DEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUM
 
Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPSSpellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
 
Basic Civil Engineering notes on Transportation Engineering & Modes of Transport
Basic Civil Engineering notes on Transportation Engineering & Modes of TransportBasic Civil Engineering notes on Transportation Engineering & Modes of Transport
Basic Civil Engineering notes on Transportation Engineering & Modes of Transport
 
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjjStl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj
 
OS-operating systems- ch05 (CPU Scheduling) ...
OS-operating systems- ch05 (CPU Scheduling) ...OS-operating systems- ch05 (CPU Scheduling) ...
OS-operating systems- ch05 (CPU Scheduling) ...
 

Msc presentation Bioinformatics

  • 1. Biological patterns(electronic nose) data classification and recognition machine learning approaches using GPGPU Pavels Kartasevs Msc Applied Bioinformatics course Cranfield University
  • 2. Contents ● Electronic nose ● SVM and ANN ● Comparison of developed solution ● Heterogeneous processing ● Results ● Further improvements and conclusions
  • 3. Problem description ● Prediction tools allows to analyze information from different sources ● Application: Meat spoilage prediction ● Meat spoilage problem (from manufacturer to producer) ● Fast enough solution and availability of free software
  • 4. Meat spoilage ● Problem that can impact health ● Cause – many different bacteria ● Sensory panel/laboratory analysis disadvantage ● Automatic analysis tools
  • 5. Electronic nose ● Wide emerging field of cheap analysis devices ● Can be used for food science ● Automatic food quality determination
  • 6. Electronic nose in prediction of meat spoilage ● Electronic nose generates data ● Low cost of the device ● Fast result ● E-nose results interpretation
  • 7. SVM and Neural networks ● SVM Support vector machines are relatively new form of supervised machine learning. ● Artificial neural networks Artificial neural network by their model mimics human brain structure.
  • 8. Difference between SVM and ANN ● SVM is fast ● Must preform grid search to find optimum solution ● Construct mathematical model of problem ● ANN learns, opposite to SVM ● Can work efficiently than SVM ● Processing speed depends on neuron count
  • 9. SVM Performance comparison ensembleSVM_Count_ALL.R CPP_BIO Program 0 50 100 150 200 250 300 350 112 10 308 19 "Intel® Xeon(R) CPU X5492 @ 3.40GHz × 8 DDR2 800 Mhz, Ubuntu 64-bit" Core i5-3210M / 4 Gb DDR3 Minutes
  • 10. Implementation ● To get such speed all application/algorithm was reimplemented in C/C++ programming language which is the fastest programming language ● LibSVM C/C++ library
  • 11. Prediction performance of R SVM 1 iterations, C param. from 1 to 50 with step 1, gamma from 0.1 to 10 with step 0.1, 80 SVM Time 60 min. Time 6.5 min.
  • 12. GPU as co-processor ● Gpu is good on parallel computations ● GPU memory latency ● GPU library call latency ●
  • 13. GPU libraries results Easy-cpu Easy-gpu Svm-train(cpu) Gpusvm-0.2 0 5 10 15 20 25 30 35 Processing time of 2Mb beef_fillets_fitr data Library Time(Seconds) Why is GPU slower?
  • 14. GPU Ensemble ● Due to small data amount running one SVM on the GPU in inefficient ● But using GPU structure is making sense to run ensemble of SVM on the GPU in parallel
  • 15. Re-implementation of libsvm on the GPU ● 2 different approaches Target NVIDIA “FERMI” GPU Target ALL NVIDIA GPU
  • 16. GPU re-implementation results ● Full GPU processing time: 1 minute vs 12 seconds on the CPU ● As accurate as CPU
  • 17. GPU re-implementation results(2) ● Heterogeneous GPU processing cpu gpu 0 0.5 1 1.5 2 2.5 1.8 2 Time (seconds) GPU implementation is slower by 10%
  • 18. GPU re-implementation results(3) 30.00% 70.00% Time performing by CPU to calaulate SVM matrix SVM Kernel calculation Other computing SVM Kernel matrix calculation on GPU saves ~30% of the CPU time, CPU is free to do other calculations
  • 19. Solution for Graphical User Interface 1 2 3 3 4
  • 20. Future improvements ● Further improvements of solution might include: ● Re-implement solution fully in Java languarge to make portable and library and platform independent ● Add Web-interface to the solution ● Write installation application to easy install solution
  • 21. Conclusions ● Implemented solution is 10 times faster, than existing R framework solution ● Graphical interface implemented ● Different analysis types ● Heterogeneous computing