SlideShare a Scribd company logo
Study on Data Augmentation Methods for
Sonar Image Analysis
Hokkaido University
Graduate School of Information Science and Technology
Harmonious Systems Engineering Laboratory
MIN JIE
0
Background 2
• For aquatic resources management, catching
restricted fish species should be avoided.
• Electromagnetic wave is interfered severely by
environment noise, so sonar image is commonly
used for solving fishing problems.
• Neural network has been used in sonar image to
find specific kind of fish.[1]
• Data augmentation can improve performance of
neural network.
[1]平間 友大, 横山 想一郎, 山下 倫央, 川村 秀憲,鈴木 恵二,和田 雅昭 : CNNを用いた音響画像に基づく
定置網内の魚種推定の精度向上, 第18回情報科学技術フォーラム(FIT2019), CF-009, 岡山(2019)
①
②
③
Echo sonder
Introduction of prediction system 3
Convolutional
neural
networkSegmentation
Determine whether cast nets
Output
percentage of
target fish
existance
Threshold
• System of avoiding restricted fish species for the
management of aquatic resources
Sonar images
from echo
sounder Input
Prediction
Motivation
• Sonar images have different composition compared
with real world images.
• Different tricks for neural network.
– Data augmentations (random flip, translate)
– Change the structure of the network
– Add Dropout into the network
– Fine tuning and add weight decay
• Selecting and combining basic methods in different
ways such as AutoAugment[2] and RandAugment[3]
from google research achieved best performance.
3
[3]RandAugment: Ekin D. Cubuk, Barret Zoph, Jonathon Shlens, Quoc V.
Le.’’Practical automated data augmentation with a reduced search space’’
[2]AutoAugment: Ekin D. Cubuk, Barret Zoph, Dandelion Mane, Vijay
Vasudevan, Quoc V. Le. “Learning Augmentation Policies from Data”
Purpose
• Avoid catching restricted fish species for aquatic
resources management.
• Make a proposal of data augmentation methods
for sonar image to improve the performance of
prediction system.
– RandAugment perform well for natural image.
– Could be used for sonar image.
• Figure out fitness of basic methods used in
RandAugment for sonar image according to their
variations with original images.
5
Introduction of sonar image
• How to collect sonar image data
– Echo sounder
• Emits a 50kHz sonic wave every three seconds.
• The intensity image of the reflected wave is output.
– Sonar image
• The stronger the reflected wave, the brighter the image.
• Shape feature comes from sonic wave reflected form fish.
50.23m/px
3sec/px
165px
(38.65m)
25px(75s)
fishseabed
Intensity time series image
Segmentation for sonar image
Make label for images
• Positive samples: collected from 10 days with tuna catches in
the image.
• Negative samples: collected from 211 days without tuna
catches.
7
Numbers of total dataset
Positive samples Negative samples
positive negative
Train dataset 10,308 148,154
Valid dataset 2,756 37,039
Test dataset 3,321 46,299
Train data
augmentated
in same way
Data Augmentation
• Single method to process all the images.
– Commonly used augmentation methods
• SMOTE
• Mixup
• Cutout
• Random Erasing
– Basic single augmentation methods.
• Uncommon usage for bad performance
• Combination for basic methods perform better in image
classification compared with single method.
– State of art methods : AutoAugment,RandAugment
7
Single
method
Original
images
Combined method
1. How to select basic method?
2. How much magnitude apply on image?
– AutoAugment
• 16 methods
• Range of magnitudes :10
• Probability of applying :11 values
• 5 sub-policies : 10 methods in total
– RandAugment
• 14 methods
• Same magnitude M :sample 4 to 5 values from (1, 30)
• Randomly select N(2,3) methods from all 14 method
16 × 10 × 11 10
search space
2 × 4, 5 search space
faster and same accuracy
8
Basic augmentation methods 10
• Basic methods of RandAugment
– ShearX(Y): shear the image along the horizontal(vertical) axis.
– TranslateX(Y): translate the image in horizontal(vertical) axis.
– Rotate: rotate the image.
– Invert: invert the pixels of image.
– Equalize: equalize the image histogram.
– AutoContrast: maximize the image contrast.
– Brightness: adjust the brightness balance of the image.
– Solarize: reduce the value of pixels for each pixel.
– Posterize: invert the pixels above the threshold.
– Contrast: control the contrast of image.
– Color: adjust the color balance of the image.
– Sharpness: adjust the sharpness balance of the image.
• All these methods have different effects on different datasets.
• Important to figure out their fitness for sonar image datasets.
Position
Brightness
No extend
RandAugment 11
• Improvement method of AutoAugment, it can get high
accuracy quickly by narrowing the search space.
• Search for 2*4 hyperparameters combinations.
・N{2, 3} : number of randomly selected methods.
・M{5, 10, 15, 20} : magnitudes sampled from (1,30)
for every methods .
• Find the match of N and M with best accuracy.
Identity, AutoContrast, Rotate,
Solarize, Posterize, Contrast,
Brightness, Sharpness, ShearX(Y),
Translate(Y), Invert , Equalize
Contrast, ShearX,
Rotate
Randomly select 3 methods
from 14 methods above
apply CNN
Train process
ShearXAutoContrast Rotateoriginal
Example N=3, M=15
Method
Bigger M bigger change
ShearX
Brightness
M=15 M=30
Enlarged image
RandAugment 11
• Improvement method of AutoAugment, that can find best
methods quickly by narrowing the search space
• Search for 3*4 hyperparameters combinations.
・N{2, 3, 4} : number of randomly selected methods.
・M{5, 10, 15, 20} : magnitude sampled from (0,30)
for every methods .
• Find the match of N and M with best accuracy.
Identity, AutoContrast, Rotate,
Solarize, Posterize, Contrast,
Brightness, Sharpness, ShearX(Y),
Translate(Y), Invert , Equalize
ShearX,
AutoContrast, Rotate
Randomly select 3 methods
from 14 methods above
apply CNN
Train process
ShearXAutoContrast Rotateoriginal
Example N=3, M=15
Method
Bigger M bigger change
ShearX
Brightness
M=15 M=30
M=15
Enlarged image
Augmented image
RandAugment 11
• Improvement method of AutoAugment, that can achieve
same accuracy quickly by narrowing the search space
• Search for 3*4 hyperparameters combinations.
・N{2, 3, 4} : number of randomly selected methods.
・M{5, 10, 15, 20} : magnitude sampled from (0,30)
for every methods .
• Find the match of N and M with best accuracy.
Identity, AutoContrast, Rotate,
Solarize, Posterize, Contrast,
Brightness, Sharpness, ShearX(Y),
Translate(Y), Invert , Equalize
ShearX,
AutoContrast, Rotate
Randomly select 3 methods
from 14 methods above
apply CNN
Train process
ShearXAutoContrast Rotateoriginal
Example N=3, M=15
Method
Bigger M bigger change
ShearX
Brightness
M=15 M=30
Criterion for fitness of methods
• Variation of methods for dataset
– Judge by appearance of images
– Need a numerical criterion for variation.
– Mean square error(MSE) to evaluate the variation of
augmented images and original images.
12
Equalize InvertShearXOriginal
MSE for all methods with different M 13
For different M, define MSE in range(100,2000) as proper, and
train the model with these methods by RandAugment.
Equalize
Invert
ShearX
Original
Method M=5 M=15 M=25
Invert 55,164 55,146 55,146
Equalize 15,867 15,867 15,867
AutoContrast 1,013 1,013 1,013
TranslateY 200 525 590
Rotate 135 322 414
ShearY 112 305 341
TranslateX 29 212 259
Brightness 8.18 106 237
ShearX 11 56 61.8
Contrast 6.54 50 250
Solarize 0.53 4.42 1,107
Posterize 0.16 2.59 10
Sharpness 0.002 0.22 0.65
Color 0 0 0
56
15878
55526
Too large MSE
cause bad
performance
Conjecture1:
Too low MSE
cause bad
performance
Conjecture2:
MSE show same
tendency with
appearance .
Model in experiment 14
• Convolutional neural network for image classification
• Resnet50[4]
– Easier to optimize the residual mapping than to optimize the original
unreferenced mapping.
• Optimizer : Momentum SGD
• Loss function : softmax cross entropy
• Input image size(224,224) : resize sonar image to same size for
pretrained model.
[4]He, Kaiming, et al. "Deep residual learning for image recognition." Proceedings of
the IEEE conference on computer vision and pattern recognition. 2016.
weight layer
weight layer
relu
F(x)
H(x)=F(x)+x relu
x
x
identity
Resize to
(224,224)
Residual block
Imbalanced dataset
• Train dataset
– Number of positive samples is P
– Number of negative samples is N
– P:N≈1:14 in this experiment
• Weighted loss function
– Weight of positive samples :
– Weight of negative samples:
– Weighted cross entropy:
15
𝑁
𝑃 + 𝑁
P
𝑃 + 𝑁
𝑦 means true label and 𝑦′ means predicted label
𝑤 𝑃 =
𝑤 𝑁 =
𝐿 𝑤 = −𝑤 𝑃 ⋅ 𝑦 log 𝑦′
− 𝑤 𝑁 1 − 𝑦 log 1 − 𝑦′
(1)
Performance evaluation 16
• Accuracy
– In binary classification, when the dataset is extremely imbalanced(1:14 in
this experiment), accuracy is not an effective criterion for performance
evaluation.
• Confusion matrix
• Precision ,recall and f1 score
– Precision=TP/(TP+FP): efficiency of targeting tuna
– Recall=TP/(TP+FN): rate of targeting tuna
– f1 score=2/(1/precision+1/recall): harmonic value of precision and recall
True class
Positive Negative
Prediction
class
0
1 TP
TNFN
PF
Experiment 17
Baseline of
sonar image
dataset
Result with
RandAugment
Result with
RandAugment
RandAugment
with proper
basic methods
1. If RandAugment perform well on sonar image?
2. If RandAugment with proper basic methods
perform better than with all methods on sonar image?
Compare
Compare
Experiment 1
• Baselines without RandAugment and RandAugment
with all methods.
– Condition:
• Train for the baseline of sonar image dataset.
• Train for RandAugment including all basic methods with
N=3, M=(5, 15, 25)
18
Numbers of total dataset
positive negative
Train dataset 10,308 148,154
Valid dataset 2,756 37,039
Test dataset 3,321 46,299
Results of experiment 1
• Baseline train without RandAugment.
• Results of RandAugment with all methods
19
Baseline F1 Precision Recall
Rate 0.8640 0.8971 0.7985
• Overall 2% improvement compared with baseline for sonar image
M(N=3) F1 Precision Recall
5 0.8846 0.8947 0.8749
15 0.8779 0.9076 0.8640
25 0.8823 0.8907 0.8815
Results of experiment 1
• Train process figures with and without RandAugment.
20
• Train process become more stable with RandAugment
Without RandAugment With RandAugment
Experiment 2
• RandAugment with proper MSE methods
-(100, 2000) in table
21
Method M=5 M=15 M=25
Invert 55,164 55,146 55,146
Equalize 15,867 15,867 15,867
AutoContrast 1,013 1,013 1,013
TranslateY 200 525 590
Rotate 135 322 414
ShearY 112 305 341
TranslateX 29 212 259
Brightness 8.18 106 237
ShearX 11 56 61.8
Contrast 6.54 50 250
Solarize 0.53 4.42 1,107
Posterize 0.16 2.59 10
Sharpness 0.002 0.22 0.65
4 methods when M=5
6 methods when M=15
8 methods when M=25
Result of experiment 2
• RandAugment with appropriate MSE methods
22
M(N=3) F1 Precision Recall
5 0.8938 0.9041 0.8861
15 0.8985 0.8800 0.9104
25 0.9006 0.8935 0.9088
M(N=3) F1 Precision Recall
5 0.8846 0.8947 0.8749
15 0.8779 0.9076 0.8640
25 0.8823 0.8907 0.8815
• Overall improvement about 2% except M=5.
Result of all methods in RandAugment
Result of proper methods in RandAugment
4 methods when M=5
Conclusion
In this sonar image dataset collected from 2 years
• RandAugment can improve performance of network.
– F1 score and recall
– Stability of train process
• RandAugment with proper MSE basic methods
performs better than that with all methods for sonar
image dataset.
• Still need to be tested in more sonar image datasets
and complicated conditions before putting into use.
23

More Related Content

What's hot

Lecture 5
Lecture 5Lecture 5
Lecture 5
Wael Sharba
 
Image Enhancement
Image Enhancement Image Enhancement
Image Enhancement
Deven Sahu
 
Spatial domain and filtering
Spatial domain and filteringSpatial domain and filtering
Spatial domain and filtering
University of Potsdam
 
Digital image processing short quesstion answers
Digital image processing short quesstion answersDigital image processing short quesstion answers
Digital image processing short quesstion answers
Ateeq Zada
 
Digital Image Processing - Image Restoration
Digital Image Processing - Image RestorationDigital Image Processing - Image Restoration
Digital Image Processing - Image Restoration
Mathankumar S
 
Chapter 1 introduction (Image Processing)
Chapter 1 introduction (Image Processing)Chapter 1 introduction (Image Processing)
Chapter 1 introduction (Image Processing)
Varun Ojha
 
Research Inventy : International Journal of Engineering and Science is publis...
Research Inventy : International Journal of Engineering and Science is publis...Research Inventy : International Journal of Engineering and Science is publis...
Research Inventy : International Journal of Engineering and Science is publis...
researchinventy
 
Image transforms
Image transformsImage transforms
Image transforms
BCET, Balasore
 
Digital image processing
Digital image processing  Digital image processing
Digital image processing
kavitha muneeshwaran
 
Image Acquisition and Representation
Image Acquisition and RepresentationImage Acquisition and Representation
Image Acquisition and Representation
Amnaakhaan
 
04 image enhancement edge detection
04 image enhancement edge detection04 image enhancement edge detection
04 image enhancement edge detection
Rumah Belajar
 
SPATIAL FILTER
SPATIAL FILTERSPATIAL FILTER
Image degradation and noise by Md.Naseem Ashraf
Image degradation and noise by Md.Naseem AshrafImage degradation and noise by Md.Naseem Ashraf
Image degradation and noise by Md.Naseem Ashraf
MD Naseem Ashraf
 
Adaptive unsharp masking
Adaptive unsharp maskingAdaptive unsharp masking
Adaptive unsharp masking
Ravi Teja
 
3 intensity transformations and spatial filtering slides
3 intensity transformations and spatial filtering slides3 intensity transformations and spatial filtering slides
3 intensity transformations and spatial filtering slides
BHAGYAPRASADBUGGE
 
Tree models with Scikit-Learn: Great models with little assumptions
Tree models with Scikit-Learn: Great models with little assumptionsTree models with Scikit-Learn: Great models with little assumptions
Tree models with Scikit-Learn: Great models with little assumptions
Gilles Louppe
 
Image Retrieval with Fisher Vectors of Binary Features (MIRU'14)
Image Retrieval with Fisher Vectors of Binary Features (MIRU'14)Image Retrieval with Fisher Vectors of Binary Features (MIRU'14)
Image Retrieval with Fisher Vectors of Binary Features (MIRU'14)
Yusuke Uchida
 
Cg36496501
Cg36496501Cg36496501
Cg36496501
IJERA Editor
 
Filtering Corrupted Image and Edge Detection in Restored Grayscale Image Usin...
Filtering Corrupted Image and Edge Detection in Restored Grayscale Image Usin...Filtering Corrupted Image and Edge Detection in Restored Grayscale Image Usin...
Filtering Corrupted Image and Edge Detection in Restored Grayscale Image Usin...
CSCJournals
 
ADAPTIVE FILTER FOR DENOISING 3D DATA CAPTURED BY DEPTH SENSORS
ADAPTIVE FILTER FOR DENOISING 3D DATA CAPTURED BY DEPTH SENSORSADAPTIVE FILTER FOR DENOISING 3D DATA CAPTURED BY DEPTH SENSORS
ADAPTIVE FILTER FOR DENOISING 3D DATA CAPTURED BY DEPTH SENSORS
Soma Boubou
 

What's hot (20)

Lecture 5
Lecture 5Lecture 5
Lecture 5
 
Image Enhancement
Image Enhancement Image Enhancement
Image Enhancement
 
Spatial domain and filtering
Spatial domain and filteringSpatial domain and filtering
Spatial domain and filtering
 
Digital image processing short quesstion answers
Digital image processing short quesstion answersDigital image processing short quesstion answers
Digital image processing short quesstion answers
 
Digital Image Processing - Image Restoration
Digital Image Processing - Image RestorationDigital Image Processing - Image Restoration
Digital Image Processing - Image Restoration
 
Chapter 1 introduction (Image Processing)
Chapter 1 introduction (Image Processing)Chapter 1 introduction (Image Processing)
Chapter 1 introduction (Image Processing)
 
Research Inventy : International Journal of Engineering and Science is publis...
Research Inventy : International Journal of Engineering and Science is publis...Research Inventy : International Journal of Engineering and Science is publis...
Research Inventy : International Journal of Engineering and Science is publis...
 
Image transforms
Image transformsImage transforms
Image transforms
 
Digital image processing
Digital image processing  Digital image processing
Digital image processing
 
Image Acquisition and Representation
Image Acquisition and RepresentationImage Acquisition and Representation
Image Acquisition and Representation
 
04 image enhancement edge detection
04 image enhancement edge detection04 image enhancement edge detection
04 image enhancement edge detection
 
SPATIAL FILTER
SPATIAL FILTERSPATIAL FILTER
SPATIAL FILTER
 
Image degradation and noise by Md.Naseem Ashraf
Image degradation and noise by Md.Naseem AshrafImage degradation and noise by Md.Naseem Ashraf
Image degradation and noise by Md.Naseem Ashraf
 
Adaptive unsharp masking
Adaptive unsharp maskingAdaptive unsharp masking
Adaptive unsharp masking
 
3 intensity transformations and spatial filtering slides
3 intensity transformations and spatial filtering slides3 intensity transformations and spatial filtering slides
3 intensity transformations and spatial filtering slides
 
Tree models with Scikit-Learn: Great models with little assumptions
Tree models with Scikit-Learn: Great models with little assumptionsTree models with Scikit-Learn: Great models with little assumptions
Tree models with Scikit-Learn: Great models with little assumptions
 
Image Retrieval with Fisher Vectors of Binary Features (MIRU'14)
Image Retrieval with Fisher Vectors of Binary Features (MIRU'14)Image Retrieval with Fisher Vectors of Binary Features (MIRU'14)
Image Retrieval with Fisher Vectors of Binary Features (MIRU'14)
 
Cg36496501
Cg36496501Cg36496501
Cg36496501
 
Filtering Corrupted Image and Edge Detection in Restored Grayscale Image Usin...
Filtering Corrupted Image and Edge Detection in Restored Grayscale Image Usin...Filtering Corrupted Image and Edge Detection in Restored Grayscale Image Usin...
Filtering Corrupted Image and Edge Detection in Restored Grayscale Image Usin...
 
ADAPTIVE FILTER FOR DENOISING 3D DATA CAPTURED BY DEPTH SENSORS
ADAPTIVE FILTER FOR DENOISING 3D DATA CAPTURED BY DEPTH SENSORSADAPTIVE FILTER FOR DENOISING 3D DATA CAPTURED BY DEPTH SENSORS
ADAPTIVE FILTER FOR DENOISING 3D DATA CAPTURED BY DEPTH SENSORS
 

Similar to Study on Data Augmentation Methods for Sonar Image Analysis

Neural Inverse Rendering for General Reflectance Photometric Stereo (ICML 2018)
Neural Inverse Rendering for General Reflectance Photometric Stereo (ICML 2018)Neural Inverse Rendering for General Reflectance Photometric Stereo (ICML 2018)
Neural Inverse Rendering for General Reflectance Photometric Stereo (ICML 2018)
Tatsunori Taniai
 
Digital Image Fundamentals
Digital Image FundamentalsDigital Image Fundamentals
Digital Image Fundamentals
Kalyan Acharjya
 
Chapter 6 image quality in ct
Chapter 6 image quality in ct Chapter 6 image quality in ct
Chapter 6 image quality in ct
Muntaser S.Ahmad
 
Conjugate Gradient method for Brain Magnetic Resonance Images Segmentation
Conjugate Gradient method for Brain Magnetic Resonance Images SegmentationConjugate Gradient method for Brain Magnetic Resonance Images Segmentation
Conjugate Gradient method for Brain Magnetic Resonance Images Segmentation
EL-Hachemi Guerrout
 
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
AIST
 
Feasibility of EEG Super-Resolution Using Deep Convolutional Networks
Feasibility of EEG Super-Resolution Using Deep Convolutional NetworksFeasibility of EEG Super-Resolution Using Deep Convolutional Networks
Feasibility of EEG Super-Resolution Using Deep Convolutional Networks
Sangjun Han
 
Exploring Methods to Improve Edge Detection with Canny Algorithm
Exploring Methods to Improve Edge Detection with Canny AlgorithmExploring Methods to Improve Edge Detection with Canny Algorithm
Exploring Methods to Improve Edge Detection with Canny Algorithm
Prasad Thakur
 
Super resolution in deep learning era - Jaejun Yoo
Super resolution in deep learning era - Jaejun YooSuper resolution in deep learning era - Jaejun Yoo
Super resolution in deep learning era - Jaejun Yoo
JaeJun Yoo
 
Biologically Inspired Methods for Adversarially Robust Deep Learning
Biologically Inspired Methods for Adversarially Robust Deep LearningBiologically Inspired Methods for Adversarially Robust Deep Learning
Biologically Inspired Methods for Adversarially Robust Deep Learning
MuhammadAhmedShah2
 
Non-Local Compressive Sampling Recovery
Non-Local Compressive Sampling RecoveryNon-Local Compressive Sampling Recovery
Non-Local Compressive Sampling Recovery
shuxianbiao
 
SophieZhangXeroxPosterFinal2015
SophieZhangXeroxPosterFinal2015SophieZhangXeroxPosterFinal2015
SophieZhangXeroxPosterFinal2015
Sophie Zhang
 
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...
ActiveEon
 
Computer aided detection of pulmonary nodules using genetic programming
Computer aided detection of pulmonary nodules using genetic programmingComputer aided detection of pulmonary nodules using genetic programming
Computer aided detection of pulmonary nodules using genetic programming
Wookjin Choi
 
seminar reprtv hdchjbjfkdbf dgusghdfs gsdgjsbk
seminar reprtv hdchjbjfkdbf dgusghdfs gsdgjsbkseminar reprtv hdchjbjfkdbf dgusghdfs gsdgjsbk
seminar reprtv hdchjbjfkdbf dgusghdfs gsdgjsbk
RajeshKotian11
 
riken-RBlur-slides.pptx
riken-RBlur-slides.pptxriken-RBlur-slides.pptx
riken-RBlur-slides.pptx
MuhammadAhmedShah2
 
FMRI medical imagining
FMRI  medical imaginingFMRI  medical imagining
FMRI medical imagining
Vishwas N
 
Lecture 3
Lecture 3Lecture 3
Lecture 3
Wael Sharba
 
Deep Local Parametric Filters for Image Enhancement
Deep Local Parametric Filters for Image EnhancementDeep Local Parametric Filters for Image Enhancement
Deep Local Parametric Filters for Image Enhancement
Sean Moran
 
Presentation-09022015
Presentation-09022015Presentation-09022015
Presentation-09022015
Bo Dong
 
Robustness of Median Filter For Suppression of Salt and Pepper Noise (SPN) an...
Robustness of Median Filter For Suppression of Salt and Pepper Noise (SPN) an...Robustness of Median Filter For Suppression of Salt and Pepper Noise (SPN) an...
Robustness of Median Filter For Suppression of Salt and Pepper Noise (SPN) an...
CSCJournals
 

Similar to Study on Data Augmentation Methods for Sonar Image Analysis (20)

Neural Inverse Rendering for General Reflectance Photometric Stereo (ICML 2018)
Neural Inverse Rendering for General Reflectance Photometric Stereo (ICML 2018)Neural Inverse Rendering for General Reflectance Photometric Stereo (ICML 2018)
Neural Inverse Rendering for General Reflectance Photometric Stereo (ICML 2018)
 
Digital Image Fundamentals
Digital Image FundamentalsDigital Image Fundamentals
Digital Image Fundamentals
 
Chapter 6 image quality in ct
Chapter 6 image quality in ct Chapter 6 image quality in ct
Chapter 6 image quality in ct
 
Conjugate Gradient method for Brain Magnetic Resonance Images Segmentation
Conjugate Gradient method for Brain Magnetic Resonance Images SegmentationConjugate Gradient method for Brain Magnetic Resonance Images Segmentation
Conjugate Gradient method for Brain Magnetic Resonance Images Segmentation
 
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
 
Feasibility of EEG Super-Resolution Using Deep Convolutional Networks
Feasibility of EEG Super-Resolution Using Deep Convolutional NetworksFeasibility of EEG Super-Resolution Using Deep Convolutional Networks
Feasibility of EEG Super-Resolution Using Deep Convolutional Networks
 
Exploring Methods to Improve Edge Detection with Canny Algorithm
Exploring Methods to Improve Edge Detection with Canny AlgorithmExploring Methods to Improve Edge Detection with Canny Algorithm
Exploring Methods to Improve Edge Detection with Canny Algorithm
 
Super resolution in deep learning era - Jaejun Yoo
Super resolution in deep learning era - Jaejun YooSuper resolution in deep learning era - Jaejun Yoo
Super resolution in deep learning era - Jaejun Yoo
 
Biologically Inspired Methods for Adversarially Robust Deep Learning
Biologically Inspired Methods for Adversarially Robust Deep LearningBiologically Inspired Methods for Adversarially Robust Deep Learning
Biologically Inspired Methods for Adversarially Robust Deep Learning
 
Non-Local Compressive Sampling Recovery
Non-Local Compressive Sampling RecoveryNon-Local Compressive Sampling Recovery
Non-Local Compressive Sampling Recovery
 
SophieZhangXeroxPosterFinal2015
SophieZhangXeroxPosterFinal2015SophieZhangXeroxPosterFinal2015
SophieZhangXeroxPosterFinal2015
 
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...
Online Stochastic Tensor Decomposition for Background Subtraction in Multispe...
 
Computer aided detection of pulmonary nodules using genetic programming
Computer aided detection of pulmonary nodules using genetic programmingComputer aided detection of pulmonary nodules using genetic programming
Computer aided detection of pulmonary nodules using genetic programming
 
seminar reprtv hdchjbjfkdbf dgusghdfs gsdgjsbk
seminar reprtv hdchjbjfkdbf dgusghdfs gsdgjsbkseminar reprtv hdchjbjfkdbf dgusghdfs gsdgjsbk
seminar reprtv hdchjbjfkdbf dgusghdfs gsdgjsbk
 
riken-RBlur-slides.pptx
riken-RBlur-slides.pptxriken-RBlur-slides.pptx
riken-RBlur-slides.pptx
 
FMRI medical imagining
FMRI  medical imaginingFMRI  medical imagining
FMRI medical imagining
 
Lecture 3
Lecture 3Lecture 3
Lecture 3
 
Deep Local Parametric Filters for Image Enhancement
Deep Local Parametric Filters for Image EnhancementDeep Local Parametric Filters for Image Enhancement
Deep Local Parametric Filters for Image Enhancement
 
Presentation-09022015
Presentation-09022015Presentation-09022015
Presentation-09022015
 
Robustness of Median Filter For Suppression of Salt and Pepper Noise (SPN) an...
Robustness of Median Filter For Suppression of Salt and Pepper Noise (SPN) an...Robustness of Median Filter For Suppression of Salt and Pepper Noise (SPN) an...
Robustness of Median Filter For Suppression of Salt and Pepper Noise (SPN) an...
 

More from harmonylab

Generating Automatic Feedback on UI Mockups with Large Language Models
Generating Automatic Feedback on UI Mockups with Large Language ModelsGenerating Automatic Feedback on UI Mockups with Large Language Models
Generating Automatic Feedback on UI Mockups with Large Language Models
harmonylab
 
【DLゼミ】XFeat: Accelerated Features for Lightweight Image Matching
【DLゼミ】XFeat: Accelerated Features for Lightweight Image Matching【DLゼミ】XFeat: Accelerated Features for Lightweight Image Matching
【DLゼミ】XFeat: Accelerated Features for Lightweight Image Matching
harmonylab
 
【修士論文】代替出勤者の選定業務における依頼順決定方法に関する研究   千坂知也
【修士論文】代替出勤者の選定業務における依頼順決定方法に関する研究   千坂知也【修士論文】代替出勤者の選定業務における依頼順決定方法に関する研究   千坂知也
【修士論文】代替出勤者の選定業務における依頼順決定方法に関する研究   千坂知也
harmonylab
 
【修士論文】経路探索のための媒介中心性に基づく道路ネットワーク階層化手法に関する研究
【修士論文】経路探索のための媒介中心性に基づく道路ネットワーク階層化手法に関する研究【修士論文】経路探索のための媒介中心性に基づく道路ネットワーク階層化手法に関する研究
【修士論文】経路探索のための媒介中心性に基づく道路ネットワーク階層化手法に関する研究
harmonylab
 
A Study on Decision Support System for Snow Removal Dispatch using Road Surfa...
A Study on Decision Support System for Snow Removal Dispatch using Road Surfa...A Study on Decision Support System for Snow Removal Dispatch using Road Surfa...
A Study on Decision Support System for Snow Removal Dispatch using Road Surfa...
harmonylab
 
【卒業論文】印象タグを用いた衣服画像生成システムに関する研究
【卒業論文】印象タグを用いた衣服画像生成システムに関する研究【卒業論文】印象タグを用いた衣服画像生成システムに関する研究
【卒業論文】印象タグを用いた衣服画像生成システムに関する研究
harmonylab
 
【卒業論文】大規模言語モデルを用いたマニュアル文章修正手法に関する研究
【卒業論文】大規模言語モデルを用いたマニュアル文章修正手法に関する研究【卒業論文】大規模言語モデルを用いたマニュアル文章修正手法に関する研究
【卒業論文】大規模言語モデルを用いたマニュアル文章修正手法に関する研究
harmonylab
 
DLゼミ:Primitive Generation and Semantic-related Alignment for Universal Zero-S...
DLゼミ:Primitive Generation and Semantic-related Alignment for Universal Zero-S...DLゼミ:Primitive Generation and Semantic-related Alignment for Universal Zero-S...
DLゼミ:Primitive Generation and Semantic-related Alignment for Universal Zero-S...
harmonylab
 
DLゼミ: MobileOne: An Improved One millisecond Mobile Backbone
DLゼミ: MobileOne: An Improved One millisecond Mobile BackboneDLゼミ: MobileOne: An Improved One millisecond Mobile Backbone
DLゼミ: MobileOne: An Improved One millisecond Mobile Backbone
harmonylab
 
DLゼミ: Llama 2: Open Foundation and Fine-Tuned Chat Models
DLゼミ: Llama 2: Open Foundation and Fine-Tuned Chat ModelsDLゼミ: Llama 2: Open Foundation and Fine-Tuned Chat Models
DLゼミ: Llama 2: Open Foundation and Fine-Tuned Chat Models
harmonylab
 
DLゼミ: ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
DLゼミ: ViTPose: Simple Vision Transformer Baselines for Human Pose EstimationDLゼミ: ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
DLゼミ: ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
harmonylab
 
Voyager: An Open-Ended Embodied Agent with Large Language Models
Voyager: An Open-Ended Embodied Agent with Large Language ModelsVoyager: An Open-Ended Embodied Agent with Large Language Models
Voyager: An Open-Ended Embodied Agent with Large Language Models
harmonylab
 
DLゼミ: Ego-Body Pose Estimation via Ego-Head Pose Estimation
DLゼミ: Ego-Body Pose Estimation via Ego-Head Pose EstimationDLゼミ: Ego-Body Pose Estimation via Ego-Head Pose Estimation
DLゼミ: Ego-Body Pose Estimation via Ego-Head Pose Estimation
harmonylab
 
ReAct: Synergizing Reasoning and Acting in Language Models
ReAct: Synergizing Reasoning and Acting in Language ModelsReAct: Synergizing Reasoning and Acting in Language Models
ReAct: Synergizing Reasoning and Acting in Language Models
harmonylab
 
形態素解析を用いた帝国議会議事速記録の変遷に関する研究
形態素解析を用いた帝国議会議事速記録の変遷に関する研究形態素解析を用いた帝国議会議事速記録の変遷に関する研究
形態素解析を用いた帝国議会議事速記録の変遷に関する研究
harmonylab
 
【卒業論文】深層生成モデルを用いたユーザ意図に基づく衣服画像の生成に関する研究
【卒業論文】深層生成モデルを用いたユーザ意図に基づく衣服画像の生成に関する研究【卒業論文】深層生成モデルを用いたユーザ意図に基づく衣服画像の生成に関する研究
【卒業論文】深層生成モデルを用いたユーザ意図に基づく衣服画像の生成に関する研究
harmonylab
 
灯油タンク内の液面高計測を用いた 灯油残量推定システムに関する研究
灯油タンク内の液面高計測を用いた灯油残量推定システムに関する研究灯油タンク内の液面高計測を用いた灯油残量推定システムに関する研究
灯油タンク内の液面高計測を用いた 灯油残量推定システムに関する研究
harmonylab
 
深層自己回帰モデルを用いた俳句の生成と評価に関する研究
深層自己回帰モデルを用いた俳句の生成と評価に関する研究深層自己回帰モデルを用いた俳句の生成と評価に関する研究
深層自己回帰モデルを用いた俳句の生成と評価に関する研究
harmonylab
 
競輪におけるレーティングシステムを用いた予想記事生成に関する研究
競輪におけるレーティングシステムを用いた予想記事生成に関する研究競輪におけるレーティングシステムを用いた予想記事生成に関する研究
競輪におけるレーティングシステムを用いた予想記事生成に関する研究
harmonylab
 
【卒業論文】B2Bオークションにおけるユーザ別 入札行動予測に関する研究
【卒業論文】B2Bオークションにおけるユーザ別 入札行動予測に関する研究【卒業論文】B2Bオークションにおけるユーザ別 入札行動予測に関する研究
【卒業論文】B2Bオークションにおけるユーザ別 入札行動予測に関する研究
harmonylab
 

More from harmonylab (20)

Generating Automatic Feedback on UI Mockups with Large Language Models
Generating Automatic Feedback on UI Mockups with Large Language ModelsGenerating Automatic Feedback on UI Mockups with Large Language Models
Generating Automatic Feedback on UI Mockups with Large Language Models
 
【DLゼミ】XFeat: Accelerated Features for Lightweight Image Matching
【DLゼミ】XFeat: Accelerated Features for Lightweight Image Matching【DLゼミ】XFeat: Accelerated Features for Lightweight Image Matching
【DLゼミ】XFeat: Accelerated Features for Lightweight Image Matching
 
【修士論文】代替出勤者の選定業務における依頼順決定方法に関する研究   千坂知也
【修士論文】代替出勤者の選定業務における依頼順決定方法に関する研究   千坂知也【修士論文】代替出勤者の選定業務における依頼順決定方法に関する研究   千坂知也
【修士論文】代替出勤者の選定業務における依頼順決定方法に関する研究   千坂知也
 
【修士論文】経路探索のための媒介中心性に基づく道路ネットワーク階層化手法に関する研究
【修士論文】経路探索のための媒介中心性に基づく道路ネットワーク階層化手法に関する研究【修士論文】経路探索のための媒介中心性に基づく道路ネットワーク階層化手法に関する研究
【修士論文】経路探索のための媒介中心性に基づく道路ネットワーク階層化手法に関する研究
 
A Study on Decision Support System for Snow Removal Dispatch using Road Surfa...
A Study on Decision Support System for Snow Removal Dispatch using Road Surfa...A Study on Decision Support System for Snow Removal Dispatch using Road Surfa...
A Study on Decision Support System for Snow Removal Dispatch using Road Surfa...
 
【卒業論文】印象タグを用いた衣服画像生成システムに関する研究
【卒業論文】印象タグを用いた衣服画像生成システムに関する研究【卒業論文】印象タグを用いた衣服画像生成システムに関する研究
【卒業論文】印象タグを用いた衣服画像生成システムに関する研究
 
【卒業論文】大規模言語モデルを用いたマニュアル文章修正手法に関する研究
【卒業論文】大規模言語モデルを用いたマニュアル文章修正手法に関する研究【卒業論文】大規模言語モデルを用いたマニュアル文章修正手法に関する研究
【卒業論文】大規模言語モデルを用いたマニュアル文章修正手法に関する研究
 
DLゼミ:Primitive Generation and Semantic-related Alignment for Universal Zero-S...
DLゼミ:Primitive Generation and Semantic-related Alignment for Universal Zero-S...DLゼミ:Primitive Generation and Semantic-related Alignment for Universal Zero-S...
DLゼミ:Primitive Generation and Semantic-related Alignment for Universal Zero-S...
 
DLゼミ: MobileOne: An Improved One millisecond Mobile Backbone
DLゼミ: MobileOne: An Improved One millisecond Mobile BackboneDLゼミ: MobileOne: An Improved One millisecond Mobile Backbone
DLゼミ: MobileOne: An Improved One millisecond Mobile Backbone
 
DLゼミ: Llama 2: Open Foundation and Fine-Tuned Chat Models
DLゼミ: Llama 2: Open Foundation and Fine-Tuned Chat ModelsDLゼミ: Llama 2: Open Foundation and Fine-Tuned Chat Models
DLゼミ: Llama 2: Open Foundation and Fine-Tuned Chat Models
 
DLゼミ: ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
DLゼミ: ViTPose: Simple Vision Transformer Baselines for Human Pose EstimationDLゼミ: ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
DLゼミ: ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
 
Voyager: An Open-Ended Embodied Agent with Large Language Models
Voyager: An Open-Ended Embodied Agent with Large Language ModelsVoyager: An Open-Ended Embodied Agent with Large Language Models
Voyager: An Open-Ended Embodied Agent with Large Language Models
 
DLゼミ: Ego-Body Pose Estimation via Ego-Head Pose Estimation
DLゼミ: Ego-Body Pose Estimation via Ego-Head Pose EstimationDLゼミ: Ego-Body Pose Estimation via Ego-Head Pose Estimation
DLゼミ: Ego-Body Pose Estimation via Ego-Head Pose Estimation
 
ReAct: Synergizing Reasoning and Acting in Language Models
ReAct: Synergizing Reasoning and Acting in Language ModelsReAct: Synergizing Reasoning and Acting in Language Models
ReAct: Synergizing Reasoning and Acting in Language Models
 
形態素解析を用いた帝国議会議事速記録の変遷に関する研究
形態素解析を用いた帝国議会議事速記録の変遷に関する研究形態素解析を用いた帝国議会議事速記録の変遷に関する研究
形態素解析を用いた帝国議会議事速記録の変遷に関する研究
 
【卒業論文】深層生成モデルを用いたユーザ意図に基づく衣服画像の生成に関する研究
【卒業論文】深層生成モデルを用いたユーザ意図に基づく衣服画像の生成に関する研究【卒業論文】深層生成モデルを用いたユーザ意図に基づく衣服画像の生成に関する研究
【卒業論文】深層生成モデルを用いたユーザ意図に基づく衣服画像の生成に関する研究
 
灯油タンク内の液面高計測を用いた 灯油残量推定システムに関する研究
灯油タンク内の液面高計測を用いた灯油残量推定システムに関する研究灯油タンク内の液面高計測を用いた灯油残量推定システムに関する研究
灯油タンク内の液面高計測を用いた 灯油残量推定システムに関する研究
 
深層自己回帰モデルを用いた俳句の生成と評価に関する研究
深層自己回帰モデルを用いた俳句の生成と評価に関する研究深層自己回帰モデルを用いた俳句の生成と評価に関する研究
深層自己回帰モデルを用いた俳句の生成と評価に関する研究
 
競輪におけるレーティングシステムを用いた予想記事生成に関する研究
競輪におけるレーティングシステムを用いた予想記事生成に関する研究競輪におけるレーティングシステムを用いた予想記事生成に関する研究
競輪におけるレーティングシステムを用いた予想記事生成に関する研究
 
【卒業論文】B2Bオークションにおけるユーザ別 入札行動予測に関する研究
【卒業論文】B2Bオークションにおけるユーザ別 入札行動予測に関する研究【卒業論文】B2Bオークションにおけるユーザ別 入札行動予測に関する研究
【卒業論文】B2Bオークションにおけるユーザ別 入札行動予測に関する研究
 

Recently uploaded

GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
Zilliz
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
Edge AI and Vision Alliance
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 

Recently uploaded (20)

GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 

Study on Data Augmentation Methods for Sonar Image Analysis

  • 1. Study on Data Augmentation Methods for Sonar Image Analysis Hokkaido University Graduate School of Information Science and Technology Harmonious Systems Engineering Laboratory MIN JIE 0
  • 2. Background 2 • For aquatic resources management, catching restricted fish species should be avoided. • Electromagnetic wave is interfered severely by environment noise, so sonar image is commonly used for solving fishing problems. • Neural network has been used in sonar image to find specific kind of fish.[1] • Data augmentation can improve performance of neural network. [1]平間 友大, 横山 想一郎, 山下 倫央, 川村 秀憲,鈴木 恵二,和田 雅昭 : CNNを用いた音響画像に基づく 定置網内の魚種推定の精度向上, 第18回情報科学技術フォーラム(FIT2019), CF-009, 岡山(2019)
  • 3. ① ② ③ Echo sonder Introduction of prediction system 3 Convolutional neural networkSegmentation Determine whether cast nets Output percentage of target fish existance Threshold • System of avoiding restricted fish species for the management of aquatic resources Sonar images from echo sounder Input Prediction
  • 4. Motivation • Sonar images have different composition compared with real world images. • Different tricks for neural network. – Data augmentations (random flip, translate) – Change the structure of the network – Add Dropout into the network – Fine tuning and add weight decay • Selecting and combining basic methods in different ways such as AutoAugment[2] and RandAugment[3] from google research achieved best performance. 3 [3]RandAugment: Ekin D. Cubuk, Barret Zoph, Jonathon Shlens, Quoc V. Le.’’Practical automated data augmentation with a reduced search space’’ [2]AutoAugment: Ekin D. Cubuk, Barret Zoph, Dandelion Mane, Vijay Vasudevan, Quoc V. Le. “Learning Augmentation Policies from Data”
  • 5. Purpose • Avoid catching restricted fish species for aquatic resources management. • Make a proposal of data augmentation methods for sonar image to improve the performance of prediction system. – RandAugment perform well for natural image. – Could be used for sonar image. • Figure out fitness of basic methods used in RandAugment for sonar image according to their variations with original images. 5
  • 6. Introduction of sonar image • How to collect sonar image data – Echo sounder • Emits a 50kHz sonic wave every three seconds. • The intensity image of the reflected wave is output. – Sonar image • The stronger the reflected wave, the brighter the image. • Shape feature comes from sonic wave reflected form fish. 50.23m/px 3sec/px 165px (38.65m) 25px(75s) fishseabed Intensity time series image Segmentation for sonar image
  • 7. Make label for images • Positive samples: collected from 10 days with tuna catches in the image. • Negative samples: collected from 211 days without tuna catches. 7 Numbers of total dataset Positive samples Negative samples positive negative Train dataset 10,308 148,154 Valid dataset 2,756 37,039 Test dataset 3,321 46,299
  • 8. Train data augmentated in same way Data Augmentation • Single method to process all the images. – Commonly used augmentation methods • SMOTE • Mixup • Cutout • Random Erasing – Basic single augmentation methods. • Uncommon usage for bad performance • Combination for basic methods perform better in image classification compared with single method. – State of art methods : AutoAugment,RandAugment 7 Single method Original images
  • 9. Combined method 1. How to select basic method? 2. How much magnitude apply on image? – AutoAugment • 16 methods • Range of magnitudes :10 • Probability of applying :11 values • 5 sub-policies : 10 methods in total – RandAugment • 14 methods • Same magnitude M :sample 4 to 5 values from (1, 30) • Randomly select N(2,3) methods from all 14 method 16 × 10 × 11 10 search space 2 × 4, 5 search space faster and same accuracy 8
  • 10. Basic augmentation methods 10 • Basic methods of RandAugment – ShearX(Y): shear the image along the horizontal(vertical) axis. – TranslateX(Y): translate the image in horizontal(vertical) axis. – Rotate: rotate the image. – Invert: invert the pixels of image. – Equalize: equalize the image histogram. – AutoContrast: maximize the image contrast. – Brightness: adjust the brightness balance of the image. – Solarize: reduce the value of pixels for each pixel. – Posterize: invert the pixels above the threshold. – Contrast: control the contrast of image. – Color: adjust the color balance of the image. – Sharpness: adjust the sharpness balance of the image. • All these methods have different effects on different datasets. • Important to figure out their fitness for sonar image datasets. Position Brightness No extend
  • 11. RandAugment 11 • Improvement method of AutoAugment, it can get high accuracy quickly by narrowing the search space. • Search for 2*4 hyperparameters combinations. ・N{2, 3} : number of randomly selected methods. ・M{5, 10, 15, 20} : magnitudes sampled from (1,30) for every methods . • Find the match of N and M with best accuracy. Identity, AutoContrast, Rotate, Solarize, Posterize, Contrast, Brightness, Sharpness, ShearX(Y), Translate(Y), Invert , Equalize Contrast, ShearX, Rotate Randomly select 3 methods from 14 methods above apply CNN Train process ShearXAutoContrast Rotateoriginal Example N=3, M=15 Method Bigger M bigger change ShearX Brightness M=15 M=30
  • 13. RandAugment 11 • Improvement method of AutoAugment, that can find best methods quickly by narrowing the search space • Search for 3*4 hyperparameters combinations. ・N{2, 3, 4} : number of randomly selected methods. ・M{5, 10, 15, 20} : magnitude sampled from (0,30) for every methods . • Find the match of N and M with best accuracy. Identity, AutoContrast, Rotate, Solarize, Posterize, Contrast, Brightness, Sharpness, ShearX(Y), Translate(Y), Invert , Equalize ShearX, AutoContrast, Rotate Randomly select 3 methods from 14 methods above apply CNN Train process ShearXAutoContrast Rotateoriginal Example N=3, M=15 Method Bigger M bigger change ShearX Brightness M=15 M=30 M=15
  • 15. RandAugment 11 • Improvement method of AutoAugment, that can achieve same accuracy quickly by narrowing the search space • Search for 3*4 hyperparameters combinations. ・N{2, 3, 4} : number of randomly selected methods. ・M{5, 10, 15, 20} : magnitude sampled from (0,30) for every methods . • Find the match of N and M with best accuracy. Identity, AutoContrast, Rotate, Solarize, Posterize, Contrast, Brightness, Sharpness, ShearX(Y), Translate(Y), Invert , Equalize ShearX, AutoContrast, Rotate Randomly select 3 methods from 14 methods above apply CNN Train process ShearXAutoContrast Rotateoriginal Example N=3, M=15 Method Bigger M bigger change ShearX Brightness M=15 M=30
  • 16. Criterion for fitness of methods • Variation of methods for dataset – Judge by appearance of images – Need a numerical criterion for variation. – Mean square error(MSE) to evaluate the variation of augmented images and original images. 12 Equalize InvertShearXOriginal
  • 17. MSE for all methods with different M 13 For different M, define MSE in range(100,2000) as proper, and train the model with these methods by RandAugment. Equalize Invert ShearX Original Method M=5 M=15 M=25 Invert 55,164 55,146 55,146 Equalize 15,867 15,867 15,867 AutoContrast 1,013 1,013 1,013 TranslateY 200 525 590 Rotate 135 322 414 ShearY 112 305 341 TranslateX 29 212 259 Brightness 8.18 106 237 ShearX 11 56 61.8 Contrast 6.54 50 250 Solarize 0.53 4.42 1,107 Posterize 0.16 2.59 10 Sharpness 0.002 0.22 0.65 Color 0 0 0 56 15878 55526 Too large MSE cause bad performance Conjecture1: Too low MSE cause bad performance Conjecture2: MSE show same tendency with appearance .
  • 18. Model in experiment 14 • Convolutional neural network for image classification • Resnet50[4] – Easier to optimize the residual mapping than to optimize the original unreferenced mapping. • Optimizer : Momentum SGD • Loss function : softmax cross entropy • Input image size(224,224) : resize sonar image to same size for pretrained model. [4]He, Kaiming, et al. "Deep residual learning for image recognition." Proceedings of the IEEE conference on computer vision and pattern recognition. 2016. weight layer weight layer relu F(x) H(x)=F(x)+x relu x x identity Resize to (224,224) Residual block
  • 19. Imbalanced dataset • Train dataset – Number of positive samples is P – Number of negative samples is N – P:N≈1:14 in this experiment • Weighted loss function – Weight of positive samples : – Weight of negative samples: – Weighted cross entropy: 15 𝑁 𝑃 + 𝑁 P 𝑃 + 𝑁 𝑦 means true label and 𝑦′ means predicted label 𝑤 𝑃 = 𝑤 𝑁 = 𝐿 𝑤 = −𝑤 𝑃 ⋅ 𝑦 log 𝑦′ − 𝑤 𝑁 1 − 𝑦 log 1 − 𝑦′ (1)
  • 20. Performance evaluation 16 • Accuracy – In binary classification, when the dataset is extremely imbalanced(1:14 in this experiment), accuracy is not an effective criterion for performance evaluation. • Confusion matrix • Precision ,recall and f1 score – Precision=TP/(TP+FP): efficiency of targeting tuna – Recall=TP/(TP+FN): rate of targeting tuna – f1 score=2/(1/precision+1/recall): harmonic value of precision and recall True class Positive Negative Prediction class 0 1 TP TNFN PF
  • 21. Experiment 17 Baseline of sonar image dataset Result with RandAugment Result with RandAugment RandAugment with proper basic methods 1. If RandAugment perform well on sonar image? 2. If RandAugment with proper basic methods perform better than with all methods on sonar image? Compare Compare
  • 22. Experiment 1 • Baselines without RandAugment and RandAugment with all methods. – Condition: • Train for the baseline of sonar image dataset. • Train for RandAugment including all basic methods with N=3, M=(5, 15, 25) 18 Numbers of total dataset positive negative Train dataset 10,308 148,154 Valid dataset 2,756 37,039 Test dataset 3,321 46,299
  • 23. Results of experiment 1 • Baseline train without RandAugment. • Results of RandAugment with all methods 19 Baseline F1 Precision Recall Rate 0.8640 0.8971 0.7985 • Overall 2% improvement compared with baseline for sonar image M(N=3) F1 Precision Recall 5 0.8846 0.8947 0.8749 15 0.8779 0.9076 0.8640 25 0.8823 0.8907 0.8815
  • 24. Results of experiment 1 • Train process figures with and without RandAugment. 20 • Train process become more stable with RandAugment Without RandAugment With RandAugment
  • 25. Experiment 2 • RandAugment with proper MSE methods -(100, 2000) in table 21 Method M=5 M=15 M=25 Invert 55,164 55,146 55,146 Equalize 15,867 15,867 15,867 AutoContrast 1,013 1,013 1,013 TranslateY 200 525 590 Rotate 135 322 414 ShearY 112 305 341 TranslateX 29 212 259 Brightness 8.18 106 237 ShearX 11 56 61.8 Contrast 6.54 50 250 Solarize 0.53 4.42 1,107 Posterize 0.16 2.59 10 Sharpness 0.002 0.22 0.65 4 methods when M=5 6 methods when M=15 8 methods when M=25
  • 26. Result of experiment 2 • RandAugment with appropriate MSE methods 22 M(N=3) F1 Precision Recall 5 0.8938 0.9041 0.8861 15 0.8985 0.8800 0.9104 25 0.9006 0.8935 0.9088 M(N=3) F1 Precision Recall 5 0.8846 0.8947 0.8749 15 0.8779 0.9076 0.8640 25 0.8823 0.8907 0.8815 • Overall improvement about 2% except M=5. Result of all methods in RandAugment Result of proper methods in RandAugment 4 methods when M=5
  • 27. Conclusion In this sonar image dataset collected from 2 years • RandAugment can improve performance of network. – F1 score and recall – Stability of train process • RandAugment with proper MSE basic methods performs better than that with all methods for sonar image dataset. • Still need to be tested in more sonar image datasets and complicated conditions before putting into use. 23