SlideShare a Scribd company logo
1 of 39
Download to read offline
When Remote Sensing Meets
Artificial Intelligence
Taoyuan, Taiwan
4 Desember 2021
UNIVERSITAS NEGERI MAKASSAR
WAHYU RAHMANIAR, Ph.D.
Postdoctoral Researcher
National Taipei University of Technology
Biography
Education
▪ S.Si.: Elektronika dan Instrumentasi (ELINS), Universitas Gajah Mada
▪ M.Sc. - Ph.D.: Electrical Engineering, National Central University
Works Experience
▪ Laboratory Assistant, ELINS, Universitas Gajah Mada
▪ Research Assistant, Fasilkom, Universitas Indonesia
▪ Research Assistant, FTIF, Institut Teknologi Sepuluh Nopember
▪ Freelance
▪ Image Processing/Computer Vision Algorithm, Issa Technology
Internship Experience
▪ Ubiik, Hsinchu, Taiwan
▪ Shibaura Institute of Technology, Toyosu, Tokyo, Japan
Occupation
Office: National Taipei University of Technology, Taiwan
Position: Postdoctoral Researcher
Research Interest
Robotics, image processing, computer vision, intelligent control, and artificial intelligence
Biography
Undergraduate Research
Robotic Prototype For Volcano Eruption Monitoring Based on Temperature Detection Using Amplitude Shift Keying
Modulation Telemetry
Master Research
▪ Sensor Integration for Real-Time Data Acquisition in Aerial Surveillance
▪ A Novel Object Detection Method based on Fuzzy Sets Theory and Speed-Up Robust Feature
Doctoral Research
▪ Real-Time Automated Segmentation and Classification of Calcaneal Fractures in CT Images
▪ Real-Time Detection and Recognition of Multiple Moving Objects for Aerial Surveillance
▪ Online Digital Image Stabilization for an Unmanned Aerial Vehicle
▪ Real-Time Bi-Directional People Counting using an RGB-D Camera
▪ Distance Measurements of Unmanned Aerial Vehicles using Vision-Based Systems in Unknown Environments
H-index: 4 (Google Scholar); 4 (Scopus)
Journal Reviewer: Journal of Robotics and Control (JRC), Big Data and Cognitive Computing - MDPI, Sensor Review
(IF: 1.442) , Circuit World (IF: 1.698) , Journal of Electronic Imaging (IF: 0.78), Scientific Reports – Nature (IF: 4.379)
GitHub: https://github.com/wahyurahmaniar
Content
INTRODUCTION
COMPUTERVISION APPLICATIONS
RESEARCH EXAMPLE IN ARTIFICIAL INTELLIGENCE
ARTIFICIAL INTELLIGENCE IN REMOTE SENSING
1
2
3
4
INTRODUCTION
Image Processing vs Computer Vision
Image Processing Example
1) Rescaling: zoom in, zoom out, cropping
2) Correcting illumination: brightness and contrast
3) Color manipulations: black and white, gray-scale, HSV, RGB, BGR
4) Filter: mean, median, low pass, high pass, gaussian, laplacian
5) Edge detection
Original Sobel Laplacian Canny
Image Processing Example
6) Morphology:
- Erode (local minimum)
- Dilate (local maximum)
- Opening (erosion then dilation)
- Closing (dilation then erosion)
- Morphology Gradient (MG)
(difference between dilation and erosion)
- Top Hat (TH)
(difference between image and opening)
- Black Hat
(difference between image and closing)
Computer Vision Example
▪ Scene modeling
▪ Object detection
▪ Object recognition
▪ Object tracking
▪ Pose estimation
▪ Motion estimation
▪ Image restoration
Artificial Intelligence (AI)
▪ AI is intelligence demonstrated by machines, as
opposed to the natural intelligence displayed by
humans or animals.
▪ Goals: reasoning, problem solving, knowledge
representation, planning, learning, natural
language processing, perception, prediction,
motion and manipulation, social intelligence, and
general intelligence.
▪ Media: image, sound, and data.
▪ AI before ML and DL: rules engines, expert
systems, knowledge graphs or symbolic AI.
▪ ML: supervised learning, unsupervised learning,
semi-supervised learning, reinforcement learning,
dimensionality reduction.
▪ DL: deep neural networks, deep belief networks,
deep reinforcement learning, recurrent neural
networks and convolutional neural networks.
Machine Learning vs Deep Learning
COMPUTER VISION
APPLICATIONS
Traditional AI
1) Traditional computer vision method: background subtraction
Disadvantage:
- Can't handle some lighting changes, including when it's dark
- Often detects 2 cars or more as 1 car
- Slow and less accurate than machine learning and deep learning
- Can't recognize the type and color of the car
Machine Learning
2) Machine learning: Haar cascade
Advantage:
- Faster computation time
Disadvantage:
- Can't handle dark illumination
- Less accurate than deep learning
- Can't recognize the type and color of the car
Deep Learning
2) Deep learning: YOLOv3
Advantage:
- Can handle any illumination
- Very accurate
- Can recognize the type and color of the car
Disadvantage:
- Requires large computation
RESEARCH EXAMPLE
Research 1: Moving Object Detection
Research 1: Moving Object Detection
Contribution
- Real-time computing performance for the detection and
recognition of multiple moving objects
- Accurate detection of moving objects using a UAV in complex
backgrounds
Rahmaniar, W.; Wang, W.-J.; Chen, H.-C. “Real-time detection and recognition of
multiple moving objects for aerial surveillance”, Electronics, vol. 8, no. 12, pp.
1373-1390, 2019. (SCI/IE, Q2, IF: 2.397)
System overview of moving objects detection and recognition
UAV movement modelling
Research 1: Object Detection
Cascade classifier for object detection and recognition
The angular value of the motion vector:
(a) (b)
Optical flow estimation.
(a) Original image. (b) Motion vectors.
( ) ( ) ( )
( ) ( )
1 180
tan
x
y
t
t
t


−
 

 
 = 
 

 
Research 1: Result
a. Motion vectors result without image stabilization
b. Motion vectors result with image stabilization
Image stabilization result
Method
Computation
Time (fps)
Accuracy (%)
PR Recall F-Measure
Proposed 47.08 94 91 92
Yazid et al. 1 70 76 72
Saif et al. - 66 86 74
Maier et al. - 94 89 91
Kalantar et al. 1.6 - - 73
Cai et al. 5 - - 76
Comparison of performance results
Research 2: Bi-directional People Counting
Research 2: Bi-directional People Counting
System overview of the proposed people counting
Contribution
- Fast and accurate object detection in the Depth images
- Map and register objects in the RGB images
- Fast and accurate tracking and counting objects
- Apply the algorithm to the embedded system
- Detect and count objects for “in” and “out” directions simultaneously
Rahmaniar, W.; Wang, W.-J.; Chiu, C.-W.; Hakim, N. L. “Real-time
bi-directional people counting using an RGB-D camera”, Sensor
Review, vol. 41, no. 4, pp. 341-349, 2021. (SCI/IE, Q2, IF: 1.442)
Research 2: Object Detection
(d) (e)
(a) (b) (c)
Foreground segmentation.
(a) Background removal.
(b) Closer object segmentation.
(c) Farther object segmentation.
(d) Binary image of the closer object.
(e) Binary image of the farther object.
Research 2: Object Registration
• The registered object’s heads are mapped to an RGB image:
(a) (b)
Object registration.
(a) Object detected in the depth RGB image. (b) Object registered in the RGB image.
( , ) ( ( , ) 2) ( , )
RGB
head n t head n t R shift x y
=  +
Research 2: Result
Hardware setup
➢ Current precision rate: 99%.
➢ Average computation time: 30~40 fps.
➢ Detect and track up to 6 people in a frame.
Comparison of performance results
Method
Computation time (fps) Accuracy
PC Embedded system PR Recall F-Measure
Proposed 35 18 0.99 0.96 0.97
Cho et al. 20 4 0.96 0.98 0.97
Tokta et al. 10 - 0.87 - -
Garcia et al. 28 - 0.96 - -
Bondi et al. 20 - 0.97 0.89 0.93
Research 3: Indoor UAV distance measurement
Research 3: Indoor UAV distance measurement
System overview
Contribution
- Fast and accurate distance measurement of the UAV in the building
- UAV positioning in the unknown environments
- Camera stabilization and positioning within UAV movement
Rahmaniar, W.; Wang, W.-J.; Caesarendra, W.; Glowacz, A.;
Oprzędkiewicz, K; Sulowicz, M, Irfan, M. “Distance Measurement of
Unmanned Aerial Vehicles Using Vision-based Systems in Unknown
Environments”, Electronics, vol. 10, no. 14, pp.1647-1660, 2021.
(SCI/IE, Q2, IF: 2.397)
Indoor segmentation
ARTIFICIAL INTELLIGENCE IN
REMOTE SENSING
Research Example
▪ Land Use and Land Cover Mapping
▪ Object Detection in Optical, Radar and Lidar Images
▪ Object Detection from Aerial and Space Platforms
▪ Moving Object Detection and Control
▪ Robotics and AI for Infrastructure Inspection and Monitoring
▪ Marine, Ocean and Climate Change Monitoring
▪ Plant Disease and Tree Health Detection
Research 1: Small Object Detection
Research 1: Small Object Detection
▪ To improve detection accuracy on remote sensing images, researchers have used deep convolutional
neural network (CNN)-based super-resolution (SR) techniques to generate artificial images and then
detect objects
▪ Generative Adversarial Network (GAN)-based methods such as super-resolution GAN (SRGAN) and
enhanced super-resolution GAN (ESRGAN) showed remarkable performance in enhancing low-resolution
(LR) images with and without noise
▪ These models have two subnetworks: a generator and a discriminator. Both subnetworks consist of deep
CNNs.
▪ Datasets containing HR and LR image pairs are used for training and testing the models. The generator
generates HR images from LR input images, and the discriminator predicts whether generated image is a
real HR image or an upscaled LR image.
▪ After sufficient training, the generator generates HR images that are similar to the ground truth HR
images, and the discriminator cannot correctly discriminate between real and fake images anymore.
Research 1: Small Object Detection
▪ Detection on LR (low-resolution) images (60cm/pixel)
is shown in (I); in (II), we show the detection on
generated SR (super resolution) images (15 cm/pixel).
▪ The first row of this figure represents the COWC (car
overhead with context) dataset, and the second row
represents the OGST (oil and gas storage tank) dataset.
▪ FRCNN (faster region-based CNN) detector is used for
LR images for detection.
▪ Then EESRGAN (edge-enhanced SRGAN) and FRCNN
architecture (EESRGAN-FRCNN) are used to generate
SR images and simultaneously detect objects from the
SR images.
▪ The red bounding boxes represent true positives, and
yellow bounding boxes represent false negatives.
Research 1: Small Object Detection
▪ The generator G generates intermediate super-
resolution (ISR) images, and then final SR images are
generated after applying the EEN network.
▪ The discriminator (DRa) discriminates between ground
truth (GT) HR images and ISR. The inverted gradients of
DRa are backpropagated into the generator G in order to
create SR images allowing for accurate object detection.
▪ Edge information is extracted from ISR, and the EEN
network enhances these edges. Afterwards, the
enhanced edges are again added to the ISR after
subtracting the original edges extracted by the Laplacian
operator and the output SR images are generated with
enhanced edges.
▪ Finally, objects are detected from the SR images using
the detector network.
▪ Two different loss functions are used for EEN: one
compares the difference between SR and ground truth
images, and the other compares the difference between
the extracted edge from ISR and ground truth.
Research 1: Small Object Detection
Research 2: Building Segmentation
Research 2: Building Segmentation
▪ A deep multi-task learning (MTL) framework is proposed to simultaneously address building footprint
segmentation as the main task; building boundary extraction, and image reconstruction as auxiliary tasks for very
high-resolution RS images
▪ The proposed framework takes input image, ground truth map, and boundary extracted ground truth map as an
input and outputs; (1) segmentation map, (2) boundary segmentation map, and (3) reconstructed input image.
Research 2: Building Segmentation
▪ The objectives aimed to optimize in this paper are, cross-entropy for semantic segmentation and boundary
extraction tasks and mean absolute error (MAE or L1 loss) for the reconstruction task.
▪ The canny edge detection is used algorithm with a radius of 3, together with dilation to extract the
boundaries from whole ground truth masks.
▪ During experiments, the input images and corresponding masks are randomly cropped by 512 x 512 patches
as network inputs. The dataset is partitioned by a 7:2:1 ratio for train, validation, and test set, respectively.
▪ Data augmentation is applied to increase the dataset size artificially by deploying the following
transformations: horizontal-vertical flip, additive Gaussian noise, perspective, random brightness, gamma,
contrast, blur.
▪ ResNet101 is used as a backbone for the feature encoder.
▪ The encoder-decoder type network architecture is based on U-Net, with five encoder and following decoder
blocks with the filter sizes of (256, 128, 64, 32, 16)
Research 2: Building Segmentation
Q & A ARE
WELCOME

More Related Content

What's hot

Wastian, Brunmeir - Data Analyses in Industrial Applications: From Predictive...
Wastian, Brunmeir - Data Analyses in Industrial Applications: From Predictive...Wastian, Brunmeir - Data Analyses in Industrial Applications: From Predictive...
Wastian, Brunmeir - Data Analyses in Industrial Applications: From Predictive...Vienna Data Science Group
 
SSII2020TS: Event-Based Camera の基礎と ニューラルネットワークによる信号処理 〜 生き物のように「変化」を捉えるビジョンセ...
SSII2020TS: Event-Based Camera の基礎と ニューラルネットワークによる信号処理 〜 生き物のように「変化」を捉えるビジョンセ...SSII2020TS: Event-Based Camera の基礎と ニューラルネットワークによる信号処理 〜 生き物のように「変化」を捉えるビジョンセ...
SSII2020TS: Event-Based Camera の基礎と ニューラルネットワークによる信号処理 〜 生き物のように「変化」を捉えるビジョンセ...SSII
 
Deep Learning for Computer Vision (1/4): Image Analytics @ laSalle 2016
Deep Learning for Computer Vision (1/4): Image Analytics @ laSalle 2016Deep Learning for Computer Vision (1/4): Image Analytics @ laSalle 2016
Deep Learning for Computer Vision (1/4): Image Analytics @ laSalle 2016Universitat Politècnica de Catalunya
 
Deep Learning: a birds eye view
Deep Learning: a birds eye viewDeep Learning: a birds eye view
Deep Learning: a birds eye viewRoelof Pieters
 
Synthesizing pseudo 2.5 d content from monocular videos for mixed reality
Synthesizing pseudo 2.5 d content from monocular videos for mixed realitySynthesizing pseudo 2.5 d content from monocular videos for mixed reality
Synthesizing pseudo 2.5 d content from monocular videos for mixed realityNAVER Engineering
 
HUMAN MOTION DETECTION AND TRACKING FOR VIDEO SURVEILLANCE
HUMAN MOTION  DETECTION AND TRACKING FOR VIDEO SURVEILLANCEHUMAN MOTION  DETECTION AND TRACKING FOR VIDEO SURVEILLANCE
HUMAN MOTION DETECTION AND TRACKING FOR VIDEO SURVEILLANCENEHA THADEUS
 
IRJET - Direct Me-Nevigation for Blind People
IRJET -  	  Direct Me-Nevigation for Blind PeopleIRJET -  	  Direct Me-Nevigation for Blind People
IRJET - Direct Me-Nevigation for Blind PeopleIRJET Journal
 
TRACKING OF PARTIALLY OCCLUDED OBJECTS IN VIDEO SEQUENCES
TRACKING OF PARTIALLY OCCLUDED OBJECTS IN VIDEO SEQUENCESTRACKING OF PARTIALLY OCCLUDED OBJECTS IN VIDEO SEQUENCES
TRACKING OF PARTIALLY OCCLUDED OBJECTS IN VIDEO SEQUENCESPraveen Pallav
 
【ECCV 2016 BNMW】Human Action Recognition without Human
【ECCV 2016 BNMW】Human Action Recognition without Human【ECCV 2016 BNMW】Human Action Recognition without Human
【ECCV 2016 BNMW】Human Action Recognition without HumanHirokatsu Kataoka
 
Deep Learning and Reinforcement Learning
Deep Learning and Reinforcement LearningDeep Learning and Reinforcement Learning
Deep Learning and Reinforcement LearningRenārs Liepiņš
 
An Object Detection, Tracking And Parametric Classification– A Review
An Object Detection, Tracking And Parametric Classification– A ReviewAn Object Detection, Tracking And Parametric Classification– A Review
An Object Detection, Tracking And Parametric Classification– A ReviewIRJET Journal
 
Visual Object Tracking: review
Visual Object Tracking: reviewVisual Object Tracking: review
Visual Object Tracking: reviewDmytro Mishkin
 
TechnicalBackgroundOverview
TechnicalBackgroundOverviewTechnicalBackgroundOverview
TechnicalBackgroundOverviewMotaz El-Saban
 
Detection and Tracking of Moving Object: A Survey
Detection and Tracking of Moving Object: A SurveyDetection and Tracking of Moving Object: A Survey
Detection and Tracking of Moving Object: A SurveyIJERA Editor
 
Introduction to Object recognition
Introduction to Object recognitionIntroduction to Object recognition
Introduction to Object recognitionAshiq Ullah
 

What's hot (20)

Wastian, Brunmeir - Data Analyses in Industrial Applications: From Predictive...
Wastian, Brunmeir - Data Analyses in Industrial Applications: From Predictive...Wastian, Brunmeir - Data Analyses in Industrial Applications: From Predictive...
Wastian, Brunmeir - Data Analyses in Industrial Applications: From Predictive...
 
Presentation of Visual Tracking
Presentation of Visual TrackingPresentation of Visual Tracking
Presentation of Visual Tracking
 
SSII2020TS: Event-Based Camera の基礎と ニューラルネットワークによる信号処理 〜 生き物のように「変化」を捉えるビジョンセ...
SSII2020TS: Event-Based Camera の基礎と ニューラルネットワークによる信号処理 〜 生き物のように「変化」を捉えるビジョンセ...SSII2020TS: Event-Based Camera の基礎と ニューラルネットワークによる信号処理 〜 生き物のように「変化」を捉えるビジョンセ...
SSII2020TS: Event-Based Camera の基礎と ニューラルネットワークによる信号処理 〜 生き物のように「変化」を捉えるビジョンセ...
 
Deep Learning for Computer Vision (1/4): Image Analytics @ laSalle 2016
Deep Learning for Computer Vision (1/4): Image Analytics @ laSalle 2016Deep Learning for Computer Vision (1/4): Image Analytics @ laSalle 2016
Deep Learning for Computer Vision (1/4): Image Analytics @ laSalle 2016
 
SECURE: Semantics Empowered resCUe enviRonmEnt
SECURE: Semantics Empowered resCUe enviRonmEntSECURE: Semantics Empowered resCUe enviRonmEnt
SECURE: Semantics Empowered resCUe enviRonmEnt
 
Deep Learning: a birds eye view
Deep Learning: a birds eye viewDeep Learning: a birds eye view
Deep Learning: a birds eye view
 
Synthesizing pseudo 2.5 d content from monocular videos for mixed reality
Synthesizing pseudo 2.5 d content from monocular videos for mixed realitySynthesizing pseudo 2.5 d content from monocular videos for mixed reality
Synthesizing pseudo 2.5 d content from monocular videos for mixed reality
 
HUMAN MOTION DETECTION AND TRACKING FOR VIDEO SURVEILLANCE
HUMAN MOTION  DETECTION AND TRACKING FOR VIDEO SURVEILLANCEHUMAN MOTION  DETECTION AND TRACKING FOR VIDEO SURVEILLANCE
HUMAN MOTION DETECTION AND TRACKING FOR VIDEO SURVEILLANCE
 
IRJET - Direct Me-Nevigation for Blind People
IRJET -  	  Direct Me-Nevigation for Blind PeopleIRJET -  	  Direct Me-Nevigation for Blind People
IRJET - Direct Me-Nevigation for Blind People
 
TRACKING OF PARTIALLY OCCLUDED OBJECTS IN VIDEO SEQUENCES
TRACKING OF PARTIALLY OCCLUDED OBJECTS IN VIDEO SEQUENCESTRACKING OF PARTIALLY OCCLUDED OBJECTS IN VIDEO SEQUENCES
TRACKING OF PARTIALLY OCCLUDED OBJECTS IN VIDEO SEQUENCES
 
【ECCV 2016 BNMW】Human Action Recognition without Human
【ECCV 2016 BNMW】Human Action Recognition without Human【ECCV 2016 BNMW】Human Action Recognition without Human
【ECCV 2016 BNMW】Human Action Recognition without Human
 
Deep Learning and Reinforcement Learning
Deep Learning and Reinforcement LearningDeep Learning and Reinforcement Learning
Deep Learning and Reinforcement Learning
 
Image recognition
Image recognitionImage recognition
Image recognition
 
An Object Detection, Tracking And Parametric Classification– A Review
An Object Detection, Tracking And Parametric Classification– A ReviewAn Object Detection, Tracking And Parametric Classification– A Review
An Object Detection, Tracking And Parametric Classification– A Review
 
Visual Object Tracking: review
Visual Object Tracking: reviewVisual Object Tracking: review
Visual Object Tracking: review
 
TechnicalBackgroundOverview
TechnicalBackgroundOverviewTechnicalBackgroundOverview
TechnicalBackgroundOverview
 
Detection and Tracking of Moving Object: A Survey
Detection and Tracking of Moving Object: A SurveyDetection and Tracking of Moving Object: A Survey
Detection and Tracking of Moving Object: A Survey
 
Introduction to Object recognition
Introduction to Object recognitionIntroduction to Object recognition
Introduction to Object recognition
 
MOTION FLOW
MOTION FLOWMOTION FLOW
MOTION FLOW
 
Object recognition
Object recognitionObject recognition
Object recognition
 

Similar to When Remote Sensing Meets Artificial Intelligence

Scrdet++ analysis
Scrdet++ analysisScrdet++ analysis
Scrdet++ analysisNEHA Kapoor
 
Wang midterm-defence
Wang midterm-defenceWang midterm-defence
Wang midterm-defenceZhipeng Wang
 
Development of wearable object detection system & blind stick for visuall...
Development of wearable object detection system & blind stick for visuall...Development of wearable object detection system & blind stick for visuall...
Development of wearable object detection system & blind stick for visuall...Arkadev Kundu
 
Introduction talk to Computer Vision
Introduction talk to Computer Vision Introduction talk to Computer Vision
Introduction talk to Computer Vision Chen Sagiv
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentIJERD Editor
 
最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に - 最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に - Hiroshi Fukui
 
Moving object detection
Moving object detectionMoving object detection
Moving object detectionManav Mittal
 
understanding the planet using satellites and deep learning
understanding the planet using satellites and deep learningunderstanding the planet using satellites and deep learning
understanding the planet using satellites and deep learningAlbert Pujol Torras
 
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...Tulipp. Eu
 
Moving Object Detection And Tracking Using CNN
Moving Object Detection And Tracking Using CNNMoving Object Detection And Tracking Using CNN
Moving Object Detection And Tracking Using CNNNITISHKUMAR1401
 
Survey on video object detection & tracking
Survey on video object detection & trackingSurvey on video object detection & tracking
Survey on video object detection & trackingijctet
 
Fast Feature Pyramids for Object Detection
Fast Feature Pyramids for Object DetectionFast Feature Pyramids for Object Detection
Fast Feature Pyramids for Object Detectionsuthi
 
Ijmsr 2016-10
Ijmsr 2016-10Ijmsr 2016-10
Ijmsr 2016-10ijmsr
 
Integrated Hidden Markov Model and Kalman Filter for Online Object Tracking
Integrated Hidden Markov Model and Kalman Filter for Online Object TrackingIntegrated Hidden Markov Model and Kalman Filter for Online Object Tracking
Integrated Hidden Markov Model and Kalman Filter for Online Object Trackingijsrd.com
 
Object Detection and Tracking AI Robot
Object Detection and Tracking AI RobotObject Detection and Tracking AI Robot
Object Detection and Tracking AI RobotIRJET Journal
 
[CVPR 2018] Utilizing unlabeled or noisy labeled data (classification, detect...
[CVPR 2018] Utilizing unlabeled or noisy labeled data (classification, detect...[CVPR 2018] Utilizing unlabeled or noisy labeled data (classification, detect...
[CVPR 2018] Utilizing unlabeled or noisy labeled data (classification, detect...NAVER Engineering
 
resume_Yuli_Liang
resume_Yuli_Liangresume_Yuli_Liang
resume_Yuli_LiangYuli Liang
 

Similar to When Remote Sensing Meets Artificial Intelligence (20)

Scrdet++ analysis
Scrdet++ analysisScrdet++ analysis
Scrdet++ analysis
 
Wang midterm-defence
Wang midterm-defenceWang midterm-defence
Wang midterm-defence
 
Development of wearable object detection system & blind stick for visuall...
Development of wearable object detection system & blind stick for visuall...Development of wearable object detection system & blind stick for visuall...
Development of wearable object detection system & blind stick for visuall...
 
Introduction talk to Computer Vision
Introduction talk to Computer Vision Introduction talk to Computer Vision
Introduction talk to Computer Vision
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and Development
 
F1063337
F1063337F1063337
F1063337
 
Motion Human Detection & Tracking Based On Background Subtraction
Motion Human Detection & Tracking Based On Background SubtractionMotion Human Detection & Tracking Based On Background Subtraction
Motion Human Detection & Tracking Based On Background Subtraction
 
最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に - 最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に -
 
Moving object detection
Moving object detectionMoving object detection
Moving object detection
 
Exploring EEG for object detection and retrieval
Exploring EEG  for object detection and retrievalExploring EEG  for object detection and retrieval
Exploring EEG for object detection and retrieval
 
understanding the planet using satellites and deep learning
understanding the planet using satellites and deep learningunderstanding the planet using satellites and deep learning
understanding the planet using satellites and deep learning
 
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
 
Moving Object Detection And Tracking Using CNN
Moving Object Detection And Tracking Using CNNMoving Object Detection And Tracking Using CNN
Moving Object Detection And Tracking Using CNN
 
Survey on video object detection & tracking
Survey on video object detection & trackingSurvey on video object detection & tracking
Survey on video object detection & tracking
 
Fast Feature Pyramids for Object Detection
Fast Feature Pyramids for Object DetectionFast Feature Pyramids for Object Detection
Fast Feature Pyramids for Object Detection
 
Ijmsr 2016-10
Ijmsr 2016-10Ijmsr 2016-10
Ijmsr 2016-10
 
Integrated Hidden Markov Model and Kalman Filter for Online Object Tracking
Integrated Hidden Markov Model and Kalman Filter for Online Object TrackingIntegrated Hidden Markov Model and Kalman Filter for Online Object Tracking
Integrated Hidden Markov Model and Kalman Filter for Online Object Tracking
 
Object Detection and Tracking AI Robot
Object Detection and Tracking AI RobotObject Detection and Tracking AI Robot
Object Detection and Tracking AI Robot
 
[CVPR 2018] Utilizing unlabeled or noisy labeled data (classification, detect...
[CVPR 2018] Utilizing unlabeled or noisy labeled data (classification, detect...[CVPR 2018] Utilizing unlabeled or noisy labeled data (classification, detect...
[CVPR 2018] Utilizing unlabeled or noisy labeled data (classification, detect...
 
resume_Yuli_Liang
resume_Yuli_Liangresume_Yuli_Liang
resume_Yuli_Liang
 

Recently uploaded

Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and ModificationsMJDuyan
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxdhanalakshmis0310
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentationcamerronhm
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxVishalSingh1417
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxcallscotland1987
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfSherif Taha
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Jisc
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...pradhanghanshyam7136
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxnegromaestrong
 

Recently uploaded (20)

Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 

When Remote Sensing Meets Artificial Intelligence

  • 1. When Remote Sensing Meets Artificial Intelligence Taoyuan, Taiwan 4 Desember 2021 UNIVERSITAS NEGERI MAKASSAR WAHYU RAHMANIAR, Ph.D. Postdoctoral Researcher National Taipei University of Technology
  • 2. Biography Education ▪ S.Si.: Elektronika dan Instrumentasi (ELINS), Universitas Gajah Mada ▪ M.Sc. - Ph.D.: Electrical Engineering, National Central University Works Experience ▪ Laboratory Assistant, ELINS, Universitas Gajah Mada ▪ Research Assistant, Fasilkom, Universitas Indonesia ▪ Research Assistant, FTIF, Institut Teknologi Sepuluh Nopember ▪ Freelance ▪ Image Processing/Computer Vision Algorithm, Issa Technology Internship Experience ▪ Ubiik, Hsinchu, Taiwan ▪ Shibaura Institute of Technology, Toyosu, Tokyo, Japan Occupation Office: National Taipei University of Technology, Taiwan Position: Postdoctoral Researcher Research Interest Robotics, image processing, computer vision, intelligent control, and artificial intelligence
  • 3. Biography Undergraduate Research Robotic Prototype For Volcano Eruption Monitoring Based on Temperature Detection Using Amplitude Shift Keying Modulation Telemetry Master Research ▪ Sensor Integration for Real-Time Data Acquisition in Aerial Surveillance ▪ A Novel Object Detection Method based on Fuzzy Sets Theory and Speed-Up Robust Feature Doctoral Research ▪ Real-Time Automated Segmentation and Classification of Calcaneal Fractures in CT Images ▪ Real-Time Detection and Recognition of Multiple Moving Objects for Aerial Surveillance ▪ Online Digital Image Stabilization for an Unmanned Aerial Vehicle ▪ Real-Time Bi-Directional People Counting using an RGB-D Camera ▪ Distance Measurements of Unmanned Aerial Vehicles using Vision-Based Systems in Unknown Environments H-index: 4 (Google Scholar); 4 (Scopus) Journal Reviewer: Journal of Robotics and Control (JRC), Big Data and Cognitive Computing - MDPI, Sensor Review (IF: 1.442) , Circuit World (IF: 1.698) , Journal of Electronic Imaging (IF: 0.78), Scientific Reports – Nature (IF: 4.379) GitHub: https://github.com/wahyurahmaniar
  • 4. Content INTRODUCTION COMPUTERVISION APPLICATIONS RESEARCH EXAMPLE IN ARTIFICIAL INTELLIGENCE ARTIFICIAL INTELLIGENCE IN REMOTE SENSING 1 2 3 4
  • 6. Image Processing vs Computer Vision
  • 7. Image Processing Example 1) Rescaling: zoom in, zoom out, cropping 2) Correcting illumination: brightness and contrast 3) Color manipulations: black and white, gray-scale, HSV, RGB, BGR 4) Filter: mean, median, low pass, high pass, gaussian, laplacian 5) Edge detection Original Sobel Laplacian Canny
  • 8. Image Processing Example 6) Morphology: - Erode (local minimum) - Dilate (local maximum) - Opening (erosion then dilation) - Closing (dilation then erosion) - Morphology Gradient (MG) (difference between dilation and erosion) - Top Hat (TH) (difference between image and opening) - Black Hat (difference between image and closing)
  • 9. Computer Vision Example ▪ Scene modeling ▪ Object detection ▪ Object recognition ▪ Object tracking ▪ Pose estimation ▪ Motion estimation ▪ Image restoration
  • 10. Artificial Intelligence (AI) ▪ AI is intelligence demonstrated by machines, as opposed to the natural intelligence displayed by humans or animals. ▪ Goals: reasoning, problem solving, knowledge representation, planning, learning, natural language processing, perception, prediction, motion and manipulation, social intelligence, and general intelligence. ▪ Media: image, sound, and data. ▪ AI before ML and DL: rules engines, expert systems, knowledge graphs or symbolic AI. ▪ ML: supervised learning, unsupervised learning, semi-supervised learning, reinforcement learning, dimensionality reduction. ▪ DL: deep neural networks, deep belief networks, deep reinforcement learning, recurrent neural networks and convolutional neural networks.
  • 11. Machine Learning vs Deep Learning
  • 13. Traditional AI 1) Traditional computer vision method: background subtraction Disadvantage: - Can't handle some lighting changes, including when it's dark - Often detects 2 cars or more as 1 car - Slow and less accurate than machine learning and deep learning - Can't recognize the type and color of the car
  • 14. Machine Learning 2) Machine learning: Haar cascade Advantage: - Faster computation time Disadvantage: - Can't handle dark illumination - Less accurate than deep learning - Can't recognize the type and color of the car
  • 15. Deep Learning 2) Deep learning: YOLOv3 Advantage: - Can handle any illumination - Very accurate - Can recognize the type and color of the car Disadvantage: - Requires large computation
  • 17. Research 1: Moving Object Detection
  • 18. Research 1: Moving Object Detection Contribution - Real-time computing performance for the detection and recognition of multiple moving objects - Accurate detection of moving objects using a UAV in complex backgrounds Rahmaniar, W.; Wang, W.-J.; Chen, H.-C. “Real-time detection and recognition of multiple moving objects for aerial surveillance”, Electronics, vol. 8, no. 12, pp. 1373-1390, 2019. (SCI/IE, Q2, IF: 2.397) System overview of moving objects detection and recognition UAV movement modelling
  • 19. Research 1: Object Detection Cascade classifier for object detection and recognition The angular value of the motion vector: (a) (b) Optical flow estimation. (a) Original image. (b) Motion vectors. ( ) ( ) ( ) ( ) ( ) 1 180 tan x y t t t   −       =      
  • 20. Research 1: Result a. Motion vectors result without image stabilization b. Motion vectors result with image stabilization Image stabilization result Method Computation Time (fps) Accuracy (%) PR Recall F-Measure Proposed 47.08 94 91 92 Yazid et al. 1 70 76 72 Saif et al. - 66 86 74 Maier et al. - 94 89 91 Kalantar et al. 1.6 - - 73 Cai et al. 5 - - 76 Comparison of performance results
  • 21. Research 2: Bi-directional People Counting
  • 22. Research 2: Bi-directional People Counting System overview of the proposed people counting Contribution - Fast and accurate object detection in the Depth images - Map and register objects in the RGB images - Fast and accurate tracking and counting objects - Apply the algorithm to the embedded system - Detect and count objects for “in” and “out” directions simultaneously Rahmaniar, W.; Wang, W.-J.; Chiu, C.-W.; Hakim, N. L. “Real-time bi-directional people counting using an RGB-D camera”, Sensor Review, vol. 41, no. 4, pp. 341-349, 2021. (SCI/IE, Q2, IF: 1.442)
  • 23. Research 2: Object Detection (d) (e) (a) (b) (c) Foreground segmentation. (a) Background removal. (b) Closer object segmentation. (c) Farther object segmentation. (d) Binary image of the closer object. (e) Binary image of the farther object.
  • 24. Research 2: Object Registration • The registered object’s heads are mapped to an RGB image: (a) (b) Object registration. (a) Object detected in the depth RGB image. (b) Object registered in the RGB image. ( , ) ( ( , ) 2) ( , ) RGB head n t head n t R shift x y =  +
  • 25. Research 2: Result Hardware setup ➢ Current precision rate: 99%. ➢ Average computation time: 30~40 fps. ➢ Detect and track up to 6 people in a frame. Comparison of performance results Method Computation time (fps) Accuracy PC Embedded system PR Recall F-Measure Proposed 35 18 0.99 0.96 0.97 Cho et al. 20 4 0.96 0.98 0.97 Tokta et al. 10 - 0.87 - - Garcia et al. 28 - 0.96 - - Bondi et al. 20 - 0.97 0.89 0.93
  • 26. Research 3: Indoor UAV distance measurement
  • 27. Research 3: Indoor UAV distance measurement System overview Contribution - Fast and accurate distance measurement of the UAV in the building - UAV positioning in the unknown environments - Camera stabilization and positioning within UAV movement Rahmaniar, W.; Wang, W.-J.; Caesarendra, W.; Glowacz, A.; Oprzędkiewicz, K; Sulowicz, M, Irfan, M. “Distance Measurement of Unmanned Aerial Vehicles Using Vision-based Systems in Unknown Environments”, Electronics, vol. 10, no. 14, pp.1647-1660, 2021. (SCI/IE, Q2, IF: 2.397) Indoor segmentation
  • 29. Research Example ▪ Land Use and Land Cover Mapping ▪ Object Detection in Optical, Radar and Lidar Images ▪ Object Detection from Aerial and Space Platforms ▪ Moving Object Detection and Control ▪ Robotics and AI for Infrastructure Inspection and Monitoring ▪ Marine, Ocean and Climate Change Monitoring ▪ Plant Disease and Tree Health Detection
  • 30. Research 1: Small Object Detection
  • 31. Research 1: Small Object Detection ▪ To improve detection accuracy on remote sensing images, researchers have used deep convolutional neural network (CNN)-based super-resolution (SR) techniques to generate artificial images and then detect objects ▪ Generative Adversarial Network (GAN)-based methods such as super-resolution GAN (SRGAN) and enhanced super-resolution GAN (ESRGAN) showed remarkable performance in enhancing low-resolution (LR) images with and without noise ▪ These models have two subnetworks: a generator and a discriminator. Both subnetworks consist of deep CNNs. ▪ Datasets containing HR and LR image pairs are used for training and testing the models. The generator generates HR images from LR input images, and the discriminator predicts whether generated image is a real HR image or an upscaled LR image. ▪ After sufficient training, the generator generates HR images that are similar to the ground truth HR images, and the discriminator cannot correctly discriminate between real and fake images anymore.
  • 32. Research 1: Small Object Detection ▪ Detection on LR (low-resolution) images (60cm/pixel) is shown in (I); in (II), we show the detection on generated SR (super resolution) images (15 cm/pixel). ▪ The first row of this figure represents the COWC (car overhead with context) dataset, and the second row represents the OGST (oil and gas storage tank) dataset. ▪ FRCNN (faster region-based CNN) detector is used for LR images for detection. ▪ Then EESRGAN (edge-enhanced SRGAN) and FRCNN architecture (EESRGAN-FRCNN) are used to generate SR images and simultaneously detect objects from the SR images. ▪ The red bounding boxes represent true positives, and yellow bounding boxes represent false negatives.
  • 33. Research 1: Small Object Detection ▪ The generator G generates intermediate super- resolution (ISR) images, and then final SR images are generated after applying the EEN network. ▪ The discriminator (DRa) discriminates between ground truth (GT) HR images and ISR. The inverted gradients of DRa are backpropagated into the generator G in order to create SR images allowing for accurate object detection. ▪ Edge information is extracted from ISR, and the EEN network enhances these edges. Afterwards, the enhanced edges are again added to the ISR after subtracting the original edges extracted by the Laplacian operator and the output SR images are generated with enhanced edges. ▪ Finally, objects are detected from the SR images using the detector network. ▪ Two different loss functions are used for EEN: one compares the difference between SR and ground truth images, and the other compares the difference between the extracted edge from ISR and ground truth.
  • 34. Research 1: Small Object Detection
  • 35. Research 2: Building Segmentation
  • 36. Research 2: Building Segmentation ▪ A deep multi-task learning (MTL) framework is proposed to simultaneously address building footprint segmentation as the main task; building boundary extraction, and image reconstruction as auxiliary tasks for very high-resolution RS images ▪ The proposed framework takes input image, ground truth map, and boundary extracted ground truth map as an input and outputs; (1) segmentation map, (2) boundary segmentation map, and (3) reconstructed input image.
  • 37. Research 2: Building Segmentation ▪ The objectives aimed to optimize in this paper are, cross-entropy for semantic segmentation and boundary extraction tasks and mean absolute error (MAE or L1 loss) for the reconstruction task. ▪ The canny edge detection is used algorithm with a radius of 3, together with dilation to extract the boundaries from whole ground truth masks. ▪ During experiments, the input images and corresponding masks are randomly cropped by 512 x 512 patches as network inputs. The dataset is partitioned by a 7:2:1 ratio for train, validation, and test set, respectively. ▪ Data augmentation is applied to increase the dataset size artificially by deploying the following transformations: horizontal-vertical flip, additive Gaussian noise, perspective, random brightness, gamma, contrast, blur. ▪ ResNet101 is used as a backbone for the feature encoder. ▪ The encoder-decoder type network architecture is based on U-Net, with five encoder and following decoder blocks with the filter sizes of (256, 128, 64, 32, 16)
  • 38. Research 2: Building Segmentation
  • 39. Q & A ARE WELCOME