SlideShare a Scribd company logo
更具適應性的AOI	
National	Tsing	Hua	University
Min	Sun
孫民
VSLab
Myself please visit aliensunmin.github.io
Vision Science Lab (VSLab)
Analyzing
Street	Views
Understanding
Personal	Videos
3D	&	Robot	Vision Human	Sensing
Research	Topicsin ComputerVision & Machine Learning
Wearable	Camera	Applications
Make3D
3
Challenges
p AOI is similar to fine-grained Recognition
p How to adapt to changes (e.g., due to different sensors/viewpoints)?
What kind of bird? Attentionshould help
image source
http://yassersouri.github.io/pages/fast-bird-part.html
Domain shift
image source
http://vision.cs.uml.edu/adaptation.html
Domain adaption should help
AttentionModel
p Attention	for	Image	Captioning
Xu et	al.	Show,	Attend	and	Tell:	Neural	Image	Caption	Generation	with	Visual	Attention.	ICML	2015
AttentionModel
Soft-attention
hard-attention
AttentionModel
p Fine-grained	Recognition Fu	et	al.	Look	Closer	to	See	Better:	Recurrent	Attention	Convolutional	Neural	Network	for	Fine-grained	Image	Recognition.	CVPR 2017
Motivation	– Adapting	Changes
p State-of-the-art	segmenter suffers	from	domain	shift
n The	appearances	of road	scenes	are	quite	different	across	domains (cities).
Taipei	
Rio
Cairo
New York
Frankfurt
Tokyo
Motivation
p Effect	of	domain	shift:
n Domain	bias	will	result	in	inferior	performance	on	target	domain	when	one	
applies	a	segmenter trained	on	the	source	domain.
Feature Space
Linear Classifier
Frankfurt (Src)
Taipei	(Tgt)
Segmenter
trained	on	
Src Domain
Frankfurt (Src)
Taipei	(Tgt)
No	More	Discrimination:
Cross	City	Adaptation	of	Road	Scene	
Segmenters
Yi-Hsin Chen Wei-Yu Chen Yu-Ting Chen Bo-Cheng Tsai Yu-Chiang Frank Wang Min Sun
Motivation
p Goal:	use	domain	adaptation	to	mitigate	the	effect	of	domain	shift.
p Approaches:
n Supervised	Fine-Tuning:	CAN access	the	label	on	the	target	domain.
• Straightforward	but	time-consuming	and	expensive.
n Unsupervised	Adaptation:	CAN’T access	the	label	on	the	target	domain.
• More	challenging	but	low	cost.
Pixel	labeling	of	one	
Cityscapes	 image	takes	
90	minutes on	average.[4]
[4] M. Cordts, M.Omran, S. Ramos, T. Rehfeld,M. Enzweiler, R. Benenson,U.Franke,S.Roth, and B. Schiele, “The cityscapes dataset for semantic urban scene
understanding,” in CVPR,IEEE,2016.
a
Practical	in	real	life	!
Data	Collection
p Use	Google	Streetview API	to	download	images	of	different	cities.
n Randomly	sample	locations	at	each	city	to	ensure	sufficient	variations	in	visual	appearance.
p Use	Time-Machine	feature	to	collect	images	pairs	at	the	same	location	but	different	
times.	
Tokyo
Rome
Rio
Taipei
T1 T2 T1 T2
Location	B	Location	A	
TimeTime
Unlabeled Image Pairs
Same location / Different Times
Our	Dataset
p We	propose	a	new	dataset	of	complex	road	scenes,	with:
n Diverse	Appearance:	includes	4 different	cities	across	continents.
n Temporal	Information:	each	city	includes	1600 image	pairs	which	provide	helpful	supervision	
without	any	human	interaction.
n Dense	pixel	annotations :	each city includes	100	high-quality	 annotated	images.	
Please	visit	:	https://yihsinchen.github.io/segmentation_adaptation/
Overview
𝐿"#"$% = 𝐿"$'( +	𝜆* 𝐿* + 𝜆,%$'' 𝐿,%$''
Global	Domain	Alignment
Global	Domain	Alignment	
p How	to	extend	the	idea	of	domain	adversarial	learning	for	adapting	
cross-domain	image	segmentation?
n We	take	each	gridin	the	𝒇 𝒄 𝟕 feature	map	of	the	FCN-based	segmenter as	an	
instance.
Global	Domain	Alignment	
p Our	objective	is	to	minimize	 by	iteratively update	domain	classifier
and	feature	extractor	:
• and	:	the	images	from	source	and	target	domain	respectively.
• :	the	number	of	grids	in	each	map.	
• and			the	feature	maps	of	source	and	target	domain	images.
• :	the	probability	that	the	grid	n	of	image	x	belongs	to	 the	source	domain,	where	is	the	
sigmoid	function.
Class-wise	Domain	Alignment
Class-wise	Domain	Alignment	
p Let	each	class	do	domain	adversarial	learning	individually.
p But	we	must	first	address	some	problem	:
n Under	the	unsupervisedsetting,	we	don’t	have	any	label	on	target	domain	to	
link	with	source	domain.
• Can’t	do	domain	adversarial	learning	against	source	domain.
n In	global	domain	adaptation,	we	define	each	grid	𝑛 in	the	feature	space	as	one	
instance.
• Can’t	directly	use	the	labels	which	are	in	the	image(pixel)	space.	
a pseudo	label
agrid-level	soft	label	
Up-Sampling
Input	Image Network Prediction
feature	space	 Pixel	space
Class-wise	Domain	Alignment	--- Grid-Level Soft	Label
p (In	source	domain)
n Calculate	grid-wise	soft	label	Φ2
, (𝐼4)	as	the	
probability	of	grid	𝑛 belonging	to	class	𝑐:	
• 𝑖:	is	the	pixel	index	in	image	space.
• 𝑛:	is	the	grid	index	in	feature	space.
• 𝑅(𝑛):	is	the	set	of	pixels	that	correspond	 to	grid	
n.
• 𝑦= 𝐼4 : denotes	the	ground	truth	label	of	pixel	𝑖.
Pixel-Level	 Ground-truthGrid-Level	Soft	Label
Class-wise	Domain	Alignment	--- Pseudo	Label
p (In	target	domain)
n Calculate	target-domain	grid-wise	soft	
pseudo	label	Φ2
, (𝐼>)	as	the	probability	of	
grid	𝑛 belonging	to	class	𝑐:
• 𝑖:	is	the	pixel	index	in	image	space.
• 𝑛:	is	the	grid	index	in	feature	space.
• 𝑅(𝑛):	is	the	set	of	pixels	that	correspond	 to	
grid	n.
• 𝜙=
,
𝐼> : is	the	pixel-wise	soft	pseudo	label	of	
pixel	𝑖 corresponding	 to	class	c
Pixel-Level	 Pseudo	LabelGrid-Level	Soft	Label
Class-wise	Domain	Alignment
p Due	to	the	pseudo label and	soft label,	we	could	“link”	each	class	
between	source	and	target	domain.
p Using	the	same	adversarial	learning	framework	can	be	achieved.	
Road
Car
Source Domain
(Ground Truth)
Target Domain
(Pseudo Label)
High
Low
Links of
Road
Probability	
bar
Class-wise	Domain	Alignment	⎯ Static-Object
Prior
Static-Object	
Prior
• Static-objects:	building,	road,	sidewalk…etc.
• Non-static-objects:	person,	car,	motorbike…etc
Class-wise	Domain	Alignment	⎯ Static-Object	Prior	
p Download	image	pair	at	the	same	location	but	different	times
Class-wise	Domain	Alignment	⎯ Static-Object	Prior	
p Perform	Dense	Match	(find	matched	points)
Class-wise	Domain	Alignment	⎯ Static-Object	Prior	
p Identify	superpixel containing	k>=3	matched	points	as	the	static	object	
prior
Class-wise	Domain	Alignment---Static-Object	Prior	
p Use static-object prior to refine pseudo label.
p For	pixel	that	belongs	to	static-object	prior,	we	suppress its	probability	
corresponding	to	non-static	objects.
• 𝑃'"$"=,(𝐼>) :	the	set	of	pixels	belong	to	static-object	prior	.
• 𝐶'"$"=,:	the	set	of	static-object	classes.
• Static-objects:	building,	 road,	sidewalk…etc.
• Non-static-objects:	 person,	car,	motorbike…etc
Experiments
p We	adapt	a	model	pretrained on	
Cityscapes	to	other	cities	in	our	
dataset.
nSource	domain:
• The	training	set	of	Cityscapes.
• 2975	road	scene	images	with	annotation.
nTarget	domain:
• 4 different	cities	of	our	datasets.
• Each	city	have	1600	images	without any	
annotation.
adapt
Experiments	⎯ Quantitative	Results
Experiments ⎯ Quantitative	Results
• Global	alignment	method	contributes	2.6%	mIoU gain.
Experiments	⎯ Quantitative	Results
• Class-wise	alignment	method	also	contributes	0.9%	mIoU gain.
Experiments	⎯ Quantitative	Results
• The	static-object	priors	contributes	another	0.6%	mIOU improvement.
Experiments	⎯ T-SNE	Visualization
p From	pre-trained,	GA	only,	to	
GA+CA(prior),	we	could	observe	
the	bias	between	domains	keep	
decreasing.
nGA	stands	for	Global	Domain	
Alignment
nCA	stands	for	Class-wise	Domain	
Alignment
Experiments ⎯ Typical	Examples
Recap
p AOI is similar to fine-grained Recognition
p How to adapt to changes (e.g., due to different sensors/viewpoints)?
What kind of bird? Attentionshould help
image source
http://yassersouri.github.io/pages/fast-bird-part.html
Domain shift
image source
http://vision.cs.uml.edu/adaptation.html
Domain adaption should help
Thank you

More Related Content

What's hot

Recent Progress on Single-Image Super-Resolution
Recent Progress on Single-Image Super-ResolutionRecent Progress on Single-Image Super-Resolution
Recent Progress on Single-Image Super-Resolution
Hiroto Honda
 
Recent Breakthroughs in AI + Learning Visual-Linguistic Representation in the...
Recent Breakthroughs in AI + Learning Visual-Linguistic Representation in the...Recent Breakthroughs in AI + Learning Visual-Linguistic Representation in the...
Recent Breakthroughs in AI + Learning Visual-Linguistic Representation in the...
Sangmin Woo
 
PR-355: Masked Autoencoders Are Scalable Vision Learners
PR-355: Masked Autoencoders Are Scalable Vision LearnersPR-355: Masked Autoencoders Are Scalable Vision Learners
PR-355: Masked Autoencoders Are Scalable Vision Learners
Jinwon Lee
 
A brief introduction to recent segmentation methods
A brief introduction to recent segmentation methodsA brief introduction to recent segmentation methods
A brief introduction to recent segmentation methods
Shunta Saito
 
160205 NeuralArt - Understanding Neural Representation
160205 NeuralArt - Understanding Neural Representation160205 NeuralArt - Understanding Neural Representation
160205 NeuralArt - Understanding Neural Representation
Junho Cho
 
ViT (Vision Transformer) Review [CDM]
ViT (Vision Transformer) Review [CDM]ViT (Vision Transformer) Review [CDM]
ViT (Vision Transformer) Review [CDM]
Dongmin Choi
 
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
Learn to Build an App to Find Similar Images using Deep Learning- Piotr TeterwakLearn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
PyData
 
Recent Object Detection Research & Person Detection
Recent Object Detection Research & Person DetectionRecent Object Detection Research & Person Detection
Recent Object Detection Research & Person Detection
Kai-Wen Zhao
 
Master Thesis of Computer Engineering SuperResoluton Giuseppe Caliendo
Master Thesis of Computer Engineering SuperResoluton Giuseppe CaliendoMaster Thesis of Computer Engineering SuperResoluton Giuseppe Caliendo
Master Thesis of Computer Engineering SuperResoluton Giuseppe Caliendo
GiuseppeCaliendo2
 
Image Translation with GAN
Image Translation with GANImage Translation with GAN
Image Translation with GAN
Junho Cho
 
Instance Segmentation - Míriam Bellver - UPC Barcelona 2018
Instance Segmentation - Míriam Bellver - UPC Barcelona 2018Instance Segmentation - Míriam Bellver - UPC Barcelona 2018
Instance Segmentation - Míriam Bellver - UPC Barcelona 2018
Universitat Politècnica de Catalunya
 
Visual Saliency Prediction with Deep Learning - Kevin McGuinness - UPC Barcel...
Visual Saliency Prediction with Deep Learning - Kevin McGuinness - UPC Barcel...Visual Saliency Prediction with Deep Learning - Kevin McGuinness - UPC Barcel...
Visual Saliency Prediction with Deep Learning - Kevin McGuinness - UPC Barcel...
Universitat Politècnica de Catalunya
 
Transformer in Computer Vision
Transformer in Computer VisionTransformer in Computer Vision
Transformer in Computer Vision
Dongmin Choi
 
Deep Learning for Computer Vision: Object Detection (UPC 2016)
Deep Learning for Computer Vision: Object Detection (UPC 2016)Deep Learning for Computer Vision: Object Detection (UPC 2016)
Deep Learning for Computer Vision: Object Detection (UPC 2016)
Universitat Politècnica de Catalunya
 
Face Recognition - Elisa Sayrol - UPC Barcelona 2018
Face Recognition - Elisa Sayrol - UPC Barcelona 2018Face Recognition - Elisa Sayrol - UPC Barcelona 2018
Face Recognition - Elisa Sayrol - UPC Barcelona 2018
Universitat Politècnica de Catalunya
 
Deformable DETR Review [CDM]
Deformable DETR Review [CDM]Deformable DETR Review [CDM]
Deformable DETR Review [CDM]
Dongmin Choi
 
Review: You Only Look One-level Feature
Review: You Only Look One-level FeatureReview: You Only Look One-level Feature
Review: You Only Look One-level Feature
Dongmin Choi
 
Adaptive object detection using adjacency and zoom prediction
Adaptive object detection using adjacency and zoom predictionAdaptive object detection using adjacency and zoom prediction
Adaptive object detection using adjacency and zoom prediction
Universitat Politècnica de Catalunya
 
150807 Fast R-CNN
150807 Fast R-CNN150807 Fast R-CNN
150807 Fast R-CNN
Junho Cho
 

What's hot (20)

Recent Progress on Single-Image Super-Resolution
Recent Progress on Single-Image Super-ResolutionRecent Progress on Single-Image Super-Resolution
Recent Progress on Single-Image Super-Resolution
 
Recent Breakthroughs in AI + Learning Visual-Linguistic Representation in the...
Recent Breakthroughs in AI + Learning Visual-Linguistic Representation in the...Recent Breakthroughs in AI + Learning Visual-Linguistic Representation in the...
Recent Breakthroughs in AI + Learning Visual-Linguistic Representation in the...
 
PR-355: Masked Autoencoders Are Scalable Vision Learners
PR-355: Masked Autoencoders Are Scalable Vision LearnersPR-355: Masked Autoencoders Are Scalable Vision Learners
PR-355: Masked Autoencoders Are Scalable Vision Learners
 
A brief introduction to recent segmentation methods
A brief introduction to recent segmentation methodsA brief introduction to recent segmentation methods
A brief introduction to recent segmentation methods
 
160205 NeuralArt - Understanding Neural Representation
160205 NeuralArt - Understanding Neural Representation160205 NeuralArt - Understanding Neural Representation
160205 NeuralArt - Understanding Neural Representation
 
Computer Vision Introduction
Computer Vision IntroductionComputer Vision Introduction
Computer Vision Introduction
 
ViT (Vision Transformer) Review [CDM]
ViT (Vision Transformer) Review [CDM]ViT (Vision Transformer) Review [CDM]
ViT (Vision Transformer) Review [CDM]
 
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
Learn to Build an App to Find Similar Images using Deep Learning- Piotr TeterwakLearn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
 
Recent Object Detection Research & Person Detection
Recent Object Detection Research & Person DetectionRecent Object Detection Research & Person Detection
Recent Object Detection Research & Person Detection
 
Master Thesis of Computer Engineering SuperResoluton Giuseppe Caliendo
Master Thesis of Computer Engineering SuperResoluton Giuseppe CaliendoMaster Thesis of Computer Engineering SuperResoluton Giuseppe Caliendo
Master Thesis of Computer Engineering SuperResoluton Giuseppe Caliendo
 
Image Translation with GAN
Image Translation with GANImage Translation with GAN
Image Translation with GAN
 
Instance Segmentation - Míriam Bellver - UPC Barcelona 2018
Instance Segmentation - Míriam Bellver - UPC Barcelona 2018Instance Segmentation - Míriam Bellver - UPC Barcelona 2018
Instance Segmentation - Míriam Bellver - UPC Barcelona 2018
 
Visual Saliency Prediction with Deep Learning - Kevin McGuinness - UPC Barcel...
Visual Saliency Prediction with Deep Learning - Kevin McGuinness - UPC Barcel...Visual Saliency Prediction with Deep Learning - Kevin McGuinness - UPC Barcel...
Visual Saliency Prediction with Deep Learning - Kevin McGuinness - UPC Barcel...
 
Transformer in Computer Vision
Transformer in Computer VisionTransformer in Computer Vision
Transformer in Computer Vision
 
Deep Learning for Computer Vision: Object Detection (UPC 2016)
Deep Learning for Computer Vision: Object Detection (UPC 2016)Deep Learning for Computer Vision: Object Detection (UPC 2016)
Deep Learning for Computer Vision: Object Detection (UPC 2016)
 
Face Recognition - Elisa Sayrol - UPC Barcelona 2018
Face Recognition - Elisa Sayrol - UPC Barcelona 2018Face Recognition - Elisa Sayrol - UPC Barcelona 2018
Face Recognition - Elisa Sayrol - UPC Barcelona 2018
 
Deformable DETR Review [CDM]
Deformable DETR Review [CDM]Deformable DETR Review [CDM]
Deformable DETR Review [CDM]
 
Review: You Only Look One-level Feature
Review: You Only Look One-level FeatureReview: You Only Look One-level Feature
Review: You Only Look One-level Feature
 
Adaptive object detection using adjacency and zoom prediction
Adaptive object detection using adjacency and zoom predictionAdaptive object detection using adjacency and zoom prediction
Adaptive object detection using adjacency and zoom prediction
 
150807 Fast R-CNN
150807 Fast R-CNN150807 Fast R-CNN
150807 Fast R-CNN
 

Similar to 更適應性的AOI-深度強化學習之應用

5 ray casting computer graphics
5 ray casting computer graphics5 ray casting computer graphics
5 ray casting computer graphics
cairo university
 
Computer Graphics Notes
Computer Graphics NotesComputer Graphics Notes
Computer Graphics Notes
Gurpreet singh
 
Image segmentation with deep learning
Image segmentation with deep learningImage segmentation with deep learning
Image segmentation with deep learning
Antonio Rueda-Toicen
 
Mirko Lucchese - Deep Image Processing
Mirko Lucchese - Deep Image ProcessingMirko Lucchese - Deep Image Processing
Mirko Lucchese - Deep Image Processing
MeetupDataScienceRoma
 
Lecture1
Lecture1Lecture1
Lecture1
Mobeen Mustafa
 
Conception_et_realisation_dun_site_Web_d.pdf
Conception_et_realisation_dun_site_Web_d.pdfConception_et_realisation_dun_site_Web_d.pdf
Conception_et_realisation_dun_site_Web_d.pdf
SofianeHassine2
 
IRJET- Wearable AI Device for Blind
IRJET- Wearable AI Device for BlindIRJET- Wearable AI Device for Blind
IRJET- Wearable AI Device for Blind
IRJET Journal
 
Let's paint a Picasso - A Look at Generative Adversarial Networks (GAN) and i...
Let's paint a Picasso - A Look at Generative Adversarial Networks (GAN) and i...Let's paint a Picasso - A Look at Generative Adversarial Networks (GAN) and i...
Let's paint a Picasso - A Look at Generative Adversarial Networks (GAN) and i...
Catalina Arango
 
Seeing what a gan cannot generate: paper review
Seeing what a gan cannot generate: paper reviewSeeing what a gan cannot generate: paper review
Seeing what a gan cannot generate: paper review
QuantUniversity
 
Cartoonization of images using machine Learning
Cartoonization of images using machine LearningCartoonization of images using machine Learning
Cartoonization of images using machine Learning
IRJET Journal
 
Build Your Own 3D Scanner: 3D Scanning with Swept-Planes
Build Your Own 3D Scanner: 3D Scanning with Swept-PlanesBuild Your Own 3D Scanner: 3D Scanning with Swept-Planes
Build Your Own 3D Scanner: 3D Scanning with Swept-Planes
Douglas Lanman
 
16 OpenCV Functions to Start your Computer Vision journey.docx
16 OpenCV Functions to Start your Computer Vision journey.docx16 OpenCV Functions to Start your Computer Vision journey.docx
16 OpenCV Functions to Start your Computer Vision journey.docx
ssuser90e017
 
FACE COUNTING USING OPEN CV & PYTHON FOR ANALYZING UNUSUAL EVENTS IN CROWDS
FACE COUNTING USING OPEN CV & PYTHON FOR ANALYZING UNUSUAL EVENTS IN CROWDSFACE COUNTING USING OPEN CV & PYTHON FOR ANALYZING UNUSUAL EVENTS IN CROWDS
FACE COUNTING USING OPEN CV & PYTHON FOR ANALYZING UNUSUAL EVENTS IN CROWDS
IRJET Journal
 
Log polar coordinates
Log polar coordinatesLog polar coordinates
Log polar coordinates
Oğul Göçmen
 
IMAGE GENERATION FROM CAPTION
IMAGE GENERATION FROM CAPTIONIMAGE GENERATION FROM CAPTION
IMAGE GENERATION FROM CAPTION
ijscai
 
Image Generation from Caption
Image Generation from Caption Image Generation from Caption
Image Generation from Caption
IJSCAI Journal
 
Scene Description From Images To Sentences
Scene Description From Images To SentencesScene Description From Images To Sentences
Scene Description From Images To Sentences
IRJET Journal
 
Substanceshanghaippt repacked
Substanceshanghaippt repackedSubstanceshanghaippt repacked
Substanceshanghaippt repacked
Lee Jungpyo
 
Currency recognition on mobile phones
Currency recognition on mobile phonesCurrency recognition on mobile phones
Currency recognition on mobile phones
habeebsab
 

Similar to 更適應性的AOI-深度強化學習之應用 (20)

5 ray casting computer graphics
5 ray casting computer graphics5 ray casting computer graphics
5 ray casting computer graphics
 
Computer Graphics Notes
Computer Graphics NotesComputer Graphics Notes
Computer Graphics Notes
 
Image segmentation with deep learning
Image segmentation with deep learningImage segmentation with deep learning
Image segmentation with deep learning
 
Mirko Lucchese - Deep Image Processing
Mirko Lucchese - Deep Image ProcessingMirko Lucchese - Deep Image Processing
Mirko Lucchese - Deep Image Processing
 
Lecture1
Lecture1Lecture1
Lecture1
 
Conception_et_realisation_dun_site_Web_d.pdf
Conception_et_realisation_dun_site_Web_d.pdfConception_et_realisation_dun_site_Web_d.pdf
Conception_et_realisation_dun_site_Web_d.pdf
 
IRJET- Wearable AI Device for Blind
IRJET- Wearable AI Device for BlindIRJET- Wearable AI Device for Blind
IRJET- Wearable AI Device for Blind
 
Let's paint a Picasso - A Look at Generative Adversarial Networks (GAN) and i...
Let's paint a Picasso - A Look at Generative Adversarial Networks (GAN) and i...Let's paint a Picasso - A Look at Generative Adversarial Networks (GAN) and i...
Let's paint a Picasso - A Look at Generative Adversarial Networks (GAN) and i...
 
Seeing what a gan cannot generate: paper review
Seeing what a gan cannot generate: paper reviewSeeing what a gan cannot generate: paper review
Seeing what a gan cannot generate: paper review
 
Cartoonization of images using machine Learning
Cartoonization of images using machine LearningCartoonization of images using machine Learning
Cartoonization of images using machine Learning
 
Build Your Own 3D Scanner: 3D Scanning with Swept-Planes
Build Your Own 3D Scanner: 3D Scanning with Swept-PlanesBuild Your Own 3D Scanner: 3D Scanning with Swept-Planes
Build Your Own 3D Scanner: 3D Scanning with Swept-Planes
 
16 OpenCV Functions to Start your Computer Vision journey.docx
16 OpenCV Functions to Start your Computer Vision journey.docx16 OpenCV Functions to Start your Computer Vision journey.docx
16 OpenCV Functions to Start your Computer Vision journey.docx
 
FACE COUNTING USING OPEN CV & PYTHON FOR ANALYZING UNUSUAL EVENTS IN CROWDS
FACE COUNTING USING OPEN CV & PYTHON FOR ANALYZING UNUSUAL EVENTS IN CROWDSFACE COUNTING USING OPEN CV & PYTHON FOR ANALYZING UNUSUAL EVENTS IN CROWDS
FACE COUNTING USING OPEN CV & PYTHON FOR ANALYZING UNUSUAL EVENTS IN CROWDS
 
Log polar coordinates
Log polar coordinatesLog polar coordinates
Log polar coordinates
 
IMAGE GENERATION FROM CAPTION
IMAGE GENERATION FROM CAPTIONIMAGE GENERATION FROM CAPTION
IMAGE GENERATION FROM CAPTION
 
Image Generation from Caption
Image Generation from Caption Image Generation from Caption
Image Generation from Caption
 
Scene Description From Images To Sentences
Scene Description From Images To SentencesScene Description From Images To Sentences
Scene Description From Images To Sentences
 
Substanceshanghaippt repacked
Substanceshanghaippt repackedSubstanceshanghaippt repacked
Substanceshanghaippt repacked
 
Currency recognition on mobile phones
Currency recognition on mobile phonesCurrency recognition on mobile phones
Currency recognition on mobile phones
 
427lects
427lects427lects
427lects
 

More from CHENHuiMei

小數據如何實現電腦視覺,微軟AI研究首席剖析關鍵
小數據如何實現電腦視覺,微軟AI研究首席剖析關鍵小數據如何實現電腦視覺,微軟AI研究首席剖析關鍵
小數據如何實現電腦視覺,微軟AI研究首席剖析關鍵
CHENHuiMei
 
QIF對AOI設備業之衝擊與機會
QIF對AOI設備業之衝擊與機會QIF對AOI設備業之衝擊與機會
QIF對AOI設備業之衝擊與機會
CHENHuiMei
 
產研融合推手-台大AOI設備研發聯盟_台大陳亮嘉
產研融合推手-台大AOI設備研發聯盟_台大陳亮嘉產研融合推手-台大AOI設備研發聯盟_台大陳亮嘉
產研融合推手-台大AOI設備研發聯盟_台大陳亮嘉
CHENHuiMei
 
基於少樣本深度學習之橡膠墊片檢測系統
基於少樣本深度學習之橡膠墊片檢測系統基於少樣本深度學習之橡膠墊片檢測系統
基於少樣本深度學習之橡膠墊片檢測系統
CHENHuiMei
 
AOI智慧升級─AI訓練師在地養成計畫_台灣人工智慧學校
AOI智慧升級─AI訓練師在地養成計畫_台灣人工智慧學校AOI智慧升級─AI訓練師在地養成計畫_台灣人工智慧學校
AOI智慧升級─AI訓練師在地養成計畫_台灣人工智慧學校
CHENHuiMei
 
IIoT發展趨勢及設備業者因應之_微軟葉怡君
IIoT發展趨勢及設備業者因應之_微軟葉怡君IIoT發展趨勢及設備業者因應之_微軟葉怡君
IIoT發展趨勢及設備業者因應之_微軟葉怡君
CHENHuiMei
 
精密機械的空間軌跡精度光學檢測法_台大范光照
精密機械的空間軌跡精度光學檢測法_台大范光照精密機械的空間軌跡精度光學檢測法_台大范光照
精密機械的空間軌跡精度光學檢測法_台大范光照
CHENHuiMei
 
Report
ReportReport
Report
CHENHuiMei
 
Deep learning
Deep learningDeep learning
Deep learning
CHENHuiMei
 
When AOI meets AI
When AOI meets AIWhen AOI meets AI
When AOI meets AI
CHENHuiMei
 
2018AOI論壇_基於生成對抗網路之非監督式AOI技術_工研院蔡雅惠
2018AOI論壇_基於生成對抗網路之非監督式AOI技術_工研院蔡雅惠2018AOI論壇_基於生成對抗網路之非監督式AOI技術_工研院蔡雅惠
2018AOI論壇_基於生成對抗網路之非監督式AOI技術_工研院蔡雅惠
CHENHuiMei
 
2018AOIEA論壇Keynote_眺望趨勢 量測設備未來10年發展重點_致茂曾一士
2018AOIEA論壇Keynote_眺望趨勢 量測設備未來10年發展重點_致茂曾一士2018AOIEA論壇Keynote_眺望趨勢 量測設備未來10年發展重點_致茂曾一士
2018AOIEA論壇Keynote_眺望趨勢 量測設備未來10年發展重點_致茂曾一士
CHENHuiMei
 
2018AOI論壇Keynote_AI入魂製造領域現況與趨勢_工研院熊治民
2018AOI論壇Keynote_AI入魂製造領域現況與趨勢_工研院熊治民2018AOI論壇Keynote_AI入魂製造領域現況與趨勢_工研院熊治民
2018AOI論壇Keynote_AI入魂製造領域現況與趨勢_工研院熊治民
CHENHuiMei
 
2018AOI論壇_AOI and IoT產線應用_工研院周森益
2018AOI論壇_AOI and IoT產線應用_工研院周森益2018AOI論壇_AOI and IoT產線應用_工研院周森益
2018AOI論壇_AOI and IoT產線應用_工研院周森益
CHENHuiMei
 
2018AOI論壇_AOI參與整廠協作之實務建議_達明機器人黃鐘賢
2018AOI論壇_AOI參與整廠協作之實務建議_達明機器人黃鐘賢2018AOI論壇_AOI參與整廠協作之實務建議_達明機器人黃鐘賢
2018AOI論壇_AOI參與整廠協作之實務建議_達明機器人黃鐘賢
CHENHuiMei
 
2018AOI論壇_深度學習在電腦視覺應用上的疑問_中央大學曾定章
2018AOI論壇_深度學習在電腦視覺應用上的疑問_中央大學曾定章2018AOI論壇_深度學習在電腦視覺應用上的疑問_中央大學曾定章
2018AOI論壇_深度學習在電腦視覺應用上的疑問_中央大學曾定章
CHENHuiMei
 
2018AOI論壇_時機已到 AOI導入邊緣運算_SAS林育宏
2018AOI論壇_時機已到 AOI導入邊緣運算_SAS林育宏2018AOI論壇_時機已到 AOI導入邊緣運算_SAS林育宏
2018AOI論壇_時機已到 AOI導入邊緣運算_SAS林育宏
CHENHuiMei
 
2018AOI論壇_如何導入深度學習來提升工業瑕疵檢測技術_工研院賴璟皓
2018AOI論壇_如何導入深度學習來提升工業瑕疵檢測技術_工研院賴璟皓2018AOI論壇_如何導入深度學習來提升工業瑕疵檢測技術_工研院賴璟皓
2018AOI論壇_如何導入深度學習來提升工業瑕疵檢測技術_工研院賴璟皓
CHENHuiMei
 
200704 Dr. Schenk 產品策略
200704 Dr. Schenk 產品策略200704 Dr. Schenk 產品策略
200704 Dr. Schenk 產品策略
CHENHuiMei
 
2007 TFT LCD-AOI教學檔案
2007 TFT LCD-AOI教學檔案2007 TFT LCD-AOI教學檔案
2007 TFT LCD-AOI教學檔案
CHENHuiMei
 

More from CHENHuiMei (20)

小數據如何實現電腦視覺,微軟AI研究首席剖析關鍵
小數據如何實現電腦視覺,微軟AI研究首席剖析關鍵小數據如何實現電腦視覺,微軟AI研究首席剖析關鍵
小數據如何實現電腦視覺,微軟AI研究首席剖析關鍵
 
QIF對AOI設備業之衝擊與機會
QIF對AOI設備業之衝擊與機會QIF對AOI設備業之衝擊與機會
QIF對AOI設備業之衝擊與機會
 
產研融合推手-台大AOI設備研發聯盟_台大陳亮嘉
產研融合推手-台大AOI設備研發聯盟_台大陳亮嘉產研融合推手-台大AOI設備研發聯盟_台大陳亮嘉
產研融合推手-台大AOI設備研發聯盟_台大陳亮嘉
 
基於少樣本深度學習之橡膠墊片檢測系統
基於少樣本深度學習之橡膠墊片檢測系統基於少樣本深度學習之橡膠墊片檢測系統
基於少樣本深度學習之橡膠墊片檢測系統
 
AOI智慧升級─AI訓練師在地養成計畫_台灣人工智慧學校
AOI智慧升級─AI訓練師在地養成計畫_台灣人工智慧學校AOI智慧升級─AI訓練師在地養成計畫_台灣人工智慧學校
AOI智慧升級─AI訓練師在地養成計畫_台灣人工智慧學校
 
IIoT發展趨勢及設備業者因應之_微軟葉怡君
IIoT發展趨勢及設備業者因應之_微軟葉怡君IIoT發展趨勢及設備業者因應之_微軟葉怡君
IIoT發展趨勢及設備業者因應之_微軟葉怡君
 
精密機械的空間軌跡精度光學檢測法_台大范光照
精密機械的空間軌跡精度光學檢測法_台大范光照精密機械的空間軌跡精度光學檢測法_台大范光照
精密機械的空間軌跡精度光學檢測法_台大范光照
 
Report
ReportReport
Report
 
Deep learning
Deep learningDeep learning
Deep learning
 
When AOI meets AI
When AOI meets AIWhen AOI meets AI
When AOI meets AI
 
2018AOI論壇_基於生成對抗網路之非監督式AOI技術_工研院蔡雅惠
2018AOI論壇_基於生成對抗網路之非監督式AOI技術_工研院蔡雅惠2018AOI論壇_基於生成對抗網路之非監督式AOI技術_工研院蔡雅惠
2018AOI論壇_基於生成對抗網路之非監督式AOI技術_工研院蔡雅惠
 
2018AOIEA論壇Keynote_眺望趨勢 量測設備未來10年發展重點_致茂曾一士
2018AOIEA論壇Keynote_眺望趨勢 量測設備未來10年發展重點_致茂曾一士2018AOIEA論壇Keynote_眺望趨勢 量測設備未來10年發展重點_致茂曾一士
2018AOIEA論壇Keynote_眺望趨勢 量測設備未來10年發展重點_致茂曾一士
 
2018AOI論壇Keynote_AI入魂製造領域現況與趨勢_工研院熊治民
2018AOI論壇Keynote_AI入魂製造領域現況與趨勢_工研院熊治民2018AOI論壇Keynote_AI入魂製造領域現況與趨勢_工研院熊治民
2018AOI論壇Keynote_AI入魂製造領域現況與趨勢_工研院熊治民
 
2018AOI論壇_AOI and IoT產線應用_工研院周森益
2018AOI論壇_AOI and IoT產線應用_工研院周森益2018AOI論壇_AOI and IoT產線應用_工研院周森益
2018AOI論壇_AOI and IoT產線應用_工研院周森益
 
2018AOI論壇_AOI參與整廠協作之實務建議_達明機器人黃鐘賢
2018AOI論壇_AOI參與整廠協作之實務建議_達明機器人黃鐘賢2018AOI論壇_AOI參與整廠協作之實務建議_達明機器人黃鐘賢
2018AOI論壇_AOI參與整廠協作之實務建議_達明機器人黃鐘賢
 
2018AOI論壇_深度學習在電腦視覺應用上的疑問_中央大學曾定章
2018AOI論壇_深度學習在電腦視覺應用上的疑問_中央大學曾定章2018AOI論壇_深度學習在電腦視覺應用上的疑問_中央大學曾定章
2018AOI論壇_深度學習在電腦視覺應用上的疑問_中央大學曾定章
 
2018AOI論壇_時機已到 AOI導入邊緣運算_SAS林育宏
2018AOI論壇_時機已到 AOI導入邊緣運算_SAS林育宏2018AOI論壇_時機已到 AOI導入邊緣運算_SAS林育宏
2018AOI論壇_時機已到 AOI導入邊緣運算_SAS林育宏
 
2018AOI論壇_如何導入深度學習來提升工業瑕疵檢測技術_工研院賴璟皓
2018AOI論壇_如何導入深度學習來提升工業瑕疵檢測技術_工研院賴璟皓2018AOI論壇_如何導入深度學習來提升工業瑕疵檢測技術_工研院賴璟皓
2018AOI論壇_如何導入深度學習來提升工業瑕疵檢測技術_工研院賴璟皓
 
200704 Dr. Schenk 產品策略
200704 Dr. Schenk 產品策略200704 Dr. Schenk 產品策略
200704 Dr. Schenk 產品策略
 
2007 TFT LCD-AOI教學檔案
2007 TFT LCD-AOI教學檔案2007 TFT LCD-AOI教學檔案
2007 TFT LCD-AOI教學檔案
 

Recently uploaded

Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
ThomasParaiso2
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
Alex Pruden
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Nexer Digital
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 

Recently uploaded (20)

Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 

更適應性的AOI-深度強化學習之應用