The following resources come from the 2009/10 BSc (Hons) in Multimedia Technology (course number 2ELE0075) from the University of Hertfordshire. All the mini projects are designed as level two modules of the undergraduate programmes.
The objectives of this project are to demonstrate abilities to:
• Handle camera setup, calibrate and capture still and video faces
• Pre-process images and extract features
• Perform face recognition by a) using existing methods and b) trying new techniques.
This project requires the students to apply their abilities to handle image capture hardware and software. Since this is an active area of research, students will need to perform literature survey and discuss ( through brainstorm sessions) their performance characteristics. In addition, they will need to design and implement pre-processing and recognition codes leading to face recognition.
The following resources come from the 2009/10 BSc (Hons) in Multimedia Technology (course number 2ELE0075) from the University of Hertfordshire. All the mini projects are designed as level two modules of the undergraduate programmes.
The objectives of this project are to demonstrate abilities to:
• Handle camera setup, calibrate and capture still and video faces
• Pre-process images and extract features
• Perform face recognition by a) using existing methods and b) trying new techniques.
This project requires the students to apply their abilities to handle image capture hardware and software. Since this is an active area of research, students will need to perform literature survey and discuss ( through brainstorm sessions) their performance characteristics. In addition, they will need to design and implement pre-processing and recognition codes leading to face recognition.
Image processing is among rapidly growing technologies today, with its applications in various aspects of a business. Image Processing forms core research area within electronics engineering and computer science disciplines too. Image Processing is a technique to enhance raw images received from satellites, space probes, aircrafts, military reconnaissance flights or pictures taken in normal day-to-day life from normal cameras. The field is becoming powerful and popular because of technically powerful personal computers, large memories of available devices as well as graphic softwares and tools available with that devices and gadgets. Image acquisition, pre-processing, segmentation, representation, recognition and interpretation are the different basic steps through which image processing is carried out. [3][4].
Hyperloop is the new mode of transportation after air, water, rails and roads. It could be a realistic high speed as well as economical way of transportation apart from a fantasized means of transportation called the "teleportation".
Lung Cancer Detection on CT Images by using Image Processingijtsrd
This project is mainly based on image processing technique. In this work MATLAB have been used through every procedure made. Image processing techniques are widely use in bio-medical sector. The objective of our work is noise removal operation, thresholding, gray scale imaging, histogram equalization, texture segmentation, and morphological operation. Detection of lung cancer from computed tomography (CT) images is done by using MATLAB software. By using these methods the work has been done on CT images and the final tumor area has been shown with pixel values. Bindiya Patel | Dr. Pankaj Kumar Mishra | Prof. Amit Kolhe"Lung Cancer Detection on CT Images by using Image Processing" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-2 | Issue-3 , April 2018, URL: http://www.ijtsrd.com/papers/ijtsrd11674.pdf http://www.ijtsrd.com/engineering/electronics-and-communication-engineering/11674/lung-cancer-detection-on-ct-images-by-using-image-processing/bindiya-patel
Image segmentation techniques
More information on this research can be found in:
Hussein, Rania, Frederic D. McKenzie. “Identifying Ambiguous Prostate Gland Contours from Histology Using Capsule Shape Information and Least Squares Curve Fitting.” The International Journal of Computer Assisted Radiology and Surgery ( IJCARS), Volume 2 Numbers 3-4, pp. 143-150, December 2007.
신종주(isaac.shin) / kakao corp.(멀티미디어처리파트)
---
Computer Vision 의 여러 영역 중 얼굴인식은 실생활에 가장 유용한 분야 중 하나입니다. 얼굴 인식도 딥러닝을 이용하면서 성능이 많이 향상되었습니다. 얼굴 인식 분야가 어떻게 발전되었고, 최근에는 어떤 연구가 진행 중인지 알아보겠습니다. 마지막으로 카카오에서 얼굴 인식 적용사례를 소개합니다.
Polygon is a figure having many slides. It may be represented as a number of line segments end to end to form a closed figure.
The line segments which form the boundary of the polygon are called edges or slides of the polygon.
The end of the side is called the polygon vertices.
Triangle is the most simple form of polygon having three side and three vertices.
The polygon may be of any shape.
Image processing is among rapidly growing technologies today, with its applications in various aspects of a business. Image Processing forms core research area within electronics engineering and computer science disciplines too. Image Processing is a technique to enhance raw images received from satellites, space probes, aircrafts, military reconnaissance flights or pictures taken in normal day-to-day life from normal cameras. The field is becoming powerful and popular because of technically powerful personal computers, large memories of available devices as well as graphic softwares and tools available with that devices and gadgets. Image acquisition, pre-processing, segmentation, representation, recognition and interpretation are the different basic steps through which image processing is carried out. [3][4].
Hyperloop is the new mode of transportation after air, water, rails and roads. It could be a realistic high speed as well as economical way of transportation apart from a fantasized means of transportation called the "teleportation".
Lung Cancer Detection on CT Images by using Image Processingijtsrd
This project is mainly based on image processing technique. In this work MATLAB have been used through every procedure made. Image processing techniques are widely use in bio-medical sector. The objective of our work is noise removal operation, thresholding, gray scale imaging, histogram equalization, texture segmentation, and morphological operation. Detection of lung cancer from computed tomography (CT) images is done by using MATLAB software. By using these methods the work has been done on CT images and the final tumor area has been shown with pixel values. Bindiya Patel | Dr. Pankaj Kumar Mishra | Prof. Amit Kolhe"Lung Cancer Detection on CT Images by using Image Processing" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-2 | Issue-3 , April 2018, URL: http://www.ijtsrd.com/papers/ijtsrd11674.pdf http://www.ijtsrd.com/engineering/electronics-and-communication-engineering/11674/lung-cancer-detection-on-ct-images-by-using-image-processing/bindiya-patel
Image segmentation techniques
More information on this research can be found in:
Hussein, Rania, Frederic D. McKenzie. “Identifying Ambiguous Prostate Gland Contours from Histology Using Capsule Shape Information and Least Squares Curve Fitting.” The International Journal of Computer Assisted Radiology and Surgery ( IJCARS), Volume 2 Numbers 3-4, pp. 143-150, December 2007.
신종주(isaac.shin) / kakao corp.(멀티미디어처리파트)
---
Computer Vision 의 여러 영역 중 얼굴인식은 실생활에 가장 유용한 분야 중 하나입니다. 얼굴 인식도 딥러닝을 이용하면서 성능이 많이 향상되었습니다. 얼굴 인식 분야가 어떻게 발전되었고, 최근에는 어떤 연구가 진행 중인지 알아보겠습니다. 마지막으로 카카오에서 얼굴 인식 적용사례를 소개합니다.
Polygon is a figure having many slides. It may be represented as a number of line segments end to end to form a closed figure.
The line segments which form the boundary of the polygon are called edges or slides of the polygon.
The end of the side is called the polygon vertices.
Triangle is the most simple form of polygon having three side and three vertices.
The polygon may be of any shape.
we talk about some cases of trouble shooting and how it can impact to java performance. Also, we introduce some kind of tools for checking matters efficiently and approaching easy to user.
3월달 "강화학습의 이론과 실제" 로 강의했던 강의자료 배포합니다.
1.Dynamic Programming
2.Policy iteration
3.Value iteration
4.Monte Carlo method
5.Temporal-Difference Learning
6.Sarsa
7.Q-learning
8.딥러닝 프레임워크 케라스 소개 및 슈퍼마리오 환경 구축
9.DQN을 이용한 인공지능 슈퍼마리오 만들기
이 흐름으로 강의를 했는데
브레이크아웃 설명은 양혁렬 (Hyuk Ryeol Yang)님의 코드를 참고 하였고
8번,9번은 새로운 환경이 나왔으니 무시해도 좋겠습니다.
이 환경에 대한 자료는 주말까지 작성하고 업로드 할 예정입니다.
2. CNN Visualization technique
Implementation Detail
• 가장 결과가 좋은 Grad-CAM과, 그와 같이 쓸 수 있는 Guided
Backpropagation을 pytorch(0.4.0)로 구현하는 법에 대해 알아보자!
• pytorch의 hook 함수와 opencv의 여러가지 유틸 함수를 사용하면 쉽게
구현할 수 있다. pytorch에 익숙치 않은 사람에게 적절한 난이도
3. Overall
• 위의 이미지에서 시각화에 준 condition(Cat / Dog)에 따라 그에
대응되는 부분만 표시해주는 것을 볼 수 있음
• 이미지에서 heatmap을 계산하는 것이므로 앞에서 나온 Guided
Backpropagation과 결합해 Guided-Grad-CAM으로도 사용 가능
5. Overall
• 목표는 앞의 내용과 같이 임의의 주어진 pytorch CNN model에 대해 쉽
게 Grad-CAM을 적용하게 코딩하는 것
• 모델을 학습하는 것이 주된 내용이 아니므로 모델은 torchvision에서 제공
하는 ResNet101 pretrained model을 사용!
6. Utils
• 제일 중요한 2가지 util 함수
1. 이미지 전처리 함수
2. heatmap 구성 함수
• 이미지 전처리의 경우 주어진 모델의 test image preprocessing을
따르도록 하는 것이 좋음
• heatmap의 경우 opencv의 컬러맵을 활용
7. Utils
• torchvision의 transform 기능을 이용해서 이미지 전처리
• normaliz의 mean과 std 값은 torchvision에서 제공되는 모델이 사용했
던 hyperparameter 들. 원래는 이미지 픽셀의 평균값을 그때그때 계산해
서 사용하지만 torchvision에서 미리 계산해서 상수 값으로 사용함
8. Utils
• opencv의 resize와 color map을 사용해서 heatmap 구성
• Windows의 경우 COLORMAP_JET의 색 구성이 우분투와 반대로 되어
있는 경우가 있음. 해당하는 경우에만 52번 라인 사용해서 뒤집어줌
9. Grad-CAM
• 해야 하는 것
1. 주어진 레이블에 대한 target layer weight의 gradient 계산
2. target layer output과 위의 gradient를 weighted sum
• 1)은 pytorch의 backward_hook을 사용해서 저장 가능
• 2)는 pytorch의 forward_hook을 사용해서 저장 가능
10. Grad-CAM
• 1)은 pytorch의 backward_hook을 사용해서 저장 가능 => #21
• 2)는 pytorch의 forward_hook을 사용해서 저장 가능 => #22
11. Grad-CAM
• forward hook의 파라미터는 module, input, output
• module은 해당 layer의 object, input은 해당 레이어에 들어온 입력
텐서, output은 해당 layer가 forward pass한 결과 텐서
12. Grad-CAM
• backward hook의 파라미터는 module, grad_input, grad_output
• module은 forward hook과 동일, grad_input은 해당 레이어에 들어온
입력의 gradient, grad_output은 forward pass의 결과로 나온 텐서의
gradient. 즉 grad_outpu로 grad_input을 새로 계산함(역전파)
13. Grad-CAM
• 주의할 점은 backward_hook에선 주어진 args를 변경하면 안 됨.
대신 backward_hook에서 텐서를 반환하면 해당 텐서를 grad_input 대
신 역전파 시켜 줌
14. Grad-CAM
• 이제 forward pass를 한 뒤 계산된
feature map과 gradient로
weighted sum 한 것을 반환해
heatmap을 만들 수 있음
• 주의할 점은 model의 forward pass를
호출할 때 model을 evaluation mode
로 설정해 줘야 함. 그렇지 않을 경우
batch norm 등에서 문제가 발생함
15. Guided Backpropagation
• 해야 하는 것
Backpropagation이 진행될 때 ReLU의 경우 해당 위치가 forward pass
의 output에서도 positive고 backward pass의 input에서도 positive
인 경우만 backpropagation을 진행
• 다행히 ReLU이기 때문에 원래 backpropagation일 때도 forward pass
의 outpu이 positive인 경우만 역전파가 진행됨.
• 따라서 backpropagation에선 뒤쪽 layer으로부터 역전파된 gradient
의 값만 참조해서 수정하면 됨
17. Guided Backpropagation
보통 고정된 입력인 이미지에 대해선
gradient 계산을 하지 않지만,
guided backpropagation의 목적
은 이미지에 대한 gradient를 계산하
는 것이므로
image 텐서에 requires_grad 함수
를 호출해 플래그를 켜줘야 함