Computer Vision Lab

•Download as PPTX, PDF•

0 likes•113 views

I had the opportunity to teach a lab of computer vision for the amazing women studying one of the AllWomen courses. Here I share the slides used for teaching these 2h labs where I tried to cover in a very high level some of the basic concepts of computer vision.

Data & Analytics

hello!
I am Karenne Mata
You can find the material for this lab in github
https://github.com/karenne/ComputerVisionLab
2

“Computer vision and machine learning
have really started to take off, but for
most people, the whole idea of what is a
computer seeing when it's looking at an
image is relatively obscure.
Mike Krieger
3

What is computer vision?
"Computer vision is an interdisciplinary scientific field that deals with
how computers can be made to gain high-level understanding from
digital images or videos. From the perspective of engineering, it seeks to
automate tasks that the human visual system can do.” - Wikipedia
4
?

Still a little obscure, isn’t it?
5
I am the Machine
looking at this image
A Black Box,
actually my brain
I am probably seeing a
5!
But I need some training before
getting so good at seeing numbers

HISTORY OF COMPUTER VISION
or how have we built the black box
Optical Character Recognision
1870 -1974: Development of the
technology that allows the
translation from image to text
1974 – 2000: Massive
commercialization
2000 – Now: Open software
realeased (Adobe, GoogleDrive,
WebOCR)
The fathers of computer vision
Larry Roberts (1963); “Machine
Perception of three
dimensional solids”; MIT PhD.
Thesis.
David Marr (1982); “Vision. A
Computational Investigation
into the Human
Representation and Processing
of Visual Information”; MIT
Press
Convolutional Nets
1980 Fukushima. The
Neocognitron
1998 Yann LeCun et al. LeNet5
2012 AlexNet
2015 ResNet-152
6

Some necessary maths…. What is a neural network
7
a11
a1n…
am1 amn…
…
y1
ym
…
x1x1 xn…
x2
xn
X =
zh = wTX + b
ah = g(zh)
zo = wT ah + b
ao = g(zo)

convolutional neural networks (CNN).
The foundations of image classification
8
Input
Output 0
0
0
0
0
1
0
0
0
0
✘ Convolutional Layer:
✘ Kernel
✘ Padding
✘ Stride
✘ Activation Function
✘ Pooling Layer
p11
p1n…
pm1 pmn…
p11
p1n…
pm1 pmn…
p11
p1n…
pm1 pmn…
* More detail on:
https://www.analyticsvidhya.com/blog/2020/02/mathematics-
behind-convolutional-neural-network/

Opening convolutional neural networks
9
* Figure 4 from paper Zeiler et al. (2013) Visualizing and Understanding Convolutional Networks.

Business stories
1. Autonomous vehicles
Self driving cars are constantly
streaming the environment to react
according the signals and other cars
or people.
2. Google Translate app
Google allows the translation of the
text embedded in images thus the
user doesn’t need to write the words
down.
3. Facial recognition
Security systems can use advanced
algorithms of computer vision to
recognize a person only showing the
face.
4. Healthcare
Diagnostic of X-rays and all the
image based sources in medicine.
5. Real-time sports tracking
See in real time the performance of
a player and the deployment of a
strategy.
6. Manufacturing
Packaging and labeling quality
assessments.
10
* Bernard Marr; 7 Amazing Examples Of Computer And Machine Vision In Practice (8 Apr 2019) Forbes

Another business story:
Emotion AI
1. Medical diagnosis.
Diagnosis of some mental health
diseases such as depression or
anxiety
2. Truth detector
Emotion AI could be used as a
truth detection device that can
be used e.g. in fraud detection
3. Marketing
Knowing how the customer feels
regarding a product or the mood of
the person can improve the quality
of the campaigns
11
* 13 Surprising Uses For Emotion AI Technology, Smarter with Garner (Susan Moore. September 11, 2018)
"Emotion AI is a subset of artificial intelligence (the broad term for machines replicating the way humans think)
that measures, understands, simulates, and reacts to human emotions” Meredith Somers, Emotion AI, explained
(March 8, 2019)

12
An example
Check this github repository to use an API for face
expression recognision:
https://github.com/justadudewhohacks/face-api.js
This is fun!

thanks!
Any questions?
You can find me at
https://www.linkedin.com/in/karenne-mata-4b81a0116/
13

What's hot

Artificial intelligence in cyber defenseUjjwal Tripathi

Ai pptKartikRathore10

Artificial intelligence a bane or boon-pdfAvlin Roy

Machine learning vs deep learningUSM Systems

Artificial intellegencegeetinsaa

Computer vision introduction Wael Badawy

New Artifitial Intelligence that can predicts Human ActionsShreya Shetty

Artificial intelligenceSURBHI SAROHA

How artificial intelligence changing the worldUSM Systems

Artificial Intelligence (AI) Interview Questions and Answers | EdurekaEdureka!

Advantages of Artificial Intelligence - Avantika UniversityAvantika University

An overview on aiMidhun chakkaravarthy

Machine Learning: Inteligencia Artificial no es sólo un tema de Ciencia Ficci....NET Conf UY

Artificial Intelligence and Expert SystemDr.R. Gunavathi Ramasamy

BE-EEE-8th sem-Presentation Artificial intelligence in security managenentMOHAMMED SAQIB

5 latest technologies in it copySaurabh Sengar

Artificial Intelligence - For AllWeDigTech

Computational Intelligence and ApplicationsChetan Kumar S

Artificial Intelligence Overview PowerPoint Presentation Slides SlideTeam

Artificial Intelligencearpitnot4u

What's hot (20)

Artificial intelligence in cyber defense

Ai ppt

Artificial intelligence a bane or boon-pdf

Machine learning vs deep learning

Artificial intellegence

Computer vision introduction

New Artifitial Intelligence that can predicts Human Actions

Artificial intelligence

How artificial intelligence changing the world

Artificial Intelligence (AI) Interview Questions and Answers | Edureka

Advantages of Artificial Intelligence - Avantika University

An overview on ai

Machine Learning: Inteligencia Artificial no es sólo un tema de Ciencia Ficci...

Artificial Intelligence and Expert System

BE-EEE-8th sem-Presentation Artificial intelligence in security managenent

5 latest technologies in it copy

Artificial Intelligence - For All

Computational Intelligence and Applications

Artificial Intelligence Overview PowerPoint Presentation Slides

Artificial Intelligence

Similar to Computer Vision Lab

Saksham presentationSakshamTurki

Applications unit 4SURBHI SAROHA

Computer vision lightning talk castaway weekChristopher Decker

Computer visionyusifagalar

ARTIFICIAL INTELLIGENCE.pptxBryCunal

What is Computer Vision?Kavika Roy

wepik-seeing-is-believing-a-beginners-guide-to-computer-vision-20231002053054...PradeepAnand18

Artificial Intelligence: Classification, Applications, Opportunities, and Cha...Abdullah al Mamun

seminar_computer_vision.pptxTejaswiniShirode

Everything You Need to Know About Computer VisionKavika Roy

Face Mask Detection System Using Artificial IntelligenceIRJET Journal

UNCOVERING FAKE NEWS BY MEANS OF SOCIAL NETWORK ANALYSISpijans

AI KIMSRAD.pptxDr. Manoj Krishnan Sarojam

0-1--Introduction FPCV-0-1.pdfPatrickMatthewChan

Area's of Artificial Inteligence .pptxJIMSVKII

Computer VisionNitin Sharma

Ai & deep learning with tensor flow iq online trainingvinayiqbusiness

ASSISTANCE SYSTEM FOR DRIVERS USING IOTIRJET Journal

[DSC Europe 23] Shahab Anbarjafari - Generative AI: Impact of Responsible AIDataScienceConferenc1

Similar to Computer Vision Lab (20)

Saksham presentation

Applications unit 4

Computer vision lightning talk castaway week

Computer vision

ARTIFICIAL INTELLIGENCE.pptx

What is Computer Vision?

wepik-seeing-is-believing-a-beginners-guide-to-computer-vision-20231002053054...

Artificial Intelligence: Classification, Applications, Opportunities, and Cha...

seminar_computer_vision.pptx

Everything You Need to Know About Computer Vision

Face Mask Detection System Using Artificial Intelligence

UNCOVERING FAKE NEWS BY MEANS OF SOCIAL NETWORK ANALYSIS

AI KIMSRAD.pptx

0-1--Introduction FPCV-0-1.pdf

Area's of Artificial Inteligence .pptx

Computer Vision

Ai & deep learning with tensor flow iq online training

ASSISTANCE SYSTEM FOR DRIVERS USING IOT

[DSC Europe 23] Shahab Anbarjafari - Generative AI: Impact of Responsible AI

Recently uploaded

Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863

Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda

04242024_CCC TUG_Joins and Relationshipsccctableauusergroup

Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster

Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha

VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Industrialised data - the key to AI success.pdfLars Albertsson

Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408

From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck

How we prevented account sharing with MFAAndrei Kaleshka

High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh

Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa

办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor

Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh

Call Girls in Saket 99530🔝 56974 Escort Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...ThinkInnovation

Recently uploaded (20)

Dubai Call Girls Wifey O52&786472 Call Girls Dubai

Customer Service Analytics - Make Sense of All Your Data.pptx

04242024_CCC TUG_Joins and Relationships

Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024

Call Girls In Dwarka 9654467111 Escorts Service

VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...

代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...

Industrialised data - the key to AI success.pdf

Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps

From idea to production in a day – Leveraging Azure ML and Streamlit to build...

How we prevented account sharing with MFA

High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...

Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf

办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...

Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝

Call Girls in Saket 99530🔝 56974 Escort Service

Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...

Computer Vision Lab

1. computer vision lab by Karenne mata

2. hello! I am Karenne Mata You can find the material for this lab in github https://github.com/karenne/ComputerVisionLab 2

3. “Computer vision and machine learning have really started to take off, but for most people, the whole idea of what is a computer seeing when it's looking at an image is relatively obscure. Mike Krieger 3

4. What is computer vision? "Computer vision is an interdisciplinary scientific field that deals with how computers can be made to gain high-level understanding from digital images or videos. From the perspective of engineering, it seeks to automate tasks that the human visual system can do.” - Wikipedia 4 ?

5. Still a little obscure, isn’t it? 5 I am the Machine looking at this image A Black Box, actually my brain I am probably seeing a 5! But I need some training before getting so good at seeing numbers

6. HISTORY OF COMPUTER VISION or how have we built the black box Optical Character Recognision 1870 -1974: Development of the technology that allows the translation from image to text 1974 – 2000: Massive commercialization 2000 – Now: Open software realeased (Adobe, GoogleDrive, WebOCR) The fathers of computer vision Larry Roberts (1963); “Machine Perception of three dimensional solids”; MIT PhD. Thesis. David Marr (1982); “Vision. A Computational Investigation into the Human Representation and Processing of Visual Information”; MIT Press Convolutional Nets 1980 Fukushima. The Neocognitron 1998 Yann LeCun et al. LeNet5 2012 AlexNet 2015 ResNet-152 6

7. Some necessary maths…. What is a neural network 7 a11 a1n… am1 amn… … y1 ym … x1x1 xn… x2 xn X = zh = wTX + b ah = g(zh) zo = wT ah + b ao = g(zo)

8. convolutional neural networks (CNN). The foundations of image classification 8 Input Output 0 0 0 0 0 1 0 0 0 0 ✘ Convolutional Layer: ✘ Kernel ✘ Padding ✘ Stride ✘ Activation Function ✘ Pooling Layer p11 p1n… pm1 pmn… p11 p1n… pm1 pmn… p11 p1n… pm1 pmn… * More detail on: https://www.analyticsvidhya.com/blog/2020/02/mathematics- behind-convolutional-neural-network/

9. Opening convolutional neural networks 9 * Figure 4 from paper Zeiler et al. (2013) Visualizing and Understanding Convolutional Networks.

10. Business stories 1. Autonomous vehicles Self driving cars are constantly streaming the environment to react according the signals and other cars or people. 2. Google Translate app Google allows the translation of the text embedded in images thus the user doesn’t need to write the words down. 3. Facial recognition Security systems can use advanced algorithms of computer vision to recognize a person only showing the face. 4. Healthcare Diagnostic of X-rays and all the image based sources in medicine. 5. Real-time sports tracking See in real time the performance of a player and the deployment of a strategy. 6. Manufacturing Packaging and labeling quality assessments. 10 * Bernard Marr; 7 Amazing Examples Of Computer And Machine Vision In Practice (8 Apr 2019) Forbes

11. Another business story: Emotion AI 1. Medical diagnosis. Diagnosis of some mental health diseases such as depression or anxiety 2. Truth detector Emotion AI could be used as a truth detection device that can be used e.g. in fraud detection 3. Marketing Knowing how the customer feels regarding a product or the mood of the person can improve the quality of the campaigns 11 * 13 Surprising Uses For Emotion AI Technology, Smarter with Garner (Susan Moore. September 11, 2018) "Emotion AI is a subset of artificial intelligence (the broad term for machines replicating the way humans think) that measures, understands, simulates, and reacts to human emotions” Meredith Somers, Emotion AI, explained (March 8, 2019)

12. 12 An example Check this github repository to use an API for face expression recognision: https://github.com/justadudewhohacks/face-api.js This is fun!

13. thanks! Any questions? You can find me at https://www.linkedin.com/in/karenne-mata-4b81a0116/ 13

Editor's Notes

Larry Roberts – How to represent images from 2D arrays to 3D using topology and algebra. Representation of the image rather than trying to convert to text. David Marr creates the bottom up approach, low level image processing meaning detecting corners, edges and motion Fukushima: A Self-organizing Neural Network Model for a Mechanism of Pattern Recognition Unaffected by Shift in Position” Yann LeCun “Gradient-Based Learning Applied to Document Recognition” 1989 LeCun CNN with backpropagation
Stride is the number of pixels shifts over the input matrix. Pad the picture with zeros (zero-padding) so that it fits ReLU stands for Rectified Linear Unit for a non-linear operation. The output is ƒ(x) = max(0,x). Pooling layers section would reduce the number of parameters when the images are too large. Spatial pooling also called subsampling or downsampling which reduces the dimensionality of each map but retains important information. Spatial pooling can be of different types: Max Pooling Average Pooling Sum Pooling
Stride is the number of pixels shifts over the input matrix. Pad the picture with zeros (zero-padding) so that it fits ReLU stands for Rectified Linear Unit for a non-linear operation. The output is ƒ(x) = max(0,x). Pooling layers section would reduce the number of parameters when the images are too large. Spatial pooling also called subsampling or downsampling which reduces the dimensionality of each map but retains important information. Spatial pooling can be of different types: Max Pooling Average Pooling Sum Pooling

Computer Vision Lab

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Computer Vision Lab

Similar to Computer Vision Lab (20)

Recently uploaded

Recently uploaded (20)

Computer Vision Lab

Editor's Notes