SlideShare a Scribd company logo
Computer Vision
(Machine Learning for
Developers)
Rachhek Shrestha
Software Developer
Microsoft Innovation Center Nepal
“Making Sense
of what you see”
▪Most advanced computers still
struggle at “making sense”
▪What if machines could see?
“Collectively we as a society
are blind because our
smartest machines are blind.”
-Fei-Fei Li
Director of Standford’s AI
Lab
“Our ultimate goal is to
make computers see
just as we do.”
“Khagendra”
Carpet
Standing
on
Cake
Happy
• Naming Objects
• Identifying People
• 3D Geometry
• Understanding
• Relations
• Emotions
• Actions
• Intentions
How do you teach a
computer to see an
object?
OutputInput
In Simple terms, imagine showing training
Images to learn from
Create a training model
Cat
In this example a cat
How
hard can
this be?
Mathematically
a cat has a
round face, two
pointy ears,
chubby body
and a long tail
Cat
Instead of creating object
models, hundreds of
thousands of pictures were
collected from the internet
and an algorithm was used.
24 M nodes, 140 M parameters
15 B connections
Machine Learning Example
Machine Learning Example
Machine Learning Example
Need for an algorithm that
can write the rules for us so
that we don’t have to write it
by hand.
Machine Learning Example
Train a Classifier(Function)
Classifier takes some data as input and assigns label to it as output
Machine Learning Example
Example of Classifying
Machine Learning Example
Microsoft’s Machine
Learning APIs
Microsoft Cognitive Services
Microsoft Emotion API
https://www.microsoft.com/cognitive-services/en-us/emotion-api
Microsoft Emotion API
https://www.microsoft.com/cognitive-services/en-us/emotion-api
Microsoft Vision API
https://www.microsoft.com/cognitive-services/en-us/vision-api
Extract rich information from images to categorize and process visual data
Microsoft Vision API
https://www.microsoft.com/cognitive-services/en-us/vision-api
Extract rich information from images to categorize and process visual data
Microsoft Face API
https://www.microsoft.com/cognitive-services/en-us/face-api
Extract rich information from images to categorize and process visual data
Interesting Apps
Microsoft Cognitive Services
Captionbot.ai
Extract rich information from images to categorize and process visual data
Mimicker Alarm
How-old.net
Thank you !
References
▪ Fei-Fei Li: How we're teaching
computers to understand pictures
– http://www.ted.com/talks/fei_fei_li_how_we_r
e_teaching_computers_to_understand_pictur
es
▪ Microsoft Cognitive services
▪ https://www.youtube.com/user/Google
Developers

More Related Content

Similar to Computer vision (machine learning for developers)

INTRODUCTION TO ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING
INTRODUCTION TO ARTIFICIAL INTELLIGENCE AND MACHINE LEARNINGINTRODUCTION TO ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING
INTRODUCTION TO ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING
sowmyamPSGRKCW
 

Similar to Computer vision (machine learning for developers) (20)

INTRODUCTION TO ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING
INTRODUCTION TO ARTIFICIAL INTELLIGENCE AND MACHINE LEARNINGINTRODUCTION TO ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING
INTRODUCTION TO ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING
 
AI - Artificial Intelligence - Implications for Libraries
AI - Artificial Intelligence - Implications for LibrariesAI - Artificial Intelligence - Implications for Libraries
AI - Artificial Intelligence - Implications for Libraries
 
MFPE 23 AI presentation.pptx
MFPE 23 AI presentation.pptxMFPE 23 AI presentation.pptx
MFPE 23 AI presentation.pptx
 
Introduction to Artificial Intelligence: AIM tinkering Lab Unit 1
Introduction to Artificial Intelligence: AIM tinkering Lab Unit 1Introduction to Artificial Intelligence: AIM tinkering Lab Unit 1
Introduction to Artificial Intelligence: AIM tinkering Lab Unit 1
 
Deep learning
Deep learningDeep learning
Deep learning
 
OA Introduction to AI from Object Automation
OA Introduction to AI from Object Automation OA Introduction to AI from Object Automation
OA Introduction to AI from Object Automation
 
Machine Learning and Artificial Intelligence Introduction for Beginners
Machine Learning and Artificial Intelligence Introduction for BeginnersMachine Learning and Artificial Intelligence Introduction for Beginners
Machine Learning and Artificial Intelligence Introduction for Beginners
 
Lets Chat AI - and Not Just ChatGPT
Lets Chat AI - and Not Just ChatGPTLets Chat AI - and Not Just ChatGPT
Lets Chat AI - and Not Just ChatGPT
 
Lets Chat AI – And Not Just ChatGPT
Lets Chat AI – And Not Just ChatGPTLets Chat AI – And Not Just ChatGPT
Lets Chat AI – And Not Just ChatGPT
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
Machine learning in startup
Machine learning in startupMachine learning in startup
Machine learning in startup
 
Artificial intelligence
Artificial intelligenceArtificial intelligence
Artificial intelligence
 
An Elementary Introduction to Artificial Intelligence, Data Science and Machi...
An Elementary Introduction to Artificial Intelligence, Data Science and Machi...An Elementary Introduction to Artificial Intelligence, Data Science and Machi...
An Elementary Introduction to Artificial Intelligence, Data Science and Machi...
 
unleshing the the Power Azure Open AI - MCT Summit middle east 2024 Riyhad.pptx
unleshing the the Power Azure Open AI - MCT Summit middle east 2024 Riyhad.pptxunleshing the the Power Azure Open AI - MCT Summit middle east 2024 Riyhad.pptx
unleshing the the Power Azure Open AI - MCT Summit middle east 2024 Riyhad.pptx
 
¿Qué es real? Cuando la IA intenta engañar al ojo humano
¿Qué es real? Cuando la IA intenta engañar al ojo humano¿Qué es real? Cuando la IA intenta engañar al ojo humano
¿Qué es real? Cuando la IA intenta engañar al ojo humano
 
AN INTRODUCTION TO EMERGING TECHNOLOGY
AN INTRODUCTION TO EMERGING TECHNOLOGYAN INTRODUCTION TO EMERGING TECHNOLOGY
AN INTRODUCTION TO EMERGING TECHNOLOGY
 
Artificial intelligence
Artificial intelligenceArtificial intelligence
Artificial intelligence
 
Unit 1 part 1
Unit 1   part 1Unit 1   part 1
Unit 1 part 1
 
Machine learning limits (What can Machine Learning do and what it can't)
Machine learning limits (What can Machine Learning do and what it can't)Machine learning limits (What can Machine Learning do and what it can't)
Machine learning limits (What can Machine Learning do and what it can't)
 
Hybrid Intelligence: The New Paradigm
Hybrid Intelligence: The New ParadigmHybrid Intelligence: The New Paradigm
Hybrid Intelligence: The New Paradigm
 

Recently uploaded

Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Peter Udo Diehl
 

Recently uploaded (20)

When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 

Computer vision (machine learning for developers)