SlideShare a Scribd company logo
1 of 21
Become
A Photo Pro
With AI and Deep Learning
Dan Zeitman - Developer Advocate - Cloudinary
Marek Sadowski - Developer Advocate - IBM
Outline
● What is AI / Deep Learning powered Photography?
● INPUT: Taking better and more professional photos
○ AI in New Camera Devices
● OUTPUT: AI powered Websites
○ Automated Curation of images, using Auto-tagging,
Deep Learning & Visual Analysis
○ Advances in algorithmic and AI-based Image
manipulation, filtering, optimization and speedy
delivery
● Demos
○ AI Playground
○ Selfie Camera
○ Upload Widget
Key Features For Visual Recognition
● Object, Scene, and Activity detection
● Facial Recognition
● Facial analysis
● Person tracking
● Content Moderation - Unsafe content detection
● Celebrity recognition
● OCR - Text in images
Object, Scene, and Activity detection
Source: AWS Rekognition
Facial Recognition
Source: AWS Rekognition
Facial Analysis
Source: AWS Rekognition
Person Tracking
Source: AWS Rekognition
Content Moderation - Unsafe Content
Source: AWS Rekognition
Celebrity Recognition
Source: AWS Rekognition
OCR - Text
Source: AWS Rekognition
AI / Deep Learning Cameras
Google Clips - Consumer Product that captures family members
and their pets.
AWS DeepLens - Learning tool aimed at AI Developers.
Lighthouse - Security camera (on Steroids)
Furbo (Furbo) - Pet Camera that dispenses treats
Spectacles (SnapChat V2) - Popular eyeglass camera will have AI
labels and AR capabilities in the next version
Arsenal Camera Assistant - Black box device to control DLSR
cameras
Use Case: Google Clips
Google Clips features
Moment IQ, a machine
learning algorithm that’s
smart enough to
recognize great
expressions, lighting and
framing. And it’s always
learning.
Source:Google Clips
Use Case: DeepLens
The world’s first deep
learning enabled video
camera for developers
AWS DeepLens helps put
deep learning in the
hands of developers,
literally, with a fully
programmable video
camera, tutorials, code,
and pre-trained models
designed to expand deep
learning skills.
Source: AWS DeepLens
Use Case: Lighthouse
Lighthouse is an
interactive assistant using
advanced camera
technology and machine
learning for your home.
You tell it the security, pet
and family related
activities you care about,
and it tells you when
those things happen.
Source:Lighthouse
Use Case: Arsenal Camera Assistant
Arsenal’s ultralight
hardware uses state
of the art AI to take
better photos in any
condition.
Source: Arsenal Camera Assistant
Use Case: Arsenal Camera Assistant
Arsenal quickly examines the scene. It uses image recognition to
identify environment and subject-specific needs (e.g. fast shutter
for birds or camera vibration)
Arsenal then finds great settings by comparing the current scene
with thousands of professional photos using a convolutional deep
neural network.
Lastly, Arsenal optimizes settings based on 18 different factors,
like hyperfocal distance, sensor dynamic range and lens
transmission.
Source: Arsenal Camera Assistant
Websites powered by AI / Deep Learning
Tinder
Yelp
Source: Tinder, Yelp
Use Case: Yelp
Yelp users upload around 100,000 photos a day to a collection of tens of millions, and
that rate continues to grow.
Yelp turned to various computer vision techniques, trying to discover intrinsic features of
a given image that could be associated with a quality score
Source: Yelp
Use Case: Yelp
At Yelp, each business’s page showcases a few of its best photos, which we call cover
photos.
First, this system was highly subject to selection bias. Cover photos are viewed and clicked
significantly more often than average. As a result, once a photo ends up on the business
page, it is highly likely to remain there, even if more attractive and useful photos are
uploaded at a later date.
Additionally, relying solely on likes to determine prominent photos can end up promoting
“clickbait” photos- that is, those that may have low relevance and quality but are upvoted
due to their provocative nature.
Source:, Yelp
Cloudinary Search Demo (TJ Bot)
● Overview of Cloudinary DAM / Admin console
● Search Bot Demo
Cloudinary & Watson Demos
GitHub Projects
https://github.com/blumareks/cloudinary-watson
Watson Sign-Up
https://developer.ibm.com/code/patterns/apply-
cognitive-to-mobile-images-on-the-go/?cm_mmc=dw-_-
cloudinarymeetup-_-imc-_-email
Cloudinary Sign-Up
https://cloudinary.com/users/register/free?source=ibm
meetup-slideshare

More Related Content

Similar to Become a photo pro with ai and deep learning

Globant - Amazon recognition workshop - 2018
Globant - Amazon recognition workshop - 2018  Globant - Amazon recognition workshop - 2018
Globant - Amazon recognition workshop - 2018 Globant
 
Best Practices for Integrating Amazon Rekognition into Your Own Applications
Best Practices for Integrating Amazon Rekognition into Your Own ApplicationsBest Practices for Integrating Amazon Rekognition into Your Own Applications
Best Practices for Integrating Amazon Rekognition into Your Own ApplicationsAmazon Web Services
 
BDA301 An Introduction to Amazon Rekognition
BDA301 An Introduction to Amazon RekognitionBDA301 An Introduction to Amazon Rekognition
BDA301 An Introduction to Amazon RekognitionAmazon Web Services
 
Adding Image and Video Analysis to your Applications (May 2018)
Adding Image and Video Analysis to your Applications (May 2018)Adding Image and Video Analysis to your Applications (May 2018)
Adding Image and Video Analysis to your Applications (May 2018)Julien SIMON
 
Adding Image and Video Analysis to your applications
Adding Image and Video Analysis to your applicationsAdding Image and Video Analysis to your applications
Adding Image and Video Analysis to your applicationsAmazon Web Services
 
AWS 기반 인공지능 비디오 분석 서비스 소개::Ranju Das::AWS Summit Seoul 2018
AWS 기반 인공지능 비디오 분석 서비스 소개::Ranju Das::AWS Summit Seoul 2018AWS 기반 인공지능 비디오 분석 서비스 소개::Ranju Das::AWS Summit Seoul 2018
AWS 기반 인공지능 비디오 분석 서비스 소개::Ranju Das::AWS Summit Seoul 2018Amazon Web Services Korea
 
AWS AI Media & Entertainment Seminar - NYC, August 15, 2017
AWS AI Media & Entertainment Seminar - NYC, August 15, 2017AWS AI Media & Entertainment Seminar - NYC, August 15, 2017
AWS AI Media & Entertainment Seminar - NYC, August 15, 2017Amazon Web Services
 
AWS Summit Berlin 2017
AWS Summit Berlin 2017AWS Summit Berlin 2017
AWS Summit Berlin 2017Rino Montiel
 
Build Computer Vision Applications with Amazon Rekognition
Build Computer Vision Applications with Amazon RekognitionBuild Computer Vision Applications with Amazon Rekognition
Build Computer Vision Applications with Amazon RekognitionAmazon Web Services
 
AWS re:Invent 2016: NEW LAUNCH! Introducing Amazon Rekognition (MAC203)
AWS re:Invent 2016: NEW LAUNCH! Introducing Amazon Rekognition (MAC203)AWS re:Invent 2016: NEW LAUNCH! Introducing Amazon Rekognition (MAC203)
AWS re:Invent 2016: NEW LAUNCH! Introducing Amazon Rekognition (MAC203)Amazon Web Services
 
Best practices for integrating Amazon Rekognition into your own application
Best practices for integrating Amazon Rekognition into your own applicationBest practices for integrating Amazon Rekognition into your own application
Best practices for integrating Amazon Rekognition into your own applicationAmazon Web Services
 
How to Test Computer Vision Apps like Google Lens and Google Photos.pdf
How to Test Computer Vision Apps like Google Lens and Google Photos.pdfHow to Test Computer Vision Apps like Google Lens and Google Photos.pdf
How to Test Computer Vision Apps like Google Lens and Google Photos.pdfpCloudy
 
BDA 301 An Introduction to Amazon Rekognition, for Deep Learning-based Comput...
BDA 301 An Introduction to Amazon Rekognition, for Deep Learning-based Comput...BDA 301 An Introduction to Amazon Rekognition, for Deep Learning-based Comput...
BDA 301 An Introduction to Amazon Rekognition, for Deep Learning-based Comput...Amazon Web Services
 
Real time video analytics with InfoSphere Streams, OpenCV and R
Real time video analytics with InfoSphere Streams, OpenCV and RReal time video analytics with InfoSphere Streams, OpenCV and R
Real time video analytics with InfoSphere Streams, OpenCV and RStephan Reimann
 
Darin Briskman_Amazon_June_9_2017_Presentation
Darin Briskman_Amazon_June_9_2017_PresentationDarin Briskman_Amazon_June_9_2017_Presentation
Darin Briskman_Amazon_June_9_2017_PresentationTriNimbus
 
Artificial Intelligence on the AWS Platform
Artificial Intelligence on the AWS PlatformArtificial Intelligence on the AWS Platform
Artificial Intelligence on the AWS PlatformAdrian Hornsby
 
Aggiungere analisi di video e immagini alle vostre applicazioni
Aggiungere analisi di video e immagini alle vostre applicazioniAggiungere analisi di video e immagini alle vostre applicazioni
Aggiungere analisi di video e immagini alle vostre applicazioniAmazon Web Services
 
AI in Finance: Moving forward!
AI in Finance: Moving forward!AI in Finance: Moving forward!
AI in Finance: Moving forward!Adrian Hornsby
 

Similar to Become a photo pro with ai and deep learning (20)

Globant - Amazon recognition workshop - 2018
Globant - Amazon recognition workshop - 2018  Globant - Amazon recognition workshop - 2018
Globant - Amazon recognition workshop - 2018
 
Best Practices for Integrating Amazon Rekognition into Your Own Applications
Best Practices for Integrating Amazon Rekognition into Your Own ApplicationsBest Practices for Integrating Amazon Rekognition into Your Own Applications
Best Practices for Integrating Amazon Rekognition into Your Own Applications
 
BDA301 An Introduction to Amazon Rekognition
BDA301 An Introduction to Amazon RekognitionBDA301 An Introduction to Amazon Rekognition
BDA301 An Introduction to Amazon Rekognition
 
Adding Image and Video Analysis to your Applications (May 2018)
Adding Image and Video Analysis to your Applications (May 2018)Adding Image and Video Analysis to your Applications (May 2018)
Adding Image and Video Analysis to your Applications (May 2018)
 
Adding Image and Video Analysis to your applications
Adding Image and Video Analysis to your applicationsAdding Image and Video Analysis to your applications
Adding Image and Video Analysis to your applications
 
AWS 기반 인공지능 비디오 분석 서비스 소개::Ranju Das::AWS Summit Seoul 2018
AWS 기반 인공지능 비디오 분석 서비스 소개::Ranju Das::AWS Summit Seoul 2018AWS 기반 인공지능 비디오 분석 서비스 소개::Ranju Das::AWS Summit Seoul 2018
AWS 기반 인공지능 비디오 분석 서비스 소개::Ranju Das::AWS Summit Seoul 2018
 
AWS AI Media & Entertainment Seminar - NYC, August 15, 2017
AWS AI Media & Entertainment Seminar - NYC, August 15, 2017AWS AI Media & Entertainment Seminar - NYC, August 15, 2017
AWS AI Media & Entertainment Seminar - NYC, August 15, 2017
 
AWS Summit Berlin 2017
AWS Summit Berlin 2017AWS Summit Berlin 2017
AWS Summit Berlin 2017
 
Ai use cases
Ai use casesAi use cases
Ai use cases
 
Build Computer Vision Applications with Amazon Rekognition
Build Computer Vision Applications with Amazon RekognitionBuild Computer Vision Applications with Amazon Rekognition
Build Computer Vision Applications with Amazon Rekognition
 
AWS re:Invent 2016: NEW LAUNCH! Introducing Amazon Rekognition (MAC203)
AWS re:Invent 2016: NEW LAUNCH! Introducing Amazon Rekognition (MAC203)AWS re:Invent 2016: NEW LAUNCH! Introducing Amazon Rekognition (MAC203)
AWS re:Invent 2016: NEW LAUNCH! Introducing Amazon Rekognition (MAC203)
 
Best practices for integrating Amazon Rekognition into your own application
Best practices for integrating Amazon Rekognition into your own applicationBest practices for integrating Amazon Rekognition into your own application
Best practices for integrating Amazon Rekognition into your own application
 
How to Test Computer Vision Apps like Google Lens and Google Photos.pdf
How to Test Computer Vision Apps like Google Lens and Google Photos.pdfHow to Test Computer Vision Apps like Google Lens and Google Photos.pdf
How to Test Computer Vision Apps like Google Lens and Google Photos.pdf
 
BDA 301 An Introduction to Amazon Rekognition, for Deep Learning-based Comput...
BDA 301 An Introduction to Amazon Rekognition, for Deep Learning-based Comput...BDA 301 An Introduction to Amazon Rekognition, for Deep Learning-based Comput...
BDA 301 An Introduction to Amazon Rekognition, for Deep Learning-based Comput...
 
Real time video analytics with InfoSphere Streams, OpenCV and R
Real time video analytics with InfoSphere Streams, OpenCV and RReal time video analytics with InfoSphere Streams, OpenCV and R
Real time video analytics with InfoSphere Streams, OpenCV and R
 
Demo day poster
Demo day posterDemo day poster
Demo day poster
 
Darin Briskman_Amazon_June_9_2017_Presentation
Darin Briskman_Amazon_June_9_2017_PresentationDarin Briskman_Amazon_June_9_2017_Presentation
Darin Briskman_Amazon_June_9_2017_Presentation
 
Artificial Intelligence on the AWS Platform
Artificial Intelligence on the AWS PlatformArtificial Intelligence on the AWS Platform
Artificial Intelligence on the AWS Platform
 
Aggiungere analisi di video e immagini alle vostre applicazioni
Aggiungere analisi di video e immagini alle vostre applicazioniAggiungere analisi di video e immagini alle vostre applicazioni
Aggiungere analisi di video e immagini alle vostre applicazioni
 
AI in Finance: Moving forward!
AI in Finance: Moving forward!AI in Finance: Moving forward!
AI in Finance: Moving forward!
 

Recently uploaded

SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, YardstickSaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, Yardsticksaastr
 
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdfThe workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdfSenaatti-kiinteistöt
 
Mathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptxMathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptxMoumonDas2
 
Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510Vipesco
 
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxMohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxmohammadalnahdi22
 
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxNikitaBankoti2
 
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
Air breathing and respiratory adaptations in diver animals
Air breathing and respiratory adaptations in diver animalsAir breathing and respiratory adaptations in diver animals
Air breathing and respiratory adaptations in diver animalsaqsarehman5055
 
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
Microsoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AIMicrosoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AITatiana Gurgel
 
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxChiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxraffaeleoman
 
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Kayode Fayemi
 
If this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaIf this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaKayode Fayemi
 
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night EnjoyCall Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night EnjoyPooja Nehwal
 
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Delhi Call girls
 
Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Chameera Dedduwage
 
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024eCommerce Institute
 
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...Sheetaleventcompany
 
Report Writing Webinar Training
Report Writing Webinar TrainingReport Writing Webinar Training
Report Writing Webinar TrainingKylaCullinane
 
George Lever - eCommerce Day Chile 2024
George Lever -  eCommerce Day Chile 2024George Lever -  eCommerce Day Chile 2024
George Lever - eCommerce Day Chile 2024eCommerce Institute
 

Recently uploaded (20)

SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, YardstickSaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
 
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdfThe workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
 
Mathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptxMathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptx
 
Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510
 
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxMohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
 
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
 
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
 
Air breathing and respiratory adaptations in diver animals
Air breathing and respiratory adaptations in diver animalsAir breathing and respiratory adaptations in diver animals
Air breathing and respiratory adaptations in diver animals
 
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
 
Microsoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AIMicrosoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AI
 
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxChiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
 
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
 
If this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaIf this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New Nigeria
 
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night EnjoyCall Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
 
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
 
Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)
 
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
 
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
 
Report Writing Webinar Training
Report Writing Webinar TrainingReport Writing Webinar Training
Report Writing Webinar Training
 
George Lever - eCommerce Day Chile 2024
George Lever -  eCommerce Day Chile 2024George Lever -  eCommerce Day Chile 2024
George Lever - eCommerce Day Chile 2024
 

Become a photo pro with ai and deep learning

  • 1. Become A Photo Pro With AI and Deep Learning Dan Zeitman - Developer Advocate - Cloudinary Marek Sadowski - Developer Advocate - IBM
  • 2. Outline ● What is AI / Deep Learning powered Photography? ● INPUT: Taking better and more professional photos ○ AI in New Camera Devices ● OUTPUT: AI powered Websites ○ Automated Curation of images, using Auto-tagging, Deep Learning & Visual Analysis ○ Advances in algorithmic and AI-based Image manipulation, filtering, optimization and speedy delivery ● Demos ○ AI Playground ○ Selfie Camera ○ Upload Widget
  • 3. Key Features For Visual Recognition ● Object, Scene, and Activity detection ● Facial Recognition ● Facial analysis ● Person tracking ● Content Moderation - Unsafe content detection ● Celebrity recognition ● OCR - Text in images
  • 4. Object, Scene, and Activity detection Source: AWS Rekognition
  • 8. Content Moderation - Unsafe Content Source: AWS Rekognition
  • 10. OCR - Text Source: AWS Rekognition
  • 11. AI / Deep Learning Cameras Google Clips - Consumer Product that captures family members and their pets. AWS DeepLens - Learning tool aimed at AI Developers. Lighthouse - Security camera (on Steroids) Furbo (Furbo) - Pet Camera that dispenses treats Spectacles (SnapChat V2) - Popular eyeglass camera will have AI labels and AR capabilities in the next version Arsenal Camera Assistant - Black box device to control DLSR cameras
  • 12. Use Case: Google Clips Google Clips features Moment IQ, a machine learning algorithm that’s smart enough to recognize great expressions, lighting and framing. And it’s always learning. Source:Google Clips
  • 13. Use Case: DeepLens The world’s first deep learning enabled video camera for developers AWS DeepLens helps put deep learning in the hands of developers, literally, with a fully programmable video camera, tutorials, code, and pre-trained models designed to expand deep learning skills. Source: AWS DeepLens
  • 14. Use Case: Lighthouse Lighthouse is an interactive assistant using advanced camera technology and machine learning for your home. You tell it the security, pet and family related activities you care about, and it tells you when those things happen. Source:Lighthouse
  • 15. Use Case: Arsenal Camera Assistant Arsenal’s ultralight hardware uses state of the art AI to take better photos in any condition. Source: Arsenal Camera Assistant
  • 16. Use Case: Arsenal Camera Assistant Arsenal quickly examines the scene. It uses image recognition to identify environment and subject-specific needs (e.g. fast shutter for birds or camera vibration) Arsenal then finds great settings by comparing the current scene with thousands of professional photos using a convolutional deep neural network. Lastly, Arsenal optimizes settings based on 18 different factors, like hyperfocal distance, sensor dynamic range and lens transmission. Source: Arsenal Camera Assistant
  • 17. Websites powered by AI / Deep Learning Tinder Yelp Source: Tinder, Yelp
  • 18. Use Case: Yelp Yelp users upload around 100,000 photos a day to a collection of tens of millions, and that rate continues to grow. Yelp turned to various computer vision techniques, trying to discover intrinsic features of a given image that could be associated with a quality score Source: Yelp
  • 19. Use Case: Yelp At Yelp, each business’s page showcases a few of its best photos, which we call cover photos. First, this system was highly subject to selection bias. Cover photos are viewed and clicked significantly more often than average. As a result, once a photo ends up on the business page, it is highly likely to remain there, even if more attractive and useful photos are uploaded at a later date. Additionally, relying solely on likes to determine prominent photos can end up promoting “clickbait” photos- that is, those that may have low relevance and quality but are upvoted due to their provocative nature. Source:, Yelp
  • 20. Cloudinary Search Demo (TJ Bot) ● Overview of Cloudinary DAM / Admin console ● Search Bot Demo
  • 21. Cloudinary & Watson Demos GitHub Projects https://github.com/blumareks/cloudinary-watson Watson Sign-Up https://developer.ibm.com/code/patterns/apply- cognitive-to-mobile-images-on-the-go/?cm_mmc=dw-_- cloudinarymeetup-_-imc-_-email Cloudinary Sign-Up https://cloudinary.com/users/register/free?source=ibm meetup-slideshare

Editor's Notes

  1. Become a Photo Pro with AI and Deep Learning Dan Zeitman - Developer Advocate - Cloudinary Marek Sadowski - Developer Advocate - IBM
  2. What is AI / Deep Learning powered Photography? INPUT: Taking better and more professional photos AI in New Camera Devices OUTPUT: AI powered Websites Automated Curation of images, using Auto-tagging, Deep Learning & Visual Analysis Advances in algorithmic and AI-based Image manipulation, filtering, optimization and speedy delivery https://techcrunch.com/2017/12/01/crunch-report-tinder-is-using-ai-to-get-you-hooked-up/ https://techcrunch.com/2017/08/30/veo/ https://engineeringblog.yelp.com/2016/11/finding-beautiful-yelp-photos-using-deep-learning.html
  3. Facial recognition Fast and accurate search capability allows you to identify a person in a photo or video using your private repository of face images.
  4. Object, scene, and activity detection With AI, you can identify thousands of objects (e.g. bike, telephone, building) and scenes (e.g. parking lot, beach, city). When analyzing video, you can also identify specific activities happening in the frame, such as "delivering a package" or "playing soccer".
  5. Facial recognition Fast and accurate search capability allows you to identify a person in a photo or video using your private repository of face images.
  6. Facial analysis You can analyze the attributes of faces in images and videos to determine things like happiness, age range, eyes open, glasses, facial hair, etc. In video, you can also measure how these things change over time, such as constructing a timeline of the emotions of an actor.
  7. Person tracking Track people through a video even when their faces are not visible, or as they go in and out of the scene. You can also identify their movements in the frame to tell things like whether someone was entering or exiting a building.
  8. Unsafe content detection Content Moderation helps you identify potentially unsafe or inappropriate content across both image and video assets and provides you with detailed labels that allow you to accurately control what you want to allow based on your needs.
  9. Celebrity recognition You can quickly identify well known people in your video and image libraries to catalog footage and photos for marketing, advertising, and media industry use cases.
  10. Text in images Specifically built to work with real world images, AI can detect and recognize text from images, such as street names, captions, product names, and license plates.
  11. https://www.youtube.com/watch?v=JXh1yyvXpwo Google Clips - Consumer Product that captures family members and their pets. AWS DeepLens - Learning tool aimed at AI Developers. Lighthouse - Security camera (on Steroids) Furbo (Furbo) - Pet Camera that dispenses treats Spectacles (SnapChat V2) - Popular eyeglass camera will have AI labels and AR capabilities in the next version Arsenal Camera Assistant - Black box device to control DLSR cameras https://aws.amazon.com/deeplens/ Lighthouse - https://www.light.house Arsenal https://witharsenal.com https://techcrunch.com/2017/11/14/furbo-unveils-treat-tossing-dog-camera-with-smart-alerts-like-when-your-dog-is-pacing/ Furbo is calling this the “first AI-powered dog camera,” which uses machine learning and computer vision to detect when your dog is chewing, pacing back and forth, or playing with another pup. Furbo will also automatically take a photo of your pup when it’s looking at the camera and let you know when a human (like a dog-walker or puppy thief) comes into view. http://www.mobyaffiliates.com/blog/snap-to-launch-second-version-of-spectacles-with-ai-capabilities/
  12. Google Clips features Moment IQ, a machine learning algorithm that’s smart enough to recognize great expressions, lighting and framing. And it’s always learning. Google Clips is smart enough to recognize great expressions, lighting and framing. So the camera captures beautiful, spontaneous images. And it gets smarter over time. Clips learns to recognize familiar faces over time. You can help it learn faster by pressing the shutter button to shoot a portrait of a friend or family member.
  13. Amazon is calling DeepLens the world’s first deep learning enabled video camera for developers. AWS DeepLens helps put deep learning in the hands of developers, literally, with a fully programmable video camera, tutorials, code, and pre-trained models designed to expand deep learning skills.
  14. Lighthouse is an interactive assistant using advanced camera technology and machine learning for your home. You tell it the security, pet and family related activities you care about, and it tells you when those things happen. Product Developer’s Comments: Ai / Deep learning: A small amount is done on the device (mainly to filter out objects too small to be classified), but the much of the heavy lifting is done in our neural networks on the cloud. Closed Source: (?) Lighthouse built their own natural language processing, computer vision algorithms and custom 3D sensing hardware. Biggest Challenges for a product manufacturer? Everything! It's hard to build hardware! Getting the quality right, at the right place, at the right scale, at the right time. Lots goes into it. Have you considered 3rd party integration with a service like Cloudinary? We're launching a product with the most complicated and sophisticated computer vision that's ever been created on a consumer product. That's hard enough! At some point we'll focus on integrations, but that's not a priority for us quite yet.
  15. Arsenal’s smart assistant AI suggests settings based on your subject and environment. It uses an advanced neural network to pick the optimal settings for any scene (using similar algorithms to those in self driving cars). Like any good assistant, it then lets you control the final shot. Arsenal’s ultralight hardware uses state of the art AI to take better photos in any condition. https://witharsenal.com/features
  16. Arsenal quickly examines the scene. It uses image recognition to identify environment and subject-specific needs (e.g. fast shutter for birds or camera vibration) Arsenal then finds great settings by comparing the current scene with thousands of professional photos using a convolutional deep neural network. Lastly, Arsenal optimizes settings based on 18 different factors, like hyperfocal distance, sensor dynamic range and lens transmission. https://witharsenal.com/features
  17. Tinder: If you’ve ever quickly swiped through Tinder, you know that sometimes your fingers can get away from you – and, all of a sudden, you’ve Super Liked someone without meaning to. Oops! Tinder today is addressing that problem with a new feature now testing in select markets that will make Super Liking a more intentional experience. Called “Super Likeable,” the feature will pop up at random times in the app to offer you a free Super Like which can be used on one of four people presented on the Super Likeable card. Tinder says the experience itself is powered by artificial intelligence that helps select the people it thinks will be “of special interest to you.” According to TechCrunch, Tinder tell us, broadly, it’s using a history of your interactions on the service to figure out who sparks your interest. Veo: https://techcrunch.com/2017/12/01/crunch-report-tinder-is-using-ai-to-get-you-hooked-up/ https://techcrunch.com/2017/08/30/veo/ https://engineeringblog.yelp.com/2016/11/finding-beautiful-yelp-photos-using-deep-learning.html
  18. Yelp users upload around 100,000 photos a day to a collection of tens of millions, and that rate continues to grow. Yelp turned to various computer vision techniques, trying to discover intrinsic features of a given image that could be associated with a quality score Depth of Field: For example, one important feature for photographers is depth of field, which measures how much of the image is in focus. Using a “shallow” depth of field can be an excellent way to distinguish the subject of an image from its background, and photos uploaded to Yelp are no exception. In many cases, the most beautiful images of a given restaurant were very sharply focused on a specific entrée. Contrast: Contrast measures the difference in brightness and color between an object in an image and other nearby objects. There are several formulas for contrast, but most involve comparing the luminance, or light intensity of neighboring regions of an image. Alignment: Finally, the location of objects in an image with respect to one another can be a significant aesthetic consideration. Studies have shown, for example, that people have an innate predisposition towards symmetry in art. In addition, some photographers also promote what is called the “rule of thirds,” a method of aligning important elements of an image along certain axes to create a sense of motion or energy. https://engineeringblog.yelp.com/2016/11/finding-beautiful-yelp-photos-using-deep-learning.html https://engineeringblog.yelp.com/2015/10/how-we-use-deep-learning-to-classify-business-photos-at-yelp.html
  19. At Yelp, each business’s page showcases a few of its best cover photos. The key issue is that this system is highly subject to selection bias. Cover photos are viewed and clicked significantly more often than average. As a result, once a photo ends up on the business page, it is highly likely to remain there, even if more attractive and useful photos are uploaded at a later date. Additionally, relying solely on “likes” to determine prominent photos can end up promoting “clickbait” photos- that is, those that may have low relevance and quality but are upvoted due to their provocative nature. The engineering team at Yelp believes that the quality of cover photos for restaurants has significantly improved. https://engineeringblog.yelp.com/2016/11/finding-beautiful-yelp-photos-using-deep-learning.html https://engineeringblog.yelp.com/2015/10/how-we-use-deep-learning-to-classify-business-photos-at-yelp.html