SlideShare a Scribd company logo
1 of 27
Download to read offline
Drones with IBM
Watson
Upkar Lidder, IBM
@lidderupk
https://github.com/lidderupk/
ulidder@us.ibm.com
Slides - https://github.com/lidderupk/watson-vr-nyc-javasig
IBM Cloud - https://ibm.biz/ibm-nyjavasig
2
1500 Drones Are Ready To Fly
IBM Developer Drone Challenge
Challenge runs from November 12 through December 16 and those 18 years of
age or older residing in the US and Canada are eligible to enter
Enter at https://developer.ibm.com/contest
Once per week during the 5 weeks of the contest a random drawing
Will be held to determine the winners
(watch for the drawing on Twitch)
Winners will receive a DJI Tello programmable drone,
an IBM Developer T-shirt, and an IBM Developer laptop sticker
Winners will also receive a special code to unlock more challenges and given
the opportunity to contribute back to the challenge
#IBMDroneDrop
IBM Cloud
3
Open Source
@ IBM
4
IBM Code Patterns
https://developer.ibm.com/patterns/ -> Try it out !
5
Classify vehicle damage images
6
1. The user captures an image with the mobile application.
2. The user sends the image on the mobile phone to the server application running in the cloud.
3. The server sends the image to Watson Visual Recognition service for analysis.
4. Watson Visual Recognition service classifies the image and returns the information to the server.
Chatbot - Anger detection, NLU, and removal of explicit images
7
1. The user interacts with the Slack app and either sends a text or uploads an image.
2. A bot passes the text or image used in Slack to an IBM Cloud Functions API.
3. Watson Visual Recognition categorizes the uploaded image using the default and explicit classifiers.
4. Watson Natural Language Processing categorizes the text if text is sent as part of the Slack communication.
5. IBM Cloud Functions receives the response, and if the text is not polite, the bot sends a message to the
Slack user to be more polite using the Slack post message API. If an image that is considered explicit is used,
the image is deleted by IBM Cloud Functions using the Slack files delete API.
Watson Studio
8
Classifiers/Models
9
A set of built-in models provides highly accurate results without training:
•General model: Default classification from thousands of classes.
•Face model: Facial analysis with age and gender.
•Explicit model (Beta): Whether an image is inappropriate for general use.
•Food model (Beta): Specifically for images of food items.
•Text model (Private beta): Text extraction from natural scene images.
Requires training:
•Custom model: Train a custom classifier by providing positive and negative images.
General Model
10
•Animals (including birds, reptiles, amphibians, etc.)
•Person and people-oriented information and activities
•Food (including cooked food and beverages)
•Plants (including trees, shrubs, aquatic plants, vegetables)
•Sports
•Nature (including many types of natural formations, geological structures)
•Transportation (land, water, air)
•And many more, including furnishings, fruits, musical instruments, tools, colors, gadgets, devices,
instruments, weapons, buildings, structures and man-made objects, clothing and garments, and flowers,
among others.
General Model Example
11
Face Detect
12
Photo by Tom Kelly / CC BY
Face Detect
13
Photo by Alan Kotok / CC BY
classify detect_faces
Food
14
classifier_ids=“” classifier_ids=food
Photo by Lola Williams / CC BY
Custom Models
15
Initial Training Update Training
Watson API Explorer
16
17
DEMO
ulidder@us.ibm.com
twitter.com/lidderupk
github.com/lidderupk
- Custom Classifier
- Test Tab
- Implementation Tab
- Java API
- SWT App
Java Application
18
Java Application – Util Class
19
Java Application – Default Classifier
20
Java Application – Custom Classifier
21
Custom Models - Tips and Tricks
22
•Provide training examples that are similar to what you plan to analyze.
•Example: training with tiger in a zoo and then using the model to classify tiger in the wild. The
background, surroundings matter, angle, lighting, distance, size of subject, etc matter.
•If the quality and content of training data is the same, then classifiers/classes that are trained on more
images will do better.
•Time vs accuracy - the benefits of more images plateaus around 5000 images.
•Parallelize by groups of images. You can train a class asynchronously.
•Recommend 150-200 images per .zip file with image size of 320 x 320. Do not need high resolution.
•Specify only classifiers you want the result for using the “classifier_id” parameter.
developer.ibm.com/code
23
24
Difficult Use Cases
25
While Watson Visual Recognition is highly flexible, there have been a number of recurring use case that
we’ve seen the API either struggle on or require significant pre/post-work from the user.
•Face Recognition: Visual Recognition is capable of face detection (detecting the presence of faces) not
face recognition (identifying individuals).
•Detecting details: Occasionally, users want to classify an image based on a small section of an image or
details scattered within an image. Because Watson analyzes the entire image when training, it may struggle
on classifications that depend on small details. Some users have adopted the strategy of breaking the image
into pieces or zooming into relevant parts of an image.
•Emotion: Emotion classification (whether facial emotion or contextual emotion) is not a feature currently
supported by Visual Recognition. Some users have attempted to do this through custom classifiers, but this
is an edge case and we cannot estimate the accuracy of this type of training.
Size Limitations
26
There are size limitations for training calls and data:
•The service accepts a maximum of 10,000 images or 100 MB per .zip file
•The service requires a minimum of 10 images per .zip file.
•The service accepts a maximum of 256 MB per training call.
•Minimum recommended size of an image is 32X32 pixels.
There are also size limitations when classifying images or detecting faces:
•Limitations for the methods to classify images:
• Maximum image size is 10 MB.
• Maximum .zip file size is 100 MB with up to 20 images.
•Limitations for the methods to detect faces:
• Maximum image size is 10 MB.
• Maximum .zip file size is 100 MB with up to 15 images.
Testing the custom model in UI
27

More Related Content

Similar to Watson Visual Recognition - NYC JavaSig

Graphics on the Go
Graphics on the GoGraphics on the Go
Graphics on the GoGil Irizarry
 
Rapid object detection using boosted cascade of simple features
Rapid object detection using boosted  cascade of simple featuresRapid object detection using boosted  cascade of simple features
Rapid object detection using boosted cascade of simple featuresHirantha Pradeep
 
Custom Image Classifier with Visual Recognition: Building with Watson
Custom Image Classifier with Visual Recognition: Building with Watson Custom Image Classifier with Visual Recognition: Building with Watson
Custom Image Classifier with Visual Recognition: Building with Watson IBM Watson
 
Building real time image classifiers for mobile apps with azure custom vision
Building real time image classifiers for mobile apps with azure custom visionBuilding real time image classifiers for mobile apps with azure custom vision
Building real time image classifiers for mobile apps with azure custom visionLuis Beltran
 
Image Processing and Computer Vision in iOS
Image Processing and Computer Vision in iOSImage Processing and Computer Vision in iOS
Image Processing and Computer Vision in iOSOge Marques
 
Automated_attendance_system_project.pptx
Automated_attendance_system_project.pptxAutomated_attendance_system_project.pptx
Automated_attendance_system_project.pptxNaveensai51
 
Getting Intimate with Images on Android with James Halpern
Getting Intimate with Images on Android with James HalpernGetting Intimate with Images on Android with James Halpern
Getting Intimate with Images on Android with James HalpernFITC
 
OReilly AI Transfer Learning
OReilly AI Transfer LearningOReilly AI Transfer Learning
OReilly AI Transfer LearningDanielle Dean
 
Introduction to GluonCV
Introduction to GluonCVIntroduction to GluonCV
Introduction to GluonCVApache MXNet
 
Making Netflix Machine Learning Algorithms Reliable
Making Netflix Machine Learning Algorithms ReliableMaking Netflix Machine Learning Algorithms Reliable
Making Netflix Machine Learning Algorithms ReliableJustin Basilico
 
A Responsive Design Case Study - What We Did Wrong Building ResponsiveDesign....
A Responsive Design Case Study - What We Did Wrong Building ResponsiveDesign....A Responsive Design Case Study - What We Did Wrong Building ResponsiveDesign....
A Responsive Design Case Study - What We Did Wrong Building ResponsiveDesign....Aidan Foster
 
[AWS Innovate 온라인 컨퍼런스] 간단한 Python 코드만으로 높은 성능의 기계 학습 모델 만들기 - 김무현, AWS Sr.데이...
[AWS Innovate 온라인 컨퍼런스] 간단한 Python 코드만으로 높은 성능의 기계 학습 모델 만들기 - 김무현, AWS Sr.데이...[AWS Innovate 온라인 컨퍼런스] 간단한 Python 코드만으로 높은 성능의 기계 학습 모델 만들기 - 김무현, AWS Sr.데이...
[AWS Innovate 온라인 컨퍼런스] 간단한 Python 코드만으로 높은 성능의 기계 학습 모델 만들기 - 김무현, AWS Sr.데이...Amazon Web Services Korea
 
Cutting Edge Computer Vision for Everyone
Cutting Edge Computer Vision for EveryoneCutting Edge Computer Vision for Everyone
Cutting Edge Computer Vision for EveryoneIvo Andreev
 
Mi0041 java and web design
Mi0041  java and web designMi0041  java and web design
Mi0041 java and web designsmumbahelp
 
Machine Learning Model for Gender Detection
Machine Learning Model for Gender DetectionMachine Learning Model for Gender Detection
Machine Learning Model for Gender DetectionTecnoIncentive
 
Human age and gender Detection
Human age and gender DetectionHuman age and gender Detection
Human age and gender DetectionAbhiAchalla
 
B4UConference_machine learning_deeplearning
B4UConference_machine learning_deeplearningB4UConference_machine learning_deeplearning
B4UConference_machine learning_deeplearningHoa Le
 
Deep-learning-for-computer-vision-applications-using-matlab.pdf
Deep-learning-for-computer-vision-applications-using-matlab.pdfDeep-learning-for-computer-vision-applications-using-matlab.pdf
Deep-learning-for-computer-vision-applications-using-matlab.pdfAubainYro1
 

Similar to Watson Visual Recognition - NYC JavaSig (20)

Recsys 2016
Recsys 2016Recsys 2016
Recsys 2016
 
Graphics on the Go
Graphics on the GoGraphics on the Go
Graphics on the Go
 
Rapid object detection using boosted cascade of simple features
Rapid object detection using boosted  cascade of simple featuresRapid object detection using boosted  cascade of simple features
Rapid object detection using boosted cascade of simple features
 
Custom Image Classifier with Visual Recognition: Building with Watson
Custom Image Classifier with Visual Recognition: Building with Watson Custom Image Classifier with Visual Recognition: Building with Watson
Custom Image Classifier with Visual Recognition: Building with Watson
 
Building real time image classifiers for mobile apps with azure custom vision
Building real time image classifiers for mobile apps with azure custom visionBuilding real time image classifiers for mobile apps with azure custom vision
Building real time image classifiers for mobile apps with azure custom vision
 
Image Processing and Computer Vision in iOS
Image Processing and Computer Vision in iOSImage Processing and Computer Vision in iOS
Image Processing and Computer Vision in iOS
 
Automated_attendance_system_project.pptx
Automated_attendance_system_project.pptxAutomated_attendance_system_project.pptx
Automated_attendance_system_project.pptx
 
Getting Intimate with Images on Android with James Halpern
Getting Intimate with Images on Android with James HalpernGetting Intimate with Images on Android with James Halpern
Getting Intimate with Images on Android with James Halpern
 
OReilly AI Transfer Learning
OReilly AI Transfer LearningOReilly AI Transfer Learning
OReilly AI Transfer Learning
 
Introduction to GluonCV
Introduction to GluonCVIntroduction to GluonCV
Introduction to GluonCV
 
Making Netflix Machine Learning Algorithms Reliable
Making Netflix Machine Learning Algorithms ReliableMaking Netflix Machine Learning Algorithms Reliable
Making Netflix Machine Learning Algorithms Reliable
 
A Responsive Design Case Study - What We Did Wrong Building ResponsiveDesign....
A Responsive Design Case Study - What We Did Wrong Building ResponsiveDesign....A Responsive Design Case Study - What We Did Wrong Building ResponsiveDesign....
A Responsive Design Case Study - What We Did Wrong Building ResponsiveDesign....
 
[AWS Innovate 온라인 컨퍼런스] 간단한 Python 코드만으로 높은 성능의 기계 학습 모델 만들기 - 김무현, AWS Sr.데이...
[AWS Innovate 온라인 컨퍼런스] 간단한 Python 코드만으로 높은 성능의 기계 학습 모델 만들기 - 김무현, AWS Sr.데이...[AWS Innovate 온라인 컨퍼런스] 간단한 Python 코드만으로 높은 성능의 기계 학습 모델 만들기 - 김무현, AWS Sr.데이...
[AWS Innovate 온라인 컨퍼런스] 간단한 Python 코드만으로 높은 성능의 기계 학습 모델 만들기 - 김무현, AWS Sr.데이...
 
Ad world
Ad worldAd world
Ad world
 
Cutting Edge Computer Vision for Everyone
Cutting Edge Computer Vision for EveryoneCutting Edge Computer Vision for Everyone
Cutting Edge Computer Vision for Everyone
 
Mi0041 java and web design
Mi0041  java and web designMi0041  java and web design
Mi0041 java and web design
 
Machine Learning Model for Gender Detection
Machine Learning Model for Gender DetectionMachine Learning Model for Gender Detection
Machine Learning Model for Gender Detection
 
Human age and gender Detection
Human age and gender DetectionHuman age and gender Detection
Human age and gender Detection
 
B4UConference_machine learning_deeplearning
B4UConference_machine learning_deeplearningB4UConference_machine learning_deeplearning
B4UConference_machine learning_deeplearning
 
Deep-learning-for-computer-vision-applications-using-matlab.pdf
Deep-learning-for-computer-vision-applications-using-matlab.pdfDeep-learning-for-computer-vision-applications-using-matlab.pdf
Deep-learning-for-computer-vision-applications-using-matlab.pdf
 

Recently uploaded

Science lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lessonScience lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lessonJericReyAuditor
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaVirag Sontakke
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,Virag Sontakke
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfadityarao40181
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 

Recently uploaded (20)

Science lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lessonScience lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lesson
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of India
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdf
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 

Watson Visual Recognition - NYC JavaSig

  • 1. Drones with IBM Watson Upkar Lidder, IBM @lidderupk https://github.com/lidderupk/ ulidder@us.ibm.com Slides - https://github.com/lidderupk/watson-vr-nyc-javasig IBM Cloud - https://ibm.biz/ibm-nyjavasig
  • 2. 2 1500 Drones Are Ready To Fly IBM Developer Drone Challenge Challenge runs from November 12 through December 16 and those 18 years of age or older residing in the US and Canada are eligible to enter Enter at https://developer.ibm.com/contest Once per week during the 5 weeks of the contest a random drawing Will be held to determine the winners (watch for the drawing on Twitch) Winners will receive a DJI Tello programmable drone, an IBM Developer T-shirt, and an IBM Developer laptop sticker Winners will also receive a special code to unlock more challenges and given the opportunity to contribute back to the challenge #IBMDroneDrop
  • 6. Classify vehicle damage images 6 1. The user captures an image with the mobile application. 2. The user sends the image on the mobile phone to the server application running in the cloud. 3. The server sends the image to Watson Visual Recognition service for analysis. 4. Watson Visual Recognition service classifies the image and returns the information to the server.
  • 7. Chatbot - Anger detection, NLU, and removal of explicit images 7 1. The user interacts with the Slack app and either sends a text or uploads an image. 2. A bot passes the text or image used in Slack to an IBM Cloud Functions API. 3. Watson Visual Recognition categorizes the uploaded image using the default and explicit classifiers. 4. Watson Natural Language Processing categorizes the text if text is sent as part of the Slack communication. 5. IBM Cloud Functions receives the response, and if the text is not polite, the bot sends a message to the Slack user to be more polite using the Slack post message API. If an image that is considered explicit is used, the image is deleted by IBM Cloud Functions using the Slack files delete API.
  • 9. Classifiers/Models 9 A set of built-in models provides highly accurate results without training: •General model: Default classification from thousands of classes. •Face model: Facial analysis with age and gender. •Explicit model (Beta): Whether an image is inappropriate for general use. •Food model (Beta): Specifically for images of food items. •Text model (Private beta): Text extraction from natural scene images. Requires training: •Custom model: Train a custom classifier by providing positive and negative images.
  • 10. General Model 10 •Animals (including birds, reptiles, amphibians, etc.) •Person and people-oriented information and activities •Food (including cooked food and beverages) •Plants (including trees, shrubs, aquatic plants, vegetables) •Sports •Nature (including many types of natural formations, geological structures) •Transportation (land, water, air) •And many more, including furnishings, fruits, musical instruments, tools, colors, gadgets, devices, instruments, weapons, buildings, structures and man-made objects, clothing and garments, and flowers, among others.
  • 12. Face Detect 12 Photo by Tom Kelly / CC BY
  • 13. Face Detect 13 Photo by Alan Kotok / CC BY classify detect_faces
  • 19. Java Application – Util Class 19
  • 20. Java Application – Default Classifier 20
  • 21. Java Application – Custom Classifier 21
  • 22. Custom Models - Tips and Tricks 22 •Provide training examples that are similar to what you plan to analyze. •Example: training with tiger in a zoo and then using the model to classify tiger in the wild. The background, surroundings matter, angle, lighting, distance, size of subject, etc matter. •If the quality and content of training data is the same, then classifiers/classes that are trained on more images will do better. •Time vs accuracy - the benefits of more images plateaus around 5000 images. •Parallelize by groups of images. You can train a class asynchronously. •Recommend 150-200 images per .zip file with image size of 320 x 320. Do not need high resolution. •Specify only classifiers you want the result for using the “classifier_id” parameter.
  • 24. 24
  • 25. Difficult Use Cases 25 While Watson Visual Recognition is highly flexible, there have been a number of recurring use case that we’ve seen the API either struggle on or require significant pre/post-work from the user. •Face Recognition: Visual Recognition is capable of face detection (detecting the presence of faces) not face recognition (identifying individuals). •Detecting details: Occasionally, users want to classify an image based on a small section of an image or details scattered within an image. Because Watson analyzes the entire image when training, it may struggle on classifications that depend on small details. Some users have adopted the strategy of breaking the image into pieces or zooming into relevant parts of an image. •Emotion: Emotion classification (whether facial emotion or contextual emotion) is not a feature currently supported by Visual Recognition. Some users have attempted to do this through custom classifiers, but this is an edge case and we cannot estimate the accuracy of this type of training.
  • 26. Size Limitations 26 There are size limitations for training calls and data: •The service accepts a maximum of 10,000 images or 100 MB per .zip file •The service requires a minimum of 10 images per .zip file. •The service accepts a maximum of 256 MB per training call. •Minimum recommended size of an image is 32X32 pixels. There are also size limitations when classifying images or detecting faces: •Limitations for the methods to classify images: • Maximum image size is 10 MB. • Maximum .zip file size is 100 MB with up to 20 images. •Limitations for the methods to detect faces: • Maximum image size is 10 MB. • Maximum .zip file size is 100 MB with up to 15 images.
  • 27. Testing the custom model in UI 27