SlideShare a Scribd company logo
1 of 52
Hot Dog Not Hot Dog!
Generate new training data
Without taking more photos
Annie Flippo
October, 2017
Southern California Data Science Conference
Hot Dog, Not Hot Dog!
Image Source: HBO
Image Source: HBO
Image Source: HBO
Image Source: HBO
Image Source: HBO
The Tech
is good
Just needs
more training
data
Image Source: HBO
Image Source: HBO
Image Source: HBO
I work as a Manager of Analytics
At Thinknear by Telenav
We are a location-based advertising platform
GPS navigation for vehicles and mobile phones
Cat
Dog
Mug
Hat
Deep Neural Network
Neural Network Visualized
http://scs.ryerson.ca/~aharley/vis/conv/
Perfect Training Data
Real-life Data
Real-life Data
Real-life Data
Real-life Data
Hot Dogs or Meatballs?
Image recognition is one of the most important
developments in AI. Close your eyes and imagine:
- Taking a shower
- Eating your lunch
- Driving
Real-life Driving: Distracted
Real-life Driving: Distracted
Real-life Driving: Distracted
Real-life Driving: Distracted
Real-life Driving: Distracted
Our Eyes Only
Autonomous Driving: Sign Recognition
Real Signs
Real Signs
Real Signs
Why is object recognition so hard?
Image Source: Stanford University, cs231n.github.io/classification
The Humble Stop Sign
The Humble Stop Sign
Be Aware of Image Hacking 1
Image Source: https://codewords.recurse.com/issues/five/why-do-neural-networks-think-a-panda-is-a-vulture
Be Aware of Image Hacking 2
Image Source: https://codewords.recurse.com/issues/five/why-do-neural-networks-think-a-panda-is-a-vulture
Be Aware of Image Hacking 3
Image Source: https://medium.com/@ageitgey/machine-learning-is-fun-part-8-how-to-intentionally-trick-neural-networks-b55da32b7196
Be Aware of Image Hacking - Fake Fakes
This could be a devastating example if the autonomous vehicle is trained
to drive over newspaper!
Pedestrian (98.5%) Newspaper (95.9%)
What can we do?
I need better training!
Any ideas?
Generate Training Data Idea 1
Augment source image by:
Rotation Perspective Transformation
Generate Training Data Idea 2
Blurring Brightness Adjustment
Generate Training Data Idea 3
Add NoiseShearing
Apply Blurring
Apply Noise
Adjust Brightness
Apply Rotations
All Adjustments
Dropout in Neural Network
Time Saving Tip
Generate many variations per source image
but:
● Don’t save to disk (slow and takes space)
● Run it under a Python Generator Object
○ Generate images as you train (fast and scalable)
Train a Winning Model
… and become a billionaire!
Image Source: HBO
Thank You
acflippo @acflippo
Slides and code: bit.ly/scds-acflippo

More Related Content

Recently uploaded

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Recently uploaded (20)

Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 

Featured

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Featured (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Techniques to generate training data

Editor's Notes

  1. A cast of 30 something techies continuously try to get funding for their ideas. They go to tech competitions, conferences but were always circumvented by their competitor, nemesis and billionaire CEO, Gavin Belson.silicon valley opening scene season 1
  2. A cast of 30 something techies continuously try to get funding for their ideas. They go to tech competitions, conferences but were always circumvented by their competitor, nemesis and billionaire CEO, Gavin Belson.silicon valley opening scene season 1
  3. A cast of 30 something techies continuously try to get funding for their ideas. They go to tech competitions, conferences but were always circumvented by their competitor, nemesis and billionaire CEO, Gavin Belson.silicon valley opening scene season 1
  4. Erlich, the serial entrepreneur, runs an incubator from his house including this group of startup guys. One of them is Jing Yang, the freeloader of the group.
  5. Finally, Erlich through some miscommunication sold Jing Yang’s idea for a food identification app. During the demo, the app only identifies hot dogs and all other foods as not hot dogs.
  6. Perplexed to the success of the app. Dinesh wonders ...
  7. Erlich had the idea to use the student from his friend’s stanford class to scrape the internet for food pictures to train Jing Yang’s model. But the students steal his idea, rendering his company worthless. Erlich again had his butt handed to him because he doesn’t have enough understanding how to train a neural network.
  8. Erlich had the idea to use the student from his friend’s stanford class to scrape the internet for food pictures to train Jing Yang’s model. But the students steal his idea, rendering his company worthless. Erlich again had his butt handed to him because he doesn’t have enough understanding how to train a neural network.
  9. See these perfect specimens of hot dog pictures taken probably by a professional food photographer. For any neural network, training data is very important. See on a simple task of identifying a hot dog. How one simple hot dog can be represented?
  10. See these perfect specimens of hot dog pictures taken probably by a professional food photographer. For any neural network, training data is very important. See on a simple task of identifying a hot dog. How one simple hot dog can be represented?
  11. For most activities, us, humans only need to use our eyes - to drive, to eat, to catch a ball. We haven’t needed radars, lidars, other electronic sensors.
  12. I’ll show you some signs and you all recognize these. They’re shown under perfect conditions.
  13. Life is not like that. Even for just the School Zone Sign. There are lots of them. Some are very localized to the region. Please use shotguns only because a handgun would just be too dangerous!
  14. Life is not like that. Even for just the School Zone Sign. There are lots of them. Some are very localized to the region. Please use shotguns only because a handgun would just be too dangerous!
  15. Life is not like that. Even for just the School Zone Sign. There are lots of them. Some are very localized to the region. Please use shotguns only because a handgun would just be too dangerous!
  16. Another way to get a robust classification neural net model is to use Dropout. It sounds kind of crazy and un-intuitive. Why would you want to wipe out information in the middle of training your network? Because you don’t want to model to count on any specific features because at anytime that feature may not be there.
  17. Go forth, generate a gazillion training images for your winning model and become a billionaire!
  18. Go forth, generate a gazillion training images for your winning model and become a billionaire!