SlideShare a Scribd company logo
Rahul Kumar
AI Scientist | Deep Learning Practitioner | Author
Deep Learning
for
Industries
About me
@hellorahulk goodrahstar
hellorahulk.com
Chief AI Scientist
Jatana.ai
Author of
@hellorahulk goodrahstar
hellorahulk.com
DE3p Larenn1g mhica3ns wrok smliair to hOw biarns wrok.
Tehse mahcnies wrok by s33nig f22Uy pa773rns and cnonc3t1ng
t3Hm t0 fU22y cnoc3tps. T3hy wRok l4y3r by ly43r, j5ut lK1e a
f1L37r, t4k1NG cmopl3x scn33s aNd br3k41ng tH3m dwon itno
s1pmLe iD34s.
Only Data Scientist can read it :
Deep Learning mechanism work similar to how brain work.
These machines works by seeing funny patterns and connecting
them to funny concepts. They work layer by layer, just like a filter,
taking complex scenes and breaking them down into simple
ideas.
Only Data Scientist can read it :
I want to be a
good
Data Scientist
Learning paradigm
NNML DL QC ……
NNML DL QC ……
Learning paradigm
NNML DL QC ……
2 papers a week
Follow NeurIPs, AAAI, CVPR, ICAPS, ICML
Learning paradigm
Listen to podcast
Follow influencers on twitter etc.
NNML DL QC ……
Depth
Learning paradigm
NNML DL QC ……
Depth
Learning paradigm
NNML DL QC ……
Depth
Learning paradigm
Is that it … ?
Let’s get
practical …
End to End Pipeline
End to End Pipeline
End to End Pipeline : INPUTS
Data sources:
• Cameras
• CSV/ Excels
• Big data
• SQL databases
• Mongo DB
Key takeaway:
• Learn to build connectors
• Avoid I/O ops
• Use binary like pickles
End to End Pipeline
End to End Pipeline : PRE-PROCESSING
Why do I care ?
Text classification
Entity Extraction
End to End Pipeline : PRE-PROCESSING
Why do I care ?
Data that you will getEnd to End Pipeline : PRE-PROCESSING
But…
Data that you will getEnd to End Pipeline : PRE-PROCESSING
Data that you will get
Data that you will get
End to End Pipeline : PRE-PROCESSING
Learn to visualization the data
End to End Pipeline : PRE-PROCESSING
Understand the important patterns
Extracting Capital letter words
Source: https://regexr.com
End to End Pipeline : PRE-PROCESSING
Extract Noise from the data
Extracting emails
Source: https://regexr.com
End to End Pipeline : PRE-PROCESSING
Understand the data distribution, outliers, skewed data
Perform Data Argumentation Balanced dataset
End to End Pipeline : PRE-PROCESSING
Understand the data distribution, outliers, skewed data
End to End Pipeline : PRE-PROCESSING
1. Better generalization of the model.
2. Better accuracy.
3. Boosts the training speed.
End to End Pipeline
End to End Pipeline : FEATURE CREATION
Representing the word using various feature representations:
•Morphological = [(prefix, over-), (root, fit), (suffix=imperfect tense, -ing)]
•Unigrams = ['o', 'v', 'e', 'r', 'f', 'i', 't', 't', 'i', 'n', 'g']
•Bigrams = ['ov', 've', 'er', 'rf', 'fi', 'it', 'tt', 'ti', 'in', 'ng']
•Trigrams = ['ove', 'ver', 'erf', 'rfi', 'fit', 'itt', 'tti', 'tin', 'ing']
•One-hot = [0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0]
•Word vector = [-0.26, 0.34, 0.48, -0.06, 0.16, 0.11, 0.13, -0.15, 0.47, -0.49, 0.07,
-0.39, -0.13, -0.15, 0.06, 0.09]
• ...
End to End Pipeline : FEATURE CREATION
End to End Pipeline
End to End Pipeline : TRAIN MODEL
Source: http://bit.ly/PlotNeurons
Design very basic Architecture
Think about the inputs Think about the outputs
Added more complexity
Train
“Build your own baseline”
End to End Pipeline : TRAIN MODEL
End to End Pipeline : TRAIN MODEL
End to End Pipeline : TRAIN MODEL
Hyperparameter optimization
End to End Pipeline : TRAIN MODEL
Hyperparameter optimization
Colab link: http://bit.ly/talosHPO
End to End Pipeline
End to End Pipeline : DEPLOY MODEL
End to End Pipeline : DEPLOY MODEL
•Freezing: That is, converting the variables stored in a
checkpoint file of the SavedModel into constants
stored directly in the model graph.
‘Shrinking model size (to have less memory and disk footprints), and improving prediction latency.’
Post training optimizations
End to End Pipeline : DEPLOY MODEL
•Quantisation: That is, converting any large float Const op into an eight-bit equivalent, followed by a float
conversion op so that the result is usable by subsequent nodes.
Post training optimizations
‘Shrinking model size (to have less memory and disk footprints), and improving prediction latency.’
End to End Pipeline
Must to have:
• Micro-service architecture
• Optimize each modules
• Real-time monitoring
• Complete RESTful mode
Let’s Deploy
Code : https://github.com/goodrahstar/tf-sentiment-docker
Thank you
@hellorahulk goodrahstar
hellorahulk.com
Book : https://www.amazon.com/dp/1788997093
Code : http://bit.ly/DeepLearningCode

More Related Content

Similar to Deeplearning for industries | Data to Production

From Idea to Model: Productionizing Data Pipelines with Apache Airflow
From Idea to Model: Productionizing Data Pipelines with Apache AirflowFrom Idea to Model: Productionizing Data Pipelines with Apache Airflow
From Idea to Model: Productionizing Data Pipelines with Apache Airflow
Databricks
 
Project
ProjectProject
Project
Xu Liu
 
CrossRef How-to: A Technical Introduction to the Basics of CrossRef, Chuck Ko...
CrossRef How-to: A Technical Introduction to the Basics of CrossRef, Chuck Ko...CrossRef How-to: A Technical Introduction to the Basics of CrossRef, Chuck Ko...
CrossRef How-to: A Technical Introduction to the Basics of CrossRef, Chuck Ko...
Crossref
 
Deeplearning and dev ops azure
Deeplearning and dev ops azureDeeplearning and dev ops azure
Deeplearning and dev ops azure
Vishwas N
 
Powering Custom Apps at Facebook using Spark Script Transformation
Powering Custom Apps at Facebook using Spark Script TransformationPowering Custom Apps at Facebook using Spark Script Transformation
Powering Custom Apps at Facebook using Spark Script Transformation
Databricks
 
SQL to NoSQL: Top 6 Questions
SQL to NoSQL: Top 6 QuestionsSQL to NoSQL: Top 6 Questions
SQL to NoSQL: Top 6 Questions
Mike Broberg
 
Using pandas library for data analysis in python
Using pandas library for data analysis in pythonUsing pandas library for data analysis in python
Using pandas library for data analysis in python
Bruce Jenks
 
Fast and Reliable Apache Spark SQL Releases
Fast and Reliable Apache Spark SQL ReleasesFast and Reliable Apache Spark SQL Releases
Fast and Reliable Apache Spark SQL Releases
DataWorks Summit
 
Deep Learning Automated Helpdesk
Deep Learning Automated HelpdeskDeep Learning Automated Helpdesk
Deep Learning Automated Helpdesk
Pranav Sharma
 
High Performance Machine Learning in R with H2O
High Performance Machine Learning in R with H2OHigh Performance Machine Learning in R with H2O
High Performance Machine Learning in R with H2O
Sri Ambati
 
Data-Analytics using python (Module 4).pptx
Data-Analytics using python (Module 4).pptxData-Analytics using python (Module 4).pptx
Data-Analytics using python (Module 4).pptx
DRSHk10
 
Data oriented design and c++
Data oriented design and c++Data oriented design and c++
Data oriented design and c++
Mike Acton
 
RAMSES: Robust Analytic Models for Science at Extreme Scales
RAMSES: Robust Analytic Models for Science at Extreme ScalesRAMSES: Robust Analytic Models for Science at Extreme Scales
RAMSES: Robust Analytic Models for Science at Extreme Scales
Ian Foster
 
P9 speed of-light faceted search via oracle in-memory option by alexander tok...
P9 speed of-light faceted search via oracle in-memory option by alexander tok...P9 speed of-light faceted search via oracle in-memory option by alexander tok...
P9 speed of-light faceted search via oracle in-memory option by alexander tok...
Alexander Tokarev
 
Microservices in Clojure
Microservices in ClojureMicroservices in Clojure
Microservices in Clojure
Lucas Cavalcanti dos Santos
 
Find Anything In Your APEX App - Fuzzy Search with Oracle Text
Find Anything In Your APEX App - Fuzzy Search with Oracle TextFind Anything In Your APEX App - Fuzzy Search with Oracle Text
Find Anything In Your APEX App - Fuzzy Search with Oracle Text
Carsten Czarski
 
Raptor user manual3.0
Raptor user manual3.0Raptor user manual3.0
Raptor user manual3.0
Elizabeth Reyna
 
COinS (eng version)
COinS (eng version)COinS (eng version)
COinS (eng version)
Milan Janíček
 
Introduction to H2O and Model Stacking Use Cases
Introduction to H2O and Model Stacking Use CasesIntroduction to H2O and Model Stacking Use Cases
Introduction to H2O and Model Stacking Use Cases
Jo-fai Chow
 
Rental Cars and Industrialized Learning to Rank with Sean Downes
Rental Cars and Industrialized Learning to Rank with Sean DownesRental Cars and Industrialized Learning to Rank with Sean Downes
Rental Cars and Industrialized Learning to Rank with Sean Downes
Databricks
 

Similar to Deeplearning for industries | Data to Production (20)

From Idea to Model: Productionizing Data Pipelines with Apache Airflow
From Idea to Model: Productionizing Data Pipelines with Apache AirflowFrom Idea to Model: Productionizing Data Pipelines with Apache Airflow
From Idea to Model: Productionizing Data Pipelines with Apache Airflow
 
Project
ProjectProject
Project
 
CrossRef How-to: A Technical Introduction to the Basics of CrossRef, Chuck Ko...
CrossRef How-to: A Technical Introduction to the Basics of CrossRef, Chuck Ko...CrossRef How-to: A Technical Introduction to the Basics of CrossRef, Chuck Ko...
CrossRef How-to: A Technical Introduction to the Basics of CrossRef, Chuck Ko...
 
Deeplearning and dev ops azure
Deeplearning and dev ops azureDeeplearning and dev ops azure
Deeplearning and dev ops azure
 
Powering Custom Apps at Facebook using Spark Script Transformation
Powering Custom Apps at Facebook using Spark Script TransformationPowering Custom Apps at Facebook using Spark Script Transformation
Powering Custom Apps at Facebook using Spark Script Transformation
 
SQL to NoSQL: Top 6 Questions
SQL to NoSQL: Top 6 QuestionsSQL to NoSQL: Top 6 Questions
SQL to NoSQL: Top 6 Questions
 
Using pandas library for data analysis in python
Using pandas library for data analysis in pythonUsing pandas library for data analysis in python
Using pandas library for data analysis in python
 
Fast and Reliable Apache Spark SQL Releases
Fast and Reliable Apache Spark SQL ReleasesFast and Reliable Apache Spark SQL Releases
Fast and Reliable Apache Spark SQL Releases
 
Deep Learning Automated Helpdesk
Deep Learning Automated HelpdeskDeep Learning Automated Helpdesk
Deep Learning Automated Helpdesk
 
High Performance Machine Learning in R with H2O
High Performance Machine Learning in R with H2OHigh Performance Machine Learning in R with H2O
High Performance Machine Learning in R with H2O
 
Data-Analytics using python (Module 4).pptx
Data-Analytics using python (Module 4).pptxData-Analytics using python (Module 4).pptx
Data-Analytics using python (Module 4).pptx
 
Data oriented design and c++
Data oriented design and c++Data oriented design and c++
Data oriented design and c++
 
RAMSES: Robust Analytic Models for Science at Extreme Scales
RAMSES: Robust Analytic Models for Science at Extreme ScalesRAMSES: Robust Analytic Models for Science at Extreme Scales
RAMSES: Robust Analytic Models for Science at Extreme Scales
 
P9 speed of-light faceted search via oracle in-memory option by alexander tok...
P9 speed of-light faceted search via oracle in-memory option by alexander tok...P9 speed of-light faceted search via oracle in-memory option by alexander tok...
P9 speed of-light faceted search via oracle in-memory option by alexander tok...
 
Microservices in Clojure
Microservices in ClojureMicroservices in Clojure
Microservices in Clojure
 
Find Anything In Your APEX App - Fuzzy Search with Oracle Text
Find Anything In Your APEX App - Fuzzy Search with Oracle TextFind Anything In Your APEX App - Fuzzy Search with Oracle Text
Find Anything In Your APEX App - Fuzzy Search with Oracle Text
 
Raptor user manual3.0
Raptor user manual3.0Raptor user manual3.0
Raptor user manual3.0
 
COinS (eng version)
COinS (eng version)COinS (eng version)
COinS (eng version)
 
Introduction to H2O and Model Stacking Use Cases
Introduction to H2O and Model Stacking Use CasesIntroduction to H2O and Model Stacking Use Cases
Introduction to H2O and Model Stacking Use Cases
 
Rental Cars and Industrialized Learning to Rank with Sean Downes
Rental Cars and Industrialized Learning to Rank with Sean DownesRental Cars and Industrialized Learning to Rank with Sean Downes
Rental Cars and Industrialized Learning to Rank with Sean Downes
 

Recently uploaded

Levelised Cost of Hydrogen (LCOH) Calculator Manual
Levelised Cost of Hydrogen  (LCOH) Calculator ManualLevelised Cost of Hydrogen  (LCOH) Calculator Manual
Levelised Cost of Hydrogen (LCOH) Calculator Manual
Massimo Talia
 
AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...
AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...
AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...
Paris Salesforce Developer Group
 
EV BMS WITH CHARGE MONITOR AND FIRE DETECTION.pptx
EV BMS WITH CHARGE MONITOR AND FIRE DETECTION.pptxEV BMS WITH CHARGE MONITOR AND FIRE DETECTION.pptx
EV BMS WITH CHARGE MONITOR AND FIRE DETECTION.pptx
nikshimanasa
 
Open Channel Flow: fluid flow with a free surface
Open Channel Flow: fluid flow with a free surfaceOpen Channel Flow: fluid flow with a free surface
Open Channel Flow: fluid flow with a free surface
Indrajeet sahu
 
AI in customer support Use cases solutions development and implementation.pdf
AI in customer support Use cases solutions development and implementation.pdfAI in customer support Use cases solutions development and implementation.pdf
AI in customer support Use cases solutions development and implementation.pdf
mahaffeycheryld
 
一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理
一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理
一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理
nedcocy
 
Height and depth gauge linear metrology.pdf
Height and depth gauge linear metrology.pdfHeight and depth gauge linear metrology.pdf
Height and depth gauge linear metrology.pdf
q30122000
 
Impartiality as per ISO /IEC 17025:2017 Standard
Impartiality as per ISO /IEC 17025:2017 StandardImpartiality as per ISO /IEC 17025:2017 Standard
Impartiality as per ISO /IEC 17025:2017 Standard
MuhammadJazib15
 
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
MadhavJungKarki
 
This study Examines the Effectiveness of Talent Procurement through the Imple...
This study Examines the Effectiveness of Talent Procurement through the Imple...This study Examines the Effectiveness of Talent Procurement through the Imple...
This study Examines the Effectiveness of Talent Procurement through the Imple...
DharmaBanothu
 
Digital Twins Computer Networking Paper Presentation.pptx
Digital Twins Computer Networking Paper Presentation.pptxDigital Twins Computer Networking Paper Presentation.pptx
Digital Twins Computer Networking Paper Presentation.pptx
aryanpankaj78
 
Blood finder application project report (1).pdf
Blood finder application project report (1).pdfBlood finder application project report (1).pdf
Blood finder application project report (1).pdf
Kamal Acharya
 
一比一原版(USF毕业证)旧金山大学毕业证如何办理
一比一原版(USF毕业证)旧金山大学毕业证如何办理一比一原版(USF毕业证)旧金山大学毕业证如何办理
一比一原版(USF毕业证)旧金山大学毕业证如何办理
uqyfuc
 
Ericsson LTE Throughput Troubleshooting Techniques.ppt
Ericsson LTE Throughput Troubleshooting Techniques.pptEricsson LTE Throughput Troubleshooting Techniques.ppt
Ericsson LTE Throughput Troubleshooting Techniques.ppt
wafawafa52
 
Call Girls Chennai +91-8824825030 Vip Call Girls Chennai
Call Girls Chennai +91-8824825030 Vip Call Girls ChennaiCall Girls Chennai +91-8824825030 Vip Call Girls Chennai
Call Girls Chennai +91-8824825030 Vip Call Girls Chennai
paraasingh12 #V08
 
一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理
一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理
一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理
upoux
 
Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...
Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...
Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...
PriyankaKilaniya
 
SCALING OF MOS CIRCUITS m .pptx
SCALING OF MOS CIRCUITS m                 .pptxSCALING OF MOS CIRCUITS m                 .pptx
SCALING OF MOS CIRCUITS m .pptx
harshapolam10
 
Null Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAMNull Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAM
Divyanshu
 
Call For Paper -3rd International Conference on Artificial Intelligence Advan...
Call For Paper -3rd International Conference on Artificial Intelligence Advan...Call For Paper -3rd International Conference on Artificial Intelligence Advan...
Call For Paper -3rd International Conference on Artificial Intelligence Advan...
ijseajournal
 

Recently uploaded (20)

Levelised Cost of Hydrogen (LCOH) Calculator Manual
Levelised Cost of Hydrogen  (LCOH) Calculator ManualLevelised Cost of Hydrogen  (LCOH) Calculator Manual
Levelised Cost of Hydrogen (LCOH) Calculator Manual
 
AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...
AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...
AI + Data Community Tour - Build the Next Generation of Apps with the Einstei...
 
EV BMS WITH CHARGE MONITOR AND FIRE DETECTION.pptx
EV BMS WITH CHARGE MONITOR AND FIRE DETECTION.pptxEV BMS WITH CHARGE MONITOR AND FIRE DETECTION.pptx
EV BMS WITH CHARGE MONITOR AND FIRE DETECTION.pptx
 
Open Channel Flow: fluid flow with a free surface
Open Channel Flow: fluid flow with a free surfaceOpen Channel Flow: fluid flow with a free surface
Open Channel Flow: fluid flow with a free surface
 
AI in customer support Use cases solutions development and implementation.pdf
AI in customer support Use cases solutions development and implementation.pdfAI in customer support Use cases solutions development and implementation.pdf
AI in customer support Use cases solutions development and implementation.pdf
 
一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理
一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理
一比一原版(爱大毕业证书)爱荷华大学毕业证如何办理
 
Height and depth gauge linear metrology.pdf
Height and depth gauge linear metrology.pdfHeight and depth gauge linear metrology.pdf
Height and depth gauge linear metrology.pdf
 
Impartiality as per ISO /IEC 17025:2017 Standard
Impartiality as per ISO /IEC 17025:2017 StandardImpartiality as per ISO /IEC 17025:2017 Standard
Impartiality as per ISO /IEC 17025:2017 Standard
 
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
1FIDIC-CONSTRUCTION-CONTRACT-2ND-ED-2017-RED-BOOK.pdf
 
This study Examines the Effectiveness of Talent Procurement through the Imple...
This study Examines the Effectiveness of Talent Procurement through the Imple...This study Examines the Effectiveness of Talent Procurement through the Imple...
This study Examines the Effectiveness of Talent Procurement through the Imple...
 
Digital Twins Computer Networking Paper Presentation.pptx
Digital Twins Computer Networking Paper Presentation.pptxDigital Twins Computer Networking Paper Presentation.pptx
Digital Twins Computer Networking Paper Presentation.pptx
 
Blood finder application project report (1).pdf
Blood finder application project report (1).pdfBlood finder application project report (1).pdf
Blood finder application project report (1).pdf
 
一比一原版(USF毕业证)旧金山大学毕业证如何办理
一比一原版(USF毕业证)旧金山大学毕业证如何办理一比一原版(USF毕业证)旧金山大学毕业证如何办理
一比一原版(USF毕业证)旧金山大学毕业证如何办理
 
Ericsson LTE Throughput Troubleshooting Techniques.ppt
Ericsson LTE Throughput Troubleshooting Techniques.pptEricsson LTE Throughput Troubleshooting Techniques.ppt
Ericsson LTE Throughput Troubleshooting Techniques.ppt
 
Call Girls Chennai +91-8824825030 Vip Call Girls Chennai
Call Girls Chennai +91-8824825030 Vip Call Girls ChennaiCall Girls Chennai +91-8824825030 Vip Call Girls Chennai
Call Girls Chennai +91-8824825030 Vip Call Girls Chennai
 
一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理
一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理
一比一原版(uofo毕业证书)美国俄勒冈大学毕业证如何办理
 
Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...
Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...
Prediction of Electrical Energy Efficiency Using Information on Consumer's Ac...
 
SCALING OF MOS CIRCUITS m .pptx
SCALING OF MOS CIRCUITS m                 .pptxSCALING OF MOS CIRCUITS m                 .pptx
SCALING OF MOS CIRCUITS m .pptx
 
Null Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAMNull Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAM
 
Call For Paper -3rd International Conference on Artificial Intelligence Advan...
Call For Paper -3rd International Conference on Artificial Intelligence Advan...Call For Paper -3rd International Conference on Artificial Intelligence Advan...
Call For Paper -3rd International Conference on Artificial Intelligence Advan...
 

Deeplearning for industries | Data to Production

  • 1. Rahul Kumar AI Scientist | Deep Learning Practitioner | Author Deep Learning for Industries
  • 4. DE3p Larenn1g mhica3ns wrok smliair to hOw biarns wrok. Tehse mahcnies wrok by s33nig f22Uy pa773rns and cnonc3t1ng t3Hm t0 fU22y cnoc3tps. T3hy wRok l4y3r by ly43r, j5ut lK1e a f1L37r, t4k1NG cmopl3x scn33s aNd br3k41ng tH3m dwon itno s1pmLe iD34s. Only Data Scientist can read it :
  • 5. Deep Learning mechanism work similar to how brain work. These machines works by seeing funny patterns and connecting them to funny concepts. They work layer by layer, just like a filter, taking complex scenes and breaking them down into simple ideas. Only Data Scientist can read it :
  • 6. I want to be a good Data Scientist
  • 8. NNML DL QC …… Learning paradigm
  • 9. NNML DL QC …… 2 papers a week Follow NeurIPs, AAAI, CVPR, ICAPS, ICML Learning paradigm Listen to podcast Follow influencers on twitter etc.
  • 10. NNML DL QC …… Depth Learning paradigm
  • 11. NNML DL QC …… Depth Learning paradigm
  • 12. NNML DL QC …… Depth Learning paradigm
  • 13. Is that it … ? Let’s get practical …
  • 14. End to End Pipeline
  • 15. End to End Pipeline
  • 16. End to End Pipeline : INPUTS Data sources: • Cameras • CSV/ Excels • Big data • SQL databases • Mongo DB Key takeaway: • Learn to build connectors • Avoid I/O ops • Use binary like pickles
  • 17. End to End Pipeline
  • 18. End to End Pipeline : PRE-PROCESSING Why do I care ? Text classification Entity Extraction
  • 19. End to End Pipeline : PRE-PROCESSING Why do I care ?
  • 20. Data that you will getEnd to End Pipeline : PRE-PROCESSING But…
  • 21. Data that you will getEnd to End Pipeline : PRE-PROCESSING
  • 22. Data that you will get
  • 23. Data that you will get
  • 24. End to End Pipeline : PRE-PROCESSING Learn to visualization the data
  • 25. End to End Pipeline : PRE-PROCESSING Understand the important patterns Extracting Capital letter words Source: https://regexr.com
  • 26. End to End Pipeline : PRE-PROCESSING Extract Noise from the data Extracting emails Source: https://regexr.com
  • 27. End to End Pipeline : PRE-PROCESSING Understand the data distribution, outliers, skewed data Perform Data Argumentation Balanced dataset
  • 28. End to End Pipeline : PRE-PROCESSING Understand the data distribution, outliers, skewed data
  • 29. End to End Pipeline : PRE-PROCESSING 1. Better generalization of the model. 2. Better accuracy. 3. Boosts the training speed.
  • 30. End to End Pipeline
  • 31. End to End Pipeline : FEATURE CREATION Representing the word using various feature representations: •Morphological = [(prefix, over-), (root, fit), (suffix=imperfect tense, -ing)] •Unigrams = ['o', 'v', 'e', 'r', 'f', 'i', 't', 't', 'i', 'n', 'g'] •Bigrams = ['ov', 've', 'er', 'rf', 'fi', 'it', 'tt', 'ti', 'in', 'ng'] •Trigrams = ['ove', 'ver', 'erf', 'rfi', 'fit', 'itt', 'tti', 'tin', 'ing'] •One-hot = [0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0] •Word vector = [-0.26, 0.34, 0.48, -0.06, 0.16, 0.11, 0.13, -0.15, 0.47, -0.49, 0.07, -0.39, -0.13, -0.15, 0.06, 0.09] • ...
  • 32. End to End Pipeline : FEATURE CREATION
  • 33. End to End Pipeline
  • 34. End to End Pipeline : TRAIN MODEL Source: http://bit.ly/PlotNeurons Design very basic Architecture Think about the inputs Think about the outputs Added more complexity Train “Build your own baseline”
  • 35. End to End Pipeline : TRAIN MODEL
  • 36. End to End Pipeline : TRAIN MODEL
  • 37. End to End Pipeline : TRAIN MODEL Hyperparameter optimization
  • 38. End to End Pipeline : TRAIN MODEL Hyperparameter optimization Colab link: http://bit.ly/talosHPO
  • 39. End to End Pipeline
  • 40. End to End Pipeline : DEPLOY MODEL
  • 41. End to End Pipeline : DEPLOY MODEL •Freezing: That is, converting the variables stored in a checkpoint file of the SavedModel into constants stored directly in the model graph. ‘Shrinking model size (to have less memory and disk footprints), and improving prediction latency.’ Post training optimizations
  • 42. End to End Pipeline : DEPLOY MODEL •Quantisation: That is, converting any large float Const op into an eight-bit equivalent, followed by a float conversion op so that the result is usable by subsequent nodes. Post training optimizations ‘Shrinking model size (to have less memory and disk footprints), and improving prediction latency.’
  • 43. End to End Pipeline Must to have: • Micro-service architecture • Optimize each modules • Real-time monitoring • Complete RESTful mode
  • 44. Let’s Deploy Code : https://github.com/goodrahstar/tf-sentiment-docker
  • 45. Thank you @hellorahulk goodrahstar hellorahulk.com Book : https://www.amazon.com/dp/1788997093 Code : http://bit.ly/DeepLearningCode

Editor's Notes

  1. So good pre-processing leads to: 1. Better generalization of the model. 2. Better accuracy. 3. Boosts the training speed.