SlideShare a Scribd company logo
1 of 13
Internship: Final Presentation
Neal Gopani
Introduction
Confidential Property of App Orchid Inc Copyright : 2017 - 2018 2
Background
● Sophomore at Rutgers University
● Pursuing a Bachelor’s of Science in Computer
Science and Mathematics
● Very interested in Machine Learning.
My Task
● Implemented an ensemble of BERTs that
were trained on the SQuAD 2.0 Dataset.
● Went from knowing very little about
Machine Learning to learning Pytorch
and experimenting with a recent NLP
architecture.
SQuAD 2.0 Dataset
3
Question Answering is one of
the hot NLP downstream tasks
being researched today!
The SQuAD 2.0 Dataset is a
commonly used benchmark
dataset for this task.
Example From the SQuAD 2.0 Dataset
Rajpurkar et. al, 2018
Transformer Architecture
4
http://jalammar.github.io/illustrated-transformer/
BERT
Confidential Property of App Orchid Inc Copyright : 2017 - 2018 5
http://jalammar.github.io/illustrated-bert/
My Task
Confidential Property of App Orchid Inc Copyright : 2017 - 2018 6
Task: Train an ensemble of BERTs on the SQuAD
2.0 dataset.
The BERT and BiDAF code were provided by
HuggingFace and this github repo respectively.
Zhou et. al, 2019
How Linguistic Knowledge Post Processing Works
7
Zhou et. al, 2019
How the Ensemble Works
Ensemble algorithm takes the prediction
files made by each of the BERTs and
BiDAF.
8
Example Question - Answers pair from model0’s n best predictions
My Results
Confidential Property of App Orchid Inc Copyright : 2017 - 2018 9
Trial # model batch_size max_seq_length learning_rate epochs dev_f1 dev_em
1 base_uncased 4 384 3.00e-5 3 74.318 70.589
2 base_uncased 8 384 3.00e-5 3 76.016 72.400
3 base_uncased 8 384 2.00e-5 3 75.646 72.155
4 base_uncased 8 384 5.00e-5 3 75.781 72.256
5 base_uncased 6 384 3.00e-5 2 76.704 73.259
6 base_uncased 6 384 5.00e-5 4 72.957 69.266
7 base_cased 8 384 3.00e-5 3 74.946 71.111
Ens1 1+2+3+4+5+6+7 78.208 75.213
Ens2 Ens1 + BIDAF 78.566 75.620
Demo
10
https://en.wikipedia.org/wiki/Cat
Issues I encountered
Confidential Property of App Orchid Inc Copyright : 2017 - 2018 11
● I had about 1 and a half weeks to set up the right environment, train the models,
and create the ensemble.
○ I spent about 1.5 months to understand the code.
○ Could have done more experiments using different hyperparameters and
augmented data.
● GPU constraint - Since these models are massive, I encountered a couple of
problems.
○ Encountered out of RAM issues when training the BERT large models
Plans for the Future
Confidential Property of App Orchid Inc Copyright : 2017 - 2018 12
● I plan to pursue a career in Machine
Learning even more now.
● I plan to learn more about NLP and fill in
the gaps of what I learned these past 2
months.
● I plan to gain experience by pursuing a
research position at my university.
● I plan to create a couple of projects
using ML and get at a level where I could
implement research papers.
● I plan to learn Computer Vision.
Google Deep Dream
13

More Related Content

What's hot

Redmine.tokyo lt04:RedmineとRPA(Uipath)を組み合わせた多システム間連携について
Redmine.tokyo lt04:RedmineとRPA(Uipath)を組み合わせた多システム間連携についてRedmine.tokyo lt04:RedmineとRPA(Uipath)を組み合わせた多システム間連携について
Redmine.tokyo lt04:RedmineとRPA(Uipath)を組み合わせた多システム間連携についてii nekosanz1
 
XP祭り2019 B-6 アジャイルソフトウェア開発への統計的品質管理の応用
XP祭り2019 B-6 アジャイルソフトウェア開発への統計的品質管理の応用XP祭り2019 B-6 アジャイルソフトウェア開発への統計的品質管理の応用
XP祭り2019 B-6 アジャイルソフトウェア開発への統計的品質管理の応用Akinori SAKATA
 
【JaSST'18 Tokai】アジャイルとテスト自動化導入の勘所
【JaSST'18 Tokai】アジャイルとテスト自動化導入の勘所【JaSST'18 Tokai】アジャイルとテスト自動化導入の勘所
【JaSST'18 Tokai】アジャイルとテスト自動化導入の勘所Kotaro Ogino
 
OpenStackトラブルシューティング入門
OpenStackトラブルシューティング入門OpenStackトラブルシューティング入門
OpenStackトラブルシューティング入門VirtualTech Japan Inc.
 
C言語なWebSocketの遊び方。
C言語なWebSocketの遊び方。C言語なWebSocketの遊び方。
C言語なWebSocketの遊び方。Masahiko Hashimoto
 
Internship Defense Presentation
Internship Defense PresentationInternship Defense Presentation
Internship Defense PresentationTazbid Habib
 
OpenStackで始めるクラウド環境構築入門
OpenStackで始めるクラウド環境構築入門OpenStackで始めるクラウド環境構築入門
OpenStackで始めるクラウド環境構築入門VirtualTech Japan Inc.
 
3万人が利用するリクルートのワイヤレス環境 ~リアクティブからプロアクティブへ~
3万人が利用するリクルートのワイヤレス環境 ~リアクティブからプロアクティブへ~3万人が利用するリクルートのワイヤレス環境 ~リアクティブからプロアクティブへ~
3万人が利用するリクルートのワイヤレス環境 ~リアクティブからプロアクティブへ~Recruit Technologies
 
Summer intern presentation
Summer intern presentationSummer intern presentation
Summer intern presentationyzhu08
 
ia-cloudとNodeREDで作る工場IoT–センサ接続やダッシュボードのカスタムNode開発秘話
ia-cloudとNodeREDで作る工場IoT–センサ接続やダッシュボードのカスタムNode開発秘話ia-cloudとNodeREDで作る工場IoT–センサ接続やダッシュボードのカスタムNode開発秘話
ia-cloudとNodeREDで作る工場IoT–センサ接続やダッシュボードのカスタムNode開発秘話Ryoichi Obara
 
アジャイル開発の基礎知識 抜粋版
アジャイル開発の基礎知識 抜粋版アジャイル開発の基礎知識 抜粋版
アジャイル開発の基礎知識 抜粋版ESM SEC
 
レコチョクのサービス群を支えるApiたち
レコチョクのサービス群を支えるApiたちレコチョクのサービス群を支えるApiたち
レコチョクのサービス群を支えるApiたちrecotech
 
事業成長にコミットするエンジニア組織への道のり
事業成長にコミットするエンジニア組織への道のり事業成長にコミットするエンジニア組織への道のり
事業成長にコミットするエンジニア組織への道のりRecruit Lifestyle Co., Ltd.
 
Cisco Modeling Labs (CML)を使ってネットワークを学ぼう!(DevNet編)
Cisco Modeling Labs (CML)を使ってネットワークを学ぼう!(DevNet編)Cisco Modeling Labs (CML)を使ってネットワークを学ぼう!(DevNet編)
Cisco Modeling Labs (CML)を使ってネットワークを学ぼう!(DevNet編)シスコシステムズ合同会社
 
Industrial Training PRESENTATION
Industrial Training PRESENTATIONIndustrial Training PRESENTATION
Industrial Training PRESENTATIONMuaz Sadini
 
Summer Training Report (GAMMON INDIA)
Summer Training Report (GAMMON INDIA)Summer Training Report (GAMMON INDIA)
Summer Training Report (GAMMON INDIA)Er Mohtisham Ali
 
XP祭り2020(0919) 基調講演 エンジニアの創造力を解き放て!(抜粋)
XP祭り2020(0919) 基調講演 エンジニアの創造力を解き放て!(抜粋)XP祭り2020(0919) 基調講演 エンジニアの創造力を解き放て!(抜粋)
XP祭り2020(0919) 基調講演 エンジニアの創造力を解き放て!(抜粋)Tomoaki Kambe
 
Summer training guidelines ppt
Summer training guidelines pptSummer training guidelines ppt
Summer training guidelines pptGurjar Patel
 
微服務基礎建設 - Message Queue
微服務基礎建設 - Message Queue微服務基礎建設 - Message Queue
微服務基礎建設 - Message QueueAndrew Wu
 
未経験者から世界と渡り合うネットワークエンジニアになるためのキャリア設計術
未経験者から世界と渡り合うネットワークエンジニアになるためのキャリア設計術未経験者から世界と渡り合うネットワークエンジニアになるためのキャリア設計術
未経験者から世界と渡り合うネットワークエンジニアになるためのキャリア設計術Taiji Tsuchiya
 

What's hot (20)

Redmine.tokyo lt04:RedmineとRPA(Uipath)を組み合わせた多システム間連携について
Redmine.tokyo lt04:RedmineとRPA(Uipath)を組み合わせた多システム間連携についてRedmine.tokyo lt04:RedmineとRPA(Uipath)を組み合わせた多システム間連携について
Redmine.tokyo lt04:RedmineとRPA(Uipath)を組み合わせた多システム間連携について
 
XP祭り2019 B-6 アジャイルソフトウェア開発への統計的品質管理の応用
XP祭り2019 B-6 アジャイルソフトウェア開発への統計的品質管理の応用XP祭り2019 B-6 アジャイルソフトウェア開発への統計的品質管理の応用
XP祭り2019 B-6 アジャイルソフトウェア開発への統計的品質管理の応用
 
【JaSST'18 Tokai】アジャイルとテスト自動化導入の勘所
【JaSST'18 Tokai】アジャイルとテスト自動化導入の勘所【JaSST'18 Tokai】アジャイルとテスト自動化導入の勘所
【JaSST'18 Tokai】アジャイルとテスト自動化導入の勘所
 
OpenStackトラブルシューティング入門
OpenStackトラブルシューティング入門OpenStackトラブルシューティング入門
OpenStackトラブルシューティング入門
 
C言語なWebSocketの遊び方。
C言語なWebSocketの遊び方。C言語なWebSocketの遊び方。
C言語なWebSocketの遊び方。
 
Internship Defense Presentation
Internship Defense PresentationInternship Defense Presentation
Internship Defense Presentation
 
OpenStackで始めるクラウド環境構築入門
OpenStackで始めるクラウド環境構築入門OpenStackで始めるクラウド環境構築入門
OpenStackで始めるクラウド環境構築入門
 
3万人が利用するリクルートのワイヤレス環境 ~リアクティブからプロアクティブへ~
3万人が利用するリクルートのワイヤレス環境 ~リアクティブからプロアクティブへ~3万人が利用するリクルートのワイヤレス環境 ~リアクティブからプロアクティブへ~
3万人が利用するリクルートのワイヤレス環境 ~リアクティブからプロアクティブへ~
 
Summer intern presentation
Summer intern presentationSummer intern presentation
Summer intern presentation
 
ia-cloudとNodeREDで作る工場IoT–センサ接続やダッシュボードのカスタムNode開発秘話
ia-cloudとNodeREDで作る工場IoT–センサ接続やダッシュボードのカスタムNode開発秘話ia-cloudとNodeREDで作る工場IoT–センサ接続やダッシュボードのカスタムNode開発秘話
ia-cloudとNodeREDで作る工場IoT–センサ接続やダッシュボードのカスタムNode開発秘話
 
アジャイル開発の基礎知識 抜粋版
アジャイル開発の基礎知識 抜粋版アジャイル開発の基礎知識 抜粋版
アジャイル開発の基礎知識 抜粋版
 
レコチョクのサービス群を支えるApiたち
レコチョクのサービス群を支えるApiたちレコチョクのサービス群を支えるApiたち
レコチョクのサービス群を支えるApiたち
 
事業成長にコミットするエンジニア組織への道のり
事業成長にコミットするエンジニア組織への道のり事業成長にコミットするエンジニア組織への道のり
事業成長にコミットするエンジニア組織への道のり
 
Cisco Modeling Labs (CML)を使ってネットワークを学ぼう!(DevNet編)
Cisco Modeling Labs (CML)を使ってネットワークを学ぼう!(DevNet編)Cisco Modeling Labs (CML)を使ってネットワークを学ぼう!(DevNet編)
Cisco Modeling Labs (CML)を使ってネットワークを学ぼう!(DevNet編)
 
Industrial Training PRESENTATION
Industrial Training PRESENTATIONIndustrial Training PRESENTATION
Industrial Training PRESENTATION
 
Summer Training Report (GAMMON INDIA)
Summer Training Report (GAMMON INDIA)Summer Training Report (GAMMON INDIA)
Summer Training Report (GAMMON INDIA)
 
XP祭り2020(0919) 基調講演 エンジニアの創造力を解き放て!(抜粋)
XP祭り2020(0919) 基調講演 エンジニアの創造力を解き放て!(抜粋)XP祭り2020(0919) 基調講演 エンジニアの創造力を解き放て!(抜粋)
XP祭り2020(0919) 基調講演 エンジニアの創造力を解き放て!(抜粋)
 
Summer training guidelines ppt
Summer training guidelines pptSummer training guidelines ppt
Summer training guidelines ppt
 
微服務基礎建設 - Message Queue
微服務基礎建設 - Message Queue微服務基礎建設 - Message Queue
微服務基礎建設 - Message Queue
 
未経験者から世界と渡り合うネットワークエンジニアになるためのキャリア設計術
未経験者から世界と渡り合うネットワークエンジニアになるためのキャリア設計術未経験者から世界と渡り合うネットワークエンジニアになるためのキャリア設計術
未経験者から世界と渡り合うネットワークエンジニアになるためのキャリア設計術
 

Similar to Internship final presentation

IRJET- School in the Cloud
IRJET- School in the CloudIRJET- School in the Cloud
IRJET- School in the CloudIRJET Journal
 
Keeping your build tool updated in a multi repository world
Keeping your build tool updated in a multi repository worldKeeping your build tool updated in a multi repository world
Keeping your build tool updated in a multi repository worldRoberto Pérez Alcolea
 
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNINGHANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNINGIRJET Journal
 
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNINGHANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNINGIRJET Journal
 
Challenges of Deep Learning in Computer Vision Webinar - Tessellate Imaging
Challenges of Deep Learning in Computer Vision Webinar - Tessellate ImagingChallenges of Deep Learning in Computer Vision Webinar - Tessellate Imaging
Challenges of Deep Learning in Computer Vision Webinar - Tessellate ImagingAdhesh Shrivastava
 
IRJET- Python Libraries and Packages for Deep Learning-A Survey
IRJET-  	  Python Libraries and Packages for Deep Learning-A SurveyIRJET-  	  Python Libraries and Packages for Deep Learning-A Survey
IRJET- Python Libraries and Packages for Deep Learning-A SurveyIRJET Journal
 
Ashisdeb analytics new_cv_doc
Ashisdeb analytics new_cv_docAshisdeb analytics new_cv_doc
Ashisdeb analytics new_cv_docashis deb
 
Ashisdeb analytics new_cv_doc
Ashisdeb analytics new_cv_docAshisdeb analytics new_cv_doc
Ashisdeb analytics new_cv_docashis deb
 
Deployment Design Patterns - Deploying Machine Learning and Deep Learning Mod...
Deployment Design Patterns - Deploying Machine Learning and Deep Learning Mod...Deployment Design Patterns - Deploying Machine Learning and Deep Learning Mod...
Deployment Design Patterns - Deploying Machine Learning and Deep Learning Mod...All Things Open
 
AllThingsOpen 2018 - Deployment Design Patterns (Dan Zaratsian)
AllThingsOpen 2018 - Deployment Design Patterns (Dan Zaratsian)AllThingsOpen 2018 - Deployment Design Patterns (Dan Zaratsian)
AllThingsOpen 2018 - Deployment Design Patterns (Dan Zaratsian)dtz001
 
The Frontier of Deep Learning in 2020 and Beyond
The Frontier of Deep Learning in 2020 and BeyondThe Frontier of Deep Learning in 2020 and Beyond
The Frontier of Deep Learning in 2020 and BeyondNUS-ISS
 
Cody Childers- Resume
Cody Childers- ResumeCody Childers- Resume
Cody Childers- ResumeCody Childers
 
Hithai Shree.J and Varsha.R.pptx
Hithai Shree.J and Varsha.R.pptxHithai Shree.J and Varsha.R.pptx
Hithai Shree.J and Varsha.R.pptxssuser22b2ec
 
Deep Learning Projects - Anomaly Detection Using Deep Learning
Deep Learning Projects - Anomaly Detection Using Deep LearningDeep Learning Projects - Anomaly Detection Using Deep Learning
Deep Learning Projects - Anomaly Detection Using Deep LearningDezyreAcademy
 
IRJET- Intelligent Home Security System using Artificial Intelligence
IRJET- Intelligent Home Security System using Artificial IntelligenceIRJET- Intelligent Home Security System using Artificial Intelligence
IRJET- Intelligent Home Security System using Artificial IntelligenceIRJET Journal
 

Similar to Internship final presentation (20)

IRJET- School in the Cloud
IRJET- School in the CloudIRJET- School in the Cloud
IRJET- School in the Cloud
 
Divyam Virmani
Divyam VirmaniDivyam Virmani
Divyam Virmani
 
Keeping your build tool updated in a multi repository world
Keeping your build tool updated in a multi repository worldKeeping your build tool updated in a multi repository world
Keeping your build tool updated in a multi repository world
 
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNINGHANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
 
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNINGHANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
 
Challenges of Deep Learning in Computer Vision Webinar - Tessellate Imaging
Challenges of Deep Learning in Computer Vision Webinar - Tessellate ImagingChallenges of Deep Learning in Computer Vision Webinar - Tessellate Imaging
Challenges of Deep Learning in Computer Vision Webinar - Tessellate Imaging
 
tip oopt pse-summit2017
tip oopt pse-summit2017tip oopt pse-summit2017
tip oopt pse-summit2017
 
IRJET- Python Libraries and Packages for Deep Learning-A Survey
IRJET-  	  Python Libraries and Packages for Deep Learning-A SurveyIRJET-  	  Python Libraries and Packages for Deep Learning-A Survey
IRJET- Python Libraries and Packages for Deep Learning-A Survey
 
Ashisdeb analytics new_cv_doc
Ashisdeb analytics new_cv_docAshisdeb analytics new_cv_doc
Ashisdeb analytics new_cv_doc
 
Ashisdeb analytics new_cv_doc
Ashisdeb analytics new_cv_docAshisdeb analytics new_cv_doc
Ashisdeb analytics new_cv_doc
 
Deployment Design Patterns - Deploying Machine Learning and Deep Learning Mod...
Deployment Design Patterns - Deploying Machine Learning and Deep Learning Mod...Deployment Design Patterns - Deploying Machine Learning and Deep Learning Mod...
Deployment Design Patterns - Deploying Machine Learning and Deep Learning Mod...
 
AllThingsOpen 2018 - Deployment Design Patterns (Dan Zaratsian)
AllThingsOpen 2018 - Deployment Design Patterns (Dan Zaratsian)AllThingsOpen 2018 - Deployment Design Patterns (Dan Zaratsian)
AllThingsOpen 2018 - Deployment Design Patterns (Dan Zaratsian)
 
The Frontier of Deep Learning in 2020 and Beyond
The Frontier of Deep Learning in 2020 and BeyondThe Frontier of Deep Learning in 2020 and Beyond
The Frontier of Deep Learning in 2020 and Beyond
 
Cody Childers- Resume
Cody Childers- ResumeCody Childers- Resume
Cody Childers- Resume
 
Gutenberg H4D Stanford 2019
Gutenberg H4D Stanford 2019Gutenberg H4D Stanford 2019
Gutenberg H4D Stanford 2019
 
Resume
ResumeResume
Resume
 
Hithai Shree.J and Varsha.R.pptx
Hithai Shree.J and Varsha.R.pptxHithai Shree.J and Varsha.R.pptx
Hithai Shree.J and Varsha.R.pptx
 
Deep Learning Projects - Anomaly Detection Using Deep Learning
Deep Learning Projects - Anomaly Detection Using Deep LearningDeep Learning Projects - Anomaly Detection Using Deep Learning
Deep Learning Projects - Anomaly Detection Using Deep Learning
 
2015 RESUME July
2015 RESUME July2015 RESUME July
2015 RESUME July
 
IRJET- Intelligent Home Security System using Artificial Intelligence
IRJET- Intelligent Home Security System using Artificial IntelligenceIRJET- Intelligent Home Security System using Artificial Intelligence
IRJET- Intelligent Home Security System using Artificial Intelligence
 

Recently uploaded

ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about usDynamic Netsoft
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfPower Karaoke
 
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
Engage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyEngage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyFrank van der Linden
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningVitsRangannavar
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
 
Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...aditisharan08
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 

Recently uploaded (20)

ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about us
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdf
 
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
Engage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyEngage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The Ugly
 
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learning
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 

Internship final presentation

  • 2. Introduction Confidential Property of App Orchid Inc Copyright : 2017 - 2018 2 Background ● Sophomore at Rutgers University ● Pursuing a Bachelor’s of Science in Computer Science and Mathematics ● Very interested in Machine Learning. My Task ● Implemented an ensemble of BERTs that were trained on the SQuAD 2.0 Dataset. ● Went from knowing very little about Machine Learning to learning Pytorch and experimenting with a recent NLP architecture.
  • 3. SQuAD 2.0 Dataset 3 Question Answering is one of the hot NLP downstream tasks being researched today! The SQuAD 2.0 Dataset is a commonly used benchmark dataset for this task. Example From the SQuAD 2.0 Dataset Rajpurkar et. al, 2018
  • 5. BERT Confidential Property of App Orchid Inc Copyright : 2017 - 2018 5 http://jalammar.github.io/illustrated-bert/
  • 6. My Task Confidential Property of App Orchid Inc Copyright : 2017 - 2018 6 Task: Train an ensemble of BERTs on the SQuAD 2.0 dataset. The BERT and BiDAF code were provided by HuggingFace and this github repo respectively. Zhou et. al, 2019
  • 7. How Linguistic Knowledge Post Processing Works 7 Zhou et. al, 2019
  • 8. How the Ensemble Works Ensemble algorithm takes the prediction files made by each of the BERTs and BiDAF. 8 Example Question - Answers pair from model0’s n best predictions
  • 9. My Results Confidential Property of App Orchid Inc Copyright : 2017 - 2018 9 Trial # model batch_size max_seq_length learning_rate epochs dev_f1 dev_em 1 base_uncased 4 384 3.00e-5 3 74.318 70.589 2 base_uncased 8 384 3.00e-5 3 76.016 72.400 3 base_uncased 8 384 2.00e-5 3 75.646 72.155 4 base_uncased 8 384 5.00e-5 3 75.781 72.256 5 base_uncased 6 384 3.00e-5 2 76.704 73.259 6 base_uncased 6 384 5.00e-5 4 72.957 69.266 7 base_cased 8 384 3.00e-5 3 74.946 71.111 Ens1 1+2+3+4+5+6+7 78.208 75.213 Ens2 Ens1 + BIDAF 78.566 75.620
  • 11. Issues I encountered Confidential Property of App Orchid Inc Copyright : 2017 - 2018 11 ● I had about 1 and a half weeks to set up the right environment, train the models, and create the ensemble. ○ I spent about 1.5 months to understand the code. ○ Could have done more experiments using different hyperparameters and augmented data. ● GPU constraint - Since these models are massive, I encountered a couple of problems. ○ Encountered out of RAM issues when training the BERT large models
  • 12. Plans for the Future Confidential Property of App Orchid Inc Copyright : 2017 - 2018 12 ● I plan to pursue a career in Machine Learning even more now. ● I plan to learn more about NLP and fill in the gaps of what I learned these past 2 months. ● I plan to gain experience by pursuing a research position at my university. ● I plan to create a couple of projects using ML and get at a level where I could implement research papers. ● I plan to learn Computer Vision. Google Deep Dream
  • 13. 13