SlideShare a Scribd company logo
1 of 25
Download to read offline
Machine Learning HW4
ML TAs
○ mlta-2023-spring@googlegroups.com
Outline
● Task Description
● Dataset
● Data Segmentation
● Model Architecture
● Baselines
● Report
● Guidelines
All links
● Kaggle Competition
● Sample Code on Colab
● Sample Code on Kaggle
● Gradescope
● Data
Speaker Identification
Task: Multiclass Classification
Predict speaker class from given speech.
Dataset -VoxCeleb2
● Training: 56666 processed audio features with labels.
● Testing: 4000 processed audio features (public & private) without labels.
● Label: 600 classes in total, each class represents a speaker.
VoxCeleb2: Link
Data Preprocessing
Ref. Prof. Hung-Yi Lee
[2020Spring DLHLP] Speech
Recognition
Data Format
● Data Directory
○ metadata.json
○ testdata.json
○ mapping.json
○ uttr-{audioID}.pt
● The information in metadata
○ "n_mels": The dimention of mel-spectrogram.
○ "speakers": A dictionary.
■ Key: speaker ids
■ Value: "feature_path" and "mel_len"
Data Segmentation During Training
Different length:
Data Segmentation During Training
Different length:
Segment during
training
Segment = 2
in training:
Segment = 128
Model Architecture
Encoder
Pooling
Layer
Predict
Layer
Input
(preprocessed)
Features
Input
Representation
Prediction
dim=600
Tranformer
Sample Code
● Baseline Methods
○ Simple: Run sample code & know how to use Transformer.
○ Medium: Adjust hyper-parameters of Transformer and prediction layer.
○ Strong: Construct Conformer, which is a variety of Transformer.
○ Boss: Implement Self-Attention Pooling & Additive Margin Softmax to further boost the
performance.
Requirements - Simple
● Build a self-attention network to classify speakers with sample code.
● Simple public baseline: 0.66025
● Estimate training time: 30~40 mins on Colab.
Requirements - Medium
● Modify the parameters of the transformer modules in the sample code.
● Medium public baseline: 0.81750
Estimate training time:
1~1.5 hour on Colab
Requirements – Strong
Estimate training time:
1~1.5 hour on Colab
● Construct Conformer, which is a variety of Transformer.
● Strong public baseline: 0.88500
Hints
● Conformer
Ref. Prof. Hung-Yi Lee
[2021Spring ML] Network
Compression
Hints
● Implement Self-Attention Pooling & Additive Margin Softmax to further
boost the performance
● Self-Attention Pooling.
Hints
● Additive Margin Softmax
Grading
● Evaluate Metric: @1 Accuracy
● Simple Baseline (Public / Private)
● Medium Baseline (Public / Private)
● Strong Baseline (Public / Private)
● Boss Baseline (Public / Private)
● Code Submission
● Gradescope
+0.5 pt / 0.5 pt
+0.5 pt / 0.5 pt
+0.5 pt / 0.5 pt
+0.5 pt / 0.5 pt
+2 pts
+4 pts
Submission Format
● "Id, Category" split by ',' in the first row.
● Followed by 8000 lines of "filename, speaker name" split by ','.
Code Submission
● Submit your code to NTU COOL (2 pts).
○ We can only see your last submission.
○ Do NOT submit the model or dataset.
○ If your codes are not reasonable, your final grade will be x 0.9
○ You should compress your code into a single zip file:
■ ex. b08902126_hw4.zip
<Student ID>_hw4.zip
Gradescope (4 pts)
1. Make a brief introduction about one of the variant of Transformer,
and use a image of the structure of the model to help explain (2 pts)
see also: ref
2. Briefly explain what’s the advantages of this variant under certain
situations. (2 pts)
Deadline
● Kaggle: 2023/04/07 23:59 (UTC+8)
● NTU COOL: 2023/04/07 23:59 (UTC+8)
● Gradescope: 2023/04/07 23:59 (UTC+8)
Regulation
● You should NOT plagiarize, if you use any other resource, you should cite it in
the reference. (*)
● You should NOT modify your prediction files manually.
● Do NOT share codes or prediction files with any living creatures.
● Do NOT use any approaches to submit your results more than 5 times a day.
● Do NOT search or use additional data or pre-trained models.
● Your final grade x 0.9 if you violate any of the above rules.
● Prof. Lee & TAs preserve the rights to change the rules & grades.
(*) Academic Ethics Guidelines for
Researchers by the Ministry of Science and
Technology (MOST)
If any questions, you can ask us via...
● NTU COOL (Recommended)
○ https://cool.ntu.edu.tw/courses/24108/discussion_topics/184435
● Email
○ mlta-2023-spring@googlegroups.com
○ The title should begin with “[hw4]”
● TA hour
○ Mon 20:00~21:00 (mandarin)
○ Fri 20:00~21:00 (english)
Appendix
● Colab 縮排問題
○ 工具 -> 設定

More Related Content

Similar to HW04.pdf

Bimbo Final Project Presentation
Bimbo Final Project PresentationBimbo Final Project Presentation
Bimbo Final Project Presentation
Can Köklü
 

Similar to HW04.pdf (20)

MN-3, MN-Core and HPL - SC21 Green500 BOF
MN-3, MN-Core and HPL - SC21 Green500 BOFMN-3, MN-Core and HPL - SC21 Green500 BOF
MN-3, MN-Core and HPL - SC21 Green500 BOF
 
The Pupil Has Become the Master: Teacher-Student Model-Based Word Embedding D...
The Pupil Has Become the Master: Teacher-Student Model-Based Word Embedding D...The Pupil Has Become the Master: Teacher-Student Model-Based Word Embedding D...
The Pupil Has Become the Master: Teacher-Student Model-Based Word Embedding D...
 
End to end MLworkflows
End to end MLworkflowsEnd to end MLworkflows
End to end MLworkflows
 
AlphaPy
AlphaPyAlphaPy
AlphaPy
 
AlphaPy: A Data Science Pipeline in Python
AlphaPy: A Data Science Pipeline in PythonAlphaPy: A Data Science Pipeline in Python
AlphaPy: A Data Science Pipeline in Python
 
Bimbo Final Project Presentation
Bimbo Final Project PresentationBimbo Final Project Presentation
Bimbo Final Project Presentation
 
Production-Ready BIG ML Workflows - from zero to hero
Production-Ready BIG ML Workflows - from zero to heroProduction-Ready BIG ML Workflows - from zero to hero
Production-Ready BIG ML Workflows - from zero to hero
 
Machine learning testing survey, landscapes and horizons, the Cliff Notes
Machine learning testing  survey, landscapes and horizons, the Cliff NotesMachine learning testing  survey, landscapes and horizons, the Cliff Notes
Machine learning testing survey, landscapes and horizons, the Cliff Notes
 
Iterative Methodology for Personalization Models Optimization
 Iterative Methodology for Personalization Models Optimization Iterative Methodology for Personalization Models Optimization
Iterative Methodology for Personalization Models Optimization
 
Essentials of Professional VLSI Digital Design Training
Essentials of Professional VLSI Digital Design TrainingEssentials of Professional VLSI Digital Design Training
Essentials of Professional VLSI Digital Design Training
 
Resume
ResumeResume
Resume
 
Production ready big ml workflows from zero to hero daniel marcous @ waze
Production ready big ml workflows from zero to hero daniel marcous @ wazeProduction ready big ml workflows from zero to hero daniel marcous @ waze
Production ready big ml workflows from zero to hero daniel marcous @ waze
 
Large Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and RepairLarge Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and Repair
 
Supervised embedding techniques in search ranking system
Supervised embedding techniques in search ranking systemSupervised embedding techniques in search ranking system
Supervised embedding techniques in search ranking system
 
Fugue: Unifying Spark and Non-Spark Ecosystems for Big Data Analytics
Fugue: Unifying Spark and Non-Spark Ecosystems for Big Data AnalyticsFugue: Unifying Spark and Non-Spark Ecosystems for Big Data Analytics
Fugue: Unifying Spark and Non-Spark Ecosystems for Big Data Analytics
 
Important Concepts for Machine Learning
Important Concepts for Machine LearningImportant Concepts for Machine Learning
Important Concepts for Machine Learning
 
Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016
Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016
Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016
 
MLConf 2016 SigOpt Talk by Scott Clark
MLConf 2016 SigOpt Talk by Scott ClarkMLConf 2016 SigOpt Talk by Scott Clark
MLConf 2016 SigOpt Talk by Scott Clark
 
AI hype or reality
AI  hype or realityAI  hype or reality
AI hype or reality
 
Universal job embedding in recommendation (public ver.)
Universal job embedding in recommendation (public ver.)Universal job embedding in recommendation (public ver.)
Universal job embedding in recommendation (public ver.)
 

Recently uploaded

Call Girls Bidadi Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Bidadi Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Bidadi Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Bidadi Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
Vip Mumbai Call Girls Ghatkopar Call On 9920725232 With Body to body massage ...
Vip Mumbai Call Girls Ghatkopar Call On 9920725232 With Body to body massage ...Vip Mumbai Call Girls Ghatkopar Call On 9920725232 With Body to body massage ...
Vip Mumbai Call Girls Ghatkopar Call On 9920725232 With Body to body massage ...
amitlee9823
 
Call Girls Hoodi Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hoodi Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Hoodi Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hoodi Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
Nandini Layout Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangal...
Nandini Layout Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangal...Nandini Layout Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangal...
Nandini Layout Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangal...
amitlee9823
 
Call Girls Jayanagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jayanagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jayanagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jayanagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
amitlee9823
 
Call Girls Bidadi ☎ 7737669865☎ Book Your One night Stand (Bangalore)
Call Girls Bidadi ☎ 7737669865☎ Book Your One night Stand (Bangalore)Call Girls Bidadi ☎ 7737669865☎ Book Your One night Stand (Bangalore)
Call Girls Bidadi ☎ 7737669865☎ Book Your One night Stand (Bangalore)
amitlee9823
 
Call Girls Hosur Road Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hosur Road Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hosur Road Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hosur Road Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
amitlee9823
 
Call Girls Hosur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hosur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Hosur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hosur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 

Recently uploaded (20)

Call Girls Bidadi Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Bidadi Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Bidadi Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Bidadi Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Guide to a Winning Interview May 2024 for MCWN
Guide to a Winning Interview May 2024 for MCWNGuide to a Winning Interview May 2024 for MCWN
Guide to a Winning Interview May 2024 for MCWN
 
Hot Call Girls |Delhi |Janakpuri ☎ 9711199171 Book Your One night Stand
Hot Call Girls |Delhi |Janakpuri ☎ 9711199171 Book Your One night StandHot Call Girls |Delhi |Janakpuri ☎ 9711199171 Book Your One night Stand
Hot Call Girls |Delhi |Janakpuri ☎ 9711199171 Book Your One night Stand
 
CFO_SB_Career History_Multi Sector Experience
CFO_SB_Career History_Multi Sector ExperienceCFO_SB_Career History_Multi Sector Experience
CFO_SB_Career History_Multi Sector Experience
 
Call Girls Alandi Road Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Road Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Alandi Road Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Road Call Me 7737669865 Budget Friendly No Advance Booking
 
Resumes, Cover Letters, and Applying Online
Resumes, Cover Letters, and Applying OnlineResumes, Cover Letters, and Applying Online
Resumes, Cover Letters, and Applying Online
 
Vip Mumbai Call Girls Ghatkopar Call On 9920725232 With Body to body massage ...
Vip Mumbai Call Girls Ghatkopar Call On 9920725232 With Body to body massage ...Vip Mumbai Call Girls Ghatkopar Call On 9920725232 With Body to body massage ...
Vip Mumbai Call Girls Ghatkopar Call On 9920725232 With Body to body massage ...
 
Top Rated Pune Call Girls Warje ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
Top Rated  Pune Call Girls Warje ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...Top Rated  Pune Call Girls Warje ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
Top Rated Pune Call Girls Warje ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
 
Personal Brand Exploration - Fernando Negron
Personal Brand Exploration - Fernando NegronPersonal Brand Exploration - Fernando Negron
Personal Brand Exploration - Fernando Negron
 
Call Girls Hoodi Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hoodi Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Hoodi Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hoodi Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
WhatsApp 📞 8448380779 ✅Call Girls In Salarpur Sector 81 ( Noida)
WhatsApp 📞 8448380779 ✅Call Girls In Salarpur Sector 81 ( Noida)WhatsApp 📞 8448380779 ✅Call Girls In Salarpur Sector 81 ( Noida)
WhatsApp 📞 8448380779 ✅Call Girls In Salarpur Sector 81 ( Noida)
 
Presentation for the country presentation
Presentation for the country presentationPresentation for the country presentation
Presentation for the country presentation
 
Nandini Layout Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangal...
Nandini Layout Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangal...Nandini Layout Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangal...
Nandini Layout Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangal...
 
Call Girls Jayanagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jayanagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jayanagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jayanagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
Miletti Gabriela_Vision Plan for artist Jahzel.pdf
Miletti Gabriela_Vision Plan for artist Jahzel.pdfMiletti Gabriela_Vision Plan for artist Jahzel.pdf
Miletti Gabriela_Vision Plan for artist Jahzel.pdf
 
Call Girls Bidadi ☎ 7737669865☎ Book Your One night Stand (Bangalore)
Call Girls Bidadi ☎ 7737669865☎ Book Your One night Stand (Bangalore)Call Girls Bidadi ☎ 7737669865☎ Book Your One night Stand (Bangalore)
Call Girls Bidadi ☎ 7737669865☎ Book Your One night Stand (Bangalore)
 
Call Girls Hosur Road Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hosur Road Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hosur Road Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hosur Road Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Dubai Call Girls Kiki O525547819 Call Girls Dubai Koko
Dubai Call Girls Kiki O525547819 Call Girls Dubai KokoDubai Call Girls Kiki O525547819 Call Girls Dubai Koko
Dubai Call Girls Kiki O525547819 Call Girls Dubai Koko
 
TEST BANK For Evidence-Based Practice for Nurses Appraisal and Application of...
TEST BANK For Evidence-Based Practice for Nurses Appraisal and Application of...TEST BANK For Evidence-Based Practice for Nurses Appraisal and Application of...
TEST BANK For Evidence-Based Practice for Nurses Appraisal and Application of...
 
Call Girls Hosur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hosur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Hosur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hosur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 

HW04.pdf

  • 1. Machine Learning HW4 ML TAs ○ mlta-2023-spring@googlegroups.com
  • 2. Outline ● Task Description ● Dataset ● Data Segmentation ● Model Architecture ● Baselines ● Report ● Guidelines All links ● Kaggle Competition ● Sample Code on Colab ● Sample Code on Kaggle ● Gradescope ● Data
  • 3. Speaker Identification Task: Multiclass Classification Predict speaker class from given speech.
  • 4. Dataset -VoxCeleb2 ● Training: 56666 processed audio features with labels. ● Testing: 4000 processed audio features (public & private) without labels. ● Label: 600 classes in total, each class represents a speaker. VoxCeleb2: Link
  • 5. Data Preprocessing Ref. Prof. Hung-Yi Lee [2020Spring DLHLP] Speech Recognition
  • 6. Data Format ● Data Directory ○ metadata.json ○ testdata.json ○ mapping.json ○ uttr-{audioID}.pt ● The information in metadata ○ "n_mels": The dimention of mel-spectrogram. ○ "speakers": A dictionary. ■ Key: speaker ids ■ Value: "feature_path" and "mel_len"
  • 7. Data Segmentation During Training Different length:
  • 8. Data Segmentation During Training Different length: Segment during training Segment = 2 in training: Segment = 128
  • 11. Sample Code ● Baseline Methods ○ Simple: Run sample code & know how to use Transformer. ○ Medium: Adjust hyper-parameters of Transformer and prediction layer. ○ Strong: Construct Conformer, which is a variety of Transformer. ○ Boss: Implement Self-Attention Pooling & Additive Margin Softmax to further boost the performance.
  • 12. Requirements - Simple ● Build a self-attention network to classify speakers with sample code. ● Simple public baseline: 0.66025 ● Estimate training time: 30~40 mins on Colab.
  • 13. Requirements - Medium ● Modify the parameters of the transformer modules in the sample code. ● Medium public baseline: 0.81750 Estimate training time: 1~1.5 hour on Colab
  • 14. Requirements – Strong Estimate training time: 1~1.5 hour on Colab ● Construct Conformer, which is a variety of Transformer. ● Strong public baseline: 0.88500
  • 15. Hints ● Conformer Ref. Prof. Hung-Yi Lee [2021Spring ML] Network Compression
  • 16. Hints ● Implement Self-Attention Pooling & Additive Margin Softmax to further boost the performance ● Self-Attention Pooling.
  • 18. Grading ● Evaluate Metric: @1 Accuracy ● Simple Baseline (Public / Private) ● Medium Baseline (Public / Private) ● Strong Baseline (Public / Private) ● Boss Baseline (Public / Private) ● Code Submission ● Gradescope +0.5 pt / 0.5 pt +0.5 pt / 0.5 pt +0.5 pt / 0.5 pt +0.5 pt / 0.5 pt +2 pts +4 pts
  • 19. Submission Format ● "Id, Category" split by ',' in the first row. ● Followed by 8000 lines of "filename, speaker name" split by ','.
  • 20. Code Submission ● Submit your code to NTU COOL (2 pts). ○ We can only see your last submission. ○ Do NOT submit the model or dataset. ○ If your codes are not reasonable, your final grade will be x 0.9 ○ You should compress your code into a single zip file: ■ ex. b08902126_hw4.zip <Student ID>_hw4.zip
  • 21. Gradescope (4 pts) 1. Make a brief introduction about one of the variant of Transformer, and use a image of the structure of the model to help explain (2 pts) see also: ref 2. Briefly explain what’s the advantages of this variant under certain situations. (2 pts)
  • 22. Deadline ● Kaggle: 2023/04/07 23:59 (UTC+8) ● NTU COOL: 2023/04/07 23:59 (UTC+8) ● Gradescope: 2023/04/07 23:59 (UTC+8)
  • 23. Regulation ● You should NOT plagiarize, if you use any other resource, you should cite it in the reference. (*) ● You should NOT modify your prediction files manually. ● Do NOT share codes or prediction files with any living creatures. ● Do NOT use any approaches to submit your results more than 5 times a day. ● Do NOT search or use additional data or pre-trained models. ● Your final grade x 0.9 if you violate any of the above rules. ● Prof. Lee & TAs preserve the rights to change the rules & grades. (*) Academic Ethics Guidelines for Researchers by the Ministry of Science and Technology (MOST)
  • 24. If any questions, you can ask us via... ● NTU COOL (Recommended) ○ https://cool.ntu.edu.tw/courses/24108/discussion_topics/184435 ● Email ○ mlta-2023-spring@googlegroups.com ○ The title should begin with “[hw4]” ● TA hour ○ Mon 20:00~21:00 (mandarin) ○ Fri 20:00~21:00 (english)