SlideShare a Scribd company logo
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
CS231n:
Deep Learning for Computer Vision
Lecture 1 - Overview
1
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
Today’s agenda
● A brief history of computer vision
● CS231n overview
2
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
Today’s agenda
● A brief history of computer vision
● CS231n overview
3
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
Image Classification: A core task in Computer Vision
4
cat
This image by Nikita is
licensed under CC-BY 2.0
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
5
There are many visual recognition problems that
are related to image classification, such as
object detection, image captioning, image
segmentation, visual question answering, visual
instruction navigation, video understanding, etc.
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
6
Deep Learning for
Computer Vision
Hierarchical computing systems with many “layers”, that are very loosely
inspired by Neuroscience
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
Neural Networks
x h
W1 s
W2
3072 100 10
cat
7
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
A class of Neural Networks that have become an
important tool for visual recognition
Convolutional Neural Networks
for Visual Recognition
8
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
9
Beyond Convolutional Neural Networks
Recurrent neural network Attention mechanism / Transformers
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
Beyond Image Classification
Classification
Semantic
Segmentation
Object
Detection
Instance
Segmentation
CAT GRASS, CAT,
TREE, SKY
DOG, DOG, CAT DOG, DOG, CAT
No spatial extent Multiple Object
No objects, just pixels This image is CC0 public domain
10
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
11
Beyond 2D Images
Gkioxari et al., “Mesh R-CNN”, ICCV 2019
Choy et al., 3D-R2N2: Recurrent Reconstruction Neural Network (2016)
Simonyan and Zisserman, “Two-stream convolutional networks for action
recognition in videos”, NeurIPS 2014
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
12
Mandlekar and Xu et al., Learning to Generalize Across
Long-Horizon Tasks from Human Demonstrations (2020)
Xu et al., PointFusion: Deep Sensor Fusion for 3D Bounding Box Estimation (2018)
Beyond Vision
Wang et al., 6-PACK: Category-level 6D Pose Tracker with
Anchor-Based Keypoints (2020)
Gao et al., ObjectFolder 2.0: A Multisensory Object Dataset for Sim2Real Transfer (2022)
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
2018 Turing Award for deep learning
most prestigious technical award, is given for major contributions of lasting importance to computing.
13
This image is CC0 public domain
Yann LeCun
This image is CC0 public domain
This image is CC0 public domain
Jeffrey Hinton Yoshua Bengio
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
IEEE PAMI Longuet-Higgins Prize
Award recognizes ONE Computer Vision paper from ten years ago with significant impact on computer
vision research.
14
At CVPR 2019, it was awarded to the 2009 original ImageNet paper
That’s Fei-Fei
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
15
2020
2021
>8k submissions, 2,067 accepted papers
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
16
Logistics
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
17
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
Lectures
- Tuesdays and Thursdays between 1:30pm to 3:00pm at NVIDIA Auditorium
- Slides will be posted on the course website shortly before each lecture
- All lectures will be recorded and uploaded to Canvas after the lecture under the
“Panopto Course Videos” Tab.
18
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
19
Course website
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
Friday Discussion Sections
6 Discussion sections Fridays 1:30pm - 2:30pm over Zoom
Hands-on tutorials, with more practical details than the main lecture
Check canvas for the Zoom link of the discussion sessions!
This Friday: Python / numpy / Colab (Presenter: Manasi Sharma)
20
04/01 Python / Numpy Review Session
04/08 Backprop Review Session
04/15 Final Project Overview and Guidelines
04/22 PyTorch / TensorFlow Review Session
04/29 Detection software & RNNs
05/06 Midterm Review Session
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
Ed
For questions about midterm, projects, logistics, etc, use Ed!
SCPD students: Use your @stanford.edu address to register for Ed; contact
scpd-customerservice@stanford.edu for help.
21
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
We'll be using Zoom to hold office hours and QueueStatus to setup queues
- please see Canvas or Ed for the QueueStatus link
- TAs will admit students to their Zoom meeting rooms for 1-1 conversations
when it’s your turn using QueueStatus.
- Office hours is listed on the course webpage!
Office Hours
22
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
Optional textbook resources
- Deep Learning
- by Goodfellow, Bengio, and Courville
- Here is a free version
- Mathematics of deep learning
- Chapters 5, 6 7 are useful to understand vector calculus and continuous optimization
- Free online version
- Dive into deep learning
- An interactive deep learning book with code, math, and discussions, based on the NumPy
interface.
- Free online version
23
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
Assignments
All assignments will be completed using Google Colab
Assignment 1: Will be out Friday, due 4/15 by 11:59pm
- K-Nearest Neighbor
- Linear classifiers: SVM, Softmax
- Two-layer neural network
- Image features
24
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
Grading
25
All assignments, coding and written portions, will be submitted via Gradescope.
An auto-grading system:
- A consistent grading scheme
- Public tests:
- Students see results of public tests immediately
- Private tests
- Generalizations of the public tests to thoroughly test your implementation
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
Grading
26
3 Assignments: 10% + 20% + 15% = 45%
In-Class Midterm Exam: 20%
Course Project: 35%
- Project Proposal: 1%
- Milestone: 2%
- Final Project Report: 29%
- Poster & Poster Session: 3%
Participation Extra Credit: up to 3%
Late policy
- 4 free late days – use up to 2 late days per assignment
- Afterwards, 25% off per day late
- No late days for project report
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
AWS
27
We will have AWS Cloud credits available for projects
- Not for HWs (only for final projects)
We will be distributing coupons to all enrolled students who need it
We will have a tutorial for walking through the AWS setup
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
Course Website: http://cs231n.stanford.edu/
- Syllabus, lecture slides, links to assignment downloads, etc
Ed:
- Use this for most communication with course staff
- Ask questions about homework, grading, logistics, etc
- Use private questions only if your post will violate honor code if you release publicly.
Gradescope:
- For turning in homework and receiving grades
Canvas:
- For watching recorded lectures
- For watching recorded discussion sessions
Overview on communication
28
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
Prerequisites
Proficiency in Python
- All class assignments will be in Python (and use numpy)
- Later in the class, you will be using Pytorch and TensorFlow
- A Python tutorial available on course website
College Calculus, Linear Algebra
No longer need CS229 (Machine Learning)
29
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
Collaboration policy
We follow the Stanford Honor Code and the CS Department Honor Code – read
them!
● Rule 1: Don’t look at solutions or code that are not your own; everything you
submit should be your own work
● Rule 2: Don’t share your solution code with others; however discussing ideas
or general strategies is fine and encouraged
● Rule 3: Indicate in your submissions anyone you worked with
Turning in something late / incomplete is better than violating the honor code
30
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
Learning objectives
Formalize computer vision applications into tasks
- Formalize inputs and outputs for vision-related problems
- Understand what data and computational requirements you need to train a model
Develop and train vision models
- Learn to code, debug, and train convolutional neural networks.
- Learn how to use software frameworks like PyTorch and TensorFlow
Gain an understanding of where the field is and where it is headed
- What new research has come out in the last 0-5 years?
- What are open research challenges?
- What ethical and societal considerations should we consider before deployment?
31
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
Why should you take this class?
Become a vision researcher (an incomplete list of conferences)
- Get involved with vision research at Stanford: apply using this form.
- CVPR 2022 conference
- ICCV 2021 conference
Become a vision engineer in industry (an incomplete list of industry teams)
- Perception team at Google AI, Vision at Google Cloud
- Vision at Meta AI
- Vision at Amazon AWS
- Nvidia, Tesla, Apple, Salesforce, ……
General interest
32
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
33
CS231n: Deep Learning for Computer Vision
● Deep Learning Basics (Lecture 2 – 4)
● Perceiving and Understanding the Visual World (Lecture 5 – 12)
● Reconstructing and Interacting with the Visual World (Lecture 13 – 16)
● Human-Centered Artificial Intelligence (Lecture 17 – 18)
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
Syllabus
34
Deep Learning Basics Convolutional Neural Networks
Data-driven learning
Linear classification & kNN
Loss functions
Optimization
Backpropagation
Multi-layer perceptrons
Neural Networks
Computer Vision Applications
Convolutions
PyTorch / TensorFlow
Activation functions
Batch normalization
Transfer learning
Data augmentation
Momentum / RMSProp / Adam
Architecture design
RNNs / Attention / Transformers
Image captioning
Object detection and segmentation
Style transfer
Video understanding
Generative models
Self-supervised learning
3D vision
Human-centered AI
Fairness & ethics
Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022
Next time: Image classification with Linear Classifiers
Linear classification
35
Plot created using Wolfram Cloud
k- nearest neighbor

More Related Content

Similar to lecture_1_2_ruohan.pdf

Experience with Online Teaching with Open Source MOOC Technology
Experience with Online Teaching with Open Source MOOC TechnologyExperience with Online Teaching with Open Source MOOC Technology
Experience with Online Teaching with Open Source MOOC Technology
Geoffrey Fox
 
DevOps Syllabus summer 2020
DevOps Syllabus summer 2020DevOps Syllabus summer 2020
DevOps Syllabus summer 2020
Len Bass
 
EDUC5101G Session 5 Presentation (March 8, 2016)
EDUC5101G Session 5 Presentation (March 8, 2016)EDUC5101G Session 5 Presentation (March 8, 2016)
EDUC5101G Session 5 Presentation (March 8, 2016)
Robert Power
 
Experience with Online Teaching with Open Source MOOC Technology
Experience with Online Teaching with Open Source MOOC TechnologyExperience with Online Teaching with Open Source MOOC Technology
Experience with Online Teaching with Open Source MOOC Technology
Geoffrey Fox
 
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNINGHANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
IRJET Journal
 
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNINGHANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
IRJET Journal
 
Syllabus 512 fall 10
Syllabus  512 fall 10Syllabus  512 fall 10
Syllabus 512 fall 10
loraevanouski
 
Introduction_image_processing_and_applications_.pptx
Introduction_image_processing_and_applications_.pptxIntroduction_image_processing_and_applications_.pptx
Introduction_image_processing_and_applications_.pptx
VINOTHRAJR1
 
discussion_3_project.pdf
discussion_3_project.pdfdiscussion_3_project.pdf
discussion_3_project.pdf
Kuan-Tsae Huang
 
An Introduction to Video Production for Digital Media (2012)
An Introduction to Video Production for Digital Media (2012)An Introduction to Video Production for Digital Media (2012)
An Introduction to Video Production for Digital Media (2012)
Eddie Vega
 
Online Learning Management System and Analytics using Deep Learning
Online Learning Management System and Analytics using Deep LearningOnline Learning Management System and Analytics using Deep Learning
Online Learning Management System and Analytics using Deep Learning
Dr. Amarjeet Singh
 
Smart Sessional with QR Attendance
Smart Sessional with QR AttendanceSmart Sessional with QR Attendance
Smart Sessional with QR Attendance
rashidalyasuog
 
Datt2500 week01
Datt2500 week01Datt2500 week01
Datt2500 week01
Joe Hambleton
 
IoT business and university partnership
IoT business and university partnershipIoT business and university partnership
IoT business and university partnership
APPAU_Ukraine
 
Survey on self supervised image segmentation
Survey on self supervised image segmentationSurvey on self supervised image segmentation
Survey on self supervised image segmentation
Arithmer Inc.
 
EdTech 2021: Tutors Time
EdTech 2021: Tutors TimeEdTech 2021: Tutors Time
EdTech 2021: Tutors Time
Peter Windle
 
EDUPUB Implementation Demo Showcase - Reference SW using Readium JS
EDUPUB Implementation Demo Showcase - Reference SW using Readium JSEDUPUB Implementation Demo Showcase - Reference SW using Readium JS
EDUPUB Implementation Demo Showcase - Reference SW using Readium JS
Open Cyber University of Korea
 
Preliminry report
 Preliminry report Preliminry report
Preliminry report
Jiten Ahuja
 
E content development and editing
E content development and  editingE content development and  editing
E content development and editing
Sankaranarayanan K B
 
Edu 741 tech outline
Edu 741 tech outlineEdu 741 tech outline
Edu 741 tech outline
caren_friedman1
 

Similar to lecture_1_2_ruohan.pdf (20)

Experience with Online Teaching with Open Source MOOC Technology
Experience with Online Teaching with Open Source MOOC TechnologyExperience with Online Teaching with Open Source MOOC Technology
Experience with Online Teaching with Open Source MOOC Technology
 
DevOps Syllabus summer 2020
DevOps Syllabus summer 2020DevOps Syllabus summer 2020
DevOps Syllabus summer 2020
 
EDUC5101G Session 5 Presentation (March 8, 2016)
EDUC5101G Session 5 Presentation (March 8, 2016)EDUC5101G Session 5 Presentation (March 8, 2016)
EDUC5101G Session 5 Presentation (March 8, 2016)
 
Experience with Online Teaching with Open Source MOOC Technology
Experience with Online Teaching with Open Source MOOC TechnologyExperience with Online Teaching with Open Source MOOC Technology
Experience with Online Teaching with Open Source MOOC Technology
 
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNINGHANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
 
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNINGHANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
HANDWRITTEN DIGIT RECOGNITION USING MACHINE LEARNING
 
Syllabus 512 fall 10
Syllabus  512 fall 10Syllabus  512 fall 10
Syllabus 512 fall 10
 
Introduction_image_processing_and_applications_.pptx
Introduction_image_processing_and_applications_.pptxIntroduction_image_processing_and_applications_.pptx
Introduction_image_processing_and_applications_.pptx
 
discussion_3_project.pdf
discussion_3_project.pdfdiscussion_3_project.pdf
discussion_3_project.pdf
 
An Introduction to Video Production for Digital Media (2012)
An Introduction to Video Production for Digital Media (2012)An Introduction to Video Production for Digital Media (2012)
An Introduction to Video Production for Digital Media (2012)
 
Online Learning Management System and Analytics using Deep Learning
Online Learning Management System and Analytics using Deep LearningOnline Learning Management System and Analytics using Deep Learning
Online Learning Management System and Analytics using Deep Learning
 
Smart Sessional with QR Attendance
Smart Sessional with QR AttendanceSmart Sessional with QR Attendance
Smart Sessional with QR Attendance
 
Datt2500 week01
Datt2500 week01Datt2500 week01
Datt2500 week01
 
IoT business and university partnership
IoT business and university partnershipIoT business and university partnership
IoT business and university partnership
 
Survey on self supervised image segmentation
Survey on self supervised image segmentationSurvey on self supervised image segmentation
Survey on self supervised image segmentation
 
EdTech 2021: Tutors Time
EdTech 2021: Tutors TimeEdTech 2021: Tutors Time
EdTech 2021: Tutors Time
 
EDUPUB Implementation Demo Showcase - Reference SW using Readium JS
EDUPUB Implementation Demo Showcase - Reference SW using Readium JSEDUPUB Implementation Demo Showcase - Reference SW using Readium JS
EDUPUB Implementation Demo Showcase - Reference SW using Readium JS
 
Preliminry report
 Preliminry report Preliminry report
Preliminry report
 
E content development and editing
E content development and  editingE content development and  editing
E content development and editing
 
Edu 741 tech outline
Edu 741 tech outlineEdu 741 tech outline
Edu 741 tech outline
 

More from Kuan-Tsae Huang

TWCC22_PPT_v3_KL.pdf quantum computer, quantum computer , quantum computer ,...
TWCC22_PPT_v3_KL.pdf  quantum computer, quantum computer , quantum computer ,...TWCC22_PPT_v3_KL.pdf  quantum computer, quantum computer , quantum computer ,...
TWCC22_PPT_v3_KL.pdf quantum computer, quantum computer , quantum computer ,...
Kuan-Tsae Huang
 
0305吳家樂lecture qva lecture , letcutre , my talk.pdf
0305吳家樂lecture qva lecture , letcutre , my talk.pdf0305吳家樂lecture qva lecture , letcutre , my talk.pdf
0305吳家樂lecture qva lecture , letcutre , my talk.pdf
Kuan-Tsae Huang
 
lecture_16_jiajun.pdf
lecture_16_jiajun.pdflecture_16_jiajun.pdf
lecture_16_jiajun.pdf
Kuan-Tsae Huang
 
MIT-Lec-11- RSA.pdf
MIT-Lec-11- RSA.pdfMIT-Lec-11- RSA.pdf
MIT-Lec-11- RSA.pdf
Kuan-Tsae Huang
 
PQS_7_ruohan.pdf
PQS_7_ruohan.pdfPQS_7_ruohan.pdf
PQS_7_ruohan.pdf
Kuan-Tsae Huang
 
lecture_6_jiajun.pdf
lecture_6_jiajun.pdflecture_6_jiajun.pdf
lecture_6_jiajun.pdf
Kuan-Tsae Huang
 
經濟部科研成果價值創造計畫-簡報格式111.05.30.pptx
經濟部科研成果價值創造計畫-簡報格式111.05.30.pptx經濟部科研成果價值創造計畫-簡報格式111.05.30.pptx
經濟部科研成果價值創造計畫-簡報格式111.05.30.pptx
Kuan-Tsae Huang
 
半導體實驗室.pptx
半導體實驗室.pptx半導體實驗室.pptx
半導體實驗室.pptx
Kuan-Tsae Huang
 
B 每產生1度氫電排放多少.pptx
B 每產生1度氫電排放多少.pptxB 每產生1度氫電排放多少.pptx
B 每產生1度氫電排放多少.pptx
Kuan-Tsae Huang
 
20230418 GPT Impact on Accounting & Finance -Reg.pdf
20230418 GPT Impact on Accounting & Finance -Reg.pdf20230418 GPT Impact on Accounting & Finance -Reg.pdf
20230418 GPT Impact on Accounting & Finance -Reg.pdf
Kuan-Tsae Huang
 
電動拖板車100208.pptx
電動拖板車100208.pptx電動拖板車100208.pptx
電動拖板車100208.pptx
Kuan-Tsae Huang
 
電動拖板車100121.pptx
電動拖板車100121.pptx電動拖板車100121.pptx
電動拖板車100121.pptx
Kuan-Tsae Huang
 
cs330_2021_lifelong_learning.pdf
cs330_2021_lifelong_learning.pdfcs330_2021_lifelong_learning.pdf
cs330_2021_lifelong_learning.pdf
Kuan-Tsae Huang
 
FIN330 Chapter 20.pptx
FIN330 Chapter 20.pptxFIN330 Chapter 20.pptx
FIN330 Chapter 20.pptx
Kuan-Tsae Huang
 
FIN330 Chapter 22.pptx
FIN330 Chapter 22.pptxFIN330 Chapter 22.pptx
FIN330 Chapter 22.pptx
Kuan-Tsae Huang
 
FIN330 Chapter 01.pptx
FIN330 Chapter 01.pptxFIN330 Chapter 01.pptx
FIN330 Chapter 01.pptx
Kuan-Tsae Huang
 
Thoughtworks Q1 2022 Investor Presentation.pdf
Thoughtworks Q1 2022 Investor Presentation.pdfThoughtworks Q1 2022 Investor Presentation.pdf
Thoughtworks Q1 2022 Investor Presentation.pdf
Kuan-Tsae Huang
 
20220828-How-do-you-build-an-effic.9658238.ppt.pptx
20220828-How-do-you-build-an-effic.9658238.ppt.pptx20220828-How-do-you-build-an-effic.9658238.ppt.pptx
20220828-How-do-you-build-an-effic.9658238.ppt.pptx
Kuan-Tsae Huang
 
Amine solvent development for carbon dioxide capture by yang du, 2016 (doctor...
Amine solvent development for carbon dioxide capture by yang du, 2016 (doctor...Amine solvent development for carbon dioxide capture by yang du, 2016 (doctor...
Amine solvent development for carbon dioxide capture by yang du, 2016 (doctor...
Kuan-Tsae Huang
 
A novel approach to carbon dioxide capture and storage by brett p. spigarelli...
A novel approach to carbon dioxide capture and storage by brett p. spigarelli...A novel approach to carbon dioxide capture and storage by brett p. spigarelli...
A novel approach to carbon dioxide capture and storage by brett p. spigarelli...
Kuan-Tsae Huang
 

More from Kuan-Tsae Huang (20)

TWCC22_PPT_v3_KL.pdf quantum computer, quantum computer , quantum computer ,...
TWCC22_PPT_v3_KL.pdf  quantum computer, quantum computer , quantum computer ,...TWCC22_PPT_v3_KL.pdf  quantum computer, quantum computer , quantum computer ,...
TWCC22_PPT_v3_KL.pdf quantum computer, quantum computer , quantum computer ,...
 
0305吳家樂lecture qva lecture , letcutre , my talk.pdf
0305吳家樂lecture qva lecture , letcutre , my talk.pdf0305吳家樂lecture qva lecture , letcutre , my talk.pdf
0305吳家樂lecture qva lecture , letcutre , my talk.pdf
 
lecture_16_jiajun.pdf
lecture_16_jiajun.pdflecture_16_jiajun.pdf
lecture_16_jiajun.pdf
 
MIT-Lec-11- RSA.pdf
MIT-Lec-11- RSA.pdfMIT-Lec-11- RSA.pdf
MIT-Lec-11- RSA.pdf
 
PQS_7_ruohan.pdf
PQS_7_ruohan.pdfPQS_7_ruohan.pdf
PQS_7_ruohan.pdf
 
lecture_6_jiajun.pdf
lecture_6_jiajun.pdflecture_6_jiajun.pdf
lecture_6_jiajun.pdf
 
經濟部科研成果價值創造計畫-簡報格式111.05.30.pptx
經濟部科研成果價值創造計畫-簡報格式111.05.30.pptx經濟部科研成果價值創造計畫-簡報格式111.05.30.pptx
經濟部科研成果價值創造計畫-簡報格式111.05.30.pptx
 
半導體實驗室.pptx
半導體實驗室.pptx半導體實驗室.pptx
半導體實驗室.pptx
 
B 每產生1度氫電排放多少.pptx
B 每產生1度氫電排放多少.pptxB 每產生1度氫電排放多少.pptx
B 每產生1度氫電排放多少.pptx
 
20230418 GPT Impact on Accounting & Finance -Reg.pdf
20230418 GPT Impact on Accounting & Finance -Reg.pdf20230418 GPT Impact on Accounting & Finance -Reg.pdf
20230418 GPT Impact on Accounting & Finance -Reg.pdf
 
電動拖板車100208.pptx
電動拖板車100208.pptx電動拖板車100208.pptx
電動拖板車100208.pptx
 
電動拖板車100121.pptx
電動拖板車100121.pptx電動拖板車100121.pptx
電動拖板車100121.pptx
 
cs330_2021_lifelong_learning.pdf
cs330_2021_lifelong_learning.pdfcs330_2021_lifelong_learning.pdf
cs330_2021_lifelong_learning.pdf
 
FIN330 Chapter 20.pptx
FIN330 Chapter 20.pptxFIN330 Chapter 20.pptx
FIN330 Chapter 20.pptx
 
FIN330 Chapter 22.pptx
FIN330 Chapter 22.pptxFIN330 Chapter 22.pptx
FIN330 Chapter 22.pptx
 
FIN330 Chapter 01.pptx
FIN330 Chapter 01.pptxFIN330 Chapter 01.pptx
FIN330 Chapter 01.pptx
 
Thoughtworks Q1 2022 Investor Presentation.pdf
Thoughtworks Q1 2022 Investor Presentation.pdfThoughtworks Q1 2022 Investor Presentation.pdf
Thoughtworks Q1 2022 Investor Presentation.pdf
 
20220828-How-do-you-build-an-effic.9658238.ppt.pptx
20220828-How-do-you-build-an-effic.9658238.ppt.pptx20220828-How-do-you-build-an-effic.9658238.ppt.pptx
20220828-How-do-you-build-an-effic.9658238.ppt.pptx
 
Amine solvent development for carbon dioxide capture by yang du, 2016 (doctor...
Amine solvent development for carbon dioxide capture by yang du, 2016 (doctor...Amine solvent development for carbon dioxide capture by yang du, 2016 (doctor...
Amine solvent development for carbon dioxide capture by yang du, 2016 (doctor...
 
A novel approach to carbon dioxide capture and storage by brett p. spigarelli...
A novel approach to carbon dioxide capture and storage by brett p. spigarelli...A novel approach to carbon dioxide capture and storage by brett p. spigarelli...
A novel approach to carbon dioxide capture and storage by brett p. spigarelli...
 

Recently uploaded

Digital Marketing Performance Marketing Sample .pdf
Digital Marketing Performance Marketing  Sample .pdfDigital Marketing Performance Marketing  Sample .pdf
Digital Marketing Performance Marketing Sample .pdf
Vineet
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
Vietnam Cotton & Spinning Association
 
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
mkkikqvo
 
社内勉強会資料_Hallucination of LLMs               .
社内勉強会資料_Hallucination of LLMs               .社内勉強会資料_Hallucination of LLMs               .
社内勉強会資料_Hallucination of LLMs               .
NABLAS株式会社
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
bmucuha
 
一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理
一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理
一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理
uevausa
 
原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理
原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理 原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理
原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理
tzu5xla
 
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
z6osjkqvd
 
Data Scientist Machine Learning Profiles .pdf
Data Scientist Machine Learning  Profiles .pdfData Scientist Machine Learning  Profiles .pdf
Data Scientist Machine Learning Profiles .pdf
Vineet
 
reading_sample_sap_press_operational_data_provisioning_with_sap_bw4hana (1).pdf
reading_sample_sap_press_operational_data_provisioning_with_sap_bw4hana (1).pdfreading_sample_sap_press_operational_data_provisioning_with_sap_bw4hana (1).pdf
reading_sample_sap_press_operational_data_provisioning_with_sap_bw4hana (1).pdf
perranet1
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
Vietnam Cotton & Spinning Association
 
How To Control IO Usage using Resource Manager
How To Control IO Usage using Resource ManagerHow To Control IO Usage using Resource Manager
How To Control IO Usage using Resource Manager
Alireza Kamrani
 
A gentle exploration of Retrieval Augmented Generation
A gentle exploration of Retrieval Augmented GenerationA gentle exploration of Retrieval Augmented Generation
A gentle exploration of Retrieval Augmented Generation
dataschool1
 
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
Marlon Dumas
 
Overview IFM June 2024 Consumer Confidence INDEX Report.pdf
Overview IFM June 2024 Consumer Confidence INDEX Report.pdfOverview IFM June 2024 Consumer Confidence INDEX Report.pdf
Overview IFM June 2024 Consumer Confidence INDEX Report.pdf
nhutnguyen355078
 
一比一原版(uom毕业证书)曼彻斯特大学毕业证如何办理
一比一原版(uom毕业证书)曼彻斯特大学毕业证如何办理一比一原版(uom毕业证书)曼彻斯特大学毕业证如何办理
一比一原版(uom毕业证书)曼彻斯特大学毕业证如何办理
osoyvvf
 
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
hyfjgavov
 
Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024
ElizabethGarrettChri
 
一比一原版雷丁大学毕业证(UoR毕业证书)学历如何办理
一比一原版雷丁大学毕业证(UoR毕业证书)学历如何办理一比一原版雷丁大学毕业证(UoR毕业证书)学历如何办理
一比一原版雷丁大学毕业证(UoR毕业证书)学历如何办理
mbawufebxi
 
Namma-Kalvi-11th-Physics-Study-Material-Unit-1-EM-221086.pdf
Namma-Kalvi-11th-Physics-Study-Material-Unit-1-EM-221086.pdfNamma-Kalvi-11th-Physics-Study-Material-Unit-1-EM-221086.pdf
Namma-Kalvi-11th-Physics-Study-Material-Unit-1-EM-221086.pdf
22ad0301
 

Recently uploaded (20)

Digital Marketing Performance Marketing Sample .pdf
Digital Marketing Performance Marketing  Sample .pdfDigital Marketing Performance Marketing  Sample .pdf
Digital Marketing Performance Marketing Sample .pdf
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
 
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
 
社内勉強会資料_Hallucination of LLMs               .
社内勉強会資料_Hallucination of LLMs               .社内勉強会資料_Hallucination of LLMs               .
社内勉強会資料_Hallucination of LLMs               .
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
 
一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理
一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理
一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理
 
原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理
原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理 原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理
原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理
 
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
 
Data Scientist Machine Learning Profiles .pdf
Data Scientist Machine Learning  Profiles .pdfData Scientist Machine Learning  Profiles .pdf
Data Scientist Machine Learning Profiles .pdf
 
reading_sample_sap_press_operational_data_provisioning_with_sap_bw4hana (1).pdf
reading_sample_sap_press_operational_data_provisioning_with_sap_bw4hana (1).pdfreading_sample_sap_press_operational_data_provisioning_with_sap_bw4hana (1).pdf
reading_sample_sap_press_operational_data_provisioning_with_sap_bw4hana (1).pdf
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
 
How To Control IO Usage using Resource Manager
How To Control IO Usage using Resource ManagerHow To Control IO Usage using Resource Manager
How To Control IO Usage using Resource Manager
 
A gentle exploration of Retrieval Augmented Generation
A gentle exploration of Retrieval Augmented GenerationA gentle exploration of Retrieval Augmented Generation
A gentle exploration of Retrieval Augmented Generation
 
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
 
Overview IFM June 2024 Consumer Confidence INDEX Report.pdf
Overview IFM June 2024 Consumer Confidence INDEX Report.pdfOverview IFM June 2024 Consumer Confidence INDEX Report.pdf
Overview IFM June 2024 Consumer Confidence INDEX Report.pdf
 
一比一原版(uom毕业证书)曼彻斯特大学毕业证如何办理
一比一原版(uom毕业证书)曼彻斯特大学毕业证如何办理一比一原版(uom毕业证书)曼彻斯特大学毕业证如何办理
一比一原版(uom毕业证书)曼彻斯特大学毕业证如何办理
 
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
 
Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024
 
一比一原版雷丁大学毕业证(UoR毕业证书)学历如何办理
一比一原版雷丁大学毕业证(UoR毕业证书)学历如何办理一比一原版雷丁大学毕业证(UoR毕业证书)学历如何办理
一比一原版雷丁大学毕业证(UoR毕业证书)学历如何办理
 
Namma-Kalvi-11th-Physics-Study-Material-Unit-1-EM-221086.pdf
Namma-Kalvi-11th-Physics-Study-Material-Unit-1-EM-221086.pdfNamma-Kalvi-11th-Physics-Study-Material-Unit-1-EM-221086.pdf
Namma-Kalvi-11th-Physics-Study-Material-Unit-1-EM-221086.pdf
 

lecture_1_2_ruohan.pdf

  • 1. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 CS231n: Deep Learning for Computer Vision Lecture 1 - Overview 1
  • 2. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 Today’s agenda ● A brief history of computer vision ● CS231n overview 2
  • 3. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 Today’s agenda ● A brief history of computer vision ● CS231n overview 3
  • 4. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 Image Classification: A core task in Computer Vision 4 cat This image by Nikita is licensed under CC-BY 2.0
  • 5. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 5 There are many visual recognition problems that are related to image classification, such as object detection, image captioning, image segmentation, visual question answering, visual instruction navigation, video understanding, etc.
  • 6. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 6 Deep Learning for Computer Vision Hierarchical computing systems with many “layers”, that are very loosely inspired by Neuroscience
  • 7. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 Neural Networks x h W1 s W2 3072 100 10 cat 7
  • 8. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 A class of Neural Networks that have become an important tool for visual recognition Convolutional Neural Networks for Visual Recognition 8
  • 9. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 9 Beyond Convolutional Neural Networks Recurrent neural network Attention mechanism / Transformers
  • 10. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 Beyond Image Classification Classification Semantic Segmentation Object Detection Instance Segmentation CAT GRASS, CAT, TREE, SKY DOG, DOG, CAT DOG, DOG, CAT No spatial extent Multiple Object No objects, just pixels This image is CC0 public domain 10
  • 11. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 11 Beyond 2D Images Gkioxari et al., “Mesh R-CNN”, ICCV 2019 Choy et al., 3D-R2N2: Recurrent Reconstruction Neural Network (2016) Simonyan and Zisserman, “Two-stream convolutional networks for action recognition in videos”, NeurIPS 2014
  • 12. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 12 Mandlekar and Xu et al., Learning to Generalize Across Long-Horizon Tasks from Human Demonstrations (2020) Xu et al., PointFusion: Deep Sensor Fusion for 3D Bounding Box Estimation (2018) Beyond Vision Wang et al., 6-PACK: Category-level 6D Pose Tracker with Anchor-Based Keypoints (2020) Gao et al., ObjectFolder 2.0: A Multisensory Object Dataset for Sim2Real Transfer (2022)
  • 13. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 2018 Turing Award for deep learning most prestigious technical award, is given for major contributions of lasting importance to computing. 13 This image is CC0 public domain Yann LeCun This image is CC0 public domain This image is CC0 public domain Jeffrey Hinton Yoshua Bengio
  • 14. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 IEEE PAMI Longuet-Higgins Prize Award recognizes ONE Computer Vision paper from ten years ago with significant impact on computer vision research. 14 At CVPR 2019, it was awarded to the 2009 original ImageNet paper That’s Fei-Fei
  • 15. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 15 2020 2021 >8k submissions, 2,067 accepted papers
  • 16. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 16 Logistics
  • 17. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 17
  • 18. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 Lectures - Tuesdays and Thursdays between 1:30pm to 3:00pm at NVIDIA Auditorium - Slides will be posted on the course website shortly before each lecture - All lectures will be recorded and uploaded to Canvas after the lecture under the “Panopto Course Videos” Tab. 18
  • 19. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 19 Course website
  • 20. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 Friday Discussion Sections 6 Discussion sections Fridays 1:30pm - 2:30pm over Zoom Hands-on tutorials, with more practical details than the main lecture Check canvas for the Zoom link of the discussion sessions! This Friday: Python / numpy / Colab (Presenter: Manasi Sharma) 20 04/01 Python / Numpy Review Session 04/08 Backprop Review Session 04/15 Final Project Overview and Guidelines 04/22 PyTorch / TensorFlow Review Session 04/29 Detection software & RNNs 05/06 Midterm Review Session
  • 21. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 Ed For questions about midterm, projects, logistics, etc, use Ed! SCPD students: Use your @stanford.edu address to register for Ed; contact scpd-customerservice@stanford.edu for help. 21
  • 22. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 We'll be using Zoom to hold office hours and QueueStatus to setup queues - please see Canvas or Ed for the QueueStatus link - TAs will admit students to their Zoom meeting rooms for 1-1 conversations when it’s your turn using QueueStatus. - Office hours is listed on the course webpage! Office Hours 22
  • 23. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 Optional textbook resources - Deep Learning - by Goodfellow, Bengio, and Courville - Here is a free version - Mathematics of deep learning - Chapters 5, 6 7 are useful to understand vector calculus and continuous optimization - Free online version - Dive into deep learning - An interactive deep learning book with code, math, and discussions, based on the NumPy interface. - Free online version 23
  • 24. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 Assignments All assignments will be completed using Google Colab Assignment 1: Will be out Friday, due 4/15 by 11:59pm - K-Nearest Neighbor - Linear classifiers: SVM, Softmax - Two-layer neural network - Image features 24
  • 25. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 Grading 25 All assignments, coding and written portions, will be submitted via Gradescope. An auto-grading system: - A consistent grading scheme - Public tests: - Students see results of public tests immediately - Private tests - Generalizations of the public tests to thoroughly test your implementation
  • 26. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 Grading 26 3 Assignments: 10% + 20% + 15% = 45% In-Class Midterm Exam: 20% Course Project: 35% - Project Proposal: 1% - Milestone: 2% - Final Project Report: 29% - Poster & Poster Session: 3% Participation Extra Credit: up to 3% Late policy - 4 free late days – use up to 2 late days per assignment - Afterwards, 25% off per day late - No late days for project report
  • 27. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 AWS 27 We will have AWS Cloud credits available for projects - Not for HWs (only for final projects) We will be distributing coupons to all enrolled students who need it We will have a tutorial for walking through the AWS setup
  • 28. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 Course Website: http://cs231n.stanford.edu/ - Syllabus, lecture slides, links to assignment downloads, etc Ed: - Use this for most communication with course staff - Ask questions about homework, grading, logistics, etc - Use private questions only if your post will violate honor code if you release publicly. Gradescope: - For turning in homework and receiving grades Canvas: - For watching recorded lectures - For watching recorded discussion sessions Overview on communication 28
  • 29. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 Prerequisites Proficiency in Python - All class assignments will be in Python (and use numpy) - Later in the class, you will be using Pytorch and TensorFlow - A Python tutorial available on course website College Calculus, Linear Algebra No longer need CS229 (Machine Learning) 29
  • 30. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 Collaboration policy We follow the Stanford Honor Code and the CS Department Honor Code – read them! ● Rule 1: Don’t look at solutions or code that are not your own; everything you submit should be your own work ● Rule 2: Don’t share your solution code with others; however discussing ideas or general strategies is fine and encouraged ● Rule 3: Indicate in your submissions anyone you worked with Turning in something late / incomplete is better than violating the honor code 30
  • 31. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 Learning objectives Formalize computer vision applications into tasks - Formalize inputs and outputs for vision-related problems - Understand what data and computational requirements you need to train a model Develop and train vision models - Learn to code, debug, and train convolutional neural networks. - Learn how to use software frameworks like PyTorch and TensorFlow Gain an understanding of where the field is and where it is headed - What new research has come out in the last 0-5 years? - What are open research challenges? - What ethical and societal considerations should we consider before deployment? 31
  • 32. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 Why should you take this class? Become a vision researcher (an incomplete list of conferences) - Get involved with vision research at Stanford: apply using this form. - CVPR 2022 conference - ICCV 2021 conference Become a vision engineer in industry (an incomplete list of industry teams) - Perception team at Google AI, Vision at Google Cloud - Vision at Meta AI - Vision at Amazon AWS - Nvidia, Tesla, Apple, Salesforce, …… General interest 32
  • 33. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 33 CS231n: Deep Learning for Computer Vision ● Deep Learning Basics (Lecture 2 – 4) ● Perceiving and Understanding the Visual World (Lecture 5 – 12) ● Reconstructing and Interacting with the Visual World (Lecture 13 – 16) ● Human-Centered Artificial Intelligence (Lecture 17 – 18)
  • 34. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 Syllabus 34 Deep Learning Basics Convolutional Neural Networks Data-driven learning Linear classification & kNN Loss functions Optimization Backpropagation Multi-layer perceptrons Neural Networks Computer Vision Applications Convolutions PyTorch / TensorFlow Activation functions Batch normalization Transfer learning Data augmentation Momentum / RMSProp / Adam Architecture design RNNs / Attention / Transformers Image captioning Object detection and segmentation Style transfer Video understanding Generative models Self-supervised learning 3D vision Human-centered AI Fairness & ethics
  • 35. Fei-Fei Li, Jiajun Wu, Ruohan Gao Lecture 1 - March 29, 2022 Next time: Image classification with Linear Classifiers Linear classification 35 Plot created using Wolfram Cloud k- nearest neighbor