2. 陳縕儂
Yun-Nung (Vivian) Chen
國立台灣大學資訊工程學系 副教授
美國微軟研究院 博士後研究員
美國卡內基美隆大學 碩士 / 博士
國立台灣大學資訊工程學 學士 / 碩士
台灣傑出女科學家新秀獎
吳大猷先生紀念獎
傑出人才基金會年輕學者創新獎
Amazon AWS ML Research Awards
Google Faulty Research Awards
3. AI & ML
Artificial intelligence (AI) is intelligence—perceiving, synthesizing, and
inferring information—demonstrated by machines, as opposed to
intelligence displayed by animals and humans.
Machine learning (ML) is a field of inquiry devoted to understanding
and building methods that “learn”, that is, methods that leverage data to
improve performance on some set of tasks.
It is seen as a part of artificial intelligence.
4. • Speech Recognition
• Handwritten Recognition
• Weather forecast
Learning ≈ Looking for a Function
f
f
f
“2”
“你好”
“ Tuesday”
Sunday
5. Deep Neural Network
• Cascading the neurons to form a neural network
5
1
x
2
x
…
…
Layer 1
…
…
1
y
2
y
…
…
Layer 2
…
…
Layer L
…
…
…
…
…
Input Output
M
y
N
x
Each layer is a simple function in the production line
6. History of Deep Learning
• 1960s: Perceptron (single layer neural network)
• 1969: Perceptron has limitation
• 1980s: Multi-layer perceptron
• 1986: Backpropagation
• 1989: 1 hidden layer is “good enough”, why deep?
• 2006: RBM initialization (breakthrough)
• 2009: GPU
• 2010: breakthrough in Speech Recognition (Dahl et al., 2010)
• 2012: breakthrough in ImageNet (Krizhevsky et al. 2012)
• 2015: “superhuman” results in Image and Speech Recognition
• 2016: AlphaGo “superhuman” results in Go playing
• 2022: ChatGPT “human-level” results in diverse domains
6
Why does deep learning show breakthrough in applications after 2010?
13. 微軟小冰 vs. ChatGPT
• Extractive-based
o can only answer questions similar to seen ones
o answers may be more reliable
o consider short context in a conversation
• Generative-based
o can answer any unseen questions
o answers needs to be validated
o consider whole context in a conversation
14. Beginners vs. Professionals
• Extractive-based
o can only answer questions similar to seen ones
o easy to control
• Generative-based
o can answer any unseen questions based on the
known domain knowledge
o difficult to control
15. Learning Pyramid
● Bloom’s Taxonomy (2001)
CREATE
EVALUATE
ANALYZE
APPLY
UNDERSTAND
REMEMBER
Use information to create some thing new
Grasp meaning of instructional materials
Take apart the known and identify relationships
Use information in a new (but similar) situation
Examine information and make judgements
Recall specific facts
High-order
thinking
skills
Low-order
thinking
skills
16. GPT: Generative Pretrained Transformer
(Radford et al., 2018)
• Pre-trained on BooksCorpus (~7000 books; 5GB)
16
Transformer
Vivian goes to make tasty tea
goes to make tasty tea
Radford, Alec, Karthik Narasimhan, Tim Salimans, and Ilya Sutskever. "Improving language understanding by generative pre-training." (2018).
<EOS>
<BOS>
Vivian
Vivian goes to make tasty tea
文字接龍
18. 18
OpenAI GPT Paradigm
Pre-Trained Data
#Parameters
Model
5GB
0.117 B
GPT (Radford et al., 2018)
40GB
1.5 B
GPT-2 (Radford et al., 2019)
45TB
175 B
GPT-3 (Brown et al., 2020)
?
?
GPT-4 (OpenAI, 2023)
20. AI Objectives
Helpful 有用
help the user solve their task
Honest 誠實
shouldn’t fabricate information or
mislead users
Harmless 無傷害性
should not cause physical,
psychological, or social harm to
people or the environment
22. Honest
• Hallucinations: output that cannot be verified from the source content
o Pros: factual hallucination improves informativeness of the generated text
o Cons: unverifiable information increases the risk from a safety perspective
22
Source Inform (name=pickwick hotel, pricerange=moderate)
Output the hotel named pickwick hotel in san diego is in a moderate price range
24. Issues in Pre-Trained Models
1. Helpful – help the user solve their task
Unable to follow the user’s instructions
2. Honest – shouldn’t fabricate information or mislead the
user
Hallucinations
3. Harmless – should not cause physical, psychological,
or social harm to people or the environment
Not easy to detect/identify
24
Solution: RLHF (Reinforcement Learning from Human Feedback)
25. ChatGPT: Reinforcement Learning from Human Feedback
• Improving GPT via teacher’s feedback
generation update via reinforcement learning
Teacher-given scores
a conversation history
a model-generated output
: 我想了解台灣的知名女歌手。
: 台灣知名女歌手包含蔡依林、…
: 可以替我介紹㇐下蔡依林嗎?
當然沒問題!蔡依林曾獲x次金曲獎… Human Feedback
77. Responsible AI
Transparency and Accountability: transparency in how AI
systems make decisions and operate; accountable for the outcomes
o 《歐盟人工智慧法》將要求公司們,對於這些模型多加說明。如果公司想在歐盟銷售或使用 AI 產
品,但不遵守這些透明化的義務,可能就會面臨最高全球總營業額 6% 的罰款。
Ethical and Fair Use: ensuring fairness by reducing biases in AI
systems to avoid discrimination
Privacy and Safety: privacy protection and robustness against
attacks and harm
The EU wants to regulate your favorite AI tools | MIT Technology Review
80. Real or Fake?
• Google helps identify if the image is
generated by AI
1) Show where on the web it first
appeared and which sites it’s
appeared on
81. • Google helps identify if the image is
generated by AI
1) Show where on the web it first
appeared and which sites it’s
appeared on
2) Self-labeling in meta-data
Real or Fake?