SlideShare a Scribd company logo
1 of 36
Download to read offline
# SEMINAR
발표자:
Sparks of AGI, MLLMs,
Superintelligence
지동환
1
@ Korea Univ.
Sparks of Artificial General Intelligence: Early experiments with GPT-4
2
Sparks of AGI
@ Korea Univ.
The edge of change
3
출처
당신이 그래프 상에서 이 지점에 서 있는다면 어떤 느낌이겠는가?
@ Korea Univ.
The edge of change
4
출처
시간 차트 위에서는 그래프 오른편을 볼 수 없을 것이다. 그렇다면…
@ Korea Univ.
The edge of change
5
출처
평범하고 시시하다.
@ Korea Univ.
The edge of change
6
출처
하지만… 그 때가 다가오고 있을 수도 있다.
202X…?
@ Korea Univ.
Artificial General Intelligence
• Oracle. 모든 분야에서 정확하고 유용한 정보를 주는 시스템.
• 인간이 하는 모든 일을 할 수 있는 AI
• “narrow AI”와 반대되는 개념
7
자비스? (출처 : Wikipedia)
@ Korea Univ.
왜 호들갑이냐?
• “Sparks of AGI”
• Microsoft Research
8
arxiv
이 제목을 어떻게 참지?
@ Korea Univ.
TL;DR
• 기존 Metric 위주의 평가로는 General Intelligence를 평가하지 못한다.
• 다양한 일에서 General하고 잘 하는 Model을 만들었다.
• 우리 GPT-4가 짱이다. LLM의 새 지평을 열었다!
9
!!
@ Korea Univ.
How can we evaluate “general” intelligence?
• memorization?
• Classic metrics?
• Beyond simple metrics
• 소수의 무한성을 시로 표현해라
• TikZ로 Unicorn을 그려라 (non-multimodal GPT-4!)
10
@ Korea Univ.
소수의 무한성 증명 (시 ver.)
11
arxiv
다양한 지식과 Context를 이해하고, 융합할 수 있다.
@ Korea Univ.
Draw a unicorn in TikZ!
• non-multimodal version GPT-4
12
arxiv
@ Korea Univ.
Image generation beyond memorization
• 그냥 training code 베끼는거 아님?
• ㄴㄴ! 다양한 변형을 해도 잘 알아 듣는다. 진짜 이해함!
13
arxiv
@ Korea Univ.
Draw a unicorn in TikZ!
• training 과정 중 발전하는 GPT-4
14
arxiv
@ Korea Univ.
Directions and Conclusion
• intelligence, AI, AGI가 무엇인가?
• 더 general한 AI로 향하는 길
• 정확히 무엇이 일어나고 있는가?
15
@ Korea Univ.
Actually, I think…
• …Seriously?
• 너네 Model이 잘 하는건 알겠어.
• 근데 그게 왜 “Sparks of AGI”인건데?
• Paper?
• No, Tech report
• No… flyer…
16
@ Korea Univ.
Language Is Not All You Need: Aligning Perception with Language Models
17
Multimodal Large Language Models
@ Korea Univ.
Kosmos-1
• ‘지능의 기초가 되기 위해,
현실 세계에 대한 지식 획득과 이해의 관점에서,
multimodal perception은
필수적으로 AGI를 달성하는데 필요하다.’
• LLMs + Multimodal perception
18
arxiv
@ Korea Univ.
Kosmos-1
19
arxiv
@ Korea Univ.
선행 연구 : MetaLM
20
Language Models are General-Purpose Interfaces
@ Korea Univ.
Semi-Causal Language Model
21
Language Models are General-Purpose Interfaces
@ Korea Univ.
Input Representation
22
Image as a Foreign Language: BEIT Pretraining for All Vision and Vision-Language Tasks
@ Korea Univ.
Multiway Transformer
23
VLMo: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts
@ Korea Univ.
Large-Scale Transformer
• Magneto [arxiv]
• Layer Norm을 각각의 Sublayer에서
• 이론적으로 정립된 더 나은 초기화 방식
• xPos [arxiv]
• 더 나은 long-context modelling을 위한 relative position encoding
• 다는 못 읽어 봤습니다! 죄송합니다!! 🥹
• Detail은 건너뛰겠습니다!
24
@ Korea Univ.
Multimodal Large Language Model
25
arxiv
@ Korea Univ.
From LLMs to MLLMs (to AGI)
26
🥲
“Generality”
@ Korea Univ.
In MLV…
27
출처 : MLV Notion
@ Korea Univ.
The Singularity Is Near : When Humans Transcend Biology
28
Superintelligence
@ Korea Univ.
Linear estimation
29
출처
@ Korea Univ.
An Intelligence Explosion
30
출처 출처
@ Korea Univ.
Artificial Super Intelligence
31
출처
출처
@ Korea Univ.
Alchemy in the 21st Century
32
출처
출처
@ Korea Univ.
Governance of superintelligence
33
OpenAI
@ Korea Univ.
[홍보] ETRI 자율성장 인공지능 경진대회 FASHION-HOW
• Classification, Continual Learning, Zero-Shot Learning
• 7/28 접수 마감. (파티원 2/6명. 절찬리에 모집 중)
• 목표는 입상!
34
@ Korea Univ.
참고 자료
• Sam Altman (OpenAI CEO) Lex Fridman 인터뷰 [링크]
• MIT Seminar [링크]
• The AI Revolution: The Road to Superintelligence [1][2][번역]
• 슈퍼인텔리전스(닉 보스트롬) [링크]
• multimodal 관련 논문
• KOSMOS [1][2]
• BEiT-3 [링크]
• 제가 공부하는 Notion [링크]
35
@ Korea Univ.
감사합니다.
36

More Related Content

Featured

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Featured (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

AIKU_23-2_Seminar_AGI.pdf

  • 1. # SEMINAR 발표자: Sparks of AGI, MLLMs, Superintelligence 지동환 1
  • 2. @ Korea Univ. Sparks of Artificial General Intelligence: Early experiments with GPT-4 2 Sparks of AGI
  • 3. @ Korea Univ. The edge of change 3 출처 당신이 그래프 상에서 이 지점에 서 있는다면 어떤 느낌이겠는가?
  • 4. @ Korea Univ. The edge of change 4 출처 시간 차트 위에서는 그래프 오른편을 볼 수 없을 것이다. 그렇다면…
  • 5. @ Korea Univ. The edge of change 5 출처 평범하고 시시하다.
  • 6. @ Korea Univ. The edge of change 6 출처 하지만… 그 때가 다가오고 있을 수도 있다. 202X…?
  • 7. @ Korea Univ. Artificial General Intelligence • Oracle. 모든 분야에서 정확하고 유용한 정보를 주는 시스템. • 인간이 하는 모든 일을 할 수 있는 AI • “narrow AI”와 반대되는 개념 7 자비스? (출처 : Wikipedia)
  • 8. @ Korea Univ. 왜 호들갑이냐? • “Sparks of AGI” • Microsoft Research 8 arxiv 이 제목을 어떻게 참지?
  • 9. @ Korea Univ. TL;DR • 기존 Metric 위주의 평가로는 General Intelligence를 평가하지 못한다. • 다양한 일에서 General하고 잘 하는 Model을 만들었다. • 우리 GPT-4가 짱이다. LLM의 새 지평을 열었다! 9 !!
  • 10. @ Korea Univ. How can we evaluate “general” intelligence? • memorization? • Classic metrics? • Beyond simple metrics • 소수의 무한성을 시로 표현해라 • TikZ로 Unicorn을 그려라 (non-multimodal GPT-4!) 10
  • 11. @ Korea Univ. 소수의 무한성 증명 (시 ver.) 11 arxiv 다양한 지식과 Context를 이해하고, 융합할 수 있다.
  • 12. @ Korea Univ. Draw a unicorn in TikZ! • non-multimodal version GPT-4 12 arxiv
  • 13. @ Korea Univ. Image generation beyond memorization • 그냥 training code 베끼는거 아님? • ㄴㄴ! 다양한 변형을 해도 잘 알아 듣는다. 진짜 이해함! 13 arxiv
  • 14. @ Korea Univ. Draw a unicorn in TikZ! • training 과정 중 발전하는 GPT-4 14 arxiv
  • 15. @ Korea Univ. Directions and Conclusion • intelligence, AI, AGI가 무엇인가? • 더 general한 AI로 향하는 길 • 정확히 무엇이 일어나고 있는가? 15
  • 16. @ Korea Univ. Actually, I think… • …Seriously? • 너네 Model이 잘 하는건 알겠어. • 근데 그게 왜 “Sparks of AGI”인건데? • Paper? • No, Tech report • No… flyer… 16
  • 17. @ Korea Univ. Language Is Not All You Need: Aligning Perception with Language Models 17 Multimodal Large Language Models
  • 18. @ Korea Univ. Kosmos-1 • ‘지능의 기초가 되기 위해, 현실 세계에 대한 지식 획득과 이해의 관점에서, multimodal perception은 필수적으로 AGI를 달성하는데 필요하다.’ • LLMs + Multimodal perception 18 arxiv
  • 20. @ Korea Univ. 선행 연구 : MetaLM 20 Language Models are General-Purpose Interfaces
  • 21. @ Korea Univ. Semi-Causal Language Model 21 Language Models are General-Purpose Interfaces
  • 22. @ Korea Univ. Input Representation 22 Image as a Foreign Language: BEIT Pretraining for All Vision and Vision-Language Tasks
  • 23. @ Korea Univ. Multiway Transformer 23 VLMo: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts
  • 24. @ Korea Univ. Large-Scale Transformer • Magneto [arxiv] • Layer Norm을 각각의 Sublayer에서 • 이론적으로 정립된 더 나은 초기화 방식 • xPos [arxiv] • 더 나은 long-context modelling을 위한 relative position encoding • 다는 못 읽어 봤습니다! 죄송합니다!! 🥹 • Detail은 건너뛰겠습니다! 24
  • 25. @ Korea Univ. Multimodal Large Language Model 25 arxiv
  • 26. @ Korea Univ. From LLMs to MLLMs (to AGI) 26 🥲 “Generality”
  • 27. @ Korea Univ. In MLV… 27 출처 : MLV Notion
  • 28. @ Korea Univ. The Singularity Is Near : When Humans Transcend Biology 28 Superintelligence
  • 29. @ Korea Univ. Linear estimation 29 출처
  • 30. @ Korea Univ. An Intelligence Explosion 30 출처 출처
  • 31. @ Korea Univ. Artificial Super Intelligence 31 출처 출처
  • 32. @ Korea Univ. Alchemy in the 21st Century 32 출처 출처
  • 33. @ Korea Univ. Governance of superintelligence 33 OpenAI
  • 34. @ Korea Univ. [홍보] ETRI 자율성장 인공지능 경진대회 FASHION-HOW • Classification, Continual Learning, Zero-Shot Learning • 7/28 접수 마감. (파티원 2/6명. 절찬리에 모집 중) • 목표는 입상! 34
  • 35. @ Korea Univ. 참고 자료 • Sam Altman (OpenAI CEO) Lex Fridman 인터뷰 [링크] • MIT Seminar [링크] • The AI Revolution: The Road to Superintelligence [1][2][번역] • 슈퍼인텔리전스(닉 보스트롬) [링크] • multimodal 관련 논문 • KOSMOS [1][2] • BEiT-3 [링크] • 제가 공부하는 Notion [링크] 35