SlideShare a Scribd company logo
Transfer Learning
์‹ค๋ฌด์—์„œ ํ™œ์šฉํ•˜๊ธฐ
Produced By Tae Young Lee
์ถœ์ฒ˜ : https://greenlogic.com.au/blog/what-is-transfer-learning/
์ธ์ƒ์—์„œ ๋ฌธ์ œ๊ฐ€ ์ƒ๊ฒผ์„ ๋•Œ ๊ณ ๋ ค์‚ฌํ•ญ
๋‚ด๊ฐ€ ๋ฌด์—‡์„ ๊ฐ€์ง€๊ณ  ์žˆ๋Š๋ƒ? ( Resource )
๋‚ด๊ฐ€ ๊ฐ€์ง€๊ณ  ์žˆ๋Š” ๊ฒƒ์ด ๋ฌด์–ด๋ƒ?
Time
Money
Deep Learning Modeling
๊ด€์ ์—์„œ ์žฌํ•ด์„ํ•ด ๋ณด์ž!
Model Training์˜ ๋ฌธ์ œ
Data ํ™•๋ณด์˜ ๋ฌธ์ œ
Training ๊ฐ€๋Šฅํ•œ ๋ฐ์ดํ„ฐ๋Š” ๋งŽ์ง€๊ฐ€ ์•Š๋‹ค.
์—…๋ฌด ์‹œ์Šคํ…œ ์ค‘์‹ฌ์œผ๋กœ ๋ฐ์ดํ„ฐ๊ฐ€ ๋ณด๊ด€๋˜์–ด ์žˆ๋‹ค.
์—…๋ฌด ์‹œ์Šคํ…œ์— ๋ถ€ํ•˜๋ฅผ ์ฃผ์ง€ ์•Š๋Š” ๋ฐ์ดํ„ฐ ํ™•๋ณด ๋ฐฉ์•ˆ ์ˆ˜๋ฆฝ
Data Transform์ด ํ•„์š”ํ•˜๊ณ  Augmentation์ด ํ•„์ˆ˜์ 
๋ฌด์—‡์„ ๋ชจ๋ธ๋งํ•  ๊ฒƒ์ธ์ง€๋ฅผ ์„ ํ–‰ํ•ด์„œ ํ™•์ธ ํ•„์š”
๋งŽ์€ ํ•™์Šต ์‹œ๊ฐ„ ์†Œ์š”
์ œ์•ฝ์ ์ธ Infra Resource
Training ์‹œ ๋ฐ์ดํ„ฐ์— ๋Œ€ํ•œ ๊ณ ๋ ค ํ•„์š”
์ดˆ๊ธฐ์—๋Š” Mini Batch์˜ ํ˜•ํƒœ๋กœ ํ•™์Šต
์„ฑ๋Šฅ์ด ์–ด๋Š์ •๋„ ๋ณด์žฅ๋œ๋‹ค ์‹ถ์œผ๋ฉด Full Batch๋กœ ์ „ํ™˜
ModelCheckPoint์„ค์ •๊ณผ Early Stopping์„ ํ†ตํ•œ Best Model์ถ”์ถœ
์ค‘๊ฐ„์— ํ•™์Šต์ด ๋Š์–ด์งˆ ๊ฒƒ์„ ๋Œ€๋น„ํ•˜์—ฌ ์ค‘๊ฐ„ ์ค‘๊ฐ„ ๊ฒฐ๊ณผ๊ฐ’ ์ €์žฅ ํ•„์š”
( Epoch Number Restoring )
Training ํ•  ๋•Œ ๊ณ ๋ ค์‚ฌํ•ญ
์–ผ๋งˆ๋‚˜ ๋งŽ์€ ๋ˆ๊ณผ ์‹œ๊ฐ„์„ ๋“ค์—ฌ์•ผ ํ•˜๋Š๋ƒ?
Time Money
๋น ๋ฅธ Training Time ํ•„์š”
Why? ๋ชจ๋ธ ๊ฒ€์ฆ์„ ์œ„ํ•ด์„œ
( ๋†’์€ ์ •ํ™•๋„ , ์‹ค ์ ์šฉ ์—ฌ๋ถ€ ํŒ๋‹จ)
GPU
๊ฐœ์„ ์„ ์œ„ํ•œ
๋‹ค์–‘ํ•œ ๋ฐฉ์•ˆ ์ˆ˜๋ฆฝ
์‹ค์ „์—์„œ๋Š” ๋น ๋ฅธ ํ•ด๊ฒฐ๊ณผ ๋Œ€์‘์„ ์›ํ•œ๋‹ค
์ „์ดํ•™์Šต์€ ๋†’์€ ์ •ํ™•๋„๋ฅผ ๋น„๊ต์  ์งง์€ ์‹œ๊ฐ„ ๋‚ด์— ๋‹ฌ์„ฑํ•  ์ˆ˜ ์žˆ๊ธฐ ๋•Œ๋ฌธ์— ์ปดํ“จํ„ฐ ๋น„์ „ ๋ถ„์•ผ์—์„œ ์œ ๋ช…ํ•œ ๋ฐฉ๋ฒ•๋ก  ์ค‘ ํ•˜๋‚˜
(Rawat & Wang 2017).
์ „์ดํ•™์Šต์„ ์ด์šฉํ•˜๋ฉด, ์ด๋ฏธ ํ•™์Šตํ•œ ๋ฌธ์ œ์™€ ๋‹ค๋ฅธ ๋ฌธ์ œ๋ฅผ ํ’€ ๋•Œ์—๋„, ๋ฐ‘๋ฐ”๋‹ฅ์—์„œ๋ถ€ํ„ฐ ๋ชจ๋ธ์„ ์Œ“์•„์˜ฌ๋ฆฌ๋Š” ๋Œ€์‹ ์— ์ด๋ฏธ
ํ•™์Šต๋˜์–ด์žˆ๋Š” ํŒจํ„ด๋“ค์„ ํ™œ์šฉํ•ด์„œ ์ ์šฉ์‹œํ‚ฌ ์ˆ˜ ์žˆ์Œ
์ด๋ฅผ ์ƒค๋ฅดํŠธ๋ฅด ์‹์œผ๋กœ ํ‘œํ˜„ํ•˜๋ฉด ๊ฑฐ์ธ์˜ ์–ด๊นจ์— ์„œ์„œ(standing on the soulder of giants) ํ•™์Šตํ•˜๋Š” ๊ฒƒ
์ปดํ“จํ„ฐ ๋น„์ „์—์„œ ๋งํ•˜๋Š” ์ „์ดํ•™์Šต์€ ์ฃผ๋กœ ์‚ฌ์ „ํ•™์Šต ๋œ ๋ชจ๋ธ (pre-trained model) ์„ ์ด์šฉํ•˜๋Š” ๊ฒƒ
์‚ฌ์ „ํ•™์Šต ๋œ ๋ชจ๋ธ์ด๋ž€, ๋‚ด๊ฐ€ ํ’€๊ณ ์ž ํ•˜๋Š” ๋ฌธ์ œ์™€ ๋น„์Šทํ•˜๋ฉด์„œ ์‚ฌ์ด์ฆˆ๊ฐ€ ํฐ ๋ฐ์ดํ„ฐ๋กœ ์ด๋ฏธ ํ•™์Šต์ด ๋˜์–ด ์žˆ๋Š” ๋ชจ๋ธ
๊ทธ๋Ÿฐ ํฐ ๋ฐ์ดํ„ฐ๋กœ ๋ชจ๋ธ์„ ํ•™์Šต์‹œํ‚ค๋Š” ๊ฒƒ์€ ์˜ค๋žœ ์‹œ๊ฐ„๊ณผ ์—ฐ์‚ฐ๋Ÿ‰์ด ํ•„์š”ํ•˜๋ฏ€๋กœ, ๊ด€๋ก€์ ์œผ๋กœ๋Š” ์ด๋ฏธ ๊ณต๊ฐœ๋˜์–ด์žˆ๋Š” ๋ชจ๋ธ๋“ค์„ ๊ทธ์ €
importํ•ด์„œ ์‚ฌ์šฉ
์ถœ์ฒ˜ : https://jeinalog.tistory.com/13
Pretrained Model์‚ฌ์šฉ ์‹œ ๊ฐ€์žฅ
์ค‘์š”ํ•œ ํฌ์ธํŠธ
Fine Tunning Scope ์ •์˜
์ฐธ๊ณ  : https://developer.ibm.com/technologies/artificial-intelligence/articles/transfer-learning-for-deep-learning/
Feature transfer
์ฐธ๊ณ  : https://developer.ibm.com/technologies/artificial-intelligence/articles/transfer-learning-for-deep-learning/
Fine Tunning
์‹ค์ „์—์„œ ํ™œ์šฉ ์‹œ ์œ ์˜์‚ฌํ•ญ
Fine Tunning
Transfer Learning
Pretrained Model
Data Optimization
๋ฐ์ดํ„ฐ ํ™•์ธ
๋ชจ๋ธ ๊ฒ€์ฆ ๋ฐฉ์•ˆ ์ˆ˜๋ฆฝ
ambiguity uncertainty
Neural Network Architecture ๊ฒ€ํ† 
Neural Net Architecture์˜
"๊ณ„์ธต์ ์ธ ํŠน์ง•"์„ ๊ณ ๋ ค
๋…๋ฆฝ์ ์ธ ์ปดํฌ๋„ŒํŠธ์˜ ์ด์งˆ์  ์š”์†Œ
( ๋ถ„๋ฆฌ ํฌ์ธํŠธ๋ฅผ ์ฐพ์ž )
์ถœ์ฒ˜ : https://jeinalog.tistory.com/13
์ด๋Ÿฌํ•œ ๋”ฅ๋Ÿฌ๋‹ ๋ชจ๋ธ์˜ ์ค‘์š”ํ•œ ์„ฑ๊ฒฉ ์ค‘ ํ•˜๋‚˜๋Š” ๋ฐ”๋กœ "๊ณ„์ธต์ ์ธ ํŠน์ง•"์„
"์Šค์Šค๋กœ" ํ•™์Šตํ•œ๋‹ค๋Š” ์ 
๊ณ„์ธต์ ์ธ ํŠน์ง•์„ ํ•™์Šตํ•œ๋‹ค ๋Š” ๋ง์˜ ์˜๋ฏธ
๋ชจ๋ธ์˜ ์ฒซ ๋ฒˆ์งธ ์ธต์€ "์ผ๋ฐ˜์ ์ธ(general)" ํŠน์ง•์„ ์ถ”์ถœํ•˜๋„๋ก ํ•˜๋Š” ํ•™์Šต
๋ชจ๋ธ์˜ ๋งˆ์ง€๋ง‰ ์ธต์— ๊ฐ€๊นŒ์›Œ์งˆ์ˆ˜๋ก ํŠน์ • ๋ฐ์ดํ„ฐ์…‹ ๋˜๋Š” ํŠน์ • ๋ฌธ์ œ์—์„œ๋งŒ ๋‚˜ํƒ€๋‚  ์ˆ˜ ์žˆ๋Š”
"๊ตฌ์ฒด์ ์ธ(specific)" ํŠน์ง•์„ ์ถ”์ถœํ•ด๋‚ด๋„๋ก ํ•˜๋Š” ๊ณ ๋„ํ™”๋œ ํ•™์Šต
๋”ฐ๋ผ์„œ ์•ž๋‹จ์— ์žˆ๋Š” ๊ณ„์ธต๋“ค์€ ๋‹ค๋ฅธ ๋ฐ์ดํ„ฐ์…‹์˜ ์ด๋ฏธ์ง€๋“ค์„ ํ•™์Šตํ•  ๋•Œ๋„ ์žฌ์‚ฌ์šฉ ๊ฐ€๋Šฅ
๋’ท๋‹จ์˜ ๊ณ„์ธต๋“ค์€ ์ƒˆ๋กœ์šด ๋ฌธ์ œ๋ฅผ ๋งž์ดํ•  ๋•Œ๋งˆ๋‹ค ์ƒˆ๋กœ ํ•™์Šต์ด ํ•„์š”ํ•จ
Yosinski et al. (2014) ๋…ผ๋ฌธ์—์„œ๋Š” ์ด๋Ÿฌํ•œ ๋”ฅ๋Ÿฌ๋‹์˜ ํŠน์„ฑ์— ๋Œ€ํ•ด,
'๋งŒ์•ฝ ์ฒซ ๋ฒˆ์งธ ๊ณ„์ธต์—์„œ ์ถ”์ถœ๋œ ํŠน์ง•์ด ์ผ๋ฐ˜์ ์ธ ํŠน์ง•์ด๊ณ 
๋งˆ์ง€๋ง‰ ์ธต์—์„œ ์ถ”์ถœ๋œ ํŠน์ง•์ด ๊ตฌ์ฒด์ ์ธ ํŠน์ง•์ด๋ผ๋ฉด,
๋„คํŠธ์›Œํฌ ๋‚ด์˜ ์–ด๋”˜๊ฐ€์— ์ผ๋ฐ˜์ ์ธ ์ˆ˜์ค€์—์„œ ๊ตฌ์ฒด์ ์ธ ์ˆ˜์ค€์œผ๋กœ ๋„˜์–ด๊ฐ€๋Š” ์ „ํ™˜์ ์ด
๋ถ„๋ช… ์กด์žฌํ•  ๊ฒƒ'
๊ฒฐ๋ก ์ ์œผ๋กœ, ์šฐ๋ฆฌ๊ฐ€ ์‚ดํŽด๋ณธ CNN ๋ชจ๋ธ์˜ Convolutional base ๋ถ€๋ถ„,
๊ทธ ์ค‘์—์„œ๋„ ํŠนํžˆ ๋‚ฎ์€ ๋ ˆ๋ฒจ์˜ ๊ณ„์ธต(input์— ๊ฐ€๊นŒ์šด ๊ณ„์ธต)์ผ์ˆ˜๋ก ์ผ๋ฐ˜์ ์ธ ํŠน์ง•์„ ์ถ”์ถœ
๊ทธ์™€ ๋ฐ˜๋Œ€๋กœ Convolutional base ์˜ ๋†’์€ ๋ ˆ๋ฒจ์˜ ๊ณ„์ธต(output์— ๊ฐ€๊นŒ์šด ๊ณ„์ธต)๊ณผ Classifier
๋ถ€๋ถ„์€ ๋ณด๋‹ค ๊ตฌ์ฒด์ ์ด๊ณ  ํŠน์œ ํ•œ ํŠน์ง•๋“ค์„ ์ถ”์ถœ.
์ถœ์ฒ˜ : https://jeinalog.tistory.com/13
์‚ฌ์ „ ํ•™์Šต๋œ ๋ชจ๋ธ์„ ์ด์ œ ๋‚˜์˜ ํ”„๋กœ์ ํŠธ์— ๋งž๊ฒŒ ์žฌ์ •์˜ํ•œ๋‹ค๋ฉด,
๋จผ์ € ์›๋ž˜ ๋ชจ๋ธ์— ์žˆ๋˜ classifier๋ฅผ ์—†์• ๋Š” ๊ฒƒ์œผ๋กœ ์‹œ์ž‘ํ•ฉ๋‹ˆ๋‹ค.
์›๋ž˜์˜ classifier๋Š” ์‚ญ์ œํ•˜๊ณ ,
๋‚ด ๋ชฉ์ ์— ๋งž๋Š” ์ƒˆ๋กœ์šด classifier๋ฅผ ์ถ”๊ฐ€ํ•ฉ๋‹ˆ๋‹ค.
๊ทธ ํ›„ ๋งˆ์ง€๋ง‰์œผ๋กœ๋Š” ์ƒˆ๋กญ๊ฒŒ ๋งŒ๋“ค์–ด์ง„ ๋‚˜์˜ ๋ชจ๋ธ์„
๋‹ค์Œ ์„ธ ๊ฐ€์ง€ ์ „๋žต ์ค‘ ํ•œ ๊ฐ€์ง€ ๋ฐฉ๋ฒ•์„ ์ด์šฉํ•ด ํŒŒ์ธํŠœ๋‹(fine-tune)์„ ์ง„ํ–‰
์ถœ์ฒ˜ : https://jeinalog.tistory.com/13
์ „๋žต๋ณ„ ํŠน์ง•
( Fine Tunning Scope ์ •์˜)
# ์ „๋žต 1 : ์ „์ฒด ๋ชจ๋ธ์„ ์ƒˆ๋กœ ํ•™์Šต์‹œํ‚ค๊ธฐ
์ด ๋ฐฉ๋ฒ•์€ ์‚ฌ์ „ํ•™์Šต ๋ชจ๋ธ์˜ ๊ตฌ์กฐ๋งŒ ์‚ฌ์šฉํ•˜๋ฉด์„œ,
๋‚ด ๋ฐ์ดํ„ฐ์…‹์— ๋งž๊ฒŒ ์ „๋ถ€ ์ƒˆ๋กœ ํ•™์Šต์‹œํ‚ค๋Š” ๋ฐฉ๋ฒ•
๋ชจ๋ธ์„ ๋ฐ‘๋ฐ”๋‹ฅ์—์„œ๋ถ€ํ„ฐ ์ƒˆ๋กœ ํ•™์Šต์‹œํ‚ค๋Š” ๊ฒƒ์ด๋ฏ€๋กœ, ํฐ ์‚ฌ์ด์ฆˆ์˜ ๋ฐ์ดํ„ฐ์…‹์ด ํ•„์š”
(์ปดํ“จํŒ… ์—ฐ์‚ฐ ๋Šฅ๋ ฅ์„ ์œ„ํ•œ ๋งŽ์€ Infra Resource ํ•„์š”)
์ถœ์ฒ˜ : https://jeinalog.tistory.com/13
# ์ „๋žต 2 : Convolutional base์˜ ์ผ๋ถ€๋ถ„์€ ๊ณ ์ •์‹œํ‚จ ์ƒํƒœ๋กœ, ๋‚˜๋จธ์ง€
๊ณ„์ธต๊ณผ classifier๋ฅผ ์ƒˆ๋กœ ํ•™์Šต์‹œํ‚ค๊ธฐ
์•ž์„œ ์–ธ๊ธ‰ํ–ˆ๋“ฏ์ด, ๋‚ฎ์€ ๋ ˆ๋ฒจ์˜ ๊ณ„์ธต์€ ์ผ๋ฐ˜์ ์ธ ํŠน์ง•(์–ด๋–ค ๋ฌธ์ œ๋ฅผ ํ‘ธ๋Š๋ƒ์—
์ƒ๊ด€ ์—†์ด ๋…๋ฆฝ์ ์ธ ํŠน์ง•)์„ ์ถ”์ถœํ•˜๊ณ , ๋†’์€ ๋ ˆ๋ฒจ์˜ ๊ณ„์ธต์€ ๊ตฌ์ฒด์ ์ด๊ณ  ํŠน์œ ํ•œ
ํŠน์ง•(๋ฌธ์ œ์— ๋”ฐ๋ผ ๋‹ฌ๋ผ์ง€๋Š” ํŠน์ง•)์„ ์ถ”์ถœํ•ฉ๋‹ˆ๋‹ค. ์ด๋Ÿฐ ํŠน์„ฑ์„ ์ด์šฉํ•ด์„œ, ์šฐ๋ฆฌ๋Š”
์‹ ๊ฒฝ๋ง์˜ ํŒŒ๋ผ๋ฏธํ„ฐ ์ค‘ ์–ด๋Š ์ •๋„๊นŒ์ง€๋ฅผ ์žฌํ•™์Šต์‹œํ‚ฌ์ง€๋ฅผ ์ •ํ•  ์ˆ˜ ์žˆ์Œ
์ฃผ๋กœ, ๋งŒ์•ฝ ๋ฐ์ดํ„ฐ์…‹์ด ์ž‘๊ณ  ๋ชจ๋ธ์˜ ํŒŒ๋ผ๋ฏธํ„ฐ๊ฐ€ ๋งŽ๋‹ค๋ฉด, ์˜ค๋ฒ„ํ”ผํŒ…์ด ๋  ์œ„ํ—˜์ด
์žˆ์œผ๋ฏ€๋กœ ๋” ๋งŽ์€ ๊ณ„์ธต์„ ๊ฑด๋“ค์ง€ ์•Š๊ณ  ๊ทธ๋Œ€๋กœ ๋‘ .
๋ฐ˜๋ฉด์—, ๋ฐ์ดํ„ฐ์…‹์ด ํฌ๊ณ  ๊ทธ์— ๋น„ํ•ด ๋ชจ๋ธ์ด ์ž‘์•„์„œ ํŒŒ๋ผ๋ฏธํ„ฐ๊ฐ€ ์ ๋‹ค๋ฉด,
์˜ค๋ฒ„ํ”ผํŒ…์— ๋Œ€ํ•œ ๊ฑฑ์ •์„ ํ•  ํ•„์š”๊ฐ€ ์—†์œผ๋ฏ€๋กœ ๋” ๋งŽ์€ ๊ณ„์ธต์„ ํ•™์Šต์‹œ์ผœ์„œ ๋‚ด
ํ”„๋กœ์ ํŠธ์— ๋” ์ ํ•ฉํ•œ ๋ชจ๋ธ๋กœ ๋ฐœ์ „
์ถœ์ฒ˜ : https://jeinalog.tistory.com/13
# ์ „๋žต 3 : Convloutional base๋Š” ๊ณ ์ •์‹œํ‚ค๊ณ , classifier๋งŒ ์ƒˆ๋กœ
ํ•™์Šต์‹œํ‚ค๊ธฐ
์ด ๊ฒฝ์šฐ๋Š” ๋ณด๋‹ค ๊ทน๋‹จ์ ์ธ ์ƒํ™ฉ์ผ ๋•Œ ์ƒ๊ฐํ•  ์ˆ˜ ์žˆ๋Š” ์ผ€์ด์Šค
convolutional base๋Š” ๊ฑด๋“ค์ง€ ์•Š๊ณ  ๊ทธ๋Œ€๋กœ ๋‘๋ฉด์„œ ํŠน์ง• ์ถ”์ถœ ๋ฉ”์ปค๋‹ˆ์ฆ˜์œผ๋กœ์จ
ํ™œ์šฉํ•˜๊ณ , classifier๋งŒ ์žฌํ•™์Šต์‹œํ‚ค๋Š” ๋ฐฉ๋ฒ•
์ด ๋ฐฉ๋ฒ•์€ ์ปดํ“จํŒ… ์—ฐ์‚ฐ ๋Šฅ๋ ฅ์ด ๋ถ€์กฑํ•˜๊ฑฐ๋‚˜ ๋ฐ์ดํ„ฐ์…‹์ด ๋„ˆ๋ฌด ์ž‘์„๋•Œ, ๊ทธ๋ฆฌ๊ณ /๋˜๋Š”
ํ’€๊ณ ์ž ํ•˜๋Š” ๋ฌธ์ œ๊ฐ€ ์‚ฌ์ „ํ•™์Šต๋ชจ๋ธ์ด ์ด๋ฏธ ํ•™์Šตํ•œ ๋ฐ์ดํ„ฐ์…‹๊ณผ ๋งค์šฐ ๋น„์Šทํ•  ๋•Œ ๊ฒ€ํ† 
์ถœ์ฒ˜ : https://jeinalog.tistory.com/13
์ฃผ์˜ํ•ด์•ผ ํ•  ์ 
Model ์ƒ์„ฑ ๋ฐ์ดํ„ฐ ๋„๋ฉ”์ธ ํ™•์ธ
CNN ๋ฒ ์ด์Šค์˜ ์‚ฌ์ „ํ•™์Šต ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•  ๋•Œ์—๋Š”,
์ด์ „์— ํ•™์Šตํ•œ ๋‚ด์šฉ๋“ค์„ ๋ชจ๋‘ ์žŠ์–ด๋ฒ„๋ฆด ์œ„ํ—˜์ด ์žˆ๊ธฐ ๋•Œ๋ฌธ์— ์ž‘์€ learning rate๋ฅผ ์‚ฌ์šฉ
์‚ฌ์ „ํ•™์Šต ๋ชจ๋ธ์ด ์ž˜ ํ•™์Šต๋˜์—ˆ๋‹ค๋Š” ๊ฐ€์ •ํ•˜์—,
์ž‘์€ learning rate์œผ๋กœ ํ•™์Šต์„ ์‹œํ‚จ๋‹ค๋ฉด
CNN ๋ชจ๋ธ์˜ ํŒŒ๋ผ๋ฏธํ„ฐ๋“ค์„ ๋„ˆ๋ฌด ๋น ๋ฅด๊ฒŒ,
ํ˜น์€ ๋„ˆ๋ฌด ๋งŽ์ด ์™œ๊ณก์‹œํ‚ค์ง€ ์•Š๊ณ 
์›๋ž˜ ํ•™์Šต๋˜์–ด์žˆ๋˜ ์ง€์‹์„ ์ž˜ ๋ณด์กดํ•˜๋ฉด์„œ
์ถ”๊ฐ€๋กœ ํ•™์Šต์„ ํ•ด์•ผ ํ•จ
์ž‘์€ Learning rate๋ฅผ ์‚ฌ์šฉํ•œ๋‹ค๋Š” ๊ฒƒ์€ Pretrained Model์ด ์ž˜ ํ•™์Šต๋˜์—ˆ๋‹ค๋Š” ๊ฐ€์ •ํ•˜์— ์„ฑ๋ฆฝ๋จ
Pretrained Model์„ ์‚ฌ์šฉํ•˜์ง€ ์•Š๋Š” ์ผ๋ฐ˜์ ์ธ Trainingํ™˜๊ฒฝ์—์„œ
์ž‘์€ Learning rate๋Š” ์ผ๋‹จ ์ˆ˜๋ ดํ•˜๋Š” ์†๋„๊ฐ€ ๋„ˆ๋ฌด ๋Š๋ฆฌ๊ณ ,
local minimum์— ๋น ์งˆ ํ™•๋ฅ ์ด ์ฆ๊ฐ€
์ถœ์ฒ˜ : https://jeinalog.tistory.com/13
[์ฐธ๊ณ ]
Pretrained Model์„ ์‚ฌ์šฉํ•˜์ง€ ์•Š๋Š”
์ผ๋ฐ˜์ ์ธ ํ•™์Šต์˜ ๊ฒฝ์šฐ
Learning rate๋ฅผ ์„ค์ •ํ•  ๋•Œ ์ฃผ์˜ํ•  ์ 
1) Learning rate๊ฐ€ ๋„ˆ๋ฌด ํด ๋•Œ,
์ตœ์ ์˜ ๊ฐ’์œผ๋กœ ์ˆ˜๋ ดํ•˜์ง€ ์•Š๊ณ , ๋ฐœ์‚ฐํ•ด๋ฒ„๋ฆฌ๋Š” ๊ฒฝ์šฐ๊ฐ€ ๋ฐœ์ƒ โ†’ OverShooting
2) Learning rate๊ฐ€ ๋„ˆ๋ฌด ์ž‘์„ ๋•Œ,
์ผ๋‹จ ์ˆ˜๋ ดํ•˜๋Š” ์†๋„๊ฐ€ ๋„ˆ๋ฌด ๋Š๋ฆฌ๊ณ , local minimum์— ๋น ์งˆ ํ™•๋ฅ ์ด ์ฆ๊ฐ€
Learning rate๋ฅผ ์ž˜ ์ฐพ๋Š” ๋ฐฉ๋ฒ•์€ ๋”ฐ๋กœ ์—†์ง€๋งŒ, Learning rate๋ฅผ ์ž˜ ์ฐพ๊ธฐ ์œ„ํ•ด ๋„์™€์ค„ ์ˆ˜ ์žˆ๋„๋ก
๋ฐ์ดํ„ฐ๋“ค์„ ์ „์ฒ˜๋ฆฌ(preprocessing)ํ•˜๋Š” ๊ณผ์ •์ด ํ•„์š”
๊ฐ๊ฐ์˜ ๋ฐ์ดํ„ฐ๋“ค์˜ ๊ฐ’์ด ๋„ˆ๋ฌด ๋งŽ์ด ์ฐจ์ด๋‚  ๊ฒฝ์šฐ
Learning rate๋ฅผ ์ž˜ ๋ชป์„ค์ •ํ–ˆ์„ ๋•Œ์™€ ๋น„์Šทํ•œ ํ˜„์ƒ์ด ๋ฐœ์ƒ
์ด๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด feature๋“ค์„ Scaling ํ•œ๋‹ค.
Feature Scaling์—๋Š” 2๊ฐ€์ง€ ๋ฐฉ๋ฒ•์ด ์žˆ๋Š”๋ฐ,
1) Normalization - ํ‘œ๋ณธ๋“ค์˜ ๊ฐ’์„ ๋ชจ๋‘ 0 ~ 1 ์‚ฌ์ด์˜ ๊ฐ’์œผ๋กœ ๋ณ€ํ™˜ ํ•˜๋Š” ๋ฐฉ๋ฒ•.
2) Standardization - ํ‘œ๋ณธ๋“ค์˜ ๊ฐ’์„ ๊ฐ€์šฐ์‹œ์•ˆ ๋ถ„ํฌ์˜ ๊ฐ’์œผ๋กœ ๋ณ€ํ™˜ ํ•˜๋Š” ๋ฐฉ๋ฒ•.
Resnet๊ณผ VGG16์—์„œ
Transfer Learning์„ ์‚ฌ์šฉํ•˜๋Š” ์ด์œ 
Layer ๋ถ„๋ฆฌ๊ฐ€ ๊ฐ€๋Šฅํ•˜๋‹ค
์˜คํ”ˆ๋œ Pretrained Model์ด ๋งŽ๋‹ค
์ถœ์ฒ˜ : https://towardsdatascience.com/deep-learning-using-transfer-learning-python-code-for-resnet50-8acdfb3a2d38
์ถœ์ฒ˜ : https://towardsdatascience.com/deep-learning-using-transfer-learning-python-code-for-resnet50-8acdfb3a2d38
์‹ค์ „์—์„  Resnet Architecture๋งŒ ์ค€์šฉ
๋ชจ๋ธ์„ ์„œ๋น„์Šค์— ๋งž์ถ”์–ด ๊ตฌํ˜„ํ•˜๋ ค๋ฉด
ํ˜„์‹ค ์„ธ๊ณ„์—์„  Domain๋ณ„ Data Dependency
ํ™•์ธ ํ•„์š”
# ์ „๋žต 1 : ์ „์ฒด ๋ชจ๋ธ์„ ์ƒˆ๋กœ ํ•™์Šต์‹œํ‚ค๊ธฐ
์ด ๋ฐฉ๋ฒ•์€ ResNet์˜ ๊ตฌ์กฐ๋งŒ ์‚ฌ์šฉํ•˜๋ฉด์„œ,
์‹ค์ œ ๋ฐ์ดํ„ฐ์…‹์— ๋งž๊ฒŒ ์ „๋ถ€ ์ƒˆ๋กœ ํ•™์Šต์‹œํ‚ค๋Š” ๋ฐฉ๋ฒ•์ž…๋‹ˆ๋‹ค.
๋ชจ๋ธ์„ ๋ฐ‘๋ฐ”๋‹ฅ์—์„œ๋ถ€ํ„ฐ ์ƒˆ๋กœ ํ•™์Šต์‹œํ‚ค๋Š” ๊ฒƒ์ด๋ฏ€๋กœ, ํฐ ์‚ฌ์ด์ฆˆ์˜ ๋ฐ์ดํ„ฐ์…‹์ด ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.
(๊ทธ๋ฆฌ๊ณ , ์ข‹์€ ์ปดํ“จํŒ… ์—ฐ์‚ฐ ๋Šฅ๋ ฅ๋„์š”!)
์ถœ์ฒ˜ : https://jeinalog.tistory.com/13
๊ทธ๋ž˜์„œ Annotation Scope์ด ์ค‘์š”ํ•˜๋‹ค.
์ƒ์„ฑ๋œ ์ด๋ฏธ์ง€ ๋ถ„๋ฅ˜ ๋ชจ๋ธ์„ ํ†ตํ•œ
์ž๋™ Batch Annotation ์ˆ˜ํ–‰๋„ ๋ณ‘ํ–‰ํ•œ๋‹ค.
์„ฑ๋Šฅํ–ฅ์ƒ์„ ์œ„ํ•ด Joint Training์„ ํ†ตํ•ด
๋‹ค์ˆ˜์˜ ์ด๋ฏธ์ง€ ๋ชจ๋ธ์˜ ๊ฒฐํ•ฉ์„ ์ˆ˜ํ–‰ํ•œ๋‹ค.
๋‹จ๊ณ„0. ๋ฌด์ž‘์œ„ ์ดˆ๊ธฐํ™” ์ž…๋ ฅ ์ด๋ฏธ์ง€
๋ชจ๋ธ
๊ฐ€์ค‘์น˜
Forward
์˜ˆ์ธก๊ฐ’ Ground Truth (์ •๋‹ต)
Backward
๋‹จ๊ณ„1. ์ž…๋ ฅ์— ๋Œ€ํ•œ ์˜ˆ์ธก
๋‹จ๊ณ„2. ์†์‹ค ํ•จ์ˆ˜ ๊ณ„์‚ฐ
๋‹จ๊ณ„3. ์ตœ์ ํ™” ์•Œ๊ณ ๋ฆฌ์ฆ˜ ์†์‹ค
์—…๋ฐ์ดํŠธ
Loss = Cost = Error
๋‹จ๊ณ„2. ์†์‹ค ํ•จ์ˆ˜ ๊ณ„์‚ฐ ( Softmax ์†์‹ค ํ•จ์ˆ˜ ) Loss = Cost = Error
๋งˆ์ง€๋ง‰ ์ถœ๋ ฅ์ธต์€ ์ „ ๋‹จ๊ณ„์—์„œ ์ถ”์ถœ๋œ ํŠน์ง• ๋ฒกํ„ฐ๋ฅผ N๊ฐœ์˜ ๋ฒ”์ฃผ๋กœ ๋ถ„๋ฅ˜ํ•˜๊ธฐ ์œ„ํ•ด ๋ฐฐ์น˜๋œ fully-connected layer์™€
softmaxํ•จ์ˆ˜๋กœ ๊ตฌ์„ฑ๋จ
์ดํ›„ cross entropy๊ฐ€ softmaxํ•จ์ˆ˜๋ฅผ ํ†ตํ•ด ๋‚˜์˜จ ํ™•๋ฅ  ๋ถ„ํฌ์™€ ์ •๋‹ต ๋ถ„ํฌ ์‚ฌ์ด์˜ ์˜ค์ฐจ๋ฅผ ๊ณ„์‚ฐ
Softmax ํ™•๋ฅ ๊ฐ’์„ ์ด์šฉํ•œ๋‹ค๋Š” ์ ์—์„œ ์ด ์†์‹ค ํ•จ์ˆ˜๋ฅผ Softmax ์†์‹ค ํ•จ์ˆ˜๋ผ๊ณ  ๋ถ€๋ฅธ๋‹ค.
๋ถ„๋ฅ˜์— ์‚ฌ์šฉ๋˜๋Š” ํ™œ์„ฑํ™” ํ•จ์ˆ˜ โ†’ Softmaxํ•จ์ˆ˜ : ๋ชจ๋“  ์ž…๋ ฅ ์‹ ํ˜ธ๋กœ๋ถ€ํ„ฐ ์˜ํ–ฅ์„ ๋ฐ›์Œ
Softmaxํ•จ์ˆ˜ ์ฃผ์˜์ 
- ์ง€์ˆ˜ ํ•จ์ˆ˜๋กœ ๋˜์–ด์žˆ์–ด ์˜ค๋ฒ„ํ”Œ๋กœ (๋ฌดํ•œ๋Œ€ ๊ฐ’ ๋ฐœ์ƒ) ๋ฌธ์ œ๊ฐ€ ์ƒ๊ธธ ์ˆ˜ ์žˆ์Œ
1๊ฐœ์˜ ๋ชจ๋ธ์—์„œ n๊ฐœ์˜ output์„ ์ถœ๋ ฅํ•œ๋‹ค๋ฉด ์ด์— ๋งž์ถฐ n๊ฐœ์˜ ์„œ๋กœ ๋‹ค๋ฅธ loss๋ฅผ ๋ฝ‘์•„ ๋‚ผ ์ˆ˜ ์žˆ๋‹ค.
์—ฌ๋Ÿฌ๊ฐœ์˜ Loss๋“ค์„ ์–ด๋–ป๊ฒŒ ์ฒ˜๋ฆฌํ•˜๋Š๋ƒ์— ๋”ฐ๋ผ Joint Training๊ณผ Alternate Training๋ฐฉ์‹์œผ๋กœ ๋‚˜๋‰  ์ˆ˜ ์žˆ๋‹ค.
Joint Training์ด๋ž€ ์—ฌ๋Ÿฌ ๊ฐœ์˜ loss๋“ค์„ ํ•˜๋‚˜์˜ ๊ฐ’์œผ๋กœ ๋”ํ•ด์„œ ์ตœ์ข… Loss๋กœ ์‚ฌ์šฉํ•˜๋Š” ํ›ˆ๋ จ ๋ฐฉ์‹
Neural Net Architecture ๋งˆ์ง€๋ง‰ ์ธต์˜ output์„ channel๋‹จ์œ„๋กœ ์ž˜๋ผ์„œ
์„œ๋กœ ๋‹ค๋ฅธ ์—ญํ• ์„ ๋ถ€์—ฌํ•œ ๊ตฌ์กฐ๋กœ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ์Œ
๊ฐ ์ฑ„๋„์ด ์„œ๋กœ ๋‹ค๋ฅธ ์—ญํ• ์„ ์ˆ˜ํ–‰ํ•˜๊ฒŒ ํ•˜๋ ค๋ฉด ์ด์— ๋งž๊ฒŒ loss function์„ ์ •์˜ํ•ด์ค˜์•ผ ํ•˜๊ธฐ ๋•Œ๋ฌธ์—
Channel๋‹จ์œ„ (์„œ๋กœ ๋‹ค๋ฅธ ์—ญํ• ) ๋‹จ์œ„์˜ Loss Function ๋ถ€์—ฌ ํ•„์š”
ํ•ต์‹ฌ์€ Total Loss = loss1 + loss2 + loss3 + loss4 + loss5
Joint Training์€ ์—ฌ๋Ÿฌ ๊ฐœ์˜ Loss๋“ค์„ ๋”ํ•ด์„œ ๋ชจ๋“  task๋“ค์„ ํ•œ๋ฒˆ์— ํ•™์Šตํ•˜๋Š” ๋ฐฉ์‹
๋‹ค์ˆ˜์˜ ์ด๋ฏธ์ง€ ๋ชจ๋ธ์˜ ๊ฒฐํ•ฉ ์‹œ์—๋„ ํ™œ์šฉ ๊ฐ€๋Šฅํ•˜๋‹ค.
์œ„์™€ ๊ฐ™์ด Joint Training์„ ์‚ฌ์šฉํ•˜๋Š” ์ด์œ ๋Š” ๋‹ค์ˆ˜์˜ ๋ชจ๋ธ๋กœ ํ•˜๋‚˜์˜ Task์ฒ˜๋ฆฌ ์‹œ์—
์ตœ์ข… ๋ชจ๋ธ ํ‰๊ฐ€ ๋ฐ ์„ฑ๋Šฅ ์ธก์ • ๊ด€๋ จํ•˜์—ฌ ์žฌ์ •์˜๊ฐ€ ํ•„์š”ํ•˜๋‹ค.
Resnet๊ณผ Bert์˜ ๊ฒฐํ•ฉ
OCR์˜ ๊ฒฝ์šฐ ์ด๋ฏธ์ง€ ๋ชจ๋ธ๊ณผ ์–ธ์–ด ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•œ๋‹ค.
์ถœ์ฒ˜ : https://arxiv.org/pdf/1908.05054.pdf
์ถœ์ฒ˜ : https://arxiv.org/pdf/1908.05054.pdf
์ถœ์ฒ˜ : https://arxiv.org/pdf/1908.05054.pdf
Bert ๋ชจ๋ธ์„ ํ•™์Šตํ•œ๋‹ค๋Š” ๊ฑด ์•„๋งˆ์ถ”์–ด
Bert Weight Load๋ฅผ ํ™œ์šฉํ•˜์ž
Pretrained Bert Model ํ™œ์šฉ
์ถœ์ฒ˜ : https://www.kaggle.com/hiromoon166/load-bert-fine-tuning-model
https://tfhub.dev/google/bert_multi_cased_L-12_H-768_A-12/1
ํ˜„์žฌ๋Š” Transfer Learning ์‹œ๋Œ€
Q & A
Thank you

More Related Content

What's hot

Bert3q KorQuAD Finetuning NLP Challenge
Bert3q KorQuAD Finetuning NLP ChallengeBert3q KorQuAD Finetuning NLP Challenge
Bert3q KorQuAD Finetuning NLP Challenge
OverDeep
ย 
Enliple BERT-Small์„ ์ด์šฉํ•œ KorQuAD ๋ชจ๋ธ
Enliple BERT-Small์„ ์ด์šฉํ•œ KorQuAD ๋ชจ๋ธEnliple BERT-Small์„ ์ด์šฉํ•œ KorQuAD ๋ชจ๋ธ
Enliple BERT-Small์„ ์ด์šฉํ•œ KorQuAD ๋ชจ๋ธ
KwangHyeonPark
ย 
boosting ๊ธฐ๋ฒ• ์ดํ•ด (bagging vs boosting)
boosting ๊ธฐ๋ฒ• ์ดํ•ด (bagging vs boosting)boosting ๊ธฐ๋ฒ• ์ดํ•ด (bagging vs boosting)
boosting ๊ธฐ๋ฒ• ์ดํ•ด (bagging vs boosting)
SANG WON PARK
ย 
Machine learning boosting 20180424
Machine learning boosting 20180424Machine learning boosting 20180424
Machine learning boosting 20180424
Changwook Jun
ย 
(Nlp)fine tuning ๋Œ€ํšŒ_์ฐธ์—ฌ๊ธฐ
(Nlp)fine tuning ๋Œ€ํšŒ_์ฐธ์—ฌ๊ธฐ(Nlp)fine tuning ๋Œ€ํšŒ_์ฐธ์—ฌ๊ธฐ
(Nlp)fine tuning ๋Œ€ํšŒ_์ฐธ์—ฌ๊ธฐ
OverDeep
ย 
Improving Language Understanding by Generative Pre-Training
Improving Language Understanding by Generative Pre-TrainingImproving Language Understanding by Generative Pre-Training
Improving Language Understanding by Generative Pre-Training
pko89403
ย 
Chapter 6 Deep feedforward networks - 2
Chapter 6 Deep feedforward networks - 2Chapter 6 Deep feedforward networks - 2
Chapter 6 Deep feedforward networks - 2
KyeongUkJang
ย 
Ai nlp challenge
Ai nlp challengeAi nlp challenge
Ai nlp challenge
Young Rok Jang
ย 
Random Forest Intro [๋žœ๋คํฌ๋ ˆ์ŠคํŠธ ์„ค๋ช…]
Random Forest Intro [๋žœ๋คํฌ๋ ˆ์ŠคํŠธ ์„ค๋ช…]Random Forest Intro [๋žœ๋คํฌ๋ ˆ์ŠคํŠธ ์„ค๋ช…]
Random Forest Intro [๋žœ๋คํฌ๋ ˆ์ŠคํŠธ ์„ค๋ช…]
Hyunwoo Kim
ย 
๊ด‘๊ณ  CTR ์˜ˆ์ธก
๊ด‘๊ณ  CTR ์˜ˆ์ธก๊ด‘๊ณ  CTR ์˜ˆ์ธก
๊ด‘๊ณ  CTR ์˜ˆ์ธก
DataScienceLab
ย 
Machine Learning Foundations (a case study approach) ๊ฐ•์˜ ์ •๋ฆฌ
Machine Learning Foundations (a case study approach) ๊ฐ•์˜ ์ •๋ฆฌMachine Learning Foundations (a case study approach) ๊ฐ•์˜ ์ •๋ฆฌ
Machine Learning Foundations (a case study approach) ๊ฐ•์˜ ์ •๋ฆฌ
SANG WON PARK
ย 
Chapter 7 Regularization for deep learning - 2
Chapter 7 Regularization for deep learning - 2Chapter 7 Regularization for deep learning - 2
Chapter 7 Regularization for deep learning - 2
KyeongUkJang
ย 
Sw๊ฐœ๋ฐœ hw์ œ์ž‘์„ค๊ณ„์„œ ์ž„๋ฒ ๋ถ€์Šค๋Ÿฌ๊ธฐ
Sw๊ฐœ๋ฐœ hw์ œ์ž‘์„ค๊ณ„์„œ ์ž„๋ฒ ๋ถ€์Šค๋Ÿฌ๊ธฐSw๊ฐœ๋ฐœ hw์ œ์ž‘์„ค๊ณ„์„œ ์ž„๋ฒ ๋ถ€์Šค๋Ÿฌ๊ธฐ
Sw๊ฐœ๋ฐœ hw์ œ์ž‘์„ค๊ณ„์„œ ์ž„๋ฒ ๋ถ€์Šค๋Ÿฌ๊ธฐ
21HG020
ย 
Ml for ์ •ํ˜•๋ฐ์ดํ„ฐ
Ml for ์ •ํ˜•๋ฐ์ดํ„ฐMl for ์ •ํ˜•๋ฐ์ดํ„ฐ
Ml for ์ •ํ˜•๋ฐ์ดํ„ฐ
JEEHYUN PAIK
ย 
[์•„๊ฟˆ์‚ฌ] The C++ Programming Language 13์žฅ ํ…œํ”Œ๋ฆฟ
[์•„๊ฟˆ์‚ฌ] The C++ Programming Language 13์žฅ ํ…œํ”Œ๋ฆฟ[์•„๊ฟˆ์‚ฌ] The C++ Programming Language 13์žฅ ํ…œํ”Œ๋ฆฟ
[์•„๊ฟˆ์‚ฌ] The C++ Programming Language 13์žฅ ํ…œํ”Œ๋ฆฟํ•ด๊ฐ•
ย 
Metric learning ์„ธ๋ฏธ๋‚˜
Metric learning ์„ธ๋ฏธ๋‚˜Metric learning ์„ธ๋ฏธ๋‚˜
Metric learning ์„ธ๋ฏธ๋‚˜
ssuser198c901
ย 
Chapter 11 Practical Methodology
Chapter 11 Practical MethodologyChapter 11 Practical Methodology
Chapter 11 Practical Methodology
KyeongUkJang
ย 
keras ๋นจ๋ฆฌ ํ›‘์–ด๋ณด๊ธฐ(intro)
keras ๋นจ๋ฆฌ ํ›‘์–ด๋ณด๊ธฐ(intro)keras ๋นจ๋ฆฌ ํ›‘์–ด๋ณด๊ธฐ(intro)
keras ๋นจ๋ฆฌ ํ›‘์–ด๋ณด๊ธฐ(intro)
beom kyun choi
ย 
Time series classification
Time series classificationTime series classification
Time series classification
Sung Kim
ย 

What's hot (20)

Bert3q KorQuAD Finetuning NLP Challenge
Bert3q KorQuAD Finetuning NLP ChallengeBert3q KorQuAD Finetuning NLP Challenge
Bert3q KorQuAD Finetuning NLP Challenge
ย 
Enliple BERT-Small์„ ์ด์šฉํ•œ KorQuAD ๋ชจ๋ธ
Enliple BERT-Small์„ ์ด์šฉํ•œ KorQuAD ๋ชจ๋ธEnliple BERT-Small์„ ์ด์šฉํ•œ KorQuAD ๋ชจ๋ธ
Enliple BERT-Small์„ ์ด์šฉํ•œ KorQuAD ๋ชจ๋ธ
ย 
boosting ๊ธฐ๋ฒ• ์ดํ•ด (bagging vs boosting)
boosting ๊ธฐ๋ฒ• ์ดํ•ด (bagging vs boosting)boosting ๊ธฐ๋ฒ• ์ดํ•ด (bagging vs boosting)
boosting ๊ธฐ๋ฒ• ์ดํ•ด (bagging vs boosting)
ย 
Machine learning boosting 20180424
Machine learning boosting 20180424Machine learning boosting 20180424
Machine learning boosting 20180424
ย 
(Nlp)fine tuning ๋Œ€ํšŒ_์ฐธ์—ฌ๊ธฐ
(Nlp)fine tuning ๋Œ€ํšŒ_์ฐธ์—ฌ๊ธฐ(Nlp)fine tuning ๋Œ€ํšŒ_์ฐธ์—ฌ๊ธฐ
(Nlp)fine tuning ๋Œ€ํšŒ_์ฐธ์—ฌ๊ธฐ
ย 
Improving Language Understanding by Generative Pre-Training
Improving Language Understanding by Generative Pre-TrainingImproving Language Understanding by Generative Pre-Training
Improving Language Understanding by Generative Pre-Training
ย 
Chapter 6 Deep feedforward networks - 2
Chapter 6 Deep feedforward networks - 2Chapter 6 Deep feedforward networks - 2
Chapter 6 Deep feedforward networks - 2
ย 
Ai nlp challenge
Ai nlp challengeAi nlp challenge
Ai nlp challenge
ย 
Random Forest Intro [๋žœ๋คํฌ๋ ˆ์ŠคํŠธ ์„ค๋ช…]
Random Forest Intro [๋žœ๋คํฌ๋ ˆ์ŠคํŠธ ์„ค๋ช…]Random Forest Intro [๋žœ๋คํฌ๋ ˆ์ŠคํŠธ ์„ค๋ช…]
Random Forest Intro [๋žœ๋คํฌ๋ ˆ์ŠคํŠธ ์„ค๋ช…]
ย 
๊ด‘๊ณ  CTR ์˜ˆ์ธก
๊ด‘๊ณ  CTR ์˜ˆ์ธก๊ด‘๊ณ  CTR ์˜ˆ์ธก
๊ด‘๊ณ  CTR ์˜ˆ์ธก
ย 
Machine Learning Foundations (a case study approach) ๊ฐ•์˜ ์ •๋ฆฌ
Machine Learning Foundations (a case study approach) ๊ฐ•์˜ ์ •๋ฆฌMachine Learning Foundations (a case study approach) ๊ฐ•์˜ ์ •๋ฆฌ
Machine Learning Foundations (a case study approach) ๊ฐ•์˜ ์ •๋ฆฌ
ย 
Chapter 7 Regularization for deep learning - 2
Chapter 7 Regularization for deep learning - 2Chapter 7 Regularization for deep learning - 2
Chapter 7 Regularization for deep learning - 2
ย 
Sw๊ฐœ๋ฐœ hw์ œ์ž‘์„ค๊ณ„์„œ ์ž„๋ฒ ๋ถ€์Šค๋Ÿฌ๊ธฐ
Sw๊ฐœ๋ฐœ hw์ œ์ž‘์„ค๊ณ„์„œ ์ž„๋ฒ ๋ถ€์Šค๋Ÿฌ๊ธฐSw๊ฐœ๋ฐœ hw์ œ์ž‘์„ค๊ณ„์„œ ์ž„๋ฒ ๋ถ€์Šค๋Ÿฌ๊ธฐ
Sw๊ฐœ๋ฐœ hw์ œ์ž‘์„ค๊ณ„์„œ ์ž„๋ฒ ๋ถ€์Šค๋Ÿฌ๊ธฐ
ย 
NN and PDF
NN and PDFNN and PDF
NN and PDF
ย 
Ml for ์ •ํ˜•๋ฐ์ดํ„ฐ
Ml for ์ •ํ˜•๋ฐ์ดํ„ฐMl for ์ •ํ˜•๋ฐ์ดํ„ฐ
Ml for ์ •ํ˜•๋ฐ์ดํ„ฐ
ย 
[์•„๊ฟˆ์‚ฌ] The C++ Programming Language 13์žฅ ํ…œํ”Œ๋ฆฟ
[์•„๊ฟˆ์‚ฌ] The C++ Programming Language 13์žฅ ํ…œํ”Œ๋ฆฟ[์•„๊ฟˆ์‚ฌ] The C++ Programming Language 13์žฅ ํ…œํ”Œ๋ฆฟ
[์•„๊ฟˆ์‚ฌ] The C++ Programming Language 13์žฅ ํ…œํ”Œ๋ฆฟ
ย 
Metric learning ์„ธ๋ฏธ๋‚˜
Metric learning ์„ธ๋ฏธ๋‚˜Metric learning ์„ธ๋ฏธ๋‚˜
Metric learning ์„ธ๋ฏธ๋‚˜
ย 
Chapter 11 Practical Methodology
Chapter 11 Practical MethodologyChapter 11 Practical Methodology
Chapter 11 Practical Methodology
ย 
keras ๋นจ๋ฆฌ ํ›‘์–ด๋ณด๊ธฐ(intro)
keras ๋นจ๋ฆฌ ํ›‘์–ด๋ณด๊ธฐ(intro)keras ๋นจ๋ฆฌ ํ›‘์–ด๋ณด๊ธฐ(intro)
keras ๋นจ๋ฆฌ ํ›‘์–ด๋ณด๊ธฐ(intro)
ย 
Time series classification
Time series classificationTime series classification
Time series classification
ย 

Similar to Transfer learning usage

Infra as a model service
Infra as a model serviceInfra as a model service
Infra as a model service
Tae Young Lee
ย 
PaLM Paper Review
PaLM Paper ReviewPaLM Paper Review
PaLM Paper Review
Tae Young Lee
ย 
Deep neural networks for You-Tube recommendations
Deep neural networks for You-Tube recommendationsDeep neural networks for You-Tube recommendations
Deep neural networks for You-Tube recommendations
seungwoo kim
ย 
ํŒŒ์ด์ฌ์œผ๋กœ ์ตํžˆ๋Š” ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ณธ (18๋…„)
ํŒŒ์ด์ฌ์œผ๋กœ ์ตํžˆ๋Š” ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ณธ (18๋…„)ํŒŒ์ด์ฌ์œผ๋กœ ์ตํžˆ๋Š” ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ณธ (18๋…„)
ํŒŒ์ด์ฌ์œผ๋กœ ์ตํžˆ๋Š” ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ณธ (18๋…„)
SK(์ฃผ) C&C - ๊ฐ•๋ณ‘ํ˜ธ
ย 
๋”ฅ๋Ÿฌ๋‹ ์„ธ๊ณ„์— ์ž…๋ฌธํ•˜๊ธฐ ์œ„๋ฐ˜ ๋ถ„ํˆฌ
๋”ฅ๋Ÿฌ๋‹ ์„ธ๊ณ„์— ์ž…๋ฌธํ•˜๊ธฐ ์œ„๋ฐ˜ ๋ถ„ํˆฌ๋”ฅ๋Ÿฌ๋‹ ์„ธ๊ณ„์— ์ž…๋ฌธํ•˜๊ธฐ ์œ„๋ฐ˜ ๋ถ„ํˆฌ
๋”ฅ๋Ÿฌ๋‹ ์„ธ๊ณ„์— ์ž…๋ฌธํ•˜๊ธฐ ์œ„๋ฐ˜ ๋ถ„ํˆฌ
Ubuntu Korea Community
ย 
Bag of Tricks for Image Classification with Convolutional Neural Networks (C...
Bag of Tricks for Image Classification  with Convolutional Neural Networks (C...Bag of Tricks for Image Classification  with Convolutional Neural Networks (C...
Bag of Tricks for Image Classification with Convolutional Neural Networks (C...
gohyunwoong
ย 
๋จธ์‹ ๋Ÿฌ๋‹๊ณผ ์‚ฌ์ดํ‚ท๋Ÿฐ์˜ ์ดํ•ด
๋จธ์‹ ๋Ÿฌ๋‹๊ณผ ์‚ฌ์ดํ‚ท๋Ÿฐ์˜ ์ดํ•ด๋จธ์‹ ๋Ÿฌ๋‹๊ณผ ์‚ฌ์ดํ‚ท๋Ÿฐ์˜ ์ดํ•ด
๋จธ์‹ ๋Ÿฌ๋‹๊ณผ ์‚ฌ์ดํ‚ท๋Ÿฐ์˜ ์ดํ•ด
์ฒ ๋ฏผ ๊ถŒ
ย 
๋งํฌ๋“œ์ธ์˜ Big Data Recommendation Products - ์–ด์ œ์˜ ๋ฐ์ดํ„ฐ๋ฅผ ํ†ตํ•ด ๋‚ด์ผ์„ ์˜ˆ์ธกํ•œ๋‹ค
๋งํฌ๋“œ์ธ์˜ Big Data Recommendation Products - ์–ด์ œ์˜ ๋ฐ์ดํ„ฐ๋ฅผ ํ†ตํ•ด ๋‚ด์ผ์„ ์˜ˆ์ธกํ•œ๋‹ค๋งํฌ๋“œ์ธ์˜ Big Data Recommendation Products - ์–ด์ œ์˜ ๋ฐ์ดํ„ฐ๋ฅผ ํ†ตํ•ด ๋‚ด์ผ์„ ์˜ˆ์ธกํ•œ๋‹ค
๋งํฌ๋“œ์ธ์˜ Big Data Recommendation Products - ์–ด์ œ์˜ ๋ฐ์ดํ„ฐ๋ฅผ ํ†ตํ•ด ๋‚ด์ผ์„ ์˜ˆ์ธกํ•œ๋‹ค
Evion Kim
ย 
AI_introduction and requirements(2024.05.12).pdf
AI_introduction and requirements(2024.05.12).pdfAI_introduction and requirements(2024.05.12).pdf
AI_introduction and requirements(2024.05.12).pdf
Lee Chanwoo
ย 
๊ฒ€์ƒ‰์—”์ง„์— ์ ์šฉ๋œ ChatGPT
๊ฒ€์ƒ‰์—”์ง„์— ์ ์šฉ๋œ ChatGPT๊ฒ€์ƒ‰์—”์ง„์— ์ ์šฉ๋œ ChatGPT
๊ฒ€์ƒ‰์—”์ง„์— ์ ์šฉ๋œ ChatGPT
Tae Young Lee
ย 
Chapter 8 - optimization for training deep models
Chapter 8 - optimization for training deep modelsChapter 8 - optimization for training deep models
Chapter 8 - optimization for training deep models
KyeongUkJang
ย 
ํ’€์žŽ์Šค์ฟจ - LIME ๋ฐœํ‘œ์ž๋ฃŒ(์„ค๋ช…๊ฐ€๋Šฅํ•œ ์ธ๊ณต์ง€๋Šฅ ๊ธฐํš!)
ํ’€์žŽ์Šค์ฟจ - LIME ๋ฐœํ‘œ์ž๋ฃŒ(์„ค๋ช…๊ฐ€๋Šฅํ•œ ์ธ๊ณต์ง€๋Šฅ ๊ธฐํš!)ํ’€์žŽ์Šค์ฟจ - LIME ๋ฐœํ‘œ์ž๋ฃŒ(์„ค๋ช…๊ฐ€๋Šฅํ•œ ์ธ๊ณต์ง€๋Šฅ ๊ธฐํš!)
ํ’€์žŽ์Šค์ฟจ - LIME ๋ฐœํ‘œ์ž๋ฃŒ(์„ค๋ช…๊ฐ€๋Šฅํ•œ ์ธ๊ณต์ง€๋Šฅ ๊ธฐํš!)
Bong-Ho Lee
ย 
Understanding MLOps
Understanding MLOpsUnderstanding MLOps
Understanding MLOps
Chun Myung Kyu
ย 
Enhanced ai platform
Enhanced ai platformEnhanced ai platform
Enhanced ai platform
Tae Young Lee
ย 
Image data augmentatiion
Image data augmentatiionImage data augmentatiion
Image data augmentatiion
Subin An
ย 
CS294-112 18
CS294-112 18CS294-112 18
CS294-112 18
Gyubin Son
ย 
Exploring Deep Learning Acceleration Technology Embedded in LLMs
Exploring Deep Learning Acceleration Technology Embedded in LLMsExploring Deep Learning Acceleration Technology Embedded in LLMs
Exploring Deep Learning Acceleration Technology Embedded in LLMs
Tae Young Lee
ย 
์ดํŽ™ํ‹ฐ๋ธŒ C++ (7~9)
์ดํŽ™ํ‹ฐ๋ธŒ C++ (7~9)์ดํŽ™ํ‹ฐ๋ธŒ C++ (7~9)
์ดํŽ™ํ‹ฐ๋ธŒ C++ (7~9)์ต์„ฑ ์กฐ
ย 
Image Deep Learning ์‹ค๋ฌด์ ์šฉ
Image Deep Learning ์‹ค๋ฌด์ ์šฉImage Deep Learning ์‹ค๋ฌด์ ์šฉ
Image Deep Learning ์‹ค๋ฌด์ ์šฉ
Youngjae Kim
ย 

Similar to Transfer learning usage (20)

Infra as a model service
Infra as a model serviceInfra as a model service
Infra as a model service
ย 
PaLM Paper Review
PaLM Paper ReviewPaLM Paper Review
PaLM Paper Review
ย 
Deep neural networks for You-Tube recommendations
Deep neural networks for You-Tube recommendationsDeep neural networks for You-Tube recommendations
Deep neural networks for You-Tube recommendations
ย 
ํŒŒ์ด์ฌ์œผ๋กœ ์ตํžˆ๋Š” ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ณธ (18๋…„)
ํŒŒ์ด์ฌ์œผ๋กœ ์ตํžˆ๋Š” ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ณธ (18๋…„)ํŒŒ์ด์ฌ์œผ๋กœ ์ตํžˆ๋Š” ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ณธ (18๋…„)
ํŒŒ์ด์ฌ์œผ๋กœ ์ตํžˆ๋Š” ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ณธ (18๋…„)
ย 
๋”ฅ๋Ÿฌ๋‹ ์„ธ๊ณ„์— ์ž…๋ฌธํ•˜๊ธฐ ์œ„๋ฐ˜ ๋ถ„ํˆฌ
๋”ฅ๋Ÿฌ๋‹ ์„ธ๊ณ„์— ์ž…๋ฌธํ•˜๊ธฐ ์œ„๋ฐ˜ ๋ถ„ํˆฌ๋”ฅ๋Ÿฌ๋‹ ์„ธ๊ณ„์— ์ž…๋ฌธํ•˜๊ธฐ ์œ„๋ฐ˜ ๋ถ„ํˆฌ
๋”ฅ๋Ÿฌ๋‹ ์„ธ๊ณ„์— ์ž…๋ฌธํ•˜๊ธฐ ์œ„๋ฐ˜ ๋ถ„ํˆฌ
ย 
Bag of Tricks for Image Classification with Convolutional Neural Networks (C...
Bag of Tricks for Image Classification  with Convolutional Neural Networks (C...Bag of Tricks for Image Classification  with Convolutional Neural Networks (C...
Bag of Tricks for Image Classification with Convolutional Neural Networks (C...
ย 
๋จธ์‹ ๋Ÿฌ๋‹๊ณผ ์‚ฌ์ดํ‚ท๋Ÿฐ์˜ ์ดํ•ด
๋จธ์‹ ๋Ÿฌ๋‹๊ณผ ์‚ฌ์ดํ‚ท๋Ÿฐ์˜ ์ดํ•ด๋จธ์‹ ๋Ÿฌ๋‹๊ณผ ์‚ฌ์ดํ‚ท๋Ÿฐ์˜ ์ดํ•ด
๋จธ์‹ ๋Ÿฌ๋‹๊ณผ ์‚ฌ์ดํ‚ท๋Ÿฐ์˜ ์ดํ•ด
ย 
๋งํฌ๋“œ์ธ์˜ Big Data Recommendation Products - ์–ด์ œ์˜ ๋ฐ์ดํ„ฐ๋ฅผ ํ†ตํ•ด ๋‚ด์ผ์„ ์˜ˆ์ธกํ•œ๋‹ค
๋งํฌ๋“œ์ธ์˜ Big Data Recommendation Products - ์–ด์ œ์˜ ๋ฐ์ดํ„ฐ๋ฅผ ํ†ตํ•ด ๋‚ด์ผ์„ ์˜ˆ์ธกํ•œ๋‹ค๋งํฌ๋“œ์ธ์˜ Big Data Recommendation Products - ์–ด์ œ์˜ ๋ฐ์ดํ„ฐ๋ฅผ ํ†ตํ•ด ๋‚ด์ผ์„ ์˜ˆ์ธกํ•œ๋‹ค
๋งํฌ๋“œ์ธ์˜ Big Data Recommendation Products - ์–ด์ œ์˜ ๋ฐ์ดํ„ฐ๋ฅผ ํ†ตํ•ด ๋‚ด์ผ์„ ์˜ˆ์ธกํ•œ๋‹ค
ย 
AI_introduction and requirements(2024.05.12).pdf
AI_introduction and requirements(2024.05.12).pdfAI_introduction and requirements(2024.05.12).pdf
AI_introduction and requirements(2024.05.12).pdf
ย 
๊ฒ€์ƒ‰์—”์ง„์— ์ ์šฉ๋œ ChatGPT
๊ฒ€์ƒ‰์—”์ง„์— ์ ์šฉ๋œ ChatGPT๊ฒ€์ƒ‰์—”์ง„์— ์ ์šฉ๋œ ChatGPT
๊ฒ€์ƒ‰์—”์ง„์— ์ ์šฉ๋œ ChatGPT
ย 
Chapter 8 - optimization for training deep models
Chapter 8 - optimization for training deep modelsChapter 8 - optimization for training deep models
Chapter 8 - optimization for training deep models
ย 
ํ’€์žŽ์Šค์ฟจ - LIME ๋ฐœํ‘œ์ž๋ฃŒ(์„ค๋ช…๊ฐ€๋Šฅํ•œ ์ธ๊ณต์ง€๋Šฅ ๊ธฐํš!)
ํ’€์žŽ์Šค์ฟจ - LIME ๋ฐœํ‘œ์ž๋ฃŒ(์„ค๋ช…๊ฐ€๋Šฅํ•œ ์ธ๊ณต์ง€๋Šฅ ๊ธฐํš!)ํ’€์žŽ์Šค์ฟจ - LIME ๋ฐœํ‘œ์ž๋ฃŒ(์„ค๋ช…๊ฐ€๋Šฅํ•œ ์ธ๊ณต์ง€๋Šฅ ๊ธฐํš!)
ํ’€์žŽ์Šค์ฟจ - LIME ๋ฐœํ‘œ์ž๋ฃŒ(์„ค๋ช…๊ฐ€๋Šฅํ•œ ์ธ๊ณต์ง€๋Šฅ ๊ธฐํš!)
ย 
Understanding MLOps
Understanding MLOpsUnderstanding MLOps
Understanding MLOps
ย 
Enhanced ai platform
Enhanced ai platformEnhanced ai platform
Enhanced ai platform
ย 
Image data augmentatiion
Image data augmentatiionImage data augmentatiion
Image data augmentatiion
ย 
CS294-112 18
CS294-112 18CS294-112 18
CS294-112 18
ย 
Exploring Deep Learning Acceleration Technology Embedded in LLMs
Exploring Deep Learning Acceleration Technology Embedded in LLMsExploring Deep Learning Acceleration Technology Embedded in LLMs
Exploring Deep Learning Acceleration Technology Embedded in LLMs
ย 
EC 789
EC 789EC 789
EC 789
ย 
์ดํŽ™ํ‹ฐ๋ธŒ C++ (7~9)
์ดํŽ™ํ‹ฐ๋ธŒ C++ (7~9)์ดํŽ™ํ‹ฐ๋ธŒ C++ (7~9)
์ดํŽ™ํ‹ฐ๋ธŒ C++ (7~9)
ย 
Image Deep Learning ์‹ค๋ฌด์ ์šฉ
Image Deep Learning ์‹ค๋ฌด์ ์šฉImage Deep Learning ์‹ค๋ฌด์ ์šฉ
Image Deep Learning ์‹ค๋ฌด์ ์šฉ
ย 

More from Tae Young Lee

Review of the Paper on Capabilities of Gemini Models in Medicine
Review of the Paper on Capabilities of Gemini Models in MedicineReview of the Paper on Capabilities of Gemini Models in Medicine
Review of the Paper on Capabilities of Gemini Models in Medicine
Tae Young Lee
ย 
A future that integrates LLMs and LAMs (Symposium)
A future that integrates LLMs and LAMs (Symposium)A future that integrates LLMs and LAMs (Symposium)
A future that integrates LLMs and LAMs (Symposium)
Tae Young Lee
ย 
LLMs Service that provides what users want to know
LLMs Service that provides what users want to knowLLMs Service that provides what users want to know
LLMs Service that provides what users want to know
Tae Young Lee
ย 
LLM แ„†แ…ฉแ„ƒแ…ฆแ†ฏ แ„€แ…ตแ„‡แ…กแ†ซ แ„‰แ…ฅแ„‡แ…ตแ„‰แ…ณ แ„‰แ…ตแ†ฏแ„Œแ…ฅแ†ซ แ„€แ…กแ„‹แ…ตแ„ƒแ…ณ
LLM แ„†แ…ฉแ„ƒแ…ฆแ†ฏ แ„€แ…ตแ„‡แ…กแ†ซ แ„‰แ…ฅแ„‡แ…ตแ„‰แ…ณ แ„‰แ…ตแ†ฏแ„Œแ…ฅแ†ซ แ„€แ…กแ„‹แ…ตแ„ƒแ…ณLLM แ„†แ…ฉแ„ƒแ…ฆแ†ฏ แ„€แ…ตแ„‡แ…กแ†ซ แ„‰แ…ฅแ„‡แ…ตแ„‰แ…ณ แ„‰แ…ตแ†ฏแ„Œแ…ฅแ†ซ แ„€แ…กแ„‹แ…ตแ„ƒแ…ณ
LLM แ„†แ…ฉแ„ƒแ…ฆแ†ฏ แ„€แ…ตแ„‡แ…กแ†ซ แ„‰แ…ฅแ„‡แ…ตแ„‰แ…ณ แ„‰แ…ตแ†ฏแ„Œแ…ฅแ†ซ แ„€แ…กแ„‹แ…ตแ„ƒแ…ณ
Tae Young Lee
ย 
Recommendation System History
Recommendation System HistoryRecommendation System History
Recommendation System History
Tae Young Lee
ย 
แ„€แ…ฅแ†ทแ„‰แ…ขแ†จแ„‹แ…ฆแ†ซแ„Œแ…ตแ†ซแ„‹แ…ฆ แ„Œแ…ฅแ†จแ„‹แ…ญแ†ผแ„ƒแ…ฌแ†ซ แ„ƒแ…ตแ†ธแ„…แ…ฅแ„‚แ…ตแ†ผ แ„†แ…ฉแ„ƒแ…ฆแ†ฏ แ„‡แ…กแ†ผแ„‡แ…ฅแ†ธแ„…แ…ฉแ†ซ
แ„€แ…ฅแ†ทแ„‰แ…ขแ†จแ„‹แ…ฆแ†ซแ„Œแ…ตแ†ซแ„‹แ…ฆ แ„Œแ…ฅแ†จแ„‹แ…ญแ†ผแ„ƒแ…ฌแ†ซ แ„ƒแ…ตแ†ธแ„…แ…ฅแ„‚แ…ตแ†ผ แ„†แ…ฉแ„ƒแ…ฆแ†ฏ แ„‡แ…กแ†ผแ„‡แ…ฅแ†ธแ„…แ…ฉแ†ซแ„€แ…ฅแ†ทแ„‰แ…ขแ†จแ„‹แ…ฆแ†ซแ„Œแ…ตแ†ซแ„‹แ…ฆ แ„Œแ…ฅแ†จแ„‹แ…ญแ†ผแ„ƒแ…ฌแ†ซ แ„ƒแ…ตแ†ธแ„…แ…ฅแ„‚แ…ตแ†ผ แ„†แ…ฉแ„ƒแ…ฆแ†ฏ แ„‡แ…กแ†ผแ„‡แ…ฅแ†ธแ„…แ…ฉแ†ซ
แ„€แ…ฅแ†ทแ„‰แ…ขแ†จแ„‹แ…ฆแ†ซแ„Œแ…ตแ†ซแ„‹แ…ฆ แ„Œแ…ฅแ†จแ„‹แ…ญแ†ผแ„ƒแ…ฌแ†ซ แ„ƒแ…ตแ†ธแ„…แ…ฅแ„‚แ…ตแ†ผ แ„†แ…ฉแ„ƒแ…ฆแ†ฏ แ„‡แ…กแ†ผแ„‡แ…ฅแ†ธแ„…แ…ฉแ†ซ
Tae Young Lee
ย 
The Deep Learning Compiler
The Deep Learning CompilerThe Deep Learning Compiler
The Deep Learning Compiler
Tae Young Lee
ย 
History of Vision AI
History of Vision AIHistory of Vision AI
History of Vision AI
Tae Young Lee
ย 
Aws architecture
Aws architectureAws architecture
Aws architecture
Tae Young Lee
ย 
Nvidia architecture
Nvidia architectureNvidia architecture
Nvidia architecture
Tae Young Lee
ย 
Ai based on gpu
Ai based on gpuAi based on gpu
Ai based on gpu
Tae Young Lee
ย 
Real estate trust solution
Real estate trust solutionReal estate trust solution
Real estate trust solution
Tae Young Lee
ย 
Bayesian learning
Bayesian learningBayesian learning
Bayesian learning
Tae Young Lee
ย 
Deep learning financial product recommendation model
Deep learning financial product recommendation modelDeep learning financial product recommendation model
Deep learning financial product recommendation model
Tae Young Lee
ย 
Deep Learning Through Various Processes
Deep Learning Through Various ProcessesDeep Learning Through Various Processes
Deep Learning Through Various Processes
Tae Young Lee
ย 
Jenkins in finance
Jenkins in financeJenkins in finance
Jenkins in finance
Tae Young Lee
ย 
HistoryOfCNN
HistoryOfCNNHistoryOfCNN
HistoryOfCNN
Tae Young Lee
ย 
๊ธˆ์œต๋ถ„์•ผ ์ทจ์—…์„ ์œ„ํ•œ ์ปค๋ฆฌ์–ด ์„ค๊ณ„
๊ธˆ์œต๋ถ„์•ผ ์ทจ์—…์„ ์œ„ํ•œ ์ปค๋ฆฌ์–ด ์„ค๊ณ„๊ธˆ์œต๋ถ„์•ผ ์ทจ์—…์„ ์œ„ํ•œ ์ปค๋ฆฌ์–ด ์„ค๊ณ„
๊ธˆ์œต๋ถ„์•ผ ์ทจ์—…์„ ์œ„ํ•œ ์ปค๋ฆฌ์–ด ์„ค๊ณ„
Tae Young Lee
ย 
ํด๋ผ์šฐ๋“œ๋ฅผ ํ™œ์šฉํ•œ ๋”ฅ๋Ÿฌ๋‹ ํ™œ์šฉ ์‚ฌ๋ก€
ํด๋ผ์šฐ๋“œ๋ฅผ ํ™œ์šฉํ•œ ๋”ฅ๋Ÿฌ๋‹ ํ™œ์šฉ ์‚ฌ๋ก€ํด๋ผ์šฐ๋“œ๋ฅผ ํ™œ์šฉํ•œ ๋”ฅ๋Ÿฌ๋‹ ํ™œ์šฉ ์‚ฌ๋ก€
ํด๋ผ์šฐ๋“œ๋ฅผ ํ™œ์šฉํ•œ ๋”ฅ๋Ÿฌ๋‹ ํ™œ์šฉ ์‚ฌ๋ก€
Tae Young Lee
ย 
๊ธˆ์œต์˜ ์—ญ์‚ฌ๋ฅผ ํ†ตํ•ด ๋ณธ ๋”ฅ๋Ÿฌ๋‹์˜ ํ•จ์ •
๊ธˆ์œต์˜ ์—ญ์‚ฌ๋ฅผ ํ†ตํ•ด ๋ณธ ๋”ฅ๋Ÿฌ๋‹์˜ ํ•จ์ •๊ธˆ์œต์˜ ์—ญ์‚ฌ๋ฅผ ํ†ตํ•ด ๋ณธ ๋”ฅ๋Ÿฌ๋‹์˜ ํ•จ์ •
๊ธˆ์œต์˜ ์—ญ์‚ฌ๋ฅผ ํ†ตํ•ด ๋ณธ ๋”ฅ๋Ÿฌ๋‹์˜ ํ•จ์ •
Tae Young Lee
ย 

More from Tae Young Lee (20)

Review of the Paper on Capabilities of Gemini Models in Medicine
Review of the Paper on Capabilities of Gemini Models in MedicineReview of the Paper on Capabilities of Gemini Models in Medicine
Review of the Paper on Capabilities of Gemini Models in Medicine
ย 
A future that integrates LLMs and LAMs (Symposium)
A future that integrates LLMs and LAMs (Symposium)A future that integrates LLMs and LAMs (Symposium)
A future that integrates LLMs and LAMs (Symposium)
ย 
LLMs Service that provides what users want to know
LLMs Service that provides what users want to knowLLMs Service that provides what users want to know
LLMs Service that provides what users want to know
ย 
LLM แ„†แ…ฉแ„ƒแ…ฆแ†ฏ แ„€แ…ตแ„‡แ…กแ†ซ แ„‰แ…ฅแ„‡แ…ตแ„‰แ…ณ แ„‰แ…ตแ†ฏแ„Œแ…ฅแ†ซ แ„€แ…กแ„‹แ…ตแ„ƒแ…ณ
LLM แ„†แ…ฉแ„ƒแ…ฆแ†ฏ แ„€แ…ตแ„‡แ…กแ†ซ แ„‰แ…ฅแ„‡แ…ตแ„‰แ…ณ แ„‰แ…ตแ†ฏแ„Œแ…ฅแ†ซ แ„€แ…กแ„‹แ…ตแ„ƒแ…ณLLM แ„†แ…ฉแ„ƒแ…ฆแ†ฏ แ„€แ…ตแ„‡แ…กแ†ซ แ„‰แ…ฅแ„‡แ…ตแ„‰แ…ณ แ„‰แ…ตแ†ฏแ„Œแ…ฅแ†ซ แ„€แ…กแ„‹แ…ตแ„ƒแ…ณ
LLM แ„†แ…ฉแ„ƒแ…ฆแ†ฏ แ„€แ…ตแ„‡แ…กแ†ซ แ„‰แ…ฅแ„‡แ…ตแ„‰แ…ณ แ„‰แ…ตแ†ฏแ„Œแ…ฅแ†ซ แ„€แ…กแ„‹แ…ตแ„ƒแ…ณ
ย 
Recommendation System History
Recommendation System HistoryRecommendation System History
Recommendation System History
ย 
แ„€แ…ฅแ†ทแ„‰แ…ขแ†จแ„‹แ…ฆแ†ซแ„Œแ…ตแ†ซแ„‹แ…ฆ แ„Œแ…ฅแ†จแ„‹แ…ญแ†ผแ„ƒแ…ฌแ†ซ แ„ƒแ…ตแ†ธแ„…แ…ฅแ„‚แ…ตแ†ผ แ„†แ…ฉแ„ƒแ…ฆแ†ฏ แ„‡แ…กแ†ผแ„‡แ…ฅแ†ธแ„…แ…ฉแ†ซ
แ„€แ…ฅแ†ทแ„‰แ…ขแ†จแ„‹แ…ฆแ†ซแ„Œแ…ตแ†ซแ„‹แ…ฆ แ„Œแ…ฅแ†จแ„‹แ…ญแ†ผแ„ƒแ…ฌแ†ซ แ„ƒแ…ตแ†ธแ„…แ…ฅแ„‚แ…ตแ†ผ แ„†แ…ฉแ„ƒแ…ฆแ†ฏ แ„‡แ…กแ†ผแ„‡แ…ฅแ†ธแ„…แ…ฉแ†ซแ„€แ…ฅแ†ทแ„‰แ…ขแ†จแ„‹แ…ฆแ†ซแ„Œแ…ตแ†ซแ„‹แ…ฆ แ„Œแ…ฅแ†จแ„‹แ…ญแ†ผแ„ƒแ…ฌแ†ซ แ„ƒแ…ตแ†ธแ„…แ…ฅแ„‚แ…ตแ†ผ แ„†แ…ฉแ„ƒแ…ฆแ†ฏ แ„‡แ…กแ†ผแ„‡แ…ฅแ†ธแ„…แ…ฉแ†ซ
แ„€แ…ฅแ†ทแ„‰แ…ขแ†จแ„‹แ…ฆแ†ซแ„Œแ…ตแ†ซแ„‹แ…ฆ แ„Œแ…ฅแ†จแ„‹แ…ญแ†ผแ„ƒแ…ฌแ†ซ แ„ƒแ…ตแ†ธแ„…แ…ฅแ„‚แ…ตแ†ผ แ„†แ…ฉแ„ƒแ…ฆแ†ฏ แ„‡แ…กแ†ผแ„‡แ…ฅแ†ธแ„…แ…ฉแ†ซ
ย 
The Deep Learning Compiler
The Deep Learning CompilerThe Deep Learning Compiler
The Deep Learning Compiler
ย 
History of Vision AI
History of Vision AIHistory of Vision AI
History of Vision AI
ย 
Aws architecture
Aws architectureAws architecture
Aws architecture
ย 
Nvidia architecture
Nvidia architectureNvidia architecture
Nvidia architecture
ย 
Ai based on gpu
Ai based on gpuAi based on gpu
Ai based on gpu
ย 
Real estate trust solution
Real estate trust solutionReal estate trust solution
Real estate trust solution
ย 
Bayesian learning
Bayesian learningBayesian learning
Bayesian learning
ย 
Deep learning financial product recommendation model
Deep learning financial product recommendation modelDeep learning financial product recommendation model
Deep learning financial product recommendation model
ย 
Deep Learning Through Various Processes
Deep Learning Through Various ProcessesDeep Learning Through Various Processes
Deep Learning Through Various Processes
ย 
Jenkins in finance
Jenkins in financeJenkins in finance
Jenkins in finance
ย 
HistoryOfCNN
HistoryOfCNNHistoryOfCNN
HistoryOfCNN
ย 
๊ธˆ์œต๋ถ„์•ผ ์ทจ์—…์„ ์œ„ํ•œ ์ปค๋ฆฌ์–ด ์„ค๊ณ„
๊ธˆ์œต๋ถ„์•ผ ์ทจ์—…์„ ์œ„ํ•œ ์ปค๋ฆฌ์–ด ์„ค๊ณ„๊ธˆ์œต๋ถ„์•ผ ์ทจ์—…์„ ์œ„ํ•œ ์ปค๋ฆฌ์–ด ์„ค๊ณ„
๊ธˆ์œต๋ถ„์•ผ ์ทจ์—…์„ ์œ„ํ•œ ์ปค๋ฆฌ์–ด ์„ค๊ณ„
ย 
ํด๋ผ์šฐ๋“œ๋ฅผ ํ™œ์šฉํ•œ ๋”ฅ๋Ÿฌ๋‹ ํ™œ์šฉ ์‚ฌ๋ก€
ํด๋ผ์šฐ๋“œ๋ฅผ ํ™œ์šฉํ•œ ๋”ฅ๋Ÿฌ๋‹ ํ™œ์šฉ ์‚ฌ๋ก€ํด๋ผ์šฐ๋“œ๋ฅผ ํ™œ์šฉํ•œ ๋”ฅ๋Ÿฌ๋‹ ํ™œ์šฉ ์‚ฌ๋ก€
ํด๋ผ์šฐ๋“œ๋ฅผ ํ™œ์šฉํ•œ ๋”ฅ๋Ÿฌ๋‹ ํ™œ์šฉ ์‚ฌ๋ก€
ย 
๊ธˆ์œต์˜ ์—ญ์‚ฌ๋ฅผ ํ†ตํ•ด ๋ณธ ๋”ฅ๋Ÿฌ๋‹์˜ ํ•จ์ •
๊ธˆ์œต์˜ ์—ญ์‚ฌ๋ฅผ ํ†ตํ•ด ๋ณธ ๋”ฅ๋Ÿฌ๋‹์˜ ํ•จ์ •๊ธˆ์œต์˜ ์—ญ์‚ฌ๋ฅผ ํ†ตํ•ด ๋ณธ ๋”ฅ๋Ÿฌ๋‹์˜ ํ•จ์ •
๊ธˆ์œต์˜ ์—ญ์‚ฌ๋ฅผ ํ†ตํ•ด ๋ณธ ๋”ฅ๋Ÿฌ๋‹์˜ ํ•จ์ •
ย 

Transfer learning usage

  • 5. ๋‚ด๊ฐ€ ๊ฐ€์ง€๊ณ  ์žˆ๋Š” ๊ฒƒ์ด ๋ฌด์–ด๋ƒ?
  • 7. Deep Learning Modeling ๊ด€์ ์—์„œ ์žฌํ•ด์„ํ•ด ๋ณด์ž!
  • 10. Training ๊ฐ€๋Šฅํ•œ ๋ฐ์ดํ„ฐ๋Š” ๋งŽ์ง€๊ฐ€ ์•Š๋‹ค. ์—…๋ฌด ์‹œ์Šคํ…œ ์ค‘์‹ฌ์œผ๋กœ ๋ฐ์ดํ„ฐ๊ฐ€ ๋ณด๊ด€๋˜์–ด ์žˆ๋‹ค. ์—…๋ฌด ์‹œ์Šคํ…œ์— ๋ถ€ํ•˜๋ฅผ ์ฃผ์ง€ ์•Š๋Š” ๋ฐ์ดํ„ฐ ํ™•๋ณด ๋ฐฉ์•ˆ ์ˆ˜๋ฆฝ Data Transform์ด ํ•„์š”ํ•˜๊ณ  Augmentation์ด ํ•„์ˆ˜์  ๋ฌด์—‡์„ ๋ชจ๋ธ๋งํ•  ๊ฒƒ์ธ์ง€๋ฅผ ์„ ํ–‰ํ•ด์„œ ํ™•์ธ ํ•„์š”
  • 12. ์ œ์•ฝ์ ์ธ Infra Resource Training ์‹œ ๋ฐ์ดํ„ฐ์— ๋Œ€ํ•œ ๊ณ ๋ ค ํ•„์š” ์ดˆ๊ธฐ์—๋Š” Mini Batch์˜ ํ˜•ํƒœ๋กœ ํ•™์Šต ์„ฑ๋Šฅ์ด ์–ด๋Š์ •๋„ ๋ณด์žฅ๋œ๋‹ค ์‹ถ์œผ๋ฉด Full Batch๋กœ ์ „ํ™˜ ModelCheckPoint์„ค์ •๊ณผ Early Stopping์„ ํ†ตํ•œ Best Model์ถ”์ถœ ์ค‘๊ฐ„์— ํ•™์Šต์ด ๋Š์–ด์งˆ ๊ฒƒ์„ ๋Œ€๋น„ํ•˜์—ฌ ์ค‘๊ฐ„ ์ค‘๊ฐ„ ๊ฒฐ๊ณผ๊ฐ’ ์ €์žฅ ํ•„์š” ( Epoch Number Restoring )
  • 13. Training ํ•  ๋•Œ ๊ณ ๋ ค์‚ฌํ•ญ
  • 14. ์–ผ๋งˆ๋‚˜ ๋งŽ์€ ๋ˆ๊ณผ ์‹œ๊ฐ„์„ ๋“ค์—ฌ์•ผ ํ•˜๋Š๋ƒ?
  • 15. Time Money ๋น ๋ฅธ Training Time ํ•„์š” Why? ๋ชจ๋ธ ๊ฒ€์ฆ์„ ์œ„ํ•ด์„œ ( ๋†’์€ ์ •ํ™•๋„ , ์‹ค ์ ์šฉ ์—ฌ๋ถ€ ํŒ๋‹จ) GPU ๊ฐœ์„ ์„ ์œ„ํ•œ ๋‹ค์–‘ํ•œ ๋ฐฉ์•ˆ ์ˆ˜๋ฆฝ
  • 17. ์ „์ดํ•™์Šต์€ ๋†’์€ ์ •ํ™•๋„๋ฅผ ๋น„๊ต์  ์งง์€ ์‹œ๊ฐ„ ๋‚ด์— ๋‹ฌ์„ฑํ•  ์ˆ˜ ์žˆ๊ธฐ ๋•Œ๋ฌธ์— ์ปดํ“จํ„ฐ ๋น„์ „ ๋ถ„์•ผ์—์„œ ์œ ๋ช…ํ•œ ๋ฐฉ๋ฒ•๋ก  ์ค‘ ํ•˜๋‚˜ (Rawat & Wang 2017). ์ „์ดํ•™์Šต์„ ์ด์šฉํ•˜๋ฉด, ์ด๋ฏธ ํ•™์Šตํ•œ ๋ฌธ์ œ์™€ ๋‹ค๋ฅธ ๋ฌธ์ œ๋ฅผ ํ’€ ๋•Œ์—๋„, ๋ฐ‘๋ฐ”๋‹ฅ์—์„œ๋ถ€ํ„ฐ ๋ชจ๋ธ์„ ์Œ“์•„์˜ฌ๋ฆฌ๋Š” ๋Œ€์‹ ์— ์ด๋ฏธ ํ•™์Šต๋˜์–ด์žˆ๋Š” ํŒจํ„ด๋“ค์„ ํ™œ์šฉํ•ด์„œ ์ ์šฉ์‹œํ‚ฌ ์ˆ˜ ์žˆ์Œ ์ด๋ฅผ ์ƒค๋ฅดํŠธ๋ฅด ์‹์œผ๋กœ ํ‘œํ˜„ํ•˜๋ฉด ๊ฑฐ์ธ์˜ ์–ด๊นจ์— ์„œ์„œ(standing on the soulder of giants) ํ•™์Šตํ•˜๋Š” ๊ฒƒ ์ปดํ“จํ„ฐ ๋น„์ „์—์„œ ๋งํ•˜๋Š” ์ „์ดํ•™์Šต์€ ์ฃผ๋กœ ์‚ฌ์ „ํ•™์Šต ๋œ ๋ชจ๋ธ (pre-trained model) ์„ ์ด์šฉํ•˜๋Š” ๊ฒƒ ์‚ฌ์ „ํ•™์Šต ๋œ ๋ชจ๋ธ์ด๋ž€, ๋‚ด๊ฐ€ ํ’€๊ณ ์ž ํ•˜๋Š” ๋ฌธ์ œ์™€ ๋น„์Šทํ•˜๋ฉด์„œ ์‚ฌ์ด์ฆˆ๊ฐ€ ํฐ ๋ฐ์ดํ„ฐ๋กœ ์ด๋ฏธ ํ•™์Šต์ด ๋˜์–ด ์žˆ๋Š” ๋ชจ๋ธ ๊ทธ๋Ÿฐ ํฐ ๋ฐ์ดํ„ฐ๋กœ ๋ชจ๋ธ์„ ํ•™์Šต์‹œํ‚ค๋Š” ๊ฒƒ์€ ์˜ค๋žœ ์‹œ๊ฐ„๊ณผ ์—ฐ์‚ฐ๋Ÿ‰์ด ํ•„์š”ํ•˜๋ฏ€๋กœ, ๊ด€๋ก€์ ์œผ๋กœ๋Š” ์ด๋ฏธ ๊ณต๊ฐœ๋˜์–ด์žˆ๋Š” ๋ชจ๋ธ๋“ค์„ ๊ทธ์ € importํ•ด์„œ ์‚ฌ์šฉ ์ถœ์ฒ˜ : https://jeinalog.tistory.com/13
  • 18. Pretrained Model์‚ฌ์šฉ ์‹œ ๊ฐ€์žฅ ์ค‘์š”ํ•œ ํฌ์ธํŠธ
  • 19. Fine Tunning Scope ์ •์˜
  • 24. Data Optimization ๋ฐ์ดํ„ฐ ํ™•์ธ ๋ชจ๋ธ ๊ฒ€์ฆ ๋ฐฉ์•ˆ ์ˆ˜๋ฆฝ ambiguity uncertainty
  • 25. Neural Network Architecture ๊ฒ€ํ† 
  • 26. Neural Net Architecture์˜ "๊ณ„์ธต์ ์ธ ํŠน์ง•"์„ ๊ณ ๋ ค ๋…๋ฆฝ์ ์ธ ์ปดํฌ๋„ŒํŠธ์˜ ์ด์งˆ์  ์š”์†Œ ( ๋ถ„๋ฆฌ ํฌ์ธํŠธ๋ฅผ ์ฐพ์ž )
  • 27. ์ถœ์ฒ˜ : https://jeinalog.tistory.com/13 ์ด๋Ÿฌํ•œ ๋”ฅ๋Ÿฌ๋‹ ๋ชจ๋ธ์˜ ์ค‘์š”ํ•œ ์„ฑ๊ฒฉ ์ค‘ ํ•˜๋‚˜๋Š” ๋ฐ”๋กœ "๊ณ„์ธต์ ์ธ ํŠน์ง•"์„ "์Šค์Šค๋กœ" ํ•™์Šตํ•œ๋‹ค๋Š” ์  ๊ณ„์ธต์ ์ธ ํŠน์ง•์„ ํ•™์Šตํ•œ๋‹ค ๋Š” ๋ง์˜ ์˜๋ฏธ ๋ชจ๋ธ์˜ ์ฒซ ๋ฒˆ์งธ ์ธต์€ "์ผ๋ฐ˜์ ์ธ(general)" ํŠน์ง•์„ ์ถ”์ถœํ•˜๋„๋ก ํ•˜๋Š” ํ•™์Šต ๋ชจ๋ธ์˜ ๋งˆ์ง€๋ง‰ ์ธต์— ๊ฐ€๊นŒ์›Œ์งˆ์ˆ˜๋ก ํŠน์ • ๋ฐ์ดํ„ฐ์…‹ ๋˜๋Š” ํŠน์ • ๋ฌธ์ œ์—์„œ๋งŒ ๋‚˜ํƒ€๋‚  ์ˆ˜ ์žˆ๋Š” "๊ตฌ์ฒด์ ์ธ(specific)" ํŠน์ง•์„ ์ถ”์ถœํ•ด๋‚ด๋„๋ก ํ•˜๋Š” ๊ณ ๋„ํ™”๋œ ํ•™์Šต ๋”ฐ๋ผ์„œ ์•ž๋‹จ์— ์žˆ๋Š” ๊ณ„์ธต๋“ค์€ ๋‹ค๋ฅธ ๋ฐ์ดํ„ฐ์…‹์˜ ์ด๋ฏธ์ง€๋“ค์„ ํ•™์Šตํ•  ๋•Œ๋„ ์žฌ์‚ฌ์šฉ ๊ฐ€๋Šฅ ๋’ท๋‹จ์˜ ๊ณ„์ธต๋“ค์€ ์ƒˆ๋กœ์šด ๋ฌธ์ œ๋ฅผ ๋งž์ดํ•  ๋•Œ๋งˆ๋‹ค ์ƒˆ๋กœ ํ•™์Šต์ด ํ•„์š”ํ•จ
  • 28. Yosinski et al. (2014) ๋…ผ๋ฌธ์—์„œ๋Š” ์ด๋Ÿฌํ•œ ๋”ฅ๋Ÿฌ๋‹์˜ ํŠน์„ฑ์— ๋Œ€ํ•ด, '๋งŒ์•ฝ ์ฒซ ๋ฒˆ์งธ ๊ณ„์ธต์—์„œ ์ถ”์ถœ๋œ ํŠน์ง•์ด ์ผ๋ฐ˜์ ์ธ ํŠน์ง•์ด๊ณ  ๋งˆ์ง€๋ง‰ ์ธต์—์„œ ์ถ”์ถœ๋œ ํŠน์ง•์ด ๊ตฌ์ฒด์ ์ธ ํŠน์ง•์ด๋ผ๋ฉด, ๋„คํŠธ์›Œํฌ ๋‚ด์˜ ์–ด๋”˜๊ฐ€์— ์ผ๋ฐ˜์ ์ธ ์ˆ˜์ค€์—์„œ ๊ตฌ์ฒด์ ์ธ ์ˆ˜์ค€์œผ๋กœ ๋„˜์–ด๊ฐ€๋Š” ์ „ํ™˜์ ์ด ๋ถ„๋ช… ์กด์žฌํ•  ๊ฒƒ' ๊ฒฐ๋ก ์ ์œผ๋กœ, ์šฐ๋ฆฌ๊ฐ€ ์‚ดํŽด๋ณธ CNN ๋ชจ๋ธ์˜ Convolutional base ๋ถ€๋ถ„, ๊ทธ ์ค‘์—์„œ๋„ ํŠนํžˆ ๋‚ฎ์€ ๋ ˆ๋ฒจ์˜ ๊ณ„์ธต(input์— ๊ฐ€๊นŒ์šด ๊ณ„์ธต)์ผ์ˆ˜๋ก ์ผ๋ฐ˜์ ์ธ ํŠน์ง•์„ ์ถ”์ถœ ๊ทธ์™€ ๋ฐ˜๋Œ€๋กœ Convolutional base ์˜ ๋†’์€ ๋ ˆ๋ฒจ์˜ ๊ณ„์ธต(output์— ๊ฐ€๊นŒ์šด ๊ณ„์ธต)๊ณผ Classifier ๋ถ€๋ถ„์€ ๋ณด๋‹ค ๊ตฌ์ฒด์ ์ด๊ณ  ํŠน์œ ํ•œ ํŠน์ง•๋“ค์„ ์ถ”์ถœ. ์ถœ์ฒ˜ : https://jeinalog.tistory.com/13
  • 29. ์‚ฌ์ „ ํ•™์Šต๋œ ๋ชจ๋ธ์„ ์ด์ œ ๋‚˜์˜ ํ”„๋กœ์ ํŠธ์— ๋งž๊ฒŒ ์žฌ์ •์˜ํ•œ๋‹ค๋ฉด, ๋จผ์ € ์›๋ž˜ ๋ชจ๋ธ์— ์žˆ๋˜ classifier๋ฅผ ์—†์• ๋Š” ๊ฒƒ์œผ๋กœ ์‹œ์ž‘ํ•ฉ๋‹ˆ๋‹ค. ์›๋ž˜์˜ classifier๋Š” ์‚ญ์ œํ•˜๊ณ , ๋‚ด ๋ชฉ์ ์— ๋งž๋Š” ์ƒˆ๋กœ์šด classifier๋ฅผ ์ถ”๊ฐ€ํ•ฉ๋‹ˆ๋‹ค. ๊ทธ ํ›„ ๋งˆ์ง€๋ง‰์œผ๋กœ๋Š” ์ƒˆ๋กญ๊ฒŒ ๋งŒ๋“ค์–ด์ง„ ๋‚˜์˜ ๋ชจ๋ธ์„ ๋‹ค์Œ ์„ธ ๊ฐ€์ง€ ์ „๋žต ์ค‘ ํ•œ ๊ฐ€์ง€ ๋ฐฉ๋ฒ•์„ ์ด์šฉํ•ด ํŒŒ์ธํŠœ๋‹(fine-tune)์„ ์ง„ํ–‰ ์ถœ์ฒ˜ : https://jeinalog.tistory.com/13
  • 30. ์ „๋žต๋ณ„ ํŠน์ง• ( Fine Tunning Scope ์ •์˜)
  • 31. # ์ „๋žต 1 : ์ „์ฒด ๋ชจ๋ธ์„ ์ƒˆ๋กœ ํ•™์Šต์‹œํ‚ค๊ธฐ ์ด ๋ฐฉ๋ฒ•์€ ์‚ฌ์ „ํ•™์Šต ๋ชจ๋ธ์˜ ๊ตฌ์กฐ๋งŒ ์‚ฌ์šฉํ•˜๋ฉด์„œ, ๋‚ด ๋ฐ์ดํ„ฐ์…‹์— ๋งž๊ฒŒ ์ „๋ถ€ ์ƒˆ๋กœ ํ•™์Šต์‹œํ‚ค๋Š” ๋ฐฉ๋ฒ• ๋ชจ๋ธ์„ ๋ฐ‘๋ฐ”๋‹ฅ์—์„œ๋ถ€ํ„ฐ ์ƒˆ๋กœ ํ•™์Šต์‹œํ‚ค๋Š” ๊ฒƒ์ด๋ฏ€๋กœ, ํฐ ์‚ฌ์ด์ฆˆ์˜ ๋ฐ์ดํ„ฐ์…‹์ด ํ•„์š” (์ปดํ“จํŒ… ์—ฐ์‚ฐ ๋Šฅ๋ ฅ์„ ์œ„ํ•œ ๋งŽ์€ Infra Resource ํ•„์š”) ์ถœ์ฒ˜ : https://jeinalog.tistory.com/13
  • 32. # ์ „๋žต 2 : Convolutional base์˜ ์ผ๋ถ€๋ถ„์€ ๊ณ ์ •์‹œํ‚จ ์ƒํƒœ๋กœ, ๋‚˜๋จธ์ง€ ๊ณ„์ธต๊ณผ classifier๋ฅผ ์ƒˆ๋กœ ํ•™์Šต์‹œํ‚ค๊ธฐ ์•ž์„œ ์–ธ๊ธ‰ํ–ˆ๋“ฏ์ด, ๋‚ฎ์€ ๋ ˆ๋ฒจ์˜ ๊ณ„์ธต์€ ์ผ๋ฐ˜์ ์ธ ํŠน์ง•(์–ด๋–ค ๋ฌธ์ œ๋ฅผ ํ‘ธ๋Š๋ƒ์— ์ƒ๊ด€ ์—†์ด ๋…๋ฆฝ์ ์ธ ํŠน์ง•)์„ ์ถ”์ถœํ•˜๊ณ , ๋†’์€ ๋ ˆ๋ฒจ์˜ ๊ณ„์ธต์€ ๊ตฌ์ฒด์ ์ด๊ณ  ํŠน์œ ํ•œ ํŠน์ง•(๋ฌธ์ œ์— ๋”ฐ๋ผ ๋‹ฌ๋ผ์ง€๋Š” ํŠน์ง•)์„ ์ถ”์ถœํ•ฉ๋‹ˆ๋‹ค. ์ด๋Ÿฐ ํŠน์„ฑ์„ ์ด์šฉํ•ด์„œ, ์šฐ๋ฆฌ๋Š” ์‹ ๊ฒฝ๋ง์˜ ํŒŒ๋ผ๋ฏธํ„ฐ ์ค‘ ์–ด๋Š ์ •๋„๊นŒ์ง€๋ฅผ ์žฌํ•™์Šต์‹œํ‚ฌ์ง€๋ฅผ ์ •ํ•  ์ˆ˜ ์žˆ์Œ ์ฃผ๋กœ, ๋งŒ์•ฝ ๋ฐ์ดํ„ฐ์…‹์ด ์ž‘๊ณ  ๋ชจ๋ธ์˜ ํŒŒ๋ผ๋ฏธํ„ฐ๊ฐ€ ๋งŽ๋‹ค๋ฉด, ์˜ค๋ฒ„ํ”ผํŒ…์ด ๋  ์œ„ํ—˜์ด ์žˆ์œผ๋ฏ€๋กœ ๋” ๋งŽ์€ ๊ณ„์ธต์„ ๊ฑด๋“ค์ง€ ์•Š๊ณ  ๊ทธ๋Œ€๋กœ ๋‘ . ๋ฐ˜๋ฉด์—, ๋ฐ์ดํ„ฐ์…‹์ด ํฌ๊ณ  ๊ทธ์— ๋น„ํ•ด ๋ชจ๋ธ์ด ์ž‘์•„์„œ ํŒŒ๋ผ๋ฏธํ„ฐ๊ฐ€ ์ ๋‹ค๋ฉด, ์˜ค๋ฒ„ํ”ผํŒ…์— ๋Œ€ํ•œ ๊ฑฑ์ •์„ ํ•  ํ•„์š”๊ฐ€ ์—†์œผ๋ฏ€๋กœ ๋” ๋งŽ์€ ๊ณ„์ธต์„ ํ•™์Šต์‹œ์ผœ์„œ ๋‚ด ํ”„๋กœ์ ํŠธ์— ๋” ์ ํ•ฉํ•œ ๋ชจ๋ธ๋กœ ๋ฐœ์ „ ์ถœ์ฒ˜ : https://jeinalog.tistory.com/13
  • 33. # ์ „๋žต 3 : Convloutional base๋Š” ๊ณ ์ •์‹œํ‚ค๊ณ , classifier๋งŒ ์ƒˆ๋กœ ํ•™์Šต์‹œํ‚ค๊ธฐ ์ด ๊ฒฝ์šฐ๋Š” ๋ณด๋‹ค ๊ทน๋‹จ์ ์ธ ์ƒํ™ฉ์ผ ๋•Œ ์ƒ๊ฐํ•  ์ˆ˜ ์žˆ๋Š” ์ผ€์ด์Šค convolutional base๋Š” ๊ฑด๋“ค์ง€ ์•Š๊ณ  ๊ทธ๋Œ€๋กœ ๋‘๋ฉด์„œ ํŠน์ง• ์ถ”์ถœ ๋ฉ”์ปค๋‹ˆ์ฆ˜์œผ๋กœ์จ ํ™œ์šฉํ•˜๊ณ , classifier๋งŒ ์žฌํ•™์Šต์‹œํ‚ค๋Š” ๋ฐฉ๋ฒ• ์ด ๋ฐฉ๋ฒ•์€ ์ปดํ“จํŒ… ์—ฐ์‚ฐ ๋Šฅ๋ ฅ์ด ๋ถ€์กฑํ•˜๊ฑฐ๋‚˜ ๋ฐ์ดํ„ฐ์…‹์ด ๋„ˆ๋ฌด ์ž‘์„๋•Œ, ๊ทธ๋ฆฌ๊ณ /๋˜๋Š” ํ’€๊ณ ์ž ํ•˜๋Š” ๋ฌธ์ œ๊ฐ€ ์‚ฌ์ „ํ•™์Šต๋ชจ๋ธ์ด ์ด๋ฏธ ํ•™์Šตํ•œ ๋ฐ์ดํ„ฐ์…‹๊ณผ ๋งค์šฐ ๋น„์Šทํ•  ๋•Œ ๊ฒ€ํ†  ์ถœ์ฒ˜ : https://jeinalog.tistory.com/13
  • 35. Model ์ƒ์„ฑ ๋ฐ์ดํ„ฐ ๋„๋ฉ”์ธ ํ™•์ธ
  • 36. CNN ๋ฒ ์ด์Šค์˜ ์‚ฌ์ „ํ•™์Šต ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•  ๋•Œ์—๋Š”, ์ด์ „์— ํ•™์Šตํ•œ ๋‚ด์šฉ๋“ค์„ ๋ชจ๋‘ ์žŠ์–ด๋ฒ„๋ฆด ์œ„ํ—˜์ด ์žˆ๊ธฐ ๋•Œ๋ฌธ์— ์ž‘์€ learning rate๋ฅผ ์‚ฌ์šฉ ์‚ฌ์ „ํ•™์Šต ๋ชจ๋ธ์ด ์ž˜ ํ•™์Šต๋˜์—ˆ๋‹ค๋Š” ๊ฐ€์ •ํ•˜์—, ์ž‘์€ learning rate์œผ๋กœ ํ•™์Šต์„ ์‹œํ‚จ๋‹ค๋ฉด CNN ๋ชจ๋ธ์˜ ํŒŒ๋ผ๋ฏธํ„ฐ๋“ค์„ ๋„ˆ๋ฌด ๋น ๋ฅด๊ฒŒ, ํ˜น์€ ๋„ˆ๋ฌด ๋งŽ์ด ์™œ๊ณก์‹œํ‚ค์ง€ ์•Š๊ณ  ์›๋ž˜ ํ•™์Šต๋˜์–ด์žˆ๋˜ ์ง€์‹์„ ์ž˜ ๋ณด์กดํ•˜๋ฉด์„œ ์ถ”๊ฐ€๋กœ ํ•™์Šต์„ ํ•ด์•ผ ํ•จ ์ž‘์€ Learning rate๋ฅผ ์‚ฌ์šฉํ•œ๋‹ค๋Š” ๊ฒƒ์€ Pretrained Model์ด ์ž˜ ํ•™์Šต๋˜์—ˆ๋‹ค๋Š” ๊ฐ€์ •ํ•˜์— ์„ฑ๋ฆฝ๋จ Pretrained Model์„ ์‚ฌ์šฉํ•˜์ง€ ์•Š๋Š” ์ผ๋ฐ˜์ ์ธ Trainingํ™˜๊ฒฝ์—์„œ ์ž‘์€ Learning rate๋Š” ์ผ๋‹จ ์ˆ˜๋ ดํ•˜๋Š” ์†๋„๊ฐ€ ๋„ˆ๋ฌด ๋Š๋ฆฌ๊ณ , local minimum์— ๋น ์งˆ ํ™•๋ฅ ์ด ์ฆ๊ฐ€ ์ถœ์ฒ˜ : https://jeinalog.tistory.com/13
  • 37. [์ฐธ๊ณ ] Pretrained Model์„ ์‚ฌ์šฉํ•˜์ง€ ์•Š๋Š” ์ผ๋ฐ˜์ ์ธ ํ•™์Šต์˜ ๊ฒฝ์šฐ
  • 38. Learning rate๋ฅผ ์„ค์ •ํ•  ๋•Œ ์ฃผ์˜ํ•  ์  1) Learning rate๊ฐ€ ๋„ˆ๋ฌด ํด ๋•Œ, ์ตœ์ ์˜ ๊ฐ’์œผ๋กœ ์ˆ˜๋ ดํ•˜์ง€ ์•Š๊ณ , ๋ฐœ์‚ฐํ•ด๋ฒ„๋ฆฌ๋Š” ๊ฒฝ์šฐ๊ฐ€ ๋ฐœ์ƒ โ†’ OverShooting 2) Learning rate๊ฐ€ ๋„ˆ๋ฌด ์ž‘์„ ๋•Œ, ์ผ๋‹จ ์ˆ˜๋ ดํ•˜๋Š” ์†๋„๊ฐ€ ๋„ˆ๋ฌด ๋Š๋ฆฌ๊ณ , local minimum์— ๋น ์งˆ ํ™•๋ฅ ์ด ์ฆ๊ฐ€ Learning rate๋ฅผ ์ž˜ ์ฐพ๋Š” ๋ฐฉ๋ฒ•์€ ๋”ฐ๋กœ ์—†์ง€๋งŒ, Learning rate๋ฅผ ์ž˜ ์ฐพ๊ธฐ ์œ„ํ•ด ๋„์™€์ค„ ์ˆ˜ ์žˆ๋„๋ก ๋ฐ์ดํ„ฐ๋“ค์„ ์ „์ฒ˜๋ฆฌ(preprocessing)ํ•˜๋Š” ๊ณผ์ •์ด ํ•„์š” ๊ฐ๊ฐ์˜ ๋ฐ์ดํ„ฐ๋“ค์˜ ๊ฐ’์ด ๋„ˆ๋ฌด ๋งŽ์ด ์ฐจ์ด๋‚  ๊ฒฝ์šฐ Learning rate๋ฅผ ์ž˜ ๋ชป์„ค์ •ํ–ˆ์„ ๋•Œ์™€ ๋น„์Šทํ•œ ํ˜„์ƒ์ด ๋ฐœ์ƒ ์ด๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด feature๋“ค์„ Scaling ํ•œ๋‹ค. Feature Scaling์—๋Š” 2๊ฐ€์ง€ ๋ฐฉ๋ฒ•์ด ์žˆ๋Š”๋ฐ, 1) Normalization - ํ‘œ๋ณธ๋“ค์˜ ๊ฐ’์„ ๋ชจ๋‘ 0 ~ 1 ์‚ฌ์ด์˜ ๊ฐ’์œผ๋กœ ๋ณ€ํ™˜ ํ•˜๋Š” ๋ฐฉ๋ฒ•. 2) Standardization - ํ‘œ๋ณธ๋“ค์˜ ๊ฐ’์„ ๊ฐ€์šฐ์‹œ์•ˆ ๋ถ„ํฌ์˜ ๊ฐ’์œผ๋กœ ๋ณ€ํ™˜ ํ•˜๋Š” ๋ฐฉ๋ฒ•.
  • 39. Resnet๊ณผ VGG16์—์„œ Transfer Learning์„ ์‚ฌ์šฉํ•˜๋Š” ์ด์œ 
  • 45. ๋ชจ๋ธ์„ ์„œ๋น„์Šค์— ๋งž์ถ”์–ด ๊ตฌํ˜„ํ•˜๋ ค๋ฉด ํ˜„์‹ค ์„ธ๊ณ„์—์„  Domain๋ณ„ Data Dependency ํ™•์ธ ํ•„์š”
  • 46. # ์ „๋žต 1 : ์ „์ฒด ๋ชจ๋ธ์„ ์ƒˆ๋กœ ํ•™์Šต์‹œํ‚ค๊ธฐ ์ด ๋ฐฉ๋ฒ•์€ ResNet์˜ ๊ตฌ์กฐ๋งŒ ์‚ฌ์šฉํ•˜๋ฉด์„œ, ์‹ค์ œ ๋ฐ์ดํ„ฐ์…‹์— ๋งž๊ฒŒ ์ „๋ถ€ ์ƒˆ๋กœ ํ•™์Šต์‹œํ‚ค๋Š” ๋ฐฉ๋ฒ•์ž…๋‹ˆ๋‹ค. ๋ชจ๋ธ์„ ๋ฐ‘๋ฐ”๋‹ฅ์—์„œ๋ถ€ํ„ฐ ์ƒˆ๋กœ ํ•™์Šต์‹œํ‚ค๋Š” ๊ฒƒ์ด๋ฏ€๋กœ, ํฐ ์‚ฌ์ด์ฆˆ์˜ ๋ฐ์ดํ„ฐ์…‹์ด ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค. (๊ทธ๋ฆฌ๊ณ , ์ข‹์€ ์ปดํ“จํŒ… ์—ฐ์‚ฐ ๋Šฅ๋ ฅ๋„์š”!) ์ถœ์ฒ˜ : https://jeinalog.tistory.com/13
  • 47. ๊ทธ๋ž˜์„œ Annotation Scope์ด ์ค‘์š”ํ•˜๋‹ค.
  • 48. ์ƒ์„ฑ๋œ ์ด๋ฏธ์ง€ ๋ถ„๋ฅ˜ ๋ชจ๋ธ์„ ํ†ตํ•œ ์ž๋™ Batch Annotation ์ˆ˜ํ–‰๋„ ๋ณ‘ํ–‰ํ•œ๋‹ค.
  • 49. ์„ฑ๋Šฅํ–ฅ์ƒ์„ ์œ„ํ•ด Joint Training์„ ํ†ตํ•ด ๋‹ค์ˆ˜์˜ ์ด๋ฏธ์ง€ ๋ชจ๋ธ์˜ ๊ฒฐํ•ฉ์„ ์ˆ˜ํ–‰ํ•œ๋‹ค.
  • 50. ๋‹จ๊ณ„0. ๋ฌด์ž‘์œ„ ์ดˆ๊ธฐํ™” ์ž…๋ ฅ ์ด๋ฏธ์ง€ ๋ชจ๋ธ ๊ฐ€์ค‘์น˜ Forward ์˜ˆ์ธก๊ฐ’ Ground Truth (์ •๋‹ต) Backward ๋‹จ๊ณ„1. ์ž…๋ ฅ์— ๋Œ€ํ•œ ์˜ˆ์ธก ๋‹จ๊ณ„2. ์†์‹ค ํ•จ์ˆ˜ ๊ณ„์‚ฐ ๋‹จ๊ณ„3. ์ตœ์ ํ™” ์•Œ๊ณ ๋ฆฌ์ฆ˜ ์†์‹ค ์—…๋ฐ์ดํŠธ Loss = Cost = Error
  • 51. ๋‹จ๊ณ„2. ์†์‹ค ํ•จ์ˆ˜ ๊ณ„์‚ฐ ( Softmax ์†์‹ค ํ•จ์ˆ˜ ) Loss = Cost = Error ๋งˆ์ง€๋ง‰ ์ถœ๋ ฅ์ธต์€ ์ „ ๋‹จ๊ณ„์—์„œ ์ถ”์ถœ๋œ ํŠน์ง• ๋ฒกํ„ฐ๋ฅผ N๊ฐœ์˜ ๋ฒ”์ฃผ๋กœ ๋ถ„๋ฅ˜ํ•˜๊ธฐ ์œ„ํ•ด ๋ฐฐ์น˜๋œ fully-connected layer์™€ softmaxํ•จ์ˆ˜๋กœ ๊ตฌ์„ฑ๋จ ์ดํ›„ cross entropy๊ฐ€ softmaxํ•จ์ˆ˜๋ฅผ ํ†ตํ•ด ๋‚˜์˜จ ํ™•๋ฅ  ๋ถ„ํฌ์™€ ์ •๋‹ต ๋ถ„ํฌ ์‚ฌ์ด์˜ ์˜ค์ฐจ๋ฅผ ๊ณ„์‚ฐ Softmax ํ™•๋ฅ ๊ฐ’์„ ์ด์šฉํ•œ๋‹ค๋Š” ์ ์—์„œ ์ด ์†์‹ค ํ•จ์ˆ˜๋ฅผ Softmax ์†์‹ค ํ•จ์ˆ˜๋ผ๊ณ  ๋ถ€๋ฅธ๋‹ค. ๋ถ„๋ฅ˜์— ์‚ฌ์šฉ๋˜๋Š” ํ™œ์„ฑํ™” ํ•จ์ˆ˜ โ†’ Softmaxํ•จ์ˆ˜ : ๋ชจ๋“  ์ž…๋ ฅ ์‹ ํ˜ธ๋กœ๋ถ€ํ„ฐ ์˜ํ–ฅ์„ ๋ฐ›์Œ Softmaxํ•จ์ˆ˜ ์ฃผ์˜์  - ์ง€์ˆ˜ ํ•จ์ˆ˜๋กœ ๋˜์–ด์žˆ์–ด ์˜ค๋ฒ„ํ”Œ๋กœ (๋ฌดํ•œ๋Œ€ ๊ฐ’ ๋ฐœ์ƒ) ๋ฌธ์ œ๊ฐ€ ์ƒ๊ธธ ์ˆ˜ ์žˆ์Œ
  • 52. 1๊ฐœ์˜ ๋ชจ๋ธ์—์„œ n๊ฐœ์˜ output์„ ์ถœ๋ ฅํ•œ๋‹ค๋ฉด ์ด์— ๋งž์ถฐ n๊ฐœ์˜ ์„œ๋กœ ๋‹ค๋ฅธ loss๋ฅผ ๋ฝ‘์•„ ๋‚ผ ์ˆ˜ ์žˆ๋‹ค. ์—ฌ๋Ÿฌ๊ฐœ์˜ Loss๋“ค์„ ์–ด๋–ป๊ฒŒ ์ฒ˜๋ฆฌํ•˜๋Š๋ƒ์— ๋”ฐ๋ผ Joint Training๊ณผ Alternate Training๋ฐฉ์‹์œผ๋กœ ๋‚˜๋‰  ์ˆ˜ ์žˆ๋‹ค. Joint Training์ด๋ž€ ์—ฌ๋Ÿฌ ๊ฐœ์˜ loss๋“ค์„ ํ•˜๋‚˜์˜ ๊ฐ’์œผ๋กœ ๋”ํ•ด์„œ ์ตœ์ข… Loss๋กœ ์‚ฌ์šฉํ•˜๋Š” ํ›ˆ๋ จ ๋ฐฉ์‹ Neural Net Architecture ๋งˆ์ง€๋ง‰ ์ธต์˜ output์„ channel๋‹จ์œ„๋กœ ์ž˜๋ผ์„œ ์„œ๋กœ ๋‹ค๋ฅธ ์—ญํ• ์„ ๋ถ€์—ฌํ•œ ๊ตฌ์กฐ๋กœ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ์Œ ๊ฐ ์ฑ„๋„์ด ์„œ๋กœ ๋‹ค๋ฅธ ์—ญํ• ์„ ์ˆ˜ํ–‰ํ•˜๊ฒŒ ํ•˜๋ ค๋ฉด ์ด์— ๋งž๊ฒŒ loss function์„ ์ •์˜ํ•ด์ค˜์•ผ ํ•˜๊ธฐ ๋•Œ๋ฌธ์— Channel๋‹จ์œ„ (์„œ๋กœ ๋‹ค๋ฅธ ์—ญํ• ) ๋‹จ์œ„์˜ Loss Function ๋ถ€์—ฌ ํ•„์š” ํ•ต์‹ฌ์€ Total Loss = loss1 + loss2 + loss3 + loss4 + loss5 Joint Training์€ ์—ฌ๋Ÿฌ ๊ฐœ์˜ Loss๋“ค์„ ๋”ํ•ด์„œ ๋ชจ๋“  task๋“ค์„ ํ•œ๋ฒˆ์— ํ•™์Šตํ•˜๋Š” ๋ฐฉ์‹ ๋‹ค์ˆ˜์˜ ์ด๋ฏธ์ง€ ๋ชจ๋ธ์˜ ๊ฒฐํ•ฉ ์‹œ์—๋„ ํ™œ์šฉ ๊ฐ€๋Šฅํ•˜๋‹ค. ์œ„์™€ ๊ฐ™์ด Joint Training์„ ์‚ฌ์šฉํ•˜๋Š” ์ด์œ ๋Š” ๋‹ค์ˆ˜์˜ ๋ชจ๋ธ๋กœ ํ•˜๋‚˜์˜ Task์ฒ˜๋ฆฌ ์‹œ์— ์ตœ์ข… ๋ชจ๋ธ ํ‰๊ฐ€ ๋ฐ ์„ฑ๋Šฅ ์ธก์ • ๊ด€๋ จํ•˜์—ฌ ์žฌ์ •์˜๊ฐ€ ํ•„์š”ํ•˜๋‹ค.
  • 54. OCR์˜ ๊ฒฝ์šฐ ์ด๋ฏธ์ง€ ๋ชจ๋ธ๊ณผ ์–ธ์–ด ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•œ๋‹ค. ์ถœ์ฒ˜ : https://arxiv.org/pdf/1908.05054.pdf
  • 57. Bert ๋ชจ๋ธ์„ ํ•™์Šตํ•œ๋‹ค๋Š” ๊ฑด ์•„๋งˆ์ถ”์–ด
  • 58. Bert Weight Load๋ฅผ ํ™œ์šฉํ•˜์ž
  • 59. Pretrained Bert Model ํ™œ์šฉ
  • 63. Q & A