[DSC Adria 23]Danko_Nikolic - Guided Transfer Learning.pptx

•Download as PPTX, PDF•

0 likes•5 views

DataScienceConferenc1

...

Data & Analytics

Guided Transfer Learning
Danko Nikolić
evocenta, Robots Go Mental

The problem
The great AI solutions today are gigantic.
There is a huge demand for effective algorithms: Lean, fast
learning.
We conquer AI through:
- increasing model sizes,
- applying massive data,
- using excessive compute.

Strong Machine Learning
Weak ML Strong ML
Data Large amounts Small amounts
Parameters Ever more Get away with small
amounts
Compute Super computers Edge devices

Strong
Machine
Learning
Smart learner
Weak
Machine
Learning
Dumb learner

Transfer learning
starts from a point
nearby a minimum. It
reaches it quickly.
Standard deep learning begins training from a distant
point – but uses a large dataset to overcome bumps and
to reach high performance i.e., a deep error minimum.
Classical transfer learning

Transfer learning: Pre-learning weights
Much better: Smart learning algorithms
The network
Learning mechanisms

Learning wisdom:
GTL is a domain expert and
knows in which direction to go
(inductive bias).
GTL guides transfer learning to
overcome a bump and reach a
deeper minimum – a better
solution.
The ordinary transfer learning does
not necessarily find the best solution.
What does GTL bring?

𝑤1,1 ⋯ 𝑤1,𝑛
⋮ ⋱ ⋮
𝑤𝑚,1 ⋯ 𝑤𝑚,𝑛 𝑡+1
=
𝑤1,1 ⋯ 𝑤1,𝑛
⋮ ⋱ ⋮
𝑤𝑚,1 ⋯ 𝑤𝑚,𝑛 𝑡
+
∆𝑤1,1 ⋯ ∆𝑤1,𝑛
⋮ ⋱ ⋮
∆𝑤𝑚,1 ⋯ ∆𝑤𝑚,𝑛
𝑤1,1 ⋯ 𝑤1,𝑛
⋮ ⋱ ⋮
𝑤𝑚,1 ⋯ 𝑤𝑚,𝑛 𝑡+1
=
𝑤1,1 ⋯ 𝑤1,𝑛
⋮ ⋱ ⋮
𝑤𝑚,1 ⋯ 𝑤𝑚,𝑛 𝑡
+
∆𝑤1,1 ⋯ ∆𝑤1,𝑛
⋮ ⋱ ⋮
∆𝑤𝑚,1 ⋯ ∆𝑤𝑚,𝑛
*
𝑔1,1 ⋯ 𝑔1,𝑛
⋮ ⋱ ⋮
𝑔𝑚,1 ⋯ 𝑔𝑚,𝑛
𝑔 𝜖{0 . . 1}
where
1
2
m
1
2
n
𝑤1,1
𝑤𝑚,𝑛

GTL creates inductive biases.
Your standard multi-layer
deep learning neural
network
GTL lays a “expertise
blanked” that imposes
inductive biases over the
network
(Inductive bias ≡ prior ≡ assumption)

Step one
Becoming an expert for a domain.

G = (M - min(M)) / (max(M) - min(M)),
mW = 1/n Σ (wB – wn).
2

…
100 ms 100 ms
A more effective
model
Model adjusts to
new visual
conditions
GTL-assisted

Weak ML has reached its limits.
Strong ML is the future.
We want to be at the forefront of AI.

resource TL GTL Combination
Data Full data set Light data set Full pre-train data
set
Compute Full effort pre-
training
Light effort 20% increase in
compute
Parameters No additional
parameters
2x parameters 2x parameters

Similar to [DSC Adria 23]Danko_Nikolic - Guided Transfer Learning.pptx

DEF CON 24 - Clarence Chio - machine duping 101Felipe Prado

[243] turning data into valueNAVER D2

[系列活動] 手把手的深度學實務台灣資料科學年會

AI and Deep Learning Subrat Panda, PhD

Learn to Build an App to Find Similar Images using Deep Learning- Piotr TeterwakPyData

Baisc Deep Learning HandsOnSean Yu

cs330_2021_lifelong_learning.pdfKuan-Tsae Huang

Deep Learning And Business Models (VNITC 2015-09-13)Ha Phuong

NullEraylsonGaldino1

Neural Networks by Priyanka KasturePriyanka Kasture

Deep learning from scratch Eran Shlomo

Scalable Learning in Computer Visionbutest

Deep learning tutorial 9/2019Amr Rashed

Deep Learning TutorialAmr Rashed

Getting started with Machine LearningGaurav Bhalotia

08 neural networksankit_ppt

Deep learning introductiongiangbui0816

Deep learningMohamed Loey

Muhammad Usman Akhtar | Ph.D Scholar | Wuhan University | School of Co...Wuhan University

Entity embeddings for categorical dataPaul Skeie

Similar to [DSC Adria 23]Danko_Nikolic - Guided Transfer Learning.pptx (20)

DEF CON 24 - Clarence Chio - machine duping 101

[243] turning data into value

[系列活動] 手把手的深度學實務

AI and Deep Learning

Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak

Baisc Deep Learning HandsOn

cs330_2021_lifelong_learning.pdf

Deep Learning And Business Models (VNITC 2015-09-13)

Null

Neural Networks by Priyanka Kasture

Deep learning from scratch

Scalable Learning in Computer Vision

Deep learning tutorial 9/2019

Deep Learning Tutorial

Getting started with Machine Learning

08 neural networks

Deep learning introduction

Deep learning

Muhammad Usman Akhtar | Ph.D Scholar | Wuhan University | School of Co...

Entity embeddings for categorical data

Recently uploaded

CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Onlineanilsa9823

Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila

Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls

CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion

Zuja dropshipping via API with DroFx.pptxolyaivanovalion

Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Riyadh +966572737505 get cytotec

Mature dropshipping via API with DroFx.pptxolyaivanovalion

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh9953056974 Low Rate Call Girls In Saket, Delhi NCR

Data-Analysis for Chicago Crime Data 2023ymrp368

Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083

Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

Edukaciniai dropshipping via API with DroFxolyaivanovalion

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten

Smarteg dropshipping via API with DroFx.pptxolyaivanovalion

Sampling (random) method and Non random.pptDr. Soumendra Kumar Patra

ALSO dropshipping via API with DroFx.pptxolyaivanovalion

Carero dropshipping via API with DroFx.pptxolyaivanovalion

Recently uploaded (20)

CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online

Accredited-Transport-Cooperatives-Jan-2021-Web.pdf

Best VIP Call Girls Noida Sector 39 Call Me: 8448380779

CebaBaby dropshipping via API with DroFX.pptx

Zuja dropshipping via API with DroFx.pptx

Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec

Mature dropshipping via API with DroFx.pptx

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh

Data-Analysis for Chicago Crime Data 2023

Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...

Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call

Edukaciniai dropshipping via API with DroFx

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call

Log Analysis using OSSEC sasoasasasas.pptx

Smarteg dropshipping via API with DroFx.pptx

Sampling (random) method and Non random.ppt

ALSO dropshipping via API with DroFx.pptx

Carero dropshipping via API with DroFx.pptx

[DSC Adria 23]Danko_Nikolic - Guided Transfer Learning.pptx

1. Guided Transfer Learning Danko Nikolić evocenta, Robots Go Mental

2. The problem The great AI solutions today are gigantic. There is a huge demand for effective algorithms: Lean, fast learning. We conquer AI through: - increasing model sizes, - applying massive data, - using excessive compute.

3. Strong Machine Learning Weak ML Strong ML Data Large amounts Small amounts Parameters Ever more Get away with small amounts Compute Super computers Edge devices

4. Strong Machine Learning Smart learner Weak Machine Learning Dumb learner

5. Transfer learning starts from a point nearby a minimum. It reaches it quickly. Standard deep learning begins training from a distant point – but uses a large dataset to overcome bumps and to reach high performance i.e., a deep error minimum. Classical transfer learning

7. Transfer learning: Pre-learning weights Much better: Smart learning algorithms The network Learning mechanisms

8. Guided Transfer Learning

9. Learning wisdom: GTL is a domain expert and knows in which direction to go (inductive bias). GTL guides transfer learning to overcome a bump and reach a deeper minimum – a better solution. The ordinary transfer learning does not necessarily find the best solution. What does GTL bring?

10.

11. 𝑤1,1 ⋯ 𝑤1,𝑛 ⋮ ⋱ ⋮ 𝑤𝑚,1 ⋯ 𝑤𝑚,𝑛 𝑡+1 = 𝑤1,1 ⋯ 𝑤1,𝑛 ⋮ ⋱ ⋮ 𝑤𝑚,1 ⋯ 𝑤𝑚,𝑛 𝑡 + ∆𝑤1,1 ⋯ ∆𝑤1,𝑛 ⋮ ⋱ ⋮ ∆𝑤𝑚,1 ⋯ ∆𝑤𝑚,𝑛 𝑤1,1 ⋯ 𝑤1,𝑛 ⋮ ⋱ ⋮ 𝑤𝑚,1 ⋯ 𝑤𝑚,𝑛 𝑡+1 = 𝑤1,1 ⋯ 𝑤1,𝑛 ⋮ ⋱ ⋮ 𝑤𝑚,1 ⋯ 𝑤𝑚,𝑛 𝑡 + ∆𝑤1,1 ⋯ ∆𝑤1,𝑛 ⋮ ⋱ ⋮ ∆𝑤𝑚,1 ⋯ ∆𝑤𝑚,𝑛 * 𝑔1,1 ⋯ 𝑔1,𝑛 ⋮ ⋱ ⋮ 𝑔𝑚,1 ⋯ 𝑔𝑚,𝑛 𝑔 𝜖{0 . . 1} where 1 2 m 1 2 n 𝑤1,1 𝑤𝑚,𝑛

12.

13.

14. GTL creates inductive biases. Your standard multi-layer deep learning neural network GTL lays a “expertise blanked” that imposes inductive biases over the network (Inductive bias ≡ prior ≡ assumption)

15. Step one Becoming an expert for a domain.

16. G = (M - min(M)) / (max(M) - min(M)), mW = 1/n Σ (wB – wn). 2

17.

18.

19.

20.

21.

22. … 100 ms 100 ms A more effective model Model adjusts to new visual conditions GTL-assisted

23. … 100 ms 100 ms

24. Weak ML has reached its limits. Strong ML is the future. We want to be at the forefront of AI.

25. Thank you

26. resource TL GTL Combination Data Full data set Light data set Full pre-train data set Compute Full effort pre- training Light effort 20% increase in compute Parameters No additional parameters 2x parameters 2x parameters

Editor's Notes

RGMs general approach

[DSC Adria 23]Danko_Nikolic - Guided Transfer Learning.pptx

Recommended

Recommended

More Related Content

Similar to [DSC Adria 23]Danko_Nikolic - Guided Transfer Learning.pptx

Similar to [DSC Adria 23]Danko_Nikolic - Guided Transfer Learning.pptx (20)

More from DataScienceConferenc1

More from DataScienceConferenc1 (20)

Recently uploaded

Recently uploaded (20)

[DSC Adria 23]Danko_Nikolic - Guided Transfer Learning.pptx

Editor's Notes