SlideShare a Scribd company logo
Deep Boosting
ICMLでも読むかい? 2014/07/23
太田 満久
ざっくりいうと
• DeepBoostという新しい集団学習アルゴリズムを
提唱した
• 弱学習器として、深い決定木など複雑な学習器を利
用できる
• AdaBoostやLogisticRegressionと比較して、精
度があがる
復習
集団学習とは
• 弱学習器(ランダムに解を出力する予測器よりは高
い精度で予測できる学習器)を組み合わせて高精度
の学習器を構成する方法
集団学習の種類
• バギング
• ランダムフォレスト
• ブースティング
バギング
• ブートストラップサンプリングを繰り返して生成し
た判別器を合成して,より判別精度の高い判別器を
生成する方法.
学習データ
復元抽出 復元抽出 復元抽出
弱学習器 弱学習器 弱学習器
強学習器
分類:多数決
回帰:平均
ランダムフォレスト
• 多数の決定木を用いたアンサンブル学習
• バギングと違い、ランダムに特徴選択を行う
学習データ
復元抽出 復元抽出 復元抽出
強学習器
分類:多数決
回帰:平均
弱学習器 弱学習器 弱学習器
特徴選択 特徴選択 特徴選択
ブースティング
• 誤分類した学習データに重みをつけて再学習を繰り
返す方法
学習データ
弱学習器
重み付け
強学習器
AdaBoost
• 代表的なブースティングのアルゴリズム
• 学習データに の重みをつけ再学習を繰り返す
• 仮説列 を出力し、重み付きで足し合わせる
!
!
!
• よく使われる仮説(弱学習器)は決定株(深さ1の決定
木)
exp
H(x) =
tX
t=1
↵tht(x)
{ht}
AdaBoost(疑似コード)
http://www.cs.nyu.edu/ mohri/mls/lecture_6.pdf
AdaBoost(イメージ)
http://www.cs.nyu.edu/ mohri/mls/lecture_6.pdf
AdaBoost(イメージ)
http://www.cs.nyu.edu/ mohri/mls/lecture_6.pdf
AdaBoost(イメージ)
http://www.cs.nyu.edu/ mohri/mls/lecture_6.pdf
Questions
• 決定株のような弱分類器ではなく、もっとリッチで
ディープな分類器を使えないか?
• 音声認識や画像認識などの複雑なタスクでは、もっと
複雑なものが必要
• リッチでディープな分類器をつかうと、過学習の危険が
ある
リッチでディープな分類器: 深い決定木など
Deep Boost
アジェンダ
• 理論
• アルゴリズム
• 評価
理論
前提
• 個のトレーニングデータ
!
• 2値分類のタスク
!
• 個の仮説の線形結合で学習器を構成
S = ((x1, y1), · · · , (xm, ym))
m
yi 2 { 1, +1}
T
f =
TX
t=1
↵tht
アイデア
• 個の複雑さの異なる仮説の集合(Set)を考える
!
!
!
• (汎化)誤差が小さくなるよう、重みを調整する
p
H1, . . . , Hp
ht 2 Hkt
f =
TX
t=1
↵tht
うまく調整する
深さ1の決定木,
深さ2の決定木,
…
誤差
• 学習器の誤差は以下のようにあらわされる
• 誤差
!
• -マージン誤差
!
• 経験マージン誤差
R(f) = E(x,y)⇠D
⇥
1yf(x)0
⇤
ˆRS,⇢(f) = E(x,y)⇠S
⇥
1yf(x)⇢
⇤
R⇢(f) = E(x,y)⇠D
⇥
1yf(x)⇢
⇤
⇢
経験分布
マージン
Rademacher複雑度
• 関数のクラスの複雑さの指標の一つ
• 経験的Rademacher複雑度:
!
!
!
!
!
• 表現力が強いほど、Rademacher複雑度は大きくなる
S = (x1, . . . , xm)
ランダムに選択
ˆRS(H) =
1
m
E
"
sup
h2H
mX
i=1
ih(xi)
#上限 との相関 
= ( 1, . . . , m), i 2 { 1, +1}
Rademacher複雑度
• Rademacher複雑度
R(H) = ES⇠Dm
h
ˆRS(H)
i
Ensembles-Margin Bound
• 以下の定理が知られている
!
!
!
!
!
!
• 誤差は、経験マージン誤差と複雑度で抑えられる
(Koltchinskii and Panchencko, 2002)
定理1
• 今回提唱した新たな定理
仮説のクラス毎の複雑度
定理1の優位点
• 提案定理の法が、より強く、誤差を抑える
Rm(F) max
k
Rm(Hk)
TX
t=1
↵tRm(Hkt
)
アルゴリズム
• 定理1の右辺が小さくなるよう、 を調整する
方針
↵
rt = Rm(Hht )
min
↵
経験誤差 複雑度
損失関数
• のスケール変換
• 第1項を微分可能な関数で置き換える
1/⇢
Φとのバランス
のための変数
未定乗数法
• ラグランジュの未定乗数法を適用
⇤j
解の求め方
• coordinate descent
• 傾きが最大の軸を求める
• その方向だけみて、最小となる位置まで移動する
• 上記を繰り返す
疑似コード
⇤j = rj +
方向の決定
• 「方向の決定」は、
「分類器の選定」に対
応している
⇤j = rj +
ステップ幅の決定
• ステップ幅の決定は、
分類器の「重み」の更
新に対応している
⇤j = rj +
重みの更新
• 「 の微分」が、
「データの重み」に対
応している
⇤j = rj +
AdaBoostとの関係
• 下記の設定で、まさに
AdaBoostになってい
る
!
!
• 下記の設定だと、
additive logistic loss
⇤j = 0
( u) = exp( u)
( u) = log2(1 + exp( u))
⇤j = 0
AdaBoost(疑似コード・再掲)
注意事項
• パラメータ を決めなければいけない
• 複雑度 を求めなければならない
• 論文では、複雑度を直接求めるのではなく、上限を利
用している
• 候補となる学習器が非常に多いため、論文ではヒュー
リスティックに対象をしぼっている
,
Rm
評価
データセット
• UCIの様々なデータ
• breastcancer
• ionosphere
• german
• OCRデータ
• ocr17, ocr17-mnist
• ocr49, ocr49-mnist
検証結果
検証結果
検証結果
検証結果
検証結果
検証結果
精度の向上
• ocr17-mnist, ocr49-mnist については、t検定で
優位差がでている(2%)ことを確認した
まとめ
• Deep Boostという新しい集団学習手法を提案した
• AdaBoostの拡張と見なすことができる
• 誤差について、理論的な保証がある
• (確かに?)精度が向上している

More Related Content

Featured

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
Marius Sescu
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
Expeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
Pixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
marketingartwork
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
Skeleton Technologies
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
SpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Lily Ray
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
Rajiv Jayarajah, MAppComm, ACC
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
Christy Abraham Joy
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
Vit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
MindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
RachelPearson36
 

Featured (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Deepboosting