SlideShare a Scribd company logo
Active Learning in
                                            Recommender Systems




http://4.bp.blogspot.com/_qFju91K89HM/SxRpABd1DTI/AAAAAAAABjw/6LaSJfjfk-I/s1600/Unexpected_Guests.jpg




 Neil Rubens
 Active Intelligence Lab
 University of Electro-Communications
http://activeintelligence.org/research/al-rs/
N. Rubens, D. Kaplan, M. Sugiyama.
Recommender Systems Handbook: Active
Learning in Recommender Systems (eds. P.B.
Kantor, F. Ricci, L. Rokach,B. Shapira). Springer,
2011.
!"#$%%&&&'()*+),-'./0%.&12/-%(223410%41(.'!,567   !"#$%%&&&'()*+,'*-.%#!-/-0%.12#0)23%4567884598%:




      Passive Intelligence                                               Active Intelligence
data is given                                        Premise: given info is insufficient
model is given
                                                     active data acquisition
task:                                                self adaptation/reconfiguration
learn model’s parameters
Why Need Useful Data?

“If you put into the machine wrong figures, will the right answers
come out?
I am not able rightly to apprehend the kind of confusion of ideas
that could provoke such a question.”
                                                  Charles Babbage



Garbage In, Garbage Out
(GIGO Principle)

           George Fuechsel
What about Data Mining?

We can sniff through the data and try to find
something of value.

Assumptions
a lot of data is available
some of the data is useful




                                                                                                    !"#$%%&&&'()*+,-./,012-'345%21#-67%*893+12%6:;*893+1'2!-5+<
                                        http://www.qualitydigest.com/sept06/articles/04_article.shtml
Obtaining Data could be “COSTLY”
Medicine:
 diagnosis: pain, time, $
 drug discovery: $$$, time

User Interaction:
 effort, time

Expertise Elicitation:
 $, time

                 Active Learning (AL)
Goal: Estimate ‘Usefulness’ of the data
      before data is acquired
Limitation of Traditional Recommender Systems




 Exploitation                         http://misspinkslip.files.wordpress.com/2009/07/used-car-salesman.jpg




RS often just tries to tell you what you want!!!
Exploration
Find out what your interests are




                                    http://www.flickr.com/photos/luisorlando/2688548978
!"#$%&




                                            5607&"8&.+2329"




#$%&'"(34&1,"-.*&%"/*01.0$2"

                               #$%&'"()*&+,"-.*&%"/*01.0$2"




                                                              !"
What is Useful depends on the Objective
Settings

         )

                  #
!"#$%'




             !

                      "
Not Useful
X2




     X1



          limited information
User Satisfaction
                                                 Ratings
                                                  positive
                                                  negative




                             X2X2
                     X2




                                                             X
                                                             X
             X1                         X1



         user: not much variety, may get bored
Drawback system: limited knowledge
Coverage




                       X2




              X1                          X1




Drawback user: exposed to items of no interest
[Settles, 2009]
                                                                                               Prediction Accuracy
 33                                               333                                                 333

 22                                               222                                                 222

 11                                               111                                                 111

 00                                               000                                                 000

 -1 -1                                            -1-1 -1                                             -1-1 -1

 -2 -2                                            -2-2 -2                                             -2-2 -2

 -3 -3                                            -3-3 -3                                             -3-3 -3
         -4-4 -4   -2-2 -2    000     222   444             -4-4 -4   -2-2 -2    000     222    444             -4-4 -4   -2-2 -2    000     222   444
                             (a)(a)
                              (a)                                               (b)(b)
                                                                                 (b)                                                (c)(c)
                                                                                                                                     (c)
                   Actual Model   Prediction Accuracy          Prediction Accuracy
Figure 2: 2: Anillustrative example(Random Sampling)learning. (a) A Atoydata set of o
 Figure 2: An illustrative exampleof ofpool-basedactive learning. (Active Learning) of
  Figure      An illustrative exampleofpool-based active learning. (a) Atoy data set
                                         pool-based active          (a) toy data set
                     400 instances, evenly sampled from two class Gaussians. The instances are
                      400 instances, evenly sampled from two class Gaussians. The instances are
                        400 instances, evenly sampled from two class Gaussians. The instances ar
                     represented as aspointsin ina2D feature space. (b) A Alogisticregression model
                      represented aspoints ina a2D feature space. (b) Alogistic regression model
                        represented     points      2D feature space. (b) logistic regression mode
                     trained with 3030labeledinstances randomly drawn from the problem domain.
                      trained with 30labeled instances randomly drawn from the problem domain.
                        trained with     labeled instances randomly drawn from the problem domain
                     The line represents the decision boundary of of the classifier (70% accuracy).(c)
                      The line represents the decision boundary ofthe classifier (70% accuracy). (c)
                        The line represents the decision boundary the classifier (70% accuracy). (c
                     A Alogisticregression model trained with 3030activelyqueried instances using
                      Alogistic regression model trained with 30actively queried instances using
                           logistic regression model trained with     actively queried instances using
                     uncertainty sampling (90%).
                      uncertainty sampling (90%).
                        uncertainty sampling (90%).
    Drawback user: exposed to items of no interest
         Figure 11illustrates the pool-based active learning cycle. A Alearnermay begin
          Figure 1illustrates the pool-based active learning cycle. Alearner may begin
           Figure illustrates the pool-based active learning cycle. learner may begin
• allow user to explore his/her interests       Usefulness/
                                                Objectives
• prediction accuracy for (user or item)
• maximize profit
• maximize number of visits / time spent
• minimize acquisition cost (# of ratings, implicit/explicit)
• max system utility
• minimize uncertainty
• make it fun for the user
• etc.
  objectives may overlap
Doesn’t have to Bothersome
Active/Passive Learning



                                     Passive Learning
       training data
          request
                                     Active Learning




                        supervised
user   training data
                         learning     approximated
                                         function
AL Categories



   Item-based AL
analyze items and select items that seem useful


  Model-based AL
analyze model and select items that seem useful
Item-based AL


            3R Properties
                                        )
Represented
by the existing training set?                            #




                              !"#$%'
e.g. (b) is already represented
Representative                              !
of others?
e.g.(a) is not                                               "
                                                !"#$%&
Results in achieving objective?
e.g. (d) -> max coverage
[Rubens & Kaplan, 2010]
Item Properties
• Popular   [Rashid 2002]


  (rated by many users)
• High Variance in ratings           [Rashid 2002]


  item that people either like or hate
• Best/Worst      [Leino & Raiha 2007]


  ask user which items s/he likes most/least
• Influential   [Rubens & Sugiyama 2007]


  items on which ratings of many other items depend
  (Representative + Not Represented)
Model-based AL



     Initial



     Improve Margin



X1   Improve Orientation
1
   Model-error AL
                                                                                 #
                                                                             ##,
                                                                          %-'
                                               3                /)$*"+$,                 . .,/')-'##,#
                                               15               '#"
                                                         (   '%
                                     -         3                      2
                               !"#$"%&'        1(                                        0
                                                                  0$"1
                                          3         3
                                          14        16


g : optimal function (in the sollution                  !"#$%&"'(!)*+,
space)                                   Model Error – C
f : learned function                     constant and is ignored
fi ’s: learned functions from a slightly
different training set.                   Bias – B
EG = B + V + C
                       2                 Hard to estimate, but is assumed
B = Ef (x) − g (x)                       to vanish (assymptotically).
                  2
V = f − Ef (x)
                  2
                                          Variance – V
C = (g (x) − f (x))
                                          Estimate and minize.
                                                                                                   10 / 20
Parameter-Variance AL
Model Complexity




as the number of training points increases
more complex models tend to fit data better
Model Selection




(a) under-fit                    (b) over-fit                 (c) appropriate fit

          Figure 8: Dependence between model complexity and accuracy.
(a) under-fit                           Model-Points Dependency
                                    (b) over-fit         (c) appropriate fit

             Figure 8: Dependence between model complexity and accuracy.




Training input points that are good for learning one model, are not necessary good for t
 Training input points that are good for learning one model,
 are not necessary good for the other.
                                   min G(X (T rain) ).
                                 X (T rain)
Black Box Settings

May not have information/understanding about:


                                                                                        )

                                                                                                                #




                                                                            !"#$%'
                                                                                            !
http://www.sps.ele.tue.nl/members/b.vries/research/research.html
                                                                                                                    "

                                                                                                !"#$%&


                                                                                                         Figure 1: Active Lear

                           Model                                                        Points
                                                                           already possible from the training point in th
ou et al., 2000, Schuurmans, 1997]
                              yx
                                       Black Box Settings
t is [Evgeniou et al., 2000, Schuurmans, 1997]
                    f (x)                   yx
                               yx
                              f (x)
   11101010101111
   01001001010011    x                      yx
   01010110100010           yx = β · x
   10101010011010
   10100101001010               x
                     yx                  yx = β · x
rences
                               yx
niou, M. Pontil,is too complex Regularization networks and su
   The system and T. Poggio.
   Referencesx     y
  machines.constantly in Computational Mathematics, 13(1):1–50,
   (and is   Advances changing)
   T. Evgeniou, M. Pontil, and yx T. Poggio. Regularization netwo
urmans. A new y = β · x
                metric-based approach to model selection. In Procee
      vector machines. Advances in Computational Mathematics, 1
   e.g. RS at Amazon, NetFlix:
                 x
 Fourteenth National Conference on Artificial Intelligence (AAA
     10,000’s lines of codes = β · x
 552–558, 1997.            yx
   D. Schuurmans. A new metric-based approach to model selection
     continuously changed by multiple teams Artificial Intellige
      of the Fourteenth National Conference on
      pages 552–558, 1997.
“Information is a difference which makes a difference”
                           Gregory Bateson (anthropologist)

Select training points based on their expected influence on
the output estimates   Proposed Method Proposed Approach
                    Proposed Method       Proposed Approach


(the only value accessible in Black-Box Settings).
   yt+1 yt+1                           yt+1 yt+1

   yt   yt                             yt   yt
                         input index                          input index
                  input index                          input index




a)a) Adding training point causes many b) Adding training point causes few
  Adding training point causes many b) Adding training point causes few
output estimates toto change.
  output estimates change.            output estimates toto change.
                                        output estimates change.
Validity of Assumptions (is change in the output estimates good?)
Changes in the estimates of the output               [Empirical]
values with regards to a new training
point:                                         0.4


                                              0.35


                                               0.3


    a) the estimate of the true               0.25

    output value deteriorates            P (yt+1 )
                                               0.2
         relatively infrequent (16%,
         expected deterioration is            0.15
         small)
    b) the estimate of the true                0.1

    output value improves
                                              0.05
         most frequent case (84%)
                                                0
    c) the estimate of the true                              y          y
    output value is overshoot                               yt+1    18 / 20
Criterion Accuracy
     10




      8




      6
∆G




      4




                              High values of criterion
      2
                              correspond to high improvements in accuracy

      0




     −2
          0   0.5   1   1.5     2         2.5   3   3.5
                                      2
                          yt − yt+1
(δ ) =            −       +
                                                                                    Interpretation

(δ ) =         ∗
                       β −β      +

                                       ( δ       −
                                                         δ)            ∑    ∈   ∗
                                                                                           δ
                                                                                                −
      =(   δ   −       δ   β )                       −
                                                                  +                     δ
                                                                                            −
                                     ( +     δ               δ)             ( +         δ       δ)

      =     (          +     ),




                                             =(          δ   −    δ   β )


                   δ                                 δ   β .
  δ
Representative


        ∑           ∈   ∗
                               δ           δ
                                                    −
    =                                       −
                    ( +         δ                   δ)



≥   ∑   ∗
                        δ
                                    −

    ∈          δ



≈   ∑   ∗               α           ∑                   δ   ϕ   ϕ   .
    ∈          δ               = +




                                            δ



                                    ∗
                                               δ
Not Represented



                ( δ          −
                                     δ)
            =                    −
              ( +        δ            δ)




    −
δ           δ




δ
    −
            δ   ≈
                    α    ∑                δ   ϕ   .
                        = +


        δ


                        {ϕ } =
9
                                                     Proposed
                                                     A!optimal
                                                     D!optimal
                                                                     Evaluation
                                                     E!optimal
                     8                               Transductive
                                                     Random
                                                     Optimal

                     7
Mean Squared Error




                     6



                     5



                     4



                     3



                     2
                         2   4              6        8          10
                                 Training Set Size


                                      •system needs to be robust with respect to
Limitations outliers
                                      •incremental re-training needs to be fast

More Related Content

Similar to Active Learning in Recommender Systems

22期.百度彭滔 搜索引擎评估与用户行为分析
22期.百度彭滔 搜索引擎评估与用户行为分析22期.百度彭滔 搜索引擎评估与用户行为分析
22期.百度彭滔 搜索引擎评估与用户行为分析
Janwen Lou
 
20190927 generative models_aia
20190927 generative models_aia20190927 generative models_aia
20190927 generative models_aia
Yi-Fan Liou
 
Neural networks across space & time : Deep learning in java
Neural networks across space & time : Deep learning in javaNeural networks across space & time : Deep learning in java
Neural networks across space & time : Deep learning in java
Dave Snowdon
 
Scaling up data science applications
Scaling up data science applicationsScaling up data science applications
Scaling up data science applications
Kexin Xie
 
Scaling Up: How Switching to Apache Spark Improved Performance, Realizability...
Scaling Up: How Switching to Apache Spark Improved Performance, Realizability...Scaling Up: How Switching to Apache Spark Improved Performance, Realizability...
Scaling Up: How Switching to Apache Spark Improved Performance, Realizability...
Databricks
 
Bokeh Tutorial - PyData @ Strata San Jose 2015
Bokeh Tutorial - PyData @ Strata San Jose 2015Bokeh Tutorial - PyData @ Strata San Jose 2015
Bokeh Tutorial - PyData @ Strata San Jose 2015
Peter Wang
 
Introduction to Neural Networks
Introduction to Neural NetworksIntroduction to Neural Networks
Introduction to Neural Networks
Databricks
 
Evaluating Data Freshness in Large Scale Replicated Databases
Evaluating Data Freshness in Large Scale Replicated DatabasesEvaluating Data Freshness in Large Scale Replicated Databases
Evaluating Data Freshness in Large Scale Replicated Databases
Miguel Araújo
 
Estimating ecosystem functional features from intra-specific trait data
Estimating ecosystem functional features from intra-specific trait dataEstimating ecosystem functional features from intra-specific trait data
Estimating ecosystem functional features from intra-specific trait data
Tano Gutiérrez Cánovas
 
C4_W1 (1).pdf
C4_W1 (1).pdfC4_W1 (1).pdf
C4_W1 (1).pdf
machine121
 
nullcon 2011 - Memory analysis – Looking into the eye of the bits
nullcon 2011 - Memory analysis – Looking into the eye of the bitsnullcon 2011 - Memory analysis – Looking into the eye of the bits
nullcon 2011 - Memory analysis – Looking into the eye of the bits
n|u - The Open Security Community
 
Detection of Abandoned Bag
Detection of Abandoned BagDetection of Abandoned Bag
Detection of Abandoned Bag
Saumya Tiwari
 
Sparse representation based human action recognition using an action region-a...
Sparse representation based human action recognition using an action region-a...Sparse representation based human action recognition using an action region-a...
Sparse representation based human action recognition using an action region-a...
Wesley De Neve
 
Visual diagnostics at scale
Visual diagnostics at scaleVisual diagnostics at scale
Visual diagnostics at scale
Rebecca Bilbro
 
Chapter3_Visualizations2.pdf
Chapter3_Visualizations2.pdfChapter3_Visualizations2.pdf
Chapter3_Visualizations2.pdf
MekiyaShigute1
 
2. filtering basics
2. filtering basics2. filtering basics
2. filtering basics
Atul Kumar Jha
 
Django101 geodjango
Django101 geodjangoDjango101 geodjango
Django101 geodjango
Calvin Cheng
 
작은 스타트업에서 머신러닝 맛보기
작은 스타트업에서 머신러닝 맛보기작은 스타트업에서 머신러닝 맛보기
작은 스타트업에서 머신러닝 맛보기
성민 이
 
Deep learning image recognition for autonomous driving(classification, objec...
Deep learning image recognition for  autonomous driving(classification, objec...Deep learning image recognition for  autonomous driving(classification, objec...
Deep learning image recognition for autonomous driving(classification, objec...
동열 이
 

Similar to Active Learning in Recommender Systems (20)

22期.百度彭滔 搜索引擎评估与用户行为分析
22期.百度彭滔 搜索引擎评估与用户行为分析22期.百度彭滔 搜索引擎评估与用户行为分析
22期.百度彭滔 搜索引擎评估与用户行为分析
 
20190927 generative models_aia
20190927 generative models_aia20190927 generative models_aia
20190927 generative models_aia
 
Neural networks across space & time : Deep learning in java
Neural networks across space & time : Deep learning in javaNeural networks across space & time : Deep learning in java
Neural networks across space & time : Deep learning in java
 
Scaling up data science applications
Scaling up data science applicationsScaling up data science applications
Scaling up data science applications
 
Scaling Up: How Switching to Apache Spark Improved Performance, Realizability...
Scaling Up: How Switching to Apache Spark Improved Performance, Realizability...Scaling Up: How Switching to Apache Spark Improved Performance, Realizability...
Scaling Up: How Switching to Apache Spark Improved Performance, Realizability...
 
Bokeh Tutorial - PyData @ Strata San Jose 2015
Bokeh Tutorial - PyData @ Strata San Jose 2015Bokeh Tutorial - PyData @ Strata San Jose 2015
Bokeh Tutorial - PyData @ Strata San Jose 2015
 
Introduction to Neural Networks
Introduction to Neural NetworksIntroduction to Neural Networks
Introduction to Neural Networks
 
Evaluating Data Freshness in Large Scale Replicated Databases
Evaluating Data Freshness in Large Scale Replicated DatabasesEvaluating Data Freshness in Large Scale Replicated Databases
Evaluating Data Freshness in Large Scale Replicated Databases
 
Estimating ecosystem functional features from intra-specific trait data
Estimating ecosystem functional features from intra-specific trait dataEstimating ecosystem functional features from intra-specific trait data
Estimating ecosystem functional features from intra-specific trait data
 
C4_W1 (1).pdf
C4_W1 (1).pdfC4_W1 (1).pdf
C4_W1 (1).pdf
 
CNN
CNNCNN
CNN
 
nullcon 2011 - Memory analysis – Looking into the eye of the bits
nullcon 2011 - Memory analysis – Looking into the eye of the bitsnullcon 2011 - Memory analysis – Looking into the eye of the bits
nullcon 2011 - Memory analysis – Looking into the eye of the bits
 
Detection of Abandoned Bag
Detection of Abandoned BagDetection of Abandoned Bag
Detection of Abandoned Bag
 
Sparse representation based human action recognition using an action region-a...
Sparse representation based human action recognition using an action region-a...Sparse representation based human action recognition using an action region-a...
Sparse representation based human action recognition using an action region-a...
 
Visual diagnostics at scale
Visual diagnostics at scaleVisual diagnostics at scale
Visual diagnostics at scale
 
Chapter3_Visualizations2.pdf
Chapter3_Visualizations2.pdfChapter3_Visualizations2.pdf
Chapter3_Visualizations2.pdf
 
2. filtering basics
2. filtering basics2. filtering basics
2. filtering basics
 
Django101 geodjango
Django101 geodjangoDjango101 geodjango
Django101 geodjango
 
작은 스타트업에서 머신러닝 맛보기
작은 스타트업에서 머신러닝 맛보기작은 스타트업에서 머신러닝 맛보기
작은 스타트업에서 머신러닝 맛보기
 
Deep learning image recognition for autonomous driving(classification, objec...
Deep learning image recognition for  autonomous driving(classification, objec...Deep learning image recognition for  autonomous driving(classification, objec...
Deep learning image recognition for autonomous driving(classification, objec...
 

More from Neil Rubens

Autism: Survey of Emerging Approaches [Clinical]
Autism: Survey of Emerging Approaches [Clinical]Autism: Survey of Emerging Approaches [Clinical]
Autism: Survey of Emerging Approaches [Clinical]
Neil Rubens
 
Collaborative Robotics (CoBot): Opportunities for Corporations
Collaborative Robotics (CoBot): Opportunities for CorporationsCollaborative Robotics (CoBot): Opportunities for Corporations
Collaborative Robotics (CoBot): Opportunities for Corporations
Neil Rubens
 
Autism: Survey of Emerging Approaches [Startups]
Autism: Survey of Emerging Approaches [Startups]Autism: Survey of Emerging Approaches [Startups]
Autism: Survey of Emerging Approaches [Startups]
Neil Rubens
 
Solving the AL Chicken-and-Egg Corpus and Model Problem
Solving the AL Chicken-and-Egg Corpus and Model ProblemSolving the AL Chicken-and-Egg Corpus and Model Problem
Solving the AL Chicken-and-Egg Corpus and Model Problem
Neil Rubens
 
Recommender Systems and Active Learning (for Startups)
Recommender Systems and Active Learning (for Startups)Recommender Systems and Active Learning (for Startups)
Recommender Systems and Active Learning (for Startups)
Neil Rubens
 
Network Learning: AI-driven Connectivist Framework for E-Learning 3.0
Network Learning: AI-driven Connectivist Framework for E-Learning 3.0Network Learning: AI-driven Connectivist Framework for E-Learning 3.0
Network Learning: AI-driven Connectivist Framework for E-Learning 3.0Neil Rubens
 
e-learning 3.0 and AI
e-learning 3.0 and AIe-learning 3.0 and AI
e-learning 3.0 and AINeil Rubens
 
Learning Networks: e-Learning 3.0
Learning Networks: e-Learning 3.0Learning Networks: e-Learning 3.0
Learning Networks: e-Learning 3.0
Neil Rubens
 
Inconsistent Outliers
Inconsistent OutliersInconsistent Outliers
Inconsistent Outliers
Neil Rubens
 
Outliers and Inconsistency
Outliers and InconsistencyOutliers and Inconsistency
Outliers and Inconsistency
Neil Rubens
 
Alumni Network Analysis
Alumni Network AnalysisAlumni Network Analysis
Alumni Network AnalysisNeil Rubens
 
Value Co-Creation in Innovation Ecosystems (Presentation @ Tokyo Institute of...
Value Co-Creation in Innovation Ecosystems (Presentation @ Tokyo Institute of...Value Co-Creation in Innovation Ecosystems (Presentation @ Tokyo Institute of...
Value Co-Creation in Innovation Ecosystems (Presentation @ Tokyo Institute of...
Neil Rubens
 
Value Co-Creation in Innovation Ecosystems (English)
Value Co-Creation in Innovation Ecosystems (English)Value Co-Creation in Innovation Ecosystems (English)
Value Co-Creation in Innovation Ecosystems (English)
Neil Rubens
 
Value Co-Creation in Innovation Ecosystems (Chinese)
Value Co-Creation in Innovation Ecosystems (Chinese)Value Co-Creation in Innovation Ecosystems (Chinese)
Value Co-Creation in Innovation Ecosystems (Chinese)
Neil Rubens
 
Japan Mobile
Japan MobileJapan Mobile
Japan Mobile
Neil Rubens
 

More from Neil Rubens (15)

Autism: Survey of Emerging Approaches [Clinical]
Autism: Survey of Emerging Approaches [Clinical]Autism: Survey of Emerging Approaches [Clinical]
Autism: Survey of Emerging Approaches [Clinical]
 
Collaborative Robotics (CoBot): Opportunities for Corporations
Collaborative Robotics (CoBot): Opportunities for CorporationsCollaborative Robotics (CoBot): Opportunities for Corporations
Collaborative Robotics (CoBot): Opportunities for Corporations
 
Autism: Survey of Emerging Approaches [Startups]
Autism: Survey of Emerging Approaches [Startups]Autism: Survey of Emerging Approaches [Startups]
Autism: Survey of Emerging Approaches [Startups]
 
Solving the AL Chicken-and-Egg Corpus and Model Problem
Solving the AL Chicken-and-Egg Corpus and Model ProblemSolving the AL Chicken-and-Egg Corpus and Model Problem
Solving the AL Chicken-and-Egg Corpus and Model Problem
 
Recommender Systems and Active Learning (for Startups)
Recommender Systems and Active Learning (for Startups)Recommender Systems and Active Learning (for Startups)
Recommender Systems and Active Learning (for Startups)
 
Network Learning: AI-driven Connectivist Framework for E-Learning 3.0
Network Learning: AI-driven Connectivist Framework for E-Learning 3.0Network Learning: AI-driven Connectivist Framework for E-Learning 3.0
Network Learning: AI-driven Connectivist Framework for E-Learning 3.0
 
e-learning 3.0 and AI
e-learning 3.0 and AIe-learning 3.0 and AI
e-learning 3.0 and AI
 
Learning Networks: e-Learning 3.0
Learning Networks: e-Learning 3.0Learning Networks: e-Learning 3.0
Learning Networks: e-Learning 3.0
 
Inconsistent Outliers
Inconsistent OutliersInconsistent Outliers
Inconsistent Outliers
 
Outliers and Inconsistency
Outliers and InconsistencyOutliers and Inconsistency
Outliers and Inconsistency
 
Alumni Network Analysis
Alumni Network AnalysisAlumni Network Analysis
Alumni Network Analysis
 
Value Co-Creation in Innovation Ecosystems (Presentation @ Tokyo Institute of...
Value Co-Creation in Innovation Ecosystems (Presentation @ Tokyo Institute of...Value Co-Creation in Innovation Ecosystems (Presentation @ Tokyo Institute of...
Value Co-Creation in Innovation Ecosystems (Presentation @ Tokyo Institute of...
 
Value Co-Creation in Innovation Ecosystems (English)
Value Co-Creation in Innovation Ecosystems (English)Value Co-Creation in Innovation Ecosystems (English)
Value Co-Creation in Innovation Ecosystems (English)
 
Value Co-Creation in Innovation Ecosystems (Chinese)
Value Co-Creation in Innovation Ecosystems (Chinese)Value Co-Creation in Innovation Ecosystems (Chinese)
Value Co-Creation in Innovation Ecosystems (Chinese)
 
Japan Mobile
Japan MobileJapan Mobile
Japan Mobile
 

Recently uploaded

The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
Israel Genealogy Research Association
 
Delivering Micro-Credentials in Technical and Vocational Education and Training
Delivering Micro-Credentials in Technical and Vocational Education and TrainingDelivering Micro-Credentials in Technical and Vocational Education and Training
Delivering Micro-Credentials in Technical and Vocational Education and Training
AG2 Design
 
PIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf IslamabadPIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf Islamabad
AyyanKhan40
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
Academy of Science of South Africa
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Dr. Vinod Kumar Kanvaria
 
Pride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School DistrictPride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School District
David Douglas School District
 
World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024
ak6969907
 
Aficamten in HCM (SEQUOIA HCM TRIAL 2024)
Aficamten in HCM (SEQUOIA HCM TRIAL 2024)Aficamten in HCM (SEQUOIA HCM TRIAL 2024)
Aficamten in HCM (SEQUOIA HCM TRIAL 2024)
Ashish Kohli
 
A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
thanhdowork
 
Advantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO PerspectiveAdvantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO Perspective
Krisztián Száraz
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
TechSoup
 
How to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP ModuleHow to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP Module
Celine George
 
Landownership in the Philippines under the Americans-2-pptx.pptx
Landownership in the Philippines under the Americans-2-pptx.pptxLandownership in the Philippines under the Americans-2-pptx.pptx
Landownership in the Philippines under the Americans-2-pptx.pptx
JezreelCabil2
 
Group Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana BuscigliopptxGroup Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana Buscigliopptx
ArianaBusciglio
 
DRUGS AND ITS classification slide share
DRUGS AND ITS classification slide shareDRUGS AND ITS classification slide share
DRUGS AND ITS classification slide share
taiba qazi
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
Levi Shapiro
 
MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...
MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...
MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...
NelTorrente
 

Recently uploaded (20)

The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
 
Delivering Micro-Credentials in Technical and Vocational Education and Training
Delivering Micro-Credentials in Technical and Vocational Education and TrainingDelivering Micro-Credentials in Technical and Vocational Education and Training
Delivering Micro-Credentials in Technical and Vocational Education and Training
 
PIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf IslamabadPIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf Islamabad
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
 
Pride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School DistrictPride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School District
 
World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024
 
Aficamten in HCM (SEQUOIA HCM TRIAL 2024)
Aficamten in HCM (SEQUOIA HCM TRIAL 2024)Aficamten in HCM (SEQUOIA HCM TRIAL 2024)
Aficamten in HCM (SEQUOIA HCM TRIAL 2024)
 
A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
 
Advantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO PerspectiveAdvantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO Perspective
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
 
How to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP ModuleHow to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP Module
 
Landownership in the Philippines under the Americans-2-pptx.pptx
Landownership in the Philippines under the Americans-2-pptx.pptxLandownership in the Philippines under the Americans-2-pptx.pptx
Landownership in the Philippines under the Americans-2-pptx.pptx
 
Group Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana BuscigliopptxGroup Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana Buscigliopptx
 
DRUGS AND ITS classification slide share
DRUGS AND ITS classification slide shareDRUGS AND ITS classification slide share
DRUGS AND ITS classification slide share
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
 
MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...
MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...
MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...
 

Active Learning in Recommender Systems

  • 1. Active Learning in Recommender Systems http://4.bp.blogspot.com/_qFju91K89HM/SxRpABd1DTI/AAAAAAAABjw/6LaSJfjfk-I/s1600/Unexpected_Guests.jpg Neil Rubens Active Intelligence Lab University of Electro-Communications
  • 2. http://activeintelligence.org/research/al-rs/ N. Rubens, D. Kaplan, M. Sugiyama. Recommender Systems Handbook: Active Learning in Recommender Systems (eds. P.B. Kantor, F. Ricci, L. Rokach,B. Shapira). Springer, 2011.
  • 3. !"#$%%&&&'()*+),-'./0%.&12/-%(223410%41(.'!,567 !"#$%%&&&'()*+,'*-.%#!-/-0%.12#0)23%4567884598%: Passive Intelligence Active Intelligence data is given Premise: given info is insufficient model is given active data acquisition task: self adaptation/reconfiguration learn model’s parameters
  • 4. Why Need Useful Data? “If you put into the machine wrong figures, will the right answers come out? I am not able rightly to apprehend the kind of confusion of ideas that could provoke such a question.” Charles Babbage Garbage In, Garbage Out (GIGO Principle) George Fuechsel
  • 5. What about Data Mining? We can sniff through the data and try to find something of value. Assumptions a lot of data is available some of the data is useful !"#$%%&&&'()*+,-./,012-'345%21#-67%*893+12%6:;*893+1'2!-5+< http://www.qualitydigest.com/sept06/articles/04_article.shtml
  • 6. Obtaining Data could be “COSTLY” Medicine: diagnosis: pain, time, $ drug discovery: $$$, time User Interaction: effort, time Expertise Elicitation: $, time Active Learning (AL) Goal: Estimate ‘Usefulness’ of the data before data is acquired
  • 7. Limitation of Traditional Recommender Systems Exploitation http://misspinkslip.files.wordpress.com/2009/07/used-car-salesman.jpg RS often just tries to tell you what you want!!!
  • 8.
  • 9. Exploration Find out what your interests are http://www.flickr.com/photos/luisorlando/2688548978
  • 10. !"#$%& 5607&"8&.+2329" #$%&'"(34&1,"-.*&%"/*01.0$2" #$%&'"()*&+,"-.*&%"/*01.0$2" !"
  • 11. What is Useful depends on the Objective
  • 12. Settings ) # !"#$%' ! "
  • 13. Not Useful X2 X1 limited information
  • 14. User Satisfaction Ratings positive negative X2X2 X2 X X X1 X1 user: not much variety, may get bored Drawback system: limited knowledge
  • 15. Coverage X2 X1 X1 Drawback user: exposed to items of no interest
  • 16. [Settles, 2009] Prediction Accuracy 33 333 333 22 222 222 11 111 111 00 000 000 -1 -1 -1-1 -1 -1-1 -1 -2 -2 -2-2 -2 -2-2 -2 -3 -3 -3-3 -3 -3-3 -3 -4-4 -4 -2-2 -2 000 222 444 -4-4 -4 -2-2 -2 000 222 444 -4-4 -4 -2-2 -2 000 222 444 (a)(a) (a) (b)(b) (b) (c)(c) (c) Actual Model Prediction Accuracy Prediction Accuracy Figure 2: 2: Anillustrative example(Random Sampling)learning. (a) A Atoydata set of o Figure 2: An illustrative exampleof ofpool-basedactive learning. (Active Learning) of Figure An illustrative exampleofpool-based active learning. (a) Atoy data set pool-based active (a) toy data set 400 instances, evenly sampled from two class Gaussians. The instances are 400 instances, evenly sampled from two class Gaussians. The instances are 400 instances, evenly sampled from two class Gaussians. The instances ar represented as aspointsin ina2D feature space. (b) A Alogisticregression model represented aspoints ina a2D feature space. (b) Alogistic regression model represented points 2D feature space. (b) logistic regression mode trained with 3030labeledinstances randomly drawn from the problem domain. trained with 30labeled instances randomly drawn from the problem domain. trained with labeled instances randomly drawn from the problem domain The line represents the decision boundary of of the classifier (70% accuracy).(c) The line represents the decision boundary ofthe classifier (70% accuracy). (c) The line represents the decision boundary the classifier (70% accuracy). (c A Alogisticregression model trained with 3030activelyqueried instances using Alogistic regression model trained with 30actively queried instances using logistic regression model trained with actively queried instances using uncertainty sampling (90%). uncertainty sampling (90%). uncertainty sampling (90%). Drawback user: exposed to items of no interest Figure 11illustrates the pool-based active learning cycle. A Alearnermay begin Figure 1illustrates the pool-based active learning cycle. Alearner may begin Figure illustrates the pool-based active learning cycle. learner may begin
  • 17. • allow user to explore his/her interests Usefulness/ Objectives • prediction accuracy for (user or item) • maximize profit • maximize number of visits / time spent • minimize acquisition cost (# of ratings, implicit/explicit) • max system utility • minimize uncertainty • make it fun for the user • etc. objectives may overlap
  • 18. Doesn’t have to Bothersome
  • 19. Active/Passive Learning Passive Learning training data request Active Learning supervised user training data learning approximated function
  • 20. AL Categories Item-based AL analyze items and select items that seem useful Model-based AL analyze model and select items that seem useful
  • 21. Item-based AL 3R Properties ) Represented by the existing training set? # !"#$%' e.g. (b) is already represented Representative ! of others? e.g.(a) is not " !"#$%& Results in achieving objective? e.g. (d) -> max coverage [Rubens & Kaplan, 2010]
  • 22. Item Properties • Popular [Rashid 2002] (rated by many users) • High Variance in ratings [Rashid 2002] item that people either like or hate • Best/Worst [Leino & Raiha 2007] ask user which items s/he likes most/least • Influential [Rubens & Sugiyama 2007] items on which ratings of many other items depend (Representative + Not Represented)
  • 23. Model-based AL Initial Improve Margin X1 Improve Orientation
  • 24. 1 Model-error AL # ##, %-' 3 /)$*"+$, . .,/')-'##,# 15 '#" ( '% - 3 2 !"#$"%&' 1( 0 0$"1 3 3 14 16 g : optimal function (in the sollution !"#$%&"'(!)*+, space) Model Error – C f : learned function constant and is ignored fi ’s: learned functions from a slightly different training set. Bias – B EG = B + V + C 2 Hard to estimate, but is assumed B = Ef (x) − g (x) to vanish (assymptotically). 2 V = f − Ef (x) 2 Variance – V C = (g (x) − f (x)) Estimate and minize. 10 / 20
  • 26. Model Complexity as the number of training points increases more complex models tend to fit data better
  • 27. Model Selection (a) under-fit (b) over-fit (c) appropriate fit Figure 8: Dependence between model complexity and accuracy.
  • 28. (a) under-fit Model-Points Dependency (b) over-fit (c) appropriate fit Figure 8: Dependence between model complexity and accuracy. Training input points that are good for learning one model, are not necessary good for t Training input points that are good for learning one model, are not necessary good for the other. min G(X (T rain) ). X (T rain)
  • 29. Black Box Settings May not have information/understanding about: ) # !"#$%' ! http://www.sps.ele.tue.nl/members/b.vries/research/research.html " !"#$%& Figure 1: Active Lear Model Points already possible from the training point in th
  • 30. ou et al., 2000, Schuurmans, 1997] yx Black Box Settings t is [Evgeniou et al., 2000, Schuurmans, 1997] f (x) yx yx f (x) 11101010101111 01001001010011 x yx 01010110100010 yx = β · x 10101010011010 10100101001010 x yx yx = β · x rences yx niou, M. Pontil,is too complex Regularization networks and su The system and T. Poggio. Referencesx y machines.constantly in Computational Mathematics, 13(1):1–50, (and is Advances changing) T. Evgeniou, M. Pontil, and yx T. Poggio. Regularization netwo urmans. A new y = β · x metric-based approach to model selection. In Procee vector machines. Advances in Computational Mathematics, 1 e.g. RS at Amazon, NetFlix: x Fourteenth National Conference on Artificial Intelligence (AAA 10,000’s lines of codes = β · x 552–558, 1997. yx D. Schuurmans. A new metric-based approach to model selection continuously changed by multiple teams Artificial Intellige of the Fourteenth National Conference on pages 552–558, 1997.
  • 31. “Information is a difference which makes a difference” Gregory Bateson (anthropologist) Select training points based on their expected influence on the output estimates Proposed Method Proposed Approach Proposed Method Proposed Approach (the only value accessible in Black-Box Settings). yt+1 yt+1 yt+1 yt+1 yt yt yt yt input index input index input index input index a)a) Adding training point causes many b) Adding training point causes few Adding training point causes many b) Adding training point causes few output estimates toto change. output estimates change. output estimates toto change. output estimates change.
  • 32. Validity of Assumptions (is change in the output estimates good?) Changes in the estimates of the output [Empirical] values with regards to a new training point: 0.4 0.35 0.3 a) the estimate of the true 0.25 output value deteriorates P (yt+1 ) 0.2 relatively infrequent (16%, expected deterioration is 0.15 small) b) the estimate of the true 0.1 output value improves 0.05 most frequent case (84%) 0 c) the estimate of the true y y output value is overshoot yt+1 18 / 20
  • 33. Criterion Accuracy 10 8 6 ∆G 4 High values of criterion 2 correspond to high improvements in accuracy 0 −2 0 0.5 1 1.5 2 2.5 3 3.5 2 yt − yt+1
  • 34. (δ ) = − + Interpretation (δ ) = ∗ β −β + ( δ − δ) ∑ ∈ ∗ δ − =( δ − δ β ) − + δ − ( + δ δ) ( + δ δ) = ( + ), =( δ − δ β ) δ δ β . δ
  • 35. Representative ∑ ∈ ∗ δ δ − = − ( + δ δ) ≥ ∑ ∗ δ − ∈ δ ≈ ∑ ∗ α ∑ δ ϕ ϕ . ∈ δ = + δ ∗ δ
  • 36. Not Represented ( δ − δ) = − ( + δ δ) − δ δ δ − δ ≈ α ∑ δ ϕ . = + δ {ϕ } =
  • 37. 9 Proposed A!optimal D!optimal Evaluation E!optimal 8 Transductive Random Optimal 7 Mean Squared Error 6 5 4 3 2 2 4 6 8 10 Training Set Size •system needs to be robust with respect to Limitations outliers •incremental re-training needs to be fast

Editor's Notes

  1. Thank you for Prof. Ricci for his kind invitation.\nToday I would like to connivence you that Active Learning is something of value, and that is very well suited for recommender systems in particular.\n\n\n\n\n\n
  2. \n
  3. \n
  4. \n
  5. \n
  6. It seems that DM may offers some relief, so why do we need to care about obtaining data of high quality?\n
  7. \n\nDeath Of A Pushy Salesman, Business Week, 2006\nhttp://www.businessweek.com/magazine/content/06_27/b3991084.htm\n
  8. Often it tries to sell you something, w/o trying to find out what you like.\nIt is a rather greedy approach trying to optimize immediate payoff.\nsome people may get turned off by bad recommendations and never come back to the system.\n\n\nWell, unless I am into cross dressing; these items are not of much use to me.\nAlthough, RS may have 50% success rate with the above strategy.\n\n
  9. The goal of recommender systems is to personalize recommendations.\nSo it really would not hurt to spend some time on trying to find out what your interests are. It may not pay off in the short term; but may pay off quite well in the long term.\n
  10. Luckily RS are starting to trying to learn more about their users.\n
  11. \n
  12. we consider overexagerated example, in which we can ask user to watch a movie and rate it\n
  13. let me start by giving an example of something that is not useful\n
  14. This strategy may be efficient in the short term; but may be not so much in the long term\n
  15. \n
  16. \n
  17. \n
  18. \n
  19. \n
  20. \n
  21. \n
  22. \n
  23. \n
  24. \n
  25. \n
  26. \n
  27. \n
  28. \n
  29. \n
  30. \n
  31. \n
  32. \n
  33. \n
  34. \n
  35. \n
  36. \n
  37. \n