SlideShare a Scribd company logo
1 of 42
Download to read offline
An Efficient Collaborative Recommender System
                   based on k -separability

       Georgios Alexandridis                       Georgios Siolas                  Andreas Stafylopatis

                               Department of Electrical and Computer Engineering
                                    National Technical University of Athens


            20th International Conference on Artificial Neural Networks
                                  (ICANN 2010)




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender           ICANN’10   1 / 16
Outline

  1    Current Trends in Recommender Systems
         Recommender Systems
         Design Issues

  2    Theoretical & Practical Aspects of our Contribution
         k-Separability
         System Architecture

  3    Evaluating our System
         Experiment
         Results
         Conclusions



Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   2 / 16
What are the Recommender Systems?


          Recommender Systems attempt to present information items (e.g.
          movies, music, books, news stories) that are likely to be of interest
          to the user.




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   3 / 16
What are the Recommender Systems?


          Recommender Systems attempt to present information items (e.g.
          movies, music, books, news stories) that are likely to be of interest
          to the user.
          Some implementations
                  Amazon
                          "Customers Who Bought This Item Also Bought"
                  Google News
                          "Recommended Stories"
                  Online Audio Broadcasters
                          last.fm
                          Pandora




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   3 / 16
Taxonomy of Recommender Systems

          Criterion: How are the predictions made?
                  Content-Based Recommenders
                          Locate "similar" items
                  Collaborative Recommenders
                          Find "like-minded" users
                  Hybrid Recommenders
                          Combination of the two




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   4 / 16
Taxonomy of Recommender Systems

          Criterion: How are the predictions made?
                  Content-Based Recommenders
                          Locate "similar" items
                  Collaborative Recommenders
                          Find "like-minded" users
                  Hybrid Recommenders
                          Combination of the two
          Which method is the best?
                  Open academic subject
                  Highly dependent on the application domain
                  We followed the Collaborative Recommender approach
                          Computationally simpler than the Hybrid approach
                          A user rating is more than a mere number; it is an aggregation of
                          various characteristics




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   4 / 16
Collaborative Recommender Systems

          Key Component: The User Ratings’ Matrix




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   5 / 16
Collaborative Recommender Systems

          Key Component: The User Ratings’ Matrix
          Ratings
                  Indicate how much a user likes an item
                          "like" "dislike"
                          1-star up to 5-stars




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   5 / 16
Collaborative Recommender Systems

          Key Component: The User Ratings’ Matrix
          Ratings
                  Indicate how much a user likes an item
                          "like" "dislike"
                          1-star up to 5-stars


                                                        I1      I2     I3     I4
                                               U1       5       3      2
                                               U2       3       5              2
                                               U3       1              2
                                               U4       2                      3




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   5 / 16
Collaborative Recommender Systems

          Key Component: The User Ratings’ Matrix
          Ratings
                  Indicate how much a user likes an item
                          "like" "dislike"
                          1-star up to 5-stars


                                                        I1      I2     I3     I4
                                               U1       5       3      2
                                               U2       3       5              2
                                               U3       1              2
                                               U4       2                      3

          Users become each other’s predictor
                  By locating positive and negative correlations among them.


Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   5 / 16
Challanges in Collaborative Recommender System
 Design
     1    The cold-start problem




     2    The sparsity problem




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   6 / 16
Challanges in Collaborative Recommender System
 Design
     1    The cold-start problem
                  Recommendations cannot be made unless a user has provided
                  some ratings
                  Solutions:
                          Recommend the most popular items
                          Explicity ask the user to rate some items prior to making
                          recommendations
     2    The sparsity problem




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   6 / 16
Challanges in Collaborative Recommender System
 Design
     1    The cold-start problem
                  Recommendations cannot be made unless a user has provided
                  some ratings
                  Solutions:
                          Recommend the most popular items
                          Explicity ask the user to rate some items prior to making
                          recommendations
     2    The sparsity problem
                  The ratings matrix is sparse
                          Empty elements: More than 90%
                  Solution: Dimensionality Reduction techniques
                          Singular Value Decomposition (SVD) yields good results
                  Pros: The resultant matrix is substantially smaller & densier
                  Cons: The dataset becomes very "noisy"
                          Most elements assume values that are marginally larger than zero
                  Conclusion: We are in need of techniques that can "learn" noisy
                  datasets!
Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   6 / 16
"Noisy" Datasets
          The added noise in the dataset hinders the discovery of patterns
          in data
                  Data clusters become difficult to separate




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   7 / 16
"Noisy" Datasets
          The added noise in the dataset hinders the discovery of patterns
          in data
                  Data clusters become difficult to separate
          Machine Learning techniques for highly non-separable datasets
                  Support Vector Machines, Radial Basis Functions


                  Evolutionary Algorithms




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   7 / 16
"Noisy" Datasets
          The added noise in the dataset hinders the discovery of patterns
          in data
                  Data clusters become difficult to separate
          Machine Learning techniques for highly non-separable datasets
                  Support Vector Machines, Radial Basis Functions
                          Computing the support vector (or estimating the surface . . . ) can be a
                          computationally intensive task
                  Evolutionary Algorithms
                          Meaningful Recommendations are not always guaranteed
                          (evolutionary dead-ends)




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender         ICANN’10   7 / 16
"Noisy" Datasets
            The added noise in the dataset hinders the discovery of patterns
            in data
                   Data clusters become difficult to separate
            Machine Learning techniques for highly non-separable datasets
                   Support Vector Machines, Radial Basis Functions
                           Computing the support vector (or estimating the surface . . . ) can be a
                           computationally intensive task
                   Evolutionary Algorithms
                           Meaningful Recommendations are not always guaranteed
                           (evolutionary dead-ends)
                   Our approach: Use k -separability!
                           Originally proposed by W. Duch1
                           Special case of the more general method of Projection Pursuit
                           Application to Feed-Forward ANNs
                           Extends linear separability of data clusters into k > 2 segments on
                           the discriminating hyperplane

      1
          W. Duch, K-separability. Lecture Notes in Computer Science 4131 (2006) 188-197
Alexandridis, Siolas, Stafylopatis (NTUA)    k -separability Collaborative Recommender         ICANN’10   7 / 16
Extending linear separability to 3-separability
 The 2-bit XOR problem
          A highly non-separable dataset
          It can be learned by a 2-layered perceptron, or ...
          ...by a single layer percpetron that implements k -separability!




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   8 / 16
Extending linear separability to 3-separability
 The 2-bit XOR problem
          A highly non-separable dataset
          It can be learned by a 2-layered perceptron, or ...
          ...by a single layer percpetron that implements k -separability!
          The activation function must partition the input space into 3
          distinct areas


                   1.2


                    1


                   0.8


                   0.6


                   0.4


                   0.2


                    0


                  −0.2
                    −0.2   0   0.2   0.4    0.6      0.8   1    1.2




                   (a) Input Space Partitioning

Alexandridis, Siolas, Stafylopatis (NTUA)         k -separability Collaborative Recommender   ICANN’10   8 / 16
Extending linear separability to 3-separability
 The 2-bit XOR problem
          A highly non-separable dataset
          It can be learned by a 2-layered perceptron, or ...
          ...by a single layer percpetron that implements k -separability!
          The activation function must partition the input space into 3
          distinct areas
                  Soft-Windowed Activation Functions

                   1.2

                                                                            1
                    1


                   0.8                                                     0.8



                   0.6
                                                                           0.6


                   0.4
                                                                           0.4
                   0.2

                                                                           0.2
                    0


                  −0.2                                                      0
                    −0.2   0   0.2   0.4    0.6      0.8   1    1.2         −2    −1     0    1   2    3    4




                   (a) Input Space Partitioning                        (b) Soft-Windowed              Activation
                                                                       Function
Alexandridis, Siolas, Stafylopatis (NTUA)         k -separability Collaborative Recommender                ICANN’10   8 / 16
Generalizing to k -separability

          Complex Datasets
                  Combine the output of two neurons (or more . . . )
                  e.g. A 5-separable dataset can be learned by the combined output
                  of 2 neurons




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   9 / 16
Generalizing to k -separability

          Complex Datasets
                  Combine the output of two neurons (or more . . . )
                  e.g. A 5-separable dataset can be learned by the combined output
                  of 2 neurons
          Generalization by Induction
                  m-neuron output ⇒ 2m + 1 regions on the discriminating line
                  ⇒ k = 2m + 1-separable dataset




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   9 / 16
Generalizing to k -separability

          Complex Datasets
                  Combine the output of two neurons (or more . . . )
                  e.g. A 5-separable dataset can be learned by the combined output
                  of 2 neurons
          Generalization by Induction
                  m-neuron output ⇒ 2m + 1 regions on the discriminating line
                  ⇒ k = 2m + 1-separable dataset
          Use in a Recommendation Engine
                  Create a 2-layered perceptron
                          n-sized input vector, m-sized hidden layer, single output layer
                          Overall, an n → m → 1 projection
                  Build a model (NN) for each user
                          Input: The ratings of the n "neighbors" of the target user on an item
                          he hasn’t evaluated
                          Output: A "score" for the unseen item



Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   9 / 16
Implementation Details


          The index of separability (k ) is not known a-priori
                  Setting k to a fixed value is of little help
                  It can lead to either overspecialization or to large training errors




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   10 / 16
Implementation Details


          The index of separability (k ) is not known a-priori
                  Setting k to a fixed value is of little help
                  It can lead to either overspecialization or to large training errors
          Therefore, k is a problem parameter: it has to be estimated




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   10 / 16
Implementation Details


          The index of separability (k ) is not known a-priori
                  Setting k to a fixed value is of little help
                  It can lead to either overspecialization or to large training errors
          Therefore, k is a problem parameter: it has to be estimated
          Dynamic Network Architecture




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   10 / 16
Implementation Details


          The index of separability (k ) is not known a-priori
                  Setting k to a fixed value is of little help
                  It can lead to either overspecialization or to large training errors
          Therefore, k is a problem parameter: it has to be estimated
          Dynamic Network Architecture
          Sparse user ratings’ matrix ⇒ small overall network size ⇒
          Constructive Network Algorithm




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   10 / 16
Implementation Details


          The index of separability (k ) is not known a-priori
                  Setting k to a fixed value is of little help
                  It can lead to either overspecialization or to large training errors
          Therefore, k is a problem parameter: it has to be estimated
          Dynamic Network Architecture
          Sparse user ratings’ matrix ⇒ small overall network size ⇒
          Constructive Network Algorithm
          Our constructive network algorithm was derived from the New
          Constructive Algorithm2



     2
       Islam MM et al. A new constructive algorithm for architectural and functional adaptation of artificial neural
 networks.
 IEEE Trans Syst Man Cybern B Cybern. 2009 Dec;39(6):1590-605
Alexandridis, Siolas, Stafylopatis (NTUA)    k -separability Collaborative Recommender                          ICANN’10   10 / 16
Constructive Network Algorithm
     1    Create a minimal architecture
     2    Train the network in two phases on the whole Training Set
     3    Iteratively add neurons in the hidden layer
                  Create new Training Sets based on the Classification Error
                  (Boosting Algorithm)
                  Only the newly added neuron’s weights are adapted; all other
                  remain "frozen"
     4    Stop network construction when the Classification Error stabilizes




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   11 / 16
Constructive Network Algorithm
     1    Create a minimal architecture
     2    Train the network in two phases on the whole Training Set
     3    Iteratively add neurons in the hidden layer
                  Create new Training Sets based on the Classification Error
                  (Boosting Algorithm)
                  Only the newly added neuron’s weights are adapted; all other
                  remain "frozen"
     4    Stop network construction when the Classification Error stabilizes

 Boosting Algorithm
          Inspired from AdaBoost and used in Network Training as a way of
          avoiding local minima
          Functionality
                  Unlearned samples ⇒ New neurons in the hidden layer ⇒ New
                  clusters discovered

Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   11 / 16
Our Collaborative Recommender System



          Input: The user ratings’ matrix and the target user




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   12 / 16
Our Collaborative Recommender System



          Input: The user ratings’ matrix and the target user
          Output: A model (NN) for the target user




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   12 / 16
Our Collaborative Recommender System



          Input: The user ratings’ matrix and the target user
          Output: A model (NN) for the target user
          Steps
             1    Pick from the user ratings’ matrix all the co-raters of the target user
             2    Compute the SVD of the co-raters matrix, retaining only the
                  non-zero Singular Values
             3    Partition the resultant matrix in 3 different sets; the Training Set, the
                  Validation Set and the Test Set
             4    Train a Constructive ANN Architecture (as discussed previously...)
             5    Compute the Performance Metrics on the Test Set




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   12 / 16
Experiment
 The MovieLens Database
        Contains the ratings of 943 users on
        1682 movies
        Sparse matrix (6.3% of non-zero
        elements)
                                                                                  140

        Each user has rated at least 20                                           120


        movies (106 on average), but. . .                                         100



        Discrete Exponential Distribution                                          80



                60% of all users have rated 100                                    60



                movies or less                                                     40



                40% of all users have rated 50                                     20



                movies or less                                                      0
                                                                                        0     100   200   300   400   500   600   700   800




        We followed a purely Collaborative                                                  (a) Rated items per user
        Strategy
                Taking into account only the user
                ratings’ and not any other
                demographic information
Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender                                 ICANN’10          13 / 16
Experiment
 Test Sets & Metrics


          Many users rate only a few movies. How would our system
          perform?


          How would our system perform on the average case?




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   14 / 16
Experiment
 Test Sets & Metrics


          Many users rate only a few movies. How would our system
          perform?
                  Group A: The few raters user group.
                          Contains all users who have rated 20-50 movies
          How would our system perform on the average case?




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   14 / 16
Experiment
 Test Sets & Metrics


          Many users rate only a few movies. How would our system
          perform?
                  Group A: The few raters user group.
                          Contains all users who have rated 20-50 movies
          How would our system perform on the average case?
                  Group B: The moderate raters user group.
                          Contains all users who have rated 51-100 movies
                          May be used in comparisons to other implementations




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   14 / 16
Experiment
 Test Sets & Metrics


          Many users rate only a few movies. How would our system
          perform?
                  Group A: The few raters user group.
                          Contains all users who have rated 20-50 movies
          How would our system perform on the average case?
                  Group B: The moderate raters user group.
                          Contains all users who have rated 51-100 movies
                          May be used in comparisons to other implementations
          We randomly picked 20 users from each group (40 users in total).
          The results were averaged for each group




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   14 / 16
Experiment
 Test Sets & Metrics


          Many users rate only a few movies. How would our system
          perform?
                  Group A: The few raters user group.
                          Contains all users who have rated 20-50 movies
          How would our system perform on the average case?
                  Group B: The moderate raters user group.
                          Contains all users who have rated 51-100 movies
                          May be used in comparisons to other implementations
          We randomly picked 20 users from each group (40 users in total).
          The results were averaged for each group
          Metrics
             1    Precision
             2    Recall
             3    F-measure


Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   14 / 16
Results

                                            Table: Performance Results
                      Methodology                                     Precision     Recall   F-measure
                      OurSystem: User Group B (moderate ratings)      75.38%        82.21%   79.37%
                      OurSystem: User Group A (few ratings)           74.07%        88.86%   78.97%
                      MovieMagician Clique-based                      74%           73%      74%
                      Movielens                                       66%           74%      70%
                      SVD/ANN                                         67.9%         69.7%    68.8%
                      MovieMagician Feature-based                     61%           75%      67%
                      MovieMagician Hybrid                            73%           56%      63%
                      Correlation                                     64.4%         46.8%    54.2%




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender                    ICANN’10   15 / 16
Results

                                            Table: Performance Results
                      Methodology                                     Precision     Recall   F-measure
                      OurSystem: User Group B (moderate ratings)      75.38%        82.21%   79.37%
                      OurSystem: User Group A (few ratings)           74.07%        88.86%   78.97%
                      MovieMagician Clique-based                      74%           73%      74%
                      Movielens                                       66%           74%      70%
                      SVD/ANN                                         67.9%         69.7%    68.8%
                      MovieMagician Feature-based                     61%           75%      67%
                      MovieMagician Hybrid                            73%           56%      63%
                      Correlation                                     64.4%         46.8%    54.2%




          Observations
                  Our system achieves good results in both usergroups and
                  outperforms the other approaches
                  Recall is higher in the few raters group because they seem to rate
                  only the movies they like
                          Therefore, the recommender cannot generalize



Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender                    ICANN’10   15 / 16
Conclusions


          We have presented a complete Collaborative Recommender
          System that is specifically fit for those cases where information is
          limited
          Our system achieves a good trade-off between Precision and
          Recall, a basic requirement for Recommenders
          This is due to the fact that k -separability is able to uncover
          complex statistical dependencies (positive and negative)
          We don’t need to filter the neighborhood of the target user as other
          systems do (e.g. by using the Pearson Correlation Formula).
                  All "neighbors" are considered
                  Extremely useful in cases of sparse datasets




Alexandridis, Siolas, Stafylopatis (NTUA)   k -separability Collaborative Recommender   ICANN’10   16 / 16

More Related Content

Viewers also liked

Experimentació peces d'escuma
Experimentació peces d'escumaExperimentació peces d'escuma
Experimentació peces d'escumanortiz3
 
Racons de llengua p4 b
Racons de llengua p4 bRacons de llengua p4 b
Racons de llengua p4 bnortiz3
 
Racons de mates p4 b
Racons de mates p4 bRacons de mates p4 b
Racons de mates p4 bnortiz3
 
Improving Social Recommendations by applying a Personalized Item Clustering P...
Improving Social Recommendations by applying a Personalized Item Clustering P...Improving Social Recommendations by applying a Personalized Item Clustering P...
Improving Social Recommendations by applying a Personalized Item Clustering P...Γιώργος Αλεξανδρίδης
 
Racons de mates
Racons de matesRacons de mates
Racons de matesnortiz3
 
Racons de llengua
Racons de llenguaRacons de llengua
Racons de llenguanortiz3
 

Viewers also liked (8)

Experimentació peces d'escuma
Experimentació peces d'escumaExperimentació peces d'escuma
Experimentació peces d'escuma
 
Power point
Power pointPower point
Power point
 
Racons de llengua p4 b
Racons de llengua p4 bRacons de llengua p4 b
Racons de llengua p4 b
 
Daily routine
Daily routineDaily routine
Daily routine
 
Racons de mates p4 b
Racons de mates p4 bRacons de mates p4 b
Racons de mates p4 b
 
Improving Social Recommendations by applying a Personalized Item Clustering P...
Improving Social Recommendations by applying a Personalized Item Clustering P...Improving Social Recommendations by applying a Personalized Item Clustering P...
Improving Social Recommendations by applying a Personalized Item Clustering P...
 
Racons de mates
Racons de matesRacons de mates
Racons de mates
 
Racons de llengua
Racons de llenguaRacons de llengua
Racons de llengua
 

Similar to k-Separability Presentation

Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender SystemsMarcel Kurovski
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systemsinovex GmbH
 
8 better ways of doing your engineering project
8 better ways of doing your engineering project8 better ways of doing your engineering project
8 better ways of doing your engineering projecttalkingkarthik
 
Software Sustainability: Better Software Better Science
Software Sustainability: Better Software Better ScienceSoftware Sustainability: Better Software Better Science
Software Sustainability: Better Software Better ScienceCarole Goble
 
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsReal-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsAnita de Waard
 
ScienceSoft: Open Software for Open Science
ScienceSoft: Open Software for Open ScienceScienceSoft: Open Software for Open Science
ScienceSoft: Open Software for Open ScienceSoftwarePractice
 
Designing an effective information architecture
Designing an effective information architectureDesigning an effective information architecture
Designing an effective information architectureoptimalworkshop
 
Immersive Recommendation Workshop, NYC Media Lab'17
Immersive Recommendation Workshop, NYC Media Lab'17Immersive Recommendation Workshop, NYC Media Lab'17
Immersive Recommendation Workshop, NYC Media Lab'17Longqi Yang
 
20120419 linkedopendataandteamsciencemcguinnesschicago
20120419 linkedopendataandteamsciencemcguinnesschicago20120419 linkedopendataandteamsciencemcguinnesschicago
20120419 linkedopendataandteamsciencemcguinnesschicagoDeborah McGuinness
 
Evaluation Insights to Key Processes of Digital Repositories
Evaluation Insights to Key Processes of Digital RepositoriesEvaluation Insights to Key Processes of Digital Repositories
Evaluation Insights to Key Processes of Digital RepositoriesGiannis Tsakonas
 
Cat Herding and Community Gardens: Practical e-Science Project Management
Cat Herding and Community Gardens: Practical e-Science Project ManagementCat Herding and Community Gardens: Practical e-Science Project Management
Cat Herding and Community Gardens: Practical e-Science Project ManagementNeil Chue Hong
 
AHM 2014: The iPlant Collaborative, Community Cyberinfrastructure for Life Sc...
AHM 2014: The iPlant Collaborative, Community Cyberinfrastructure for Life Sc...AHM 2014: The iPlant Collaborative, Community Cyberinfrastructure for Life Sc...
AHM 2014: The iPlant Collaborative, Community Cyberinfrastructure for Life Sc...EarthCube
 
Facets and Pivoting for Flexible and Usable Linked Data Exploration
Facets and Pivoting for Flexible and Usable Linked Data ExplorationFacets and Pivoting for Flexible and Usable Linked Data Exploration
Facets and Pivoting for Flexible and Usable Linked Data ExplorationRoberto García
 
FAIR Assessment for Repositories and Researchers
FAIR Assessment for Repositories and Researchers FAIR Assessment for Repositories and Researchers
FAIR Assessment for Repositories and Researchers EOSCpilot .eu
 
Permissionless Innovation BIM Trichy 2 Feb 2012
Permissionless Innovation BIM Trichy 2 Feb 2012Permissionless Innovation BIM Trichy 2 Feb 2012
Permissionless Innovation BIM Trichy 2 Feb 2012Olivier MJ Crépin-Leblond
 
MONAI and Open Science for Medical Imaging Deep Learning: SIPAIM 2020
MONAI and Open Science for Medical Imaging Deep Learning: SIPAIM 2020MONAI and Open Science for Medical Imaging Deep Learning: SIPAIM 2020
MONAI and Open Science for Medical Imaging Deep Learning: SIPAIM 2020Stephen Aylward
 

Similar to k-Separability Presentation (20)

Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
 
8 better ways of doing your engineering project
8 better ways of doing your engineering project8 better ways of doing your engineering project
8 better ways of doing your engineering project
 
Software Sustainability: Better Software Better Science
Software Sustainability: Better Software Better ScienceSoftware Sustainability: Better Software Better Science
Software Sustainability: Better Software Better Science
 
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsReal-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
 
ScienceSoft: Open Software for Open Science
ScienceSoft: Open Software for Open ScienceScienceSoft: Open Software for Open Science
ScienceSoft: Open Software for Open Science
 
Designing an effective information architecture
Designing an effective information architectureDesigning an effective information architecture
Designing an effective information architecture
 
Immersive Recommendation Workshop, NYC Media Lab'17
Immersive Recommendation Workshop, NYC Media Lab'17Immersive Recommendation Workshop, NYC Media Lab'17
Immersive Recommendation Workshop, NYC Media Lab'17
 
20120419 linkedopendataandteamsciencemcguinnesschicago
20120419 linkedopendataandteamsciencemcguinnesschicago20120419 linkedopendataandteamsciencemcguinnesschicago
20120419 linkedopendataandteamsciencemcguinnesschicago
 
A View on eScience
A View on eScienceA View on eScience
A View on eScience
 
Evaluation Insights to Key Processes of Digital Repositories
Evaluation Insights to Key Processes of Digital RepositoriesEvaluation Insights to Key Processes of Digital Repositories
Evaluation Insights to Key Processes of Digital Repositories
 
Cat Herding and Community Gardens: Practical e-Science Project Management
Cat Herding and Community Gardens: Practical e-Science Project ManagementCat Herding and Community Gardens: Practical e-Science Project Management
Cat Herding and Community Gardens: Practical e-Science Project Management
 
AHM 2014: The iPlant Collaborative, Community Cyberinfrastructure for Life Sc...
AHM 2014: The iPlant Collaborative, Community Cyberinfrastructure for Life Sc...AHM 2014: The iPlant Collaborative, Community Cyberinfrastructure for Life Sc...
AHM 2014: The iPlant Collaborative, Community Cyberinfrastructure for Life Sc...
 
TVOT June 2012
TVOT June 2012TVOT June 2012
TVOT June 2012
 
Bruce, "Investing in a Time of Disruptive Change"
Bruce, "Investing in a Time of Disruptive Change"Bruce, "Investing in a Time of Disruptive Change"
Bruce, "Investing in a Time of Disruptive Change"
 
Facets and Pivoting for Flexible and Usable Linked Data Exploration
Facets and Pivoting for Flexible and Usable Linked Data ExplorationFacets and Pivoting for Flexible and Usable Linked Data Exploration
Facets and Pivoting for Flexible and Usable Linked Data Exploration
 
Data mining weka
Data mining wekaData mining weka
Data mining weka
 
FAIR Assessment for Repositories and Researchers
FAIR Assessment for Repositories and Researchers FAIR Assessment for Repositories and Researchers
FAIR Assessment for Repositories and Researchers
 
Permissionless Innovation BIM Trichy 2 Feb 2012
Permissionless Innovation BIM Trichy 2 Feb 2012Permissionless Innovation BIM Trichy 2 Feb 2012
Permissionless Innovation BIM Trichy 2 Feb 2012
 
MONAI and Open Science for Medical Imaging Deep Learning: SIPAIM 2020
MONAI and Open Science for Medical Imaging Deep Learning: SIPAIM 2020MONAI and Open Science for Medical Imaging Deep Learning: SIPAIM 2020
MONAI and Open Science for Medical Imaging Deep Learning: SIPAIM 2020
 

Recently uploaded

THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONHumphrey A Beña
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSJoshuaGantuangco2
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxChelloAnnAsuncion2
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfSpandanaRallapalli
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
Q4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxQ4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxnelietumpap1
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 

Recently uploaded (20)

THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdf
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
Q4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxQ4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptx
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 

k-Separability Presentation

  • 1. An Efficient Collaborative Recommender System based on k -separability Georgios Alexandridis Georgios Siolas Andreas Stafylopatis Department of Electrical and Computer Engineering National Technical University of Athens 20th International Conference on Artificial Neural Networks (ICANN 2010) Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 1 / 16
  • 2. Outline 1 Current Trends in Recommender Systems Recommender Systems Design Issues 2 Theoretical & Practical Aspects of our Contribution k-Separability System Architecture 3 Evaluating our System Experiment Results Conclusions Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 2 / 16
  • 3. What are the Recommender Systems? Recommender Systems attempt to present information items (e.g. movies, music, books, news stories) that are likely to be of interest to the user. Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 3 / 16
  • 4. What are the Recommender Systems? Recommender Systems attempt to present information items (e.g. movies, music, books, news stories) that are likely to be of interest to the user. Some implementations Amazon "Customers Who Bought This Item Also Bought" Google News "Recommended Stories" Online Audio Broadcasters last.fm Pandora Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 3 / 16
  • 5. Taxonomy of Recommender Systems Criterion: How are the predictions made? Content-Based Recommenders Locate "similar" items Collaborative Recommenders Find "like-minded" users Hybrid Recommenders Combination of the two Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 4 / 16
  • 6. Taxonomy of Recommender Systems Criterion: How are the predictions made? Content-Based Recommenders Locate "similar" items Collaborative Recommenders Find "like-minded" users Hybrid Recommenders Combination of the two Which method is the best? Open academic subject Highly dependent on the application domain We followed the Collaborative Recommender approach Computationally simpler than the Hybrid approach A user rating is more than a mere number; it is an aggregation of various characteristics Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 4 / 16
  • 7. Collaborative Recommender Systems Key Component: The User Ratings’ Matrix Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 5 / 16
  • 8. Collaborative Recommender Systems Key Component: The User Ratings’ Matrix Ratings Indicate how much a user likes an item "like" "dislike" 1-star up to 5-stars Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 5 / 16
  • 9. Collaborative Recommender Systems Key Component: The User Ratings’ Matrix Ratings Indicate how much a user likes an item "like" "dislike" 1-star up to 5-stars I1 I2 I3 I4 U1 5 3 2 U2 3 5 2 U3 1 2 U4 2 3 Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 5 / 16
  • 10. Collaborative Recommender Systems Key Component: The User Ratings’ Matrix Ratings Indicate how much a user likes an item "like" "dislike" 1-star up to 5-stars I1 I2 I3 I4 U1 5 3 2 U2 3 5 2 U3 1 2 U4 2 3 Users become each other’s predictor By locating positive and negative correlations among them. Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 5 / 16
  • 11. Challanges in Collaborative Recommender System Design 1 The cold-start problem 2 The sparsity problem Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 6 / 16
  • 12. Challanges in Collaborative Recommender System Design 1 The cold-start problem Recommendations cannot be made unless a user has provided some ratings Solutions: Recommend the most popular items Explicity ask the user to rate some items prior to making recommendations 2 The sparsity problem Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 6 / 16
  • 13. Challanges in Collaborative Recommender System Design 1 The cold-start problem Recommendations cannot be made unless a user has provided some ratings Solutions: Recommend the most popular items Explicity ask the user to rate some items prior to making recommendations 2 The sparsity problem The ratings matrix is sparse Empty elements: More than 90% Solution: Dimensionality Reduction techniques Singular Value Decomposition (SVD) yields good results Pros: The resultant matrix is substantially smaller & densier Cons: The dataset becomes very "noisy" Most elements assume values that are marginally larger than zero Conclusion: We are in need of techniques that can "learn" noisy datasets! Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 6 / 16
  • 14. "Noisy" Datasets The added noise in the dataset hinders the discovery of patterns in data Data clusters become difficult to separate Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 7 / 16
  • 15. "Noisy" Datasets The added noise in the dataset hinders the discovery of patterns in data Data clusters become difficult to separate Machine Learning techniques for highly non-separable datasets Support Vector Machines, Radial Basis Functions Evolutionary Algorithms Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 7 / 16
  • 16. "Noisy" Datasets The added noise in the dataset hinders the discovery of patterns in data Data clusters become difficult to separate Machine Learning techniques for highly non-separable datasets Support Vector Machines, Radial Basis Functions Computing the support vector (or estimating the surface . . . ) can be a computationally intensive task Evolutionary Algorithms Meaningful Recommendations are not always guaranteed (evolutionary dead-ends) Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 7 / 16
  • 17. "Noisy" Datasets The added noise in the dataset hinders the discovery of patterns in data Data clusters become difficult to separate Machine Learning techniques for highly non-separable datasets Support Vector Machines, Radial Basis Functions Computing the support vector (or estimating the surface . . . ) can be a computationally intensive task Evolutionary Algorithms Meaningful Recommendations are not always guaranteed (evolutionary dead-ends) Our approach: Use k -separability! Originally proposed by W. Duch1 Special case of the more general method of Projection Pursuit Application to Feed-Forward ANNs Extends linear separability of data clusters into k > 2 segments on the discriminating hyperplane 1 W. Duch, K-separability. Lecture Notes in Computer Science 4131 (2006) 188-197 Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 7 / 16
  • 18. Extending linear separability to 3-separability The 2-bit XOR problem A highly non-separable dataset It can be learned by a 2-layered perceptron, or ... ...by a single layer percpetron that implements k -separability! Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 8 / 16
  • 19. Extending linear separability to 3-separability The 2-bit XOR problem A highly non-separable dataset It can be learned by a 2-layered perceptron, or ... ...by a single layer percpetron that implements k -separability! The activation function must partition the input space into 3 distinct areas 1.2 1 0.8 0.6 0.4 0.2 0 −0.2 −0.2 0 0.2 0.4 0.6 0.8 1 1.2 (a) Input Space Partitioning Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 8 / 16
  • 20. Extending linear separability to 3-separability The 2-bit XOR problem A highly non-separable dataset It can be learned by a 2-layered perceptron, or ... ...by a single layer percpetron that implements k -separability! The activation function must partition the input space into 3 distinct areas Soft-Windowed Activation Functions 1.2 1 1 0.8 0.8 0.6 0.6 0.4 0.4 0.2 0.2 0 −0.2 0 −0.2 0 0.2 0.4 0.6 0.8 1 1.2 −2 −1 0 1 2 3 4 (a) Input Space Partitioning (b) Soft-Windowed Activation Function Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 8 / 16
  • 21. Generalizing to k -separability Complex Datasets Combine the output of two neurons (or more . . . ) e.g. A 5-separable dataset can be learned by the combined output of 2 neurons Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 9 / 16
  • 22. Generalizing to k -separability Complex Datasets Combine the output of two neurons (or more . . . ) e.g. A 5-separable dataset can be learned by the combined output of 2 neurons Generalization by Induction m-neuron output ⇒ 2m + 1 regions on the discriminating line ⇒ k = 2m + 1-separable dataset Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 9 / 16
  • 23. Generalizing to k -separability Complex Datasets Combine the output of two neurons (or more . . . ) e.g. A 5-separable dataset can be learned by the combined output of 2 neurons Generalization by Induction m-neuron output ⇒ 2m + 1 regions on the discriminating line ⇒ k = 2m + 1-separable dataset Use in a Recommendation Engine Create a 2-layered perceptron n-sized input vector, m-sized hidden layer, single output layer Overall, an n → m → 1 projection Build a model (NN) for each user Input: The ratings of the n "neighbors" of the target user on an item he hasn’t evaluated Output: A "score" for the unseen item Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 9 / 16
  • 24. Implementation Details The index of separability (k ) is not known a-priori Setting k to a fixed value is of little help It can lead to either overspecialization or to large training errors Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 10 / 16
  • 25. Implementation Details The index of separability (k ) is not known a-priori Setting k to a fixed value is of little help It can lead to either overspecialization or to large training errors Therefore, k is a problem parameter: it has to be estimated Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 10 / 16
  • 26. Implementation Details The index of separability (k ) is not known a-priori Setting k to a fixed value is of little help It can lead to either overspecialization or to large training errors Therefore, k is a problem parameter: it has to be estimated Dynamic Network Architecture Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 10 / 16
  • 27. Implementation Details The index of separability (k ) is not known a-priori Setting k to a fixed value is of little help It can lead to either overspecialization or to large training errors Therefore, k is a problem parameter: it has to be estimated Dynamic Network Architecture Sparse user ratings’ matrix ⇒ small overall network size ⇒ Constructive Network Algorithm Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 10 / 16
  • 28. Implementation Details The index of separability (k ) is not known a-priori Setting k to a fixed value is of little help It can lead to either overspecialization or to large training errors Therefore, k is a problem parameter: it has to be estimated Dynamic Network Architecture Sparse user ratings’ matrix ⇒ small overall network size ⇒ Constructive Network Algorithm Our constructive network algorithm was derived from the New Constructive Algorithm2 2 Islam MM et al. A new constructive algorithm for architectural and functional adaptation of artificial neural networks. IEEE Trans Syst Man Cybern B Cybern. 2009 Dec;39(6):1590-605 Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 10 / 16
  • 29. Constructive Network Algorithm 1 Create a minimal architecture 2 Train the network in two phases on the whole Training Set 3 Iteratively add neurons in the hidden layer Create new Training Sets based on the Classification Error (Boosting Algorithm) Only the newly added neuron’s weights are adapted; all other remain "frozen" 4 Stop network construction when the Classification Error stabilizes Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 11 / 16
  • 30. Constructive Network Algorithm 1 Create a minimal architecture 2 Train the network in two phases on the whole Training Set 3 Iteratively add neurons in the hidden layer Create new Training Sets based on the Classification Error (Boosting Algorithm) Only the newly added neuron’s weights are adapted; all other remain "frozen" 4 Stop network construction when the Classification Error stabilizes Boosting Algorithm Inspired from AdaBoost and used in Network Training as a way of avoiding local minima Functionality Unlearned samples ⇒ New neurons in the hidden layer ⇒ New clusters discovered Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 11 / 16
  • 31. Our Collaborative Recommender System Input: The user ratings’ matrix and the target user Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 12 / 16
  • 32. Our Collaborative Recommender System Input: The user ratings’ matrix and the target user Output: A model (NN) for the target user Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 12 / 16
  • 33. Our Collaborative Recommender System Input: The user ratings’ matrix and the target user Output: A model (NN) for the target user Steps 1 Pick from the user ratings’ matrix all the co-raters of the target user 2 Compute the SVD of the co-raters matrix, retaining only the non-zero Singular Values 3 Partition the resultant matrix in 3 different sets; the Training Set, the Validation Set and the Test Set 4 Train a Constructive ANN Architecture (as discussed previously...) 5 Compute the Performance Metrics on the Test Set Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 12 / 16
  • 34. Experiment The MovieLens Database Contains the ratings of 943 users on 1682 movies Sparse matrix (6.3% of non-zero elements) 140 Each user has rated at least 20 120 movies (106 on average), but. . . 100 Discrete Exponential Distribution 80 60% of all users have rated 100 60 movies or less 40 40% of all users have rated 50 20 movies or less 0 0 100 200 300 400 500 600 700 800 We followed a purely Collaborative (a) Rated items per user Strategy Taking into account only the user ratings’ and not any other demographic information Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 13 / 16
  • 35. Experiment Test Sets & Metrics Many users rate only a few movies. How would our system perform? How would our system perform on the average case? Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 14 / 16
  • 36. Experiment Test Sets & Metrics Many users rate only a few movies. How would our system perform? Group A: The few raters user group. Contains all users who have rated 20-50 movies How would our system perform on the average case? Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 14 / 16
  • 37. Experiment Test Sets & Metrics Many users rate only a few movies. How would our system perform? Group A: The few raters user group. Contains all users who have rated 20-50 movies How would our system perform on the average case? Group B: The moderate raters user group. Contains all users who have rated 51-100 movies May be used in comparisons to other implementations Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 14 / 16
  • 38. Experiment Test Sets & Metrics Many users rate only a few movies. How would our system perform? Group A: The few raters user group. Contains all users who have rated 20-50 movies How would our system perform on the average case? Group B: The moderate raters user group. Contains all users who have rated 51-100 movies May be used in comparisons to other implementations We randomly picked 20 users from each group (40 users in total). The results were averaged for each group Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 14 / 16
  • 39. Experiment Test Sets & Metrics Many users rate only a few movies. How would our system perform? Group A: The few raters user group. Contains all users who have rated 20-50 movies How would our system perform on the average case? Group B: The moderate raters user group. Contains all users who have rated 51-100 movies May be used in comparisons to other implementations We randomly picked 20 users from each group (40 users in total). The results were averaged for each group Metrics 1 Precision 2 Recall 3 F-measure Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 14 / 16
  • 40. Results Table: Performance Results Methodology Precision Recall F-measure OurSystem: User Group B (moderate ratings) 75.38% 82.21% 79.37% OurSystem: User Group A (few ratings) 74.07% 88.86% 78.97% MovieMagician Clique-based 74% 73% 74% Movielens 66% 74% 70% SVD/ANN 67.9% 69.7% 68.8% MovieMagician Feature-based 61% 75% 67% MovieMagician Hybrid 73% 56% 63% Correlation 64.4% 46.8% 54.2% Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 15 / 16
  • 41. Results Table: Performance Results Methodology Precision Recall F-measure OurSystem: User Group B (moderate ratings) 75.38% 82.21% 79.37% OurSystem: User Group A (few ratings) 74.07% 88.86% 78.97% MovieMagician Clique-based 74% 73% 74% Movielens 66% 74% 70% SVD/ANN 67.9% 69.7% 68.8% MovieMagician Feature-based 61% 75% 67% MovieMagician Hybrid 73% 56% 63% Correlation 64.4% 46.8% 54.2% Observations Our system achieves good results in both usergroups and outperforms the other approaches Recall is higher in the few raters group because they seem to rate only the movies they like Therefore, the recommender cannot generalize Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 15 / 16
  • 42. Conclusions We have presented a complete Collaborative Recommender System that is specifically fit for those cases where information is limited Our system achieves a good trade-off between Precision and Recall, a basic requirement for Recommenders This is due to the fact that k -separability is able to uncover complex statistical dependencies (positive and negative) We don’t need to filter the neighborhood of the target user as other systems do (e.g. by using the Pearson Correlation Formula). All "neighbors" are considered Extremely useful in cases of sparse datasets Alexandridis, Siolas, Stafylopatis (NTUA) k -separability Collaborative Recommender ICANN’10 16 / 16