A presentation about scientific recommender systems from the seminarphase of the project group PUSHPIN at the University of Paderborn
http://pgpushpin.wordpress.com/
Recommendation Engines for Scientific LiteratureKris Jack
I gave this talk at the Workshop on Recommender Enginer@TUG (http://bit.ly/yuxrAM) on 2012/12/19.
It presents a selection of algorithms and experimental data that are commonly used in recommending scientific literature. Real-world results from Mendeley's article recommendation system are also presented.
The work presented here has been partially funded by the European Commission as part of the TEAM IAPP project (grant no. 251514) within the FP7 People Programme (Marie Curie).
Mendeley: Recommendation Systems for Academic LiteratureKris Jack
I gave this talk to an MSc class about Semantic Technologies at the Technical University of Graz (TUG) on 2012/01/12.
It presents what recommendation systems are and how they are often used before delving into how they are used at Mendeley. Real-world results from Mendeley’s article recommendation system are also presented.
The work presented here has been partially funded by the European Commission as part of the TEAM IAPP project (grant no. 251514) within the FP7 People Programme (Marie Curie).
We looked at the data. Here’s a breakdown of some key statistics about the nation’s incoming presidents’ addresses, how long they spoke, how well, and more.
This document discusses how emojis, emoticons, and text speak can be used to teach students. It provides background on the origins of emoticons in 1982 as ways to convey tone and feelings in text communications. It then suggests that with text speak and emojis, students can translate, decode, summarize, play with language, and add emotion to language. A number of websites and apps that can be used for emoji-related activities, lessons, and discussions are also listed.
Study: The Future of VR, AR and Self-Driving CarsLinkedIn
We asked LinkedIn members worldwide about their levels of interest in the latest wave of technology: whether they’re using wearables, and whether they intend to buy self-driving cars and VR headsets as they become available. We asked them too about their attitudes to technology and to the growing role of Artificial Intelligence (AI) in the devices that they use. The answers were fascinating – and in many cases, surprising.
This SlideShare explores the full results of this study, including detailed market-by-market breakdowns of intention levels for each technology – and how attitudes change with age, location and seniority level. If you’re marketing a tech brand – or planning to use VR and wearables to reach a professional audience – then these are insights you won’t want to miss.
Artificial intelligence (AI) is everywhere, promising self-driving cars, medical breakthroughs, and new ways of working. But how do you separate hype from reality? How can your company apply AI to solve real business problems?
Here’s what AI learnings your business should keep in mind for 2017.
Information filtering is the process of monitoring large amounts of dynamically generated information and identifying the subset of information likely to be of interest to a user based on their information needs. It represents the user's interests and identifies only pieces of information they would find interesting. There are three main categories of information filtering: collaborative filtering which uses recommendations from other users; content-based filtering which uses a comparison between item content and user profiles; and hybrid filtering which combines aspects of collaborative and content-based filtering. Feedback techniques can also be used to continually update and improve filtering.
Recommendation Engines for Scientific LiteratureKris Jack
I gave this talk at the Workshop on Recommender Enginer@TUG (http://bit.ly/yuxrAM) on 2012/12/19.
It presents a selection of algorithms and experimental data that are commonly used in recommending scientific literature. Real-world results from Mendeley's article recommendation system are also presented.
The work presented here has been partially funded by the European Commission as part of the TEAM IAPP project (grant no. 251514) within the FP7 People Programme (Marie Curie).
Mendeley: Recommendation Systems for Academic LiteratureKris Jack
I gave this talk to an MSc class about Semantic Technologies at the Technical University of Graz (TUG) on 2012/01/12.
It presents what recommendation systems are and how they are often used before delving into how they are used at Mendeley. Real-world results from Mendeley’s article recommendation system are also presented.
The work presented here has been partially funded by the European Commission as part of the TEAM IAPP project (grant no. 251514) within the FP7 People Programme (Marie Curie).
We looked at the data. Here’s a breakdown of some key statistics about the nation’s incoming presidents’ addresses, how long they spoke, how well, and more.
This document discusses how emojis, emoticons, and text speak can be used to teach students. It provides background on the origins of emoticons in 1982 as ways to convey tone and feelings in text communications. It then suggests that with text speak and emojis, students can translate, decode, summarize, play with language, and add emotion to language. A number of websites and apps that can be used for emoji-related activities, lessons, and discussions are also listed.
Study: The Future of VR, AR and Self-Driving CarsLinkedIn
We asked LinkedIn members worldwide about their levels of interest in the latest wave of technology: whether they’re using wearables, and whether they intend to buy self-driving cars and VR headsets as they become available. We asked them too about their attitudes to technology and to the growing role of Artificial Intelligence (AI) in the devices that they use. The answers were fascinating – and in many cases, surprising.
This SlideShare explores the full results of this study, including detailed market-by-market breakdowns of intention levels for each technology – and how attitudes change with age, location and seniority level. If you’re marketing a tech brand – or planning to use VR and wearables to reach a professional audience – then these are insights you won’t want to miss.
Artificial intelligence (AI) is everywhere, promising self-driving cars, medical breakthroughs, and new ways of working. But how do you separate hype from reality? How can your company apply AI to solve real business problems?
Here’s what AI learnings your business should keep in mind for 2017.
Information filtering is the process of monitoring large amounts of dynamically generated information and identifying the subset of information likely to be of interest to a user based on their information needs. It represents the user's interests and identifies only pieces of information they would find interesting. There are three main categories of information filtering: collaborative filtering which uses recommendations from other users; content-based filtering which uses a comparison between item content and user profiles; and hybrid filtering which combines aspects of collaborative and content-based filtering. Feedback techniques can also be used to continually update and improve filtering.
COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...IJCSEA Journal
This document compares collaborative filtering algorithms with various similarity measures for movie recommendations. It summarizes User-based and Item-based collaborative filtering algorithms implemented in the Apache Mahout framework. Various similarity measures used in collaborative filtering are discussed, including Euclidean distance, Log Likelihood Ratio, Pearson correlation, Tanimoto coefficient, Uncentered Cosine, and Spearman correlation. The document concludes that Item-based algorithms typically provide better results than User-based algorithms for movie recommendations.
COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...IJCSEA Journal
Collaborative Filtering is generally used as a recommender system. There is enormous growth in the amount of data in web. These recommender systems help users to select products on the web, which is the most suitable for them. Collaborative filtering-systems collect user’s previous information about an item such as movies, music, ideas, and so on. For recommending the best item, there are many algorithms, which are based on different approaches. The most known algorithms are User-based and Item-based algorithms. Experiments show that Item-based algorithms give better results than User-based algorithms. The aim of this paper isto compare User-based and Item-based Collaborative Filtering Algorithms with many different similarity indexes with their accuracy and performance. We provide an approach to determine the best algorithm, which give the most accurate recommendation by using statistical accuracy metrics. The results are compared the User-based and Item-based algorithms with movie recommendation data set.
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...PyData
This document discusses building a hybrid recommendation engine using Python to recommend Pubmed documents. It begins with an introduction to predictive analytics and recommender systems. Different types of recommender systems are described, including knowledge-based, content-based, collaborative filtering, and hybrid models. The document then outlines a hybrid model that performs content-based filtering on Pubmed documents using vector space modeling and weights documents, before applying collaborative filtering using the Python-recsys library to filter and recommend documents. Finally, it demonstrates the hybrid model on a Pubmed dataset and compares its performance to using Python-recsys alone.
The document discusses a content-based recommendation system with sentiment analysis. It provides an overview of recommendation systems and their importance. The objectives are to provide personalized recommendations to users based on their preferences using information filtering techniques. Existing systems faced issues like scalability, sparsity, and cold starts. The proposed system is a hybrid approach that combines item-based collaborative filtering with user clustering to make predictions. It will be scalable while addressing cold starts. Tools like Flask, JavaScript, Python are used. Cosine similarity and sentiment analysis techniques are also discussed. The conclusion is that the proposed system can recommend less popular items and future work could include other factors in recommendations.
Develop a robust and effective book recommendation system that provides personalized suggestions to users, enhancing their reading experience and promoting diverse literary exploration.
International Journal of Engineering Research and DevelopmentIJERD Editor
This document summarizes and compares different recommender system techniques and graph processing platforms. It discusses five main recommender system categories: collaborative filtering, content-based, demographic, utility-based, and knowledge-based. It also outlines six popular graph processing platforms: Hadoop, YARN, Stratosphere, Giraph, GraphLab, and Neo4j. The document provides an overview of the programming models used by these platforms, particularly MapReduce.
DCCR is a deep collaborative conjunctive recommender model for rating prediction tasks. It is a hybrid neural network architecture consisting of an embedding system and neural network. The embedding system extracts latent features of users and items from raw ratings data. The neural network then merges the user and item features and extracts higher-level interaction features for rating prediction. Experiments on two datasets show DCCR achieves better accuracy than other methods by leveraging deep feature extraction and fusion while only using raw rating data. Future work includes exploring other similarity measures to address issues with sparse rating data.
Nesta palestra no evento GDG DataFest, apresentei uma introdução prática sobre as principais técnicas de sistemas de recomendação, incluindo arquiteturas recentes baseadas em Deep Learning. Foram apresentados exemplos utilizando Python, TensorFlow e Google ML Engine, e fornecidos datasets para exercitarmos um cenário de recomendação de artigos e notícias.
This document provides an overview of a module on recommender systems for a digital library curriculum. The module aims to teach students about different recommender system approaches, including content-based, collaborative filtering, and hybrid systems. It also covers challenges in recommender system design and the use of user profiles. The key topics covered include recommender system types and techniques, challenges in collaborative filtering, and modeling user profiles both explicitly and implicitly.
Digital Trails Dave King 1 5 10 Part 2 D3Dave King
Collective intelligence is defined as the intelligence that emerges from the interactions and contributions of users. It can be harnessed through allowing user interactions and contributions, aggregating what is learned about users through models, and using those models to recommend relevant content. Collective intelligence comes from both structured data like ratings and purchases, as well as unstructured data like reviews and forum posts, which are often transformed into structured data. Recommender systems are classified as collaborative filtering, content-based, or hybrid approaches. Collaborative filtering relies on user-item correlations or ratings to make recommendations, while content-based filtering analyzes item attributes.
The document discusses content-based recommendation systems. It describes how these systems work by analyzing item content and user preferences to recommend items that match a user's interests. It covers techniques like using TF-IDF to represent items as vectors, calculating similarity between items and profiles to find nearest neighbors, and using machine learning methods like naive Bayes classification. Feature selection methods are also discussed to choose the most useful terms for modeling items and users.
This document presents a project proposal for a Recommendation System for Technical Learning. It includes:
1. The names of the team members and project guide.
2. The objectives are to create a recommendation system to recommend relevant courses and books to users based on popularity and interests using collaborative and content-based filtering.
3. The literature review discusses previous recommendation system problems and solutions using collaborative filtering on Hadoop and considering location as an attribute.
4. The solution approach uses two types of filtering - collaborative and content-based - to build the recommendation system and analyze user ratings to train an ML model to make recommendations.
This document discusses information technology and database concepts. It covers relational, hierarchical, and network database models. It also discusses two-tier and three-tier architecture. The document then discusses system analysis and design, including defining a system, the software development life cycle, and the different phases of system analysis, design, coding, testing, implementation, and maintenance.
This document provides an overview of information retrieval and extraction systems. It discusses how information retrieval systems work by generating representations of documents and queries/profiles, comparing the representations, and returning relevant results. It also outlines the generic modules that comprise information extraction systems, including their inputs, outputs, functions, and rule-based operations.
Mendeley: crowdsourcing and recommending research on a large scaleKris Jack
I was invited to be the keynote speaker at a special track on Recommendation; Data Sharing and Research Practices in Science 2.0 at the I-KNOW 2011 conference (http://i-know.tugraz.at/) on 2011/09/07.
It presents the challanges involved in crowdsourcing the world's largest research catalogue and then building a recommendation service on top of them that scales to serve millions of users.
Expert systems in artificial intelegenceAnna Aquarian
An expert system is a computer system that uses knowledge and inference rules to solve complex problems in a manner similar to a human expert. It consists of a knowledge base containing facts and rules about a problem domain, a working memory that stores facts about the current problem, an inference engine that applies rules to derive new facts and solve problems, and a user interface for communicating with users. Expert systems are designed to emulate the decision-making of human experts and provide consistent, fast solutions to problems in a domain.
COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...IJCSEA Journal
This document compares collaborative filtering algorithms with various similarity measures for movie recommendations. It summarizes User-based and Item-based collaborative filtering algorithms implemented in the Apache Mahout framework. Various similarity measures used in collaborative filtering are discussed, including Euclidean distance, Log Likelihood Ratio, Pearson correlation, Tanimoto coefficient, Uncentered Cosine, and Spearman correlation. The document concludes that Item-based algorithms typically provide better results than User-based algorithms for movie recommendations.
COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...IJCSEA Journal
Collaborative Filtering is generally used as a recommender system. There is enormous growth in the amount of data in web. These recommender systems help users to select products on the web, which is the most suitable for them. Collaborative filtering-systems collect user’s previous information about an item such as movies, music, ideas, and so on. For recommending the best item, there are many algorithms, which are based on different approaches. The most known algorithms are User-based and Item-based algorithms. Experiments show that Item-based algorithms give better results than User-based algorithms. The aim of this paper isto compare User-based and Item-based Collaborative Filtering Algorithms with many different similarity indexes with their accuracy and performance. We provide an approach to determine the best algorithm, which give the most accurate recommendation by using statistical accuracy metrics. The results are compared the User-based and Item-based algorithms with movie recommendation data set.
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...PyData
This document discusses building a hybrid recommendation engine using Python to recommend Pubmed documents. It begins with an introduction to predictive analytics and recommender systems. Different types of recommender systems are described, including knowledge-based, content-based, collaborative filtering, and hybrid models. The document then outlines a hybrid model that performs content-based filtering on Pubmed documents using vector space modeling and weights documents, before applying collaborative filtering using the Python-recsys library to filter and recommend documents. Finally, it demonstrates the hybrid model on a Pubmed dataset and compares its performance to using Python-recsys alone.
The document discusses a content-based recommendation system with sentiment analysis. It provides an overview of recommendation systems and their importance. The objectives are to provide personalized recommendations to users based on their preferences using information filtering techniques. Existing systems faced issues like scalability, sparsity, and cold starts. The proposed system is a hybrid approach that combines item-based collaborative filtering with user clustering to make predictions. It will be scalable while addressing cold starts. Tools like Flask, JavaScript, Python are used. Cosine similarity and sentiment analysis techniques are also discussed. The conclusion is that the proposed system can recommend less popular items and future work could include other factors in recommendations.
Develop a robust and effective book recommendation system that provides personalized suggestions to users, enhancing their reading experience and promoting diverse literary exploration.
International Journal of Engineering Research and DevelopmentIJERD Editor
This document summarizes and compares different recommender system techniques and graph processing platforms. It discusses five main recommender system categories: collaborative filtering, content-based, demographic, utility-based, and knowledge-based. It also outlines six popular graph processing platforms: Hadoop, YARN, Stratosphere, Giraph, GraphLab, and Neo4j. The document provides an overview of the programming models used by these platforms, particularly MapReduce.
DCCR is a deep collaborative conjunctive recommender model for rating prediction tasks. It is a hybrid neural network architecture consisting of an embedding system and neural network. The embedding system extracts latent features of users and items from raw ratings data. The neural network then merges the user and item features and extracts higher-level interaction features for rating prediction. Experiments on two datasets show DCCR achieves better accuracy than other methods by leveraging deep feature extraction and fusion while only using raw rating data. Future work includes exploring other similarity measures to address issues with sparse rating data.
Nesta palestra no evento GDG DataFest, apresentei uma introdução prática sobre as principais técnicas de sistemas de recomendação, incluindo arquiteturas recentes baseadas em Deep Learning. Foram apresentados exemplos utilizando Python, TensorFlow e Google ML Engine, e fornecidos datasets para exercitarmos um cenário de recomendação de artigos e notícias.
This document provides an overview of a module on recommender systems for a digital library curriculum. The module aims to teach students about different recommender system approaches, including content-based, collaborative filtering, and hybrid systems. It also covers challenges in recommender system design and the use of user profiles. The key topics covered include recommender system types and techniques, challenges in collaborative filtering, and modeling user profiles both explicitly and implicitly.
Digital Trails Dave King 1 5 10 Part 2 D3Dave King
Collective intelligence is defined as the intelligence that emerges from the interactions and contributions of users. It can be harnessed through allowing user interactions and contributions, aggregating what is learned about users through models, and using those models to recommend relevant content. Collective intelligence comes from both structured data like ratings and purchases, as well as unstructured data like reviews and forum posts, which are often transformed into structured data. Recommender systems are classified as collaborative filtering, content-based, or hybrid approaches. Collaborative filtering relies on user-item correlations or ratings to make recommendations, while content-based filtering analyzes item attributes.
The document discusses content-based recommendation systems. It describes how these systems work by analyzing item content and user preferences to recommend items that match a user's interests. It covers techniques like using TF-IDF to represent items as vectors, calculating similarity between items and profiles to find nearest neighbors, and using machine learning methods like naive Bayes classification. Feature selection methods are also discussed to choose the most useful terms for modeling items and users.
This document presents a project proposal for a Recommendation System for Technical Learning. It includes:
1. The names of the team members and project guide.
2. The objectives are to create a recommendation system to recommend relevant courses and books to users based on popularity and interests using collaborative and content-based filtering.
3. The literature review discusses previous recommendation system problems and solutions using collaborative filtering on Hadoop and considering location as an attribute.
4. The solution approach uses two types of filtering - collaborative and content-based - to build the recommendation system and analyze user ratings to train an ML model to make recommendations.
This document discusses information technology and database concepts. It covers relational, hierarchical, and network database models. It also discusses two-tier and three-tier architecture. The document then discusses system analysis and design, including defining a system, the software development life cycle, and the different phases of system analysis, design, coding, testing, implementation, and maintenance.
This document provides an overview of information retrieval and extraction systems. It discusses how information retrieval systems work by generating representations of documents and queries/profiles, comparing the representations, and returning relevant results. It also outlines the generic modules that comprise information extraction systems, including their inputs, outputs, functions, and rule-based operations.
Mendeley: crowdsourcing and recommending research on a large scaleKris Jack
I was invited to be the keynote speaker at a special track on Recommendation; Data Sharing and Research Practices in Science 2.0 at the I-KNOW 2011 conference (http://i-know.tugraz.at/) on 2011/09/07.
It presents the challanges involved in crowdsourcing the world's largest research catalogue and then building a recommendation service on top of them that scales to serve millions of users.
Expert systems in artificial intelegenceAnna Aquarian
An expert system is a computer system that uses knowledge and inference rules to solve complex problems in a manner similar to a human expert. It consists of a knowledge base containing facts and rules about a problem domain, a working memory that stores facts about the current problem, an inference engine that applies rules to derive new facts and solve problems, and a user interface for communicating with users. Expert systems are designed to emulate the decision-making of human experts and provide consistent, fast solutions to problems in a domain.
Similar to Scientific Recommender Systems - PG PUSHPIN (20)
This presentation was provided by Steph Pollock of The American Psychological Association’s Journals Program, and Damita Snow, of The American Society of Civil Engineers (ASCE), for the initial session of NISO's 2024 Training Series "DEIA in the Scholarly Landscape." Session One: 'Setting Expectations: a DEIA Primer,' was held June 6, 2024.
This presentation includes basic of PCOS their pathology and treatment and also Ayurveda correlation of PCOS and Ayurvedic line of treatment mentioned in classics.
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Dr. Vinod Kumar Kanvaria
Exploiting Artificial Intelligence for Empowering Researchers and Faculty,
International FDP on Fundamentals of Research in Social Sciences
at Integral University, Lucknow, 06.06.2024
By Dr. Vinod Kumar Kanvaria
Main Java[All of the Base Concepts}.docxadhitya5119
This is part 1 of my Java Learning Journey. This Contains Custom methods, classes, constructors, packages, multithreading , try- catch block, finally block and more.
Executive Directors Chat Leveraging AI for Diversity, Equity, and InclusionTechSoup
Let’s explore the intersection of technology and equity in the final session of our DEI series. Discover how AI tools, like ChatGPT, can be used to support and enhance your nonprofit's DEI initiatives. Participants will gain insights into practical AI applications and get tips for leveraging technology to advance their DEI goals.
The simplified electron and muon model, Oscillating Spacetime: The Foundation...RitikBhardwaj56
Discover the Simplified Electron and Muon Model: A New Wave-Based Approach to Understanding Particles delves into a groundbreaking theory that presents electrons and muons as rotating soliton waves within oscillating spacetime. Geared towards students, researchers, and science buffs, this book breaks down complex ideas into simple explanations. It covers topics such as electron waves, temporal dynamics, and the implications of this model on particle physics. With clear illustrations and easy-to-follow explanations, readers will gain a new outlook on the universe's fundamental nature.
it describes the bony anatomy including the femoral head , acetabulum, labrum . also discusses the capsule , ligaments . muscle that act on the hip joint and the range of motion are outlined. factors affecting hip joint stability and weight transmission through the joint are summarized.
How to Setup Warehouse & Location in Odoo 17 InventoryCeline George
In this slide, we'll explore how to set up warehouses and locations in Odoo 17 Inventory. This will help us manage our stock effectively, track inventory levels, and streamline warehouse operations.
Strategies for Effective Upskilling is a presentation by Chinwendu Peace in a Your Skill Boost Masterclass organisation by the Excellence Foundation for South Sudan on 08th and 09th June 2024 from 1 PM to 3 PM on each day.
5. Recommender Systems
Recommender Systems
u :C ×S →R
C - set of all users
S - set of all items
R - totally ordered set, which describes the usefulness of the
items to the respective user
Scientific Recommender Systems 5
6. Categories of Recommender Systems
Categories of Recommender Systems
content-based: items are recommended that are similar to
items the user liked in the past
collaborative: items are recommended that people liked that
are similar to the user (similar taste/preferences)
hybrid: a combination of content-based and collaborative
recommendation approaches
Scientific Recommender Systems 6
7. Categories of Recommender Systems
Content-based Recommender Systems
utility u(c, s) of an item s is estimated with the help of the
utilities u(c, si ) of all items si ∈ S that user c already rated
that are similar to item s
similarity between items is calculated according to their
attributes
user and item profiles
common problems
limited content analysis
overspecialization
new user problem
Scientific Recommender Systems 7
8. Categories of Recommender Systems
Content-based Recommender: TF-IDF
N - total number of documents in the system
keyword ki appears in ni of the documents
fi,j denotes the number of times a certain keyword ki appears
in a document dj
Scientific Recommender Systems 8
9. Categories of Recommender Systems
Content-based Recommender: TF-IDF
N - total number of documents in the system
keyword ki appears in ni of the documents
fi,j denotes the number of times a certain keyword ki appears
in a document dj
Term Frequency
fi,j
TFi,j = maxz fz,j
maximum in the denominator calculated over the frequencies
of all keywords kz that appear in document dj
Scientific Recommender Systems 8
10. Categories of Recommender Systems
Content-based Recommender: TF-IDF
N - total number of documents in the system
keyword ki appears in ni of the documents
fi,j denotes the number of times a certain keyword ki appears
in a document dj
Term Frequency
fi,j
TFi,j = maxz fz,j
maximum in the denominator calculated over the frequencies
of all keywords kz that appear in document dj
Inverse Document Frequency
N
for a keyword ki : IDFi = log ni
Scientific Recommender Systems 8
11. Categories of Recommender Systems
Content-based Recommender: TF-IDF
N - total number of documents in the system
keyword ki appears in ni of the documents
fi,j denotes the number of times a certain keyword ki appears
in a document dj
Term Frequency
fi,j
TFi,j = maxz fz,j
maximum in the denominator calculated over the frequencies
of all keywords kz that appear in document dj
Inverse Document Frequency
N
for a keyword ki : IDFi = log ni
TF-IDF
wi,j = TFi,j × IDFi
Scientific Recommender Systems 8
12. Categories of Recommender Systems
Collaborative Recommender Systems
utility u(c, s) of an item s is estimated with the help of the
utilities u(ci , s) assigned by users ci ∈ C that are similar to
user c.
common problems
new user/item problem
cold start
sparsity
scalability
Scientific Recommender Systems 9
13. Categories of Recommender Systems
Collaborative Recommender: Apache Mahout (1)
provides a ”toolbox” to create collaborative recommender
systems
input
user (long), item (long), preference (double)
1, 111, 2.5
data model
input from different file formats, database
increase performance with specific data structures
Scientific Recommender Systems 10
14. Categories of Recommender Systems
Collaborative Recommender: Apache Mahout (2)
user-based recommender
Scientific Recommender Systems 11
15. Categories of Recommender Systems
Collaborative Recommender: Apache Mahout (2)
user-based recommender
item-based recommender
Scientific Recommender Systems 11
16. Categories of Recommender Systems
Collaborative Recommender: Apache Mahout (3)
similarity measures
pearson correlation (cosine similarity)
euclidean distance
spearman correlation
log-likelihood
...
slope-one recommender
other experimental recommender implementations
e.g. cluster-based
Scientific Recommender Systems 12
17. Categories of Recommender Systems
Hybrid Recommender Systems
combination of content-based and collaborative methods
seperate content-based and collaborative recommender
systems; results get combined somehow
collaborative recommender system with some added aspects of
content-based methods
content-based recommender system with some added aspects
of collaborative methods
a single recommender system which unifies content-based and
collaborative methods from the beginning
Scientific Recommender Systems 13
18. Categories of Recommender Systems
Hybrid Recommender: SciPlore
SciPlore Overview
Scientific Recommender Systems 14
20. Conclusion
Summary
utility function
categories of recommender systems
content-based
collaborative
hybrid
implementation with Apache Mahout
possible visualizations
Scientific Recommender Systems 16
21. Conclusion
Questions?
Scientific Recommender Systems 17
22. References
References
Apache Mahout: Scalable machine learning and data mining.
http://mahout.apache.org/ - accessed on 6th January 2012
SciPlore: Exploring Science. http://www.sciplore.org -
accessed on 6th January 2012
G Adomavicius and A Tuzhilin. Toward the next generation of
recommender systems: a survey of the state-of-the-art and
possible extensions. IEEE Transactions on Knowledge and
Data Engineering, 17(6):734-749, 2005
B Gipp, J Beel and C Hentschel. Scienstein: A research paper
recommender system, volume 301, pages 309-315. IEEE, 2009
Sean Owen, Robin Anil, Ted Dunning and Ellen Friedman.
Mahout in Action, 2011
Scientific Recommender Systems 18