Slides from the paper
Ladislav Peska, Peter Vojtas:Using Implicit Preference Relations to Improve Content-based Recommendations.
on EC_WEB 2015 Conference (DEXA event), Valencia, Spain
An overview of the current functionality of the FAIR Evaluator - a framework for automating the evaluation of FAIRness of digital resources. The screenshots here are of the early strawman prototype, which is only available for use by the FAIR Metrics Authoring group at this time. Nevertheless, feedback on the functionality of the Evaluator would be welcome! We anticipate having a fully public version before August 2018.
This work is supported, in part, by the Ministerio de Economía y Competitividad grant number TIN2014-55993-RM
BESDUI: Benchmark for End-User Structured Data User InterfacesRoberto García
BESDUI is a first proposal to establish an accepted benchmark to measure the performance, from an end-user perspective, of tools for structured data exploration and search. This includes relational and semantic data. More details: http://w3id.org/BESDUI
This is a short presentation about the FAIR Metrics Evaluator - software that automates the application of FAIR Metrics against a given resource, in order to determine its degree of "FAIRness"
This tutorial gives an overview of how search engines and machine learning techniques can be tightly coupled to address the need for building scalable recommender or other prediction based systems. Typically, most of them architect retrieval and prediction in two phases. In Phase I, a search engine returns the top-k results based on constraints expressed as a query. In Phase II, the top-k results are re-ranked in another system according to an optimization function that uses a supervised trained model. However this approach presents several issues, such as the possibility of returning sub-optimal results due to the top-k limits during query, as well as the prescence of some inefficiencies in the system due to the decoupling of retrieval and ranking.
To address this issue the authors created ML-Scoring, an open source framework that tightly integrates machine learning models into Elasticsearch, a popular search engine. ML-Scoring replaces the default information retrieval ranking function with a custom supervised model that is trained through Spark, Weka, or R that is loaded as a plugin in Elasticsearch. This tutorial will not only review basic methods in information retrieval and machine learning, but it will also walk through practical examples from loading a dataset into Elasticsearch to training a model in Spark, Weka, or R, to creating the ML-Scoring plugin for Elasticsearch. No prior experience is required in any system listed (Elasticsearch, Spark, Weka, R), though some programming experience is recommended.
An overview of the current functionality of the FAIR Evaluator - a framework for automating the evaluation of FAIRness of digital resources. The screenshots here are of the early strawman prototype, which is only available for use by the FAIR Metrics Authoring group at this time. Nevertheless, feedback on the functionality of the Evaluator would be welcome! We anticipate having a fully public version before August 2018.
This work is supported, in part, by the Ministerio de Economía y Competitividad grant number TIN2014-55993-RM
BESDUI: Benchmark for End-User Structured Data User InterfacesRoberto García
BESDUI is a first proposal to establish an accepted benchmark to measure the performance, from an end-user perspective, of tools for structured data exploration and search. This includes relational and semantic data. More details: http://w3id.org/BESDUI
This is a short presentation about the FAIR Metrics Evaluator - software that automates the application of FAIR Metrics against a given resource, in order to determine its degree of "FAIRness"
This tutorial gives an overview of how search engines and machine learning techniques can be tightly coupled to address the need for building scalable recommender or other prediction based systems. Typically, most of them architect retrieval and prediction in two phases. In Phase I, a search engine returns the top-k results based on constraints expressed as a query. In Phase II, the top-k results are re-ranked in another system according to an optimization function that uses a supervised trained model. However this approach presents several issues, such as the possibility of returning sub-optimal results due to the top-k limits during query, as well as the prescence of some inefficiencies in the system due to the decoupling of retrieval and ranking.
To address this issue the authors created ML-Scoring, an open source framework that tightly integrates machine learning models into Elasticsearch, a popular search engine. ML-Scoring replaces the default information retrieval ranking function with a custom supervised model that is trained through Spark, Weka, or R that is loaded as a plugin in Elasticsearch. This tutorial will not only review basic methods in information retrieval and machine learning, but it will also walk through practical examples from loading a dataset into Elasticsearch to training a model in Spark, Weka, or R, to creating the ML-Scoring plugin for Elasticsearch. No prior experience is required in any system listed (Elasticsearch, Spark, Weka, R), though some programming experience is recommended.
In questo articolo, voglio analizzare gli algoritmi dei motori
di ricerca in ottica SEO. Google e gli altri “search
engine” utilizzano algoritmi per classificare le informazioni
archiviate sulla base di determinati parametri, non tutti noti.
Questo processo permette di assegnare un valore ad ogni
pagina web, valore che determina la posizione della pagina
nella lista dei risultati (SERP) dei motori di ricerca.
L’obiettivo primario è quello di fornire risultati che
soddisfino pienamente le query effettuate dagli utenti.
Un errore è però credere che Google, Yahoo e Bing utilizzino
un solo algoritmo, quando in realtà sono diversi. Ognuno di
essi assolve ad uno specifico compito e per questo è
importante conoscerne almeno le principali caratteristiche e
sfruttarle in ottica SEO.
Creare un sito web di successo che ci permette di generare traffico è fondamentale per ottenere un ritorno economico. Gli stessi inserzionisti saranno sicuramente più propensi ad investire sulle nostre pagine in termini di pubblicità se il numero di visitatori è elevato, omogeneamente distribuito su tutte le pagine del sito e realmente interessato al contenuto. Ma come facciamo ad ottenere questo risultato? Come facciamo a portare i visitatori sul nostro sito e a catturare il loro interesse ? Il passaggio preliminare, obbligatorio, è ovviamente quello di indicizzare il sito sui principali motori di ricerca. Secondo step, quello di riuscire ad ottenere un buon risultato in termini di visibilità, ovvero di posizionare il sito nella prima pagina dei risultati (SERP) dei motori di ricerca, utilizzando tecniche SEO & SEM.
Towards Complex User Feedback and Presentation Context in Recommender SystemsLadislav Peska
We present our work in progress towards employing complex user feedback and its context in recommender systems. Our work is generally focused on small or medium-sized e-commerce portals. Due to the nature of such enterprises, explicit feedback is unavailable, but implicit feedback can be collected in both large amount and rich variety. However, some perceived values of implicit feedback may depend on the context of the page or user’s device (further denoted as presentation context). In this paper, we present an extended model of presentation context, propose methods integrating it into the set of implicit feedback features and evaluate these on the dataset of real e-commerce users. The evaluation corroborated the importance of leveraging presentation context in recommender systems.
presents the foundational aspects of web analytics and some specifics such as the hotel problem. Discusses trace data, behaviorism, and other cool web analytics stuff
In questo articolo, voglio analizzare gli algoritmi dei motori
di ricerca in ottica SEO. Google e gli altri “search
engine” utilizzano algoritmi per classificare le informazioni
archiviate sulla base di determinati parametri, non tutti noti.
Questo processo permette di assegnare un valore ad ogni
pagina web, valore che determina la posizione della pagina
nella lista dei risultati (SERP) dei motori di ricerca.
L’obiettivo primario è quello di fornire risultati che
soddisfino pienamente le query effettuate dagli utenti.
Un errore è però credere che Google, Yahoo e Bing utilizzino
un solo algoritmo, quando in realtà sono diversi. Ognuno di
essi assolve ad uno specifico compito e per questo è
importante conoscerne almeno le principali caratteristiche e
sfruttarle in ottica SEO.
Creare un sito web di successo che ci permette di generare traffico è fondamentale per ottenere un ritorno economico. Gli stessi inserzionisti saranno sicuramente più propensi ad investire sulle nostre pagine in termini di pubblicità se il numero di visitatori è elevato, omogeneamente distribuito su tutte le pagine del sito e realmente interessato al contenuto. Ma come facciamo ad ottenere questo risultato? Come facciamo a portare i visitatori sul nostro sito e a catturare il loro interesse ? Il passaggio preliminare, obbligatorio, è ovviamente quello di indicizzare il sito sui principali motori di ricerca. Secondo step, quello di riuscire ad ottenere un buon risultato in termini di visibilità, ovvero di posizionare il sito nella prima pagina dei risultati (SERP) dei motori di ricerca, utilizzando tecniche SEO & SEM.
Towards Complex User Feedback and Presentation Context in Recommender SystemsLadislav Peska
We present our work in progress towards employing complex user feedback and its context in recommender systems. Our work is generally focused on small or medium-sized e-commerce portals. Due to the nature of such enterprises, explicit feedback is unavailable, but implicit feedback can be collected in both large amount and rich variety. However, some perceived values of implicit feedback may depend on the context of the page or user’s device (further denoted as presentation context). In this paper, we present an extended model of presentation context, propose methods integrating it into the set of implicit feedback features and evaluate these on the dataset of real e-commerce users. The evaluation corroborated the importance of leveraging presentation context in recommender systems.
presents the foundational aspects of web analytics and some specifics such as the hotel problem. Discusses trace data, behaviorism, and other cool web analytics stuff
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...inventionjournals
Information is overloaded in the Internet due to the unstable growth of information and it makes information search as complicate process. Recommendation System (RS) is the tool and largely used nowadays in many areas to generate interest items to users. With the development of e-commerce and information access, recommender systems have become a popular technique to prune large information spaces so that users are directed toward those items that best meet their needs and preferences. As the exponential explosion of various contents generated on the Web, Recommendation techniques have become increasingly indispensable. Web recommendation systems assist the users to get the exact information and facilitate the information search easier. Web recommendation is one of the techniques of web personalization, which recommends web pages or items to the user based on the previous browsing history. But the tremendous growth in the amount of the available information and the number of visitors to web sites in recent years places some key challenges for recommender system. The recent recommender systems stuck with producing high quality recommendation with large information, resulting unwanted item instead of targeted item or product, and performing many recommendations per second for millions of user and items. To avoid these challenges a new recommender system technologies are needed that can quickly produce high quality recommendation, even for a very large scale problems. To address these issues we use two recommender system process using fuzzy clustering and collaborative filtering algorithms. Fuzzy clustering is used to predict the items or product that will be accessed in the future based on the previous action of user browsers behavior. Collaborative filtering recommendation process is used to produce the user expects result from the result of fuzzy clustering and collection of Web Database data items. Using this new recommendation system, it results the user expected product or item with minimum time. This system reduces the result of unrelated and unwanted item to user and provides the results with user interested domain.
An open source, scalable queuing solution on top of apache kafka 2019Yaniv Bronhaim
Apache Kafka is an open-source stream-processing software platform. It aims to provide a unified, high-throughput, low-latency platform for handling real-time data feeds.
In this session, we’ll cover basic Kafka terminology: Topics, Groups, Partitions and Offsets. We will dive deeper into Kafka’s ACK policies, discuss their advantages and challenges, as well as how we built a queuing system using Kafka in order to support Yotpo’s architecture and vision of breaking the monolith. During the session, we’ll share real-life production use-cases and present the ecosystem and open-source tools we implemented at Yotpo to support message queuing requirements.
Yaniv Bronheim
Cloud Platform Group
Making IA Real: Planning an Information Architecture StrategyChiara Fox Ogan
Presented at Internet Librarian conference in 2001. Provides an introduction to what information architecture is and how you can use the methods to develop a good website.
Develop a robust and effective book recommendation system that provides personalized suggestions to users, enhancing their reading experience and promoting diverse literary exploration.
This is part 1 of the tutorial Xavier and Deepak gave at Recsys 2016 this year. You can find the second part http://www.slideshare.net/xamat/recsys-2016-tutorial-lessons-learned-from-building-reallife-recommender-systems
Similar to Using Implicit Preference Relations to Improve Content-based Recommendations, EC-WEB 2015 (20)
Towards Recommender Systems for Police Photo LineupLadislav Peska
Photo lineups play a significant role in the eyewitness identifica-tion process. This method is used to provide evidence in the prosecution and subsequent conviction of suspects. Unfortu-nately, there are many cases where lineups have led to the con-viction of an innocent suspect. One of the key factors affecting the incorrect identification of a suspect is the lack of lineup fair-ness, i.e. that the suspect differs significantly from all other candidates. Although the process of assembling fair lineup is both highly important and time-consuming, only a handful of tools are available to simplify the task.
In these slides, we describe our work towards using recommend-er systems for the photo lineup assembling task. We propose and evaluate two complementary methods for item-based rec-ommendation: one based on the visual descriptors of the deep neural network, the other based on the content-based attrib-utes of persons.
The initial evaluation made by forensic technicians shows that although results favored visual descriptors over attribute-based similarity, both approaches are functional and highly diverse in terms of recommended objects. Thus, future work should in-volve incorporating both approaches in a single prediction method, preference learning based on the feedback from forensic technicians and recommendation of assembled lineups instead of single candidates.
Linking Content Information with Bayesian Personalized Ranking via Multiple C...Ladislav Peska
In this paper, we propose a multiple content alignments extension to the Bayesian Personalized Ranking Matrix Factorization (BPR-MCA). The proposed method incorporates multiple sources of content information in the form of user-to-user or object-to-object similarity matrices and aligns users’ and items’ latent factors ac-cording to these similarities. During the training phase, BPR-MCA also learns the relevance weight of each similarity matrix.
BPR-MCA was evaluated on the MovieLens 1M dataset, extended by the content information from IMDB, DBTropes and ZIP code statistics. The experiment shows that BPR-MCA can help to significantly improve recommendation w.r.t. nDCG and AUPR over standard BPR under several cold-start scenarios.
How Recreation Management Software Can Streamline Your Operations.pptxwottaspaceseo
Recreation management software streamlines operations by automating key tasks such as scheduling, registration, and payment processing, reducing manual workload and errors. It provides centralized management of facilities, classes, and events, ensuring efficient resource allocation and facility usage. The software offers user-friendly online portals for easy access to bookings and program information, enhancing customer experience. Real-time reporting and data analytics deliver insights into attendance and preferences, aiding in strategic decision-making. Additionally, effective communication tools keep participants and staff informed with timely updates. Overall, recreation management software enhances efficiency, improves service delivery, and boosts customer satisfaction.
Essentials of Automations: The Art of Triggers and Actions in FMESafe Software
In this second installment of our Essentials of Automations webinar series, we’ll explore the landscape of triggers and actions, guiding you through the nuances of authoring and adapting workspaces for seamless automations. Gain an understanding of the full spectrum of triggers and actions available in FME, empowering you to enhance your workspaces for efficient automation.
We’ll kick things off by showcasing the most commonly used event-based triggers, introducing you to various automation workflows like manual triggers, schedules, directory watchers, and more. Plus, see how these elements play out in real scenarios.
Whether you’re tweaking your current setup or building from the ground up, this session will arm you with the tools and insights needed to transform your FME usage into a powerhouse of productivity. Join us to discover effective strategies that simplify complex processes, enhancing your productivity and transforming your data management practices with FME. Let’s turn complexity into clarity and make your workspaces work wonders!
Top Features to Include in Your Winzo Clone App for Business Growth (4).pptxrickgrimesss22
Discover the essential features to incorporate in your Winzo clone app to boost business growth, enhance user engagement, and drive revenue. Learn how to create a compelling gaming experience that stands out in the competitive market.
May Marketo Masterclass, London MUG May 22 2024.pdfAdele Miller
Can't make Adobe Summit in Vegas? No sweat because the EMEA Marketo Engage Champions are coming to London to share their Summit sessions, insights and more!
This is a MUG with a twist you don't want to miss.
Launch Your Streaming Platforms in MinutesRoshan Dwivedi
The claim of launching a streaming platform in minutes might be a bit of an exaggeration, but there are services that can significantly streamline the process. Here's a breakdown:
Pros of Speedy Streaming Platform Launch Services:
No coding required: These services often use drag-and-drop interfaces or pre-built templates, eliminating the need for programming knowledge.
Faster setup: Compared to building from scratch, these platforms can get you up and running much quicker.
All-in-one solutions: Many services offer features like content management systems (CMS), video players, and monetization tools, reducing the need for multiple integrations.
Things to Consider:
Limited customization: These platforms may offer less flexibility in design and functionality compared to custom-built solutions.
Scalability: As your audience grows, you might need to upgrade to a more robust platform or encounter limitations with the "quick launch" option.
Features: Carefully evaluate which features are included and if they meet your specific needs (e.g., live streaming, subscription options).
Examples of Services for Launching Streaming Platforms:
Muvi [muvi com]
Uscreen [usencreen tv]
Alternatives to Consider:
Existing Streaming platforms: Platforms like YouTube or Twitch might be suitable for basic streaming needs, though monetization options might be limited.
Custom Development: While more time-consuming, custom development offers the most control and flexibility for your platform.
Overall, launching a streaming platform in minutes might not be entirely realistic, but these services can significantly speed up the process compared to building from scratch. Carefully consider your needs and budget when choosing the best option for you.
Utilocate offers a comprehensive solution for locate ticket management by automating and streamlining the entire process. By integrating with Geospatial Information Systems (GIS), it provides accurate mapping and visualization of utility locations, enhancing decision-making and reducing the risk of errors. The system's advanced data analytics tools help identify trends, predict potential issues, and optimize resource allocation, making the locate ticket management process smarter and more efficient. Additionally, automated ticket management ensures consistency and reduces human error, while real-time notifications keep all relevant personnel informed and ready to respond promptly.
The system's ability to streamline workflows and automate ticket routing significantly reduces the time taken to process each ticket, making the process faster and more efficient. Mobile access allows field technicians to update ticket information on the go, ensuring that the latest information is always available and accelerating the locate process. Overall, Utilocate not only enhances the efficiency and accuracy of locate ticket management but also improves safety by minimizing the risk of utility damage through precise and timely locates.
First Steps with Globus Compute Multi-User EndpointsGlobus
In this presentation we will share our experiences around getting started with the Globus Compute multi-user endpoint. Working with the Pharmacology group at the University of Auckland, we have previously written an application using Globus Compute that can offload computationally expensive steps in the researcher's workflows, which they wish to manage from their familiar Windows environments, onto the NeSI (New Zealand eScience Infrastructure) cluster. Some of the challenges we have encountered were that each researcher had to set up and manage their own single-user globus compute endpoint and that the workloads had varying resource requirements (CPUs, memory and wall time) between different runs. We hope that the multi-user endpoint will help to address these challenges and share an update on our progress here.
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...Shahin Sheidaei
Games are powerful teaching tools, fostering hands-on engagement and fun. But they require careful consideration to succeed. Join me to explore factors in running and selecting games, ensuring they serve as effective teaching tools. Learn to maintain focus on learning objectives while playing, and how to measure the ROI of gaming in education. Discover strategies for pitching gaming to leadership. This session offers insights, tips, and examples for coaches, team leads, and enterprise leaders seeking to teach from simple to complex concepts.
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...informapgpstrackings
Keep tabs on your field staff effortlessly with Informap Technology Centre LLC. Real-time tracking, task assignment, and smart features for efficient management. Request a live demo today!
For more details, visit us : https://informapuae.com/field-staff-tracking/
Enterprise Resource Planning System includes various modules that reduce any business's workload. Additionally, it organizes the workflows, which drives towards enhancing productivity. Here are a detailed explanation of the ERP modules. Going through the points will help you understand how the software is changing the work dynamics.
To know more details here: https://blogs.nyggs.com/nyggs/enterprise-resource-planning-erp-system-modules/
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...Globus
Large Language Models (LLMs) are currently the center of attention in the tech world, particularly for their potential to advance research. In this presentation, we'll explore a straightforward and effective method for quickly initiating inference runs on supercomputers using the vLLM tool with Globus Compute, specifically on the Polaris system at ALCF. We'll begin by briefly discussing the popularity and applications of LLMs in various fields. Following this, we will introduce the vLLM tool, and explain how it integrates with Globus Compute to efficiently manage LLM operations on Polaris. Attendees will learn the practical aspects of setting up and remotely triggering LLMs from local machines, focusing on ease of use and efficiency. This talk is ideal for researchers and practitioners looking to leverage the power of LLMs in their work, offering a clear guide to harnessing supercomputing resources for quick and effective LLM inference.
Developing Distributed High-performance Computing Capabilities of an Open Sci...Globus
COVID-19 had an unprecedented impact on scientific collaboration. The pandemic and its broad response from the scientific community has forged new relationships among public health practitioners, mathematical modelers, and scientific computing specialists, while revealing critical gaps in exploiting advanced computing systems to support urgent decision making. Informed by our team’s work in applying high-performance computing in support of public health decision makers during the COVID-19 pandemic, we present how Globus technologies are enabling the development of an open science platform for robust epidemic analysis, with the goal of collaborative, secure, distributed, on-demand, and fast time-to-solution analyses to support public health.
Software Engineering, Software Consulting, Tech Lead.
Spring Boot, Spring Cloud, Spring Core, Spring JDBC, Spring Security,
Spring Transaction, Spring MVC,
Log4j, REST/SOAP WEB-SERVICES.
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...Mind IT Systems
Healthcare providers often struggle with the complexities of chronic conditions and remote patient monitoring, as each patient requires personalized care and ongoing monitoring. Off-the-shelf solutions may not meet these diverse needs, leading to inefficiencies and gaps in care. It’s here, custom healthcare software offers a tailored solution, ensuring improved care and effectiveness.
GraphSummit Paris - The art of the possible with Graph TechnologyNeo4j
Sudhir Hasbe, Chief Product Officer, Neo4j
Join us as we explore breakthrough innovations enabled by interconnected data and AI. Discover firsthand how organizations use relationships in data to uncover contextual insights and solve our most pressing challenges – from optimizing supply chains, detecting fraud, and improving customer experiences to accelerating drug discoveries.
GraphSummit Paris - The art of the possible with Graph Technology
Using Implicit Preference Relations to Improve Content-based Recommendations, EC-WEB 2015
1. Using Implicit Preference
Relations to Improve Content
Based Recommending
Ladislav Peška and Peter Vojtáš
Department of Software Engineering,
Charles University in Prague,
Czech Republic
2. Peska, Vojtas. Using IPR to Improve Content-
Based Recommending
2
Recommender Systems
Propose relevant items to the right persons at the right
time
Machine learning application
Expose otherwise hard to find, uknown items
Complementary to the catalogues, search engines etc.
„Win-win strategy“
EC-WEB 2015, Valencia
User Feedback
rating, clickstream,
time on page, buys…
User, Object Profiles
Object attributes
(Context)
Time, location,
Possible choices…
RECOMMENDER
SYSTEM
Top-K Recommended objects
3. Peska, Vojtas. Using IPR to Improve Content-
Based Recommending
3
Recommender Systems
User feedback
Explicit feedback (rating)
Implicit feedback (user behavior)
Dwell time, clickstream, scrolling, mouse moves etc.
Often used as a proxy to the user rating
Recommending algorithms
Collaborative filtering
(Users A and B were similar so far, the should like similar things in the future too)
Cold start problem
Content-based filtering
(User A should like similar items to the ones he liked so far)
Overspecialization, lack of diversity, obvious recommendations…
EC-WEB 2015, Valencia
4. Peska, Vojtas. Using IPR to Improve Content-
Based Recommending
4
Challenge
Recommending for small e-commerce websites
Tens of similar vendors, user can choose whichever she likes
(Almost) no explicit feedback
(No incentives for users)
Few visited pages
(Often usage of external search engines & landing on object details)
Low user loyalty
(New vs. Returning visitors ratio 80:20)
Not enough data for collaborative filtering
Focus on Implicit Feedback & Content-based recommendations
Gather as much as possible user feedback; the sooner the better
Gather external content to improve CB recommendations (other papers)
EC-WEB 2015, Valencia
5. Peska, Vojtas. Using IPR to Improve Content-
Based Recommending
5
User Feedback
Explicit feedback (provided via website GUI)
Rating an object via Likert Scale
Comparing objects explicitly is not so common
Implicit feedback (Virtually any JS event could be used)
Actions related to evaluation of a single object
Dwell time on the object detail page
Number of page views
Scrolling, mouse events
Select / copy text, printing, purchase process etc.
Actions related to evaluation of a list of objects
Analyze user behavior on the category pages,
search results etc.
Search related actions etc.
EC-WEB 2015, Valencia
A Bor
Results
Selected object IDs:
1,4
Ignored object IDs:
2,3,5,6,7,8
6. Peska, Vojtas. Using IPR to Improve Content-
Based Recommending
6
Our Working Hypothesis
Users are often evaluating lists of objects
Search results, category pages, recommended items etc.
If user selects some objects from the list, we take it as an
evidence of his/her positive preference.
User prefers selected object(s) more, than other displayed &
ignored objects
We can form preference relations:
IPRrel (selected obj. > ignored obj.)
We can extend such relations along the content-based
similarity of objects
Some objects could be ignored, because user was not
aware of them, not becouse he/she did not like them
E.g. they were displayed below the visible area
EC-WEB 2015, Valencia
>
>
7. Peska, Vojtas. Using IPR to Improve Content-
Based Recommending
7
Outline of Our Approach
EC-WEB 2015, Valencia
8. Peska, Vojtas. Using IPR to Improve Content-
Based Recommending
8
Collecting User Behavior
IPIget component for collecting user behavior
Browser visible area size
List of all objects and its positions on the page
Listener on Scrolling events
Compute visible time for each displayed object, use it as a proxy to
the level of user evaluation
Some more refined approaches are possible (e.g. registering mouse moves or
visual focus for different quadrants)
Listener on Clicking events (which object(s) were selected by the user)
IPIget component download: http://ksi.mff.cuni.cz/~peska/ipiget.zip
EC-WEB 2015, Valencia
9. Peska, Vojtas. Using IPR to Improve Content-
Based Recommending
9
Collecting User Behavior – Example
EC-WEB 2015, Valencia
10. Peska, Vojtas. Using IPR to Improve Content-
Based Recommending
10
Extending IPR Relations
IPR(Ox,Oy,intx,y)
EC-WEB 2015, Valencia
11. Peska, Vojtas. Using IPR to Improve Content-
Based Recommending
11
Using IPR to Reranking List of
Objects
EC-WEB 2015, Valencia
12. Peska, Vojtas. Using IPR to Improve Content-
Based Recommending
12
Using IPR to Reranking List of
Objects - Algorithm
EC-WEB 2015, Valencia
13. Peska, Vojtas. Using IPR to Improve Content-
Based Recommending
13
Using IPR to Reranking List of
Objects – Conflict Strategies
IPR(O4>O2): O4 is better than O2
Forward:
Move O4 just before O2
Do not miss relevant objects
Backward:
Move O2 just after O4
Do not show irrelevant objects
Swap:
Change positions of O4 and O2
Keep objects well separated
EC-WEB 2015, Valencia
O1
O2
O3
O4
O5
O6
+ IPR(O4,O2,int)
O1
O4
O2
O3
O5
O6
O1
O3
O4
O2
O5
O6
O1
O4
O3
O2
O5
O6
Forward Backward Swap
14. Peska, Vojtas. Using IPR to Improve Content-
Based Recommending
14
Our Approach - Example
EC-WEB 2015, Valencia
15. Peska, Vojtas. Using IPR to Improve Content-
Based Recommending
15
Experiments
Off-line experiments on Czech secondhand bookshop dataset
1760 users, train set (2/3 of user data), test set (1/3)
Recommender systems tries to predict visited objects
Vector Space Model (VSM) with TF-IDF & Cosine similarity
SimCat (recommending similar categories based on Collaborative Filtering)
Stochastic Gradient Descent Matrix Factorization (SGD MF)
nDCG and Presence@top-k metrics
EC-WEB 2015, Valencia
Method nDCG p@5 p@10 p@50
VSM + best IPR-rerank (sim:0.5, int:0.1, swap) 0.475 13.6% 15.7% 20.7%
VSM 0.464 13.2% 15.1% 19.6%
Best IPR-rank (sim:0.5, int:0.1, swap) 0.247 7.1% 7.7% 8.5%
SimCat + best IPR-rerank (sim:0.01, int:0.1, forward) 0.219 4.7% 6.3% 10.0%
SimCat 0.136 0.9% 1.5% 5.4%
SGD MF (500 lat. factors, max 500 iterations) 0.126 0.89% 1.2% 3.3%
Random recommendations 0.085 0.09% 0.14% 0.27%
MinSimilarity threshold, VSM
0.2 0.3 0.5 0.8
0.465 0.470 0.473 0.472
Conflict resolving, VSM
Forward Backward Swap
0.465 0.460 0.466
16. Conclusions, Future Work
Implicit feedback could be more than just a substitution for user rating
Collecting feedback on list of objects could give us insight about user decision
proces
We used user behavior on list of objects to create Implicit
Preference Relations (IPR) between selected and ignored objects
IPR can be extended along the object similarity axis
We shown algorithm to update linear list of objects with IPRs
IPR re-ranked recommendations outperformed original ones in an off-line
experiment
Open Problems, Challenges
How much was object really evaluated by the user? (Going beyond visibility)
Which object features makes it desirable for the user? (Tailored object similarities)
On-line deployment
EC-WEB 2015, Valencia Peska, Vojtas. Using IPR to Improve Content-
Based Recommending
16
17. EC-WEB 2015, Valencia Peska, Vojtas. Using IPR to Improve Content-
Based Recommending
17
Thank you!
Questions, comments?
18. Peska, Vojtas. Using IPR to Improve Content-
Based Recommending
18
Recommending in Czech
Second-hand Bookshop
Mostly single item in stock
Few content-based attributes (low information value)
- Title, author, price, category, textual description
- Hard to define informative attributes
- Title (and author name) in Czech
- No common book identifier
(ISBN mostly inapplicable)
No explicit feedback
Page-view, time on page, buys…
Users identified through cookies
Approx. 9500 active books
50-100 visitors / day
2-4 purchases
EC-WEB 2015, Valencia
RECOMMENDED
OBJECTS
CATEGORIES
Attributes
search
CATALOGUE