SlideShare a Scribd company logo
http://Learning-Layers-euhttp://Learning-Layers-eu
Learning Layers
Scaling up Technologies for Informal Learning in SME Clusters
Towards a Scalable Social Recommender
Engine for Online Marketplaces
The Case of Apache Solr
Emanuel Lacic, Dominik Kowald, Denis Parra, Christoph Trattner
1
http://Learning-Layers-eu
Many thanks to
Emanuel Lacic
elacic@know-center.at
Graz University of Technology
Austria
2
Denis Parra
dparra@ing.puc.cl
Pontificia Universidad
Catolica
Chile
Christoph Trattner
ctrattner@know-center.at
Know-Center Graz
Austria
http://Learning-Layers-eu
What will this talk be about?
• (Real-time) product recommendations for
online marketplaces
• Scalability of recommender systems
• Utilizing social network data for the
recommendations of products to people
3
http://Learning-Layers-eu
How did this work start?
• Joint project with the Austrian start-up Blanc-Noir
• Personalized product recommender for online
marketplaces based on
– Actions in the marketplaces (e.g., Ebay, Amazon)
– Product information
– Social network data (e.g., Facebook, G+)
– Filter criteria
• Provided at (near) real-time!
… especially if there is a lot of data
… together with many data updates
4
http://Learning-Layers-eu
So now, how we have solved
that issue?
5
http://Learning-Layers-eu
What‘s available out there?
• Frameworks/approaches for scalable
recommendations
– Distributed data processing
• Apache Hadoop / Mahout (map/reduce paradigm)
– Relational databases
• MySQL, PostgreSQL (e.g., RecDB project)
– Collaborative Filtering improvements
• Matrix factorization
• Lack of a framework / approach that combines all
things we need
6
http://Learning-Layers-eu 7
http://Learning-Layers-eu
Why Solr?
• „High-performance, full-featured text search engine library“
… but more precise …
• „High-performance, fully-featured token matching and scoring
library“ [Grainger, 2012]
… which provides ….
– full-text searches (content-based)
– powerful queries (e.g., MoreLikeThis or Facets)
– (near) real-time data updates (no pre/re-calculations)
– easy schema updates (social data integration)
• Established open-source software (Apache license) with big
community
8
http://Learning-Layers-eu 9
Our framework
https://github.com/learning-layers/SocRec
http://Learning-Layers-eu
How does the thing perform?
• Dataset of virtual world SecondLife
– Marketplace and social data
10
http://Learning-Layers-eu
What‘s about the marketplace
and social data features?
11
0
0.05
0.1
0.15
0.2
Purchases Categories Title Description Interests Groups Likes Comments Interactions
nDCG
Data Features
0
0.1
0.2
0.3
0.4
0.5
Purchases Categories Title Description Interests Groups Likes Comments Interactions
UserCoverage
Data Features
• Both types of data are important for the recommender quality and
user coverage
http://Learning-Layers-eu
What‘s about the hybrids?
12
0
0.01
0.02
0.03
0.04
0.05
MP CCFm CFs ALL
nDCG
Recommendation Algorithms
0
0.2
0.4
0.6
0.8
1
MP CCFm CFs ALL
UserCoverage
Data Features
• The hybrid approach provides a good trade-off of recommender
quality and user coverage
http://Learning-Layers-eu
What‘s about the scalability?
13
• Recommendations can be provided in (near) real-time in both cases
(with and without data update)
http://Learning-Layers-eu
What we have shown!
• Apache Solr is more than a search engine!
• Actually it is a great framework to implement a scalable
recommender engine for online marketplaces
• Near real-time recommendations through build-in query-functions
• Near real-time data updates
• Easy integration of social data
+ a high-performance full-text search engine for free!
• Evaluation on dataset gathered from SecondLife
• Different marketplace and social data features are important
• Hybrid approaches produce more robust recommendations
• It scales!
14
http://Learning-Layers-eu
What do we want to do in the
future?
15
• Online study together with BlancNoir with “real” data
• Impact of geo-spatial data
• Impact of temporal data (see WebScience track)
• Comparative study with other backend solutions (e.g.,
ElasticSearch)
http://Learning-Layers-eu
Thank you for your attention!
Code and framework:
https://github.com/learning-layers/SocRec
Questions?
Dominik Kowald
dkowald@know-center.at
Know-Center
Graz University of Technology (Austria)
16
http://Learning-Layers-eu
Backup
17
http://Learning-Layers-eu
Short hands-on session
• Collaborative Filtering
• Content-Based
18
http://Learning-Layers-eu
SecondLife dataset
19
http://Learning-Layers-eu
How to Use the Engine?
• Implement and run a new recommender
20
http://Learning-Layers-eu
Recommendation Algorithms
implemented in the Engine
• MostPopular (MP)
– Recommends for any user the most purchased items
• Collaborative Filtering (CF)
– Find similar users (k nearest neighbors) and recommend novel items of
those users [Schafer et al., 2007]
– In Solr: select queries and facet counts
• Content-Based (C)
– Analyze item meta-data to find similar items [Pazzani et al., 2007]
– In Solr: MoreLikeThis function
• Hybrid (CCF)
– Combine different algorithms to overcome their individual limitations
[Burke et al., 2002]
– Each algorithm can be weighted / tuned according to its performance
21

More Related Content

What's hot

A Short Swim through the Personal Learning Pool
A Short Swim through the Personal Learning PoolA Short Swim through the Personal Learning Pool
A Short Swim through the Personal Learning Pool
Ralf Klamma
 
Keynote Learning Layers Developer Camp 2013
Keynote Learning Layers Developer Camp 2013Keynote Learning Layers Developer Camp 2013
Keynote Learning Layers Developer Camp 2013
Ralf Klamma
 
A Methodology and Tool Support for Widget-based Web Application Development
A Methodology and Tool Support for Widget-based Web Application DevelopmentA Methodology and Tool Support for Widget-based Web Application Development
A Methodology and Tool Support for Widget-based Web Application Development
Nicolaescu Petru
 
View Based Near Real Time Collaborative Modeling for Information Systems Engi...
View Based Near Real Time Collaborative Modeling for Information Systems Engi...View Based Near Real Time Collaborative Modeling for Information Systems Engi...
View Based Near Real Time Collaborative Modeling for Information Systems Engi...
Nicolaescu Petru
 
Learning Analytics in a Mobile World - A Community Information Systems Perspe...
Learning Analytics in a Mobile World - A Community Information Systems Perspe...Learning Analytics in a Mobile World - A Community Information Systems Perspe...
Learning Analytics in a Mobile World - A Community Information Systems Perspe...
Ralf Klamma
 
OGC Interoperability Experiments and Authentication
OGC Interoperability Experiments and AuthenticationOGC Interoperability Experiments and Authentication
OGC Interoperability Experiments and Authentication
EDINA, University of Edinburgh
 
SyncMeta: Near Real-time Collaborative Conceptual Modeling on the Web
SyncMeta: Near Real-time Collaborative Conceptual Modeling on the WebSyncMeta: Near Real-time Collaborative Conceptual Modeling on the Web
SyncMeta: Near Real-time Collaborative Conceptual Modeling on the Web
Nicolaescu Petru
 
Some Academic Sector/NMCA outcomes from the OGC Web Service Shibboleth Intero...
Some Academic Sector/NMCA outcomes from the OGC Web Service Shibboleth Intero...Some Academic Sector/NMCA outcomes from the OGC Web Service Shibboleth Intero...
Some Academic Sector/NMCA outcomes from the OGC Web Service Shibboleth Intero...
EDINA, University of Edinburgh
 
Strategic Developments in Digital Initiatives at Academic Libraries
Strategic Developments in Digital Initiatives at Academic LibrariesStrategic Developments in Digital Initiatives at Academic Libraries
Strategic Developments in Digital Initiatives at Academic Libraries
Hong (Jenny) Jing
 
Enabling Data-Intensive Science Through Data Infrastructures
Enabling Data-Intensive Science Through Data InfrastructuresEnabling Data-Intensive Science Through Data Infrastructures
Enabling Data-Intensive Science Through Data Infrastructures
LIBER Europe
 
Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...
Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...
Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...
LIBER Europe
 
Digital asset management (dam) systems used in Libraries
Digital asset management (dam) systems used in LibrariesDigital asset management (dam) systems used in Libraries
Digital asset management (dam) systems used in Libraries
Hong (Jenny) Jing
 
A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...
A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...
A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...
LIBER Europe
 
The  Selection Between An Open Source And Vended Software in Libraries: Oppor...
The  Selection Between An Open Source And Vended Software in Libraries:Oppor...The  Selection Between An Open Source And Vended Software in Libraries:Oppor...
The  Selection Between An Open Source And Vended Software in Libraries: Oppor...
Hong (Jenny) Jing
 
Building and using ontologies (2015)
Building and using ontologies (2015)Building and using ontologies (2015)
Building and using ontologies (2015)
Elena Simperl
 
Shibboleth Access Management Federations and Secure SDI: ESDIN Experience
Shibboleth Access Management Federations and Secure SDI: ESDIN Experience Shibboleth Access Management Federations and Secure SDI: ESDIN Experience
Shibboleth Access Management Federations and Secure SDI: ESDIN Experience
EDINA, University of Edinburgh
 
Yjs: A Framework for Near Real-time P2P Shared Editing on Arbitrary Data Types
Yjs: A Framework for Near Real-time P2P Shared Editing on Arbitrary Data TypesYjs: A Framework for Near Real-time P2P Shared Editing on Arbitrary Data Types
Yjs: A Framework for Near Real-time P2P Shared Editing on Arbitrary Data Types
Nicolaescu Petru
 
E Infrastructure for OA
E Infrastructure for OAE Infrastructure for OA
E Infrastructure for OA
Knowledge Exchange
 
The role of virtual research environments (VRE's) within the context of an e-...
The role of virtual research environments (VRE's) within the context of an e-...The role of virtual research environments (VRE's) within the context of an e-...
The role of virtual research environments (VRE's) within the context of an e-...
heila1
 
Demand Analysis
Demand AnalysisDemand Analysis
Demand Analysis
EDSA project
 

What's hot (20)

A Short Swim through the Personal Learning Pool
A Short Swim through the Personal Learning PoolA Short Swim through the Personal Learning Pool
A Short Swim through the Personal Learning Pool
 
Keynote Learning Layers Developer Camp 2013
Keynote Learning Layers Developer Camp 2013Keynote Learning Layers Developer Camp 2013
Keynote Learning Layers Developer Camp 2013
 
A Methodology and Tool Support for Widget-based Web Application Development
A Methodology and Tool Support for Widget-based Web Application DevelopmentA Methodology and Tool Support for Widget-based Web Application Development
A Methodology and Tool Support for Widget-based Web Application Development
 
View Based Near Real Time Collaborative Modeling for Information Systems Engi...
View Based Near Real Time Collaborative Modeling for Information Systems Engi...View Based Near Real Time Collaborative Modeling for Information Systems Engi...
View Based Near Real Time Collaborative Modeling for Information Systems Engi...
 
Learning Analytics in a Mobile World - A Community Information Systems Perspe...
Learning Analytics in a Mobile World - A Community Information Systems Perspe...Learning Analytics in a Mobile World - A Community Information Systems Perspe...
Learning Analytics in a Mobile World - A Community Information Systems Perspe...
 
OGC Interoperability Experiments and Authentication
OGC Interoperability Experiments and AuthenticationOGC Interoperability Experiments and Authentication
OGC Interoperability Experiments and Authentication
 
SyncMeta: Near Real-time Collaborative Conceptual Modeling on the Web
SyncMeta: Near Real-time Collaborative Conceptual Modeling on the WebSyncMeta: Near Real-time Collaborative Conceptual Modeling on the Web
SyncMeta: Near Real-time Collaborative Conceptual Modeling on the Web
 
Some Academic Sector/NMCA outcomes from the OGC Web Service Shibboleth Intero...
Some Academic Sector/NMCA outcomes from the OGC Web Service Shibboleth Intero...Some Academic Sector/NMCA outcomes from the OGC Web Service Shibboleth Intero...
Some Academic Sector/NMCA outcomes from the OGC Web Service Shibboleth Intero...
 
Strategic Developments in Digital Initiatives at Academic Libraries
Strategic Developments in Digital Initiatives at Academic LibrariesStrategic Developments in Digital Initiatives at Academic Libraries
Strategic Developments in Digital Initiatives at Academic Libraries
 
Enabling Data-Intensive Science Through Data Infrastructures
Enabling Data-Intensive Science Through Data InfrastructuresEnabling Data-Intensive Science Through Data Infrastructures
Enabling Data-Intensive Science Through Data Infrastructures
 
Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...
Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...
Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...
 
Digital asset management (dam) systems used in Libraries
Digital asset management (dam) systems used in LibrariesDigital asset management (dam) systems used in Libraries
Digital asset management (dam) systems used in Libraries
 
A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...
A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...
A Revolution in Open Science: Open Data and the Role of Libraries (Professor ...
 
The  Selection Between An Open Source And Vended Software in Libraries: Oppor...
The  Selection Between An Open Source And Vended Software in Libraries:Oppor...The  Selection Between An Open Source And Vended Software in Libraries:Oppor...
The  Selection Between An Open Source And Vended Software in Libraries: Oppor...
 
Building and using ontologies (2015)
Building and using ontologies (2015)Building and using ontologies (2015)
Building and using ontologies (2015)
 
Shibboleth Access Management Federations and Secure SDI: ESDIN Experience
Shibboleth Access Management Federations and Secure SDI: ESDIN Experience Shibboleth Access Management Federations and Secure SDI: ESDIN Experience
Shibboleth Access Management Federations and Secure SDI: ESDIN Experience
 
Yjs: A Framework for Near Real-time P2P Shared Editing on Arbitrary Data Types
Yjs: A Framework for Near Real-time P2P Shared Editing on Arbitrary Data TypesYjs: A Framework for Near Real-time P2P Shared Editing on Arbitrary Data Types
Yjs: A Framework for Near Real-time P2P Shared Editing on Arbitrary Data Types
 
E Infrastructure for OA
E Infrastructure for OAE Infrastructure for OA
E Infrastructure for OA
 
The role of virtual research environments (VRE's) within the context of an e-...
The role of virtual research environments (VRE's) within the context of an e-...The role of virtual research environments (VRE's) within the context of an e-...
The role of virtual research environments (VRE's) within the context of an e-...
 
Demand Analysis
Demand AnalysisDemand Analysis
Demand Analysis
 

Viewers also liked

HT2016: Influence of Frequency, Recency and Semantic Context on Tag Reuse
HT2016: Influence of Frequency, Recency and Semantic Context on Tag ReuseHT2016: Influence of Frequency, Recency and Semantic Context on Tag Reuse
HT2016: Influence of Frequency, Recency and Semantic Context on Tag Reuse
Dominik Kowald
 
WWW2015 PHD Symposium
WWW2015 PHD SymposiumWWW2015 PHD Symposium
WWW2015 PHD Symposium
Dominik Kowald
 
Contextualized versus Structural Overlapping Communities in Social Media.
Contextualized versus Structural Overlapping Communities in Social Media. Contextualized versus Structural Overlapping Communities in Social Media.
Contextualized versus Structural Overlapping Communities in Social Media.
Mohsen Shahriari
 
DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...
DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...
DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...
Nicolaescu Petru
 
A Microservice Approach for Near Real-time Collaborative 3D Object Annotation...
A Microservice Approach for Near Real-time Collaborative 3D Object Annotation...A Microservice Approach for Near Real-time Collaborative 3D Object Annotation...
A Microservice Approach for Near Real-time Collaborative 3D Object Annotation...
Nicolaescu Petru
 
A Community Information System for Ubiquitous Informal Learning Support
A Community Information System for Ubiquitous Informal Learning SupportA Community Information System for Ubiquitous Informal Learning Support
A Community Information System for Ubiquitous Informal Learning Support
Nicolaescu Petru
 
Analysis of Overlapping Communities in Signed Complex Networks
Analysis of Overlapping Communities in Signed Complex NetworksAnalysis of Overlapping Communities in Signed Complex Networks
Analysis of Overlapping Communities in Signed Complex Networks
Mohsen Shahriari
 
EC-TEL 2016: Which Algorithms Suit Which Learning Environments?
EC-TEL 2016: Which Algorithms Suit Which Learning Environments?EC-TEL 2016: Which Algorithms Suit Which Learning Environments?
EC-TEL 2016: Which Algorithms Suit Which Learning Environments?
Simone Kopeinik
 
SeViAnno 2.0: Web-Enabled Collaborative Semantic Video Annotation Beyond the ...
SeViAnno 2.0: Web-Enabled CollaborativeSemantic Video Annotation Beyond the ...SeViAnno 2.0: Web-Enabled CollaborativeSemantic Video Annotation Beyond the ...
SeViAnno 2.0: Web-Enabled Collaborative Semantic Video Annotation Beyond the ...
Nicolaescu Petru
 
DireWolf Goes Pack Hunting: A Peer-to-Peer Approach for Secure Low Latency Wi...
DireWolf Goes Pack Hunting: A Peer-to-Peer Approach for Secure Low Latency Wi...DireWolf Goes Pack Hunting: A Peer-to-Peer Approach for Secure Low Latency Wi...
DireWolf Goes Pack Hunting: A Peer-to-Peer Approach for Secure Low Latency Wi...
IstvanKoren
 
DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...
DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...
DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...
Dejan Kovachev
 
Informal Learning at the Workplace via Adaptive Video
Informal Learning at the Workplace via Adaptive VideoInformal Learning at the Workplace via Adaptive Video
Informal Learning at the Workplace via Adaptive Video
Nicolaescu Petru
 
Prototyping Apps for Informal Learning: A Semantic Video Annotation Scenario
Prototyping Apps for Informal Learning: A Semantic Video Annotation ScenarioPrototyping Apps for Informal Learning: A Semantic Video Annotation Scenario
Prototyping Apps for Informal Learning: A Semantic Video Annotation Scenario
Nicolaescu Petru
 
Unpacking the Layers Box
Unpacking the Layers BoxUnpacking the Layers Box
Unpacking the Layers Box
IstvanKoren
 
Requirements Bazaar - Meet your Users on the Web
Requirements Bazaar - Meet your Users on the WebRequirements Bazaar - Meet your Users on the Web
Requirements Bazaar - Meet your Users on the WebIstvanKoren
 
Maximising the opportunities of the online marketplace (South Australia)
Maximising the opportunities of the online marketplace (South Australia)Maximising the opportunities of the online marketplace (South Australia)
Maximising the opportunities of the online marketplace (South Australia)
Zumio
 
Triptourage - online marketplace for tailored trips on-demand
Triptourage - online marketplace for tailored trips on-demandTriptourage - online marketplace for tailored trips on-demand
Triptourage - online marketplace for tailored trips on-demand
Darja Gutnick
 

Viewers also liked (17)

HT2016: Influence of Frequency, Recency and Semantic Context on Tag Reuse
HT2016: Influence of Frequency, Recency and Semantic Context on Tag ReuseHT2016: Influence of Frequency, Recency and Semantic Context on Tag Reuse
HT2016: Influence of Frequency, Recency and Semantic Context on Tag Reuse
 
WWW2015 PHD Symposium
WWW2015 PHD SymposiumWWW2015 PHD Symposium
WWW2015 PHD Symposium
 
Contextualized versus Structural Overlapping Communities in Social Media.
Contextualized versus Structural Overlapping Communities in Social Media. Contextualized versus Structural Overlapping Communities in Social Media.
Contextualized versus Structural Overlapping Communities in Social Media.
 
DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...
DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...
DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...
 
A Microservice Approach for Near Real-time Collaborative 3D Object Annotation...
A Microservice Approach for Near Real-time Collaborative 3D Object Annotation...A Microservice Approach for Near Real-time Collaborative 3D Object Annotation...
A Microservice Approach for Near Real-time Collaborative 3D Object Annotation...
 
A Community Information System for Ubiquitous Informal Learning Support
A Community Information System for Ubiquitous Informal Learning SupportA Community Information System for Ubiquitous Informal Learning Support
A Community Information System for Ubiquitous Informal Learning Support
 
Analysis of Overlapping Communities in Signed Complex Networks
Analysis of Overlapping Communities in Signed Complex NetworksAnalysis of Overlapping Communities in Signed Complex Networks
Analysis of Overlapping Communities in Signed Complex Networks
 
EC-TEL 2016: Which Algorithms Suit Which Learning Environments?
EC-TEL 2016: Which Algorithms Suit Which Learning Environments?EC-TEL 2016: Which Algorithms Suit Which Learning Environments?
EC-TEL 2016: Which Algorithms Suit Which Learning Environments?
 
SeViAnno 2.0: Web-Enabled Collaborative Semantic Video Annotation Beyond the ...
SeViAnno 2.0: Web-Enabled CollaborativeSemantic Video Annotation Beyond the ...SeViAnno 2.0: Web-Enabled CollaborativeSemantic Video Annotation Beyond the ...
SeViAnno 2.0: Web-Enabled Collaborative Semantic Video Annotation Beyond the ...
 
DireWolf Goes Pack Hunting: A Peer-to-Peer Approach for Secure Low Latency Wi...
DireWolf Goes Pack Hunting: A Peer-to-Peer Approach for Secure Low Latency Wi...DireWolf Goes Pack Hunting: A Peer-to-Peer Approach for Secure Low Latency Wi...
DireWolf Goes Pack Hunting: A Peer-to-Peer Approach for Secure Low Latency Wi...
 
DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...
DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...
DireWolf - Distributing and Migrating User Interfaces for Widget-based Web Ap...
 
Informal Learning at the Workplace via Adaptive Video
Informal Learning at the Workplace via Adaptive VideoInformal Learning at the Workplace via Adaptive Video
Informal Learning at the Workplace via Adaptive Video
 
Prototyping Apps for Informal Learning: A Semantic Video Annotation Scenario
Prototyping Apps for Informal Learning: A Semantic Video Annotation ScenarioPrototyping Apps for Informal Learning: A Semantic Video Annotation Scenario
Prototyping Apps for Informal Learning: A Semantic Video Annotation Scenario
 
Unpacking the Layers Box
Unpacking the Layers BoxUnpacking the Layers Box
Unpacking the Layers Box
 
Requirements Bazaar - Meet your Users on the Web
Requirements Bazaar - Meet your Users on the WebRequirements Bazaar - Meet your Users on the Web
Requirements Bazaar - Meet your Users on the Web
 
Maximising the opportunities of the online marketplace (South Australia)
Maximising the opportunities of the online marketplace (South Australia)Maximising the opportunities of the online marketplace (South Australia)
Maximising the opportunities of the online marketplace (South Australia)
 
Triptourage - online marketplace for tailored trips on-demand
Triptourage - online marketplace for tailored trips on-demandTriptourage - online marketplace for tailored trips on-demand
Triptourage - online marketplace for tailored trips on-demand
 

Similar to SRS2014: Towards a Scalable Recommender Engine for Online Marketplaces

C19013010 the tutorial to build shared ai services session 1
C19013010  the tutorial to build shared ai services session 1C19013010  the tutorial to build shared ai services session 1
C19013010 the tutorial to build shared ai services session 1
Bill Liu
 
Architectural Considerations for Startups
Architectural Considerations for StartupsArchitectural Considerations for Startups
Architectural Considerations for Startups
Niall Roche
 
Practical model management in the age of Data science and ML
Practical model management in the age of Data science and MLPractical model management in the age of Data science and ML
Practical model management in the age of Data science and ML
QuantUniversity
 
ECIR Recommendation Challenges
ECIR Recommendation ChallengesECIR Recommendation Challenges
ECIR Recommendation Challenges
Daniel Kohlsdorf
 
GraphTalks Hamburg - Einführung in Graphdatenbanken
GraphTalks Hamburg - Einführung in GraphdatenbankenGraphTalks Hamburg - Einführung in Graphdatenbanken
GraphTalks Hamburg - Einführung in Graphdatenbanken
Neo4j
 
Neo4j GraphTalks - Einführung in Graphdatenbanken
Neo4j GraphTalks - Einführung in GraphdatenbankenNeo4j GraphTalks - Einführung in Graphdatenbanken
Neo4j GraphTalks - Einführung in Graphdatenbanken
Neo4j
 
Productionising Machine Learning Models
Productionising Machine Learning ModelsProductionising Machine Learning Models
Productionising Machine Learning Models
Tash Bickley
 
Invento Labs Web 2.0
Invento Labs Web 2.0Invento Labs Web 2.0
Invento Labs Web 2.0
Invento Labs
 
Harnessing Big Data_UCLA
Harnessing Big Data_UCLAHarnessing Big Data_UCLA
Harnessing Big Data_UCLAPaul Barsch
 
Recommendation engines : Matching items to users
Recommendation engines : Matching items to usersRecommendation engines : Matching items to users
Recommendation engines : Matching items to usersjobinwilson
 
Recommendation engines matching items to users
Recommendation engines matching items to usersRecommendation engines matching items to users
Recommendation engines matching items to users
Flytxt
 
Science Gateway Canvas
Science Gateway CanvasScience Gateway Canvas
Science Gateway Canvas
Shayan Shahand
 
Software Analytics: Data Analytics for Software Engineering
Software Analytics: Data Analytics for Software EngineeringSoftware Analytics: Data Analytics for Software Engineering
Software Analytics: Data Analytics for Software Engineering
Tao Xie
 
Neo4j GraphDay Tel Aviv - Graphs in Action
Neo4j GraphDay Tel Aviv - Graphs in ActionNeo4j GraphDay Tel Aviv - Graphs in Action
Neo4j GraphDay Tel Aviv - Graphs in Action
Neo4j
 
Agile data science
Agile data scienceAgile data science
Agile data science
Joel Horwitz
 
Vertica Analytics Database general overview
Vertica Analytics Database general overviewVertica Analytics Database general overview
Vertica Analytics Database general overview
Stratebi
 
Tech essentials for Product managers
Tech essentials for Product managersTech essentials for Product managers
Tech essentials for Product managers
Nitin T Bhat
 
Liberating data power of APIs
Liberating data power of APIsLiberating data power of APIs
Liberating data power of APIs
Bala Iyer
 
Product Lines and Ecosystems: from customization to configuration
Product Lines and Ecosystems: from customization to configurationProduct Lines and Ecosystems: from customization to configuration
Product Lines and Ecosystems: from customization to configuration
AdaCore
 
Algorithm Marketplace and the new "Algorithm Economy"
Algorithm Marketplace and the new "Algorithm Economy"Algorithm Marketplace and the new "Algorithm Economy"
Algorithm Marketplace and the new "Algorithm Economy"
Diego Oppenheimer
 

Similar to SRS2014: Towards a Scalable Recommender Engine for Online Marketplaces (20)

C19013010 the tutorial to build shared ai services session 1
C19013010  the tutorial to build shared ai services session 1C19013010  the tutorial to build shared ai services session 1
C19013010 the tutorial to build shared ai services session 1
 
Architectural Considerations for Startups
Architectural Considerations for StartupsArchitectural Considerations for Startups
Architectural Considerations for Startups
 
Practical model management in the age of Data science and ML
Practical model management in the age of Data science and MLPractical model management in the age of Data science and ML
Practical model management in the age of Data science and ML
 
ECIR Recommendation Challenges
ECIR Recommendation ChallengesECIR Recommendation Challenges
ECIR Recommendation Challenges
 
GraphTalks Hamburg - Einführung in Graphdatenbanken
GraphTalks Hamburg - Einführung in GraphdatenbankenGraphTalks Hamburg - Einführung in Graphdatenbanken
GraphTalks Hamburg - Einführung in Graphdatenbanken
 
Neo4j GraphTalks - Einführung in Graphdatenbanken
Neo4j GraphTalks - Einführung in GraphdatenbankenNeo4j GraphTalks - Einführung in Graphdatenbanken
Neo4j GraphTalks - Einführung in Graphdatenbanken
 
Productionising Machine Learning Models
Productionising Machine Learning ModelsProductionising Machine Learning Models
Productionising Machine Learning Models
 
Invento Labs Web 2.0
Invento Labs Web 2.0Invento Labs Web 2.0
Invento Labs Web 2.0
 
Harnessing Big Data_UCLA
Harnessing Big Data_UCLAHarnessing Big Data_UCLA
Harnessing Big Data_UCLA
 
Recommendation engines : Matching items to users
Recommendation engines : Matching items to usersRecommendation engines : Matching items to users
Recommendation engines : Matching items to users
 
Recommendation engines matching items to users
Recommendation engines matching items to usersRecommendation engines matching items to users
Recommendation engines matching items to users
 
Science Gateway Canvas
Science Gateway CanvasScience Gateway Canvas
Science Gateway Canvas
 
Software Analytics: Data Analytics for Software Engineering
Software Analytics: Data Analytics for Software EngineeringSoftware Analytics: Data Analytics for Software Engineering
Software Analytics: Data Analytics for Software Engineering
 
Neo4j GraphDay Tel Aviv - Graphs in Action
Neo4j GraphDay Tel Aviv - Graphs in ActionNeo4j GraphDay Tel Aviv - Graphs in Action
Neo4j GraphDay Tel Aviv - Graphs in Action
 
Agile data science
Agile data scienceAgile data science
Agile data science
 
Vertica Analytics Database general overview
Vertica Analytics Database general overviewVertica Analytics Database general overview
Vertica Analytics Database general overview
 
Tech essentials for Product managers
Tech essentials for Product managersTech essentials for Product managers
Tech essentials for Product managers
 
Liberating data power of APIs
Liberating data power of APIsLiberating data power of APIs
Liberating data power of APIs
 
Product Lines and Ecosystems: from customization to configuration
Product Lines and Ecosystems: from customization to configurationProduct Lines and Ecosystems: from customization to configuration
Product Lines and Ecosystems: from customization to configuration
 
Algorithm Marketplace and the new "Algorithm Economy"
Algorithm Marketplace and the new "Algorithm Economy"Algorithm Marketplace and the new "Algorithm Economy"
Algorithm Marketplace and the new "Algorithm Economy"
 

More from Dominik Kowald

AFEL: TagRec Framework
AFEL: TagRec FrameworkAFEL: TagRec Framework
AFEL: TagRec Framework
Dominik Kowald
 
AFEL: Online Study of Tag Recommendations
AFEL: Online Study of Tag RecommendationsAFEL: Online Study of Tag Recommendations
AFEL: Online Study of Tag Recommendations
Dominik Kowald
 
AFEL: Imbalance of Tag Recommendations
AFEL: Imbalance of Tag RecommendationsAFEL: Imbalance of Tag Recommendations
AFEL: Imbalance of Tag Recommendations
Dominik Kowald
 
AFEL-REC: A Recommender System for Providing Learning Resource Recommendation...
AFEL-REC: A Recommender System for Providing Learning Resource Recommendation...AFEL-REC: A Recommender System for Providing Learning Resource Recommendation...
AFEL-REC: A Recommender System for Providing Learning Resource Recommendation...
Dominik Kowald
 
Dominik Kowald PhD Defense Recommender Systems
Dominik Kowald PhD Defense Recommender SystemsDominik Kowald PhD Defense Recommender Systems
Dominik Kowald PhD Defense Recommender Systems
Dominik Kowald
 
Temporal Effects on Hashtag Reuse in Twitter
Temporal Effects on Hashtag Reuse in TwitterTemporal Effects on Hashtag Reuse in Twitter
Temporal Effects on Hashtag Reuse in Twitter
Dominik Kowald
 

More from Dominik Kowald (6)

AFEL: TagRec Framework
AFEL: TagRec FrameworkAFEL: TagRec Framework
AFEL: TagRec Framework
 
AFEL: Online Study of Tag Recommendations
AFEL: Online Study of Tag RecommendationsAFEL: Online Study of Tag Recommendations
AFEL: Online Study of Tag Recommendations
 
AFEL: Imbalance of Tag Recommendations
AFEL: Imbalance of Tag RecommendationsAFEL: Imbalance of Tag Recommendations
AFEL: Imbalance of Tag Recommendations
 
AFEL-REC: A Recommender System for Providing Learning Resource Recommendation...
AFEL-REC: A Recommender System for Providing Learning Resource Recommendation...AFEL-REC: A Recommender System for Providing Learning Resource Recommendation...
AFEL-REC: A Recommender System for Providing Learning Resource Recommendation...
 
Dominik Kowald PhD Defense Recommender Systems
Dominik Kowald PhD Defense Recommender SystemsDominik Kowald PhD Defense Recommender Systems
Dominik Kowald PhD Defense Recommender Systems
 
Temporal Effects on Hashtag Reuse in Twitter
Temporal Effects on Hashtag Reuse in TwitterTemporal Effects on Hashtag Reuse in Twitter
Temporal Effects on Hashtag Reuse in Twitter
 

Recently uploaded

Buy Pinterest Followers, Reactions & Repins Go Viral on Pinterest with Socio...
Buy Pinterest Followers, Reactions & Repins  Go Viral on Pinterest with Socio...Buy Pinterest Followers, Reactions & Repins  Go Viral on Pinterest with Socio...
Buy Pinterest Followers, Reactions & Repins Go Viral on Pinterest with Socio...
SocioCosmos
 
Multilingual SEO Services | Multilingual Keyword Research | Filose
Multilingual SEO Services |  Multilingual Keyword Research | FiloseMultilingual SEO Services |  Multilingual Keyword Research | Filose
Multilingual SEO Services | Multilingual Keyword Research | Filose
madisonsmith478075
 
Unlock TikTok Success with Sociocosmos..
Unlock TikTok Success with Sociocosmos..Unlock TikTok Success with Sociocosmos..
Unlock TikTok Success with Sociocosmos..
SocioCosmos
 
“To be integrated is to feel secure, to feel connected.” The views and experi...
“To be integrated is to feel secure, to feel connected.” The views and experi...“To be integrated is to feel secure, to feel connected.” The views and experi...
“To be integrated is to feel secure, to feel connected.” The views and experi...
AJHSSR Journal
 
LORRAINE ANDREI_LEQUIGAN_HOW TO USE TRELLO
LORRAINE ANDREI_LEQUIGAN_HOW TO USE TRELLOLORRAINE ANDREI_LEQUIGAN_HOW TO USE TRELLO
LORRAINE ANDREI_LEQUIGAN_HOW TO USE TRELLO
lorraineandreiamcidl
 
SluggerPunk Final Angel Investor Proposal
SluggerPunk Final Angel Investor ProposalSluggerPunk Final Angel Investor Proposal
SluggerPunk Final Angel Investor Proposal
grogshiregames
 
7 Tips on Social Media Marketing strategy
7 Tips on Social Media Marketing strategy7 Tips on Social Media Marketing strategy
7 Tips on Social Media Marketing strategy
Digital Marketing Lab
 
SluggerPunk Angel Investor Final Proposal
SluggerPunk Angel Investor Final ProposalSluggerPunk Angel Investor Final Proposal
SluggerPunk Angel Investor Final Proposal
grogshiregames
 
Grow Your Reddit Community Fast.........
Grow Your Reddit Community Fast.........Grow Your Reddit Community Fast.........
Grow Your Reddit Community Fast.........
SocioCosmos
 
Social Media Marketing Strategies .
Social Media Marketing Strategies                     .Social Media Marketing Strategies                     .
Social Media Marketing Strategies .
Virtual Real Design
 
Surat Digital Marketing School - course curriculum
Surat Digital Marketing School - course curriculumSurat Digital Marketing School - course curriculum
Surat Digital Marketing School - course curriculum
digitalcourseshop4
 
Improving Workplace Safety Performance in Malaysian SMEs: The Role of Safety ...
Improving Workplace Safety Performance in Malaysian SMEs: The Role of Safety ...Improving Workplace Safety Performance in Malaysian SMEs: The Role of Safety ...
Improving Workplace Safety Performance in Malaysian SMEs: The Role of Safety ...
AJHSSR Journal
 
Your Path to YouTube Stardom Starts Here
Your Path to YouTube Stardom Starts HereYour Path to YouTube Stardom Starts Here
Your Path to YouTube Stardom Starts Here
SocioCosmos
 

Recently uploaded (13)

Buy Pinterest Followers, Reactions & Repins Go Viral on Pinterest with Socio...
Buy Pinterest Followers, Reactions & Repins  Go Viral on Pinterest with Socio...Buy Pinterest Followers, Reactions & Repins  Go Viral on Pinterest with Socio...
Buy Pinterest Followers, Reactions & Repins Go Viral on Pinterest with Socio...
 
Multilingual SEO Services | Multilingual Keyword Research | Filose
Multilingual SEO Services |  Multilingual Keyword Research | FiloseMultilingual SEO Services |  Multilingual Keyword Research | Filose
Multilingual SEO Services | Multilingual Keyword Research | Filose
 
Unlock TikTok Success with Sociocosmos..
Unlock TikTok Success with Sociocosmos..Unlock TikTok Success with Sociocosmos..
Unlock TikTok Success with Sociocosmos..
 
“To be integrated is to feel secure, to feel connected.” The views and experi...
“To be integrated is to feel secure, to feel connected.” The views and experi...“To be integrated is to feel secure, to feel connected.” The views and experi...
“To be integrated is to feel secure, to feel connected.” The views and experi...
 
LORRAINE ANDREI_LEQUIGAN_HOW TO USE TRELLO
LORRAINE ANDREI_LEQUIGAN_HOW TO USE TRELLOLORRAINE ANDREI_LEQUIGAN_HOW TO USE TRELLO
LORRAINE ANDREI_LEQUIGAN_HOW TO USE TRELLO
 
SluggerPunk Final Angel Investor Proposal
SluggerPunk Final Angel Investor ProposalSluggerPunk Final Angel Investor Proposal
SluggerPunk Final Angel Investor Proposal
 
7 Tips on Social Media Marketing strategy
7 Tips on Social Media Marketing strategy7 Tips on Social Media Marketing strategy
7 Tips on Social Media Marketing strategy
 
SluggerPunk Angel Investor Final Proposal
SluggerPunk Angel Investor Final ProposalSluggerPunk Angel Investor Final Proposal
SluggerPunk Angel Investor Final Proposal
 
Grow Your Reddit Community Fast.........
Grow Your Reddit Community Fast.........Grow Your Reddit Community Fast.........
Grow Your Reddit Community Fast.........
 
Social Media Marketing Strategies .
Social Media Marketing Strategies                     .Social Media Marketing Strategies                     .
Social Media Marketing Strategies .
 
Surat Digital Marketing School - course curriculum
Surat Digital Marketing School - course curriculumSurat Digital Marketing School - course curriculum
Surat Digital Marketing School - course curriculum
 
Improving Workplace Safety Performance in Malaysian SMEs: The Role of Safety ...
Improving Workplace Safety Performance in Malaysian SMEs: The Role of Safety ...Improving Workplace Safety Performance in Malaysian SMEs: The Role of Safety ...
Improving Workplace Safety Performance in Malaysian SMEs: The Role of Safety ...
 
Your Path to YouTube Stardom Starts Here
Your Path to YouTube Stardom Starts HereYour Path to YouTube Stardom Starts Here
Your Path to YouTube Stardom Starts Here
 

SRS2014: Towards a Scalable Recommender Engine for Online Marketplaces

  • 1. http://Learning-Layers-euhttp://Learning-Layers-eu Learning Layers Scaling up Technologies for Informal Learning in SME Clusters Towards a Scalable Social Recommender Engine for Online Marketplaces The Case of Apache Solr Emanuel Lacic, Dominik Kowald, Denis Parra, Christoph Trattner 1
  • 2. http://Learning-Layers-eu Many thanks to Emanuel Lacic elacic@know-center.at Graz University of Technology Austria 2 Denis Parra dparra@ing.puc.cl Pontificia Universidad Catolica Chile Christoph Trattner ctrattner@know-center.at Know-Center Graz Austria
  • 3. http://Learning-Layers-eu What will this talk be about? • (Real-time) product recommendations for online marketplaces • Scalability of recommender systems • Utilizing social network data for the recommendations of products to people 3
  • 4. http://Learning-Layers-eu How did this work start? • Joint project with the Austrian start-up Blanc-Noir • Personalized product recommender for online marketplaces based on – Actions in the marketplaces (e.g., Ebay, Amazon) – Product information – Social network data (e.g., Facebook, G+) – Filter criteria • Provided at (near) real-time! … especially if there is a lot of data … together with many data updates 4
  • 5. http://Learning-Layers-eu So now, how we have solved that issue? 5
  • 6. http://Learning-Layers-eu What‘s available out there? • Frameworks/approaches for scalable recommendations – Distributed data processing • Apache Hadoop / Mahout (map/reduce paradigm) – Relational databases • MySQL, PostgreSQL (e.g., RecDB project) – Collaborative Filtering improvements • Matrix factorization • Lack of a framework / approach that combines all things we need 6
  • 8. http://Learning-Layers-eu Why Solr? • „High-performance, full-featured text search engine library“ … but more precise … • „High-performance, fully-featured token matching and scoring library“ [Grainger, 2012] … which provides …. – full-text searches (content-based) – powerful queries (e.g., MoreLikeThis or Facets) – (near) real-time data updates (no pre/re-calculations) – easy schema updates (social data integration) • Established open-source software (Apache license) with big community 8
  • 10. http://Learning-Layers-eu How does the thing perform? • Dataset of virtual world SecondLife – Marketplace and social data 10
  • 11. http://Learning-Layers-eu What‘s about the marketplace and social data features? 11 0 0.05 0.1 0.15 0.2 Purchases Categories Title Description Interests Groups Likes Comments Interactions nDCG Data Features 0 0.1 0.2 0.3 0.4 0.5 Purchases Categories Title Description Interests Groups Likes Comments Interactions UserCoverage Data Features • Both types of data are important for the recommender quality and user coverage
  • 12. http://Learning-Layers-eu What‘s about the hybrids? 12 0 0.01 0.02 0.03 0.04 0.05 MP CCFm CFs ALL nDCG Recommendation Algorithms 0 0.2 0.4 0.6 0.8 1 MP CCFm CFs ALL UserCoverage Data Features • The hybrid approach provides a good trade-off of recommender quality and user coverage
  • 13. http://Learning-Layers-eu What‘s about the scalability? 13 • Recommendations can be provided in (near) real-time in both cases (with and without data update)
  • 14. http://Learning-Layers-eu What we have shown! • Apache Solr is more than a search engine! • Actually it is a great framework to implement a scalable recommender engine for online marketplaces • Near real-time recommendations through build-in query-functions • Near real-time data updates • Easy integration of social data + a high-performance full-text search engine for free! • Evaluation on dataset gathered from SecondLife • Different marketplace and social data features are important • Hybrid approaches produce more robust recommendations • It scales! 14
  • 15. http://Learning-Layers-eu What do we want to do in the future? 15 • Online study together with BlancNoir with “real” data • Impact of geo-spatial data • Impact of temporal data (see WebScience track) • Comparative study with other backend solutions (e.g., ElasticSearch)
  • 16. http://Learning-Layers-eu Thank you for your attention! Code and framework: https://github.com/learning-layers/SocRec Questions? Dominik Kowald dkowald@know-center.at Know-Center Graz University of Technology (Austria) 16
  • 18. http://Learning-Layers-eu Short hands-on session • Collaborative Filtering • Content-Based 18
  • 20. http://Learning-Layers-eu How to Use the Engine? • Implement and run a new recommender 20
  • 21. http://Learning-Layers-eu Recommendation Algorithms implemented in the Engine • MostPopular (MP) – Recommends for any user the most purchased items • Collaborative Filtering (CF) – Find similar users (k nearest neighbors) and recommend novel items of those users [Schafer et al., 2007] – In Solr: select queries and facet counts • Content-Based (C) – Analyze item meta-data to find similar items [Pazzani et al., 2007] – In Solr: MoreLikeThis function • Hybrid (CCF) – Combine different algorithms to overcome their individual limitations [Burke et al., 2002] – Each algorithm can be weighted / tuned according to its performance 21