SlideShare a Scribd company logo
1 of 1
Hybrid Biased k-NN to Predict Movie Tweets Popularity 
Ladislav Peška 
Department of Software Engineering 
Charles University in Prague 
Malostranske namesti 25, Prague, Czech Republic 
Peter Vojtáš 
Department of Software Engineering 
Charles University in Prague 
Malostranske namesti 25, Prague, Czech Republic 
ABSTRACT 
In this paper we describe approach of our SemWexMFF group to 
the RecSys Challenge 2014. Target of the challenge was to predict 
level of user engagement on tweets generated automatically from 
IMDB. 
During experiments we have tested several state-of-the-art 
prediction techniques and proposed a variant of item based k-NN 
algorithm, which better reflects user engagement and nature of 
the movie domain content-based attributes. 
Our final solution (placed in the midfield of the challenge leader 
board) is an aggregation of several runs of this algorithm. 
OUR APPROACH 
Our approach follows two main hypothesis: 
1. Engagement of similar objects should be similar 
2. Engagement depends on neighborhood (friends and 
followers) of current user 
Similarity of objects is based on their content-based attributes: 
Numeric attributes: normalized distance 
푠푖푚푥,푦 ,푚푎푥퐷푖푠푡 = max⁡ 0, 
푚푎푥퐷푖푠푡 − 푥 − 푦 
푚푎푥퐷푖푠푡 
String attributes: relative Levenshtein distance 
푠푖푚푥,푦 = 1 −⁡ 
푙푒푣푒푛푠ℎ푡푒푖푛(푥, 푦) 
max⁡(푙푒푛푔ℎ푡 푥 , 푙푒푛푔ℎ푡(푦)) 
Nominal/Set attributes: Jaccard similarity 
푠푖푚퐱,퐲 = | 퐱 ∩ 퐲 | | 퐱 ∪ 퐲 | 
Effect of user’s friends and followers was approximated by user 
bias. 
Movie attributes were queried from OMDB API: 
- average rating, number of awards, IMDB metascore 
- number of ratings 
- movie name, release date, genre, country, language, director, actors 
RESULTS 
HYBRID BIASED k-NN 
For tweet tID, its movie mID and fixed k, the algorithm first 
compute similarities to other movies and selects k most 
similar movies. Then for each tweet about the movie the 
predicted ranking 푟 is increased according to similarity 푠 , 
user engagement r and bias of the tweeting user. The bias 
of the current movie is added in the final 푟 prediction too. 
function HybridBiasedKNN(tID, mID , k){ 
푟 = 0; 
/*compute similarity for all movies */ 
foreach(mID ϵ TrainSet){ 
S[mID ] = similarity(mID , mID ); 
} 
S = getKMostSimilar(S,k); 
/*get all tweets about movies in S */ 
foreach({uID, mID, r, 푠 }: 
{uID, mID, r} ϵ TrainSet && S [mID]= 푠 ){ 
푟 += 푠 * r / (bias(uID) + ε ); 
} 
푟 = bias(mID ) + (푟 / sum(푠 )) 
return 푟 ; 
} 
Braveheart 
TID: 421065455743541248 
UID: 25813709 
The Patriot 
8.4 
7.1 
1995 
2000 
68 
63 
Action, Biography, 
Drama 
Action, Drama, 
War 
Mel Gibson; James Robinson; Sean 
Lawlor; Sandy Nelson; James Cosmo 
Mel Gibson; Heath Ledger; Joely 
Richardson; Jason Isaacs 
Rating Year 
IMDB 
metascore 
Genre Actors 
TID: 410808483345465344 
UID: 307867510 
Engagement: 0 
TID: 421040870931320833 
UID: 296041028 
Engagement: 3 … 
AVG Eng.: 0.0 
AVG Eng.: 0.3250 
AVG Eng.: 0.024 
AVG Eng.: 0.001 
Results of state-of-the-art methods 
Method nDCG 
Random predictions 0.7482 
Bi-Polar Slope One 0.7652 
Factor Wise Matrix Factorization 0.7556 
Item-Item k-NN 0.7604 
Decision Tree 0.7494 
Support Vector Machines (SVM) 0.8057 
Results of Hybrid k-NN using only one attribute 
Method nDCG Method nDCG 
AVG rating 0.7918 Genres 0.7919 
Awards 0.7652 Countries 0.7984 
IMDB Metascore 0.8057 Languages 0.8005 
Number of ratings 0.7964 Director 0.8029 
Movie name 0.7947 Actors 0.7930 
Release year 0.7962 
Results of Hybrid k-NN combining more attributes 
Method nDCG 
Hybrid k-nn (Metascore, Language, Director, Country, 
Date, # of ratings) 
0.7927 
Hybrid k-nn(Metascore, Language, Director, Country, 
Date, # of ratings), no bias 
0.7792 
Linear Regression (Metascore, Language, Director, 
Country, Date, # of ratings) 
0.7913 
AVG (Metascore, Language, Director, Country, Date, 
# of ratings), omit best and worst prediction 
0.8134 
LESSONS LEARNED and POSSIBLE EXTENSIONS 
Both hypothesis on which we based our solution seems to be confirmed. Omitting user bias lead to severe decrease of algorithm success 
metrics and almost all content-based attributes proved to be quite good measure of movie similarity. 
Hybrid biased k-NN outperformed all considered state-of-the-art machine learning methods, however our results were placed in lower 
midfield of the challenge. Several extensions to the current approach is possible, namely: 
-Considering temporal dependance in the dataset 
-Using some of the tweet characteristics, additional content-based attributes e.g. from DBPedia or some other meta-learning methods

More Related Content

What's hot

An Slight Overview of the Critical Elements of Spatial Statistics
An Slight Overview of the Critical Elements of Spatial StatisticsAn Slight Overview of the Critical Elements of Spatial Statistics
An Slight Overview of the Critical Elements of Spatial StatisticsTony Fast
 
Object Segmentation (D2L7 Insight@DCU Machine Learning Workshop 2017)
Object Segmentation (D2L7 Insight@DCU Machine Learning Workshop 2017)Object Segmentation (D2L7 Insight@DCU Machine Learning Workshop 2017)
Object Segmentation (D2L7 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya
 
Content-based image retrieval using a mobile device as a novel interface
Content-based image retrieval using a mobile device as a novel interfaceContent-based image retrieval using a mobile device as a novel interface
Content-based image retrieval using a mobile device as a novel interfaceJonathon Hare
 
LIMSI @ MediaEval SED 2014
LIMSI @ MediaEval SED 2014LIMSI @ MediaEval SED 2014
LIMSI @ MediaEval SED 2014multimediaeval
 
Interpretability of Convolutional Neural Networks - Eva Mohedano - UPC Barcel...
Interpretability of Convolutional Neural Networks - Eva Mohedano - UPC Barcel...Interpretability of Convolutional Neural Networks - Eva Mohedano - UPC Barcel...
Interpretability of Convolutional Neural Networks - Eva Mohedano - UPC Barcel...Universitat Politècnica de Catalunya
 
A Scalable Dataflow Implementation of Curran's Approximation Algorithm
A Scalable Dataflow Implementation of Curran's Approximation AlgorithmA Scalable Dataflow Implementation of Curran's Approximation Algorithm
A Scalable Dataflow Implementation of Curran's Approximation AlgorithmNECST Lab @ Politecnico di Milano
 
Generative Adversarial Networks and Their Applications
Generative Adversarial Networks and Their ApplicationsGenerative Adversarial Networks and Their Applications
Generative Adversarial Networks and Their ApplicationsArtifacia
 
Object Detection Using R-CNN Deep Learning Framework
Object Detection Using R-CNN Deep Learning FrameworkObject Detection Using R-CNN Deep Learning Framework
Object Detection Using R-CNN Deep Learning FrameworkNader Karimi
 
Deep Learning for Graphs
Deep Learning for GraphsDeep Learning for Graphs
Deep Learning for GraphsDeepLearningBlr
 
Algorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at SpotifyAlgorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at SpotifyChris Johnson
 
Introduction to object detection
Introduction to object detectionIntroduction to object detection
Introduction to object detectionBrodmann17
 
Introduction to Generative Adversarial Networks
Introduction to Generative Adversarial NetworksIntroduction to Generative Adversarial Networks
Introduction to Generative Adversarial NetworksBennoG1
 
DiscoGAN - Learning to Discover Cross-Domain Relations with Generative Advers...
DiscoGAN - Learning to Discover Cross-Domain Relations with Generative Advers...DiscoGAN - Learning to Discover Cross-Domain Relations with Generative Advers...
DiscoGAN - Learning to Discover Cross-Domain Relations with Generative Advers...Taeksoo Kim
 

What's hot (20)

An Slight Overview of the Critical Elements of Spatial Statistics
An Slight Overview of the Critical Elements of Spatial StatisticsAn Slight Overview of the Critical Elements of Spatial Statistics
An Slight Overview of the Critical Elements of Spatial Statistics
 
SEGAN: Speech Enhancement Generative Adversarial Network
SEGAN: Speech Enhancement Generative Adversarial NetworkSEGAN: Speech Enhancement Generative Adversarial Network
SEGAN: Speech Enhancement Generative Adversarial Network
 
Object Segmentation (D2L7 Insight@DCU Machine Learning Workshop 2017)
Object Segmentation (D2L7 Insight@DCU Machine Learning Workshop 2017)Object Segmentation (D2L7 Insight@DCU Machine Learning Workshop 2017)
Object Segmentation (D2L7 Insight@DCU Machine Learning Workshop 2017)
 
Content-based image retrieval using a mobile device as a novel interface
Content-based image retrieval using a mobile device as a novel interfaceContent-based image retrieval using a mobile device as a novel interface
Content-based image retrieval using a mobile device as a novel interface
 
LIMSI @ MediaEval SED 2014
LIMSI @ MediaEval SED 2014LIMSI @ MediaEval SED 2014
LIMSI @ MediaEval SED 2014
 
Interpretability of Convolutional Neural Networks - Eva Mohedano - UPC Barcel...
Interpretability of Convolutional Neural Networks - Eva Mohedano - UPC Barcel...Interpretability of Convolutional Neural Networks - Eva Mohedano - UPC Barcel...
Interpretability of Convolutional Neural Networks - Eva Mohedano - UPC Barcel...
 
A Scalable Dataflow Implementation of Curran's Approximation Algorithm
A Scalable Dataflow Implementation of Curran's Approximation AlgorithmA Scalable Dataflow Implementation of Curran's Approximation Algorithm
A Scalable Dataflow Implementation of Curran's Approximation Algorithm
 
Video Object Segmentation - Laura Leal-Taixé - UPC Barcelona 2018
Video Object Segmentation - Laura Leal-Taixé - UPC Barcelona 2018Video Object Segmentation - Laura Leal-Taixé - UPC Barcelona 2018
Video Object Segmentation - Laura Leal-Taixé - UPC Barcelona 2018
 
Generative Adversarial Networks and Their Applications
Generative Adversarial Networks and Their ApplicationsGenerative Adversarial Networks and Their Applications
Generative Adversarial Networks and Their Applications
 
Object Detection - Míriam Bellver - UPC Barcelona 2018
Object Detection - Míriam Bellver - UPC Barcelona 2018Object Detection - Míriam Bellver - UPC Barcelona 2018
Object Detection - Míriam Bellver - UPC Barcelona 2018
 
Object Detection Using R-CNN Deep Learning Framework
Object Detection Using R-CNN Deep Learning FrameworkObject Detection Using R-CNN Deep Learning Framework
Object Detection Using R-CNN Deep Learning Framework
 
Semantic Segmentation - Míriam Bellver - UPC Barcelona 2018
Semantic Segmentation - Míriam Bellver - UPC Barcelona 2018Semantic Segmentation - Míriam Bellver - UPC Barcelona 2018
Semantic Segmentation - Míriam Bellver - UPC Barcelona 2018
 
Deep Learning for Graphs
Deep Learning for GraphsDeep Learning for Graphs
Deep Learning for Graphs
 
Deep 3D Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2018
Deep 3D Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2018Deep 3D Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2018
Deep 3D Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2018
 
Algorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at SpotifyAlgorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at Spotify
 
Introduction to object detection
Introduction to object detectionIntroduction to object detection
Introduction to object detection
 
Introduction to Generative Adversarial Networks
Introduction to Generative Adversarial NetworksIntroduction to Generative Adversarial Networks
Introduction to Generative Adversarial Networks
 
Instance Segmentation - Míriam Bellver - UPC Barcelona 2018
Instance Segmentation - Míriam Bellver - UPC Barcelona 2018Instance Segmentation - Míriam Bellver - UPC Barcelona 2018
Instance Segmentation - Míriam Bellver - UPC Barcelona 2018
 
Generative adversarial text to image synthesis
Generative adversarial text to image synthesisGenerative adversarial text to image synthesis
Generative adversarial text to image synthesis
 
DiscoGAN - Learning to Discover Cross-Domain Relations with Generative Advers...
DiscoGAN - Learning to Discover Cross-Domain Relations with Generative Advers...DiscoGAN - Learning to Discover Cross-Domain Relations with Generative Advers...
DiscoGAN - Learning to Discover Cross-Domain Relations with Generative Advers...
 

Viewers also liked

A Comparison of Propositionalization Strategies for Creating Features from Li...
A Comparison of Propositionalization Strategies for Creating Features from Li...A Comparison of Propositionalization Strategies for Creating Features from Li...
A Comparison of Propositionalization Strategies for Creating Features from Li...Petar Ristoski
 
A novel architecture of rns based
A novel architecture of rns basedA novel architecture of rns based
A novel architecture of rns basedVLSICS Design
 
Essentials Every Non-Technical Person Need To Know To Build The Best Tech-Tea...
Essentials Every Non-Technical Person Need To Know To Build The Best Tech-Tea...Essentials Every Non-Technical Person Need To Know To Build The Best Tech-Tea...
Essentials Every Non-Technical Person Need To Know To Build The Best Tech-Tea...itnig
 
Autotransplantation of Tooth in Children
Autotransplantation of Tooth in Children Autotransplantation of Tooth in Children
Autotransplantation of Tooth in Children Abu-Hussein Muhamad
 
MW 2012: Online Scholarly Catalogues at the Art Institute of Chicago
MW 2012: Online Scholarly Catalogues at the Art Institute of ChicagoMW 2012: Online Scholarly Catalogues at the Art Institute of Chicago
MW 2012: Online Scholarly Catalogues at the Art Institute of ChicagoElizabeth Neely
 
Jazmin Dervishali - Ravensbourne School
Jazmin Dervishali - Ravensbourne SchoolJazmin Dervishali - Ravensbourne School
Jazmin Dervishali - Ravensbourne SchoolBritish Council
 
1 25 What Is P H
1 25 What Is P H1 25 What Is P H
1 25 What Is P Hmrheffner
 
Final coer presentation for progressive hall greenbank
Final coer presentation for progressive hall greenbankFinal coer presentation for progressive hall greenbank
Final coer presentation for progressive hall greenbankCitizensoftheReserve
 
Rb vs wp 로드타임 성능비교
Rb vs wp 로드타임 성능비교Rb vs wp 로드타임 성능비교
Rb vs wp 로드타임 성능비교Gitaek kwon
 
멀티플랫폼 앱 개발과 테스팅
멀티플랫폼 앱 개발과 테스팅멀티플랫폼 앱 개발과 테스팅
멀티플랫폼 앱 개발과 테스팅WooKyoung Noh
 
urgent sale orris carnation sector-85 gurgaon ,size-1485@4000 per sq.ft more ...
urgent sale orris carnation sector-85 gurgaon ,size-1485@4000 per sq.ft more ...urgent sale orris carnation sector-85 gurgaon ,size-1485@4000 per sq.ft more ...
urgent sale orris carnation sector-85 gurgaon ,size-1485@4000 per sq.ft more ...AADHAR HOMES
 
[하나투어] 골프투어 상품안내 동남아
[하나투어] 골프투어 상품안내 동남아[하나투어] 골프투어 상품안내 동남아
[하나투어] 골프투어 상품안내 동남아Mal-Yong Yoon
 
터키여행 제안자료
터키여행 제안자료터키여행 제안자료
터키여행 제안자료Mal-Yong Yoon
 
잡코리아 글로벌 프런티어 6기_WHISPER_탐방 보고서
잡코리아 글로벌 프런티어 6기_WHISPER_탐방 보고서잡코리아 글로벌 프런티어 6기_WHISPER_탐방 보고서
잡코리아 글로벌 프런티어 6기_WHISPER_탐방 보고서잡코리아 글로벌 프런티어
 
[NDC14] 라이브중인 2D게임에 시스템 변경 없이 본 애니메이션 도입하기[던전앤파이터]
[NDC14] 라이브중인 2D게임에 시스템 변경 없이 본 애니메이션 도입하기[던전앤파이터][NDC14] 라이브중인 2D게임에 시스템 변경 없이 본 애니메이션 도입하기[던전앤파이터]
[NDC14] 라이브중인 2D게임에 시스템 변경 없이 본 애니메이션 도입하기[던전앤파이터]SeungWon Lee
 
User Engagement as Evaluation: a Ranking or a Regression Problem?
User Engagement as Evaluation: a Ranking or a Regression Problem?User Engagement as Evaluation: a Ranking or a Regression Problem?
User Engagement as Evaluation: a Ranking or a Regression Problem?Frédéric Guillou
 
YNT@Ardis Parent meeting 9/1/2010
YNT@Ardis Parent meeting 9/1/2010YNT@Ardis Parent meeting 9/1/2010
YNT@Ardis Parent meeting 9/1/2010msalzer
 
The unstoppable stallionx
The unstoppable stallionxThe unstoppable stallionx
The unstoppable stallionxKids Work
 
Sumida Wu Lehrer Jbc 283 2008 6728
Sumida Wu Lehrer Jbc 283 2008 6728Sumida Wu Lehrer Jbc 283 2008 6728
Sumida Wu Lehrer Jbc 283 2008 6728jpsumida
 

Viewers also liked (20)

A Comparison of Propositionalization Strategies for Creating Features from Li...
A Comparison of Propositionalization Strategies for Creating Features from Li...A Comparison of Propositionalization Strategies for Creating Features from Li...
A Comparison of Propositionalization Strategies for Creating Features from Li...
 
A novel architecture of rns based
A novel architecture of rns basedA novel architecture of rns based
A novel architecture of rns based
 
Essentials Every Non-Technical Person Need To Know To Build The Best Tech-Tea...
Essentials Every Non-Technical Person Need To Know To Build The Best Tech-Tea...Essentials Every Non-Technical Person Need To Know To Build The Best Tech-Tea...
Essentials Every Non-Technical Person Need To Know To Build The Best Tech-Tea...
 
Autotransplantation of Tooth in Children
Autotransplantation of Tooth in Children Autotransplantation of Tooth in Children
Autotransplantation of Tooth in Children
 
MW 2012: Online Scholarly Catalogues at the Art Institute of Chicago
MW 2012: Online Scholarly Catalogues at the Art Institute of ChicagoMW 2012: Online Scholarly Catalogues at the Art Institute of Chicago
MW 2012: Online Scholarly Catalogues at the Art Institute of Chicago
 
Jazmin Dervishali - Ravensbourne School
Jazmin Dervishali - Ravensbourne SchoolJazmin Dervishali - Ravensbourne School
Jazmin Dervishali - Ravensbourne School
 
1 25 What Is P H
1 25 What Is P H1 25 What Is P H
1 25 What Is P H
 
Final coer presentation for progressive hall greenbank
Final coer presentation for progressive hall greenbankFinal coer presentation for progressive hall greenbank
Final coer presentation for progressive hall greenbank
 
Rb vs wp 로드타임 성능비교
Rb vs wp 로드타임 성능비교Rb vs wp 로드타임 성능비교
Rb vs wp 로드타임 성능비교
 
거의모든It 3장
거의모든It 3장거의모든It 3장
거의모든It 3장
 
멀티플랫폼 앱 개발과 테스팅
멀티플랫폼 앱 개발과 테스팅멀티플랫폼 앱 개발과 테스팅
멀티플랫폼 앱 개발과 테스팅
 
urgent sale orris carnation sector-85 gurgaon ,size-1485@4000 per sq.ft more ...
urgent sale orris carnation sector-85 gurgaon ,size-1485@4000 per sq.ft more ...urgent sale orris carnation sector-85 gurgaon ,size-1485@4000 per sq.ft more ...
urgent sale orris carnation sector-85 gurgaon ,size-1485@4000 per sq.ft more ...
 
[하나투어] 골프투어 상품안내 동남아
[하나투어] 골프투어 상품안내 동남아[하나투어] 골프투어 상품안내 동남아
[하나투어] 골프투어 상품안내 동남아
 
터키여행 제안자료
터키여행 제안자료터키여행 제안자료
터키여행 제안자료
 
잡코리아 글로벌 프런티어 6기_WHISPER_탐방 보고서
잡코리아 글로벌 프런티어 6기_WHISPER_탐방 보고서잡코리아 글로벌 프런티어 6기_WHISPER_탐방 보고서
잡코리아 글로벌 프런티어 6기_WHISPER_탐방 보고서
 
[NDC14] 라이브중인 2D게임에 시스템 변경 없이 본 애니메이션 도입하기[던전앤파이터]
[NDC14] 라이브중인 2D게임에 시스템 변경 없이 본 애니메이션 도입하기[던전앤파이터][NDC14] 라이브중인 2D게임에 시스템 변경 없이 본 애니메이션 도입하기[던전앤파이터]
[NDC14] 라이브중인 2D게임에 시스템 변경 없이 본 애니메이션 도입하기[던전앤파이터]
 
User Engagement as Evaluation: a Ranking or a Regression Problem?
User Engagement as Evaluation: a Ranking or a Regression Problem?User Engagement as Evaluation: a Ranking or a Regression Problem?
User Engagement as Evaluation: a Ranking or a Regression Problem?
 
YNT@Ardis Parent meeting 9/1/2010
YNT@Ardis Parent meeting 9/1/2010YNT@Ardis Parent meeting 9/1/2010
YNT@Ardis Parent meeting 9/1/2010
 
The unstoppable stallionx
The unstoppable stallionxThe unstoppable stallionx
The unstoppable stallionx
 
Sumida Wu Lehrer Jbc 283 2008 6728
Sumida Wu Lehrer Jbc 283 2008 6728Sumida Wu Lehrer Jbc 283 2008 6728
Sumida Wu Lehrer Jbc 283 2008 6728
 

Similar to RecSys Challenge 2014, SemWexMFF group

Real-world News Recommender Systems
Real-world News Recommender SystemsReal-world News Recommender Systems
Real-world News Recommender Systemskib_83
 
Download
DownloadDownload
Downloadbutest
 
Download
DownloadDownload
Downloadbutest
 
Neo4j graphdatabaseforrecommendations-130531021030-phpapp02-converted
Neo4j graphdatabaseforrecommendations-130531021030-phpapp02-convertedNeo4j graphdatabaseforrecommendations-130531021030-phpapp02-converted
Neo4j graphdatabaseforrecommendations-130531021030-phpapp02-convertedsnehapandey01
 
movie recommender system using vectorization and SVD tech
movie recommender system using vectorization and SVD techmovie recommender system using vectorization and SVD tech
movie recommender system using vectorization and SVD techUddeshBhagat
 
R markup code to create Regression Model
R markup code to create Regression ModelR markup code to create Regression Model
R markup code to create Regression ModelMohit Rajput
 
Neo4j - graph database for recommendations
Neo4j - graph database for recommendationsNeo4j - graph database for recommendations
Neo4j - graph database for recommendationsproksik
 
Hands on Training – Graph Database with Neo4j
Hands on Training – Graph Database with Neo4jHands on Training – Graph Database with Neo4j
Hands on Training – Graph Database with Neo4jSerendio Inc.
 
Rokach-GomaxSlides.pptx
Rokach-GomaxSlides.pptxRokach-GomaxSlides.pptx
Rokach-GomaxSlides.pptxJadna Almeida
 
Rokach-GomaxSlides (1).pptx
Rokach-GomaxSlides (1).pptxRokach-GomaxSlides (1).pptx
Rokach-GomaxSlides (1).pptxJadna Almeida
 
[한국어] Safe Multi-Agent Reinforcement Learning for Autonomous Driving
[한국어] Safe Multi-Agent Reinforcement Learning for Autonomous Driving[한국어] Safe Multi-Agent Reinforcement Learning for Autonomous Driving
[한국어] Safe Multi-Agent Reinforcement Learning for Autonomous DrivingKiho Suh
 
SVD and the Netflix Dataset
SVD and the Netflix DatasetSVD and the Netflix Dataset
SVD and the Netflix DatasetBen Mabey
 
JPEG XR objective and subjective evaluations
JPEG XR objective and subjective evaluationsJPEG XR objective and subjective evaluations
JPEG XR objective and subjective evaluationsTouradj Ebrahimi
 
Mahout Taste Engine
Mahout Taste EngineMahout Taste Engine
Mahout Taste Enginegirisv
 
MOVIE RECOMMENDATION SYSTEM.pptx
MOVIE RECOMMENDATION SYSTEM.pptxMOVIE RECOMMENDATION SYSTEM.pptx
MOVIE RECOMMENDATION SYSTEM.pptxAyushkumar417871
 
acmsigtalkshare-121023190142-phpapp01.pptx
acmsigtalkshare-121023190142-phpapp01.pptxacmsigtalkshare-121023190142-phpapp01.pptx
acmsigtalkshare-121023190142-phpapp01.pptxdongchangim30
 
Movie lens movie recommendation system
Movie lens movie recommendation systemMovie lens movie recommendation system
Movie lens movie recommendation systemGaurav Sawant
 

Similar to RecSys Challenge 2014, SemWexMFF group (20)

Domainspecificsubgraph extraction ieee-bigdata2016
Domainspecificsubgraph extraction ieee-bigdata2016Domainspecificsubgraph extraction ieee-bigdata2016
Domainspecificsubgraph extraction ieee-bigdata2016
 
Real-world News Recommender Systems
Real-world News Recommender SystemsReal-world News Recommender Systems
Real-world News Recommender Systems
 
Download
DownloadDownload
Download
 
Download
DownloadDownload
Download
 
Neo4j graphdatabaseforrecommendations-130531021030-phpapp02-converted
Neo4j graphdatabaseforrecommendations-130531021030-phpapp02-convertedNeo4j graphdatabaseforrecommendations-130531021030-phpapp02-converted
Neo4j graphdatabaseforrecommendations-130531021030-phpapp02-converted
 
movie recommender system using vectorization and SVD tech
movie recommender system using vectorization and SVD techmovie recommender system using vectorization and SVD tech
movie recommender system using vectorization and SVD tech
 
R markup code to create Regression Model
R markup code to create Regression ModelR markup code to create Regression Model
R markup code to create Regression Model
 
Neo4j - graph database for recommendations
Neo4j - graph database for recommendationsNeo4j - graph database for recommendations
Neo4j - graph database for recommendations
 
Hands on Training – Graph Database with Neo4j
Hands on Training – Graph Database with Neo4jHands on Training – Graph Database with Neo4j
Hands on Training – Graph Database with Neo4j
 
Rokach-GomaxSlides.pptx
Rokach-GomaxSlides.pptxRokach-GomaxSlides.pptx
Rokach-GomaxSlides.pptx
 
Rokach-GomaxSlides (1).pptx
Rokach-GomaxSlides (1).pptxRokach-GomaxSlides (1).pptx
Rokach-GomaxSlides (1).pptx
 
Cs583 recommender-systems
Cs583 recommender-systemsCs583 recommender-systems
Cs583 recommender-systems
 
[한국어] Safe Multi-Agent Reinforcement Learning for Autonomous Driving
[한국어] Safe Multi-Agent Reinforcement Learning for Autonomous Driving[한국어] Safe Multi-Agent Reinforcement Learning for Autonomous Driving
[한국어] Safe Multi-Agent Reinforcement Learning for Autonomous Driving
 
SSD: Single Shot MultiBox Detector (UPC Reading Group)
SSD: Single Shot MultiBox Detector (UPC Reading Group)SSD: Single Shot MultiBox Detector (UPC Reading Group)
SSD: Single Shot MultiBox Detector (UPC Reading Group)
 
SVD and the Netflix Dataset
SVD and the Netflix DatasetSVD and the Netflix Dataset
SVD and the Netflix Dataset
 
JPEG XR objective and subjective evaluations
JPEG XR objective and subjective evaluationsJPEG XR objective and subjective evaluations
JPEG XR objective and subjective evaluations
 
Mahout Taste Engine
Mahout Taste EngineMahout Taste Engine
Mahout Taste Engine
 
MOVIE RECOMMENDATION SYSTEM.pptx
MOVIE RECOMMENDATION SYSTEM.pptxMOVIE RECOMMENDATION SYSTEM.pptx
MOVIE RECOMMENDATION SYSTEM.pptx
 
acmsigtalkshare-121023190142-phpapp01.pptx
acmsigtalkshare-121023190142-phpapp01.pptxacmsigtalkshare-121023190142-phpapp01.pptx
acmsigtalkshare-121023190142-phpapp01.pptx
 
Movie lens movie recommendation system
Movie lens movie recommendation systemMovie lens movie recommendation system
Movie lens movie recommendation system
 

More from Ladislav Peska

Fuzzy D’Hondt’s Algorithm for On-line Recommendations Aggregation
Fuzzy D’Hondt’s Algorithm for On-line Recommendations AggregationFuzzy D’Hondt’s Algorithm for On-line Recommendations Aggregation
Fuzzy D’Hondt’s Algorithm for On-line Recommendations AggregationLadislav Peska
 
LineIT: Similarity search and recommendations for photo lineup assembling
LineIT: Similarity search and recommendations for photo lineup assemblingLineIT: Similarity search and recommendations for photo lineup assembling
LineIT: Similarity search and recommendations for photo lineup assemblingLadislav Peska
 
Towards Similarity Models in Police Photo Lineup Assembling Tasks
Towards Similarity Models in Police Photo Lineup Assembling TasksTowards Similarity Models in Police Photo Lineup Assembling Tasks
Towards Similarity Models in Police Photo Lineup Assembling TasksLadislav Peska
 
Off-line vs. On-line Evaluation of Recommender Systems in Small E-commerce
Off-line vs. On-line Evaluation of Recommender Systems in Small E-commerceOff-line vs. On-line Evaluation of Recommender Systems in Small E-commerce
Off-line vs. On-line Evaluation of Recommender Systems in Small E-commerceLadislav Peska
 
Towards Recommender Systems for Police Photo Lineup
Towards Recommender Systems for Police Photo LineupTowards Recommender Systems for Police Photo Lineup
Towards Recommender Systems for Police Photo LineupLadislav Peska
 
Linking Content Information with Bayesian Personalized Ranking via Multiple C...
Linking Content Information with Bayesian Personalized Ranking via Multiple C...Linking Content Information with Bayesian Personalized Ranking via Multiple C...
Linking Content Information with Bayesian Personalized Ranking via Multiple C...Ladislav Peska
 
Towards Complex User Feedback and Presentation Context in Recommender Systems
Towards Complex User Feedback and Presentation Context in Recommender SystemsTowards Complex User Feedback and Presentation Context in Recommender Systems
Towards Complex User Feedback and Presentation Context in Recommender SystemsLadislav Peska
 
Using the Context of User Feedback in Recommender Systems
Using the Context of User Feedback in Recommender SystemsUsing the Context of User Feedback in Recommender Systems
Using the Context of User Feedback in Recommender SystemsLadislav Peska
 
How to Interpret Implicit User Feedback
How to Interpret Implicit User FeedbackHow to Interpret Implicit User Feedback
How to Interpret Implicit User FeedbackLadislav Peska
 
Using Implicit Preference Relations to Improve Content-based Recommendations,...
Using Implicit Preference Relations to Improve Content-based Recommendations,...Using Implicit Preference Relations to Improve Content-based Recommendations,...
Using Implicit Preference Relations to Improve Content-based Recommendations,...Ladislav Peska
 

More from Ladislav Peska (10)

Fuzzy D’Hondt’s Algorithm for On-line Recommendations Aggregation
Fuzzy D’Hondt’s Algorithm for On-line Recommendations AggregationFuzzy D’Hondt’s Algorithm for On-line Recommendations Aggregation
Fuzzy D’Hondt’s Algorithm for On-line Recommendations Aggregation
 
LineIT: Similarity search and recommendations for photo lineup assembling
LineIT: Similarity search and recommendations for photo lineup assemblingLineIT: Similarity search and recommendations for photo lineup assembling
LineIT: Similarity search and recommendations for photo lineup assembling
 
Towards Similarity Models in Police Photo Lineup Assembling Tasks
Towards Similarity Models in Police Photo Lineup Assembling TasksTowards Similarity Models in Police Photo Lineup Assembling Tasks
Towards Similarity Models in Police Photo Lineup Assembling Tasks
 
Off-line vs. On-line Evaluation of Recommender Systems in Small E-commerce
Off-line vs. On-line Evaluation of Recommender Systems in Small E-commerceOff-line vs. On-line Evaluation of Recommender Systems in Small E-commerce
Off-line vs. On-line Evaluation of Recommender Systems in Small E-commerce
 
Towards Recommender Systems for Police Photo Lineup
Towards Recommender Systems for Police Photo LineupTowards Recommender Systems for Police Photo Lineup
Towards Recommender Systems for Police Photo Lineup
 
Linking Content Information with Bayesian Personalized Ranking via Multiple C...
Linking Content Information with Bayesian Personalized Ranking via Multiple C...Linking Content Information with Bayesian Personalized Ranking via Multiple C...
Linking Content Information with Bayesian Personalized Ranking via Multiple C...
 
Towards Complex User Feedback and Presentation Context in Recommender Systems
Towards Complex User Feedback and Presentation Context in Recommender SystemsTowards Complex User Feedback and Presentation Context in Recommender Systems
Towards Complex User Feedback and Presentation Context in Recommender Systems
 
Using the Context of User Feedback in Recommender Systems
Using the Context of User Feedback in Recommender SystemsUsing the Context of User Feedback in Recommender Systems
Using the Context of User Feedback in Recommender Systems
 
How to Interpret Implicit User Feedback
How to Interpret Implicit User FeedbackHow to Interpret Implicit User Feedback
How to Interpret Implicit User Feedback
 
Using Implicit Preference Relations to Improve Content-based Recommendations,...
Using Implicit Preference Relations to Improve Content-based Recommendations,...Using Implicit Preference Relations to Improve Content-based Recommendations,...
Using Implicit Preference Relations to Improve Content-based Recommendations,...
 

Recently uploaded

PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxAArockiyaNisha
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju
 
fundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyfundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyDrAnita Sharma
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 

Recently uploaded (20)

PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
fundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyfundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomology
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 

RecSys Challenge 2014, SemWexMFF group

  • 1. Hybrid Biased k-NN to Predict Movie Tweets Popularity Ladislav Peška Department of Software Engineering Charles University in Prague Malostranske namesti 25, Prague, Czech Republic Peter Vojtáš Department of Software Engineering Charles University in Prague Malostranske namesti 25, Prague, Czech Republic ABSTRACT In this paper we describe approach of our SemWexMFF group to the RecSys Challenge 2014. Target of the challenge was to predict level of user engagement on tweets generated automatically from IMDB. During experiments we have tested several state-of-the-art prediction techniques and proposed a variant of item based k-NN algorithm, which better reflects user engagement and nature of the movie domain content-based attributes. Our final solution (placed in the midfield of the challenge leader board) is an aggregation of several runs of this algorithm. OUR APPROACH Our approach follows two main hypothesis: 1. Engagement of similar objects should be similar 2. Engagement depends on neighborhood (friends and followers) of current user Similarity of objects is based on their content-based attributes: Numeric attributes: normalized distance 푠푖푚푥,푦 ,푚푎푥퐷푖푠푡 = max⁡ 0, 푚푎푥퐷푖푠푡 − 푥 − 푦 푚푎푥퐷푖푠푡 String attributes: relative Levenshtein distance 푠푖푚푥,푦 = 1 −⁡ 푙푒푣푒푛푠ℎ푡푒푖푛(푥, 푦) max⁡(푙푒푛푔ℎ푡 푥 , 푙푒푛푔ℎ푡(푦)) Nominal/Set attributes: Jaccard similarity 푠푖푚퐱,퐲 = | 퐱 ∩ 퐲 | | 퐱 ∪ 퐲 | Effect of user’s friends and followers was approximated by user bias. Movie attributes were queried from OMDB API: - average rating, number of awards, IMDB metascore - number of ratings - movie name, release date, genre, country, language, director, actors RESULTS HYBRID BIASED k-NN For tweet tID, its movie mID and fixed k, the algorithm first compute similarities to other movies and selects k most similar movies. Then for each tweet about the movie the predicted ranking 푟 is increased according to similarity 푠 , user engagement r and bias of the tweeting user. The bias of the current movie is added in the final 푟 prediction too. function HybridBiasedKNN(tID, mID , k){ 푟 = 0; /*compute similarity for all movies */ foreach(mID ϵ TrainSet){ S[mID ] = similarity(mID , mID ); } S = getKMostSimilar(S,k); /*get all tweets about movies in S */ foreach({uID, mID, r, 푠 }: {uID, mID, r} ϵ TrainSet && S [mID]= 푠 ){ 푟 += 푠 * r / (bias(uID) + ε ); } 푟 = bias(mID ) + (푟 / sum(푠 )) return 푟 ; } Braveheart TID: 421065455743541248 UID: 25813709 The Patriot 8.4 7.1 1995 2000 68 63 Action, Biography, Drama Action, Drama, War Mel Gibson; James Robinson; Sean Lawlor; Sandy Nelson; James Cosmo Mel Gibson; Heath Ledger; Joely Richardson; Jason Isaacs Rating Year IMDB metascore Genre Actors TID: 410808483345465344 UID: 307867510 Engagement: 0 TID: 421040870931320833 UID: 296041028 Engagement: 3 … AVG Eng.: 0.0 AVG Eng.: 0.3250 AVG Eng.: 0.024 AVG Eng.: 0.001 Results of state-of-the-art methods Method nDCG Random predictions 0.7482 Bi-Polar Slope One 0.7652 Factor Wise Matrix Factorization 0.7556 Item-Item k-NN 0.7604 Decision Tree 0.7494 Support Vector Machines (SVM) 0.8057 Results of Hybrid k-NN using only one attribute Method nDCG Method nDCG AVG rating 0.7918 Genres 0.7919 Awards 0.7652 Countries 0.7984 IMDB Metascore 0.8057 Languages 0.8005 Number of ratings 0.7964 Director 0.8029 Movie name 0.7947 Actors 0.7930 Release year 0.7962 Results of Hybrid k-NN combining more attributes Method nDCG Hybrid k-nn (Metascore, Language, Director, Country, Date, # of ratings) 0.7927 Hybrid k-nn(Metascore, Language, Director, Country, Date, # of ratings), no bias 0.7792 Linear Regression (Metascore, Language, Director, Country, Date, # of ratings) 0.7913 AVG (Metascore, Language, Director, Country, Date, # of ratings), omit best and worst prediction 0.8134 LESSONS LEARNED and POSSIBLE EXTENSIONS Both hypothesis on which we based our solution seems to be confirmed. Omitting user bias lead to severe decrease of algorithm success metrics and almost all content-based attributes proved to be quite good measure of movie similarity. Hybrid biased k-NN outperformed all considered state-of-the-art machine learning methods, however our results were placed in lower midfield of the challenge. Several extensions to the current approach is possible, namely: -Considering temporal dependance in the dataset -Using some of the tweet characteristics, additional content-based attributes e.g. from DBPedia or some other meta-learning methods