SlideShare a Scribd company logo
1 of 17
Download to read offline
Ðàçðàáîòêà äàííûõ è ìàøèííîå îáó÷åíèå
Ââåäåíèå â ðåêîìåíäàòåëüíûå ñèñòåìû
Èãíàòîâ Äìèòðèé Èãîðåâè÷
♦
¾Big Data Startup Accelerator Program¿  ðàçâèòèå êîìïåòåíòíîñòåé â ñîçäàíèè
èííîâàöèîííûõ ïðîäóêòîâ è áèçíåñîâ â ñôåðå Áîëüøèõ Äàííûõ
Ñîâìåñòíàÿ èíèöèàòèâà êîðïîðàöèè SAP è innovationStudio MSU FE
♦
ÍÈÓ ÂØÝ
Ôàêóëüòåò êîìïüþòåðíûõ íàóê
Äåïàðòàìåíò àíàëèçà äàííûõ è èñêóññòâåííîãî èíòåëëåêòà
07 ìàðòà 2015
(SAP  innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 1 / 17
Ïëàí çàíÿòèÿ
Íà ýòîé âñòðå÷å:
1 Ââåäåíèå â ðåêîìåíäàòåëüíûå ñèñòåìû (PC). Ïîäõîäû íà îñíîâå
ñõîäñòâà ïî ïîëüçîâàòåëÿìè è íà îñíîâå ñõîäñòâà ïî îáúåêòàì
ðåêîìåíäàöèè. Îöåíêà êà÷åñòâà ðåêîìåíäàòåëüíûõ ñèñòåì (Òî÷íîñòü,
ïîëíîòà, F-ìåðà. Áèìîäàëüíàÿ êðîññâàëèäàöèÿ).
2 Áóëåâà ìàòðè÷íàÿ ôàêòîðèçàöèÿ è ñèíãóëÿðíîå ðàçëîæåíèå (SVD).
Ñðàâíåíèå ðåçóëüòàòîâ íà îñíîâå ìåòîäà áëèæàéøåãî ñîñåäà ïî MAE íà
ïðèìåðå ðåêîìåíäàöèè ôèëüìîâ.
3 Ãèáðèäíûå ðåêîìåíäàöèè íà ïðèìåðå ñåðâèñà îíëàéí-ðàäèîñòàíöèé.
Äîïîëíèòåëüíûå òåìû.
1 Ìîäåëè SVD, SVD++ è time-SVD. Ãðàäèåíòíûé ñïóñê.
2 Êîíòåêñòíûå ðåêîìåíäàöèè.
3 Ñâîáîäíî-ðàñïðîñòðàíÿåìûå áèáëèîòåêè ÐÑ.
4 Ïðèìåð PC äëÿ êðàóäñîðñèíãà íà îñíîâå áèêëàñòåðèçàöèè.
5 ÐÑ íà îñíîâå áèêëàñòåðèçàöèè è àññîöèàòèâíûõ ïðàâèë.
6 Ñèñòåìû ñîâìåñòíîãî ïîëüçîâàíèÿ ðåñóðñàìè. ÐÑ äëÿ ôîëêñîíîìèé.
(SAP  innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 2 / 17
Îãëàâëåíèå
1 Case-study
Ââåäåíèå. User-based è item-based ïîäõîäû. Îöåíêà êà÷åñòâà.
BMF  SVD
Ðåêîìåíäàöèÿ ðàäèîñòàíöèé
2 Ïîëåçíûå ññûëêè
×òî ïî÷èòàòü?
Ñîîáùåñòâî, êîíôåðåíöèè, êóðñû
Ïðèìåðû êîìïàíèé
Èñòî÷íèêè äàííûõ
Ñâîáîäíî-ðàñïðîñòðàíÿåìûå ÐÑ
3 Äîïîëíèòåëüíûå òåìû
Ìîäåëü íà îñíîâå èäåè SVD. Ãðàäèåíòíûé ñïóñê.
Êîíòåêñòíûå ðåêîìåíäàöèè
(SAP  innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 3 / 17
Case-study 1: Ââåäåíèå
Ââåäåíèå â ðåêîìåíäàòåëüíûå ñèñòåìû (PC). Ïîäõîäû íà îñíîâå
ñõîäñòâà ïî ïîëüçîâàòåëÿìè è íà îñíîâå ñõîäñòâà ïî îáúåêòàì
ðåêîìåíäàöèè. Îöåíêà êà÷åñòâà ðåêîìåíäàòåëüíûõ ñèñòåì (Òî÷íîñòü,
ïîëíîòà, F-ìåðà. Áèìîäàëüíàÿ êðîññâàëèäàöèÿ).
Dmitry I. Ignatov, Jonas Poelmans, Guido Dedene, Stijn Viaene: A New
Cross-Validation Technique to Evaluate Quality of Recommender Systems.
PerMIn 2012: 195-202.
Èãíàòîâ Ä. È., Êàìèíñêàÿ À. Þ., Ìàãèçîâ Ð. À. Ìåòîä ñêîëüçÿùåãî
êîíòðîëÿ äëÿ îöåíêè êà÷åñòâà ðåêîìåíäàòåëüíûõ èíòåðíåò-ñåðâèñîâ //
ÊÈÈ-2010, Ò. 1. Ì. : Ôèçìàòëèò, 2010. Ñ. 175-182.
Ñëàéäû ïî çàïðîñó.
(SAP  innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 4 / 17
Case-study 2: BMF  SVD
Áóëåâà ìàòðè÷íàÿ ôàêòîðèçàöèÿ è ñèíãóëÿðíîå ðàçëîæåíèå (SVD).
Ñðàâíåíèå ðåçóëüòàòîâ íà îñíîâå ìåòîäà áëèæàéøåãî ñîñåäà ïî MAE íà
ïðèìåðå ðåêîìåíäàöèè ôèëüìîâ.
Dmitry I. Ignatov, Elena Nenova, Natalia Konstantinova, Andrey V.
Konstantinov: Boolean Matrix Factorisation for Collaborative Filtering: An
FCA-Based Approach. AIMSA 2014: 47-58
Elena Nenova, Dmitry I. Ignatov, Andrey V. Konstantinov: An FCA-based
Boolean Matrix Factorisation for Collaborative Filtering. FCA Meets IR 2013:
P. 57-73
Ñëàéäû
(SAP  innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 5 / 17
Case-study 3: Ðåêîìåíäàöèÿ ðàäèîñòàíöèé
Ãèáðèäíûå ðåêîìåíäàöèè íà ïðèìåðå ñåðâèñà îíëàéí-ðàäèîñòàíöèé.
Dmitry I. Ignatov, Sergey I. Nikolenko, Taimuraz Abaev, Natalia
Konstantinova: Online Recommender System for Radio Station Hosting:
Experimental Results Revisited. WI-IAT (2) 2014: 229-236
Dmitry I. Ignatov, Andrey V. Konstantinov, Sergey I. Nikolenko, Jonas
Poelmans, Vasily Zaharchuk: Online Recommender System for Radio Station
Hosting. BIR 2012: 1-12
Ñëàéäû
(SAP  innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 6 / 17
Îãëàâëåíèå
1 Case-study
Ââåäåíèå. User-based è item-based ïîäõîäû. Îöåíêà êà÷åñòâà.
BMF  SVD
Ðåêîìåíäàöèÿ ðàäèîñòàíöèé
2 Ïîëåçíûå ññûëêè
×òî ïî÷èòàòü?
Ñîîáùåñòâî, êîíôåðåíöèè, êóðñû
Ïðèìåðû êîìïàíèé
Èñòî÷íèêè äàííûõ
Ñâîáîäíî-ðàñïðîñòðàíÿåìûå ÐÑ
3 Äîïîëíèòåëüíûå òåìû
Ìîäåëü íà îñíîâå èäåè SVD. Ãðàäèåíòíûé ñïóñê.
Êîíòåêñòíûå ðåêîìåíäàöèè
(SAP  innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 7 / 17
×òî ïî÷èòàòü?
Recommender Systems Handbook, 2011  First comprehensive handbook
dedicated entirely to the eld of recommender systems.
O.Celma. Music Recommendation and Discovery, 2010   . . . this book
presents the state of the art of all the dierent techniques used to recommend
items, focusing on the music domain as the underlying application.
Òîáè Ñåãàðàí. Ïðîãðàììèðóåì êîëëåêòèâíûé ðàçóì. Ãëàâà 2, 2012(2008)
Greg Linden et al. Amazon.com Recommendations: Item-to-Item
Collaborative Filtering. Industry Report. 2003
Y. Koren, Matrix Factorization Techniques for Recommender Systems, IEEE
Computer, 2009
Ââîäíûå ñòàòüè Ñåðãåÿ Íèêîëåíêî íà Habrahabr
. . .
(SAP  innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 8 / 17
Ñîîáùåñòâî, êîíôåðåíöèè, êóðñû
Ñåðèÿ êîíôåðåíöèé RecSys
RecSys Wiki
International Society for Music Information Retrieval (ISMIR)
J.A. Konstan, M.D. Ekstrand. Introduction to Recommender Systems.
Coursera
Ñ.Íèêîëåíêî. Ðåêîìåíäàòåëüíûå ñèñòåìû. Ëåêöèè ïî ìàøèííîìó
îáó÷åíèþ â ÊÔÓ, 2014
Ê.Â. Âîðîíöîâ Êîëëàáîðàòèâíàÿ ôèëüòðàöèÿ. Êóðñ ëåêöèé ïî ìàøèííîå
îáó÷åíèþ
. . .
(SAP  innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 9 / 17
Ïðèìåðû êîìïàíèé
ÈÌÕÎÍÅÒ  ðåêîìåíäàòåëüíûé ñåðâèñ
Retail Rocket  ïëàòôîðìà äëÿ ìóëüòèêàíàëüíîé ïåðñîíàëèçàöèè
èíòåðíåò-ìàãàçèíà íà îñíîâå big data, ñîçäàííàÿ ðàçðàáîò÷èêàìè
ðåêîìåíäàòåëüíûõ ñèñòåì Ozon.ru è Wikimart.ru 
Surngbird  ðåêîìåíäàòåëüíàÿ ñèñòåìà âåá-ñòðàíèö
Gravity  Rock Solid Recommendations is a fast-growing company that
helps customers with its state-of-the-art recommendation solutions.
. . .
(SAP  innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 10 / 17
Èñòî÷íèêè äàííûõ
RecSys Challenge 2015
Group Lens Datasets
ECML/PKDD Discovery Challenge 2011
BibSonomy :: dumps for research purposes
Million Song Dataset Challenge
Job Recommendation Challenge at Kaggle
. . .
(SAP  innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 11 / 17
Ñâîáîäíî-ðàñïðîñòðàíÿåìûå ÐÑ
MyMediaLite
Easyrec
Python-recsys
LibRec
LensKit
MRec
SVDFeature
. . .
(SAP  innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 12 / 17
Îãëàâëåíèå
1 Case-study
Ââåäåíèå. User-based è item-based ïîäõîäû. Îöåíêà êà÷åñòâà.
BMF  SVD
Ðåêîìåíäàöèÿ ðàäèîñòàíöèé
2 Ïîëåçíûå ññûëêè
×òî ïî÷èòàòü?
Ñîîáùåñòâî, êîíôåðåíöèè, êóðñû
Ïðèìåðû êîìïàíèé
Èñòî÷íèêè äàííûõ
Ñâîáîäíî-ðàñïðîñòðàíÿåìûå ÐÑ
3 Äîïîëíèòåëüíûå òåìû
Ìîäåëü íà îñíîâå èäåè SVD. Ãðàäèåíòíûé ñïóñê.
Êîíòåêñòíûå ðåêîìåíäàöèè
(SAP  innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 13 / 17
Ìîäåëü íà îñíîâå èäåè SVD. Ãðàäèåíòíûé ñïóñê.
Yehuda Koren, Robert M. Bell: Advances in Collaborative Filtering. Recommender
Systems Handbook 2011: 145-186
Áàçîâàÿ ìîäåëü:
rui = µ + bi + bu + qT
i pu
Öåëåâàÿ ôóíêöèÿ:
min
b,q,p
=
(u,i)∈R
(rui − µ − bi − bu − qT
i pu)2
+ λ(b2
i + b2
u + ||qi ||2
+ ||pu||2
), ãäå
R = {(u, i)| îöåíêà rui îïðåäåëåíà}
Ìåòîä ãðàäèåíòíîãî ñïóñêà:
eui = rui − ˆrui
bu ← bu + γ · (eui − λ · bu)
bi ← bi + γ · (eui − λ · bi )
qi ← qi + γ · (eui · pu − λ · qi )
pu ← pu + γ · (eui · qi − λ · pu)
(SAP  innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 14 / 17
Êîíòåêñòíûå ðåêîìåíäàöèè
Gediminas Adomavicius, Alexander Tuzhilin: Context-Aware Recommender
Systems. Recommender Systems Handbook 2011: 217-253
Ìíîãîìåðíàÿ ìîäåëü User × Item × Time ðåêîìåíäàòåëüíîãî ïðîñòðàíñòâà
(êîíòåêñòíàÿ èíôîðìàöèÿ  âðåìÿ)
(SAP  innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 15 / 17
Êîíòåêñòíûå ðåêîìåíäàöèè
G. Adomavicius, A. Tuzhilin, 2011(2005)
Ñïîñîáû âñòðàèâàíèÿ êîíòåêñòà â ïðîñòðàíñòâî ðåêîìåíäàöèé
(SAP  innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 16 / 17
Âîïðîñû è êîíòàêòû
www.hse.ru/staff/dima
Ñïàñèáî!
dmitrii.ignatov[at]gmail.com
dignatov[at]hse.ru
(SAP  innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 17 / 17

More Related Content

Viewers also liked

On the Family of Concept Forming Operators in Polyadic FCA
On the Family of Concept Forming Operators in Polyadic FCAOn the Family of Concept Forming Operators in Polyadic FCA
On the Family of Concept Forming Operators in Polyadic FCADmitrii Ignatov
 
Поиск частых множеств признаков (товаров) и ассоциативные правила
Поиск частых множеств признаков (товаров) и ассоциативные правилаПоиск частых множеств признаков (товаров) и ассоциативные правила
Поиск частых множеств признаков (товаров) и ассоциативные правилаDmitrii Ignatov
 
A lattice-based consensus clustering
A lattice-based consensus clusteringA lattice-based consensus clustering
A lattice-based consensus clusteringDmitrii Ignatov
 
Experimental Economics and Machine Learning workshop
Experimental Economics and Machine Learning workshopExperimental Economics and Machine Learning workshop
Experimental Economics and Machine Learning workshopDmitrii Ignatov
 
A One-Pass Triclustering Approach: Is There any Room for Big Data?
A One-Pass Triclustering Approach: Is There any Room for Big Data?A One-Pass Triclustering Approach: Is There any Room for Big Data?
A One-Pass Triclustering Approach: Is There any Room for Big Data?Dmitrii Ignatov
 
AIST 2016 Opening Slides
AIST 2016 Opening SlidesAIST 2016 Opening Slides
AIST 2016 Opening SlidesDmitrii Ignatov
 
Putting OAC-triclustering on MapReduce
Putting OAC-triclustering on MapReducePutting OAC-triclustering on MapReduce
Putting OAC-triclustering on MapReduceDmitrii Ignatov
 
A lattice-based consensus clustering
A lattice-based consensus clusteringA lattice-based consensus clustering
A lattice-based consensus clusteringDmitrii Ignatov
 
Boolean matrix factorisation for collaborative filtering
Boolean matrix factorisation for collaborative filteringBoolean matrix factorisation for collaborative filtering
Boolean matrix factorisation for collaborative filteringDmitrii Ignatov
 
NIPS 2016, Tensor-Learn@NIPS, and IEEE ICDM 2016
NIPS 2016, Tensor-Learn@NIPS, and IEEE ICDM 2016NIPS 2016, Tensor-Learn@NIPS, and IEEE ICDM 2016
NIPS 2016, Tensor-Learn@NIPS, and IEEE ICDM 2016Dmitrii Ignatov
 
Intro to Data Mining and Machine Learning
Intro to Data Mining and Machine LearningIntro to Data Mining and Machine Learning
Intro to Data Mining and Machine LearningDmitrii Ignatov
 
Pattern-based classification of demographic sequences
Pattern-based classification of demographic sequencesPattern-based classification of demographic sequences
Pattern-based classification of demographic sequencesDmitrii Ignatov
 
Context-Aware Recommender System Based on Boolean Matrix Factorisation
Context-Aware Recommender System Based on Boolean Matrix FactorisationContext-Aware Recommender System Based on Boolean Matrix Factorisation
Context-Aware Recommender System Based on Boolean Matrix FactorisationDmitrii Ignatov
 
RAPS: A Recommender Algorithm Based on Pattern Structures
RAPS: A Recommender Algorithm Based on Pattern StructuresRAPS: A Recommender Algorithm Based on Pattern Structures
RAPS: A Recommender Algorithm Based on Pattern StructuresDmitrii Ignatov
 
Pattern Mining and Machine Learning for Demographic Sequences
Pattern Mining and Machine Learning for Demographic SequencesPattern Mining and Machine Learning for Demographic Sequences
Pattern Mining and Machine Learning for Demographic SequencesDmitrii Ignatov
 
Online Recommender System for Radio Station Hosting: Experimental Results Rev...
Online Recommender System for Radio Station Hosting: Experimental Results Rev...Online Recommender System for Radio Station Hosting: Experimental Results Rev...
Online Recommender System for Radio Station Hosting: Experimental Results Rev...Dmitrii Ignatov
 
Searching for optimal patterns in Boolean tensors
Searching for optimal patterns in Boolean tensorsSearching for optimal patterns in Boolean tensors
Searching for optimal patterns in Boolean tensorsDmitrii Ignatov
 

Viewers also liked (18)

On the Family of Concept Forming Operators in Polyadic FCA
On the Family of Concept Forming Operators in Polyadic FCAOn the Family of Concept Forming Operators in Polyadic FCA
On the Family of Concept Forming Operators in Polyadic FCA
 
Поиск частых множеств признаков (товаров) и ассоциативные правила
Поиск частых множеств признаков (товаров) и ассоциативные правилаПоиск частых множеств признаков (товаров) и ассоциативные правила
Поиск частых множеств признаков (товаров) и ассоциативные правила
 
A lattice-based consensus clustering
A lattice-based consensus clusteringA lattice-based consensus clustering
A lattice-based consensus clustering
 
Experimental Economics and Machine Learning workshop
Experimental Economics and Machine Learning workshopExperimental Economics and Machine Learning workshop
Experimental Economics and Machine Learning workshop
 
A One-Pass Triclustering Approach: Is There any Room for Big Data?
A One-Pass Triclustering Approach: Is There any Room for Big Data?A One-Pass Triclustering Approach: Is There any Room for Big Data?
A One-Pass Triclustering Approach: Is There any Room for Big Data?
 
AIST 2016 Opening Slides
AIST 2016 Opening SlidesAIST 2016 Opening Slides
AIST 2016 Opening Slides
 
Putting OAC-triclustering on MapReduce
Putting OAC-triclustering on MapReducePutting OAC-triclustering on MapReduce
Putting OAC-triclustering on MapReduce
 
A lattice-based consensus clustering
A lattice-based consensus clusteringA lattice-based consensus clustering
A lattice-based consensus clustering
 
Boolean matrix factorisation for collaborative filtering
Boolean matrix factorisation for collaborative filteringBoolean matrix factorisation for collaborative filtering
Boolean matrix factorisation for collaborative filtering
 
NIPS 2016, Tensor-Learn@NIPS, and IEEE ICDM 2016
NIPS 2016, Tensor-Learn@NIPS, and IEEE ICDM 2016NIPS 2016, Tensor-Learn@NIPS, and IEEE ICDM 2016
NIPS 2016, Tensor-Learn@NIPS, and IEEE ICDM 2016
 
Intro to Data Mining and Machine Learning
Intro to Data Mining and Machine LearningIntro to Data Mining and Machine Learning
Intro to Data Mining and Machine Learning
 
Pattern-based classification of demographic sequences
Pattern-based classification of demographic sequencesPattern-based classification of demographic sequences
Pattern-based classification of demographic sequences
 
Sequence mining
Sequence miningSequence mining
Sequence mining
 
Context-Aware Recommender System Based on Boolean Matrix Factorisation
Context-Aware Recommender System Based on Boolean Matrix FactorisationContext-Aware Recommender System Based on Boolean Matrix Factorisation
Context-Aware Recommender System Based on Boolean Matrix Factorisation
 
RAPS: A Recommender Algorithm Based on Pattern Structures
RAPS: A Recommender Algorithm Based on Pattern StructuresRAPS: A Recommender Algorithm Based on Pattern Structures
RAPS: A Recommender Algorithm Based on Pattern Structures
 
Pattern Mining and Machine Learning for Demographic Sequences
Pattern Mining and Machine Learning for Demographic SequencesPattern Mining and Machine Learning for Demographic Sequences
Pattern Mining and Machine Learning for Demographic Sequences
 
Online Recommender System for Radio Station Hosting: Experimental Results Rev...
Online Recommender System for Radio Station Hosting: Experimental Results Rev...Online Recommender System for Radio Station Hosting: Experimental Results Rev...
Online Recommender System for Radio Station Hosting: Experimental Results Rev...
 
Searching for optimal patterns in Boolean tensors
Searching for optimal patterns in Boolean tensorsSearching for optimal patterns in Boolean tensors
Searching for optimal patterns in Boolean tensors
 

More from Dmitrii Ignatov

Interpretable Concept-Based Classification with Shapley Values
Interpretable Concept-Based Classification with Shapley ValuesInterpretable Concept-Based Classification with Shapley Values
Interpretable Concept-Based Classification with Shapley ValuesDmitrii Ignatov
 
AIST2019 – opening slides
AIST2019 – opening slidesAIST2019 – opening slides
AIST2019 – opening slidesDmitrii Ignatov
 
Turning Krimp into a Triclustering Technique on Sets of Attribute-Condition P...
Turning Krimp into a Triclustering Technique on Sets of Attribute-Condition P...Turning Krimp into a Triclustering Technique on Sets of Attribute-Condition P...
Turning Krimp into a Triclustering Technique on Sets of Attribute-Condition P...Dmitrii Ignatov
 
Personal Experiences of Publishing with Springer from both Editor and Author ...
Personal Experiences of Publishing with Springer from both Editor and Author ...Personal Experiences of Publishing with Springer from both Editor and Author ...
Personal Experiences of Publishing with Springer from both Editor and Author ...Dmitrii Ignatov
 
Social Learning in Networks: Extraction Deterministic Rules
Social Learning in Networks: Extraction Deterministic RulesSocial Learning in Networks: Extraction Deterministic Rules
Social Learning in Networks: Extraction Deterministic RulesDmitrii Ignatov
 
Orpailleur -- triclustering talk
Orpailleur -- triclustering talkOrpailleur -- triclustering talk
Orpailleur -- triclustering talkDmitrii Ignatov
 
CoClus ICDM Workshop talk
CoClus ICDM Workshop talkCoClus ICDM Workshop talk
CoClus ICDM Workshop talkDmitrii Ignatov
 
Radio recommender system for FMHost
Radio recommender system for FMHostRadio recommender system for FMHost
Radio recommender system for FMHostDmitrii Ignatov
 

More from Dmitrii Ignatov (11)

Interpretable Concept-Based Classification with Shapley Values
Interpretable Concept-Based Classification with Shapley ValuesInterpretable Concept-Based Classification with Shapley Values
Interpretable Concept-Based Classification with Shapley Values
 
AIST2019 – opening slides
AIST2019 – opening slidesAIST2019 – opening slides
AIST2019 – opening slides
 
Turning Krimp into a Triclustering Technique on Sets of Attribute-Condition P...
Turning Krimp into a Triclustering Technique on Sets of Attribute-Condition P...Turning Krimp into a Triclustering Technique on Sets of Attribute-Condition P...
Turning Krimp into a Triclustering Technique on Sets of Attribute-Condition P...
 
Personal Experiences of Publishing with Springer from both Editor and Author ...
Personal Experiences of Publishing with Springer from both Editor and Author ...Personal Experiences of Publishing with Springer from both Editor and Author ...
Personal Experiences of Publishing with Springer from both Editor and Author ...
 
Aist2014
Aist2014Aist2014
Aist2014
 
Social Learning in Networks: Extraction Deterministic Rules
Social Learning in Networks: Extraction Deterministic RulesSocial Learning in Networks: Extraction Deterministic Rules
Social Learning in Networks: Extraction Deterministic Rules
 
Orpailleur -- triclustering talk
Orpailleur -- triclustering talkOrpailleur -- triclustering talk
Orpailleur -- triclustering talk
 
CoClus ICDM Workshop talk
CoClus ICDM Workshop talkCoClus ICDM Workshop talk
CoClus ICDM Workshop talk
 
Pseudo-triclustering
Pseudo-triclusteringPseudo-triclustering
Pseudo-triclustering
 
Radio recommender system for FMHost
Radio recommender system for FMHostRadio recommender system for FMHost
Radio recommender system for FMHost
 
CrowDM system
CrowDM systemCrowDM system
CrowDM system
 

Введение в рекомендательные системы. 3 case-study без NetFlix.

  • 1. Ðàçðàáîòêà äàííûõ è ìàøèííîå îáó÷åíèå Ââåäåíèå â ðåêîìåíäàòåëüíûå ñèñòåìû Èãíàòîâ Äìèòðèé Èãîðåâè÷ ♦ ¾Big Data Startup Accelerator Program¿ ðàçâèòèå êîìïåòåíòíîñòåé â ñîçäàíèè èííîâàöèîííûõ ïðîäóêòîâ è áèçíåñîâ â ñôåðå Áîëüøèõ Äàííûõ Ñîâìåñòíàÿ èíèöèàòèâà êîðïîðàöèè SAP è innovationStudio MSU FE ♦ ÍÈÓ ÂØÝ Ôàêóëüòåò êîìïüþòåðíûõ íàóê Äåïàðòàìåíò àíàëèçà äàííûõ è èñêóññòâåííîãî èíòåëëåêòà 07 ìàðòà 2015 (SAP innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 1 / 17
  • 2. Ïëàí çàíÿòèÿ Íà ýòîé âñòðå÷å: 1 Ââåäåíèå â ðåêîìåíäàòåëüíûå ñèñòåìû (PC). Ïîäõîäû íà îñíîâå ñõîäñòâà ïî ïîëüçîâàòåëÿìè è íà îñíîâå ñõîäñòâà ïî îáúåêòàì ðåêîìåíäàöèè. Îöåíêà êà÷åñòâà ðåêîìåíäàòåëüíûõ ñèñòåì (Òî÷íîñòü, ïîëíîòà, F-ìåðà. Áèìîäàëüíàÿ êðîññâàëèäàöèÿ). 2 Áóëåâà ìàòðè÷íàÿ ôàêòîðèçàöèÿ è ñèíãóëÿðíîå ðàçëîæåíèå (SVD). Ñðàâíåíèå ðåçóëüòàòîâ íà îñíîâå ìåòîäà áëèæàéøåãî ñîñåäà ïî MAE íà ïðèìåðå ðåêîìåíäàöèè ôèëüìîâ. 3 Ãèáðèäíûå ðåêîìåíäàöèè íà ïðèìåðå ñåðâèñà îíëàéí-ðàäèîñòàíöèé. Äîïîëíèòåëüíûå òåìû. 1 Ìîäåëè SVD, SVD++ è time-SVD. Ãðàäèåíòíûé ñïóñê. 2 Êîíòåêñòíûå ðåêîìåíäàöèè. 3 Ñâîáîäíî-ðàñïðîñòðàíÿåìûå áèáëèîòåêè ÐÑ. 4 Ïðèìåð PC äëÿ êðàóäñîðñèíãà íà îñíîâå áèêëàñòåðèçàöèè. 5 ÐÑ íà îñíîâå áèêëàñòåðèçàöèè è àññîöèàòèâíûõ ïðàâèë. 6 Ñèñòåìû ñîâìåñòíîãî ïîëüçîâàíèÿ ðåñóðñàìè. ÐÑ äëÿ ôîëêñîíîìèé. (SAP innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 2 / 17
  • 3. Îãëàâëåíèå 1 Case-study Ââåäåíèå. User-based è item-based ïîäõîäû. Îöåíêà êà÷åñòâà. BMF SVD Ðåêîìåíäàöèÿ ðàäèîñòàíöèé 2 Ïîëåçíûå ññûëêè ×òî ïî÷èòàòü? Ñîîáùåñòâî, êîíôåðåíöèè, êóðñû Ïðèìåðû êîìïàíèé Èñòî÷íèêè äàííûõ Ñâîáîäíî-ðàñïðîñòðàíÿåìûå ÐÑ 3 Äîïîëíèòåëüíûå òåìû Ìîäåëü íà îñíîâå èäåè SVD. Ãðàäèåíòíûé ñïóñê. Êîíòåêñòíûå ðåêîìåíäàöèè (SAP innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 3 / 17
  • 4. Case-study 1: Ââåäåíèå Ââåäåíèå â ðåêîìåíäàòåëüíûå ñèñòåìû (PC). Ïîäõîäû íà îñíîâå ñõîäñòâà ïî ïîëüçîâàòåëÿìè è íà îñíîâå ñõîäñòâà ïî îáúåêòàì ðåêîìåíäàöèè. Îöåíêà êà÷åñòâà ðåêîìåíäàòåëüíûõ ñèñòåì (Òî÷íîñòü, ïîëíîòà, F-ìåðà. Áèìîäàëüíàÿ êðîññâàëèäàöèÿ). Dmitry I. Ignatov, Jonas Poelmans, Guido Dedene, Stijn Viaene: A New Cross-Validation Technique to Evaluate Quality of Recommender Systems. PerMIn 2012: 195-202. Èãíàòîâ Ä. È., Êàìèíñêàÿ À. Þ., Ìàãèçîâ Ð. À. Ìåòîä ñêîëüçÿùåãî êîíòðîëÿ äëÿ îöåíêè êà÷åñòâà ðåêîìåíäàòåëüíûõ èíòåðíåò-ñåðâèñîâ // ÊÈÈ-2010, Ò. 1. Ì. : Ôèçìàòëèò, 2010. Ñ. 175-182. Ñëàéäû ïî çàïðîñó. (SAP innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 4 / 17
  • 5. Case-study 2: BMF SVD Áóëåâà ìàòðè÷íàÿ ôàêòîðèçàöèÿ è ñèíãóëÿðíîå ðàçëîæåíèå (SVD). Ñðàâíåíèå ðåçóëüòàòîâ íà îñíîâå ìåòîäà áëèæàéøåãî ñîñåäà ïî MAE íà ïðèìåðå ðåêîìåíäàöèè ôèëüìîâ. Dmitry I. Ignatov, Elena Nenova, Natalia Konstantinova, Andrey V. Konstantinov: Boolean Matrix Factorisation for Collaborative Filtering: An FCA-Based Approach. AIMSA 2014: 47-58 Elena Nenova, Dmitry I. Ignatov, Andrey V. Konstantinov: An FCA-based Boolean Matrix Factorisation for Collaborative Filtering. FCA Meets IR 2013: P. 57-73 Ñëàéäû (SAP innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 5 / 17
  • 6. Case-study 3: Ðåêîìåíäàöèÿ ðàäèîñòàíöèé Ãèáðèäíûå ðåêîìåíäàöèè íà ïðèìåðå ñåðâèñà îíëàéí-ðàäèîñòàíöèé. Dmitry I. Ignatov, Sergey I. Nikolenko, Taimuraz Abaev, Natalia Konstantinova: Online Recommender System for Radio Station Hosting: Experimental Results Revisited. WI-IAT (2) 2014: 229-236 Dmitry I. Ignatov, Andrey V. Konstantinov, Sergey I. Nikolenko, Jonas Poelmans, Vasily Zaharchuk: Online Recommender System for Radio Station Hosting. BIR 2012: 1-12 Ñëàéäû (SAP innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 6 / 17
  • 7. Îãëàâëåíèå 1 Case-study Ââåäåíèå. User-based è item-based ïîäõîäû. Îöåíêà êà÷åñòâà. BMF SVD Ðåêîìåíäàöèÿ ðàäèîñòàíöèé 2 Ïîëåçíûå ññûëêè ×òî ïî÷èòàòü? Ñîîáùåñòâî, êîíôåðåíöèè, êóðñû Ïðèìåðû êîìïàíèé Èñòî÷íèêè äàííûõ Ñâîáîäíî-ðàñïðîñòðàíÿåìûå ÐÑ 3 Äîïîëíèòåëüíûå òåìû Ìîäåëü íà îñíîâå èäåè SVD. Ãðàäèåíòíûé ñïóñê. Êîíòåêñòíûå ðåêîìåíäàöèè (SAP innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 7 / 17
  • 8. ×òî ïî÷èòàòü? Recommender Systems Handbook, 2011 First comprehensive handbook dedicated entirely to the eld of recommender systems. O.Celma. Music Recommendation and Discovery, 2010 . . . this book presents the state of the art of all the dierent techniques used to recommend items, focusing on the music domain as the underlying application. Òîáè Ñåãàðàí. Ïðîãðàììèðóåì êîëëåêòèâíûé ðàçóì. Ãëàâà 2, 2012(2008) Greg Linden et al. Amazon.com Recommendations: Item-to-Item Collaborative Filtering. Industry Report. 2003 Y. Koren, Matrix Factorization Techniques for Recommender Systems, IEEE Computer, 2009 Ââîäíûå ñòàòüè Ñåðãåÿ Íèêîëåíêî íà Habrahabr . . . (SAP innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 8 / 17
  • 9. Ñîîáùåñòâî, êîíôåðåíöèè, êóðñû Ñåðèÿ êîíôåðåíöèé RecSys RecSys Wiki International Society for Music Information Retrieval (ISMIR) J.A. Konstan, M.D. Ekstrand. Introduction to Recommender Systems. Coursera Ñ.Íèêîëåíêî. Ðåêîìåíäàòåëüíûå ñèñòåìû. Ëåêöèè ïî ìàøèííîìó îáó÷åíèþ â ÊÔÓ, 2014 Ê.Â. Âîðîíöîâ Êîëëàáîðàòèâíàÿ ôèëüòðàöèÿ. Êóðñ ëåêöèé ïî ìàøèííîå îáó÷åíèþ . . . (SAP innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 9 / 17
  • 10. Ïðèìåðû êîìïàíèé ÈÌÕÎÍÅÒ ðåêîìåíäàòåëüíûé ñåðâèñ Retail Rocket ïëàòôîðìà äëÿ ìóëüòèêàíàëüíîé ïåðñîíàëèçàöèè èíòåðíåò-ìàãàçèíà íà îñíîâå big data, ñîçäàííàÿ ðàçðàáîò÷èêàìè ðåêîìåíäàòåëüíûõ ñèñòåì Ozon.ru è Wikimart.ru Surngbird ðåêîìåíäàòåëüíàÿ ñèñòåìà âåá-ñòðàíèö Gravity Rock Solid Recommendations is a fast-growing company that helps customers with its state-of-the-art recommendation solutions. . . . (SAP innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 10 / 17
  • 11. Èñòî÷íèêè äàííûõ RecSys Challenge 2015 Group Lens Datasets ECML/PKDD Discovery Challenge 2011 BibSonomy :: dumps for research purposes Million Song Dataset Challenge Job Recommendation Challenge at Kaggle . . . (SAP innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 11 / 17
  • 12. Ñâîáîäíî-ðàñïðîñòðàíÿåìûå ÐÑ MyMediaLite Easyrec Python-recsys LibRec LensKit MRec SVDFeature . . . (SAP innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 12 / 17
  • 13. Îãëàâëåíèå 1 Case-study Ââåäåíèå. User-based è item-based ïîäõîäû. Îöåíêà êà÷åñòâà. BMF SVD Ðåêîìåíäàöèÿ ðàäèîñòàíöèé 2 Ïîëåçíûå ññûëêè ×òî ïî÷èòàòü? Ñîîáùåñòâî, êîíôåðåíöèè, êóðñû Ïðèìåðû êîìïàíèé Èñòî÷íèêè äàííûõ Ñâîáîäíî-ðàñïðîñòðàíÿåìûå ÐÑ 3 Äîïîëíèòåëüíûå òåìû Ìîäåëü íà îñíîâå èäåè SVD. Ãðàäèåíòíûé ñïóñê. Êîíòåêñòíûå ðåêîìåíäàöèè (SAP innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 13 / 17
  • 14. Ìîäåëü íà îñíîâå èäåè SVD. Ãðàäèåíòíûé ñïóñê. Yehuda Koren, Robert M. Bell: Advances in Collaborative Filtering. Recommender Systems Handbook 2011: 145-186 Áàçîâàÿ ìîäåëü: rui = µ + bi + bu + qT i pu Öåëåâàÿ ôóíêöèÿ: min b,q,p = (u,i)∈R (rui − µ − bi − bu − qT i pu)2 + λ(b2 i + b2 u + ||qi ||2 + ||pu||2 ), ãäå R = {(u, i)| îöåíêà rui îïðåäåëåíà} Ìåòîä ãðàäèåíòíîãî ñïóñêà: eui = rui − ˆrui bu ← bu + γ · (eui − λ · bu) bi ← bi + γ · (eui − λ · bi ) qi ← qi + γ · (eui · pu − λ · qi ) pu ← pu + γ · (eui · qi − λ · pu) (SAP innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 14 / 17
  • 15. Êîíòåêñòíûå ðåêîìåíäàöèè Gediminas Adomavicius, Alexander Tuzhilin: Context-Aware Recommender Systems. Recommender Systems Handbook 2011: 217-253 Ìíîãîìåðíàÿ ìîäåëü User × Item × Time ðåêîìåíäàòåëüíîãî ïðîñòðàíñòâà (êîíòåêñòíàÿ èíôîðìàöèÿ âðåìÿ) (SAP innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 15 / 17
  • 16. Êîíòåêñòíûå ðåêîìåíäàöèè G. Adomavicius, A. Tuzhilin, 2011(2005) Ñïîñîáû âñòðàèâàíèÿ êîíòåêñòà â ïðîñòðàíñòâî ðåêîìåíäàöèé (SAP innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 16 / 17
  • 17. Âîïðîñû è êîíòàêòû www.hse.ru/staff/dima Ñïàñèáî! dmitrii.ignatov[at]gmail.com dignatov[at]hse.ru (SAP innovationStudio MSU FE) Big Data Startup Accelerator Program 07.03.2015 17 / 17