SlideShare a Scribd company logo
Seite 1
Page 1
The University of Innsbruck was founded in 1669 and is one of Austria’s oldest universities. Today, with over 28.000 students and 4.000 staff, it is
western Austria’s largest institution of higher education and research. For further information visit: www.uibk.ac.at.
#nowplaying on #Spotify: Leveraging Spotify
Information on Twitter for Artist Recommendations
Martin Pichl, Eva Zangerle and Günther Specht
Seite 2
Page 2
Agenda
• Why Music Recommendations?
• Dataset Creation & Recommendation Approach
• Discussion and Future Work
Seite 3
Page 3
Recent Trends
• Rise of the web enabled new distribution channels
• Online Stores
• Music Streaming Platforms
• …
• These new distribution channels
– Exploit a word-wide market
– Virtually no inventory costs
→ More and more dives music is available
Seite 4
Page 4
Why (Music) Recommender Systems?
• The user is confronted with more and more diverse music
– on streaming platforms
– in online stores
– on mobile devices
• and has a free choice
• Users often do not know what to listen to
→ Information Overload
• Recommender Systems
– Helps users finding music they like
→ Increase usability
Seite 5
Page 5
Research on Music Recommender System
• Publicly available data necessary
• Twitter
– People share what they are listening at the moment
• Get additional information from Spotify
– Additional listening events
– Additional information about the tracks and the artists
– Additional information about the listening context
• The additional information is necessary to build a more
specialized recommender system
Seite 6
Page 6
Example Tweets
Seite 7
Page 7
The Dataset
• Generated dataset based Tweets that contains
– <UserID, ArtistID, TrackID>-triples
– Boolean preferences (listened/not listened)
• Cleaning
– Removed duplicates
– Removed certain accounts i.e. @SpotifyNowPlaying
– Removed “Various Artists”
Seite 8
Page 8
Dataset Snapshot
• Dataset contains
– 513,489 listening events
– by 68,045 unique users
– listening to 97,586 unique tracks
– by unique 40,593 artists
• Distribution
– In average 4.77 tweets per user (SD= 30.02)
– Median of 2
Seite 9
Page 9
Artist Recommendations using this Dataset
• No content based information
– Recommendations are computed using collaborative filtering
• Collaborative Filtering (CF)
– CF recommends items that the most similar users of a user
listened to (and are new to the user)
• CF relies on
– A user similarity measure
– A number of nearest neighbors 𝑘
Seite 10
Page 10
User Similarity
• Boolean Preferences
– Jaccard Coefficient is suitable
– 𝐽𝑎𝑐𝑐𝑎𝑟𝑑𝑖,𝑗 =
𝑆 𝑖 ∩ 𝑆 𝑗
𝑆 𝑖 ∪ 𝑆 𝑗
• Include all the information available
– Compute Jaccard Coefficient using the artist listening history
– Compute Jaccard Coefficient using the track listening history
– Combined using an weighted average
• 𝑢𝑠𝑒𝑟𝑆𝑖𝑚 = 𝑤 𝑎 ∗ 𝑎𝑟𝑡𝑖𝑠𝑡𝑆𝑖𝑚 + 𝑤𝑡 ∗ 𝑡𝑟𝑎𝑐𝑘𝑆𝑖𝑚
Seite 11
Page 11
Parameter Tuning
• Input Parameters
– 𝑤 𝑎, 𝑤𝑡, 𝑘
– Optimized using a Genetic Algorithm (GA)
– Fitness = Precision of the recommender system
– In average a good solution was found after 4.14 iterations
(SD=2.27)
Seite 12
Page 12
Genetic Algorithm
• 𝑤 𝑎,𝑤𝑡, 𝑘 are float point genes between 0 and 1 and form a
individual
• Random initial distribution
• The fitness of each individual is measured using the
precision
• Crossover and mutations of the best individual
• Terminate if the precision is 1 or a certain number of
generations is reach
Seite 13
Page 13
The Big Picture
Seite 14
Page 14
Evaluation Setup
• Offline Evaluation
– From each user we removed 1/3 of the listening events for
testing
– Recommended 𝑝 ∗ 𝑆𝑖𝑧𝑒 𝑜𝑓 𝑡ℎ𝑒 𝑇𝑒𝑠𝑡𝑠𝑒𝑡 items
– Varied 𝑝 between 0 and 1
– Computed precision and recall for each 𝑝
• Parameters used for the Evaluation
– 𝑤 𝑎 = 0.21
– 𝑤𝑡 = 0.94
– 𝑘 = 59
Seite 15
Page 15
Evaluation Metrics
• Hit: Item found in the testset
• 𝑝𝑟𝑒𝑐𝑖𝑠𝑜𝑛 =
ℎ𝑖𝑡𝑠
𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑟𝑒𝑐𝑜𝑚𝑚𝑒𝑛𝑑𝑎𝑡𝑖𝑜𝑛𝑠
• Relevant Items: All items in the testset
• 𝑟𝑒𝑐𝑎𝑙𝑙 =
ℎ𝑖𝑡𝑠
𝑠𝑖𝑧𝑒 𝑜𝑓 𝑡ℎ𝑒 𝑡𝑒𝑠𝑡𝑠𝑒𝑡
Seite 16
Page 16
Performance of the optimized Recommender
System n Precision Recall
1 0.4656 0.0228
2 0.3622 0.0547
3 0.3137 0.0782
4 0.2812 0.1003
5 0.2531 0.1195
6 0.2315 0.1286
7 0.2170 0.1396
8 0.2170 0.1396
9 0.1871 0.1583
10 0.1871 0,1583
0
0,1
0,2
0,3
0,4
0,5
Precision/Recall
Number of Recommendations (% of the Testset)
Precision
Recall
Seite 17
Page 17
Discussion
• Heading into the right direction
• Performance is limited for a high number of
recommendations
– Data sparsity
– Too general approach
• Performance improvements with
– Reducing data sparsity
– Specialized algorithm that fits more to music
recommendation
Seite 18
Page 18
Next Steps towards a more specialized RS
• Match Spotify and Twitter Users
– Early experiments show that we can match ~ 10% of the
dataset
– Better matching than using the username and played tracks?
• Extract listening context from playlist names, i.e.
– Christmas
– Workout, training
– Driving
– …
Seite 19
Page 19
Next Steps towards a more specialized RS
• The offline evaluation is rather limited
• Create an intuitive webinterface
• Conduct a live user experiment
Seite 20
Page 20
Acknowledgments

More Related Content

Similar to (SoWeMine Workshop) "#nowplaying on #Spotify: Leveraging Spotify Information on Twitter for Artist Recommendations" - Martin Pichl, Eva Zangerle and Günther SpechtPresentation

A Unified Music Recommender System Using Listening Habits and Semantics of Tags
A Unified Music Recommender System Using Listening Habits and Semantics of TagsA Unified Music Recommender System Using Listening Habits and Semantics of Tags
A Unified Music Recommender System Using Listening Habits and Semantics of Tags
datasciencekorea
 
Data science-2013-heekim
Data science-2013-heekimData science-2013-heekim
Data science-2013-heekimHaklae Kim
 
Luis Aguiar: Platforms, Promotion, and Product Discovery: Evidence from Spoti...
Luis Aguiar: Platforms, Promotion, and Product Discovery: Evidence from Spoti...Luis Aguiar: Platforms, Promotion, and Product Discovery: Evidence from Spoti...
Luis Aguiar: Platforms, Promotion, and Product Discovery: Evidence from Spoti...
Marius Miron
 
Deezer - Big data as a streaming service
Deezer - Big data as a streaming serviceDeezer - Big data as a streaming service
Deezer - Big data as a streaming service
Julie Knibbe
 
IntroductionRecommenderSystems_Petroni.pdf
IntroductionRecommenderSystems_Petroni.pdfIntroductionRecommenderSystems_Petroni.pdf
IntroductionRecommenderSystems_Petroni.pdf
AlphaIssaghaDiallo
 
Algorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at SpotifyAlgorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at Spotify
Chris Johnson
 
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
Timo van Niedek
 
Building Data Pipelines for Music Recommendations at Spotify
Building Data Pipelines for Music Recommendations at SpotifyBuilding Data Pipelines for Music Recommendations at Spotify
Building Data Pipelines for Music Recommendations at Spotify
Vidhya Murali
 
IBANK - Big data www.ibank.uk.com 07474222079
IBANK - Big data www.ibank.uk.com 07474222079IBANK - Big data www.ibank.uk.com 07474222079
IBANK - Big data www.ibank.uk.com 07474222079
ibankuk
 
Big data
Big dataBig data
Big data
Zeeshan Khan
 
Designing recommender system for your application
Designing  recommender system for  your applicationDesigning  recommender system for  your application
Designing recommender system for your application
孜羲 顏
 
Online Recommender System for Radio Station Hosting: Experimental Results Rev...
Online Recommender System for Radio Station Hosting: Experimental Results Rev...Online Recommender System for Radio Station Hosting: Experimental Results Rev...
Online Recommender System for Radio Station Hosting: Experimental Results Rev...
Dmitrii Ignatov
 
Recommendations 101
Recommendations 101 Recommendations 101
Recommendations 101
Esh Vckay
 
Sobers "At Risk of Losing Everything: Advocating for Audiovisual Materials in...
Sobers "At Risk of Losing Everything: Advocating for Audiovisual Materials in...Sobers "At Risk of Losing Everything: Advocating for Audiovisual Materials in...
Sobers "At Risk of Losing Everything: Advocating for Audiovisual Materials in...
National Information Standards Organization (NISO)
 
Thesis-Defense-YuhuiWang-small
Thesis-Defense-YuhuiWang-smallThesis-Defense-YuhuiWang-small
Thesis-Defense-YuhuiWang-smallYuhui Wang
 
Recommendation Systems Roadtrip
Recommendation Systems RoadtripRecommendation Systems Roadtrip
Recommendation Systems Roadtrip
The Real Dyl
 
EMOS 2018 Big Data methods and techniques
EMOS 2018 Big Data methods and techniquesEMOS 2018 Big Data methods and techniques
EMOS 2018 Big Data methods and techniques
Piet J.H. Daas
 
Big data
Big dataBig data
Big data
Big dataBig data
Big data
Harshit Namdev
 
Hadoop PDF
Hadoop PDFHadoop PDF
Hadoop PDF
1904saikrishna
 

Similar to (SoWeMine Workshop) "#nowplaying on #Spotify: Leveraging Spotify Information on Twitter for Artist Recommendations" - Martin Pichl, Eva Zangerle and Günther SpechtPresentation (20)

A Unified Music Recommender System Using Listening Habits and Semantics of Tags
A Unified Music Recommender System Using Listening Habits and Semantics of TagsA Unified Music Recommender System Using Listening Habits and Semantics of Tags
A Unified Music Recommender System Using Listening Habits and Semantics of Tags
 
Data science-2013-heekim
Data science-2013-heekimData science-2013-heekim
Data science-2013-heekim
 
Luis Aguiar: Platforms, Promotion, and Product Discovery: Evidence from Spoti...
Luis Aguiar: Platforms, Promotion, and Product Discovery: Evidence from Spoti...Luis Aguiar: Platforms, Promotion, and Product Discovery: Evidence from Spoti...
Luis Aguiar: Platforms, Promotion, and Product Discovery: Evidence from Spoti...
 
Deezer - Big data as a streaming service
Deezer - Big data as a streaming serviceDeezer - Big data as a streaming service
Deezer - Big data as a streaming service
 
IntroductionRecommenderSystems_Petroni.pdf
IntroductionRecommenderSystems_Petroni.pdfIntroductionRecommenderSystems_Petroni.pdf
IntroductionRecommenderSystems_Petroni.pdf
 
Algorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at SpotifyAlgorithmic Music Recommendations at Spotify
Algorithmic Music Recommendations at Spotify
 
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
 
Building Data Pipelines for Music Recommendations at Spotify
Building Data Pipelines for Music Recommendations at SpotifyBuilding Data Pipelines for Music Recommendations at Spotify
Building Data Pipelines for Music Recommendations at Spotify
 
IBANK - Big data www.ibank.uk.com 07474222079
IBANK - Big data www.ibank.uk.com 07474222079IBANK - Big data www.ibank.uk.com 07474222079
IBANK - Big data www.ibank.uk.com 07474222079
 
Big data
Big dataBig data
Big data
 
Designing recommender system for your application
Designing  recommender system for  your applicationDesigning  recommender system for  your application
Designing recommender system for your application
 
Online Recommender System for Radio Station Hosting: Experimental Results Rev...
Online Recommender System for Radio Station Hosting: Experimental Results Rev...Online Recommender System for Radio Station Hosting: Experimental Results Rev...
Online Recommender System for Radio Station Hosting: Experimental Results Rev...
 
Recommendations 101
Recommendations 101 Recommendations 101
Recommendations 101
 
Sobers "At Risk of Losing Everything: Advocating for Audiovisual Materials in...
Sobers "At Risk of Losing Everything: Advocating for Audiovisual Materials in...Sobers "At Risk of Losing Everything: Advocating for Audiovisual Materials in...
Sobers "At Risk of Losing Everything: Advocating for Audiovisual Materials in...
 
Thesis-Defense-YuhuiWang-small
Thesis-Defense-YuhuiWang-smallThesis-Defense-YuhuiWang-small
Thesis-Defense-YuhuiWang-small
 
Recommendation Systems Roadtrip
Recommendation Systems RoadtripRecommendation Systems Roadtrip
Recommendation Systems Roadtrip
 
EMOS 2018 Big Data methods and techniques
EMOS 2018 Big Data methods and techniquesEMOS 2018 Big Data methods and techniques
EMOS 2018 Big Data methods and techniques
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Hadoop PDF
Hadoop PDFHadoop PDF
Hadoop PDF
 

More from icwe2015

Mikkonen liquid-sw-icwe2015
Mikkonen liquid-sw-icwe2015Mikkonen liquid-sw-icwe2015
Mikkonen liquid-sw-icwe2015
icwe2015
 
(Web User Interfaces track) "Getting the Query Right: User Interface Design o...
(Web User Interfaces track) "Getting the Query Right: User Interface Design o...(Web User Interfaces track) "Getting the Query Right: User Interface Design o...
(Web User Interfaces track) "Getting the Query Right: User Interface Design o...
icwe2015
 
(Web Application Design track) "Two Factor Authentication Made Easy" - Alex Q...
(Web Application Design track) "Two Factor Authentication Made Easy" - Alex Q...(Web Application Design track) "Two Factor Authentication Made Easy" - Alex Q...
(Web Application Design track) "Two Factor Authentication Made Easy" - Alex Q...
icwe2015
 
(Semantic Web Technologies and Applications track) "MIRROR: Automatic R2RML M...
(Semantic Web Technologies and Applications track) "MIRROR: Automatic R2RML M...(Semantic Web Technologies and Applications track) "MIRROR: Automatic R2RML M...
(Semantic Web Technologies and Applications track) "MIRROR: Automatic R2RML M...
icwe2015
 
(Linked Data Development and Exploitation track) "YQL as a Platform for Linke...
(Linked Data Development and Exploitation track) "YQL as a Platform for Linke...(Linked Data Development and Exploitation track) "YQL as a Platform for Linke...
(Linked Data Development and Exploitation track) "YQL as a Platform for Linke...icwe2015
 
(Linked Data Interfaces and Querying track) "SUMMA: A Common API for Linked D...
(Linked Data Interfaces and Querying track) "SUMMA: A Common API for Linked D...(Linked Data Interfaces and Querying track) "SUMMA: A Common API for Linked D...
(Linked Data Interfaces and Querying track) "SUMMA: A Common API for Linked D...
icwe2015
 
(Linked Data Development and Exploitation track) "Generating the Semantic Sna...
(Linked Data Development and Exploitation track) "Generating the Semantic Sna...(Linked Data Development and Exploitation track) "Generating the Semantic Sna...
(Linked Data Development and Exploitation track) "Generating the Semantic Sna...icwe2015
 
(Industry track) "Interactive networks for digital cultural heritage collecti...
(Industry track) "Interactive networks for digital cultural heritage collecti...(Industry track) "Interactive networks for digital cultural heritage collecti...
(Industry track) "Interactive networks for digital cultural heritage collecti...icwe2015
 
(Mobile Web Applications track) "Mobile-IDM: A Design Method for Modeling the...
(Mobile Web Applications track) "Mobile-IDM: A Design Method for Modeling the...(Mobile Web Applications track) "Mobile-IDM: A Design Method for Modeling the...
(Mobile Web Applications track) "Mobile-IDM: A Design Method for Modeling the...icwe2015
 
(Linked Data Development and Exploitation track) "Curtains Up! Lights, Camera...
(Linked Data Development and Exploitation track) "Curtains Up! Lights, Camera...(Linked Data Development and Exploitation track) "Curtains Up! Lights, Camera...
(Linked Data Development and Exploitation track) "Curtains Up! Lights, Camera...
icwe2015
 
(Mobile Web Applications track) "Profiling User Activities with Minimal Traff...
(Mobile Web Applications track) "Profiling User Activities with Minimal Traff...(Mobile Web Applications track) "Profiling User Activities with Minimal Traff...
(Mobile Web Applications track) "Profiling User Activities with Minimal Traff...
icwe2015
 
(NLPIT Workshop) (Keynote) Nathan Schneider - “Hacking a Way Through the Twit...
(NLPIT Workshop) (Keynote) Nathan Schneider - “Hacking a Way Through the Twit...(NLPIT Workshop) (Keynote) Nathan Schneider - “Hacking a Way Through the Twit...
(NLPIT Workshop) (Keynote) Nathan Schneider - “Hacking a Way Through the Twit...icwe2015
 
(PEWET Workshop) (Keynote) Vincenzo De Florio - “Fractally-organized Connecti...
(PEWET Workshop) (Keynote) Vincenzo De Florio - “Fractally-organized Connecti...(PEWET Workshop) (Keynote) Vincenzo De Florio - “Fractally-organized Connecti...
(PEWET Workshop) (Keynote) Vincenzo De Florio - “Fractally-organized Connecti...icwe2015
 
(Web Application Design track) "Liquid Stream Processing across Web Browsers ...
(Web Application Design track) "Liquid Stream Processing across Web Browsers ...(Web Application Design track) "Liquid Stream Processing across Web Browsers ...
(Web Application Design track) "Liquid Stream Processing across Web Browsers ...icwe2015
 
(Web Composition and Mashups track) "REST Web Service Description for Graph-B...
(Web Composition and Mashups track) "REST Web Service Description for Graph-B...(Web Composition and Mashups track) "REST Web Service Description for Graph-B...
(Web Composition and Mashups track) "REST Web Service Description for Graph-B...icwe2015
 
(Semantic Web Technologies and Applications track) "A Quantitative Comparison...
(Semantic Web Technologies and Applications track) "A Quantitative Comparison...(Semantic Web Technologies and Applications track) "A Quantitative Comparison...
(Semantic Web Technologies and Applications track) "A Quantitative Comparison...icwe2015
 
(Keynote) Peter Mika - “Making the Web Searchable”
(Keynote) Peter Mika - “Making the Web Searchable”(Keynote) Peter Mika - “Making the Web Searchable”
(Keynote) Peter Mika - “Making the Web Searchable”icwe2015
 
(Keynote) Mike Thelwall - “Sentiment Strength Detection for Social Media Text...
(Keynote) Mike Thelwall - “Sentiment Strength Detection for Social Media Text...(Keynote) Mike Thelwall - “Sentiment Strength Detection for Social Media Text...
(Keynote) Mike Thelwall - “Sentiment Strength Detection for Social Media Text...icwe2015
 

More from icwe2015 (18)

Mikkonen liquid-sw-icwe2015
Mikkonen liquid-sw-icwe2015Mikkonen liquid-sw-icwe2015
Mikkonen liquid-sw-icwe2015
 
(Web User Interfaces track) "Getting the Query Right: User Interface Design o...
(Web User Interfaces track) "Getting the Query Right: User Interface Design o...(Web User Interfaces track) "Getting the Query Right: User Interface Design o...
(Web User Interfaces track) "Getting the Query Right: User Interface Design o...
 
(Web Application Design track) "Two Factor Authentication Made Easy" - Alex Q...
(Web Application Design track) "Two Factor Authentication Made Easy" - Alex Q...(Web Application Design track) "Two Factor Authentication Made Easy" - Alex Q...
(Web Application Design track) "Two Factor Authentication Made Easy" - Alex Q...
 
(Semantic Web Technologies and Applications track) "MIRROR: Automatic R2RML M...
(Semantic Web Technologies and Applications track) "MIRROR: Automatic R2RML M...(Semantic Web Technologies and Applications track) "MIRROR: Automatic R2RML M...
(Semantic Web Technologies and Applications track) "MIRROR: Automatic R2RML M...
 
(Linked Data Development and Exploitation track) "YQL as a Platform for Linke...
(Linked Data Development and Exploitation track) "YQL as a Platform for Linke...(Linked Data Development and Exploitation track) "YQL as a Platform for Linke...
(Linked Data Development and Exploitation track) "YQL as a Platform for Linke...
 
(Linked Data Interfaces and Querying track) "SUMMA: A Common API for Linked D...
(Linked Data Interfaces and Querying track) "SUMMA: A Common API for Linked D...(Linked Data Interfaces and Querying track) "SUMMA: A Common API for Linked D...
(Linked Data Interfaces and Querying track) "SUMMA: A Common API for Linked D...
 
(Linked Data Development and Exploitation track) "Generating the Semantic Sna...
(Linked Data Development and Exploitation track) "Generating the Semantic Sna...(Linked Data Development and Exploitation track) "Generating the Semantic Sna...
(Linked Data Development and Exploitation track) "Generating the Semantic Sna...
 
(Industry track) "Interactive networks for digital cultural heritage collecti...
(Industry track) "Interactive networks for digital cultural heritage collecti...(Industry track) "Interactive networks for digital cultural heritage collecti...
(Industry track) "Interactive networks for digital cultural heritage collecti...
 
(Mobile Web Applications track) "Mobile-IDM: A Design Method for Modeling the...
(Mobile Web Applications track) "Mobile-IDM: A Design Method for Modeling the...(Mobile Web Applications track) "Mobile-IDM: A Design Method for Modeling the...
(Mobile Web Applications track) "Mobile-IDM: A Design Method for Modeling the...
 
(Linked Data Development and Exploitation track) "Curtains Up! Lights, Camera...
(Linked Data Development and Exploitation track) "Curtains Up! Lights, Camera...(Linked Data Development and Exploitation track) "Curtains Up! Lights, Camera...
(Linked Data Development and Exploitation track) "Curtains Up! Lights, Camera...
 
(Mobile Web Applications track) "Profiling User Activities with Minimal Traff...
(Mobile Web Applications track) "Profiling User Activities with Minimal Traff...(Mobile Web Applications track) "Profiling User Activities with Minimal Traff...
(Mobile Web Applications track) "Profiling User Activities with Minimal Traff...
 
(NLPIT Workshop) (Keynote) Nathan Schneider - “Hacking a Way Through the Twit...
(NLPIT Workshop) (Keynote) Nathan Schneider - “Hacking a Way Through the Twit...(NLPIT Workshop) (Keynote) Nathan Schneider - “Hacking a Way Through the Twit...
(NLPIT Workshop) (Keynote) Nathan Schneider - “Hacking a Way Through the Twit...
 
(PEWET Workshop) (Keynote) Vincenzo De Florio - “Fractally-organized Connecti...
(PEWET Workshop) (Keynote) Vincenzo De Florio - “Fractally-organized Connecti...(PEWET Workshop) (Keynote) Vincenzo De Florio - “Fractally-organized Connecti...
(PEWET Workshop) (Keynote) Vincenzo De Florio - “Fractally-organized Connecti...
 
(Web Application Design track) "Liquid Stream Processing across Web Browsers ...
(Web Application Design track) "Liquid Stream Processing across Web Browsers ...(Web Application Design track) "Liquid Stream Processing across Web Browsers ...
(Web Application Design track) "Liquid Stream Processing across Web Browsers ...
 
(Web Composition and Mashups track) "REST Web Service Description for Graph-B...
(Web Composition and Mashups track) "REST Web Service Description for Graph-B...(Web Composition and Mashups track) "REST Web Service Description for Graph-B...
(Web Composition and Mashups track) "REST Web Service Description for Graph-B...
 
(Semantic Web Technologies and Applications track) "A Quantitative Comparison...
(Semantic Web Technologies and Applications track) "A Quantitative Comparison...(Semantic Web Technologies and Applications track) "A Quantitative Comparison...
(Semantic Web Technologies and Applications track) "A Quantitative Comparison...
 
(Keynote) Peter Mika - “Making the Web Searchable”
(Keynote) Peter Mika - “Making the Web Searchable”(Keynote) Peter Mika - “Making the Web Searchable”
(Keynote) Peter Mika - “Making the Web Searchable”
 
(Keynote) Mike Thelwall - “Sentiment Strength Detection for Social Media Text...
(Keynote) Mike Thelwall - “Sentiment Strength Detection for Social Media Text...(Keynote) Mike Thelwall - “Sentiment Strength Detection for Social Media Text...
(Keynote) Mike Thelwall - “Sentiment Strength Detection for Social Media Text...
 

Recently uploaded

1.Wireless Communication System_Wireless communication is a broad term that i...
1.Wireless Communication System_Wireless communication is a broad term that i...1.Wireless Communication System_Wireless communication is a broad term that i...
1.Wireless Communication System_Wireless communication is a broad term that i...
JeyaPerumal1
 
急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样
急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样
急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样
3ipehhoa
 
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
3ipehhoa
 
History+of+E-commerce+Development+in+China-www.cfye-commerce.shop
History+of+E-commerce+Development+in+China-www.cfye-commerce.shopHistory+of+E-commerce+Development+in+China-www.cfye-commerce.shop
History+of+E-commerce+Development+in+China-www.cfye-commerce.shop
laozhuseo02
 
Multi-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Multi-cluster Kubernetes Networking- Patterns, Projects and GuidelinesMulti-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Multi-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Sanjeev Rampal
 
Living-in-IT-era-Module-7-Imaging-and-Design-for-Social-Impact.pptx
Living-in-IT-era-Module-7-Imaging-and-Design-for-Social-Impact.pptxLiving-in-IT-era-Module-7-Imaging-and-Design-for-Social-Impact.pptx
Living-in-IT-era-Module-7-Imaging-and-Design-for-Social-Impact.pptx
TristanJasperRamos
 
guildmasters guide to ravnica Dungeons & Dragons 5...
guildmasters guide to ravnica Dungeons & Dragons 5...guildmasters guide to ravnica Dungeons & Dragons 5...
guildmasters guide to ravnica Dungeons & Dragons 5...
Rogerio Filho
 
test test test test testtest test testtest test testtest test testtest test ...
test test  test test testtest test testtest test testtest test testtest test ...test test  test test testtest test testtest test testtest test testtest test ...
test test test test testtest test testtest test testtest test testtest test ...
Arif0071
 
How to Use Contact Form 7 Like a Pro.pptx
How to Use Contact Form 7 Like a Pro.pptxHow to Use Contact Form 7 Like a Pro.pptx
How to Use Contact Form 7 Like a Pro.pptx
Gal Baras
 
The+Prospects+of+E-Commerce+in+China.pptx
The+Prospects+of+E-Commerce+in+China.pptxThe+Prospects+of+E-Commerce+in+China.pptx
The+Prospects+of+E-Commerce+in+China.pptx
laozhuseo02
 
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
3ipehhoa
 
ER(Entity Relationship) Diagram for online shopping - TAE
ER(Entity Relationship) Diagram for online shopping - TAEER(Entity Relationship) Diagram for online shopping - TAE
ER(Entity Relationship) Diagram for online shopping - TAE
Himani415946
 
BASIC C++ lecture NOTE C++ lecture 3.pptx
BASIC C++ lecture NOTE C++ lecture 3.pptxBASIC C++ lecture NOTE C++ lecture 3.pptx
BASIC C++ lecture NOTE C++ lecture 3.pptx
natyesu
 
Latest trends in computer networking.pptx
Latest trends in computer networking.pptxLatest trends in computer networking.pptx
Latest trends in computer networking.pptx
JungkooksNonexistent
 
Output determination SAP S4 HANA SAP SD CC
Output determination SAP S4 HANA SAP SD CCOutput determination SAP S4 HANA SAP SD CC
Output determination SAP S4 HANA SAP SD CC
ShahulHameed54211
 
This 7-second Brain Wave Ritual Attracts Money To You.!
This 7-second Brain Wave Ritual Attracts Money To You.!This 7-second Brain Wave Ritual Attracts Money To You.!
This 7-second Brain Wave Ritual Attracts Money To You.!
nirahealhty
 

Recently uploaded (16)

1.Wireless Communication System_Wireless communication is a broad term that i...
1.Wireless Communication System_Wireless communication is a broad term that i...1.Wireless Communication System_Wireless communication is a broad term that i...
1.Wireless Communication System_Wireless communication is a broad term that i...
 
急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样
急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样
急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样
 
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
 
History+of+E-commerce+Development+in+China-www.cfye-commerce.shop
History+of+E-commerce+Development+in+China-www.cfye-commerce.shopHistory+of+E-commerce+Development+in+China-www.cfye-commerce.shop
History+of+E-commerce+Development+in+China-www.cfye-commerce.shop
 
Multi-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Multi-cluster Kubernetes Networking- Patterns, Projects and GuidelinesMulti-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Multi-cluster Kubernetes Networking- Patterns, Projects and Guidelines
 
Living-in-IT-era-Module-7-Imaging-and-Design-for-Social-Impact.pptx
Living-in-IT-era-Module-7-Imaging-and-Design-for-Social-Impact.pptxLiving-in-IT-era-Module-7-Imaging-and-Design-for-Social-Impact.pptx
Living-in-IT-era-Module-7-Imaging-and-Design-for-Social-Impact.pptx
 
guildmasters guide to ravnica Dungeons & Dragons 5...
guildmasters guide to ravnica Dungeons & Dragons 5...guildmasters guide to ravnica Dungeons & Dragons 5...
guildmasters guide to ravnica Dungeons & Dragons 5...
 
test test test test testtest test testtest test testtest test testtest test ...
test test  test test testtest test testtest test testtest test testtest test ...test test  test test testtest test testtest test testtest test testtest test ...
test test test test testtest test testtest test testtest test testtest test ...
 
How to Use Contact Form 7 Like a Pro.pptx
How to Use Contact Form 7 Like a Pro.pptxHow to Use Contact Form 7 Like a Pro.pptx
How to Use Contact Form 7 Like a Pro.pptx
 
The+Prospects+of+E-Commerce+in+China.pptx
The+Prospects+of+E-Commerce+in+China.pptxThe+Prospects+of+E-Commerce+in+China.pptx
The+Prospects+of+E-Commerce+in+China.pptx
 
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
 
ER(Entity Relationship) Diagram for online shopping - TAE
ER(Entity Relationship) Diagram for online shopping - TAEER(Entity Relationship) Diagram for online shopping - TAE
ER(Entity Relationship) Diagram for online shopping - TAE
 
BASIC C++ lecture NOTE C++ lecture 3.pptx
BASIC C++ lecture NOTE C++ lecture 3.pptxBASIC C++ lecture NOTE C++ lecture 3.pptx
BASIC C++ lecture NOTE C++ lecture 3.pptx
 
Latest trends in computer networking.pptx
Latest trends in computer networking.pptxLatest trends in computer networking.pptx
Latest trends in computer networking.pptx
 
Output determination SAP S4 HANA SAP SD CC
Output determination SAP S4 HANA SAP SD CCOutput determination SAP S4 HANA SAP SD CC
Output determination SAP S4 HANA SAP SD CC
 
This 7-second Brain Wave Ritual Attracts Money To You.!
This 7-second Brain Wave Ritual Attracts Money To You.!This 7-second Brain Wave Ritual Attracts Money To You.!
This 7-second Brain Wave Ritual Attracts Money To You.!
 

(SoWeMine Workshop) "#nowplaying on #Spotify: Leveraging Spotify Information on Twitter for Artist Recommendations" - Martin Pichl, Eva Zangerle and Günther SpechtPresentation

  • 1. Seite 1 Page 1 The University of Innsbruck was founded in 1669 and is one of Austria’s oldest universities. Today, with over 28.000 students and 4.000 staff, it is western Austria’s largest institution of higher education and research. For further information visit: www.uibk.ac.at. #nowplaying on #Spotify: Leveraging Spotify Information on Twitter for Artist Recommendations Martin Pichl, Eva Zangerle and Günther Specht
  • 2. Seite 2 Page 2 Agenda • Why Music Recommendations? • Dataset Creation & Recommendation Approach • Discussion and Future Work
  • 3. Seite 3 Page 3 Recent Trends • Rise of the web enabled new distribution channels • Online Stores • Music Streaming Platforms • … • These new distribution channels – Exploit a word-wide market – Virtually no inventory costs → More and more dives music is available
  • 4. Seite 4 Page 4 Why (Music) Recommender Systems? • The user is confronted with more and more diverse music – on streaming platforms – in online stores – on mobile devices • and has a free choice • Users often do not know what to listen to → Information Overload • Recommender Systems – Helps users finding music they like → Increase usability
  • 5. Seite 5 Page 5 Research on Music Recommender System • Publicly available data necessary • Twitter – People share what they are listening at the moment • Get additional information from Spotify – Additional listening events – Additional information about the tracks and the artists – Additional information about the listening context • The additional information is necessary to build a more specialized recommender system
  • 7. Seite 7 Page 7 The Dataset • Generated dataset based Tweets that contains – <UserID, ArtistID, TrackID>-triples – Boolean preferences (listened/not listened) • Cleaning – Removed duplicates – Removed certain accounts i.e. @SpotifyNowPlaying – Removed “Various Artists”
  • 8. Seite 8 Page 8 Dataset Snapshot • Dataset contains – 513,489 listening events – by 68,045 unique users – listening to 97,586 unique tracks – by unique 40,593 artists • Distribution – In average 4.77 tweets per user (SD= 30.02) – Median of 2
  • 9. Seite 9 Page 9 Artist Recommendations using this Dataset • No content based information – Recommendations are computed using collaborative filtering • Collaborative Filtering (CF) – CF recommends items that the most similar users of a user listened to (and are new to the user) • CF relies on – A user similarity measure – A number of nearest neighbors 𝑘
  • 10. Seite 10 Page 10 User Similarity • Boolean Preferences – Jaccard Coefficient is suitable – 𝐽𝑎𝑐𝑐𝑎𝑟𝑑𝑖,𝑗 = 𝑆 𝑖 ∩ 𝑆 𝑗 𝑆 𝑖 ∪ 𝑆 𝑗 • Include all the information available – Compute Jaccard Coefficient using the artist listening history – Compute Jaccard Coefficient using the track listening history – Combined using an weighted average • 𝑢𝑠𝑒𝑟𝑆𝑖𝑚 = 𝑤 𝑎 ∗ 𝑎𝑟𝑡𝑖𝑠𝑡𝑆𝑖𝑚 + 𝑤𝑡 ∗ 𝑡𝑟𝑎𝑐𝑘𝑆𝑖𝑚
  • 11. Seite 11 Page 11 Parameter Tuning • Input Parameters – 𝑤 𝑎, 𝑤𝑡, 𝑘 – Optimized using a Genetic Algorithm (GA) – Fitness = Precision of the recommender system – In average a good solution was found after 4.14 iterations (SD=2.27)
  • 12. Seite 12 Page 12 Genetic Algorithm • 𝑤 𝑎,𝑤𝑡, 𝑘 are float point genes between 0 and 1 and form a individual • Random initial distribution • The fitness of each individual is measured using the precision • Crossover and mutations of the best individual • Terminate if the precision is 1 or a certain number of generations is reach
  • 13. Seite 13 Page 13 The Big Picture
  • 14. Seite 14 Page 14 Evaluation Setup • Offline Evaluation – From each user we removed 1/3 of the listening events for testing – Recommended 𝑝 ∗ 𝑆𝑖𝑧𝑒 𝑜𝑓 𝑡ℎ𝑒 𝑇𝑒𝑠𝑡𝑠𝑒𝑡 items – Varied 𝑝 between 0 and 1 – Computed precision and recall for each 𝑝 • Parameters used for the Evaluation – 𝑤 𝑎 = 0.21 – 𝑤𝑡 = 0.94 – 𝑘 = 59
  • 15. Seite 15 Page 15 Evaluation Metrics • Hit: Item found in the testset • 𝑝𝑟𝑒𝑐𝑖𝑠𝑜𝑛 = ℎ𝑖𝑡𝑠 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑟𝑒𝑐𝑜𝑚𝑚𝑒𝑛𝑑𝑎𝑡𝑖𝑜𝑛𝑠 • Relevant Items: All items in the testset • 𝑟𝑒𝑐𝑎𝑙𝑙 = ℎ𝑖𝑡𝑠 𝑠𝑖𝑧𝑒 𝑜𝑓 𝑡ℎ𝑒 𝑡𝑒𝑠𝑡𝑠𝑒𝑡
  • 16. Seite 16 Page 16 Performance of the optimized Recommender System n Precision Recall 1 0.4656 0.0228 2 0.3622 0.0547 3 0.3137 0.0782 4 0.2812 0.1003 5 0.2531 0.1195 6 0.2315 0.1286 7 0.2170 0.1396 8 0.2170 0.1396 9 0.1871 0.1583 10 0.1871 0,1583 0 0,1 0,2 0,3 0,4 0,5 Precision/Recall Number of Recommendations (% of the Testset) Precision Recall
  • 17. Seite 17 Page 17 Discussion • Heading into the right direction • Performance is limited for a high number of recommendations – Data sparsity – Too general approach • Performance improvements with – Reducing data sparsity – Specialized algorithm that fits more to music recommendation
  • 18. Seite 18 Page 18 Next Steps towards a more specialized RS • Match Spotify and Twitter Users – Early experiments show that we can match ~ 10% of the dataset – Better matching than using the username and played tracks? • Extract listening context from playlist names, i.e. – Christmas – Workout, training – Driving – …
  • 19. Seite 19 Page 19 Next Steps towards a more specialized RS • The offline evaluation is rather limited • Create an intuitive webinterface • Conduct a live user experiment