SlideShare a Scribd company logo
1 of 18
Download to read offline
BLUE MANGOES
Piyush Khemka, Prachi Poddar, Snigdha
Kamal, Sharang Bhat
MUSIC RECOMMENDATION
SYSTEM
Based on Million Song Dataset*
*labrosa.ee.columbia.edu/millionsong/
Background and Motivation
Growth in Music streaming consumption among consumers
Plethora of options available - Spotify, Pandora, 8tracks
*http://www.riaa.com/reports/riaa-2015-year-end-sales-shipments-data-report-riaa/
RIAA - Recording Industry Association of America
Background and Motivation
Music Recommendations - excellent feature for any music application.
Better Recommendations - Better Conversions, More engagement
Develop a music recommendation system based on the Million Song
Dataset using various recommendation methodologies and draw a
comparative analysis between them
Dataset Description
The Million Song Dataset User Taste Profile
Dataset
Freely available collection of audio features and metadata 48 million triplets(User Id, Song ID, count)
for a million contemporary popular music tracks ( 280 GB) Gathered from 1 million users
1 M songs.
Subset: 2.8 GB (compressed) -> 10 GB (CSV) Size: 500 MB (compressed) -> 3 GB (.txt
file)
Project Pipeline
Collaborative Filtering
The Collaborative Filtering method uses previous user choices, and
choices of similar users to predict the possible future song
selections
Data was first fed through a data cleaning module, which removed
erroneous entries, such as missing values for both Song and User
ID’s.
ID’s were then mapped to a unique integer via a dictionary as Spark
functions require numeric values
Collaborative Filtering
Used Matrix Factorization instead of the conventional distance
function
Latent factor methods were used to train on some known data
K User Features (latent) were extracted , which in this case was the
song count
Assumption-Song Count represents all the factors that could have
contributed to a user choosing to listen to a song
Results - Collaborative Filtering
Content Based Recommendation
We extracted certain features from the dataset, which describes
features of a song
Normalized those features by taking its product with its confidence to
get a final value, such as mode and mode confidence to get a final
mode estimate
Removed features not related to audio features
Clustered songs in a higher dimensional space, and found similar
songs within each cluster
Content Based Recommendation
Training Method- Cross Validation
User-triples dataset was split in a ratio of 80:20.
80% of the dataset was used to train a clustering model
Created a profile for each user by merging the user-song dataset
Each user profile consisted of a “mean” of all songs heard by a user in
his lifetime (as per the dataset)
Clustered using K-means
Content Based Recommendation
Testing
Generated 10 nearest neighbours for each user
Evaluated by comparing recommendations to actual values present in
the dataset
Results - Content Based Filtering
Precision = True Positives / (True Positive + False Positive)
Final Precision- 6.1% on entire MSD subset
Performance for Content Based decreases with increase in size of data
*
* http://www-
personal.umich.edu/~yjli/content/pro
jectreport.pdf
Visualizations and Trend Analysis
Visualizations and Trend Analysis
Team Work
- Scoping the project, data extraction, data cleaning - All
- Collaborative Filtering - Sharang
- Content Based Filtering - Piyush
- Trend Analysis & Visualizations - Prachi, Snigdha
- Conclusion, Report, Presentation - All
- Asking questions on Piazza - Anonymous
Conclusion
Eye opener on big data and its difficulties
We were fairly satisfied with our results, we managed to reduce the
RMSE by almost 1, and virtually predicted how many times a user
will play a song
We fairly satisfied with a precision of 6.1 % considering we tried a new
recommendation methodology
Collaborative Filtering was easier to implement and evaluate
THANK YOU

More Related Content

Similar to music recommendation system ,Based on Million Song Dataset

Deep Learning Based Music Recommendation System
Deep Learning Based Music Recommendation SystemDeep Learning Based Music Recommendation System
Deep Learning Based Music Recommendation SystemIRJET Journal
 
Personalizing the listening experience
Personalizing the listening experiencePersonalizing the listening experience
Personalizing the listening experienceMounia Lalmas-Roelleke
 
Music Recommendation System
Music Recommendation SystemMusic Recommendation System
Music Recommendation SystemIRJET Journal
 
Mehfil : Song Recommendation System Using Sentiment Detected
Mehfil : Song Recommendation System Using Sentiment DetectedMehfil : Song Recommendation System Using Sentiment Detected
Mehfil : Song Recommendation System Using Sentiment DetectedIRJET Journal
 
Igor Kostiuk “Как приручить музыкальную рекомендательную систему”
Igor Kostiuk “Как приручить музыкальную рекомендательную систему”Igor Kostiuk “Как приручить музыкальную рекомендательную систему”
Igor Kostiuk “Как приручить музыкальную рекомендательную систему”Dakiry
 
A Review on Song Recommendation Techniques
A Review on Song Recommendation TechniquesA Review on Song Recommendation Techniques
A Review on Song Recommendation TechniquesIRJET Journal
 
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...Timo van Niedek
 
Deezer - Big data as a streaming service
Deezer - Big data as a streaming serviceDeezer - Big data as a streaming service
Deezer - Big data as a streaming serviceJulie Knibbe
 
IRJET- Implementation of Emotion based Music Recommendation System using SVM ...
IRJET- Implementation of Emotion based Music Recommendation System using SVM ...IRJET- Implementation of Emotion based Music Recommendation System using SVM ...
IRJET- Implementation of Emotion based Music Recommendation System using SVM ...IRJET Journal
 
Music Recommendation system Project PPT.pptx
Music Recommendation system Project PPT.pptxMusic Recommendation system Project PPT.pptx
Music Recommendation system Project PPT.pptx2k22csds2212634
 
Music Personalization At Spotify
Music Personalization At SpotifyMusic Personalization At Spotify
Music Personalization At SpotifyVidhya Murali
 
IRJET- Feeling based Music Recommnendation System using Sensors
IRJET- Feeling based Music Recommnendation System using SensorsIRJET- Feeling based Music Recommnendation System using Sensors
IRJET- Feeling based Music Recommnendation System using SensorsIRJET Journal
 
IRJET- Analysis of Music Recommendation System using Machine Learning Alg...
IRJET-  	  Analysis of Music Recommendation System using Machine Learning Alg...IRJET-  	  Analysis of Music Recommendation System using Machine Learning Alg...
IRJET- Analysis of Music Recommendation System using Machine Learning Alg...IRJET Journal
 
Recsys 2018 overview and highlights
Recsys 2018 overview and highlightsRecsys 2018 overview and highlights
Recsys 2018 overview and highlightsSandra Garcia
 
Twente ir-course 20-10-2010
Twente ir-course 20-10-2010Twente ir-course 20-10-2010
Twente ir-course 20-10-2010Arjen de Vries
 
NewCo Discovery
NewCo DiscoveryNewCo Discovery
NewCo DiscoveryDavid Cho
 
genetic algorithm based music recommender system
genetic algorithm based music recommender systemgenetic algorithm based music recommender system
genetic algorithm based music recommender systemneha pevekar
 

Similar to music recommendation system ,Based on Million Song Dataset (20)

Deep Learning Based Music Recommendation System
Deep Learning Based Music Recommendation SystemDeep Learning Based Music Recommendation System
Deep Learning Based Music Recommendation System
 
Personalizing the listening experience
Personalizing the listening experiencePersonalizing the listening experience
Personalizing the listening experience
 
Music Recommendation System
Music Recommendation SystemMusic Recommendation System
Music Recommendation System
 
Song Proffer
Song ProfferSong Proffer
Song Proffer
 
About
AboutAbout
About
 
Mehfil : Song Recommendation System Using Sentiment Detected
Mehfil : Song Recommendation System Using Sentiment DetectedMehfil : Song Recommendation System Using Sentiment Detected
Mehfil : Song Recommendation System Using Sentiment Detected
 
Igor Kostiuk “Как приручить музыкальную рекомендательную систему”
Igor Kostiuk “Как приручить музыкальную рекомендательную систему”Igor Kostiuk “Как приручить музыкальную рекомендательную систему”
Igor Kostiuk “Как приручить музыкальную рекомендательную систему”
 
A Review on Song Recommendation Techniques
A Review on Song Recommendation TechniquesA Review on Song Recommendation Techniques
A Review on Song Recommendation Techniques
 
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
 
Deezer - Big data as a streaming service
Deezer - Big data as a streaming serviceDeezer - Big data as a streaming service
Deezer - Big data as a streaming service
 
IRJET- Implementation of Emotion based Music Recommendation System using SVM ...
IRJET- Implementation of Emotion based Music Recommendation System using SVM ...IRJET- Implementation of Emotion based Music Recommendation System using SVM ...
IRJET- Implementation of Emotion based Music Recommendation System using SVM ...
 
Music Recommendation system Project PPT.pptx
Music Recommendation system Project PPT.pptxMusic Recommendation system Project PPT.pptx
Music Recommendation system Project PPT.pptx
 
C018211723
C018211723C018211723
C018211723
 
Music Personalization At Spotify
Music Personalization At SpotifyMusic Personalization At Spotify
Music Personalization At Spotify
 
IRJET- Feeling based Music Recommnendation System using Sensors
IRJET- Feeling based Music Recommnendation System using SensorsIRJET- Feeling based Music Recommnendation System using Sensors
IRJET- Feeling based Music Recommnendation System using Sensors
 
IRJET- Analysis of Music Recommendation System using Machine Learning Alg...
IRJET-  	  Analysis of Music Recommendation System using Machine Learning Alg...IRJET-  	  Analysis of Music Recommendation System using Machine Learning Alg...
IRJET- Analysis of Music Recommendation System using Machine Learning Alg...
 
Recsys 2018 overview and highlights
Recsys 2018 overview and highlightsRecsys 2018 overview and highlights
Recsys 2018 overview and highlights
 
Twente ir-course 20-10-2010
Twente ir-course 20-10-2010Twente ir-course 20-10-2010
Twente ir-course 20-10-2010
 
NewCo Discovery
NewCo DiscoveryNewCo Discovery
NewCo Discovery
 
genetic algorithm based music recommender system
genetic algorithm based music recommender systemgenetic algorithm based music recommender system
genetic algorithm based music recommender system
 

Recently uploaded

Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 

Recently uploaded (20)

Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 

music recommendation system ,Based on Million Song Dataset

  • 1. BLUE MANGOES Piyush Khemka, Prachi Poddar, Snigdha Kamal, Sharang Bhat
  • 2. MUSIC RECOMMENDATION SYSTEM Based on Million Song Dataset* *labrosa.ee.columbia.edu/millionsong/
  • 3. Background and Motivation Growth in Music streaming consumption among consumers Plethora of options available - Spotify, Pandora, 8tracks *http://www.riaa.com/reports/riaa-2015-year-end-sales-shipments-data-report-riaa/ RIAA - Recording Industry Association of America
  • 4. Background and Motivation Music Recommendations - excellent feature for any music application. Better Recommendations - Better Conversions, More engagement Develop a music recommendation system based on the Million Song Dataset using various recommendation methodologies and draw a comparative analysis between them
  • 5. Dataset Description The Million Song Dataset User Taste Profile Dataset Freely available collection of audio features and metadata 48 million triplets(User Id, Song ID, count) for a million contemporary popular music tracks ( 280 GB) Gathered from 1 million users 1 M songs. Subset: 2.8 GB (compressed) -> 10 GB (CSV) Size: 500 MB (compressed) -> 3 GB (.txt file)
  • 7. Collaborative Filtering The Collaborative Filtering method uses previous user choices, and choices of similar users to predict the possible future song selections Data was first fed through a data cleaning module, which removed erroneous entries, such as missing values for both Song and User ID’s. ID’s were then mapped to a unique integer via a dictionary as Spark functions require numeric values
  • 8. Collaborative Filtering Used Matrix Factorization instead of the conventional distance function Latent factor methods were used to train on some known data K User Features (latent) were extracted , which in this case was the song count Assumption-Song Count represents all the factors that could have contributed to a user choosing to listen to a song
  • 10. Content Based Recommendation We extracted certain features from the dataset, which describes features of a song Normalized those features by taking its product with its confidence to get a final value, such as mode and mode confidence to get a final mode estimate Removed features not related to audio features Clustered songs in a higher dimensional space, and found similar songs within each cluster
  • 11. Content Based Recommendation Training Method- Cross Validation User-triples dataset was split in a ratio of 80:20. 80% of the dataset was used to train a clustering model Created a profile for each user by merging the user-song dataset Each user profile consisted of a “mean” of all songs heard by a user in his lifetime (as per the dataset) Clustered using K-means
  • 12. Content Based Recommendation Testing Generated 10 nearest neighbours for each user Evaluated by comparing recommendations to actual values present in the dataset
  • 13. Results - Content Based Filtering Precision = True Positives / (True Positive + False Positive) Final Precision- 6.1% on entire MSD subset Performance for Content Based decreases with increase in size of data * * http://www- personal.umich.edu/~yjli/content/pro jectreport.pdf
  • 16. Team Work - Scoping the project, data extraction, data cleaning - All - Collaborative Filtering - Sharang - Content Based Filtering - Piyush - Trend Analysis & Visualizations - Prachi, Snigdha - Conclusion, Report, Presentation - All - Asking questions on Piazza - Anonymous
  • 17. Conclusion Eye opener on big data and its difficulties We were fairly satisfied with our results, we managed to reduce the RMSE by almost 1, and virtually predicted how many times a user will play a song We fairly satisfied with a precision of 6.1 % considering we tried a new recommendation methodology Collaborative Filtering was easier to implement and evaluate