Created a Classification Machine Learning model to predict the popularity of songs on Spotify for the recommendation engine, which will recommend songs to its listeners based on popular songs and their taste.
Project
Basic Univariate Statistics, Graphical Methods, and Communication of
Data
Overview and Rationale
This assignment is designed to provide you with hands-on experience in performing
descriptive statistical methods on a data set. The data set is provided in an Excel workbook
and contains a wide range to data types that you will need to work with.
Course Outcomes
This assignment is directly linked to the following key learning outcomes from the course
syllabus:
CO1: Explore the use of statistical software in data analysis through hands-on applications,
CO2: Create distributions and graphical representation based on given data and identify
which distributions best fit the data using the Chi-squared goodness of fit test.
CO7: Interpret meaningful relationships and patterns in the data in relation to a given
business question
Assignment Summary
Using the data provided in the attached Excel workbook provided, apply the methods of
graphical and numerical descriptive statistics to both the categorical and numerical data as
described in the project document.
Follow the instructions in the project document to analyze the data presented in the Excel
workbook. Then complete a report summarizing the results in your Excel workbook.
Submit both the report and the Excel workbook.
The Excel workbook contains all statistical work. The report should include all your
findings along with important statisticalissues.
Format & Guidelines
The report should follow the following format:
(i) Introduction
(ii) Analysis
(iii) Conclusion
And be 800-900 words and be presented in the APA format.
The Excel spreadsheet must be completed as outlined in the assignment.
Project Instructions:
Using the Excel workbook provided:
1. Create a histogram of the top 10 districts with the highest average number
of vehicles per household in 2016. (10%)
2. Create a relative frequency histogram from the “2016 percentage of households
without vehicles” for the entire country. Comment on the shape of the
distribution. (10%)
3. Create a box plot of the “2016 percentage of households without vehicles”
for the entire country. (5%)
4. Perform numerical descriptive statistics for the “2016 Vehicles per Household” for
the entire country. (5%)
5. For the “2016 percentage of households without vehicle” for the entire country,
determine whether there are any outliers. Comment on the districts to which
the outliers belong. (10%)
6. Repeat tasks 2-5 above for the “2015 percentage of households without vehicle”.
(30%) Describe whether there are any similarities or dissimilarities in the
distributions of the two variables that you have analyzed (the two variables are:
“2016 percentage of households without vehicles” and “2015 percentage of
households without vehicles”) (10%).
7. Create a table that shows the average number of vehicles per household by state in
2016 (org ...
Project
Basic Univariate Statistics, Graphical Methods, and Communication of
Data
Overview and Rationale
This assignment is designed to provide you with hands-on experience in performing
descriptive statistical methods on a data set. The data set is provided in an Excel workbook
and contains a wide range to data types that you will need to work with.
Course Outcomes
This assignment is directly linked to the following key learning outcomes from the course
syllabus:
CO1: Explore the use of statistical software in data analysis through hands-on applications,
CO2: Create distributions and graphical representation based on given data and identify
which distributions best fit the data using the Chi-squared goodness of fit test.
CO7: Interpret meaningful relationships and patterns in the data in relation to a given
business question
Assignment Summary
Using the data provided in the attached Excel workbook provided, apply the methods of
graphical and numerical descriptive statistics to both the categorical and numerical data as
described in the project document.
Follow the instructions in the project document to analyze the data presented in the Excel
workbook. Then complete a report summarizing the results in your Excel workbook.
Submit both the report and the Excel workbook.
The Excel workbook contains all statistical work. The report should include all your
findings along with important statisticalissues.
Format & Guidelines
The report should follow the following format:
(i) Introduction
(ii) Analysis
(iii) Conclusion
And be 800-900 words and be presented in the APA format.
The Excel spreadsheet must be completed as outlined in the assignment.
Project Instructions:
Using the Excel workbook provided:
1. Create a histogram of the top 10 districts with the highest average number
of vehicles per household in 2016. (10%)
2. Create a relative frequency histogram from the “2016 percentage of households
without vehicles” for the entire country. Comment on the shape of the
distribution. (10%)
3. Create a box plot of the “2016 percentage of households without vehicles”
for the entire country. (5%)
4. Perform numerical descriptive statistics for the “2016 Vehicles per Household” for
the entire country. (5%)
5. For the “2016 percentage of households without vehicle” for the entire country,
determine whether there are any outliers. Comment on the districts to which
the outliers belong. (10%)
6. Repeat tasks 2-5 above for the “2015 percentage of households without vehicle”.
(30%) Describe whether there are any similarities or dissimilarities in the
distributions of the two variables that you have analyzed (the two variables are:
“2016 percentage of households without vehicles” and “2015 percentage of
households without vehicles”) (10%).
7. Create a table that shows the average number of vehicles per household by state in
2016 (org ...
survey on Hybrid recommendation mechanism to get effective ranking results fo...Suraj Ligade
These days clients are having exclusive
requirements towards advancements, they need to hunt tunes
in such circumstances where they are not ready to recall tunes
title or melody related points of interest. Recovery of music or
melodies substance is one of the hardest errands and testing
work in the field of Music Information Retrieval (MIR). There
are different looking techniques created and executed, yet
these seeking strategies are no more ready to inquiry tunes
which required by the clients and confronting different issues
like programmed playlist creation, music suggestion or music
pursuit are connected issues. In past framework client seek
the tune with the assistance of tune title, craftsman name and
whatever other related points of interest so this strategy is
exceptionally tedious. To beat this issue singing so as to look
tune or murmuring a segment of it is the most regular
approach to seek the tune. This hunt strategy is the most
helpful when client don't have entry to sound gadget or client
can't review the traits of the tune such as tune title, name of
craftsman, name of collection. In proposed framework client
have not stress over recalling the tune data and this technique
is not tedious. In this strategy we utilize the data from a
client's hunt history and in addition the normal properties of
client's comparative foundations. Cross breed proposal
component utilizes the substance construct recovery
framework situated in light of utilization of the sound data
such as tone, pitch, mood. This component used to get exact
result to the client. The more imperative idea is clients ready
to work their gadgets without manual information orders by
hand. It is simple and basic system to perform music look.
In October 2015, we conducted two separate surveys about music consumption habits, and specifically Apple Music. This reports shares the findings of those two surveys along with key messages for readers.
Competitor analysis of Music Streaming ServicesTiffany Sam
As a personal winter break project, this deck is a compilation of various trends I have identified in the music streaming space both locally and globally with special attention paid to the rising competition between Apple Music and Spotify.
Assignment Grading Rubric
Course: MT460 Unit: 9 Points: 45
_________________________________________________
Unit 9: Leadership, Policy, and Culture
Case Study Analysis Paper:
Prepare a case study analysis of Case 9, Defender Direct, Inc.: A Business of Growing Leaders found in the Cases
section of your digital book.
Closely follow the Case Study Analysis Template by clicking on the hyperlink. Please utilize this template format for this
Assignment. Use titles and subtitles per the format for readability purposes. Focus upon the idea of the company’s
strategy-culture relationship and which of the four strategy-culture situations should it implement in order to help move
Defender Direct forward. Please include the SWOT Analysis with the four quadrants in the appendix of your paper (after
the References page). You can find the SWOT analysis template in Doc Sharing.
Assignment Checklist:
• Conduct a SWOT Analysis on the case study company’s current strategy-culture relationship.
• Create a case study analysis focusing on which of the four strategy-culture situations it should implement in order
to help move the company forward.
Format
The case analysis should be 2--3 written pages in length (not including the formal title page and References page),
double-spaced. Ensure that you use correct spelling, grammar, punctuation, mechanics, and usage. The citing of sources
(text and list references) should use the current APA format and style.
http://extmedia.kaplan.edu/business/MT460/MT460_1403C/MT460_Case_Study_Analysis_Format_and_Style_Template.docx
For assistance with APA format and citation style visit the Kaplan Writing Center.
Directions for Submitting Your Project
• Before you submit your project, you should save your work on your computer in a location and with a name that
you will remember.
• Make sure your Assignment is in the correct file format (Microsoft Word .doc or .docx).
• Submit y your completed document to the Unit 9 Assignment Dropbox.
Need help with the Dropbox? Click on the Dropbox Guide link under Academic Tools tab.
MT460 Unit 9 Assignment Grading Rubric
Maximum
Percent
Criteria
Maximum
Points
50%
Content
22
Answer provides correct and complete
information demonstrating critical
thinking:
• Conduct a SWOT Analysis on
the case study company’s
current strategy-culture
relationship.
• Create a case study analysis
focusing on which of the four
strategy-culture situations it
should implement in order to
help move the company
forward.
30%
Analysis and Critical Thinking
• Synopsis of situation
• Key issues
• Problem definition
• Alternative solutions
• Select a solution
• Implementation
• Recommendations
14
15% Writing Style, Grammar 6
5% APA Format and Citation Style 3
100% Total 45
Assignment Grading RubricCourse: MT460 Unit: 9 Points: 45________ ...
Music Recommendation System is a cutting-edge technology that uses algorithms to analyze user preferences and behavior in order to suggest personalized music recommendations.
A music recommendation system project is a fascinating endeavor that combines elements of data analysis, machine learning, and user experience design to create personalized music suggestions for users. Here's a breakdown of what such a project might entail:
Data Collection: The first step is gathering data. This could include information about songs, albums, artists, genres, user preferences, listening history, and more. APIs from music streaming platforms like Spotify or Last.fm are often used to access this data.
Data Preprocessing: Once the data is collected, it needs to be cleaned and preprocessed. This involves tasks like removing duplicates, handling missing values, and converting data into a suitable format for analysis.
Feature Engineering: Features are attributes or characteristics of the data that can be used to make predictions or recommendations. In the context of music recommendation, features could include song tempo, genre, artist popularity, user listening history, and so on. Feature engineering involves selecting and creating relevant features from the data.
Model Selection: There are various machine learning algorithms that can be used for recommendation systems, such as collaborative filtering, content-based filtering, matrix factorization, and deep learning models. The choice of model depends on factors like the type of data available and the specific requirements of the project.
Training the Model: Once a model is selected, it needs to be trained on the preprocessed data. During training, the model learns patterns and relationships in the data that enable it to make accurate recommendations.
Evaluation: After training the model, it's important to evaluate its performance. This involves using metrics like precision, recall, and accuracy to assess how well the model predicts user preferences and provides relevant recommendations.
Deployment: Once the model is trained and evaluated, it can be deployed into a production environment where users can interact with it. This might involve integrating the recommendation system into a music streaming platform or building a standalone application.
Feedback Loop: A good recommendation system should be able to adapt to changing user preferences over time. This requires implementing a feedback loop where user interactions with the system are continuously monitored and used to update the model and improve the recommendations.
Throughout the development process, it's important to consider factors like scalability, usability, and privacy to ensure that the recommendation system is both effective and user-friendly. Additionally, incorporating techniques like A/B testing can help optimize the system and refine its recommendations based on real-world user feedback.
Understanding ai music discovery and recommendation systemsValerio Velardo
Thanks to Spotify, you’ve got millions of songs at your fingertips, ready to be discovered. And because of its AI-powered recommendation system, you’re more likely to find new music you’re already interested in. Learning what’s behind this technology can assist music business leaders and entrepreneurs to provide a more meaningful, tailored discovery experience for their users. Not only will you gain insight that most music leaders don’t yet have, but also become aware of how to apply this to your business.
In this one-hour webinar, you’ll:
- Learn how music recommendation systems work
- Understand how to leverage these systems to add value to
specific music business applications
- Learn how listeners discover music according to our research
findings
Incorporating data from the Spotify Web API, I expanded on the research for the June 2018 RTG presentation to study submitted memories as links between emotions, activities, and musical qualities using python instead of R.
survey on Hybrid recommendation mechanism to get effective ranking results fo...Suraj Ligade
These days clients are having exclusive
requirements towards advancements, they need to hunt tunes
in such circumstances where they are not ready to recall tunes
title or melody related points of interest. Recovery of music or
melodies substance is one of the hardest errands and testing
work in the field of Music Information Retrieval (MIR). There
are different looking techniques created and executed, yet
these seeking strategies are no more ready to inquiry tunes
which required by the clients and confronting different issues
like programmed playlist creation, music suggestion or music
pursuit are connected issues. In past framework client seek
the tune with the assistance of tune title, craftsman name and
whatever other related points of interest so this strategy is
exceptionally tedious. To beat this issue singing so as to look
tune or murmuring a segment of it is the most regular
approach to seek the tune. This hunt strategy is the most
helpful when client don't have entry to sound gadget or client
can't review the traits of the tune such as tune title, name of
craftsman, name of collection. In proposed framework client
have not stress over recalling the tune data and this technique
is not tedious. In this strategy we utilize the data from a
client's hunt history and in addition the normal properties of
client's comparative foundations. Cross breed proposal
component utilizes the substance construct recovery
framework situated in light of utilization of the sound data
such as tone, pitch, mood. This component used to get exact
result to the client. The more imperative idea is clients ready
to work their gadgets without manual information orders by
hand. It is simple and basic system to perform music look.
In October 2015, we conducted two separate surveys about music consumption habits, and specifically Apple Music. This reports shares the findings of those two surveys along with key messages for readers.
Competitor analysis of Music Streaming ServicesTiffany Sam
As a personal winter break project, this deck is a compilation of various trends I have identified in the music streaming space both locally and globally with special attention paid to the rising competition between Apple Music and Spotify.
Assignment Grading Rubric
Course: MT460 Unit: 9 Points: 45
_________________________________________________
Unit 9: Leadership, Policy, and Culture
Case Study Analysis Paper:
Prepare a case study analysis of Case 9, Defender Direct, Inc.: A Business of Growing Leaders found in the Cases
section of your digital book.
Closely follow the Case Study Analysis Template by clicking on the hyperlink. Please utilize this template format for this
Assignment. Use titles and subtitles per the format for readability purposes. Focus upon the idea of the company’s
strategy-culture relationship and which of the four strategy-culture situations should it implement in order to help move
Defender Direct forward. Please include the SWOT Analysis with the four quadrants in the appendix of your paper (after
the References page). You can find the SWOT analysis template in Doc Sharing.
Assignment Checklist:
• Conduct a SWOT Analysis on the case study company’s current strategy-culture relationship.
• Create a case study analysis focusing on which of the four strategy-culture situations it should implement in order
to help move the company forward.
Format
The case analysis should be 2--3 written pages in length (not including the formal title page and References page),
double-spaced. Ensure that you use correct spelling, grammar, punctuation, mechanics, and usage. The citing of sources
(text and list references) should use the current APA format and style.
http://extmedia.kaplan.edu/business/MT460/MT460_1403C/MT460_Case_Study_Analysis_Format_and_Style_Template.docx
For assistance with APA format and citation style visit the Kaplan Writing Center.
Directions for Submitting Your Project
• Before you submit your project, you should save your work on your computer in a location and with a name that
you will remember.
• Make sure your Assignment is in the correct file format (Microsoft Word .doc or .docx).
• Submit y your completed document to the Unit 9 Assignment Dropbox.
Need help with the Dropbox? Click on the Dropbox Guide link under Academic Tools tab.
MT460 Unit 9 Assignment Grading Rubric
Maximum
Percent
Criteria
Maximum
Points
50%
Content
22
Answer provides correct and complete
information demonstrating critical
thinking:
• Conduct a SWOT Analysis on
the case study company’s
current strategy-culture
relationship.
• Create a case study analysis
focusing on which of the four
strategy-culture situations it
should implement in order to
help move the company
forward.
30%
Analysis and Critical Thinking
• Synopsis of situation
• Key issues
• Problem definition
• Alternative solutions
• Select a solution
• Implementation
• Recommendations
14
15% Writing Style, Grammar 6
5% APA Format and Citation Style 3
100% Total 45
Assignment Grading RubricCourse: MT460 Unit: 9 Points: 45________ ...
Music Recommendation System is a cutting-edge technology that uses algorithms to analyze user preferences and behavior in order to suggest personalized music recommendations.
A music recommendation system project is a fascinating endeavor that combines elements of data analysis, machine learning, and user experience design to create personalized music suggestions for users. Here's a breakdown of what such a project might entail:
Data Collection: The first step is gathering data. This could include information about songs, albums, artists, genres, user preferences, listening history, and more. APIs from music streaming platforms like Spotify or Last.fm are often used to access this data.
Data Preprocessing: Once the data is collected, it needs to be cleaned and preprocessed. This involves tasks like removing duplicates, handling missing values, and converting data into a suitable format for analysis.
Feature Engineering: Features are attributes or characteristics of the data that can be used to make predictions or recommendations. In the context of music recommendation, features could include song tempo, genre, artist popularity, user listening history, and so on. Feature engineering involves selecting and creating relevant features from the data.
Model Selection: There are various machine learning algorithms that can be used for recommendation systems, such as collaborative filtering, content-based filtering, matrix factorization, and deep learning models. The choice of model depends on factors like the type of data available and the specific requirements of the project.
Training the Model: Once a model is selected, it needs to be trained on the preprocessed data. During training, the model learns patterns and relationships in the data that enable it to make accurate recommendations.
Evaluation: After training the model, it's important to evaluate its performance. This involves using metrics like precision, recall, and accuracy to assess how well the model predicts user preferences and provides relevant recommendations.
Deployment: Once the model is trained and evaluated, it can be deployed into a production environment where users can interact with it. This might involve integrating the recommendation system into a music streaming platform or building a standalone application.
Feedback Loop: A good recommendation system should be able to adapt to changing user preferences over time. This requires implementing a feedback loop where user interactions with the system are continuously monitored and used to update the model and improve the recommendations.
Throughout the development process, it's important to consider factors like scalability, usability, and privacy to ensure that the recommendation system is both effective and user-friendly. Additionally, incorporating techniques like A/B testing can help optimize the system and refine its recommendations based on real-world user feedback.
Understanding ai music discovery and recommendation systemsValerio Velardo
Thanks to Spotify, you’ve got millions of songs at your fingertips, ready to be discovered. And because of its AI-powered recommendation system, you’re more likely to find new music you’re already interested in. Learning what’s behind this technology can assist music business leaders and entrepreneurs to provide a more meaningful, tailored discovery experience for their users. Not only will you gain insight that most music leaders don’t yet have, but also become aware of how to apply this to your business.
In this one-hour webinar, you’ll:
- Learn how music recommendation systems work
- Understand how to leverage these systems to add value to
specific music business applications
- Learn how listeners discover music according to our research
findings
Incorporating data from the Spotify Web API, I expanded on the research for the June 2018 RTG presentation to study submitted memories as links between emotions, activities, and musical qualities using python instead of R.
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...John Andrews
SlideShare Description for "Chatty Kathy - UNC Bootcamp Final Project Presentation"
Title: Chatty Kathy: Enhancing Physical Activity Among Older Adults
Description:
Discover how Chatty Kathy, an innovative project developed at the UNC Bootcamp, aims to tackle the challenge of low physical activity among older adults. Our AI-driven solution uses peer interaction to boost and sustain exercise levels, significantly improving health outcomes. This presentation covers our problem statement, the rationale behind Chatty Kathy, synthetic data and persona creation, model performance metrics, a visual demonstration of the project, and potential future developments. Join us for an insightful Q&A session to explore the potential of this groundbreaking project.
Project Team: Jay Requarth, Jana Avery, John Andrews, Dr. Dick Davis II, Nee Buntoum, Nam Yeongjin & Mat Nicholas
Adjusting primitives for graph : SHORT REPORT / NOTESSubhajit Sahu
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is
Multiply with different modes (map)
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
1. Comparing various launch configs for CUDA based vector element sum (in-place).
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Round table discussion of vector databases, unstructured data, ai, big data, real-time, robots and Milvus.
A lively discussion with NJ Gen AI Meetup Lead, Prasad and Procure.FYI's Co-Found
3. Problem Statement
Spotify, one of the most popular platform, used by the listeners for songs and
podcasts.
Spotify uses recommendation engine to recommend tracks to the listener in the
discover weekly section according to listener's preference and popularity.
Out of the two factors, popularity of the track is the most important factor, used
by the recommendation engine because it also tells about the popular
preferences of the people, based upon various variables and the heyday period
of the track.
4. Dataset Description
114000 rows and 21 columns
Target Variable
Popularity – This measure vary with past released song with present
released songs because Spotify reshuffles according to monthly
listeners. It is a multiclass variable consisting of 3 categories.
In this, only those variables are taken which are affecting the target
variable. Other variable are not taken into consideration.
5. Key variables
Independent Variable
Continuous:
Categorical:
•Valence% – It is the positiveness of the song. Higher the value is cheerful
and euphoric, lower the value depressing and sad.
•Danceability% – How much the song can be used for dance purpose.
•Energy% – It is the amount of energy a song have
•Acoustic Ness% – It measures the use of natural instruments or
electronically made music.
•Key – It the musical notes which is used in the track, such as 0=C, 1=C#,
and so on. There are total of 12 keys present.
•Tempo – It represents the speed of the song. Higher the tempo higher
faster the song and vice-versa.
•Duration – It represents the length of the song in seconds.
•Speech ness – It represents the amount of vocals/voices present in the
song.
7. EDA Report
• Duplicate values, null values and typo error were present in the data.
• There are huge outliers present in the data, which is treated by converting
them into categories maintaining the balance in the classes.
• Did some Feature engineering such as clubbing, binning and rounding the
data to reduce the classes in the data.
• The target variable “Popularity” was initially in percentage 0-100%.
However, the original data description says that it is classification problem.
So, the target variable is converted from regression to multi-classification.
• The target variable was not-balanced. Oversampling technique was used to
balance the classes.
8. Conversion of target variable percentile
into three categories.
In the histogram below we can see than the target column has a peak at 0, which is represents no
popularity of the tracks, so, it is assigned an independent class of the variable because it will impact
the accuracy of the model. The new classes are ‘zero popularity’, ‘low popularity’, ‘high popularity’
9. Algorithms report
With the different algorithms, the accuracy
is not fluctuation much, represent the
stability in the prediction.
Highest Accuracy = 85.94
Lowest Accuracy = 79.7
Algorithm wise accuracy:
• Random Forest Classifier = 85.94
• Decision Tree Classifier = 79.7
• Cat Boost Classifier = 80.3
• XG Boost Classifier = 82.28
10. Key Findings
• There were 20 independent variables present in the data but only 8 variables were
affecting the popularity of the song.
• Valence, danceability and energy are affecting almost 50% to the popularity.
• Song Genre is one of the most important factor when comes to individual's preference or
taste of music, that recommendation engine considers. The most popular genre is
Country-Specific which consist of Country Wise language songs, indicates people love
mother tongue when it comes to songs. Apart from that most popular genre is EDM
(Electronic Dance Music) because high valence, danceability and energy.
• Medium tempo is 2x popular than any other tempo range which is between 100-140
bpm. This tempo is used in EDM, Rock and Pop music, are the most popular genres.
11. The importance of each column related
to the popularity
The figure in the left shows how much each feature
is affecting the target column.
Valence + danceability + energy
16.7% + 16.0% + 15.5% = 48.2%
First 8 columns or all the 20 columns is giving the
same accuracy.
12. Conclusion
The overall dataset was little complicated because of the difficulty of establishing the
relationship between the target variable and the independent variables. However, with
some cleaning and feature engineering, the final model was stable with high accuracy.
The most difficult differentiation was that, the popularity was getting affecting by the
release date of the song and the release date was not available in the data, so it seemed
like the case of Endogeneity. Nonetheless, After separating the target variable, it got
sorted.
As a Data Scientist, I can conclude that this trained model with the following dataset is
predicting accurately and is ready for deployment in the Spotify recommendation Engine,
to predict the right popularity in future recommending the right tracks to the listeners .