2. PROJECT DESCRIPTION
• Project mainly focused on analyzing random 1650 YouTube videos for
their comments
• Performing sentiment analysis on positive and negative comments to
determine the most popular words used in those comments
• Determining the most popular emojis in comments section
3. PACKAGES USED FOR ANALYSIS
• The analysis was done using following packages in python:
1. Pandas – For data manipulation and exploratory data analysis (EDA)
2. Plotly – For data visualization
3. Regular expression – For cleaning the noise in the data
4. TextBlob – For sentiment analysis (Determining the polarity of comments)
5. WordCloud – For determining the most frequent word in comments
6. Emoji – For separating the emojis from the comments
7. Plotly – To determine the most popular emojis in comments
5. DATA LOADING
• The data was collected from Kaggle
• The pandas library was used to load the data and it was stored
in variable called data
6. SENTIMENT ANALYSIS
• The comments column comment_text was used from data
• This column was divided into two comments based on the polarity of comments
1. Positive comments – Comments with +1 polarity
2. Negative comments – Comments with -1 polarity