SlideShare a Scribd company logo
1 of 22
BOOK RECOMMENDATIONS SYSTEM
CONTENTS
• PROJECTARCHITECTURE
• INTRODUCTION TO RECOMMENDATION SYSTEM
• DATA SET DETAILS
• DATA PREPROCESSING AND EDA
• VISUALIZATION
• DETAILS ABOUT RECOMMENDATION TECHNIQUES
• MODEL SELECTION
• DEPLOYMENT
PROBLEM STATEMENT
During the last few decades, with the rise of YouTube, Amazon, Netflix, and many other such web
services, recommender systems have become much more important in our lives in terms of
providing highly personalized and relevant content. The main objective is to create a
recommendation system to recommend relevant books to users based on popularity and user
interests.
In today's digital age, people have access to a vast amount of information and entertainment. Books,
being a significant part of this content, come in various genres, styles, and languages. However, the
abundance of choices can often be overwhelming for readers. A personalized book recommendation
system can help users discover new and relevant books based on their preferences, reading history,
and interests.
ABSTRACT OF THE PROJECT
• This project focuses on developing an intelligent book recommendation system utilizing machine learning
techniques.
• By analyzing diverse book attributes and user preferences, the system accurately predicts personalized
recommendations.
• Through data preprocessing, feature extraction, and representation methods, the system employs
collaborative and content-based filtering algorithms.
• Additional techniques address the cold-start problem and integrate user feedback for continuous
improvement.
• Evaluation metrics demonstrate the system's effectiveness in generating high-quality and diverse book
recommendations.
• This study offers valuable insights for researchers and practitioners, revolutionizing readers' exploration of
new literary works.
DATA PREPROCESSING
 In Books Dataset Checking of null values and missing data. Removal of two columns of
small image URL and large image URL. Changing column names for easy recognition.
In the publisher column missing value with others. In the Year Of Publication column,
we have two object data DK Publishing Inc replaced this with 2000 and Gallimard
replace it with 2003.
 In Users Dataset In the Users dataset in the Age column we find unique values and
with that, we calculate the mean age. In the Location columns, we have combined
information about the city, state, and country we split this Information into three different
columns.
 In Rating Dataset In this data set we check Book-rating and User-Id are columns that
are numerical type In the ISBN column we remove extra characters.
PROJECTARCHITECTURE
Datasets
• Import Libraries
• Load Data-sets
Data Cleaning
• Missing Value Treatment
• Checking duplicates
Data Preprocessing / EDA
• Rename column names
• Construct an extra column for the location
• Changes in the inappropriate blanks values in columns
Data Visualization
• Numerical Data Visualization
• Barh Chart, Pie-Chart, Bar Graph, Histogram
• Outliers Detection through Boxplot
Model Selection
Model Deployment
2.INTRODUCTIONTO RECOMMENDATION
 Recommendation systems involve
predicting user preferences for unseen items.
 Recommendation systems have
become very popular with the
increasing availability of millions of
products online
 Recommending relevant products
increases the customer’s interest and sales of the
company.
 Examples:-
 Facebook-” People You May Know”
 Netflix-” Other Movies You May Enjoy”
 Amazon-” Customers Who brought this item
also brought…”
3.DATASETSDETAILS
Book Dataset
• No. of rows:-271360
• No. of columns:-8
Users Dataset
• No. of rows:-278858
• No. of columns:-3
Rating Dataset
• No. of rows:-1149780
• No. of columns:-3
4.DATAPREPROCESSING
In Books Dataset
 Checking of null values and
missing data.
 Removal of two columns of
small image URL and large
image URL.
 Changing column names for
easy recognition.
 In the publisher column
missing value with others.
 In the Year Of Publication
column we have two object
data DK Publishing Inc
replaced this with 2000 and
Gallimard replace it with
2003.
In Users Dataset
 In the Users dataset in the
Age column we find unique
values and with that, we
calculate the mean age.
 In the Location columns, we
have combined information
about the city, state, and
country we split this
information into three different
columns.
In Rating Dataset
 In this data set we check
Book-rating and User-Id are
columns that are numerical
type.
 In the ISBN column we
remove extra characters.
FINALDATASETDETAILS
Datasets
• Final Dataset
• After merging
of all three
preprocessed
datasets
Rows And
Columns
• 50815 Rows
• 8 Columns
No. Of Unique
User-ID
• 95513
Null Values
• Total null
values of all
3 datasets
are6
Data Types
• Int(32) 1
columns
• Object 8
columns
After applying preprocessing on all three datasets. We merge all and made the final
data set for visualization and model building.
5.VISUALIZATION
 Outliers
 There are many outliers in age columns
 Outliers are treated with mean values.
 Graphical representation of top
10 books
 In Histogram represents
the Year Of Publication
• From 1990 to 2005 we saw
there are many publishers.
• Top7 PublishersWiththeMost
Books
Top 7 Countries With the Most
Users
Divide the complete dataset based on implicit
and explicit ratings.
• In the Explicit dataset we get a rating above zero
• In the Implicit dataset we get a rating of zero.
• So, we select the Explicit dataset.
• In Explicit rating we find that more people rated
above 6 and most of the people rated 8.
 Top 15 Highest Reader From Countries
 Top 20 Publisher With The Most Book
 Below Histogram
Represents the age of
users.
• Aged between 30 to 40
most users read books.
6. RECOMMENDATIONTECHNIQUES
6.1 Popularity-Based Recommendation System :-
It is a type of recommendation system which works on the principle of popularity and or anything which is
in trend. These systems check the product, movies, or books that are in trend or are most popular among
the users and directly recommend those.
 Advantage of popularity-based recommendation system
 There is no need for the user’s historical data.
 Disadvantage popularity-based recommendation system
 The system would recommend the same sort of products/books which are solely based on popularity to every other
user.
POPULARITY-BASEDRECOMMENDATIONSYSTEMDATAFRAME
6.2 CONTENT-BASED FILTERING:-
A content-based recommender works with data that the user provides, either explicitly
(rating) or implicitly (clicking on a link). Based on that data, a user profile is generated,
which is then used to make suggestions to the user.
 Advantage of content-based recommendation system
 Able to recommend users with unique tastes.
 Can explain the recommendation.
 Disadvantage of content-based recommendation system
 Data should be in a structured format.
 Unable to use quality judgments from other users.
CONTENT-BASE FILTERING RESULT
6.3COLLABORATIVEFILTERING:-
Collaborative filtering is used by most recommendation systems to find similar patterns or information of
the users, this technique can filter out items that users like on the basis of the ratings or reactions by
similar users.
 Advantages of collaborative filtering
 Other user scores are used.
 No deterministic result since chance is involved in the system.
 Disadvantages of collaborative filtering
 Needs more data.
 Problems with new users and new products.
RESULTOFCOLLABORATIVEFILTERING
CHALLENGESINPROJECT
 In Start, we face difficulty with the dataset we have three datasets in that multiple columns are interlinked
with each other. In that pre-processed data and finds the relationship between variables.
 EDA is an interesting part but the selection of variables and making more effective visualization is quite a tough task
 A most difficult task for the team to build an accurate model, we made 5-6 models and selected only three that show accurate
recommendations.
 In deployment, we learned streamlit and HTML for making a good interface. It takes time and continuous
discussion in the team and we did make a great app page.
REFERENCES
THANK YOU

More Related Content

Similar to BOOK RECALL TYPPPPPPBBBBBBBBBHHHHHHHHHHHHY

classIX_DS_Teacher_Presentation.pptx
classIX_DS_Teacher_Presentation.pptxclassIX_DS_Teacher_Presentation.pptx
classIX_DS_Teacher_Presentation.pptxXICSStudents
 
Putting "Value" in Evaluation: Building Relevant, Dynamic Statistical Analysis
Putting "Value" in Evaluation: Building Relevant, Dynamic Statistical AnalysisPutting "Value" in Evaluation: Building Relevant, Dynamic Statistical Analysis
Putting "Value" in Evaluation: Building Relevant, Dynamic Statistical AnalysisJoshua Johnson
 
IRJET- Hybrid Book Recommendation System
IRJET- Hybrid Book Recommendation SystemIRJET- Hybrid Book Recommendation System
IRJET- Hybrid Book Recommendation SystemIRJET Journal
 
The Research specifically DataAnalysis.pptx
The Research specifically DataAnalysis.pptxThe Research specifically DataAnalysis.pptx
The Research specifically DataAnalysis.pptxCasylouMendozaBorqui
 
Webinar - Harness the Power of Data with Tableau - 2016-02-18
Webinar - Harness the Power of Data with Tableau - 2016-02-18Webinar - Harness the Power of Data with Tableau - 2016-02-18
Webinar - Harness the Power of Data with Tableau - 2016-02-18TechSoup
 
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AIData Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AIData Con LA
 
The Economics of Recommender Systems
The Economics of Recommender SystemsThe Economics of Recommender Systems
The Economics of Recommender SystemsKonstantin Savenkov
 
Statistical Learning - Introduction.pptx
Statistical Learning - Introduction.pptxStatistical Learning - Introduction.pptx
Statistical Learning - Introduction.pptxJayaprakashGururaj
 
Mod5_Recommendation Systems.pptx
Mod5_Recommendation Systems.pptxMod5_Recommendation Systems.pptx
Mod5_Recommendation Systems.pptxdivyammo
 
RecSys 2015 posters
RecSys 2015 postersRecSys 2015 posters
RecSys 2015 postersJMoralesBSU
 
Advertising using big data
Advertising using big dataAdvertising using big data
Advertising using big dataRajathMk1
 

Similar to BOOK RECALL TYPPPPPPBBBBBBBBBHHHHHHHHHHHHY (20)

classIX_DS_Teacher_Presentation.pptx
classIX_DS_Teacher_Presentation.pptxclassIX_DS_Teacher_Presentation.pptx
classIX_DS_Teacher_Presentation.pptx
 
Putting "Value" in Evaluation: Building Relevant, Dynamic Statistical Analysis
Putting "Value" in Evaluation: Building Relevant, Dynamic Statistical AnalysisPutting "Value" in Evaluation: Building Relevant, Dynamic Statistical Analysis
Putting "Value" in Evaluation: Building Relevant, Dynamic Statistical Analysis
 
IRJET- Hybrid Book Recommendation System
IRJET- Hybrid Book Recommendation SystemIRJET- Hybrid Book Recommendation System
IRJET- Hybrid Book Recommendation System
 
unit 1.pptx
unit 1.pptxunit 1.pptx
unit 1.pptx
 
Isabelle Reiss - Interoperability, visibility, credibility
Isabelle Reiss - Interoperability, visibility, credibilityIsabelle Reiss - Interoperability, visibility, credibility
Isabelle Reiss - Interoperability, visibility, credibility
 
Data Analysis
Data AnalysisData Analysis
Data Analysis
 
Data mining
Data miningData mining
Data mining
 
Data mining
Data miningData mining
Data mining
 
The Research specifically DataAnalysis.pptx
The Research specifically DataAnalysis.pptxThe Research specifically DataAnalysis.pptx
The Research specifically DataAnalysis.pptx
 
Visualization ppt
Visualization pptVisualization ppt
Visualization ppt
 
Webinar - Harness the Power of Data with Tableau - 2016-02-18
Webinar - Harness the Power of Data with Tableau - 2016-02-18Webinar - Harness the Power of Data with Tableau - 2016-02-18
Webinar - Harness the Power of Data with Tableau - 2016-02-18
 
Data analysis
Data analysisData analysis
Data analysis
 
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AIData Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
 
The Economics of Recommender Systems
The Economics of Recommender SystemsThe Economics of Recommender Systems
The Economics of Recommender Systems
 
Statistical Learning - Introduction.pptx
Statistical Learning - Introduction.pptxStatistical Learning - Introduction.pptx
Statistical Learning - Introduction.pptx
 
Mod5_Recommendation Systems.pptx
Mod5_Recommendation Systems.pptxMod5_Recommendation Systems.pptx
Mod5_Recommendation Systems.pptx
 
RecSys 2015 posters
RecSys 2015 postersRecSys 2015 posters
RecSys 2015 posters
 
Data Analysis, Intepretation
Data Analysis, IntepretationData Analysis, Intepretation
Data Analysis, Intepretation
 
Advertising using big data
Advertising using big dataAdvertising using big data
Advertising using big data
 
Qs1 group a
Qs1 group a Qs1 group a
Qs1 group a
 

Recently uploaded

Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...ThinkInnovation
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAbdelrhman abooda
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 

Recently uploaded (20)

Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Decoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in ActionDecoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in Action
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 

BOOK RECALL TYPPPPPPBBBBBBBBBHHHHHHHHHHHHY

  • 2. CONTENTS • PROJECTARCHITECTURE • INTRODUCTION TO RECOMMENDATION SYSTEM • DATA SET DETAILS • DATA PREPROCESSING AND EDA • VISUALIZATION • DETAILS ABOUT RECOMMENDATION TECHNIQUES • MODEL SELECTION • DEPLOYMENT
  • 3. PROBLEM STATEMENT During the last few decades, with the rise of YouTube, Amazon, Netflix, and many other such web services, recommender systems have become much more important in our lives in terms of providing highly personalized and relevant content. The main objective is to create a recommendation system to recommend relevant books to users based on popularity and user interests. In today's digital age, people have access to a vast amount of information and entertainment. Books, being a significant part of this content, come in various genres, styles, and languages. However, the abundance of choices can often be overwhelming for readers. A personalized book recommendation system can help users discover new and relevant books based on their preferences, reading history, and interests.
  • 4. ABSTRACT OF THE PROJECT • This project focuses on developing an intelligent book recommendation system utilizing machine learning techniques. • By analyzing diverse book attributes and user preferences, the system accurately predicts personalized recommendations. • Through data preprocessing, feature extraction, and representation methods, the system employs collaborative and content-based filtering algorithms. • Additional techniques address the cold-start problem and integrate user feedback for continuous improvement. • Evaluation metrics demonstrate the system's effectiveness in generating high-quality and diverse book recommendations. • This study offers valuable insights for researchers and practitioners, revolutionizing readers' exploration of new literary works.
  • 5. DATA PREPROCESSING  In Books Dataset Checking of null values and missing data. Removal of two columns of small image URL and large image URL. Changing column names for easy recognition. In the publisher column missing value with others. In the Year Of Publication column, we have two object data DK Publishing Inc replaced this with 2000 and Gallimard replace it with 2003.  In Users Dataset In the Users dataset in the Age column we find unique values and with that, we calculate the mean age. In the Location columns, we have combined information about the city, state, and country we split this Information into three different columns.  In Rating Dataset In this data set we check Book-rating and User-Id are columns that are numerical type In the ISBN column we remove extra characters.
  • 6. PROJECTARCHITECTURE Datasets • Import Libraries • Load Data-sets Data Cleaning • Missing Value Treatment • Checking duplicates Data Preprocessing / EDA • Rename column names • Construct an extra column for the location • Changes in the inappropriate blanks values in columns Data Visualization • Numerical Data Visualization • Barh Chart, Pie-Chart, Bar Graph, Histogram • Outliers Detection through Boxplot Model Selection Model Deployment
  • 7. 2.INTRODUCTIONTO RECOMMENDATION  Recommendation systems involve predicting user preferences for unseen items.  Recommendation systems have become very popular with the increasing availability of millions of products online  Recommending relevant products increases the customer’s interest and sales of the company.  Examples:-  Facebook-” People You May Know”  Netflix-” Other Movies You May Enjoy”  Amazon-” Customers Who brought this item also brought…”
  • 8. 3.DATASETSDETAILS Book Dataset • No. of rows:-271360 • No. of columns:-8 Users Dataset • No. of rows:-278858 • No. of columns:-3 Rating Dataset • No. of rows:-1149780 • No. of columns:-3
  • 9. 4.DATAPREPROCESSING In Books Dataset  Checking of null values and missing data.  Removal of two columns of small image URL and large image URL.  Changing column names for easy recognition.  In the publisher column missing value with others.  In the Year Of Publication column we have two object data DK Publishing Inc replaced this with 2000 and Gallimard replace it with 2003. In Users Dataset  In the Users dataset in the Age column we find unique values and with that, we calculate the mean age.  In the Location columns, we have combined information about the city, state, and country we split this information into three different columns. In Rating Dataset  In this data set we check Book-rating and User-Id are columns that are numerical type.  In the ISBN column we remove extra characters.
  • 10. FINALDATASETDETAILS Datasets • Final Dataset • After merging of all three preprocessed datasets Rows And Columns • 50815 Rows • 8 Columns No. Of Unique User-ID • 95513 Null Values • Total null values of all 3 datasets are6 Data Types • Int(32) 1 columns • Object 8 columns After applying preprocessing on all three datasets. We merge all and made the final data set for visualization and model building.
  • 11. 5.VISUALIZATION  Outliers  There are many outliers in age columns  Outliers are treated with mean values.  Graphical representation of top 10 books  In Histogram represents the Year Of Publication • From 1990 to 2005 we saw there are many publishers.
  • 12. • Top7 PublishersWiththeMost Books Top 7 Countries With the Most Users Divide the complete dataset based on implicit and explicit ratings. • In the Explicit dataset we get a rating above zero • In the Implicit dataset we get a rating of zero. • So, we select the Explicit dataset. • In Explicit rating we find that more people rated above 6 and most of the people rated 8.
  • 13.  Top 15 Highest Reader From Countries  Top 20 Publisher With The Most Book  Below Histogram Represents the age of users. • Aged between 30 to 40 most users read books.
  • 14. 6. RECOMMENDATIONTECHNIQUES 6.1 Popularity-Based Recommendation System :- It is a type of recommendation system which works on the principle of popularity and or anything which is in trend. These systems check the product, movies, or books that are in trend or are most popular among the users and directly recommend those.  Advantage of popularity-based recommendation system  There is no need for the user’s historical data.  Disadvantage popularity-based recommendation system  The system would recommend the same sort of products/books which are solely based on popularity to every other user.
  • 16. 6.2 CONTENT-BASED FILTERING:- A content-based recommender works with data that the user provides, either explicitly (rating) or implicitly (clicking on a link). Based on that data, a user profile is generated, which is then used to make suggestions to the user.  Advantage of content-based recommendation system  Able to recommend users with unique tastes.  Can explain the recommendation.  Disadvantage of content-based recommendation system  Data should be in a structured format.  Unable to use quality judgments from other users.
  • 18. 6.3COLLABORATIVEFILTERING:- Collaborative filtering is used by most recommendation systems to find similar patterns or information of the users, this technique can filter out items that users like on the basis of the ratings or reactions by similar users.  Advantages of collaborative filtering  Other user scores are used.  No deterministic result since chance is involved in the system.  Disadvantages of collaborative filtering  Needs more data.  Problems with new users and new products.
  • 20. CHALLENGESINPROJECT  In Start, we face difficulty with the dataset we have three datasets in that multiple columns are interlinked with each other. In that pre-processed data and finds the relationship between variables.  EDA is an interesting part but the selection of variables and making more effective visualization is quite a tough task  A most difficult task for the team to build an accurate model, we made 5-6 models and selected only three that show accurate recommendations.  In deployment, we learned streamlit and HTML for making a good interface. It takes time and continuous discussion in the team and we did make a great app page.