SlideShare a Scribd company logo
CONTENT
▪ INTRODUCTION
▪ OBJECTIVE
▪ DATA DESCRIPTION
▪ PROPOSED METHODOLOGY
▪ MODEL DESCRIPTION
▫ ARCHITECTURAL MODEL
▫ CART
▫ RANDOM FOREST TREE
▫ KNN CLASSIFIERS
▪ ARCHITECTURAL MODEL
▫ ALGORITHM COMPUTATION
▪ APPLICATION
▪ REFRENCES
2
INTRODUCTION
• About wine:
• Wine is a beverage made from fermented grape and other fruit juices with lower amount
of alcohol content.
• Quality of wine is graded based on the taste of wine and vintage. This process is time
taking, costly and not efficient.
• A wine itself includes different parameters like fixed acidity, volatile acidity, citric acid,
residual sugar, chlorides, free sulphur dioxide, total sulphur dioxide, density, pH,
sulphates, alcohol and quality.
• Problem statement:
• In industries, understanding the demands of wine safety testing can be a complex task
for the laboratory with numerous analytes and residues to monitor.
• But, our application’s prediction, provide ideal solutions for the analysis of wine, which
will make this whole process efficient and cheaper with less human interaction.
3
OBJECTIVE
• Our main objective is to predict the wine quality using machine learning through Python
programming language
• A large dataset is considered and wine quality is modelled to analyse the quality of wine
through different parameters like fixed acidity, volatile acidity etc.
• All these parameters will be analysed through Machine Learning algorithms like random forest
classifier algorithm which will helps to rate the wine on scale 1 - 10 or bad - good.
• Output obtained would further be checked for correctness and model will be optimized
accordingly.
• It can support the wine expert evaluations and ultimately improve the production.
4
DATA DESCRIPTION
5
• The dataset contains chemical descriptions of 6499
Portuguese “Vinho Verde” wines.
• There are 4899 entries for white wine, and 1600 entries for
red wines.
• The source of the data is taken from the UCI Machine
Learning Repository, provided by Paulo Cortez, from the
University of Minho, Portugal.
5
DATA
DESCRIPTION
6
Attributes Description
pH To measure ripeness
Density Density in gram per cm3
Alcohol Volume of alcohol in %
Fixed Acidity Impart sourness and resist microbial infection, measured in no. of
grams of tartaric acid per dm3
Volatile Acidity no. of grams of acetic acid per dm3 of wine
Citric Acid no. of grams of citric acid per dm3 of wine
Residual Sugar Remaining sugar after fermentation stops
Chlorides no. of grams of sodium chloride per dm3 of wine
Free Sulfur
dioxide
no. of grams of free sulphites per dm3 of wine
Total Sulfur
dioxide
no. of grams of total sulfite (free sulphite+ bound)
Sulphates no. of grams of potassium sulphate per dm3 of wine
Quality Target variable, 1-10 value6
CONCLUSION OF DATA ANALYSIS
FIRST
The two most important features among
all 12 attributes are Sulphur dioxide (both
free and total) and Alcohol.
LAST
Volatile acidity contributes to
acidic tastes and have negative
correlation to wine quality.
SECOND
The most important factor to decide the quality of wine is alcohol,
higher concentration of alcohol leads to better quality of wine and
lower density of wine.
7
PROPOSED METHODOLOGY
▪ It gives insights of the dependency of target variables on independent variables using
machine learning techniques to determine the quality of wine because it gives the best
outcome for the assurance of quality of wine.
▪ The dependent variable is “quality rating” whereas other variables i.e. alcohol,
sulphur etc. are assumed to be predictors or independent variables.
▪ While hindering the effectiveness of the data model, various types of errors have
occurred like over fitting, introduced from having too large of a training set and bias
occur due to too small of a test set.
8
MODEL DESCRIPTION
9
“
10
CLASSIFICATION AND REGRESSION TREE
(CART)
▪ CART is a decision tree used for analysing both datasets (red and white wine).
▪ The decision trees produced by CART are always binary, containing exactly two
branches for each decision node.
▪ The CART algorithm grows the tree by conducting for each decision node, an
exhaustive search of all available variables. All possible splitting values, selecting the
optimal split
11
RANDOM FOREST
▪ Random forest is a method of classification, regression and other tasks, that operate
by constructing a multitude of decision trees at training time and outputting the class
that is the mode of the classes (classification) or mean prediction (regression) of the
individual trees.
▪ Following are some of the features of random forest algorithm:
1. It runs efficiently on large databases.
2. It gives estimates of what variables are important in the classification.
3. It generates an internal unbiased estimate of generalization error as the forest
building progresses occur.
12
“
13
K-Nearest-Neighbourhood Classifiers
▪ This classifier technique is depended on learning by analogy; this means a comparison
between a test tuple with similar training tuples.
▪ The training tuples are described by n attributes. Each tuple corresponds a point in an n-
dimensional space. All the training tuples are stocked in an n-dimensional pattern space. For
an unknown tuple, a k-nearest-neighbourhood classifier searches the pattern space for the k
training tuples that are closest to the unknown tuple. k training tuples are called as the k
“nearest neighbours” of the unknown tuple.
▪ “Closeness” is a metric distance, likewise Euclidean distance between two points or tuples,
say, 𝑋1 = (𝑥11, 𝑥12……….. 𝑥1𝑛) and 𝑋2 = (𝑥21, 𝑥22……….. 𝑥2𝑛), is:
dist (X1, X2) = 𝑖−1
𝑛
𝑥1𝑖 − 𝑥2𝑖
2
14
ARCHITECTURE MODEL
A flow chart that briefly describes processing of our
software application.
15
16
16
“
Enter all 12 data elements of newly developed wine.
• Entered wine data is further divided into two sets (white wine datasets or red wine datasets) based on alcohol value.
• Based on respective datasets a binary tree is formed ( using random forests algorithm)
• Random forest algorithm
• recode(quality([3,4,5]=0,[6,7,8]=1))
• RandomForestClassifier(estimator=25)
• return prediction, confusion matrix, accuracy
• Total value is evaluated which is root value (using KNN)
• knn
• classify(X,Y,x)
• for i=1 to m do
• compute distance d(X,x)
• compute set I containing indices for the k smallestdistance d(X,x)
• cluster(X)
• return majority label for {Y,where i E I}
• Evaluated root value is actually our required parameter value i.e. depended variable or quality of wine (on scale of 1 to 10).
• Later on this estimated value is compared with datasets value and our algorithm is trained our defined datasets so as to for
correctness and model will be optimized accordingly.
ALGORITHM COMPUTATION
17
APPLICATION
▪ Results will be used by the wine
manufacturers to improve the quality of the
future wines.
▪ Certification bodies can also use the result
for quality control.
▪ Results can be used to make wine selection
guides for wine magazines.
▪ Results can be used by consumers for wine
selection
18
REFRENCES
▪ [1] Y. Er, “The Classification of White Wine and Red Wine According to Their Physicochemical
Qualities,” Int. J. Intell. Syst. Appl. Eng., vol. 4, no. SpecialIssue-1, pp. 23–26, 2016.
▪ [2] E. Summary, W. P. Monitoring, W. Quality, W. Safety, and W. Complexity, “Wine Analysis :
from ‘Grape to Glass’ An analytical testing digest of the wine manufacturing process,” 2016.
▪ [3] A. Ghosh, “Project Report : -Red Wine Quality Analysis Final 3 . An empirical Red Wine
Quality Analysis of the Portuguese ‘ Vinho Verde ’ wine,” no. December 2017, 2018.
▪ [4] Y. Gupta, “Selection of important features and predicting wine quality using machine
learning techniques,” Procedia Comput. Sci., vol. 125, pp. 305–312, 2018.
▪ [5] P. Model, L. Regression, and R. Studio, “Building and Evaluating a Predictive Model w/
Linear Regression in RapidMiner Studio,” 2018.
▪ [6] B. Tajini and O. C. Paris, “Badr Tajini – On Campus Paris – DSTI 2017,” vol. 47, no. 4, pp.
547–553, 2017.
19
19
20 20

More Related Content

What's hot

Introduction to Neural Networks with Python
Introduction to Neural Networks with PythonIntroduction to Neural Networks with Python
Introduction to Neural Networks with Python
dataHacker. rs
 
Housing price prediction
Housing price predictionHousing price prediction
Housing price prediction
Abhimanyu Dwivedi
 
WATER QUALITY PREDICTION
WATER QUALITY PREDICTIONWATER QUALITY PREDICTION
WATER QUALITY PREDICTION
Fasil47
 
Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysis
Makrand Patil
 
Driver Drowsiness Detection Review
Driver Drowsiness Detection ReviewDriver Drowsiness Detection Review
Driver Drowsiness Detection Review
Asaad Waqar
 
House price prediction
House price predictionHouse price prediction
House price prediction
AdityaKumar1505
 
Random forest algorithm
Random forest algorithmRandom forest algorithm
Random forest algorithm
Rashid Ansari
 
Data science presentation
Data science presentationData science presentation
Data science presentation
MSDEVMTL
 
Feature Engineering in Machine Learning
Feature Engineering in Machine LearningFeature Engineering in Machine Learning
Feature Engineering in Machine Learning
Knoldus Inc.
 
House Price Prediction An AI Approach.
House Price Prediction An AI Approach.House Price Prediction An AI Approach.
House Price Prediction An AI Approach.
Nahian Ahmed
 
Machine Learning Project
Machine Learning ProjectMachine Learning Project
Machine Learning ProjectAbhishek Singh
 
Neural Network Architectures
Neural Network ArchitecturesNeural Network Architectures
Neural Network Architectures
Martin Ockajak
 
Driver drowsiness monitoring system using visual behavior and Machine Learning.
Driver drowsiness monitoring system using visual behavior and Machine Learning.Driver drowsiness monitoring system using visual behavior and Machine Learning.
Driver drowsiness monitoring system using visual behavior and Machine Learning.
AasimAhmedKhanJawaad
 
Machine learning ppt
Machine learning pptMachine learning ppt
Machine learning ppt
Rajat Sharma
 
Big Data Science - hype?
Big Data Science - hype?Big Data Science - hype?
Big Data Science - hype?
BalaBit
 
Presentation on Sentiment Analysis
Presentation on Sentiment AnalysisPresentation on Sentiment Analysis
Presentation on Sentiment Analysis
Rebecca Williams
 
Data science
Data scienceData science
Data science
SwapnilDahake2
 
Attribute oriented analysis
Attribute oriented analysisAttribute oriented analysis
Attribute oriented analysis
Hirra Sultan
 
Training Neural Networks
Training Neural NetworksTraining Neural Networks
Training Neural Networks
Databricks
 

What's hot (20)

Introduction to Neural Networks with Python
Introduction to Neural Networks with PythonIntroduction to Neural Networks with Python
Introduction to Neural Networks with Python
 
Housing price prediction
Housing price predictionHousing price prediction
Housing price prediction
 
WATER QUALITY PREDICTION
WATER QUALITY PREDICTIONWATER QUALITY PREDICTION
WATER QUALITY PREDICTION
 
Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysis
 
Driver Drowsiness Detection Review
Driver Drowsiness Detection ReviewDriver Drowsiness Detection Review
Driver Drowsiness Detection Review
 
Decision tree
Decision treeDecision tree
Decision tree
 
House price prediction
House price predictionHouse price prediction
House price prediction
 
Random forest algorithm
Random forest algorithmRandom forest algorithm
Random forest algorithm
 
Data science presentation
Data science presentationData science presentation
Data science presentation
 
Feature Engineering in Machine Learning
Feature Engineering in Machine LearningFeature Engineering in Machine Learning
Feature Engineering in Machine Learning
 
House Price Prediction An AI Approach.
House Price Prediction An AI Approach.House Price Prediction An AI Approach.
House Price Prediction An AI Approach.
 
Machine Learning Project
Machine Learning ProjectMachine Learning Project
Machine Learning Project
 
Neural Network Architectures
Neural Network ArchitecturesNeural Network Architectures
Neural Network Architectures
 
Driver drowsiness monitoring system using visual behavior and Machine Learning.
Driver drowsiness monitoring system using visual behavior and Machine Learning.Driver drowsiness monitoring system using visual behavior and Machine Learning.
Driver drowsiness monitoring system using visual behavior and Machine Learning.
 
Machine learning ppt
Machine learning pptMachine learning ppt
Machine learning ppt
 
Big Data Science - hype?
Big Data Science - hype?Big Data Science - hype?
Big Data Science - hype?
 
Presentation on Sentiment Analysis
Presentation on Sentiment AnalysisPresentation on Sentiment Analysis
Presentation on Sentiment Analysis
 
Data science
Data scienceData science
Data science
 
Attribute oriented analysis
Attribute oriented analysisAttribute oriented analysis
Attribute oriented analysis
 
Training Neural Networks
Training Neural NetworksTraining Neural Networks
Training Neural Networks
 

Similar to Wine Quality Analysis Using Machine Learning

Assignment - 03Model Building, Selection, & Prediction.docx
Assignment - 03Model Building, Selection, & Prediction.docxAssignment - 03Model Building, Selection, & Prediction.docx
Assignment - 03Model Building, Selection, & Prediction.docx
jane3dyson92312
 
Assignment - 03Model Building, Selection, & Prediction.docx
Assignment - 03Model Building, Selection, & Prediction.docxAssignment - 03Model Building, Selection, & Prediction.docx
Assignment - 03Model Building, Selection, & Prediction.docx
festockton
 
Wine Taste Preference Modeling Based On Physicochemical Tests_ShuaiWei
Wine Taste Preference Modeling Based On Physicochemical Tests_ShuaiWeiWine Taste Preference Modeling Based On Physicochemical Tests_ShuaiWei
Wine Taste Preference Modeling Based On Physicochemical Tests_ShuaiWeiShuai Wei
 
pdf.pdf
pdf.pdfpdf.pdf
pdf.pdf
mohan raj
 
Using Adaboost Algorithm to Enhance the Prediction Accuracy of Decision Trees
Using Adaboost Algorithm to Enhance the Prediction Accuracy of Decision TreesUsing Adaboost Algorithm to Enhance the Prediction Accuracy of Decision Trees
Using Adaboost Algorithm to Enhance the Prediction Accuracy of Decision Trees
Mohammed Al Hamadi
 
21 Cost efficient QM in microbreweries agk
21 Cost efficient QM in microbreweries agk21 Cost efficient QM in microbreweries agk
21 Cost efficient QM in microbreweries agkAxel Kristiansen
 
Presentation of CDR WineLab®, Wine Analysis System
Presentation of CDR WineLab®, Wine Analysis SystemPresentation of CDR WineLab®, Wine Analysis System
Presentation of CDR WineLab®, Wine Analysis System
CDR S.r.l.
 
WBC 2012 Poster P-86 Optical vs. D and SV
WBC 2012 Poster P-86 Optical vs. D and SVWBC 2012 Poster P-86 Optical vs. D and SV
WBC 2012 Poster P-86 Optical vs. D and SVDaniel Gore
 
Quality Control of RNA Samples - For Gene-Expression Results you Can Rely on
Quality Control of RNA Samples - For Gene-Expression Results you Can Rely onQuality Control of RNA Samples - For Gene-Expression Results you Can Rely on
Quality Control of RNA Samples - For Gene-Expression Results you Can Rely on
QIAGEN
 
Determination of Wine Color and Total Phenol Content using the LAMBDA PDA UV/...
Determination of Wine Color and Total Phenol Content using the LAMBDA PDA UV/...Determination of Wine Color and Total Phenol Content using the LAMBDA PDA UV/...
Determination of Wine Color and Total Phenol Content using the LAMBDA PDA UV/...
PerkinElmer, Inc.
 
PANIC 2014 Poster PNA-MNova - Aloe - Beer - Small Mixture qNMR
PANIC 2014 Poster  PNA-MNova - Aloe - Beer - Small Mixture qNMRPANIC 2014 Poster  PNA-MNova - Aloe - Beer - Small Mixture qNMR
PANIC 2014 Poster PNA-MNova - Aloe - Beer - Small Mixture qNMRJohn Edwards
 
Air pollution meter
Air pollution meterAir pollution meter
Air pollution meter
AyuenNyiel
 
IU Applied Machine Learning Class Final Project: ML Methods for Predicting Wi...
IU Applied Machine Learning Class Final Project: ML Methods for Predicting Wi...IU Applied Machine Learning Class Final Project: ML Methods for Predicting Wi...
IU Applied Machine Learning Class Final Project: ML Methods for Predicting Wi...
James Nelson
 
SULA PPT 2022 Shubham.pptx
SULA PPT 2022 Shubham.pptxSULA PPT 2022 Shubham.pptx
SULA PPT 2022 Shubham.pptx
ShubhamShivajiShejwa
 
Kx for wine tasting
Kx for wine tastingKx for wine tasting
Kx for wine tasting
Mark Lefevre, CQF
 
Wine Data Analysis using R, SQL and TABLEAU
Wine Data Analysis using R, SQL and TABLEAUWine Data Analysis using R, SQL and TABLEAU
Wine Data Analysis using R, SQL and TABLEAU
Sarita Maharia
 
Quality Control of RNA Samples — For Gene Expression Results You Can Rely On
Quality Control of RNA Samples — For Gene Expression Results You Can Rely OnQuality Control of RNA Samples — For Gene Expression Results You Can Rely On
Quality Control of RNA Samples — For Gene Expression Results You Can Rely On
QIAGEN
 

Similar to Wine Quality Analysis Using Machine Learning (20)

Assignment - 03Model Building, Selection, & Prediction.docx
Assignment - 03Model Building, Selection, & Prediction.docxAssignment - 03Model Building, Selection, & Prediction.docx
Assignment - 03Model Building, Selection, & Prediction.docx
 
Assignment - 03Model Building, Selection, & Prediction.docx
Assignment - 03Model Building, Selection, & Prediction.docxAssignment - 03Model Building, Selection, & Prediction.docx
Assignment - 03Model Building, Selection, & Prediction.docx
 
Team_Random
Team_RandomTeam_Random
Team_Random
 
Wine Taste Preference Modeling Based On Physicochemical Tests_ShuaiWei
Wine Taste Preference Modeling Based On Physicochemical Tests_ShuaiWeiWine Taste Preference Modeling Based On Physicochemical Tests_ShuaiWei
Wine Taste Preference Modeling Based On Physicochemical Tests_ShuaiWei
 
pdf.pdf
pdf.pdfpdf.pdf
pdf.pdf
 
ProjectReport
ProjectReportProjectReport
ProjectReport
 
Using Adaboost Algorithm to Enhance the Prediction Accuracy of Decision Trees
Using Adaboost Algorithm to Enhance the Prediction Accuracy of Decision TreesUsing Adaboost Algorithm to Enhance the Prediction Accuracy of Decision Trees
Using Adaboost Algorithm to Enhance the Prediction Accuracy of Decision Trees
 
21 Cost efficient QM in microbreweries agk
21 Cost efficient QM in microbreweries agk21 Cost efficient QM in microbreweries agk
21 Cost efficient QM in microbreweries agk
 
Presentation of CDR WineLab®, Wine Analysis System
Presentation of CDR WineLab®, Wine Analysis SystemPresentation of CDR WineLab®, Wine Analysis System
Presentation of CDR WineLab®, Wine Analysis System
 
WBC 2012 Poster P-86 Optical vs. D and SV
WBC 2012 Poster P-86 Optical vs. D and SVWBC 2012 Poster P-86 Optical vs. D and SV
WBC 2012 Poster P-86 Optical vs. D and SV
 
P-225_Gore
P-225_GoreP-225_Gore
P-225_Gore
 
Quality Control of RNA Samples - For Gene-Expression Results you Can Rely on
Quality Control of RNA Samples - For Gene-Expression Results you Can Rely onQuality Control of RNA Samples - For Gene-Expression Results you Can Rely on
Quality Control of RNA Samples - For Gene-Expression Results you Can Rely on
 
Determination of Wine Color and Total Phenol Content using the LAMBDA PDA UV/...
Determination of Wine Color and Total Phenol Content using the LAMBDA PDA UV/...Determination of Wine Color and Total Phenol Content using the LAMBDA PDA UV/...
Determination of Wine Color and Total Phenol Content using the LAMBDA PDA UV/...
 
PANIC 2014 Poster PNA-MNova - Aloe - Beer - Small Mixture qNMR
PANIC 2014 Poster  PNA-MNova - Aloe - Beer - Small Mixture qNMRPANIC 2014 Poster  PNA-MNova - Aloe - Beer - Small Mixture qNMR
PANIC 2014 Poster PNA-MNova - Aloe - Beer - Small Mixture qNMR
 
Air pollution meter
Air pollution meterAir pollution meter
Air pollution meter
 
IU Applied Machine Learning Class Final Project: ML Methods for Predicting Wi...
IU Applied Machine Learning Class Final Project: ML Methods for Predicting Wi...IU Applied Machine Learning Class Final Project: ML Methods for Predicting Wi...
IU Applied Machine Learning Class Final Project: ML Methods for Predicting Wi...
 
SULA PPT 2022 Shubham.pptx
SULA PPT 2022 Shubham.pptxSULA PPT 2022 Shubham.pptx
SULA PPT 2022 Shubham.pptx
 
Kx for wine tasting
Kx for wine tastingKx for wine tasting
Kx for wine tasting
 
Wine Data Analysis using R, SQL and TABLEAU
Wine Data Analysis using R, SQL and TABLEAUWine Data Analysis using R, SQL and TABLEAU
Wine Data Analysis using R, SQL and TABLEAU
 
Quality Control of RNA Samples — For Gene Expression Results You Can Rely On
Quality Control of RNA Samples — For Gene Expression Results You Can Rely OnQuality Control of RNA Samples — For Gene Expression Results You Can Rely On
Quality Control of RNA Samples — For Gene Expression Results You Can Rely On
 

Recently uploaded

NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
Amil Baba Dawood bangali
 
Runway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptxRunway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptx
SupreethSP4
 
block diagram and signal flow graph representation
block diagram and signal flow graph representationblock diagram and signal flow graph representation
block diagram and signal flow graph representation
Divya Somashekar
 
DESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docxDESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docx
FluxPrime1
 
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Dr.Costas Sachpazis
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
JoytuBarua2
 
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
thanhdowork
 
English lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdfEnglish lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdf
BrazilAccount1
 
Railway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdfRailway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdf
TeeVichai
 
Architectural Portfolio Sean Lockwood
Architectural Portfolio Sean LockwoodArchitectural Portfolio Sean Lockwood
Architectural Portfolio Sean Lockwood
seandesed
 
AP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specificAP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specific
BrazilAccount1
 
MCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdfMCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdf
Osamah Alsalih
 
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
obonagu
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
Kamal Acharya
 
HYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generationHYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generation
Robbie Edward Sayers
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
Kamal Acharya
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
R&R Consult
 
ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
Vijay Dialani, PhD
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
Pipe Restoration Solutions
 
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
H.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdfH.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdf
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
MLILAB
 

Recently uploaded (20)

NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
 
Runway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptxRunway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptx
 
block diagram and signal flow graph representation
block diagram and signal flow graph representationblock diagram and signal flow graph representation
block diagram and signal flow graph representation
 
DESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docxDESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docx
 
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
 
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
 
English lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdfEnglish lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdf
 
Railway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdfRailway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdf
 
Architectural Portfolio Sean Lockwood
Architectural Portfolio Sean LockwoodArchitectural Portfolio Sean Lockwood
Architectural Portfolio Sean Lockwood
 
AP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specificAP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specific
 
MCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdfMCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdf
 
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
 
HYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generationHYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generation
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
 
ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
 
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
H.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdfH.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdf
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
 

Wine Quality Analysis Using Machine Learning

  • 1.
  • 2. CONTENT ▪ INTRODUCTION ▪ OBJECTIVE ▪ DATA DESCRIPTION ▪ PROPOSED METHODOLOGY ▪ MODEL DESCRIPTION ▫ ARCHITECTURAL MODEL ▫ CART ▫ RANDOM FOREST TREE ▫ KNN CLASSIFIERS ▪ ARCHITECTURAL MODEL ▫ ALGORITHM COMPUTATION ▪ APPLICATION ▪ REFRENCES 2
  • 3. INTRODUCTION • About wine: • Wine is a beverage made from fermented grape and other fruit juices with lower amount of alcohol content. • Quality of wine is graded based on the taste of wine and vintage. This process is time taking, costly and not efficient. • A wine itself includes different parameters like fixed acidity, volatile acidity, citric acid, residual sugar, chlorides, free sulphur dioxide, total sulphur dioxide, density, pH, sulphates, alcohol and quality. • Problem statement: • In industries, understanding the demands of wine safety testing can be a complex task for the laboratory with numerous analytes and residues to monitor. • But, our application’s prediction, provide ideal solutions for the analysis of wine, which will make this whole process efficient and cheaper with less human interaction. 3
  • 4. OBJECTIVE • Our main objective is to predict the wine quality using machine learning through Python programming language • A large dataset is considered and wine quality is modelled to analyse the quality of wine through different parameters like fixed acidity, volatile acidity etc. • All these parameters will be analysed through Machine Learning algorithms like random forest classifier algorithm which will helps to rate the wine on scale 1 - 10 or bad - good. • Output obtained would further be checked for correctness and model will be optimized accordingly. • It can support the wine expert evaluations and ultimately improve the production. 4
  • 5. DATA DESCRIPTION 5 • The dataset contains chemical descriptions of 6499 Portuguese “Vinho Verde” wines. • There are 4899 entries for white wine, and 1600 entries for red wines. • The source of the data is taken from the UCI Machine Learning Repository, provided by Paulo Cortez, from the University of Minho, Portugal. 5
  • 6. DATA DESCRIPTION 6 Attributes Description pH To measure ripeness Density Density in gram per cm3 Alcohol Volume of alcohol in % Fixed Acidity Impart sourness and resist microbial infection, measured in no. of grams of tartaric acid per dm3 Volatile Acidity no. of grams of acetic acid per dm3 of wine Citric Acid no. of grams of citric acid per dm3 of wine Residual Sugar Remaining sugar after fermentation stops Chlorides no. of grams of sodium chloride per dm3 of wine Free Sulfur dioxide no. of grams of free sulphites per dm3 of wine Total Sulfur dioxide no. of grams of total sulfite (free sulphite+ bound) Sulphates no. of grams of potassium sulphate per dm3 of wine Quality Target variable, 1-10 value6
  • 7. CONCLUSION OF DATA ANALYSIS FIRST The two most important features among all 12 attributes are Sulphur dioxide (both free and total) and Alcohol. LAST Volatile acidity contributes to acidic tastes and have negative correlation to wine quality. SECOND The most important factor to decide the quality of wine is alcohol, higher concentration of alcohol leads to better quality of wine and lower density of wine. 7
  • 8. PROPOSED METHODOLOGY ▪ It gives insights of the dependency of target variables on independent variables using machine learning techniques to determine the quality of wine because it gives the best outcome for the assurance of quality of wine. ▪ The dependent variable is “quality rating” whereas other variables i.e. alcohol, sulphur etc. are assumed to be predictors or independent variables. ▪ While hindering the effectiveness of the data model, various types of errors have occurred like over fitting, introduced from having too large of a training set and bias occur due to too small of a test set. 8
  • 11. CLASSIFICATION AND REGRESSION TREE (CART) ▪ CART is a decision tree used for analysing both datasets (red and white wine). ▪ The decision trees produced by CART are always binary, containing exactly two branches for each decision node. ▪ The CART algorithm grows the tree by conducting for each decision node, an exhaustive search of all available variables. All possible splitting values, selecting the optimal split 11
  • 12. RANDOM FOREST ▪ Random forest is a method of classification, regression and other tasks, that operate by constructing a multitude of decision trees at training time and outputting the class that is the mode of the classes (classification) or mean prediction (regression) of the individual trees. ▪ Following are some of the features of random forest algorithm: 1. It runs efficiently on large databases. 2. It gives estimates of what variables are important in the classification. 3. It generates an internal unbiased estimate of generalization error as the forest building progresses occur. 12
  • 14. K-Nearest-Neighbourhood Classifiers ▪ This classifier technique is depended on learning by analogy; this means a comparison between a test tuple with similar training tuples. ▪ The training tuples are described by n attributes. Each tuple corresponds a point in an n- dimensional space. All the training tuples are stocked in an n-dimensional pattern space. For an unknown tuple, a k-nearest-neighbourhood classifier searches the pattern space for the k training tuples that are closest to the unknown tuple. k training tuples are called as the k “nearest neighbours” of the unknown tuple. ▪ “Closeness” is a metric distance, likewise Euclidean distance between two points or tuples, say, 𝑋1 = (𝑥11, 𝑥12……….. 𝑥1𝑛) and 𝑋2 = (𝑥21, 𝑥22……….. 𝑥2𝑛), is: dist (X1, X2) = 𝑖−1 𝑛 𝑥1𝑖 − 𝑥2𝑖 2 14
  • 15. ARCHITECTURE MODEL A flow chart that briefly describes processing of our software application. 15
  • 16. 16 16
  • 17. “ Enter all 12 data elements of newly developed wine. • Entered wine data is further divided into two sets (white wine datasets or red wine datasets) based on alcohol value. • Based on respective datasets a binary tree is formed ( using random forests algorithm) • Random forest algorithm • recode(quality([3,4,5]=0,[6,7,8]=1)) • RandomForestClassifier(estimator=25) • return prediction, confusion matrix, accuracy • Total value is evaluated which is root value (using KNN) • knn • classify(X,Y,x) • for i=1 to m do • compute distance d(X,x) • compute set I containing indices for the k smallestdistance d(X,x) • cluster(X) • return majority label for {Y,where i E I} • Evaluated root value is actually our required parameter value i.e. depended variable or quality of wine (on scale of 1 to 10). • Later on this estimated value is compared with datasets value and our algorithm is trained our defined datasets so as to for correctness and model will be optimized accordingly. ALGORITHM COMPUTATION 17
  • 18. APPLICATION ▪ Results will be used by the wine manufacturers to improve the quality of the future wines. ▪ Certification bodies can also use the result for quality control. ▪ Results can be used to make wine selection guides for wine magazines. ▪ Results can be used by consumers for wine selection 18
  • 19. REFRENCES ▪ [1] Y. Er, “The Classification of White Wine and Red Wine According to Their Physicochemical Qualities,” Int. J. Intell. Syst. Appl. Eng., vol. 4, no. SpecialIssue-1, pp. 23–26, 2016. ▪ [2] E. Summary, W. P. Monitoring, W. Quality, W. Safety, and W. Complexity, “Wine Analysis : from ‘Grape to Glass’ An analytical testing digest of the wine manufacturing process,” 2016. ▪ [3] A. Ghosh, “Project Report : -Red Wine Quality Analysis Final 3 . An empirical Red Wine Quality Analysis of the Portuguese ‘ Vinho Verde ’ wine,” no. December 2017, 2018. ▪ [4] Y. Gupta, “Selection of important features and predicting wine quality using machine learning techniques,” Procedia Comput. Sci., vol. 125, pp. 305–312, 2018. ▪ [5] P. Model, L. Regression, and R. Studio, “Building and Evaluating a Predictive Model w/ Linear Regression in RapidMiner Studio,” 2018. ▪ [6] B. Tajini and O. C. Paris, “Badr Tajini – On Campus Paris – DSTI 2017,” vol. 47, no. 4, pp. 547–553, 2017. 19 19
  • 20. 20 20