SlideShare a Scribd company logo
1 of 13
Ensemble Learning
By – Vidhu Jha
Roll No. – 19MT10048
Introduction to Ensemble Learning
• Ensemble Learning is a combination of several machine learning
models (weak learners) in 1 problem. The intuition is that when you
comebine several weak learners, they can become strong learners.
• Each weak learner is fitted on the training set and provides
predictions obtained. The final prediction is computed by combining
the results from all the weak learners. In this manner, it usually
produces more accurate solutions than a single model would.
Basic Ensemble Learning Techniques
1. Max Voting: Prediction from each model = 1 vote. Final prediction
comes from the prediction with the most votes.
2. Averaging: Final prediction is the average of all predictions. This
goes for regression problems, example – random forest regression,
where the final result is the average of the predictions from
individual decision trees.
3. Weighted Average: The base model with higher predictive power is
more important and thus, given more weight – the sum of the
weights would equal 1.
Advanced Ensemble Learning Techniques
1. Stacking
• Here, various estimators are combined to
reduce their biases.
• Predictions from each estimator are stacked
together and used as input to a final
estimator (meta-model).
• Training of the final estimator happens via
cross-validation.
Stacking - Methodology
1.Divide the training set into K folds, for example 10.
2.Train a base model (say decision tree) on 9 folds and
make predictions on the 10th fold.
3.Repeat until you have a prediction for each fold.
4.Fit the base model on the whole training set.
5.Use the model to make predictions on the test set.
6.Repeat step 2-5 for other base models (for example
knn).
7.Use predictions from the test set as features to build a
new model (meta-model).
8. Make final predictions on the
test set using the meta model.
Advanced Ensemble Learning Techniques
2. Blending
• Uses a holdout/validation set from the training set to make
predictions, i.e., predictions are made on the holdout set only.
• The holdout set and the predictions are used to build a model which
is run on the test set.
• It is simpler than stacking and prevents leakage of information in the
model.
• However, blending uses less data, may lead to overfitting.
Blending - Methodology
1. The training set is split into training and
validation sets.
2. Models are fitted on the training set.
3. Make predictions on the validation and
training set.
4. Use validation set and its predictions to build
a final mode.
5. Make final predictions using this model
Advanced Ensemble Learning Techniques
3. Bagging
• Takes random samples of data, builds learning algorithms and uses
the mean to find bagging probabilities.
• Also known as bootstrap aggregating.
• Aggregates the results from several models in order to obtain a
generalized result.
Bagging - Methodology
1. Create multiple subsets from original
dataset with replacement.
2. Build a base model for each of the
subsets.
3. Running all the models in parallel.
4. Combining predictions from all models
to obtain final predictions.
Advanced Ensemble Learning Techniques
3. Boosting
• If a data point is incorrectly predicted by the first model, and then the
next (probably all models), will combining the predictions provide
better results? Such situations are taken care of by boosting.
• Boosting is a sequential process, where each subsequent model
attempts to correct the errors of the previous model.
• Reduces bias and variance by converting weak learners into strong
learners.
Boosting - Methodology
1. Create a subset from the original data.
2. Build an initial model with this data.
3. Run predictions on the whole data set,.
4. Calculate the error using the predictions
and the actual values.
5. Assign more weight to the incorrect
predictions.
6. Create another model that attempts to fix
errors from the last model.
7. Run predictions on the entire dataset with
the new model.
8. Create several models with each model
aiming at correcting the errors generated by
the previous one.
9. Obtain the final model by weighting the
mean of all the models.
…and many more methods as well.
Libraries for Ensemble Learning
References
• https://www.analyticsvidhya.com/blog/2018/06/comprehensive-
guide-for-ensemble-models/
• Hands-On Machine Learning with Scikit-Learn and TensorFlow:
Concepts, Tools, and Techniques to Build Intelligent Systems, by
Aurélien Géron.
THE END.

More Related Content

What's hot

Ensemble Method (Bagging Boosting)
Ensemble Method (Bagging Boosting)Ensemble Method (Bagging Boosting)
Ensemble Method (Bagging Boosting)Abdullah al Mamun
 
Understanding random forests
Understanding random forestsUnderstanding random forests
Understanding random forestsMarc Garcia
 
2.5 backpropagation
2.5 backpropagation2.5 backpropagation
2.5 backpropagationKrish_ver2
 
07 dimensionality reduction
07 dimensionality reduction07 dimensionality reduction
07 dimensionality reductionMarco Quartulli
 
Ensemble hybrid learning technique
Ensemble hybrid learning techniqueEnsemble hybrid learning technique
Ensemble hybrid learning techniqueDishaSinha9
 
Data Science - Part V - Decision Trees & Random Forests
Data Science - Part V - Decision Trees & Random Forests Data Science - Part V - Decision Trees & Random Forests
Data Science - Part V - Decision Trees & Random Forests Derek Kane
 
Hands on machine learning with scikit-learn and tensor flow by ahmed yousry
Hands on machine learning with scikit-learn and tensor flow by ahmed yousryHands on machine learning with scikit-learn and tensor flow by ahmed yousry
Hands on machine learning with scikit-learn and tensor flow by ahmed yousryAhmed Yousry
 
Stacking ensemble
Stacking ensembleStacking ensemble
Stacking ensemblekalung0313
 
Decision Tree - C4.5&CART
Decision Tree - C4.5&CARTDecision Tree - C4.5&CART
Decision Tree - C4.5&CARTXueping Peng
 
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...butest
 
2.2 decision tree
2.2 decision tree2.2 decision tree
2.2 decision treeKrish_ver2
 
Restricted Boltzmann Machine | Neural Network Tutorial | Deep Learning Tutori...
Restricted Boltzmann Machine | Neural Network Tutorial | Deep Learning Tutori...Restricted Boltzmann Machine | Neural Network Tutorial | Deep Learning Tutori...
Restricted Boltzmann Machine | Neural Network Tutorial | Deep Learning Tutori...Edureka!
 
Introduction to random forest and gradient boosting methods a lecture
Introduction to random forest and gradient boosting methods   a lectureIntroduction to random forest and gradient boosting methods   a lecture
Introduction to random forest and gradient boosting methods a lectureShreyas S K
 
Association Rule Learning Part 1: Frequent Itemset Generation
Association Rule Learning Part 1: Frequent Itemset GenerationAssociation Rule Learning Part 1: Frequent Itemset Generation
Association Rule Learning Part 1: Frequent Itemset GenerationKnoldus Inc.
 

What's hot (20)

Decision tree and random forest
Decision tree and random forestDecision tree and random forest
Decision tree and random forest
 
Ensemble Method (Bagging Boosting)
Ensemble Method (Bagging Boosting)Ensemble Method (Bagging Boosting)
Ensemble Method (Bagging Boosting)
 
Understanding random forests
Understanding random forestsUnderstanding random forests
Understanding random forests
 
2.5 backpropagation
2.5 backpropagation2.5 backpropagation
2.5 backpropagation
 
07 dimensionality reduction
07 dimensionality reduction07 dimensionality reduction
07 dimensionality reduction
 
Ensemble hybrid learning technique
Ensemble hybrid learning techniqueEnsemble hybrid learning technique
Ensemble hybrid learning technique
 
Data Science - Part V - Decision Trees & Random Forests
Data Science - Part V - Decision Trees & Random Forests Data Science - Part V - Decision Trees & Random Forests
Data Science - Part V - Decision Trees & Random Forests
 
Support Vector Machine
Support Vector MachineSupport Vector Machine
Support Vector Machine
 
Hands on machine learning with scikit-learn and tensor flow by ahmed yousry
Hands on machine learning with scikit-learn and tensor flow by ahmed yousryHands on machine learning with scikit-learn and tensor flow by ahmed yousry
Hands on machine learning with scikit-learn and tensor flow by ahmed yousry
 
Stacking ensemble
Stacking ensembleStacking ensemble
Stacking ensemble
 
Arboles de decision
Arboles de decisionArboles de decision
Arboles de decision
 
Decision Tree - C4.5&CART
Decision Tree - C4.5&CARTDecision Tree - C4.5&CART
Decision Tree - C4.5&CART
 
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
CC282 Unsupervised Learning (Clustering) Lecture 7 slides for ...
 
2.2 decision tree
2.2 decision tree2.2 decision tree
2.2 decision tree
 
Decision tree
Decision treeDecision tree
Decision tree
 
Restricted Boltzmann Machine | Neural Network Tutorial | Deep Learning Tutori...
Restricted Boltzmann Machine | Neural Network Tutorial | Deep Learning Tutori...Restricted Boltzmann Machine | Neural Network Tutorial | Deep Learning Tutori...
Restricted Boltzmann Machine | Neural Network Tutorial | Deep Learning Tutori...
 
Introduction to random forest and gradient boosting methods a lecture
Introduction to random forest and gradient boosting methods   a lectureIntroduction to random forest and gradient boosting methods   a lecture
Introduction to random forest and gradient boosting methods a lecture
 
Ensemble methods
Ensemble methodsEnsemble methods
Ensemble methods
 
Support vector machines
Support vector machinesSupport vector machines
Support vector machines
 
Association Rule Learning Part 1: Frequent Itemset Generation
Association Rule Learning Part 1: Frequent Itemset GenerationAssociation Rule Learning Part 1: Frequent Itemset Generation
Association Rule Learning Part 1: Frequent Itemset Generation
 

Similar to Ensemble Learning.pptx

Random Forest.pptx
Random Forest.pptxRandom Forest.pptx
Random Forest.pptxSPIDERSRSTV
 
BaggingBoosting.pdf
BaggingBoosting.pdfBaggingBoosting.pdf
BaggingBoosting.pdfDynamicPitch
 
Ensemble methods in Machine learning technology
Ensemble methods in Machine learning technologyEnsemble methods in Machine learning technology
Ensemble methods in Machine learning technologysikethatsarightemail
 
Aaa ped-14-Ensemble Learning: About Ensemble Learning
Aaa ped-14-Ensemble Learning: About Ensemble LearningAaa ped-14-Ensemble Learning: About Ensemble Learning
Aaa ped-14-Ensemble Learning: About Ensemble LearningAminaRepo
 
A Multiple Classifiers System For Solving The Character Recognition Problem I...
A Multiple Classifiers System For Solving The Character Recognition Problem I...A Multiple Classifiers System For Solving The Character Recognition Problem I...
A Multiple Classifiers System For Solving The Character Recognition Problem I...Randa Elanwar
 
Statistical Learning and Model Selection module 2.pptx
Statistical Learning and Model Selection module 2.pptxStatistical Learning and Model Selection module 2.pptx
Statistical Learning and Model Selection module 2.pptxnagarajan740445
 
Cross-validation aggregation for forecasting
Cross-validation aggregation for forecastingCross-validation aggregation for forecasting
Cross-validation aggregation for forecastingDevon Barrow
 
RapidMiner: Learning Schemes In Rapid Miner
RapidMiner:  Learning Schemes In Rapid MinerRapidMiner:  Learning Schemes In Rapid Miner
RapidMiner: Learning Schemes In Rapid MinerDataminingTools Inc
 
RapidMiner: Learning Schemes In Rapid Miner5
RapidMiner:   Learning Schemes In Rapid Miner5RapidMiner:   Learning Schemes In Rapid Miner5
RapidMiner: Learning Schemes In Rapid Miner5Rapidmining Content
 
Simple Ensemble Learning
Simple Ensemble LearningSimple Ensemble Learning
Simple Ensemble LearningMushfiq18
 
Hyperparameter Tuning
Hyperparameter TuningHyperparameter Tuning
Hyperparameter TuningJon Lederman
 
Machine learning session6(decision trees random forrest)
Machine learning   session6(decision trees random forrest)Machine learning   session6(decision trees random forrest)
Machine learning session6(decision trees random forrest)Abhimanyu Dwivedi
 
NITW_Improving Deep Neural Networks (1).pptx
NITW_Improving Deep Neural Networks (1).pptxNITW_Improving Deep Neural Networks (1).pptx
NITW_Improving Deep Neural Networks (1).pptxDrKBManwade
 
NITW_Improving Deep Neural Networks.pptx
NITW_Improving Deep Neural Networks.pptxNITW_Improving Deep Neural Networks.pptx
NITW_Improving Deep Neural Networks.pptxssuserd23711
 
GA.-.Presentation
GA.-.PresentationGA.-.Presentation
GA.-.Presentationoldmanpat
 

Similar to Ensemble Learning.pptx (20)

Ensemble learning
Ensemble learningEnsemble learning
Ensemble learning
 
Random Forest.pptx
Random Forest.pptxRandom Forest.pptx
Random Forest.pptx
 
BaggingBoosting.pdf
BaggingBoosting.pdfBaggingBoosting.pdf
BaggingBoosting.pdf
 
Ensemble methods in Machine learning technology
Ensemble methods in Machine learning technologyEnsemble methods in Machine learning technology
Ensemble methods in Machine learning technology
 
Aaa ped-14-Ensemble Learning: About Ensemble Learning
Aaa ped-14-Ensemble Learning: About Ensemble LearningAaa ped-14-Ensemble Learning: About Ensemble Learning
Aaa ped-14-Ensemble Learning: About Ensemble Learning
 
A Multiple Classifiers System For Solving The Character Recognition Problem I...
A Multiple Classifiers System For Solving The Character Recognition Problem I...A Multiple Classifiers System For Solving The Character Recognition Problem I...
A Multiple Classifiers System For Solving The Character Recognition Problem I...
 
Statistical Learning and Model Selection module 2.pptx
Statistical Learning and Model Selection module 2.pptxStatistical Learning and Model Selection module 2.pptx
Statistical Learning and Model Selection module 2.pptx
 
ANN - UNIT 3.pptx
ANN - UNIT 3.pptxANN - UNIT 3.pptx
ANN - UNIT 3.pptx
 
ANN - UNIT 3.pptx
ANN - UNIT 3.pptxANN - UNIT 3.pptx
ANN - UNIT 3.pptx
 
Cross-validation aggregation for forecasting
Cross-validation aggregation for forecastingCross-validation aggregation for forecasting
Cross-validation aggregation for forecasting
 
RapidMiner: Learning Schemes In Rapid Miner
RapidMiner:  Learning Schemes In Rapid MinerRapidMiner:  Learning Schemes In Rapid Miner
RapidMiner: Learning Schemes In Rapid Miner
 
RapidMiner: Learning Schemes In Rapid Miner5
RapidMiner:   Learning Schemes In Rapid Miner5RapidMiner:   Learning Schemes In Rapid Miner5
RapidMiner: Learning Schemes In Rapid Miner5
 
Simple Ensemble Learning
Simple Ensemble LearningSimple Ensemble Learning
Simple Ensemble Learning
 
Hyperparameter Tuning
Hyperparameter TuningHyperparameter Tuning
Hyperparameter Tuning
 
Machine learning session6(decision trees random forrest)
Machine learning   session6(decision trees random forrest)Machine learning   session6(decision trees random forrest)
Machine learning session6(decision trees random forrest)
 
Maths Behind Models.pptx
Maths Behind Models.pptxMaths Behind Models.pptx
Maths Behind Models.pptx
 
NITW_Improving Deep Neural Networks (1).pptx
NITW_Improving Deep Neural Networks (1).pptxNITW_Improving Deep Neural Networks (1).pptx
NITW_Improving Deep Neural Networks (1).pptx
 
NITW_Improving Deep Neural Networks.pptx
NITW_Improving Deep Neural Networks.pptxNITW_Improving Deep Neural Networks.pptx
NITW_Improving Deep Neural Networks.pptx
 
Ensemble learning
Ensemble learningEnsemble learning
Ensemble learning
 
GA.-.Presentation
GA.-.PresentationGA.-.Presentation
GA.-.Presentation
 

Recently uploaded

Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZTE
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx959SahilShah
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...VICTOR MAESTRE RAMIREZ
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learningmisbanausheenparvam
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerAnamika Sarkar
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineeringmalavadedarshan25
 
power system scada applications and uses
power system scada applications and usespower system scada applications and uses
power system scada applications and usesDevarapalliHaritha
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxPoojaBan
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLDeelipZope
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidNikhilNagaraju
 

Recently uploaded (20)

Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learning
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineering
 
power system scada applications and uses
power system scada applications and usespower system scada applications and uses
power system scada applications and uses
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptx
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCL
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfid
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 

Ensemble Learning.pptx

  • 1. Ensemble Learning By – Vidhu Jha Roll No. – 19MT10048
  • 2. Introduction to Ensemble Learning • Ensemble Learning is a combination of several machine learning models (weak learners) in 1 problem. The intuition is that when you comebine several weak learners, they can become strong learners. • Each weak learner is fitted on the training set and provides predictions obtained. The final prediction is computed by combining the results from all the weak learners. In this manner, it usually produces more accurate solutions than a single model would.
  • 3. Basic Ensemble Learning Techniques 1. Max Voting: Prediction from each model = 1 vote. Final prediction comes from the prediction with the most votes. 2. Averaging: Final prediction is the average of all predictions. This goes for regression problems, example – random forest regression, where the final result is the average of the predictions from individual decision trees. 3. Weighted Average: The base model with higher predictive power is more important and thus, given more weight – the sum of the weights would equal 1.
  • 4. Advanced Ensemble Learning Techniques 1. Stacking • Here, various estimators are combined to reduce their biases. • Predictions from each estimator are stacked together and used as input to a final estimator (meta-model). • Training of the final estimator happens via cross-validation.
  • 5. Stacking - Methodology 1.Divide the training set into K folds, for example 10. 2.Train a base model (say decision tree) on 9 folds and make predictions on the 10th fold. 3.Repeat until you have a prediction for each fold. 4.Fit the base model on the whole training set. 5.Use the model to make predictions on the test set. 6.Repeat step 2-5 for other base models (for example knn). 7.Use predictions from the test set as features to build a new model (meta-model). 8. Make final predictions on the test set using the meta model.
  • 6. Advanced Ensemble Learning Techniques 2. Blending • Uses a holdout/validation set from the training set to make predictions, i.e., predictions are made on the holdout set only. • The holdout set and the predictions are used to build a model which is run on the test set. • It is simpler than stacking and prevents leakage of information in the model. • However, blending uses less data, may lead to overfitting.
  • 7. Blending - Methodology 1. The training set is split into training and validation sets. 2. Models are fitted on the training set. 3. Make predictions on the validation and training set. 4. Use validation set and its predictions to build a final mode. 5. Make final predictions using this model
  • 8. Advanced Ensemble Learning Techniques 3. Bagging • Takes random samples of data, builds learning algorithms and uses the mean to find bagging probabilities. • Also known as bootstrap aggregating. • Aggregates the results from several models in order to obtain a generalized result.
  • 9. Bagging - Methodology 1. Create multiple subsets from original dataset with replacement. 2. Build a base model for each of the subsets. 3. Running all the models in parallel. 4. Combining predictions from all models to obtain final predictions.
  • 10. Advanced Ensemble Learning Techniques 3. Boosting • If a data point is incorrectly predicted by the first model, and then the next (probably all models), will combining the predictions provide better results? Such situations are taken care of by boosting. • Boosting is a sequential process, where each subsequent model attempts to correct the errors of the previous model. • Reduces bias and variance by converting weak learners into strong learners.
  • 11. Boosting - Methodology 1. Create a subset from the original data. 2. Build an initial model with this data. 3. Run predictions on the whole data set,. 4. Calculate the error using the predictions and the actual values. 5. Assign more weight to the incorrect predictions. 6. Create another model that attempts to fix errors from the last model. 7. Run predictions on the entire dataset with the new model. 8. Create several models with each model aiming at correcting the errors generated by the previous one. 9. Obtain the final model by weighting the mean of all the models.
  • 12. …and many more methods as well. Libraries for Ensemble Learning
  • 13. References • https://www.analyticsvidhya.com/blog/2018/06/comprehensive- guide-for-ensemble-models/ • Hands-On Machine Learning with Scikit-Learn and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems, by Aurélien Géron. THE END.