SlideShare a Scribd company logo
Ways of Attaining Optimal Model
Complexity
Author:
Sudi Murindanyi
2022/HD05/5583X
Presented by - Sudi
Affiliation: Makerere University
Uganda
2
Important words
❑ Machine Learning(ML): Is the use and development of computer
systems that are able to learn and adapt without following explicit
instructions, by using algorithms and statistical models to analyse
and draw inferences from patterns in data.
❑ Model training: In ML, It is the process of feeding an ML algorithm
with data to help identify and learn good values for all attributes
involved.
❑ Model Complexity: In ML, Model Complexity usually refers to the
number of degrees of freedom in a learned model, often measured as
the number of adjustable weights or parameters in the architecture
doing the learning.
❑ Optimization: In ML, It is the process where we train the model
iteratively that results in a maximum and minimum function
evaluation.
3
Background
❑ ML has grown at a remarkable rate, attracting a great
number of researchers and practitioners. It has become
one of the most popular research directions and plays a
significant role in many fields, such as machine
translation, speech recognition, image recognition,
recommendation systems, etc.
❑ Optimization is one of the core components of machine-
learning. The essence of most machine learning
algorithms is to build an optimization model and learn the
parameters in the objective function from the given data.
❑ In order to promote the development of machine
learning, a series of effective optimal model complexity
was put forward, which have improved the performance
and efficiency of the machine-learning methods.
4
5 way of attaining optimal model complexity
1. Feature Engineering and Selection
2. Data Augmentation
3. Dimensionality Reduction
4. Active Learning
5. Ensemble Models
5
1. Feature Engineering and Selection
Feature engineering refers to a process of selecting and transforming
variables/features in your dataset when creating a predictive model using machine
learning while, Feature selection is the process where you automatically or manually
select the features that contribute the most to your prediction variable or output.
Feature Engineering techniques
❑ Handling missing Data(N/A, null, NaN, none, etc):
❑ Variable deletion: Involve dropping variables with missing values on a case-by-
basis
❑ Mean or medium imputation: Uses the mean & median of the non-missing
observation .(Numerical value)
❑ Most common value: replace the missing values with maximum occurred value in
a column/feature (categorical column/features)
❑ Handling continuous features (Difference range of values age, salary)
❑ Min-Max Normalization: It shrink all values in a fixed range between 0 and 1.
subtracts the minimum value in the feature and then diodes by its range.
❑ standardization: its ensures that each feature has a mean of 0 and a standard
deviation of 1.bringing all features to the some magnitude.
6
con’t
❑ Handling categorical features(Divide into groups eg:gender and
education )
❑ label encoding: simply converting each categorical value in a column
to a number.
❑ one-hot-encoding(dummy variable): help to replace aa categorical
variable with one or more features that can have the value 0 and 1.
Features selection techniques
❑ univariate selection: help select independent features that have the
strongest relationship with the target feature in your dataset.
❑ feature importance: give you a secure for each feature of your data.
the higher the scare, the important or relevant that feature is to your
target feature.
❑ correlation matrix heatmap: shows how the feature are related to
each other or the target features.
7
2. Data Augmentation
Data Augmentation is the process of modifying, or augmenting a dataset with
additional dataset.
Many model need a huge amount of data, so they benefit more form data
augmentation techniques.
Types of data augmentation
❑ Real data augmentation: Is is when you add real, additional data to a dataset.
adding a real dataset can be done in two ways, merge and fuzzy merge.
❑ synthetic data augmentation: This is when you add synthetic data, or fake data
that simply looks real.
Challenges in augmentation methods
The are a number of challenges that need to be solved in order to create effective data
augmentation methods to be solved in order to create effective data augmentation
methods, including scalability, heterogeneous datasets, relevance, data duplication,
and validation
8
3. Dimensionality Reduction
When dealing with high dimensional data, it is often useful to reduce the
dimensionality by projecting the data to a lower dimensional subspace
with captures the essence of the data. this called dimensionality
reduction.
Techniques for dimensionality reduction
❑ feature selection methods: uses scoring or statistical methods to
select which features to keep and which features to delete.
❑ matrix factorization: It uses linear algebra for example matrix
factorization methods to reduce dataset matrix into its constituent
parts, for dimensionality reduction eg: PCA
❑ manifold learning: They are used to create a low-dimensional
projection of high dimension data, often for the purpose of data
visualization(Multidimensional scaling).
❑ autoencoder methods:It involves framing a self supervised learning
problem where a model must reproduce the input correctly.
9
4. Active Learning
Active learning is the subset of machine learning in which a
learning algorithm can query a user interactively to label
data with the desired outputs. In active learning, the
algorithm proactively selects the subset of examples to be
labeled next from the pool of unlabeled data.
They are three categories of active learning:
❑ Stream-based selective sampling
❑ Pool-based sampling
❑ Membership query synthesis
10
5. Ensemble Models
Ensemble learning helps improve machine learning results by combining
several models. This approach allows the production of better predictive
performance compared to a single model. Basic idea is to learn a set of
classifiers (experts) and to allow them to vote.
11
Con’t
Methods for Independently Constructing Ensembles –
❏ Majority Vote
❏ Bagging and Random Forest
❏ Randomness Injection
❏ Feature-Selection Ensembles
❏ Error-Correcting Output Coding
Methods for Coordinated Construction of Ensembles –
❏ Boosting
❏ Stacking
12
References
1. S. Sun, Z. Cao, H. Zhu and J. Zhao, "A Survey of Optimization
Methods From a Machine Learning Perspective," in IEEE
Transactions on Cybernetics, vol. 50, no. 8, pp. 3668-3681, Aug.
2020, doi: 10.1109/TCYB.2019.2950779.
2. Machine Learning Tutorial – Feature Engineering and Feature
Selection For Beginners (freecodecamp.org)
3. Augmentation for Machine Learning (akkio.com)
4. Introduction to Dimensionality Reduction for Machine Learning -
MachineLearningMastery.com
5. Active learning machine learning: What it is and how it works -
DataRobot AI Cloud
6. Ensemble Classifier | Data Mining - GeeksforGeeks
7. Best Practices for Improving Your Machine Learning and Deep
Learning Models - neptune.ai
8. (17) Controlling Model Complexity in Machine Learning | LinkedIn

More Related Content

Similar to Optimal Model Complexity (1).pptx

ML_Module_1.pdf
ML_Module_1.pdfML_Module_1.pdf
ML_Module_1.pdf
JafarHussain48
 
Tech meetup Data Driven - Codemotion
Tech meetup Data Driven - Codemotion Tech meetup Data Driven - Codemotion
Tech meetup Data Driven - Codemotion
antimo musone
 
Presentation1.pptx
Presentation1.pptxPresentation1.pptx
Presentation1.pptx
narmeen11
 
It's Machine Learning Basics -- For You!
It's Machine Learning Basics -- For You!It's Machine Learning Basics -- For You!
It's Machine Learning Basics -- For You!
To Sum It Up
 
Introduction to ML (Machine Learning)
Introduction to ML (Machine Learning)Introduction to ML (Machine Learning)
Introduction to ML (Machine Learning)
SwatiTripathi44
 
cyberbullyingdetectionusingmachinelearning-11-220913143556-fec10e26.pptx
cyberbullyingdetectionusingmachinelearning-11-220913143556-fec10e26.pptxcyberbullyingdetectionusingmachinelearning-11-220913143556-fec10e26.pptx
cyberbullyingdetectionusingmachinelearning-11-220913143556-fec10e26.pptx
SaiKiran101146
 
Machine Learning - Lecture1.pptx.pdf
Machine Learning - Lecture1.pptx.pdfMachine Learning - Lecture1.pptx.pdf
Machine Learning - Lecture1.pptx.pdf
NsitTech
 
Deep Learning Vocabulary.docx
Deep Learning Vocabulary.docxDeep Learning Vocabulary.docx
Deep Learning Vocabulary.docx
jaffarbikat
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
Knowledge And Skill Forum
 
Dive into Machine Learning Event MUGDSC.pptx
Dive into Machine Learning Event MUGDSC.pptxDive into Machine Learning Event MUGDSC.pptx
Dive into Machine Learning Event MUGDSC.pptx
RakshaAgrawal21
 
Dive into Machine Learning Event--MUGDSC
Dive into Machine Learning Event--MUGDSCDive into Machine Learning Event--MUGDSC
Dive into Machine Learning Event--MUGDSC
RakshaAgrawal21
 
introduction to Statistical Theory.pptx
 introduction to Statistical Theory.pptx introduction to Statistical Theory.pptx
introduction to Statistical Theory.pptx
Dr.Shweta
 
A Survey on Machine Learning Algorithms
A Survey on Machine Learning AlgorithmsA Survey on Machine Learning Algorithms
A Survey on Machine Learning Algorithms
AM Publications
 
Machine Learning - Deep Learning
Machine Learning - Deep LearningMachine Learning - Deep Learning
Machine Learning - Deep Learning
Adetimehin Oluwasegun Matthew
 
Lecture-1-Introduction to Deep learning.pptx
Lecture-1-Introduction to Deep learning.pptxLecture-1-Introduction to Deep learning.pptx
Lecture-1-Introduction to Deep learning.pptx
JayChauhan100
 
Lesson 1 - Overview of Machine Learning and Data Analysis.pptx
Lesson 1 - Overview of Machine Learning and Data Analysis.pptxLesson 1 - Overview of Machine Learning and Data Analysis.pptx
Lesson 1 - Overview of Machine Learning and Data Analysis.pptx
cloudserviceuit
 
Chapter 05 Machine Learning.pptx
Chapter 05 Machine Learning.pptxChapter 05 Machine Learning.pptx
Chapter 05 Machine Learning.pptx
ssuser957b41
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
Adetimehin Oluwasegun Matthew
 
Machine Learning Interview Questions and Answers
Machine Learning Interview Questions and AnswersMachine Learning Interview Questions and Answers
Machine Learning Interview Questions and Answers
Satyam Jaiswal
 
Machine Learning Terminologies
Machine Learning TerminologiesMachine Learning Terminologies
Machine Learning Terminologies
Ajitesh Kumar
 

Similar to Optimal Model Complexity (1).pptx (20)

ML_Module_1.pdf
ML_Module_1.pdfML_Module_1.pdf
ML_Module_1.pdf
 
Tech meetup Data Driven - Codemotion
Tech meetup Data Driven - Codemotion Tech meetup Data Driven - Codemotion
Tech meetup Data Driven - Codemotion
 
Presentation1.pptx
Presentation1.pptxPresentation1.pptx
Presentation1.pptx
 
It's Machine Learning Basics -- For You!
It's Machine Learning Basics -- For You!It's Machine Learning Basics -- For You!
It's Machine Learning Basics -- For You!
 
Introduction to ML (Machine Learning)
Introduction to ML (Machine Learning)Introduction to ML (Machine Learning)
Introduction to ML (Machine Learning)
 
cyberbullyingdetectionusingmachinelearning-11-220913143556-fec10e26.pptx
cyberbullyingdetectionusingmachinelearning-11-220913143556-fec10e26.pptxcyberbullyingdetectionusingmachinelearning-11-220913143556-fec10e26.pptx
cyberbullyingdetectionusingmachinelearning-11-220913143556-fec10e26.pptx
 
Machine Learning - Lecture1.pptx.pdf
Machine Learning - Lecture1.pptx.pdfMachine Learning - Lecture1.pptx.pdf
Machine Learning - Lecture1.pptx.pdf
 
Deep Learning Vocabulary.docx
Deep Learning Vocabulary.docxDeep Learning Vocabulary.docx
Deep Learning Vocabulary.docx
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Dive into Machine Learning Event MUGDSC.pptx
Dive into Machine Learning Event MUGDSC.pptxDive into Machine Learning Event MUGDSC.pptx
Dive into Machine Learning Event MUGDSC.pptx
 
Dive into Machine Learning Event--MUGDSC
Dive into Machine Learning Event--MUGDSCDive into Machine Learning Event--MUGDSC
Dive into Machine Learning Event--MUGDSC
 
introduction to Statistical Theory.pptx
 introduction to Statistical Theory.pptx introduction to Statistical Theory.pptx
introduction to Statistical Theory.pptx
 
A Survey on Machine Learning Algorithms
A Survey on Machine Learning AlgorithmsA Survey on Machine Learning Algorithms
A Survey on Machine Learning Algorithms
 
Machine Learning - Deep Learning
Machine Learning - Deep LearningMachine Learning - Deep Learning
Machine Learning - Deep Learning
 
Lecture-1-Introduction to Deep learning.pptx
Lecture-1-Introduction to Deep learning.pptxLecture-1-Introduction to Deep learning.pptx
Lecture-1-Introduction to Deep learning.pptx
 
Lesson 1 - Overview of Machine Learning and Data Analysis.pptx
Lesson 1 - Overview of Machine Learning and Data Analysis.pptxLesson 1 - Overview of Machine Learning and Data Analysis.pptx
Lesson 1 - Overview of Machine Learning and Data Analysis.pptx
 
Chapter 05 Machine Learning.pptx
Chapter 05 Machine Learning.pptxChapter 05 Machine Learning.pptx
Chapter 05 Machine Learning.pptx
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
 
Machine Learning Interview Questions and Answers
Machine Learning Interview Questions and AnswersMachine Learning Interview Questions and Answers
Machine Learning Interview Questions and Answers
 
Machine Learning Terminologies
Machine Learning TerminologiesMachine Learning Terminologies
Machine Learning Terminologies
 

Recently uploaded

End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
Lars Albertsson
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
nuttdpt
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
Social Samosa
 
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
zsjl4mimo
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
Timothy Spann
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
nyfuhyz
 
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
74nqk8xf
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
74nqk8xf
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Kiwi Creative
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
g4dpvqap0
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
soxrziqu
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
A presentation that explain the Power BI Licensing
A presentation that explain the Power BI LicensingA presentation that explain the Power BI Licensing
A presentation that explain the Power BI Licensing
AlessioFois2
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
nuttdpt
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
aqzctr7x
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
apvysm8
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 

Recently uploaded (20)

End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
 
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
 
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
A presentation that explain the Power BI Licensing
A presentation that explain the Power BI LicensingA presentation that explain the Power BI Licensing
A presentation that explain the Power BI Licensing
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 

Optimal Model Complexity (1).pptx

  • 1. Ways of Attaining Optimal Model Complexity Author: Sudi Murindanyi 2022/HD05/5583X Presented by - Sudi Affiliation: Makerere University Uganda
  • 2. 2 Important words ❑ Machine Learning(ML): Is the use and development of computer systems that are able to learn and adapt without following explicit instructions, by using algorithms and statistical models to analyse and draw inferences from patterns in data. ❑ Model training: In ML, It is the process of feeding an ML algorithm with data to help identify and learn good values for all attributes involved. ❑ Model Complexity: In ML, Model Complexity usually refers to the number of degrees of freedom in a learned model, often measured as the number of adjustable weights or parameters in the architecture doing the learning. ❑ Optimization: In ML, It is the process where we train the model iteratively that results in a maximum and minimum function evaluation.
  • 3. 3 Background ❑ ML has grown at a remarkable rate, attracting a great number of researchers and practitioners. It has become one of the most popular research directions and plays a significant role in many fields, such as machine translation, speech recognition, image recognition, recommendation systems, etc. ❑ Optimization is one of the core components of machine- learning. The essence of most machine learning algorithms is to build an optimization model and learn the parameters in the objective function from the given data. ❑ In order to promote the development of machine learning, a series of effective optimal model complexity was put forward, which have improved the performance and efficiency of the machine-learning methods.
  • 4. 4 5 way of attaining optimal model complexity 1. Feature Engineering and Selection 2. Data Augmentation 3. Dimensionality Reduction 4. Active Learning 5. Ensemble Models
  • 5. 5 1. Feature Engineering and Selection Feature engineering refers to a process of selecting and transforming variables/features in your dataset when creating a predictive model using machine learning while, Feature selection is the process where you automatically or manually select the features that contribute the most to your prediction variable or output. Feature Engineering techniques ❑ Handling missing Data(N/A, null, NaN, none, etc): ❑ Variable deletion: Involve dropping variables with missing values on a case-by- basis ❑ Mean or medium imputation: Uses the mean & median of the non-missing observation .(Numerical value) ❑ Most common value: replace the missing values with maximum occurred value in a column/feature (categorical column/features) ❑ Handling continuous features (Difference range of values age, salary) ❑ Min-Max Normalization: It shrink all values in a fixed range between 0 and 1. subtracts the minimum value in the feature and then diodes by its range. ❑ standardization: its ensures that each feature has a mean of 0 and a standard deviation of 1.bringing all features to the some magnitude.
  • 6. 6 con’t ❑ Handling categorical features(Divide into groups eg:gender and education ) ❑ label encoding: simply converting each categorical value in a column to a number. ❑ one-hot-encoding(dummy variable): help to replace aa categorical variable with one or more features that can have the value 0 and 1. Features selection techniques ❑ univariate selection: help select independent features that have the strongest relationship with the target feature in your dataset. ❑ feature importance: give you a secure for each feature of your data. the higher the scare, the important or relevant that feature is to your target feature. ❑ correlation matrix heatmap: shows how the feature are related to each other or the target features.
  • 7. 7 2. Data Augmentation Data Augmentation is the process of modifying, or augmenting a dataset with additional dataset. Many model need a huge amount of data, so they benefit more form data augmentation techniques. Types of data augmentation ❑ Real data augmentation: Is is when you add real, additional data to a dataset. adding a real dataset can be done in two ways, merge and fuzzy merge. ❑ synthetic data augmentation: This is when you add synthetic data, or fake data that simply looks real. Challenges in augmentation methods The are a number of challenges that need to be solved in order to create effective data augmentation methods to be solved in order to create effective data augmentation methods, including scalability, heterogeneous datasets, relevance, data duplication, and validation
  • 8. 8 3. Dimensionality Reduction When dealing with high dimensional data, it is often useful to reduce the dimensionality by projecting the data to a lower dimensional subspace with captures the essence of the data. this called dimensionality reduction. Techniques for dimensionality reduction ❑ feature selection methods: uses scoring or statistical methods to select which features to keep and which features to delete. ❑ matrix factorization: It uses linear algebra for example matrix factorization methods to reduce dataset matrix into its constituent parts, for dimensionality reduction eg: PCA ❑ manifold learning: They are used to create a low-dimensional projection of high dimension data, often for the purpose of data visualization(Multidimensional scaling). ❑ autoencoder methods:It involves framing a self supervised learning problem where a model must reproduce the input correctly.
  • 9. 9 4. Active Learning Active learning is the subset of machine learning in which a learning algorithm can query a user interactively to label data with the desired outputs. In active learning, the algorithm proactively selects the subset of examples to be labeled next from the pool of unlabeled data. They are three categories of active learning: ❑ Stream-based selective sampling ❑ Pool-based sampling ❑ Membership query synthesis
  • 10. 10 5. Ensemble Models Ensemble learning helps improve machine learning results by combining several models. This approach allows the production of better predictive performance compared to a single model. Basic idea is to learn a set of classifiers (experts) and to allow them to vote.
  • 11. 11 Con’t Methods for Independently Constructing Ensembles – ❏ Majority Vote ❏ Bagging and Random Forest ❏ Randomness Injection ❏ Feature-Selection Ensembles ❏ Error-Correcting Output Coding Methods for Coordinated Construction of Ensembles – ❏ Boosting ❏ Stacking
  • 12. 12 References 1. S. Sun, Z. Cao, H. Zhu and J. Zhao, "A Survey of Optimization Methods From a Machine Learning Perspective," in IEEE Transactions on Cybernetics, vol. 50, no. 8, pp. 3668-3681, Aug. 2020, doi: 10.1109/TCYB.2019.2950779. 2. Machine Learning Tutorial – Feature Engineering and Feature Selection For Beginners (freecodecamp.org) 3. Augmentation for Machine Learning (akkio.com) 4. Introduction to Dimensionality Reduction for Machine Learning - MachineLearningMastery.com 5. Active learning machine learning: What it is and how it works - DataRobot AI Cloud 6. Ensemble Classifier | Data Mining - GeeksforGeeks 7. Best Practices for Improving Your Machine Learning and Deep Learning Models - neptune.ai 8. (17) Controlling Model Complexity in Machine Learning | LinkedIn