SlideShare a Scribd company logo
Advance regression
model
&
K neighbour
algorithm
Submitted by
M.Neya shree
M.sc cs
What Is a Regression Model?
A regression model provides a function that
describes the relationship between one or more
independent variables and a response, dependent, or target
variable.
For example, the relationship between height and weight
may be described by a linear regression model.
Types of Regression
1. Linear
A linear regression is a model where the relationship between
inputs and outputs is a straight line. This is the easiest to
conceptualize and even observe in the real world. Even when a
relationship isn’t very linear, our brains try to see the pattern
and attach a rudimentary linear model to that relationship.
One example may be around the number of responses to a
marketing campaign. If we send 1,000 emails, we may get five
responses. If this relationship can be modeled using a linear
regression, we would expect to get ten responses when we
send 2,000 emails. Your chart may vary, but the general idea is
that we associate a predictor and a target, and we assume a
relationship between the two.
2. Multiple
Multiple regression indicates that there are more than one input
variables that may affect the outcome, or target variable. For
our email campaign example, you may include an additional
variable with the number of emails sent in the last month.
By looking at both input variables, a clearer picture starts to
emerge about what drives users to respond to a campaign and
how to optimize email timing and frequency. While
conceptualizing the model becomes more complex with more
inputs, the relationship may continue to be linear.
For these models, it is important to understand exactly what
effect each input has and how they combine to produce the
final target variable results.
3. Non-Linear
A linear model may work for some parts of the marketing
example above. However, we know that as we keep increasing
the number of emails in a particular campaign, the number of
responses starts to decline vs the number of emails sent. To
model this, we need a non-linear regression model.
This model will initially show a positive relationship between
number of emails and the response, but as the number of
emails increases, the model will flatten out and become almost
constant. The important takeaway here is that it is important to
understand when a model could potentially be non-linear. If you
make assumptions based on a linear model, you could get
results that are very different than expectations.
4. Stepwise Regression Modeling
While the other items we have talked about until now are specific
types of models, stepwise regression is more of a technique. If a
model involves many potential inputs, the analyst may start with the
most directly correlated input variable to build a model. Once that is
accomplished, the next step is to make the model more accurate.
To do that, additional input variables can be added to the model one
at a time in order of significance of the results. Using our email
marketing example, the initial model may be based on just the number
of emails sent. Then we would add something like the average age of
the email recipient. After that, we would add the average number of
emails each recipient has received from us. Each additional variable
would add a small amount of additional accuracy to the model.
Applications of k-NN in machine learning
The k-NN algorithm has been utilized within a variety of
applications, largely within classification. Some of these use cases
include:
- Data preprocessing: Datasets frequently have missing values,
but the KNN algorithm can estimate for those values in a process
known as missing data imputation.
- Recommendation Engines: Using clickstream data from
websites, the KNN algorithm has been used to provide automatic
recommendations to users on additional content. This research
(link resides outside of ibm.com) shows that the a user is assigned
to a particular group, and based on that group’s user behavior,
they are given a recommendation. However, given the scaling
issues with KNN, this approach may not be optimal for larger
datasets.
Advantages
- Easy to implement: Given the algorithm’s simplicity and
accuracy, it is one of the first classifiers that a new data
scientist will learn.
- Adapts easily: As new training samples are added, the
algorithm adjusts to account for any new data since all training
data is stored into memory.
- Few hyperparameters: KNN only requires a k value and a
distance metric, which is low when compared to other machine
learning algorithms.
Disadvantages
● Does not scale well
● Curse of dimensionality
● Prone to overfitting
Thank you

More Related Content

Similar to AI and ML.pptx

Top-15-Best-Practices-for-Email-Marketers-with-AI.pdf
Top-15-Best-Practices-for-Email-Marketers-with-AI.pdfTop-15-Best-Practices-for-Email-Marketers-with-AI.pdf
Top-15-Best-Practices-for-Email-Marketers-with-AI.pdf
LolaNikulshina
 
Regresión
RegresiónRegresión
Conversion Prediction for Advertisement Recommendation using Expectation Maxi...
Conversion Prediction for Advertisement Recommendation using Expectation Maxi...Conversion Prediction for Advertisement Recommendation using Expectation Maxi...
Conversion Prediction for Advertisement Recommendation using Expectation Maxi...
IJCSIS Research Publications
 
Instance Selection and Optimization of Neural Networks
Instance Selection and Optimization of Neural NetworksInstance Selection and Optimization of Neural Networks
Instance Selection and Optimization of Neural Networks
ITIIIndustries
 
Market Research using SPSS _ Edu4Sure Sept 2023.ppt
Market Research using SPSS _ Edu4Sure Sept 2023.pptMarket Research using SPSS _ Edu4Sure Sept 2023.ppt
Market Research using SPSS _ Edu4Sure Sept 2023.ppt
Edu4Sure
 
M08 BiasVarianceTradeoff
M08 BiasVarianceTradeoffM08 BiasVarianceTradeoff
M08 BiasVarianceTradeoff
Raman Kannan
 
TEXT GENERATION WITH GAN NETWORKS USING FEEDBACK SCORE
TEXT GENERATION WITH GAN NETWORKS USING FEEDBACK SCORETEXT GENERATION WITH GAN NETWORKS USING FEEDBACK SCORE
TEXT GENERATION WITH GAN NETWORKS USING FEEDBACK SCORE
IJCI JOURNAL
 
Regularization_BY_MOHAMED_ESSAM.pptx
Regularization_BY_MOHAMED_ESSAM.pptxRegularization_BY_MOHAMED_ESSAM.pptx
Regularization_BY_MOHAMED_ESSAM.pptx
Mohamed Essam
 
AWS Certified Machine Learning Specialty
AWS Certified Machine Learning Specialty AWS Certified Machine Learning Specialty
AWS Certified Machine Learning Specialty
Adnan Rashid
 
Machine Learning.pptx
Machine Learning.pptxMachine Learning.pptx
Machine Learning.pptx
NitinSharma134320
 
Employee mode of commuting
Employee mode of commutingEmployee mode of commuting
Employee mode of commuting
Saleesh Satheeshchandran
 
Dimd_m_004 DL.pdf
Dimd_m_004 DL.pdfDimd_m_004 DL.pdf
Dimd_m_004 DL.pdf
juan631
 
Machine-Learning-with-Ridge-and-Lasso-Regression.pdf
Machine-Learning-with-Ridge-and-Lasso-Regression.pdfMachine-Learning-with-Ridge-and-Lasso-Regression.pdf
Machine-Learning-with-Ridge-and-Lasso-Regression.pdf
AyadIliass
 
Feature selection
Feature selectionFeature selection
Feature selection
Abhinav Katoch
 
Chapter 05 Machine Learning.pptx
Chapter 05 Machine Learning.pptxChapter 05 Machine Learning.pptx
Chapter 05 Machine Learning.pptx
ssuser957b41
 
Campaign response modeling
Campaign response modelingCampaign response modeling
Campaign response modeling
Esteban Ribero
 
Top Machine Learning Algorithms Used By AI Professionals ARTiBA.pdf
Top Machine Learning Algorithms Used By AI Professionals ARTiBA.pdfTop Machine Learning Algorithms Used By AI Professionals ARTiBA.pdf
Top Machine Learning Algorithms Used By AI Professionals ARTiBA.pdf
Artificial Intelligence Board of America
 
Module 4: Model Selection and Evaluation
Module 4: Model Selection and EvaluationModule 4: Model Selection and Evaluation
Module 4: Model Selection and Evaluation
Sara Hooker
 
Final Report
Final ReportFinal Report
Final ReportAman Soni
 

Similar to AI and ML.pptx (20)

Top-15-Best-Practices-for-Email-Marketers-with-AI.pdf
Top-15-Best-Practices-for-Email-Marketers-with-AI.pdfTop-15-Best-Practices-for-Email-Marketers-with-AI.pdf
Top-15-Best-Practices-for-Email-Marketers-with-AI.pdf
 
Regresión
RegresiónRegresión
Regresión
 
Conversion Prediction for Advertisement Recommendation using Expectation Maxi...
Conversion Prediction for Advertisement Recommendation using Expectation Maxi...Conversion Prediction for Advertisement Recommendation using Expectation Maxi...
Conversion Prediction for Advertisement Recommendation using Expectation Maxi...
 
SAS Text Mining
SAS Text MiningSAS Text Mining
SAS Text Mining
 
Instance Selection and Optimization of Neural Networks
Instance Selection and Optimization of Neural NetworksInstance Selection and Optimization of Neural Networks
Instance Selection and Optimization of Neural Networks
 
Market Research using SPSS _ Edu4Sure Sept 2023.ppt
Market Research using SPSS _ Edu4Sure Sept 2023.pptMarket Research using SPSS _ Edu4Sure Sept 2023.ppt
Market Research using SPSS _ Edu4Sure Sept 2023.ppt
 
M08 BiasVarianceTradeoff
M08 BiasVarianceTradeoffM08 BiasVarianceTradeoff
M08 BiasVarianceTradeoff
 
TEXT GENERATION WITH GAN NETWORKS USING FEEDBACK SCORE
TEXT GENERATION WITH GAN NETWORKS USING FEEDBACK SCORETEXT GENERATION WITH GAN NETWORKS USING FEEDBACK SCORE
TEXT GENERATION WITH GAN NETWORKS USING FEEDBACK SCORE
 
Regularization_BY_MOHAMED_ESSAM.pptx
Regularization_BY_MOHAMED_ESSAM.pptxRegularization_BY_MOHAMED_ESSAM.pptx
Regularization_BY_MOHAMED_ESSAM.pptx
 
AWS Certified Machine Learning Specialty
AWS Certified Machine Learning Specialty AWS Certified Machine Learning Specialty
AWS Certified Machine Learning Specialty
 
Machine Learning.pptx
Machine Learning.pptxMachine Learning.pptx
Machine Learning.pptx
 
Employee mode of commuting
Employee mode of commutingEmployee mode of commuting
Employee mode of commuting
 
Dimd_m_004 DL.pdf
Dimd_m_004 DL.pdfDimd_m_004 DL.pdf
Dimd_m_004 DL.pdf
 
Machine-Learning-with-Ridge-and-Lasso-Regression.pdf
Machine-Learning-with-Ridge-and-Lasso-Regression.pdfMachine-Learning-with-Ridge-and-Lasso-Regression.pdf
Machine-Learning-with-Ridge-and-Lasso-Regression.pdf
 
Feature selection
Feature selectionFeature selection
Feature selection
 
Chapter 05 Machine Learning.pptx
Chapter 05 Machine Learning.pptxChapter 05 Machine Learning.pptx
Chapter 05 Machine Learning.pptx
 
Campaign response modeling
Campaign response modelingCampaign response modeling
Campaign response modeling
 
Top Machine Learning Algorithms Used By AI Professionals ARTiBA.pdf
Top Machine Learning Algorithms Used By AI Professionals ARTiBA.pdfTop Machine Learning Algorithms Used By AI Professionals ARTiBA.pdf
Top Machine Learning Algorithms Used By AI Professionals ARTiBA.pdf
 
Module 4: Model Selection and Evaluation
Module 4: Model Selection and EvaluationModule 4: Model Selection and Evaluation
Module 4: Model Selection and Evaluation
 
Final Report
Final ReportFinal Report
Final Report
 

More from NeyaShree1

WE PPT.pptx
WE PPT.pptxWE PPT.pptx
WE PPT.pptx
NeyaShree1
 
HDI vs GDI-1.pptx
HDI vs GDI-1.pptxHDI vs GDI-1.pptx
HDI vs GDI-1.pptx
NeyaShree1
 
Security overview of cloud computing.pptx
Security overview of cloud computing.pptxSecurity overview of cloud computing.pptx
Security overview of cloud computing.pptx
NeyaShree1
 
IOT.pptx
IOT.pptxIOT.pptx
IOT.pptx
NeyaShree1
 
Edge detection.pptx
Edge detection.pptxEdge detection.pptx
Edge detection.pptx
NeyaShree1
 
Nosql .pptx
Nosql .pptxNosql .pptx
Nosql .pptx
NeyaShree1
 
Cryptography .pptx
Cryptography .pptxCryptography .pptx
Cryptography .pptx
NeyaShree1
 
Dos.pptx
Dos.pptxDos.pptx
Dos.pptx
NeyaShree1
 
Python .pptx
Python .pptxPython .pptx
Python .pptx
NeyaShree1
 
Dsa .pdf
Dsa .pdfDsa .pdf
Dsa .pdf
NeyaShree1
 
Dsa .pdf
Dsa .pdfDsa .pdf
Dsa .pdf
NeyaShree1
 
CD.pptx
CD.pptxCD.pptx
CD.pptx
NeyaShree1
 

More from NeyaShree1 (12)

WE PPT.pptx
WE PPT.pptxWE PPT.pptx
WE PPT.pptx
 
HDI vs GDI-1.pptx
HDI vs GDI-1.pptxHDI vs GDI-1.pptx
HDI vs GDI-1.pptx
 
Security overview of cloud computing.pptx
Security overview of cloud computing.pptxSecurity overview of cloud computing.pptx
Security overview of cloud computing.pptx
 
IOT.pptx
IOT.pptxIOT.pptx
IOT.pptx
 
Edge detection.pptx
Edge detection.pptxEdge detection.pptx
Edge detection.pptx
 
Nosql .pptx
Nosql .pptxNosql .pptx
Nosql .pptx
 
Cryptography .pptx
Cryptography .pptxCryptography .pptx
Cryptography .pptx
 
Dos.pptx
Dos.pptxDos.pptx
Dos.pptx
 
Python .pptx
Python .pptxPython .pptx
Python .pptx
 
Dsa .pdf
Dsa .pdfDsa .pdf
Dsa .pdf
 
Dsa .pdf
Dsa .pdfDsa .pdf
Dsa .pdf
 
CD.pptx
CD.pptxCD.pptx
CD.pptx
 

Recently uploaded

ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
Priyankaranawat4
 
Advantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO PerspectiveAdvantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO Perspective
Krisztián Száraz
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
Scholarhat
 
Landownership in the Philippines under the Americans-2-pptx.pptx
Landownership in the Philippines under the Americans-2-pptx.pptxLandownership in the Philippines under the Americans-2-pptx.pptx
Landownership in the Philippines under the Americans-2-pptx.pptx
JezreelCabil2
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
vaibhavrinwa19
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
Top five deadliest dog breeds in America
Top five deadliest dog breeds in AmericaTop five deadliest dog breeds in America
Top five deadliest dog breeds in America
Bisnar Chase Personal Injury Attorneys
 
Best Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDABest Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDA
deeptiverma2406
 
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
RitikBhardwaj56
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Thiyagu K
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
tarandeep35
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
heathfieldcps1
 
A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
thanhdowork
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
Delivering Micro-Credentials in Technical and Vocational Education and Training
Delivering Micro-Credentials in Technical and Vocational Education and TrainingDelivering Micro-Credentials in Technical and Vocational Education and Training
Delivering Micro-Credentials in Technical and Vocational Education and Training
AG2 Design
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 

Recently uploaded (20)

ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
 
Advantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO PerspectiveAdvantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO Perspective
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
 
Landownership in the Philippines under the Americans-2-pptx.pptx
Landownership in the Philippines under the Americans-2-pptx.pptxLandownership in the Philippines under the Americans-2-pptx.pptx
Landownership in the Philippines under the Americans-2-pptx.pptx
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
Top five deadliest dog breeds in America
Top five deadliest dog breeds in AmericaTop five deadliest dog breeds in America
Top five deadliest dog breeds in America
 
Best Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDABest Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDA
 
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
 
A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
Delivering Micro-Credentials in Technical and Vocational Education and Training
Delivering Micro-Credentials in Technical and Vocational Education and TrainingDelivering Micro-Credentials in Technical and Vocational Education and Training
Delivering Micro-Credentials in Technical and Vocational Education and Training
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 

AI and ML.pptx

  • 2. What Is a Regression Model? A regression model provides a function that describes the relationship between one or more independent variables and a response, dependent, or target variable. For example, the relationship between height and weight may be described by a linear regression model.
  • 3. Types of Regression 1. Linear A linear regression is a model where the relationship between inputs and outputs is a straight line. This is the easiest to conceptualize and even observe in the real world. Even when a relationship isn’t very linear, our brains try to see the pattern and attach a rudimentary linear model to that relationship. One example may be around the number of responses to a marketing campaign. If we send 1,000 emails, we may get five responses. If this relationship can be modeled using a linear regression, we would expect to get ten responses when we send 2,000 emails. Your chart may vary, but the general idea is that we associate a predictor and a target, and we assume a relationship between the two.
  • 4. 2. Multiple Multiple regression indicates that there are more than one input variables that may affect the outcome, or target variable. For our email campaign example, you may include an additional variable with the number of emails sent in the last month. By looking at both input variables, a clearer picture starts to emerge about what drives users to respond to a campaign and how to optimize email timing and frequency. While conceptualizing the model becomes more complex with more inputs, the relationship may continue to be linear. For these models, it is important to understand exactly what effect each input has and how they combine to produce the final target variable results.
  • 5. 3. Non-Linear A linear model may work for some parts of the marketing example above. However, we know that as we keep increasing the number of emails in a particular campaign, the number of responses starts to decline vs the number of emails sent. To model this, we need a non-linear regression model. This model will initially show a positive relationship between number of emails and the response, but as the number of emails increases, the model will flatten out and become almost constant. The important takeaway here is that it is important to understand when a model could potentially be non-linear. If you make assumptions based on a linear model, you could get results that are very different than expectations.
  • 6. 4. Stepwise Regression Modeling While the other items we have talked about until now are specific types of models, stepwise regression is more of a technique. If a model involves many potential inputs, the analyst may start with the most directly correlated input variable to build a model. Once that is accomplished, the next step is to make the model more accurate. To do that, additional input variables can be added to the model one at a time in order of significance of the results. Using our email marketing example, the initial model may be based on just the number of emails sent. Then we would add something like the average age of the email recipient. After that, we would add the average number of emails each recipient has received from us. Each additional variable would add a small amount of additional accuracy to the model.
  • 7.
  • 8. Applications of k-NN in machine learning The k-NN algorithm has been utilized within a variety of applications, largely within classification. Some of these use cases include: - Data preprocessing: Datasets frequently have missing values, but the KNN algorithm can estimate for those values in a process known as missing data imputation. - Recommendation Engines: Using clickstream data from websites, the KNN algorithm has been used to provide automatic recommendations to users on additional content. This research (link resides outside of ibm.com) shows that the a user is assigned to a particular group, and based on that group’s user behavior, they are given a recommendation. However, given the scaling issues with KNN, this approach may not be optimal for larger datasets.
  • 9. Advantages - Easy to implement: Given the algorithm’s simplicity and accuracy, it is one of the first classifiers that a new data scientist will learn. - Adapts easily: As new training samples are added, the algorithm adjusts to account for any new data since all training data is stored into memory. - Few hyperparameters: KNN only requires a k value and a distance metric, which is low when compared to other machine learning algorithms.
  • 10. Disadvantages ● Does not scale well ● Curse of dimensionality ● Prone to overfitting