SlideShare a Scribd company logo
1 of 25
Predict Purchasing Intentions
of Online Shoppers
Sean Robertson. Helen Phan
Goals
Predict the shopping intent of
website visitors
๏‚ง Purchase
๏‚ง No purchase
๏‚ง โ€œOnline Shoppers Purchasing Intentionโ€ retrieved from UCI machine learning
repository, data is from Columbia Sportswear Company
๏‚ง No missing data
๏‚ง 50% for training, 50% for validation
Online shopping
sessions
Dataset
12,330
Numerical
variables
10
Categorical
variables
8
Administrative pages
Time spent on
administrative pages
Dataset / 10 numerical variables
Informational pages
Time spent on informational pages
Product related pages
Time spent on product related
pages
Bounce rate
Exit rate
Dataset / 10 numerical variables
Page value
Special day
Dataset / 10 numerical variables
=
๐‘’ โˆ’ ๐‘๐‘œ๐‘š๐‘š๐‘’๐‘Ÿ๐‘๐‘’ ๐‘Ÿ๐‘’๐‘ฃ๐‘’๐‘›๐‘ข๐‘’ + ๐‘ก๐‘œ๐‘ก๐‘Ž๐‘™ ๐‘”๐‘œ๐‘Ž๐‘™ ๐‘ฃ๐‘Ž๐‘™๐‘ข๐‘’
# ๐‘ข๐‘›๐‘–๐‘ž๐‘ข๐‘’ ๐‘๐‘Ž๐‘”๐‘’๐‘ฃ๐‘–๐‘’๐‘ค๐‘  ๐‘“๐‘œ๐‘Ÿ ๐‘”๐‘–๐‘ฃ๐‘’๐‘› ๐‘๐‘Ž๐‘”๐‘’
Dataset / 8 categorical variables
Browser
Region
Operating system Month
Traffic Source
Dataset / 8 categorical variables
84.53%
15.47%
Revenue
No Purchase Purchase
13.74%
85.57%
Visitor Type
New Returning Others
76.74%
23.26%
Weekend
No Yes
Predict purchasing intention
1. Decision Tree
2. Logistic Regression
3. Neural Networks
4. Random Forest
DecisionTree
โ€ข Obtain the maximal decision tree
โ€ข Model overfitting
Optimal: 8-leaf tree
Subtree Assessment Plot
๏‚ง 41 nodes and 8 leaves
๏‚ง Classification conditions
โƒ Page value
โƒ Bounce rate
โƒ Administrative page
โƒ Month
โƒ Time on product related pages
๏‚ง Misclassification rate
โƒ Train data: 9.312%
โƒ Validation data: 9.828%
The optimal tree
Leaf Classification Rules
% of train data % of validation data Purchasing
Intention
Y = False Y = True Y = False Y = True
1 Page value < 0.9805 96.03 3.97 95.96 4.04 No
2
Page value >= 0.9805
Bounce Rates < 0.0001 Administrative <3.5
14.55 85.45 19.72 80.28 Yes
3
Page value >= 0.9805
Bounce Rates < 0.0001
in Mar, Nov
43.49 56.51 38.59 61.41 Yes
4
Page value >= 0.9805 or <21.9121
Bounce Rates < 0.0001
Administrative >=3.5
59.66 40.34 52.63 47.37 No
5
Page value >= 21.9121
Bounce Rates < 0.0001 Administrative >=3.5
26.04 73.96 28.07 71.93 Yes
The optimal tree
The optimal tree
Leaf Classification Rules
% of train data % of validation data
Purchasing
Intention
Y = False Y = True Y = False Y = True
6
Page value >= 0.9805
Bounce Rates < 0.0001
In between May and Oct
Product Related Duration >= 505.4
72.14 27.86 68.06 31.94 No
7
Page value >= 0.9805 or <12.174
Bounce Rates < 0.0001
In between May and Oct
Product Related Duration < 505.4
71.43 28.57 64.29 35.71 No
8
Page value >= 12.174
Bounce Rates < 0.0001
In between May and Oct
Product Related Duration < 505.4
11.54 88.46 28.57 71.43 Yes
Logistic Regression
โ€ข Regression
โ€ข Regression after stepwise variable selection
โ€ข Regression after transformation
โ€ข Regression after transformation, categorial
variables recoded and stepwise variable
selection
Train Validation
11.162
%
11.531%
11.291
%
11.434%
10.188
%
10.282%
10.399
%
10.282%
Time spent on
administrative pages
Time spent on informational
pages
Product related
pages
Time spent on product related
pages
Page value
Log Transformation
Logistic Regression
= -.336 โ€“ 16.86 (ExitRates) โ€“ .339 (Aug) โ€“ .419(Sept)โ€“ .235(Oct) โ€“ .63 (Nov) โ€“ .068(log_administrative duration) + 1.136
(log_pagevalues) โ€“ .258 (New Visitor)
โ€“ .302(traffic type_10) + .359 (traffic type_13) โ€“ .388(traffic type_20) โ€“ .417(traffic type_8)
Logistic Regression
l๐‘œ๐‘”(
๐‘
1 โˆ’ ๐‘
)
Variable Odd ratio % Change
ExitRates 0.001 -100%
Indicator_Aug 0.508 -49%
Indicator_Nov 0.283 -72%
Indicator_Oct 0.625 -38%
Indicator_Sept 0.432 -57%
LOG_Administrative_Duration 0.934 -7%
Variable Odd ratio % Change
LOG_PageValues 3.114 211%
New_Visitor 0.596 -40%
TrafficType_10 0.546 -45%
TrafficType_13 2.05 105%
TrafficType_20 0.46 -54%
TrafficType_8 0.434 -57%
Neural Networks
โ€ข Auto Neural Network after stepwise
โ€ข Auto Neural Network after variable
transformation and stepwise
โ€ข Auto Neural Network after variable
transformation, recoding categorical variables
and stepwise
Train Validation
9.71% 10.08%
9.79% 9.87%
10.61% 10.23%
Neural Networks
โ€ข Neural Network (top) had 199 weights/parameters and
19 iterations to optimize Average Squared Error and
Misclassification Rate
โ€ข Auto-Neural Network (bottom) had 100 weights and 3
hidden units
โ€ข Both models are large as a result of 18 variables in the
model
Neural Networks
Distribution of Weights
Random Forest
โ€ข Random Forest
โ€ข Random Forest after stepwise
โ€ข Random Forest after variable transformation
โ€ข Random Forest after variable transformation and
stepwise
โ€ข Random Forest after variable transformation,
recoding categorical variables and stepwise
Train Validation
9.61% 10.88%
9.08% 10.1%
9.94% 10.88%
9.68% 10.78%
8.19% 10.5%
Random Forest
โ€ข Misclassification Rate of training data converged to
9.94%
โ€ข Misclassification Rate of validation data converged to
10.9%
โ€ข No significant difference when using a forest larger than
the default of 100 trees
Conclusion
# Model MISCTrain MISCValidation
1 DecisionTree
9.312% 9.828%
2
Regression after variable transformation, recoding categorical
variables and stepwise 10.399% 10.282%
3
Auto Neural Network after variable transformation and
stepwise 9.798% 9.876%
4 Random Forest after stepwise
9.085% 10.104%
Conclusion
When do people purchase?
โ€ข Page value >= 0.9805 and Bounce Rates < 0.0001 and Administrative <3.5
โ€ข Page value >= 0.9805 and Bounce Rates < 0.0001 and in Mar, Nov
โ€ข Page value >= 21.9121 and Bounce Rates < 0.0001 and Administrative >=3.5
โ€ข Page value >= 12.174 and Bounce Rates < 0.0001 and In between May and Oct and Product Related Duration < 505.
โ€ข Visit the page that had high average value (Page value >= 0.9805)
โ€ข When they did not bounce from the site (Bounce Rates < 0.0001)
โ€ข Visit to their account management pages was low (<4), or if high was also paired with even higher average value (Page value
>= 21.9805)
Thank you!

More Related Content

What's hot

Module 4: Model Selection and Evaluation
Module 4: Model Selection and EvaluationModule 4: Model Selection and Evaluation
Module 4: Model Selection and EvaluationSara Hooker
ย 
Data visualization
Data visualizationData visualization
Data visualizationHoang Nguyen
ย 
Machine learning in retail
Machine learning in retail Machine learning in retail
Machine learning in retail Capgemini
ย 
Credit card payment_fraud_detection
Credit card payment_fraud_detectionCredit card payment_fraud_detection
Credit card payment_fraud_detectionPEIPEI HAN
ย 
Logistic regression
Logistic regressionLogistic regression
Logistic regressionYashwantGahlot1
ย 
datamarts.ppt
datamarts.pptdatamarts.ppt
datamarts.pptbhavyag24
ย 
Vehicle Routing Problem.ppt
Vehicle Routing Problem.pptVehicle Routing Problem.ppt
Vehicle Routing Problem.pptHassanHani5
ย 
Predicting Employee Attrition
Predicting Employee AttritionPredicting Employee Attrition
Predicting Employee AttritionMohamad Sahil
ย 
Exploratory data analysis data visualization
Exploratory data analysis data visualizationExploratory data analysis data visualization
Exploratory data analysis data visualizationDr. Hamdan Al-Sabri
ย 
Credit Card Fraudulent Transaction Detection Research Paper
Credit Card Fraudulent Transaction Detection Research PaperCredit Card Fraudulent Transaction Detection Research Paper
Credit Card Fraudulent Transaction Detection Research PaperGarvit Burad
ย 
Confusion Matrix
Confusion MatrixConfusion Matrix
Confusion MatrixRajat Gupta
ย 
Data analytics telecom churn final ppt
Data analytics telecom churn final ppt Data analytics telecom churn final ppt
Data analytics telecom churn final ppt Gunvansh Khanna
ย 
Machine Learning Overview.pptx
Machine Learning Overview.pptxMachine Learning Overview.pptx
Machine Learning Overview.pptxRushikeshChikane2
ย 
02 data
02 data02 data
02 dataphakhwan22
ย 
Flight Delay Prediction
Flight Delay PredictionFlight Delay Prediction
Flight Delay PredictionVyshak Srishylappa
ย 
K-Folds Cross Validation Method
K-Folds Cross Validation MethodK-Folds Cross Validation Method
K-Folds Cross Validation MethodSHUBHAM GUPTA
ย 
Machine learning interview questions and answers
Machine learning interview questions and answersMachine learning interview questions and answers
Machine learning interview questions and answerskavinilavuG
ย 
End-to-End Machine Learning Project
End-to-End Machine Learning ProjectEnd-to-End Machine Learning Project
End-to-End Machine Learning ProjectEng Teong Cheah
ย 

What's hot (20)

Module 4: Model Selection and Evaluation
Module 4: Model Selection and EvaluationModule 4: Model Selection and Evaluation
Module 4: Model Selection and Evaluation
ย 
Data visualization
Data visualizationData visualization
Data visualization
ย 
Machine learning in retail
Machine learning in retail Machine learning in retail
Machine learning in retail
ย 
Credit card payment_fraud_detection
Credit card payment_fraud_detectionCredit card payment_fraud_detection
Credit card payment_fraud_detection
ย 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
ย 
datamarts.ppt
datamarts.pptdatamarts.ppt
datamarts.ppt
ย 
Vehicle Routing Problem.ppt
Vehicle Routing Problem.pptVehicle Routing Problem.ppt
Vehicle Routing Problem.ppt
ย 
Predicting Employee Attrition
Predicting Employee AttritionPredicting Employee Attrition
Predicting Employee Attrition
ย 
Exploratory data analysis data visualization
Exploratory data analysis data visualizationExploratory data analysis data visualization
Exploratory data analysis data visualization
ย 
Credit Card Fraudulent Transaction Detection Research Paper
Credit Card Fraudulent Transaction Detection Research PaperCredit Card Fraudulent Transaction Detection Research Paper
Credit Card Fraudulent Transaction Detection Research Paper
ย 
Confusion Matrix
Confusion MatrixConfusion Matrix
Confusion Matrix
ย 
Customer churn prediction in banking
Customer churn prediction in bankingCustomer churn prediction in banking
Customer churn prediction in banking
ย 
Data analytics telecom churn final ppt
Data analytics telecom churn final ppt Data analytics telecom churn final ppt
Data analytics telecom churn final ppt
ย 
Machine Learning Overview.pptx
Machine Learning Overview.pptxMachine Learning Overview.pptx
Machine Learning Overview.pptx
ย 
Auxis Webinar: Diving into RPA
Auxis Webinar: Diving into RPAAuxis Webinar: Diving into RPA
Auxis Webinar: Diving into RPA
ย 
02 data
02 data02 data
02 data
ย 
Flight Delay Prediction
Flight Delay PredictionFlight Delay Prediction
Flight Delay Prediction
ย 
K-Folds Cross Validation Method
K-Folds Cross Validation MethodK-Folds Cross Validation Method
K-Folds Cross Validation Method
ย 
Machine learning interview questions and answers
Machine learning interview questions and answersMachine learning interview questions and answers
Machine learning interview questions and answers
ย 
End-to-End Machine Learning Project
End-to-End Machine Learning ProjectEnd-to-End Machine Learning Project
End-to-End Machine Learning Project
ย 

Similar to Predict online shoppers' intentions

Presentation top tips for getting optimal sql execution
Presentation    top tips for getting optimal sql executionPresentation    top tips for getting optimal sql execution
Presentation top tips for getting optimal sql executionxKinAnx
ย 
Smarter Supplier Management โ€“ Improving Supplier Performance Through Better C...
Smarter Supplier Management โ€“ Improving Supplier Performance Through Better C...Smarter Supplier Management โ€“ Improving Supplier Performance Through Better C...
Smarter Supplier Management โ€“ Improving Supplier Performance Through Better C...SAP Ariba
ย 
Predictive Conversion Modeling - Lifting Web Analytics to the next level
Predictive Conversion Modeling - Lifting Web Analytics to the next levelPredictive Conversion Modeling - Lifting Web Analytics to the next level
Predictive Conversion Modeling - Lifting Web Analytics to the next levelPetri Mertanen
ย 
Cost-per-use vs. hours-per-report: usage data collection and the value of sta...
Cost-per-use vs. hours-per-report: usage data collection and the value of sta...Cost-per-use vs. hours-per-report: usage data collection and the value of sta...
Cost-per-use vs. hours-per-report: usage data collection and the value of sta...NASIG
ย 
Randleman Presentation
Randleman PresentationRandleman Presentation
Randleman Presentationbrandleman
ย 
Common Errors in ML
Common Errors in MLCommon Errors in ML
Common Errors in MLKyle Polich
ย 
Common Errors in ML
Common Errors in MLCommon Errors in ML
Common Errors in MLKyle Polich
ย 
Digitalization in electronics manufacturing
Digitalization in electronics manufacturingDigitalization in electronics manufacturing
Digitalization in electronics manufacturingTom Arne Danielsen
ย 
Predictive Analysis
Predictive AnalysisPredictive Analysis
Predictive AnalysisMichael Bystry
ย 
Business and Data Analytics Collaborative April Meetup
Business and Data Analytics Collaborative April MeetupBusiness and Data Analytics Collaborative April Meetup
Business and Data Analytics Collaborative April MeetupKen Tucker
ย 
Measuring the Health of XaaS Subscription Renewals
Measuring the Health of XaaS Subscription RenewalsMeasuring the Health of XaaS Subscription Renewals
Measuring the Health of XaaS Subscription RenewalsTSIA
ย 
1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptop1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptopRising Media, Inc.
ย 
How to Enter the Data Analytics Industry?
How to Enter the Data Analytics Industry?How to Enter the Data Analytics Industry?
How to Enter the Data Analytics Industry?Ganes Kesari
ย 
Landing Page Optimization: 6 Years of Testing Distilled Into 5 Critical Lessons
Landing Page Optimization: 6 Years of Testing Distilled Into 5 Critical LessonsLanding Page Optimization: 6 Years of Testing Distilled Into 5 Critical Lessons
Landing Page Optimization: 6 Years of Testing Distilled Into 5 Critical LessonsTaboola
ย 
customer_profiling_based_on_fuzzy_principals_linkedin
customer_profiling_based_on_fuzzy_principals_linkedincustomer_profiling_based_on_fuzzy_principals_linkedin
customer_profiling_based_on_fuzzy_principals_linkedinAsoka Korale
ย 
Predicting Helpfulness of User-Generated Product Reviews Through Analytical M...
Predicting Helpfulness of User-Generated Product Reviews Through Analytical M...Predicting Helpfulness of User-Generated Product Reviews Through Analytical M...
Predicting Helpfulness of User-Generated Product Reviews Through Analytical M...Ankita Kaul
ย 
Logistics Mgt Webcast
Logistics Mgt WebcastLogistics Mgt Webcast
Logistics Mgt Webcastbrandleman
ย 
BDW Chicago 2016 - Don Deloach, CEO and President, Infobright - Rethinking Ar...
BDW Chicago 2016 - Don Deloach, CEO and President, Infobright - Rethinking Ar...BDW Chicago 2016 - Don Deloach, CEO and President, Infobright - Rethinking Ar...
BDW Chicago 2016 - Don Deloach, CEO and President, Infobright - Rethinking Ar...Big Data Week
ย 

Similar to Predict online shoppers' intentions (20)

Presentation top tips for getting optimal sql execution
Presentation    top tips for getting optimal sql executionPresentation    top tips for getting optimal sql execution
Presentation top tips for getting optimal sql execution
ย 
Smarter Supplier Management โ€“ Improving Supplier Performance Through Better C...
Smarter Supplier Management โ€“ Improving Supplier Performance Through Better C...Smarter Supplier Management โ€“ Improving Supplier Performance Through Better C...
Smarter Supplier Management โ€“ Improving Supplier Performance Through Better C...
ย 
Predictive Conversion Modeling - Lifting Web Analytics to the next level
Predictive Conversion Modeling - Lifting Web Analytics to the next levelPredictive Conversion Modeling - Lifting Web Analytics to the next level
Predictive Conversion Modeling - Lifting Web Analytics to the next level
ย 
Cost-per-use vs. hours-per-report: usage data collection and the value of sta...
Cost-per-use vs. hours-per-report: usage data collection and the value of sta...Cost-per-use vs. hours-per-report: usage data collection and the value of sta...
Cost-per-use vs. hours-per-report: usage data collection and the value of sta...
ย 
Randleman Presentation
Randleman PresentationRandleman Presentation
Randleman Presentation
ย 
Pricing strategies 2017 nz
Pricing strategies 2017 nzPricing strategies 2017 nz
Pricing strategies 2017 nz
ย 
Common Errors in ML
Common Errors in MLCommon Errors in ML
Common Errors in ML
ย 
Common Errors in ML
Common Errors in MLCommon Errors in ML
Common Errors in ML
ย 
Digitalization in electronics manufacturing
Digitalization in electronics manufacturingDigitalization in electronics manufacturing
Digitalization in electronics manufacturing
ย 
Predictive Analysis
Predictive AnalysisPredictive Analysis
Predictive Analysis
ย 
Business and Data Analytics Collaborative April Meetup
Business and Data Analytics Collaborative April MeetupBusiness and Data Analytics Collaborative April Meetup
Business and Data Analytics Collaborative April Meetup
ย 
Modul 1.3: Goals & Benefits
Modul 1.3: Goals & BenefitsModul 1.3: Goals & Benefits
Modul 1.3: Goals & Benefits
ย 
Measuring the Health of XaaS Subscription Renewals
Measuring the Health of XaaS Subscription RenewalsMeasuring the Health of XaaS Subscription Renewals
Measuring the Health of XaaS Subscription Renewals
ย 
1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptop1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptop
ย 
How to Enter the Data Analytics Industry?
How to Enter the Data Analytics Industry?How to Enter the Data Analytics Industry?
How to Enter the Data Analytics Industry?
ย 
Landing Page Optimization: 6 Years of Testing Distilled Into 5 Critical Lessons
Landing Page Optimization: 6 Years of Testing Distilled Into 5 Critical LessonsLanding Page Optimization: 6 Years of Testing Distilled Into 5 Critical Lessons
Landing Page Optimization: 6 Years of Testing Distilled Into 5 Critical Lessons
ย 
customer_profiling_based_on_fuzzy_principals_linkedin
customer_profiling_based_on_fuzzy_principals_linkedincustomer_profiling_based_on_fuzzy_principals_linkedin
customer_profiling_based_on_fuzzy_principals_linkedin
ย 
Predicting Helpfulness of User-Generated Product Reviews Through Analytical M...
Predicting Helpfulness of User-Generated Product Reviews Through Analytical M...Predicting Helpfulness of User-Generated Product Reviews Through Analytical M...
Predicting Helpfulness of User-Generated Product Reviews Through Analytical M...
ย 
Logistics Mgt Webcast
Logistics Mgt WebcastLogistics Mgt Webcast
Logistics Mgt Webcast
ย 
BDW Chicago 2016 - Don Deloach, CEO and President, Infobright - Rethinking Ar...
BDW Chicago 2016 - Don Deloach, CEO and President, Infobright - Rethinking Ar...BDW Chicago 2016 - Don Deloach, CEO and President, Infobright - Rethinking Ar...
BDW Chicago 2016 - Don Deloach, CEO and President, Infobright - Rethinking Ar...
ย 

Recently uploaded

Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
ย 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
ย 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
ย 
Delhi Call Girls CP 9711199171 โ˜Žโœ”๐Ÿ‘Œโœ” Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 โ˜Žโœ”๐Ÿ‘Œโœ” Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 โ˜Žโœ”๐Ÿ‘Œโœ” Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 โ˜Žโœ”๐Ÿ‘Œโœ” Whatsapp Hard And Sexy Vip Callshivangimorya083
ย 
Data Warehouse , Data Cube Computation
Data Warehouse   , Data Cube ComputationData Warehouse   , Data Cube Computation
Data Warehouse , Data Cube Computationsit20ad004
ย 
Call Girls in Defence Colony Delhi ๐Ÿ’ฏCall Us ๐Ÿ”8264348440๐Ÿ”
Call Girls in Defence Colony Delhi ๐Ÿ’ฏCall Us ๐Ÿ”8264348440๐Ÿ”Call Girls in Defence Colony Delhi ๐Ÿ’ฏCall Us ๐Ÿ”8264348440๐Ÿ”
Call Girls in Defence Colony Delhi ๐Ÿ’ฏCall Us ๐Ÿ”8264348440๐Ÿ”soniya singh
ย 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
ย 
๊งโค Aerocity Call Girls Service Aerocity Delhi โค๊ง‚ 9999965857 โ˜Ž๏ธ Hard And Sexy ...
๊งโค Aerocity Call Girls Service Aerocity Delhi โค๊ง‚ 9999965857 โ˜Ž๏ธ Hard And Sexy ...๊งโค Aerocity Call Girls Service Aerocity Delhi โค๊ง‚ 9999965857 โ˜Ž๏ธ Hard And Sexy ...
๊งโค Aerocity Call Girls Service Aerocity Delhi โค๊ง‚ 9999965857 โ˜Ž๏ธ Hard And Sexy ...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure
ย 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
ย 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
ย 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
ย 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...Suhani Kapoor
ย 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
ย 
ไปฃๅŠžๅ›ฝๅค–ๅคงๅญฆๆ–‡ๅ‡ญใ€ŠๅŽŸ็‰ˆ็พŽๅ›ฝUCLAๆ–‡ๅ‡ญ่ฏไนฆใ€‹ๅŠ ๅทžๅคงๅญฆๆด›ๆ‰็Ÿถๅˆ†ๆ กๆฏ•ไธš่ฏๅˆถไฝœๆˆ็ปฉๅ•ไฟฎๆ”น
ไปฃๅŠžๅ›ฝๅค–ๅคงๅญฆๆ–‡ๅ‡ญใ€ŠๅŽŸ็‰ˆ็พŽๅ›ฝUCLAๆ–‡ๅ‡ญ่ฏไนฆใ€‹ๅŠ ๅทžๅคงๅญฆๆด›ๆ‰็Ÿถๅˆ†ๆ กๆฏ•ไธš่ฏๅˆถไฝœๆˆ็ปฉๅ•ไฟฎๆ”นไปฃๅŠžๅ›ฝๅค–ๅคงๅญฆๆ–‡ๅ‡ญใ€ŠๅŽŸ็‰ˆ็พŽๅ›ฝUCLAๆ–‡ๅ‡ญ่ฏไนฆใ€‹ๅŠ ๅทžๅคงๅญฆๆด›ๆ‰็Ÿถๅˆ†ๆ กๆฏ•ไธš่ฏๅˆถไฝœๆˆ็ปฉๅ•ไฟฎๆ”น
ไปฃๅŠžๅ›ฝๅค–ๅคงๅญฆๆ–‡ๅ‡ญใ€ŠๅŽŸ็‰ˆ็พŽๅ›ฝUCLAๆ–‡ๅ‡ญ่ฏไนฆใ€‹ๅŠ ๅทžๅคงๅญฆๆด›ๆ‰็Ÿถๅˆ†ๆ กๆฏ•ไธš่ฏๅˆถไฝœๆˆ็ปฉๅ•ไฟฎๆ”นatducpo
ย 
Full night ๐Ÿฅต Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy โœŒ๏ธo...
Full night ๐Ÿฅต Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy โœŒ๏ธo...Full night ๐Ÿฅต Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy โœŒ๏ธo...
Full night ๐Ÿฅต Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy โœŒ๏ธo...shivangimorya083
ย 
Delhi Call Girls Punjabi Bagh 9711199171 โ˜Žโœ”๐Ÿ‘Œโœ” Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 โ˜Žโœ”๐Ÿ‘Œโœ” Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 โ˜Žโœ”๐Ÿ‘Œโœ” Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 โ˜Žโœ”๐Ÿ‘Œโœ” Whatsapp Hard And Sexy Vip Callshivangimorya083
ย 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
ย 

Recently uploaded (20)

Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
ย 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
ย 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
ย 
Delhi Call Girls CP 9711199171 โ˜Žโœ”๐Ÿ‘Œโœ” Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 โ˜Žโœ”๐Ÿ‘Œโœ” Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 โ˜Žโœ”๐Ÿ‘Œโœ” Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 โ˜Žโœ”๐Ÿ‘Œโœ” Whatsapp Hard And Sexy Vip Call
ย 
Data Warehouse , Data Cube Computation
Data Warehouse   , Data Cube ComputationData Warehouse   , Data Cube Computation
Data Warehouse , Data Cube Computation
ย 
Russian Call Girls Dwarka Sector 15 ๐Ÿ’“ Delhi 9999965857 @Sabina Modi VVIP MODE...
Russian Call Girls Dwarka Sector 15 ๐Ÿ’“ Delhi 9999965857 @Sabina Modi VVIP MODE...Russian Call Girls Dwarka Sector 15 ๐Ÿ’“ Delhi 9999965857 @Sabina Modi VVIP MODE...
Russian Call Girls Dwarka Sector 15 ๐Ÿ’“ Delhi 9999965857 @Sabina Modi VVIP MODE...
ย 
Call Girls in Defence Colony Delhi ๐Ÿ’ฏCall Us ๐Ÿ”8264348440๐Ÿ”
Call Girls in Defence Colony Delhi ๐Ÿ’ฏCall Us ๐Ÿ”8264348440๐Ÿ”Call Girls in Defence Colony Delhi ๐Ÿ’ฏCall Us ๐Ÿ”8264348440๐Ÿ”
Call Girls in Defence Colony Delhi ๐Ÿ’ฏCall Us ๐Ÿ”8264348440๐Ÿ”
ย 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
ย 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
ย 
๊งโค Aerocity Call Girls Service Aerocity Delhi โค๊ง‚ 9999965857 โ˜Ž๏ธ Hard And Sexy ...
๊งโค Aerocity Call Girls Service Aerocity Delhi โค๊ง‚ 9999965857 โ˜Ž๏ธ Hard And Sexy ...๊งโค Aerocity Call Girls Service Aerocity Delhi โค๊ง‚ 9999965857 โ˜Ž๏ธ Hard And Sexy ...
๊งโค Aerocity Call Girls Service Aerocity Delhi โค๊ง‚ 9999965857 โ˜Ž๏ธ Hard And Sexy ...
ย 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
ย 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
ย 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
ย 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
ย 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
ย 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
ย 
ไปฃๅŠžๅ›ฝๅค–ๅคงๅญฆๆ–‡ๅ‡ญใ€ŠๅŽŸ็‰ˆ็พŽๅ›ฝUCLAๆ–‡ๅ‡ญ่ฏไนฆใ€‹ๅŠ ๅทžๅคงๅญฆๆด›ๆ‰็Ÿถๅˆ†ๆ กๆฏ•ไธš่ฏๅˆถไฝœๆˆ็ปฉๅ•ไฟฎๆ”น
ไปฃๅŠžๅ›ฝๅค–ๅคงๅญฆๆ–‡ๅ‡ญใ€ŠๅŽŸ็‰ˆ็พŽๅ›ฝUCLAๆ–‡ๅ‡ญ่ฏไนฆใ€‹ๅŠ ๅทžๅคงๅญฆๆด›ๆ‰็Ÿถๅˆ†ๆ กๆฏ•ไธš่ฏๅˆถไฝœๆˆ็ปฉๅ•ไฟฎๆ”นไปฃๅŠžๅ›ฝๅค–ๅคงๅญฆๆ–‡ๅ‡ญใ€ŠๅŽŸ็‰ˆ็พŽๅ›ฝUCLAๆ–‡ๅ‡ญ่ฏไนฆใ€‹ๅŠ ๅทžๅคงๅญฆๆด›ๆ‰็Ÿถๅˆ†ๆ กๆฏ•ไธš่ฏๅˆถไฝœๆˆ็ปฉๅ•ไฟฎๆ”น
ไปฃๅŠžๅ›ฝๅค–ๅคงๅญฆๆ–‡ๅ‡ญใ€ŠๅŽŸ็‰ˆ็พŽๅ›ฝUCLAๆ–‡ๅ‡ญ่ฏไนฆใ€‹ๅŠ ๅทžๅคงๅญฆๆด›ๆ‰็Ÿถๅˆ†ๆ กๆฏ•ไธš่ฏๅˆถไฝœๆˆ็ปฉๅ•ไฟฎๆ”น
ย 
Full night ๐Ÿฅต Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy โœŒ๏ธo...
Full night ๐Ÿฅต Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy โœŒ๏ธo...Full night ๐Ÿฅต Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy โœŒ๏ธo...
Full night ๐Ÿฅต Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy โœŒ๏ธo...
ย 
Delhi Call Girls Punjabi Bagh 9711199171 โ˜Žโœ”๐Ÿ‘Œโœ” Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 โ˜Žโœ”๐Ÿ‘Œโœ” Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 โ˜Žโœ”๐Ÿ‘Œโœ” Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 โ˜Žโœ”๐Ÿ‘Œโœ” Whatsapp Hard And Sexy Vip Call
ย 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
ย 

Predict online shoppers' intentions

  • 1. Predict Purchasing Intentions of Online Shoppers Sean Robertson. Helen Phan
  • 2. Goals Predict the shopping intent of website visitors ๏‚ง Purchase ๏‚ง No purchase
  • 3. ๏‚ง โ€œOnline Shoppers Purchasing Intentionโ€ retrieved from UCI machine learning repository, data is from Columbia Sportswear Company ๏‚ง No missing data ๏‚ง 50% for training, 50% for validation Online shopping sessions Dataset 12,330 Numerical variables 10 Categorical variables 8
  • 4. Administrative pages Time spent on administrative pages Dataset / 10 numerical variables Informational pages Time spent on informational pages Product related pages Time spent on product related pages
  • 5. Bounce rate Exit rate Dataset / 10 numerical variables
  • 6. Page value Special day Dataset / 10 numerical variables = ๐‘’ โˆ’ ๐‘๐‘œ๐‘š๐‘š๐‘’๐‘Ÿ๐‘๐‘’ ๐‘Ÿ๐‘’๐‘ฃ๐‘’๐‘›๐‘ข๐‘’ + ๐‘ก๐‘œ๐‘ก๐‘Ž๐‘™ ๐‘”๐‘œ๐‘Ž๐‘™ ๐‘ฃ๐‘Ž๐‘™๐‘ข๐‘’ # ๐‘ข๐‘›๐‘–๐‘ž๐‘ข๐‘’ ๐‘๐‘Ž๐‘”๐‘’๐‘ฃ๐‘–๐‘’๐‘ค๐‘  ๐‘“๐‘œ๐‘Ÿ ๐‘”๐‘–๐‘ฃ๐‘’๐‘› ๐‘๐‘Ž๐‘”๐‘’
  • 7. Dataset / 8 categorical variables Browser Region Operating system Month Traffic Source
  • 8. Dataset / 8 categorical variables 84.53% 15.47% Revenue No Purchase Purchase 13.74% 85.57% Visitor Type New Returning Others 76.74% 23.26% Weekend No Yes
  • 9. Predict purchasing intention 1. Decision Tree 2. Logistic Regression 3. Neural Networks 4. Random Forest
  • 10. DecisionTree โ€ข Obtain the maximal decision tree โ€ข Model overfitting Optimal: 8-leaf tree Subtree Assessment Plot
  • 11. ๏‚ง 41 nodes and 8 leaves ๏‚ง Classification conditions โƒ Page value โƒ Bounce rate โƒ Administrative page โƒ Month โƒ Time on product related pages ๏‚ง Misclassification rate โƒ Train data: 9.312% โƒ Validation data: 9.828% The optimal tree
  • 12. Leaf Classification Rules % of train data % of validation data Purchasing Intention Y = False Y = True Y = False Y = True 1 Page value < 0.9805 96.03 3.97 95.96 4.04 No 2 Page value >= 0.9805 Bounce Rates < 0.0001 Administrative <3.5 14.55 85.45 19.72 80.28 Yes 3 Page value >= 0.9805 Bounce Rates < 0.0001 in Mar, Nov 43.49 56.51 38.59 61.41 Yes 4 Page value >= 0.9805 or <21.9121 Bounce Rates < 0.0001 Administrative >=3.5 59.66 40.34 52.63 47.37 No 5 Page value >= 21.9121 Bounce Rates < 0.0001 Administrative >=3.5 26.04 73.96 28.07 71.93 Yes The optimal tree
  • 13. The optimal tree Leaf Classification Rules % of train data % of validation data Purchasing Intention Y = False Y = True Y = False Y = True 6 Page value >= 0.9805 Bounce Rates < 0.0001 In between May and Oct Product Related Duration >= 505.4 72.14 27.86 68.06 31.94 No 7 Page value >= 0.9805 or <12.174 Bounce Rates < 0.0001 In between May and Oct Product Related Duration < 505.4 71.43 28.57 64.29 35.71 No 8 Page value >= 12.174 Bounce Rates < 0.0001 In between May and Oct Product Related Duration < 505.4 11.54 88.46 28.57 71.43 Yes
  • 14. Logistic Regression โ€ข Regression โ€ข Regression after stepwise variable selection โ€ข Regression after transformation โ€ข Regression after transformation, categorial variables recoded and stepwise variable selection Train Validation 11.162 % 11.531% 11.291 % 11.434% 10.188 % 10.282% 10.399 % 10.282%
  • 15. Time spent on administrative pages Time spent on informational pages Product related pages Time spent on product related pages Page value Log Transformation
  • 17. = -.336 โ€“ 16.86 (ExitRates) โ€“ .339 (Aug) โ€“ .419(Sept)โ€“ .235(Oct) โ€“ .63 (Nov) โ€“ .068(log_administrative duration) + 1.136 (log_pagevalues) โ€“ .258 (New Visitor) โ€“ .302(traffic type_10) + .359 (traffic type_13) โ€“ .388(traffic type_20) โ€“ .417(traffic type_8) Logistic Regression l๐‘œ๐‘”( ๐‘ 1 โˆ’ ๐‘ ) Variable Odd ratio % Change ExitRates 0.001 -100% Indicator_Aug 0.508 -49% Indicator_Nov 0.283 -72% Indicator_Oct 0.625 -38% Indicator_Sept 0.432 -57% LOG_Administrative_Duration 0.934 -7% Variable Odd ratio % Change LOG_PageValues 3.114 211% New_Visitor 0.596 -40% TrafficType_10 0.546 -45% TrafficType_13 2.05 105% TrafficType_20 0.46 -54% TrafficType_8 0.434 -57%
  • 18. Neural Networks โ€ข Auto Neural Network after stepwise โ€ข Auto Neural Network after variable transformation and stepwise โ€ข Auto Neural Network after variable transformation, recoding categorical variables and stepwise Train Validation 9.71% 10.08% 9.79% 9.87% 10.61% 10.23%
  • 19. Neural Networks โ€ข Neural Network (top) had 199 weights/parameters and 19 iterations to optimize Average Squared Error and Misclassification Rate โ€ข Auto-Neural Network (bottom) had 100 weights and 3 hidden units โ€ข Both models are large as a result of 18 variables in the model
  • 21. Random Forest โ€ข Random Forest โ€ข Random Forest after stepwise โ€ข Random Forest after variable transformation โ€ข Random Forest after variable transformation and stepwise โ€ข Random Forest after variable transformation, recoding categorical variables and stepwise Train Validation 9.61% 10.88% 9.08% 10.1% 9.94% 10.88% 9.68% 10.78% 8.19% 10.5%
  • 22. Random Forest โ€ข Misclassification Rate of training data converged to 9.94% โ€ข Misclassification Rate of validation data converged to 10.9% โ€ข No significant difference when using a forest larger than the default of 100 trees
  • 23. Conclusion # Model MISCTrain MISCValidation 1 DecisionTree 9.312% 9.828% 2 Regression after variable transformation, recoding categorical variables and stepwise 10.399% 10.282% 3 Auto Neural Network after variable transformation and stepwise 9.798% 9.876% 4 Random Forest after stepwise 9.085% 10.104%
  • 24. Conclusion When do people purchase? โ€ข Page value >= 0.9805 and Bounce Rates < 0.0001 and Administrative <3.5 โ€ข Page value >= 0.9805 and Bounce Rates < 0.0001 and in Mar, Nov โ€ข Page value >= 21.9121 and Bounce Rates < 0.0001 and Administrative >=3.5 โ€ข Page value >= 12.174 and Bounce Rates < 0.0001 and In between May and Oct and Product Related Duration < 505. โ€ข Visit the page that had high average value (Page value >= 0.9805) โ€ข When they did not bounce from the site (Bounce Rates < 0.0001) โ€ข Visit to their account management pages was low (<4), or if high was also paired with even higher average value (Page value >= 21.9805)