SlideShare a Scribd company logo
Maisam Shahid Wasti and Dennis Hartono
Biking wrong way at Trousdale

We found that it is possible to predict wrong way
violations at Trousdale Parkway, USC Campus
Data Collection

Collected 14 hours of data with total sample size of 2837
Decision Rule to classify violators

Established a consistent decision rule for sample
validation
Overview of the observation site

Five minutes slot following class ending times
Observed higher proportion of violators for few minutes
after the classes end
Interpretation of important variables
Name: 5min_after

Five minutes slot following class ending times

Name
Type

'5min_after'
Binary

Description

Counted '1' if sample observed within 5 minutes slot following the class ending times
Interpretation of important variables

Used ‘bodyweight’ as a binary measure for speed
Approach to model selection
All non-interaction terms
Backward-LR

LOGISTIC REGRESSION

Initial Model
with selective non interaction terms
Backward-LR

+ (n,C,2) second order
terms

Intermediate Model
Filtration

Refined our model
in three stages

Final Model
after removing terms causing
Multicollinearity
Variables in final model
Variables

Significance

Gender

.003

Bag * Sportswear

.010

Bag * Bodyweight

.002

Bag * 5min_after

.001

Non-Interaction Terms
Interaction Terms

Found significant independent variables
We interpret bag as an indicator for student on campus
Model evaluation
Test

Statistic

Significance

Omnibus

33.518

0.000

Cox and Snell R2

0.015

Nagelkerke R2

0.024

• Observed significant improvement in Log-Likelihood
through Omnibus test
• Model suffered from low R2 values
Correlation Matrix

Gender

Bag * Sportswear

Bag * Body_weight

Bag * 5min_after

Multicollinearity Test 1

Gender
Bag * Sportswear
Bag * Body_weight
Bag * 5min_after

1.000
-.085
-.186
.041

-.085
1.000
-.028
-.015

-.186
-.028
1.000
-.007

.041
-.015
-.007
1.000

Found no serious multi-collinearity issues (>0.3)
with highest correlation coefficient of magnitude 0.186
Multicollinearity Test 2
Variables

B

S.E.

Wald

Sig.

Exp(B)

Gender

-.335

.112

8.936

.003

.715

Bag * Sportswear

.667

.258

6.671

.010

1.949

Bag * Bodyweight

.798

.258

9.605

.002

2.222

Bag * 5min_after

.409

.120

11.533

.001

1.506

Observed Standard Errors to be bounded by maximum
of 0.258
Residual Analysis

Observed no residuals lying above 2 standard deviation
Challenges with Classification Accuracy
- Have a skewed class distribution
2500

81.7 %

2000

2315

1500
1000

500

522

18.3 %

0
Violators

Non-Violators

- Resulting in high baseline accuracy
- Difficult to improve much from the high baseline accuracy
Predicted Probabilities Histograms

Violators

Non-Violators

- Observed significant overlap
- The default 0.5 gave a bad cut-off threshold
Classification Tables
Ground Truth

Baseline Classification Table
Prediction
Wrong Way Violation
Not Violating Violating

Percentage
Correct

Not Violating

465

0

100.0

Violating

107

0

0.0

Total

572

0

81.3

Ground Truth

Classification Table with 0.35 Cut-off
Prediction
Wrong Way Violation
Not Violating Violating

Percentage
Correct

Not Violating

463

2

99.6

Violating

105

2

1.9

Total

568

4

81.3

The overall classification accuracy remains the same with
increased prediction power for violations
The ROC Graph

Observed to be better at predicting violations than the
baseline at Cut-off = 0.35
Multiway Cross-tabulation tests
Wrong_Way_Violation * Formal_Dressing Crosstabulation
Count
Formal_Dressing

Wrong_Way_Violation

Not Violating
Violating

47

Total
2315

510

12

522

59

2837

16

Total
2315

Wrong_Way_Violation * Helmet Crosstabulation

Count

Count

Wrong_Way_Violation

Not Violating
Violating

Total

In formal
dress

2778

Total

Wrong_Way_Violation * Food_or_Beverages Crosstabulation

Not in formal
dress
2268

Food_or_Beverages
Without food
With food or
or beverage
beverage
2278
37
511
2789

11
48

Helmet

Total
2315

Wrong_Way_Violation

Violating

Wearing
helmet

Total

520

2

522

2819

522
2837

Not Violating

Not wearing
helmet
2299

18

2837

29

Total
2315

Wrong_Way_Violation * Hoodie Crosstabulation
Count
Hoodie

Wrong_Way_Violation

Not Violating
Violating

Total

Not wearing
hoodie
2286

Wearing
hoodie

517

5

522

2803

34

2837

Lacking significant number of violators for few cases
Other classifiers
Classifier

Accuracy %

Baseline

81.3

Logistic Regression

81.3

Parzen windows

81.64

Linear Perceptron

81.29

K-Nearest Neighbors

81.29

Experimented other classifiers to achieve a slight increase
in overall accuracy
Logistic Regression vs. Parzen Windows
Ground Truth

Parzen Window
Prediction
Wrong Way Violation
Not Violating Violating

Percentage
Correct

Not Violating

463

0

100

Violating

105

2

1..9

Total

570

2

81.64

Ground Truth

Logistic Regression (0.35)
Prediction
Wrong Way Violation
Not Violating Violating

Percentage
Correct

Not Violating

463

2

99.6

Violating

105

2

1.9

Total

568

4

81.30

Achieved slightly improved TPR/FPR and overall
classification accuracy using Parzen Windows
Questions…

More Related Content

Recently uploaded

TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
Tatiana Kojar
 
A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024
Intelisync
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
Hiroshi SHIBATA
 
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - HiikeSystem Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
Hiike
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
DanBrown980551
 
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
alexjohnson7307
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
Miro Wengner
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Alpen-Adria-Universität
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
Brandon Minnick, MBA
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 

Recently uploaded (20)

TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
 
A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
 
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - HiikeSystem Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
 
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 

Featured

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
Marius Sescu
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
Expeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
Pixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
marketingartwork
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
Skeleton Technologies
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
SpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Lily Ray
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
Rajiv Jayarajah, MAppComm, ACC
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
Christy Abraham Joy
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
Vit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
MindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
RachelPearson36
 

Featured (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Prediction of wrong way bike violators at USC using binary logistic regression by recording sample size of 2800+ on campus, Statistics for Engineers, USC Spring 2013

  • 1. Maisam Shahid Wasti and Dennis Hartono
  • 2. Biking wrong way at Trousdale We found that it is possible to predict wrong way violations at Trousdale Parkway, USC Campus
  • 3. Data Collection Collected 14 hours of data with total sample size of 2837
  • 4. Decision Rule to classify violators Established a consistent decision rule for sample validation
  • 5. Overview of the observation site Five minutes slot following class ending times Observed higher proportion of violators for few minutes after the classes end
  • 6. Interpretation of important variables Name: 5min_after Five minutes slot following class ending times Name Type '5min_after' Binary Description Counted '1' if sample observed within 5 minutes slot following the class ending times
  • 7. Interpretation of important variables Used ‘bodyweight’ as a binary measure for speed
  • 8. Approach to model selection All non-interaction terms Backward-LR LOGISTIC REGRESSION Initial Model with selective non interaction terms Backward-LR + (n,C,2) second order terms Intermediate Model Filtration Refined our model in three stages Final Model after removing terms causing Multicollinearity
  • 9. Variables in final model Variables Significance Gender .003 Bag * Sportswear .010 Bag * Bodyweight .002 Bag * 5min_after .001 Non-Interaction Terms Interaction Terms Found significant independent variables We interpret bag as an indicator for student on campus
  • 10. Model evaluation Test Statistic Significance Omnibus 33.518 0.000 Cox and Snell R2 0.015 Nagelkerke R2 0.024 • Observed significant improvement in Log-Likelihood through Omnibus test • Model suffered from low R2 values
  • 11. Correlation Matrix Gender Bag * Sportswear Bag * Body_weight Bag * 5min_after Multicollinearity Test 1 Gender Bag * Sportswear Bag * Body_weight Bag * 5min_after 1.000 -.085 -.186 .041 -.085 1.000 -.028 -.015 -.186 -.028 1.000 -.007 .041 -.015 -.007 1.000 Found no serious multi-collinearity issues (>0.3) with highest correlation coefficient of magnitude 0.186
  • 12. Multicollinearity Test 2 Variables B S.E. Wald Sig. Exp(B) Gender -.335 .112 8.936 .003 .715 Bag * Sportswear .667 .258 6.671 .010 1.949 Bag * Bodyweight .798 .258 9.605 .002 2.222 Bag * 5min_after .409 .120 11.533 .001 1.506 Observed Standard Errors to be bounded by maximum of 0.258
  • 13. Residual Analysis Observed no residuals lying above 2 standard deviation
  • 14. Challenges with Classification Accuracy - Have a skewed class distribution 2500 81.7 % 2000 2315 1500 1000 500 522 18.3 % 0 Violators Non-Violators - Resulting in high baseline accuracy - Difficult to improve much from the high baseline accuracy
  • 15. Predicted Probabilities Histograms Violators Non-Violators - Observed significant overlap - The default 0.5 gave a bad cut-off threshold
  • 16. Classification Tables Ground Truth Baseline Classification Table Prediction Wrong Way Violation Not Violating Violating Percentage Correct Not Violating 465 0 100.0 Violating 107 0 0.0 Total 572 0 81.3 Ground Truth Classification Table with 0.35 Cut-off Prediction Wrong Way Violation Not Violating Violating Percentage Correct Not Violating 463 2 99.6 Violating 105 2 1.9 Total 568 4 81.3 The overall classification accuracy remains the same with increased prediction power for violations
  • 17. The ROC Graph Observed to be better at predicting violations than the baseline at Cut-off = 0.35
  • 18. Multiway Cross-tabulation tests Wrong_Way_Violation * Formal_Dressing Crosstabulation Count Formal_Dressing Wrong_Way_Violation Not Violating Violating 47 Total 2315 510 12 522 59 2837 16 Total 2315 Wrong_Way_Violation * Helmet Crosstabulation Count Count Wrong_Way_Violation Not Violating Violating Total In formal dress 2778 Total Wrong_Way_Violation * Food_or_Beverages Crosstabulation Not in formal dress 2268 Food_or_Beverages Without food With food or or beverage beverage 2278 37 511 2789 11 48 Helmet Total 2315 Wrong_Way_Violation Violating Wearing helmet Total 520 2 522 2819 522 2837 Not Violating Not wearing helmet 2299 18 2837 29 Total 2315 Wrong_Way_Violation * Hoodie Crosstabulation Count Hoodie Wrong_Way_Violation Not Violating Violating Total Not wearing hoodie 2286 Wearing hoodie 517 5 522 2803 34 2837 Lacking significant number of violators for few cases
  • 19. Other classifiers Classifier Accuracy % Baseline 81.3 Logistic Regression 81.3 Parzen windows 81.64 Linear Perceptron 81.29 K-Nearest Neighbors 81.29 Experimented other classifiers to achieve a slight increase in overall accuracy
  • 20. Logistic Regression vs. Parzen Windows Ground Truth Parzen Window Prediction Wrong Way Violation Not Violating Violating Percentage Correct Not Violating 463 0 100 Violating 105 2 1..9 Total 570 2 81.64 Ground Truth Logistic Regression (0.35) Prediction Wrong Way Violation Not Violating Violating Percentage Correct Not Violating 463 2 99.6 Violating 105 2 1.9 Total 568 4 81.30 Achieved slightly improved TPR/FPR and overall classification accuracy using Parzen Windows

Editor's Notes

  1. 141.85