SlideShare a Scribd company logo
1 of 15
6.53
E-COMMERCE
FRAUD-MACHINE
LEARNING
MODELS
Ximena Bustamante
INTRODUCTION
According to Statista “e-commerce losses to online payment fraud were
estimated at 41 billion U.S. dollars globally in 2022, up from the previous
year. The figure is expected to grow further to 48 billion U.S. dollars by
2023” (Statista, “Value of e-commerce losses to online payment fraud
worldwide from 2020 to 2023”)
Machine learning algorithms are often used to identify potentially
fraudulent transactions
Come explore with me two models, logistic regression and decision trees,
that were used to identify variables significantly correlated with fraud
2023 E-Commerce Fraud Machine Learning Models-Ximena
Bustamante
2
DATASET
2023 E-Commerce Fraud Machine Learning Models-Ximena
Bustamante
3
Variables
customerEmail
Multiple
Duplicated
customerPhone
customerDevice
customerIPAddress
customerBillingAddress
No_Transactions
No_Orders
No_Payments
transactionId
orderId
paymentMethodId
paymentMethodRegistrationFailure
paymentMethodType
paymentMethodProvider
transactionAmount
transactionFailed
orderState
Fraud
KEY INSIGHTS
KEY INSIGHTS
SIGNIFICANT VARIABLES
2023 E-Commerce Fraud Machine Learning Models-Ximena
Bustamante
5
• The dataset consisted of 19 variable
• Out of the 18 independent variables—1 dependent variable—only 7 were found to be significant and the algorithms were run on
these.
KEY INSIGHTS
LOGISTIC REGRESSION
2023 E-Commerce Fraud Machine Learning Models-Ximena
Bustamante
6
• A logistic regression model was created with one dependent variable (fraud: Y/N) and 7 independent variables
• It resulted in a highly accurate model according to the confusion matrix used to measure its precision
• As see on the image on the right, it resulted in an 88% accuracy, 85% sensitivity
91% specificity, 90% precision and 87% negative predictive value
• Out of 65 non-fraud transactions in the test data, it correctly identified 59
• Out of 64 fraud transactions in the test data, it correctly identified 55
KEY INSIGHTS
DECISION TREES
2023 E-Commerce Fraud Machine Learning Models-Ximena
Bustamante
7
• A decision tree model was also created with the same dependent and independent
• It also resulted in a highly accurate algorithm according to the confusion matrix used to measure its precision
• This model resulted in a 96% sensitivity
83% specificity, 85% pos predictive and 95% negative predictive value
• Out of a total of 260 non-fraud transactions, it correctly identified 249
• Out of a total of 257 fraud transactions, it correctly identified 213
DATA PROCESS-ACQUISITION,
PREPARATION, ANALYSIS AND
VISUALIZATION
DATA ACQUISITION, PREPARATION AND
ANALYSIS
EXCEL & ACCES
2023 E-Commerce Fraud Machine Learning Models-Ximena
Bustamante
9
• Data was acquired from Kaggle and analysis was conducted with inspiration from University of Illinois –Urbana Champaign
Professor Hudson (Machine Learning Algorithms with R in Business Analytics)
• Tables with transaction data and customer data were initially joined in Acces and then explored in Excel
• Initial exploration of the data led to the identification of multiple customer e-mails associated to one customer
• This led to a new variable of binomial values being created to reflect transactions for customers with MULTIPLE emails
DATA ACQUISITION, PREPARATION AND
ANALYSIS
POWER BI
2023 E-Commerce Fraud Machine Learning Models-Ximena
Bustamante
10
• Power BI-Power Query was used to conduct more in-depth analysis of the variables
• Based on “Column Distribution”, it was evident that some IP addresses, devices and billing addresses were being used by multiple
customers (DUPLICATED)
• Thus, a new “Duplicated” column was created to reflect these transactions
DATA ACQUISITION, PREPARATION AND
ANALYSIS
R STUDIO
2023 E-Commerce Fraud Machine Learning Models-Ximena
Bustamante
11
• R Studio was used to create the 2 Machine Learning (ML) algorithms
• For the complete code, please visit my GitHub repository
• To create both ML models, I uploaded the necessary libraries, converted strings to factors, created confusion matrix, visualized the
balance of the dataset, split the data into training and testing sets, trained the models and the evaluated them on the test data,
made predictions, and finally used confusion matrix to measure accuracy
DATA VISUALIZATION
POWER BI
2023 E-Commerce Fraud Machine Learning Models-Ximena
Bustamante
12
• Power BI was used to create a map to show the geographical location of all transactions, color coded by fraud and non-fraud
CHALLENGES AND COOL TECHNIQUES
CHALLENGES AND COOL TECHNIQUES
2023 E-Commerce Fraud Machine Learning Models-Ximena
Bustamante
14
• Challenge: High number of correlated variables
• Cool Technique: Feature engineering--created two columns (with binomial values) to reflect transactions that had
duplicated/multiple addresses, phone numbers and Ip addresses, instead of creating one column for
• Challenge: Unbalanced dataset
• Cool Technique: Balanced it using RUS (random under sampling) to create a dataset with roughly the same amount of fraud/non-
fraud transactions
What If I had More Time?
• If I had more time, I would have done social networking to see how transactions may associate to one another
THANK YOU FOR
CHECKING OUT MY
PROJECT!
 Follow me for more project ideas
 If you have any questions, comments, feedback, JOB OFFERS , feel free to DM me
2023 E-Commerce Fraud Machine Learning
Models-Ximena Bustamante
15

More Related Content

Similar to E-Commerce Fraud Machine Learning Models.pptx

CREDIT CARD FRAUD DETECTION
CREDIT CARD FRAUD DETECTION CREDIT CARD FRAUD DETECTION
CREDIT CARD FRAUD DETECTION K Srinivas Rao
 
Share Credit_Card_Fraud_Detection_ML_MP (1).pptx
Share Credit_Card_Fraud_Detection_ML_MP (1).pptxShare Credit_Card_Fraud_Detection_ML_MP (1).pptx
Share Credit_Card_Fraud_Detection_ML_MP (1).pptxyatintaneja6
 
A Review of deep learning techniques in detection of anomaly incredit card tr...
A Review of deep learning techniques in detection of anomaly incredit card tr...A Review of deep learning techniques in detection of anomaly incredit card tr...
A Review of deep learning techniques in detection of anomaly incredit card tr...IRJET Journal
 
Sharing Microsoft RMS Data with QuickBooks
Sharing Microsoft RMS Data with QuickBooksSharing Microsoft RMS Data with QuickBooks
Sharing Microsoft RMS Data with QuickBooksDawn Scranton
 
Global Dynamics 365 Bootcamp London 2018
Global Dynamics 365 Bootcamp London 2018Global Dynamics 365 Bootcamp London 2018
Global Dynamics 365 Bootcamp London 2018Stefano Tempesta
 
A Novel Framework for Credit Card.
A Novel Framework for Credit Card.A Novel Framework for Credit Card.
A Novel Framework for Credit Card.Shakas Technologies
 
Certus Accelerate - Building the business case for why you need to invest in ...
Certus Accelerate - Building the business case for why you need to invest in ...Certus Accelerate - Building the business case for why you need to invest in ...
Certus Accelerate - Building the business case for why you need to invest in ...Certus Solutions
 
Online Transaction Fraud Detection System Based on Machine Learning
Online Transaction Fraud Detection System Based on Machine LearningOnline Transaction Fraud Detection System Based on Machine Learning
Online Transaction Fraud Detection System Based on Machine LearningIRJET Journal
 
TELECOM SERVICES: I.T. & ANALYTICS
TELECOM SERVICES: I.T. & ANALYTICSTELECOM SERVICES: I.T. & ANALYTICS
TELECOM SERVICES: I.T. & ANALYTICSGeorge Krasadakis
 
ATM fraud detection system using machine learning algorithms
ATM fraud detection system using machine learning algorithmsATM fraud detection system using machine learning algorithms
ATM fraud detection system using machine learning algorithmsIRJET Journal
 
Banking Circle: Money Laundering Beware: A Modern Approach to AML with Machin...
Banking Circle: Money Laundering Beware: A Modern Approach to AML with Machin...Banking Circle: Money Laundering Beware: A Modern Approach to AML with Machin...
Banking Circle: Money Laundering Beware: A Modern Approach to AML with Machin...Neo4j
 
FRAUD DETECTION IN CREDIT CARD TRANSACTIONS
FRAUD DETECTION IN CREDIT CARD TRANSACTIONSFRAUD DETECTION IN CREDIT CARD TRANSACTIONS
FRAUD DETECTION IN CREDIT CARD TRANSACTIONSIRJET Journal
 
IRJET- Survey on Credit Card Fraud Detection
IRJET- Survey on Credit Card Fraud DetectionIRJET- Survey on Credit Card Fraud Detection
IRJET- Survey on Credit Card Fraud DetectionIRJET Journal
 
Machine Learning in Cyber Security
Machine Learning in Cyber SecurityMachine Learning in Cyber Security
Machine Learning in Cyber SecurityRishi Kant
 
Are Merchants Losing The CNP Fraud Battle - A QPS Whitepaper
Are Merchants Losing The CNP Fraud Battle - A QPS WhitepaperAre Merchants Losing The CNP Fraud Battle - A QPS Whitepaper
Are Merchants Losing The CNP Fraud Battle - A QPS WhitepaperQuatrro Processing Services (QPS)
 
Automated cheque recognition
Automated cheque recognitionAutomated cheque recognition
Automated cheque recognitioninfo_jojo
 
IRJET - Fraud Detection in Credit Card using Machine Learning Techniques
IRJET -  	  Fraud Detection in Credit Card using Machine Learning TechniquesIRJET -  	  Fraud Detection in Credit Card using Machine Learning Techniques
IRJET - Fraud Detection in Credit Card using Machine Learning TechniquesIRJET Journal
 

Similar to E-Commerce Fraud Machine Learning Models.pptx (20)

CREDIT CARD FRAUD DETECTION
CREDIT CARD FRAUD DETECTION CREDIT CARD FRAUD DETECTION
CREDIT CARD FRAUD DETECTION
 
Share Credit_Card_Fraud_Detection_ML_MP (1).pptx
Share Credit_Card_Fraud_Detection_ML_MP (1).pptxShare Credit_Card_Fraud_Detection_ML_MP (1).pptx
Share Credit_Card_Fraud_Detection_ML_MP (1).pptx
 
A Review of deep learning techniques in detection of anomaly incredit card tr...
A Review of deep learning techniques in detection of anomaly incredit card tr...A Review of deep learning techniques in detection of anomaly incredit card tr...
A Review of deep learning techniques in detection of anomaly incredit card tr...
 
Sharing Microsoft RMS Data with QuickBooks
Sharing Microsoft RMS Data with QuickBooksSharing Microsoft RMS Data with QuickBooks
Sharing Microsoft RMS Data with QuickBooks
 
Global Dynamics 365 Bootcamp London 2018
Global Dynamics 365 Bootcamp London 2018Global Dynamics 365 Bootcamp London 2018
Global Dynamics 365 Bootcamp London 2018
 
A Novel Framework for Credit Card.
A Novel Framework for Credit Card.A Novel Framework for Credit Card.
A Novel Framework for Credit Card.
 
Certus Accelerate - Building the business case for why you need to invest in ...
Certus Accelerate - Building the business case for why you need to invest in ...Certus Accelerate - Building the business case for why you need to invest in ...
Certus Accelerate - Building the business case for why you need to invest in ...
 
Online Transaction Fraud Detection System Based on Machine Learning
Online Transaction Fraud Detection System Based on Machine LearningOnline Transaction Fraud Detection System Based on Machine Learning
Online Transaction Fraud Detection System Based on Machine Learning
 
TELECOM SERVICES: I.T. & ANALYTICS
TELECOM SERVICES: I.T. & ANALYTICSTELECOM SERVICES: I.T. & ANALYTICS
TELECOM SERVICES: I.T. & ANALYTICS
 
ATM fraud detection system using machine learning algorithms
ATM fraud detection system using machine learning algorithmsATM fraud detection system using machine learning algorithms
ATM fraud detection system using machine learning algorithms
 
Project PPT sem 2.pptx
Project PPT sem 2.pptxProject PPT sem 2.pptx
Project PPT sem 2.pptx
 
Banking Circle: Money Laundering Beware: A Modern Approach to AML with Machin...
Banking Circle: Money Laundering Beware: A Modern Approach to AML with Machin...Banking Circle: Money Laundering Beware: A Modern Approach to AML with Machin...
Banking Circle: Money Laundering Beware: A Modern Approach to AML with Machin...
 
CREDIT_CARD.ppt
CREDIT_CARD.pptCREDIT_CARD.ppt
CREDIT_CARD.ppt
 
FRAUD DETECTION IN CREDIT CARD TRANSACTIONS
FRAUD DETECTION IN CREDIT CARD TRANSACTIONSFRAUD DETECTION IN CREDIT CARD TRANSACTIONS
FRAUD DETECTION IN CREDIT CARD TRANSACTIONS
 
IRJET- Survey on Credit Card Fraud Detection
IRJET- Survey on Credit Card Fraud DetectionIRJET- Survey on Credit Card Fraud Detection
IRJET- Survey on Credit Card Fraud Detection
 
The Role of Generative AI and LLMs in Accounts Payable Automation1.pdf
The Role of Generative AI and LLMs in Accounts Payable Automation1.pdfThe Role of Generative AI and LLMs in Accounts Payable Automation1.pdf
The Role of Generative AI and LLMs in Accounts Payable Automation1.pdf
 
Machine Learning in Cyber Security
Machine Learning in Cyber SecurityMachine Learning in Cyber Security
Machine Learning in Cyber Security
 
Are Merchants Losing The CNP Fraud Battle - A QPS Whitepaper
Are Merchants Losing The CNP Fraud Battle - A QPS WhitepaperAre Merchants Losing The CNP Fraud Battle - A QPS Whitepaper
Are Merchants Losing The CNP Fraud Battle - A QPS Whitepaper
 
Automated cheque recognition
Automated cheque recognitionAutomated cheque recognition
Automated cheque recognition
 
IRJET - Fraud Detection in Credit Card using Machine Learning Techniques
IRJET -  	  Fraud Detection in Credit Card using Machine Learning TechniquesIRJET -  	  Fraud Detection in Credit Card using Machine Learning Techniques
IRJET - Fraud Detection in Credit Card using Machine Learning Techniques
 

Recently uploaded

Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlkumarajju5765
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxolyaivanovalion
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 

Recently uploaded (20)

Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 

E-Commerce Fraud Machine Learning Models.pptx

  • 2. INTRODUCTION According to Statista “e-commerce losses to online payment fraud were estimated at 41 billion U.S. dollars globally in 2022, up from the previous year. The figure is expected to grow further to 48 billion U.S. dollars by 2023” (Statista, “Value of e-commerce losses to online payment fraud worldwide from 2020 to 2023”) Machine learning algorithms are often used to identify potentially fraudulent transactions Come explore with me two models, logistic regression and decision trees, that were used to identify variables significantly correlated with fraud 2023 E-Commerce Fraud Machine Learning Models-Ximena Bustamante 2
  • 3. DATASET 2023 E-Commerce Fraud Machine Learning Models-Ximena Bustamante 3 Variables customerEmail Multiple Duplicated customerPhone customerDevice customerIPAddress customerBillingAddress No_Transactions No_Orders No_Payments transactionId orderId paymentMethodId paymentMethodRegistrationFailure paymentMethodType paymentMethodProvider transactionAmount transactionFailed orderState Fraud
  • 5. KEY INSIGHTS SIGNIFICANT VARIABLES 2023 E-Commerce Fraud Machine Learning Models-Ximena Bustamante 5 • The dataset consisted of 19 variable • Out of the 18 independent variables—1 dependent variable—only 7 were found to be significant and the algorithms were run on these.
  • 6. KEY INSIGHTS LOGISTIC REGRESSION 2023 E-Commerce Fraud Machine Learning Models-Ximena Bustamante 6 • A logistic regression model was created with one dependent variable (fraud: Y/N) and 7 independent variables • It resulted in a highly accurate model according to the confusion matrix used to measure its precision • As see on the image on the right, it resulted in an 88% accuracy, 85% sensitivity 91% specificity, 90% precision and 87% negative predictive value • Out of 65 non-fraud transactions in the test data, it correctly identified 59 • Out of 64 fraud transactions in the test data, it correctly identified 55
  • 7. KEY INSIGHTS DECISION TREES 2023 E-Commerce Fraud Machine Learning Models-Ximena Bustamante 7 • A decision tree model was also created with the same dependent and independent • It also resulted in a highly accurate algorithm according to the confusion matrix used to measure its precision • This model resulted in a 96% sensitivity 83% specificity, 85% pos predictive and 95% negative predictive value • Out of a total of 260 non-fraud transactions, it correctly identified 249 • Out of a total of 257 fraud transactions, it correctly identified 213
  • 9. DATA ACQUISITION, PREPARATION AND ANALYSIS EXCEL & ACCES 2023 E-Commerce Fraud Machine Learning Models-Ximena Bustamante 9 • Data was acquired from Kaggle and analysis was conducted with inspiration from University of Illinois –Urbana Champaign Professor Hudson (Machine Learning Algorithms with R in Business Analytics) • Tables with transaction data and customer data were initially joined in Acces and then explored in Excel • Initial exploration of the data led to the identification of multiple customer e-mails associated to one customer • This led to a new variable of binomial values being created to reflect transactions for customers with MULTIPLE emails
  • 10. DATA ACQUISITION, PREPARATION AND ANALYSIS POWER BI 2023 E-Commerce Fraud Machine Learning Models-Ximena Bustamante 10 • Power BI-Power Query was used to conduct more in-depth analysis of the variables • Based on “Column Distribution”, it was evident that some IP addresses, devices and billing addresses were being used by multiple customers (DUPLICATED) • Thus, a new “Duplicated” column was created to reflect these transactions
  • 11. DATA ACQUISITION, PREPARATION AND ANALYSIS R STUDIO 2023 E-Commerce Fraud Machine Learning Models-Ximena Bustamante 11 • R Studio was used to create the 2 Machine Learning (ML) algorithms • For the complete code, please visit my GitHub repository • To create both ML models, I uploaded the necessary libraries, converted strings to factors, created confusion matrix, visualized the balance of the dataset, split the data into training and testing sets, trained the models and the evaluated them on the test data, made predictions, and finally used confusion matrix to measure accuracy
  • 12. DATA VISUALIZATION POWER BI 2023 E-Commerce Fraud Machine Learning Models-Ximena Bustamante 12 • Power BI was used to create a map to show the geographical location of all transactions, color coded by fraud and non-fraud
  • 13. CHALLENGES AND COOL TECHNIQUES
  • 14. CHALLENGES AND COOL TECHNIQUES 2023 E-Commerce Fraud Machine Learning Models-Ximena Bustamante 14 • Challenge: High number of correlated variables • Cool Technique: Feature engineering--created two columns (with binomial values) to reflect transactions that had duplicated/multiple addresses, phone numbers and Ip addresses, instead of creating one column for • Challenge: Unbalanced dataset • Cool Technique: Balanced it using RUS (random under sampling) to create a dataset with roughly the same amount of fraud/non- fraud transactions What If I had More Time? • If I had more time, I would have done social networking to see how transactions may associate to one another
  • 15. THANK YOU FOR CHECKING OUT MY PROJECT!  Follow me for more project ideas  If you have any questions, comments, feedback, JOB OFFERS , feel free to DM me 2023 E-Commerce Fraud Machine Learning Models-Ximena Bustamante 15