SlideShare a Scribd company logo
1 of 33
+




    Predictive Analytics for Cars.com
        Directed by Professor Martin Bariff

                Xiaofei Yang, Joyce
                     Yi Shi, Ivy
                Yingqiu Zhu, Chris
   Introduction
   Data Preparation

          +
    Data Mining Methods
   Better Cars.com




                          2
+




    Introduction
    Yingqiu Zhu, Chris



                         3
+                  4

    INTRODUCTION
    SPSS MODELER
+




    Data Preparation
        Yingqiu Zhu, Chris
+
                       6
        DATA
 PREPARATION
      KEY VARIABLES
a. Basic Information

b. Ratings

c. Indicators
+
                            7

       DATA
PREPARATION
                 CLEANING


                Shopping
    Quality     for Used
      of
    Repair

              Overall
              Rating




Ready-to-Use Data
+                                                                 8

    DATA PREPARATION
    BALANCING

             Unbalanced Data         Balanced Data



     16.43%                                30.21%
      3.01%                       5.54%
     2.11%                       3.88%
        10.01%                            18.40%
                        68.45%                      41.96%
                                                             27
                                                             %
+




    Data Mining Methods
             Yingqiu Zhu, Chris
          & Xiaofei Yang, Joyce
+                                                10

    METHODS
                                     • K-Means
         Cluster Analysis




                              • Decision Tree
        Predictive Analysis




          Text Mining
+                                                    11

    CLUSTER ANALYSIS
    K-MEANS


                      K-Means is the most popular
                       nonhierarchical clustering
                       method.

                      K-Means is based on geometric
                       notions of similarity
+                                        12

    CLUSTER ANALYSIS
    K-MEANS
        Model Summary   Model Accuracy
R13

                                     ANALYSI
                                          S
                      Yes       No



                        5   1
    Importance 100%




                                       K-Means

                        5   1


                        5   1


+                     Yes       No
        Importance
           11%
+                                                      14

    CLUSTER ANALYSIS
    K-MEANS

              Customers
              who purchase used cars from a dealer

                 rating high for the dealer
                 recommending the dealer to others.
+                                                                                     15

    PREDICTIVE ANALYSIS
    INTRODUCTION OF DECISION TREE MODEL
       A tree-building algorithm

       Recursively splitting the data into smaller and smaller groups based on the
        fields that provides the maximum information gain

       Used C5.0 model in our research
+                                                                       16

    PREDICTIVE ANALYSIS
    DECISION TREE

                     Decision Tree with Balanced Data
                                     60%                   40%




       Evaluation Standard
        Overall accuracy: > 80%
        Favorability between training & testing samples: The results
         with the testing sample compare favorably to the training
         sample.
+                                               17

    PREDICTIVE ANALYSIS
    DECISION TREE




                    Rating - Customer Service
+                                                                       18

    PREDICTIVE ANALYSIS
    INTRODUCTION OF TEXT MINING MODEL

       Extract key concepts from the text and create categories with
        these concepts by using linguistic and frequency techniques

       Explore more information and context included in the data

       Better compare the results with and without text data and see
        the difference
+                                                                   19

    PREDICTIVE ANALYSIS
    DECISION TREE MODEL WITH TEXT DATA

    Model with Only Numeric Data   Model with Text & Numeric Data
+                                                        20

    PREDICTIVE ANALYSIS
    Interactive Workbench Categories and Concepts View
21




                        Trustful & Knowledgeable

                        Extended warranty

                        Relatively consistent pricing



buying/buying process
                        Nearest location < Service & Credibility
+                                                          22

    PREDICTIVE ANALYSIS
    TEXT MINING
               Rating = 5                  Rating = 1
          High Overall Rating         Low Overall Rating
    100
     90                         100
     80                          90
                                 80
     70                          70
     60                          60
     50                          50
     40                          40
     30                          30
     20                          20
     10                          10
      0                           0




                    %                           %
+




    Better Cars.Com
            Yi Shi (Ivy)
What we get from our analysis…                             24




     Less influential dealer’s   Purchase behavior
     information                 =>Higher overall rating




                         Overall Rating



     Higher rating for
                                 Text Mining
     buying process &
     customer service            =>New Hint for
                                 Consumer behavior
      =>Higher Overall Rating
+                                         25

    Stimulus – Influence Overall Rating
    NEW HINT

       Buying Process (experience)

        -Sales people

        -Service

        -Facility (Eg: Internet)

        -Family’s feeling



       Price sensitivity

       Finance issue
+                                                           26

    Motivation
    STEP 1: Customers have more rights on review sheet

     New   Review Sheet


                                              Personal
                                              Information
                                 Additional
                                 Questions
                     Review
       Additional
       Rating
27

             Additional Rating




Description Matching Level
28
                    Additional Questions
 Are you satisfied with this purchase?
      Yes                No

Please tell us WHY (Mark all that apply)
   Quality of in-store service
   Quality of vehicle
   Price
   Financial Service (Insurance, cash deal, warranty cost, monthly payment)
   Location of dealer
   Sales People
   In-store facility
   In-store environment
   Can/Cannot get vehicle in quickly enough
   Service Department hours
   Alternative transportation not available (rental car, shuttle, etc.)
   Other______________
+                                                            29

    Motivation
    STEP 2: More visual hint for customers
                               Dealer Rating   Best Seller



                               Extended Warranty Label




            Dealer:




            Dealer:
+                                                                    30

    Motivation
    STEP 3: New Dealers Resume

                                          • Lounge with snacks and
                     Facilities:            drink
                                          • Free Wi-Fi

                                          •   Insurance
                                          •   Deal/Gift
                      Services:
                                          •   Extended Warranty
                                          •   Maintenance
                                          •   Free certification

                  More space for
                 Dealer Description
                   3 others viewing this vehicle right now

                   15 others bought vehicles from this
                   dealer in the last month
+                                                         31

    Motivation
    STEP 4: Monthly rating list

       Monthly highest rating dealer

       • Sub-rating list
       • Monthly Star Dealer (label in the search page)

       Monthly recommended dealer


       Monthly best seller


       The most favorable dealer
+                                                           32


    Better Cars.Com
       Combine Cars review sheet and Dealer review sheet
           Continue to review dealer?
             Yes              No
            Dealers Review sheet………

       Add schedule plug-in

       Online voicemail

       Add to Favorite

       Relevant vehicles to consider
           You Viewed
           Customer who view this also viewed
           Recommended base on your browsing history
+




    Predictive Analytics for Cars.com         Xiaofei Yang, Joyce
        Directed by Professor Martin Bariff        Yi Shi, Ivy
                                              Yingqiu Zhu, Chris

Thank Questions???
      You for Your Listening

More Related Content

Similar to Predictive Analytics for Cars.com

Predictive analytics for cars.com
Predictive analytics for cars.comPredictive analytics for cars.com
Predictive analytics for cars.comYI SHI
 
Health meeting v2_20120529_kp
Health meeting v2_20120529_kpHealth meeting v2_20120529_kp
Health meeting v2_20120529_kpKevin Pledge
 
Customer Insights Summit Toronto 2012
Customer Insights Summit Toronto 2012Customer Insights Summit Toronto 2012
Customer Insights Summit Toronto 2012Fabiana Pereira
 
201206 IASA Session 408 - Applied Analytics
201206 IASA Session 408 - Applied Analytics201206 IASA Session 408 - Applied Analytics
201206 IASA Session 408 - Applied AnalyticsSteven Callahan
 
Emetrics - Oct 19 2011 - New York - X channel optimisation
Emetrics - Oct 19 2011 - New York - X channel optimisationEmetrics - Oct 19 2011 - New York - X channel optimisation
Emetrics - Oct 19 2011 - New York - X channel optimisationCraig Sullivan
 
Leveraging Analytics To Improve Your Marketing Efforts:
Leveraging Analytics To Improve Your Marketing Efforts:Leveraging Analytics To Improve Your Marketing Efforts:
Leveraging Analytics To Improve Your Marketing Efforts:BusinessOnline
 
Ibc biological assay development & validation 2011 gra presentation
Ibc biological assay development & validation 2011 gra presentationIbc biological assay development & validation 2011 gra presentation
Ibc biological assay development & validation 2011 gra presentationGreyRigge Associates Ltd
 
Metric Arts Presentation
Metric Arts PresentationMetric Arts Presentation
Metric Arts PresentationMetric Arts
 
Data analytics and visualization
Data analytics and visualizationData analytics and visualization
Data analytics and visualizationVini Vasundharan
 
Conversion management conference 2013
Conversion management   conference 2013Conversion management   conference 2013
Conversion management conference 2013Scot McAllan
 
Conversion Rate Optimisation - Perth Conference - March 2013
Conversion Rate Optimisation - Perth Conference - March 2013Conversion Rate Optimisation - Perth Conference - March 2013
Conversion Rate Optimisation - Perth Conference - March 2013Scot McAllan
 
Greg Kelton - "Multivariate Testing at Orange"
Greg Kelton - "Multivariate Testing at Orange"Greg Kelton - "Multivariate Testing at Orange"
Greg Kelton - "Multivariate Testing at Orange"Constantin Cocioaba
 
Predictive analytics km chicago
Predictive analytics km chicagoPredictive analytics km chicago
Predictive analytics km chicagoKM Chicago
 
A brief introduction to Six Sigma
A brief introduction to Six SigmaA brief introduction to Six Sigma
A brief introduction to Six SigmaVillads Jakobsen
 
Discovering Your Value Proposition
Discovering Your Value Proposition Discovering Your Value Proposition
Discovering Your Value Proposition MarketingExperiments
 
Business Intelligence Symposium Presentation
Business Intelligence Symposium PresentationBusiness Intelligence Symposium Presentation
Business Intelligence Symposium PresentationBill Cassill
 
Lecture 22
Lecture 22Lecture 22
Lecture 22Shani729
 

Similar to Predictive Analytics for Cars.com (20)

Predictive analytics for cars.com
Predictive analytics for cars.comPredictive analytics for cars.com
Predictive analytics for cars.com
 
Health meeting v2_20120529_kp
Health meeting v2_20120529_kpHealth meeting v2_20120529_kp
Health meeting v2_20120529_kp
 
Customer Insights Summit Toronto 2012
Customer Insights Summit Toronto 2012Customer Insights Summit Toronto 2012
Customer Insights Summit Toronto 2012
 
201206 IASA Session 408 - Applied Analytics
201206 IASA Session 408 - Applied Analytics201206 IASA Session 408 - Applied Analytics
201206 IASA Session 408 - Applied Analytics
 
Emetrics - Oct 19 2011 - New York - X channel optimisation
Emetrics - Oct 19 2011 - New York - X channel optimisationEmetrics - Oct 19 2011 - New York - X channel optimisation
Emetrics - Oct 19 2011 - New York - X channel optimisation
 
Leveraging Analytics To Improve Your Marketing Efforts:
Leveraging Analytics To Improve Your Marketing Efforts:Leveraging Analytics To Improve Your Marketing Efforts:
Leveraging Analytics To Improve Your Marketing Efforts:
 
Bebpa Nice 29 Sept 2011
Bebpa Nice 29 Sept 2011Bebpa Nice 29 Sept 2011
Bebpa Nice 29 Sept 2011
 
Ibc biological assay development & validation 2011 gra presentation
Ibc biological assay development & validation 2011 gra presentationIbc biological assay development & validation 2011 gra presentation
Ibc biological assay development & validation 2011 gra presentation
 
Metric Arts Presentation
Metric Arts PresentationMetric Arts Presentation
Metric Arts Presentation
 
Data analytics and visualization
Data analytics and visualizationData analytics and visualization
Data analytics and visualization
 
Conversion management conference 2013
Conversion management   conference 2013Conversion management   conference 2013
Conversion management conference 2013
 
Conversion Rate Optimisation - Perth Conference - March 2013
Conversion Rate Optimisation - Perth Conference - March 2013Conversion Rate Optimisation - Perth Conference - March 2013
Conversion Rate Optimisation - Perth Conference - March 2013
 
Greg Kelton Multivariate Testing
Greg Kelton Multivariate TestingGreg Kelton Multivariate Testing
Greg Kelton Multivariate Testing
 
Greg Kelton - "Multivariate Testing at Orange"
Greg Kelton - "Multivariate Testing at Orange"Greg Kelton - "Multivariate Testing at Orange"
Greg Kelton - "Multivariate Testing at Orange"
 
Predictive analytics km chicago
Predictive analytics km chicagoPredictive analytics km chicago
Predictive analytics km chicago
 
Six Sigma
Six SigmaSix Sigma
Six Sigma
 
A brief introduction to Six Sigma
A brief introduction to Six SigmaA brief introduction to Six Sigma
A brief introduction to Six Sigma
 
Discovering Your Value Proposition
Discovering Your Value Proposition Discovering Your Value Proposition
Discovering Your Value Proposition
 
Business Intelligence Symposium Presentation
Business Intelligence Symposium PresentationBusiness Intelligence Symposium Presentation
Business Intelligence Symposium Presentation
 
Lecture 22
Lecture 22Lecture 22
Lecture 22
 

Predictive Analytics for Cars.com

  • 1. + Predictive Analytics for Cars.com Directed by Professor Martin Bariff Xiaofei Yang, Joyce Yi Shi, Ivy Yingqiu Zhu, Chris
  • 2. Introduction  Data Preparation  + Data Mining Methods  Better Cars.com 2
  • 3. + Introduction Yingqiu Zhu, Chris 3
  • 4. + 4 INTRODUCTION SPSS MODELER
  • 5. + Data Preparation Yingqiu Zhu, Chris
  • 6. + 6 DATA PREPARATION KEY VARIABLES a. Basic Information b. Ratings c. Indicators
  • 7. + 7 DATA PREPARATION CLEANING Shopping Quality for Used of Repair Overall Rating Ready-to-Use Data
  • 8. + 8 DATA PREPARATION BALANCING Unbalanced Data Balanced Data 16.43% 30.21% 3.01% 5.54% 2.11% 3.88% 10.01% 18.40% 68.45% 41.96% 27 %
  • 9. + Data Mining Methods Yingqiu Zhu, Chris & Xiaofei Yang, Joyce
  • 10. + 10 METHODS • K-Means Cluster Analysis • Decision Tree Predictive Analysis Text Mining
  • 11. + 11 CLUSTER ANALYSIS K-MEANS  K-Means is the most popular nonhierarchical clustering method.  K-Means is based on geometric notions of similarity
  • 12. + 12 CLUSTER ANALYSIS K-MEANS Model Summary Model Accuracy
  • 13. R13 ANALYSI S Yes No 5 1 Importance 100% K-Means 5 1 5 1 + Yes No Importance 11%
  • 14. + 14 CLUSTER ANALYSIS K-MEANS Customers who purchase used cars from a dealer  rating high for the dealer  recommending the dealer to others.
  • 15. + 15 PREDICTIVE ANALYSIS INTRODUCTION OF DECISION TREE MODEL  A tree-building algorithm  Recursively splitting the data into smaller and smaller groups based on the fields that provides the maximum information gain  Used C5.0 model in our research
  • 16. + 16 PREDICTIVE ANALYSIS DECISION TREE Decision Tree with Balanced Data 60% 40% Evaluation Standard  Overall accuracy: > 80%  Favorability between training & testing samples: The results with the testing sample compare favorably to the training sample.
  • 17. + 17 PREDICTIVE ANALYSIS DECISION TREE Rating - Customer Service
  • 18. + 18 PREDICTIVE ANALYSIS INTRODUCTION OF TEXT MINING MODEL  Extract key concepts from the text and create categories with these concepts by using linguistic and frequency techniques  Explore more information and context included in the data  Better compare the results with and without text data and see the difference
  • 19. + 19 PREDICTIVE ANALYSIS DECISION TREE MODEL WITH TEXT DATA Model with Only Numeric Data Model with Text & Numeric Data
  • 20. + 20 PREDICTIVE ANALYSIS Interactive Workbench Categories and Concepts View
  • 21. 21 Trustful & Knowledgeable Extended warranty Relatively consistent pricing buying/buying process Nearest location < Service & Credibility
  • 22. + 22 PREDICTIVE ANALYSIS TEXT MINING Rating = 5 Rating = 1 High Overall Rating Low Overall Rating 100 90 100 80 90 80 70 70 60 60 50 50 40 40 30 30 20 20 10 10 0 0 % %
  • 23. + Better Cars.Com Yi Shi (Ivy)
  • 24. What we get from our analysis… 24 Less influential dealer’s Purchase behavior information =>Higher overall rating Overall Rating Higher rating for Text Mining buying process & customer service =>New Hint for Consumer behavior =>Higher Overall Rating
  • 25. + 25 Stimulus – Influence Overall Rating NEW HINT  Buying Process (experience) -Sales people -Service -Facility (Eg: Internet) -Family’s feeling  Price sensitivity  Finance issue
  • 26. + 26 Motivation STEP 1: Customers have more rights on review sheet  New Review Sheet Personal Information Additional Questions Review Additional Rating
  • 27. 27 Additional Rating Description Matching Level
  • 28. 28 Additional Questions  Are you satisfied with this purchase?  Yes  No Please tell us WHY (Mark all that apply)  Quality of in-store service  Quality of vehicle  Price  Financial Service (Insurance, cash deal, warranty cost, monthly payment)  Location of dealer  Sales People  In-store facility  In-store environment  Can/Cannot get vehicle in quickly enough  Service Department hours  Alternative transportation not available (rental car, shuttle, etc.)  Other______________
  • 29. + 29 Motivation STEP 2: More visual hint for customers Dealer Rating Best Seller Extended Warranty Label Dealer: Dealer:
  • 30. + 30 Motivation STEP 3: New Dealers Resume • Lounge with snacks and Facilities: drink • Free Wi-Fi • Insurance • Deal/Gift Services: • Extended Warranty • Maintenance • Free certification More space for Dealer Description 3 others viewing this vehicle right now 15 others bought vehicles from this dealer in the last month
  • 31. + 31 Motivation STEP 4: Monthly rating list Monthly highest rating dealer • Sub-rating list • Monthly Star Dealer (label in the search page) Monthly recommended dealer Monthly best seller The most favorable dealer
  • 32. + 32 Better Cars.Com  Combine Cars review sheet and Dealer review sheet  Continue to review dealer?  Yes  No Dealers Review sheet………  Add schedule plug-in  Online voicemail  Add to Favorite  Relevant vehicles to consider  You Viewed  Customer who view this also viewed  Recommended base on your browsing history
  • 33. + Predictive Analytics for Cars.com Xiaofei Yang, Joyce Directed by Professor Martin Bariff Yi Shi, Ivy Yingqiu Zhu, Chris Thank Questions??? You for Your Listening

Editor's Notes

  1. Now we have general ideas about why people give dealers low or high rating? What do they really care about during purchase? Then how to help dealers improve that? Before talking about that. We should first stand in dealers&apos; place, thinking about what do dealer want? More customers, high ROI and better SRP. Dealers are always try to topped the list. Rather than just provide them lot of tips in dealers section, how about do something to motivate them. No one want to lose in competition. Step 1: give customers more rights on review sheet, not just simple three rating. Customers could evaluate more things about dealer
  2. In order to enable customers find more dealer information in most efficient way. Cars. Com could encourage dealer list more information on their resume. Like if they provide free WIFI or not. What kind of particular do they have, do they provide insurance service, extended warranty service, or maintenance service. Do they have any seasonal deal? Do they provide free certification service. Give dealer more space to descript themselves. Another way to encourage dealers is plug-in real time label in their page, tell customers how many people are viewing this vehicle right now and the dealers’ sales volume in the last month.
  3. The last efficient things to motivate dealers is post some rating list on the home page month by month. like….this could give customers more hint to search as well as encourage dealers to be better.