Data Mining Example ‘Tips’ dataset Irish Centre for High End Computing Dr. Eoin Brazil, Technology Transfer
Outline of Presentation <ul><li>Tips in a US restaurant for 2 1/2 months </li></ul><ul><li>244 cases, 8 variables </li></u...
Exploring ‘tips’ dataset <ul><li>Q:  What are the factors that affect tipping behaviour ? </li></ul><ul><li>Data restructu...
Irish Centre for High End Computing (ICHEC) - Data Mining Example Large bins show global features, tips fall off quickly s...
Irish Centre for High End Computing (ICHEC) - Data Mining Example Smaller bins show local features, tips tend to be rounde...
Irish Centre for High End Computing (ICHEC) - Data Mining Example Looking at the correlation between variables (r) and che...
Irish Centre for High End Computing (ICHEC) - Data Mining Example ‘ Conditioning’ or ‘Drilling down’ to explore more compl...
Irish Centre for High End Computing (ICHEC) - Data Mining Example ‘ Conditioning’ or ‘Drilling down’ to explore more compl...
Beyond ‘toy’ datasets <ul><li>More complex methods </li></ul><ul><li>Classification (Supervised/Unsupervised) </li></ul><u...
Acknowledgements Supported by Science Foundation Ireland under grant 08/HEC/I1450 and by HEA’s PRTLI-C4.
Upcoming SlideShare
Loading in...5
×

An example of discovering simple patterns using basic data mining

2,858

Published on

A short overview on how simple data mining techniques can be applied to a dataset to identify patterns relevant to business problems.

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
2,858
On Slideshare
0
From Embeds
0
Number of Embeds
5
Actions
Shares
0
Downloads
0
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

An example of discovering simple patterns using basic data mining

  1. 1. Data Mining Example ‘Tips’ dataset Irish Centre for High End Computing Dr. Eoin Brazil, Technology Transfer
  2. 2. Outline of Presentation <ul><li>Tips in a US restaurant for 2 1/2 months </li></ul><ul><li>244 cases, 8 variables </li></ul><ul><li>Obs num, totbill, tip, sex, smoker, day (Thur-Sun), time, size of party </li></ul><ul><li>Source: Bryant, P. G. and Smith, M. A. (1995), Practical Data Analysis: Case Studies in Business Statistics </li></ul>Irish Centre for High End Computing (ICHEC) - Data Mining Example
  3. 3. Exploring ‘tips’ dataset <ul><li>Q: What are the factors that affect tipping behaviour ? </li></ul><ul><li>Data restructuring: </li></ul><ul><li>Calculate tiprate = tip/totbill </li></ul><ul><li>Avenues to explore: Regression modeling </li></ul>Irish Centre for High End Computing (ICHEC) - Data Mining Example
  4. 4. Irish Centre for High End Computing (ICHEC) - Data Mining Example Large bins show global features, tips fall off quickly so perhaps this is not a very expensive restaurant.
  5. 5. Irish Centre for High End Computing (ICHEC) - Data Mining Example Smaller bins show local features, tips tend to be rounded to the nearest 50 cents or dollar.
  6. 6. Irish Centre for High End Computing (ICHEC) - Data Mining Example Looking at the correlation between variables (r) and checking to see who gives more or less than the ‘average’ tip of 18% of the bill.
  7. 7. Irish Centre for High End Computing (ICHEC) - Data Mining Example ‘ Conditioning’ or ‘Drilling down’ to explore more complex relationships within the data.
  8. 8. Irish Centre for High End Computing (ICHEC) - Data Mining Example ‘ Conditioning’ or ‘Drilling down’ to explore more complex relationships within the data, smokers round their tip more.
  9. 9. Beyond ‘toy’ datasets <ul><li>More complex methods </li></ul><ul><li>Classification (Supervised/Unsupervised) </li></ul><ul><li>Machine Learning </li></ul><ul><li>Statistical techniques </li></ul><ul><li>Visualisation </li></ul>Irish Centre for High End Computing (ICHEC) - Data Mining Example
  10. 10. Acknowledgements Supported by Science Foundation Ireland under grant 08/HEC/I1450 and by HEA’s PRTLI-C4.

×