Your SlideShare is downloading. ×
An example of discovering simple patterns using basic data mining
An example of discovering simple patterns using basic data mining
An example of discovering simple patterns using basic data mining
An example of discovering simple patterns using basic data mining
An example of discovering simple patterns using basic data mining
An example of discovering simple patterns using basic data mining
An example of discovering simple patterns using basic data mining
An example of discovering simple patterns using basic data mining
An example of discovering simple patterns using basic data mining
An example of discovering simple patterns using basic data mining
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

An example of discovering simple patterns using basic data mining

2,718

Published on

A short overview on how simple data mining techniques can be applied to a dataset to identify patterns relevant to business problems.

A short overview on how simple data mining techniques can be applied to a dataset to identify patterns relevant to business problems.

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
2,718
On Slideshare
0
From Embeds
0
Number of Embeds
5
Actions
Shares
0
Downloads
0
Comments
0
Likes
0
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. Data Mining Example ‘Tips’ dataset Irish Centre for High End Computing Dr. Eoin Brazil, Technology Transfer
  • 2. Outline of Presentation
    • Tips in a US restaurant for 2 1/2 months
    • 244 cases, 8 variables
    • Obs num, totbill, tip, sex, smoker, day (Thur-Sun), time, size of party
    • Source: Bryant, P. G. and Smith, M. A. (1995), Practical Data Analysis: Case Studies in Business Statistics
    Irish Centre for High End Computing (ICHEC) - Data Mining Example
  • 3. Exploring ‘tips’ dataset
    • Q: What are the factors that affect tipping behaviour ?
    • Data restructuring:
    • Calculate tiprate = tip/totbill
    • Avenues to explore: Regression modeling
    Irish Centre for High End Computing (ICHEC) - Data Mining Example
  • 4. Irish Centre for High End Computing (ICHEC) - Data Mining Example Large bins show global features, tips fall off quickly so perhaps this is not a very expensive restaurant.
  • 5. Irish Centre for High End Computing (ICHEC) - Data Mining Example Smaller bins show local features, tips tend to be rounded to the nearest 50 cents or dollar.
  • 6. Irish Centre for High End Computing (ICHEC) - Data Mining Example Looking at the correlation between variables (r) and checking to see who gives more or less than the ‘average’ tip of 18% of the bill.
  • 7. Irish Centre for High End Computing (ICHEC) - Data Mining Example ‘ Conditioning’ or ‘Drilling down’ to explore more complex relationships within the data.
  • 8. Irish Centre for High End Computing (ICHEC) - Data Mining Example ‘ Conditioning’ or ‘Drilling down’ to explore more complex relationships within the data, smokers round their tip more.
  • 9. Beyond ‘toy’ datasets
    • More complex methods
    • Classification (Supervised/Unsupervised)
    • Machine Learning
    • Statistical techniques
    • Visualisation
    Irish Centre for High End Computing (ICHEC) - Data Mining Example
  • 10. Acknowledgements Supported by Science Foundation Ireland under grant 08/HEC/I1450 and by HEA’s PRTLI-C4.

×