Your SlideShare is downloading. ×
  • Like
R & Data mining in action
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×

Now you can save presentations on your phone or tablet

Available for both IPhone and Android

Text the download link to your phone

Standard text messaging rates apply

R & Data mining in action

  • 197 views
Published

Presentation from workshop "R & Data mining in action" given at JDD 2013. …

Presentation from workshop "R & Data mining in action" given at JDD 2013.
Code samples with description (in Polish): https://gist.github.com/kmrowca/public

Published in Technology , Education
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
    Be the first to like this
No Downloads

Views

Total Views
197
On SlideShare
0
From Embeds
0
Number of Embeds
1

Actions

Shares
Downloads
10
Comments
0
Likes
0

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide
  • Przykład z kodem pocztowym i numerem telefonu

Transcript

  • 1. R & data mining in action Katarzyna Mrowca
  • 2. Sztuka czytania między wierszami czyli język R i Data Mining w akcji
  • 3. <me> Katarzyna Mrowca </me>
  • 4. The deal 
  • 5. Agenda • Quick glance on theory - Data mining • Exercises on… paper • Quick glance on tool – R console • Exercises – became friend with R •…
  • 6. Agenda • Quick glance on theory - Data mining • Exercises on… paper • Quick glance on tool – R console • Exercises – became friend with R •… Theory Exercise
  • 7. Agenda • Quick glance on theory - Data preparation • Exercises • Regression • Time series • Decision trees • Cluser analysis Theory • Text mining •… Exercise
  • 8. Quick glance on theory!
  • 9. What data mining is?
  • 10. What „google” says?
  • 11. What „google” says? Data mining (the analysis step of the "Knowledge Discovery in Databases" process, or KDD), an interdisciplinary subfield of computer science,
  • 12. What „google” says? Data mining (the analysis step of the "Knowledge Discovery in Databases" process, or KDD), an interdisciplinary subfield of computer science, is the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics.
  • 13. What „google” says? Data mining (the analysis step of the "Knowledge Discovery in Databases" process, or KDD), an interdisciplinary subfield of computer science, is the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics.
  • 14. What „google” says? Data mining (the analysis step of the "Knowledge Discovery in Databases" process, or KDD), an interdisciplinary subfield of computer science, is the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics.
  • 15. What „google” says? Data mining (the analysis step of the "Knowledge Discovery in Databases" process, or KDD), an interdisciplinary subfield of computer science, is the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics.
  • 16. What „google” says? Data mining (the analysis step of the "Knowledge Discovery in Databases" process, or KDD), an interdisciplinary subfield of computer science, is the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics.
  • 17. What „google” says? The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use.
  • 18. What „google” says? The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use.
  • 19. What „google” says? The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use.
  • 20. What „google” says? Aside from the raw analysis step, it involves database and data management aspects, data pre-processing, model and inference considerations, interestingness metrics, complexity considerations, post-processing of discovered structures, visualization, and online updating. Source: wikipedia
  • 21. Data mining – what is „inside” • Predictive • Regression • Classification • Collaborative Filtering • Descriptive • Clustering / similarity matching • Association rules and variants • Deviation detection
  • 22. Data mining – what is „inside” • Predictive: • Regression • Classification • Collaborative Filtering • Descriptive: • Clustering / similarity matching • Association rules and variants • Deviation detection
  • 23. Data mining – what is „inside” • Predictive: • Regression • Classification • Collaborative Filtering • Descriptive: • Clustering / similarity matching • Association rules and variants • Deviation detection
  • 24. What data mining is not?
  • 25. Why Data Mining is so popular?
  • 26. What is a difference between statistics and data mining?
  • 27. Data preparation
  • 28. Variables
  • 29. Qualitative & Quantitative
  • 30. Tame R console!
  • 31. NetBeans + R Source: https://blogs.oracle.com/geertjan/entry/r_plugin_for_netbeans_ide
  • 32. RHIPE <– R+ Hadoop Find out more: http://www.datadr.org/
  • 33. Revolution Analytics <- R + Hadoop + Enterprise Find out more: http://www.revolutionanalytics.com
  • 34. Take a break 
  • 35. Regression
  • 36. Time series
  • 37. Decision trees
  • 38. Regression trees
  • 39. Classification trees
  • 40. K means
  • 41. Text mining
  • 42. Thank you!