Visual data mining with HeatMiner

2,058 views

Published on

Large data sets comprising multiple correlating attributes may include phenomena hard to identify and understand using traditional data analysis and visualization methods. HeatMiner is a new visual data mining technology which visualizes the data as three-dimensional heatmaps. Even complex patterns missed by other methods are easy to recognize from 3D-heatmaps with a single glance. Go and try HeatMiner with your own data at the Cloud’N’Sci.fi Algorithms-as-a-Service marketplace!

Published in: Technology
0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
2,058
On SlideShare
0
From Embeds
0
Number of Embeds
28
Actions
Shares
0
Downloads
0
Comments
0
Likes
2
Embeds 0
No embeds

No notes for slide

Visual data mining with HeatMiner

  1. 1. CLOUD ’ .fi Algorithms-as-a-Service cloudnsci.fi N’ HeatMiner® is a registered trademark of Agience Oy Ltd by Pauli Misikangas, the CEO of Cloud’N’Sci Ltd & Agience Oy Ltd
  2. 2. CLOUD ’ .fi Algorithms-as-a-Service cloudnsci.fi N’ These two images are identical. Copyright © CloudNSci Ltd 2011 2
  3. 3. CLOUD ’ .fi Algorithms-as-a-Service cloudnsci.fi N’ These histograms prove it! 80000 80000 100000 Number of Pixels Number of Pixels Number of Pixels 80000 60000 60000 60000 40000 40000 40000 20000 20000 20000 0 0 0 Red Color Intensity Green Color Intensity Blue Color IntensityDATA 1 ANALYSIS 1DATA 2 ANALYSIS 2 80000 80000 100000 Number of Pixels Number of Pixels Number of Pixels 80000 60000 60000 60000 40000 40000 40000 20000 20000 20000 0 0 0 Red Color Intensity Green Color Intensity Blue Color Intensity Copyright © CloudNSci Ltd 2011 3
  4. 4. CLOUD ’ .fi Algorithms-as-a-Service cloudnsci.fi N’ It could be Your Business Data... 80000 80000 100000 Q1 60000 60000 80000 2011 Count 60000 Count Count 40000 40000 40000 20000 20000 20000 0 0 0 Business Attribute 1 Business Attribute 2 Business Attribute 3DATA 1 ANALYSIS 1DATA 2 ANALYSIS 2 80000 80000 100000 80000 Q2 60000 60000 Count 60000 Count Count 2011 40000 40000 40000 20000 20000 20000 0 0 0 Business Attribute 1 Business Attribute 2 Business Attribute 3 Copyright © CloudNSci Ltd 2011 4
  5. 5. CLOUD’ .fi Algorithms-as-a-Service cloudnsci.fi N’ Invalid View → Wrong Conclusions• Over-simplified view to complex data can lead to wrong conclusions and crucial mistakes – Nearly all real-world data is complex! – Most business reports are too simple • Histograms, pie charts, average trends, ... – Should be used only for 1-dimensional data • Scatter plots, bubble diagrams, ... – Can view 2-3 dimensional data, but often messy• What if my data has 3+ columns? Copyright © CloudNSci Ltd 2011 5
  6. 6. CLOUD ’ .fi Algorithms-as-a-Service cloudnsci.fi N’ Visual Data Mining with HeatMiner® Data as it really is Colors can be used as the fourth dimension The 3D shape or to ease indicates interpretationfrequent value combinations Three selected attributes define the point of view Copyright © CloudNSci Ltd 2011 6
  7. 7. CLOUD ’ .fi Algorithms-as-a-Service cloudnsci.fi N’ Let’s try again...DATA 1 ANALYSIS 1DATA 2 ANALYSIS 2 Copyright © CloudNSci Ltd 2011 7
  8. 8. CLOUD ’ .fi Algorithms-as-a-Service cloudnsci.fi N’ (x,y,z) 4D Heatmap Colors indicates the average of observations The target column defining heatmap colors Copyright © CloudNSci Ltd 2011 8
  9. 9. CLOUD’ .fi Algorithms-as-a-Service cloudnsci.fi N’ Customer Survey Heatmap Average=7 Average=1 HeatMiner automatically selects 3 columns as the optimal view to explain the target Copyright © CloudNSci Ltd 2011 9
  10. 10. CLOUD ’ .fi Algorithms-as-a-Service cloudnsci.fi N’ Cluster Heatmap The selected view separates groups clearly HeatMiner selects the optimal view Copyright © CloudNSci Ltd 2011 10
  11. 11. CLOUD ’ .fi Algorithms-as-a-Service cloudnsci.fi N’ SWOT Heatmap Opportunity Strength Protect this! Threat Develop this! Weakness HeatMiner selects the optimal view Copyright © CloudNSci Ltd 2011 11
  12. 12. CLOUD ’ .fi Algorithms-as-a-Service cloudnsci.fi N’ Difference Heatmap Red points dominate More blue here points than red points What is the difference between these two data sets? (red vs blue) Copyright © CloudNSci Ltd 2011 12
  13. 13. CLOUD ’ .fi Algorithms-as-a-Service cloudnsci.fi N’ Click Heatmap The more clicks, the deeper hole! Copyright © CloudNSci Ltd 2011 13
  14. 14. CLOUD ’ .fi Algorithms-as-a-Service cloudnsci.fi N’ Geographic Heatmap Smoothing makes even sparse data look good! Copyright © CloudNSci Ltd 2011 14
  15. 15. CLOUD ’ .fi Algorithms-as-a-Service cloudnsci.fi N’ Confidence Heatmaps Average is only part of the Truth – check also variance to reveal the risks Height indicates the average of Z(X,Y)→ ZAVERAGE &VARIANCE Color indicates the variance of Z Copyright © CloudNSci Ltd 2011 15
  16. 16. CLOUD ’ .fi Algorithms-as-a-Service cloudnsci.fi N’ Available at the Cloud’N’Sci.fiAlgorithms-as-a-Service Marketplace! Try HeatMiner for free! Full list of available solutions Copyright © CloudNSci Ltd 2011 16
  17. 17. CLOUD ’ .fi Algorithms-as-a-Service cloudnsci.fi N’http://cloudnsci.fi/wiki Case stories and 3D applet demos Copyright © CloudNSci Ltd 2011 17
  18. 18. CLOUD ’ .fi Algorithms-as-a-Service cloudnsci.fi N’ Copyright © CloudNSci Ltd 2011 18

×