Advanced Visualizations, Bijilash Babu


Published on

Marlabs, a Bangalore-based provider of IT services, is sponsoring a ‘Business Intelligence Technology’ conference at the Thiruvananthapuram Technopark on Friday.

The event will focus on emerging trends in Business Intelligence (BI) Technology, a Marlabs spokesman said.

It will feature eminent speakers from leading information technology companies including Marlabs, Infosys, UST Global, NeST and Kreara.

The conference will discuss latest developments in emerging BI areas such as predictive analytics, Big Data, mobile BI, social BI and advanced visualisations. It will also highlight the growing job opportunities for newly graduated software professionals in the Tier II and Tier III cities.

  • Be the first to comment

  • Be the first to like this

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

Advanced Visualizations, Bijilash Babu

  1. 1. Advanced VisualizationBijilash BabuTechnical ArchitectTechnology Development CentreNeST Softwarebijilash.babu@nestgroup.netSession on Emerging trends inBusiness Intelligence20 July 2012: Zenith HallBhavani, Technopark, Trivandrum
  2. 2. Roadmap• Big Data• Dimensionality• Current trends• Ordinary analytics• Applied mathematics• Advanced Technology20-Jul-12 NeST Controlled2
  3. 3. Big Data, It’s Visualization• Gartner’s definition of big data refers to high-volume, high-velocity and high-variety information assets that demand cost-effective, innovative forms ofinformation processing for enhanced insight and decision making.• Big Data is the convergence of three v’s: volume, velocity and variety .• Internet of things (with different sensors), CRM, social media, etc..• Improved use of Big Data could add X to the economy and create N jobs.• Volume of data keeps creeping, Decision makers would struggle..• Data visualisation is the key for better perception of the underlying facts.20-Jul-12 NeST Controlled/Confidential3
  4. 4. When big wasn’t that big20-Jul-12 NeST Controlled/Confidential4• Line graph• Stack graph• Categories Stack graphTrack rises and falls over time• Scatterplot• Matrix chart• Network DiagramSee relationships among data points• Bar chart• Block histogram• Bubble chartCompare a set of values• Pie Chart• Tree Map• Analyze a text• Word tree• WordleSee parts of a whole• MappingSee the world
  5. 5. Better Representation9876546765 987-654-676520-Jul-12 NeST Controlled/Confidential5Source:
  6. 6. Timeline20-Jul-12 NeST Controlled/Confidential6Source: The Economist
  7. 7. Better Representation20-Jul-12 NeST Controlled/Confidential7Source: The New York Times
  8. 8. The volcano20-Jul-12 NeST Controlled/Confidential8
  9. 9. Create your own visual20-Jul-12 NeST Controlled/Confidential9Source: www.wordle.netGeorge K. Thiruvathukal,Associate Editor in ChiefComputing in Science & EngineeringTag Cloud, NSF proposals
  10. 10. Create your own visual20-Jul-12 NeST Controlled/Confidential10Created in R with word-cloud package. Data from country population. Note that the proportional sizes of Chinaand India were reduced in half. Source:
  11. 11. What’s up..• Need to bring in better methods for extracting a smallerset of relevant data.• In fact, Big Data isn’t just about numbers or volume, butthe trends – how they change over time.• Visualization is an invaluable tool in identifying trendswithin massive data sets.• Spotting anomalies as well as outliers.20-Jul-12 NeST Controlled/Confidential11
  12. 12. Going around Big Data• With the exponential growth in data acquisition andgeneration.• High-resolution sensors, automated accumulation...• More disk space and more CPU cycles!• But, there are couple of walls around the CPU,• and GPUs come into picture!20-Jul-12 NeST Controlled/Confidential12
  13. 13. Calling in Maths• Scientific data analysis techniques• Numerical Linear Algebra• SVD - The prize, compression• PCA/ NLPCA – to reduce the dimensionality, feature extractionLatent Semantic Indexing (LSI)• SVM - classification, regression, and anomaly detection.• SOM - neural network algorithm based on unsupervised learning20-Jul-12 NeST Controlled/Confidential13
  14. 14. Log plots• Response to skewness towards large values; i.e., cases in which one or afew points are much larger than the bulk of the data.• To show percent change or multiplicative factors.• Base of ten is useful when the data range over several orders of magnitude,a base of two is useful when the data have a smaller range20-Jul-12 NeST Controlled/Confidential14Source: Forbes
  15. 15. 20-Jul-12 NeST Controlled/Confidential15
  16. 16. Better Mixing20-Jul-12 NeST Controlled/Confidential16+ = ?
  17. 17. Visual data Mining20-Jul-12 NeST Controlled/Confidential17Source: S.J. Simoff et al. (Eds.): Visual Data Mining, LNCS 4404
  18. 18. Advanced technology20-Jul-12 NeST Controlled/Confidential18
  19. 19. Hans Rosling...20-Jul-12 NeST Controlled/Confidential19...What’s next?
  20. 20. Thank you!!!20-Jul-12 NeST Controlled/Confidential20