S ANAND   DATA SCIENTIST   GRAMENER.COMDATA VISUALISATIONPICTURES USING NUMBERS
You will be shown a set of numbersalong with a summary (average, etc)Can you make sense of the figures?                WHY...
DO THESE FOUR CITIES LOOK IDENTICAL TO YOU?Take a look at the sales          2010       Bangalore      Delhi       Hyderab...
ARE THEY REALLY IDENTICAL? CHECK AGAIN…But in fact, the four cities aretotally different in behaviour.Bangalore       sale...
DETECTING FRAUD                 “                     We know meter readings are                     incorrect, for variou...
This plot shows the frequency of all meter readings from  Why would                                                    Apr...
PREDICTING MARKS            What determines a child’s marks?            Do girls score better than boys?            Does t...
… and peaksBased on the results of the 20 lakh                                      for Sep-bornsstudents taking the Class...
FASTEST SCORERS          “              I’ve always been curious… who              among India’s prolific one-day         ...
INDIAN ODI BATTING
http://gramener.com/cricket
http://gramener.com/cricket
SECURITIES   FINDING PATTERNS             Which securities move together?             How should I diversify?             ...
68% correlation              between AUD & EURPlot of 6 month daily AUD - EUR values                    … that move       ...
VISUALISING CHANGE            What was the weather in India like…EDUCATION WEATHER     THE LAST 100 YEARS?
DASHBOARDS                “                    Today, we use a 40-page weekly                    report summarise our onli...
ASSET MANAGEMENT
COMPUTER USAGE
EXPLORING RELATIONS           This is the social network of programmers           across various Indian cities, using the ...
Bangalore           Chennai       PuneHyderabad           Mumbai        Delhi            http://gramener.com/codersearch
SIMPLEREDESIGNS
TIMING
GramenerA data analytics and visualisation companyWe handle terabyte-size data   via non-traditional analytics and visuali...
Pictures through Numbers, OpenDataCamp 2012 Bangalore
Pictures through Numbers, OpenDataCamp 2012 Bangalore
Pictures through Numbers, OpenDataCamp 2012 Bangalore
Pictures through Numbers, OpenDataCamp 2012 Bangalore
Pictures through Numbers, OpenDataCamp 2012 Bangalore
Pictures through Numbers, OpenDataCamp 2012 Bangalore
Pictures through Numbers, OpenDataCamp 2012 Bangalore
Upcoming SlideShare
Loading in …5
×

Pictures through Numbers, OpenDataCamp 2012 Bangalore

2,312 views

Published on

Published in: Design, Business, Travel
0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
2,312
On SlideShare
0
From Embeds
0
Number of Embeds
570
Actions
Shares
0
Downloads
35
Comments
0
Likes
2
Embeds 0
No embeds

No notes for slide
  • Good evening. My name is Anand, and you can find more about me by googling for “S Anand”. My site is the first hit.I’ll be talking about recent trends in technology, and how you can leverage them.
  • Who’s the best Indian one-daybatsman? The size represents every run ever scored. The colour represents speed. Red is slow, green is fast.Sehwag’s very fast – but so was Kapil, especially for his time.
  • This is a drilldown, showing every single match they played.With this, you’ll be able to see who the consistent players are, and where exactly their runs came from.You can also click to see that particular match statistics.
  • We plotted that across a number of cities.Bangalore – dense network with a central connected component.Chennai and Pune – not so well connected, but not too bad either.The other cities barely have a network.And you can FEEL this if you visit the cities and talk to geeks.
  • Pictures through Numbers, OpenDataCamp 2012 Bangalore

    1. 1. S ANAND DATA SCIENTIST GRAMENER.COMDATA VISUALISATIONPICTURES USING NUMBERS
    2. 2. You will be shown a set of numbersalong with a summary (average, etc)Can you make sense of the figures? WHY VISUALISE?
    3. 3. DO THESE FOUR CITIES LOOK IDENTICAL TO YOU?Take a look at the sales 2010 Bangalore Delhi Hyderabad Mumbaireport alongside. A company Month Price Sales Price Sales Price Sales Price Saleshas branches in 4 cities, and Jan 10.0 8.04 10.0 9.14 10.0 7.46 8.0 6.58each branch changes the Feb 8.0 6.95 8.0 8.14 8.0 6.77 8.0 5.76product price every month. Mar 13.0 7.58 13.0 8.74 13.0 12.74 8.0 7.71This leads to a corresponding Apr 9.0 8.81 9.0 8.77 9.0 7.11 8.0 8.84change in the sales. May 11.0 8.33 11.0 9.26 11.0 7.81 8.0 8.47 Jun 14.0 9.96 14.0 8.10 14.0 8.84 8.0 7.04Here is the performance of Jul 6.0 7.24 6.0 6.13 6.0 6.08 8.0 5.25the 4 branches with their Aug 4.0 4.26 4.0 3.10 4.0 5.39 19.0 12.50monthly price and sales for Sep 12.0 10.84 12.0 9.13 12.0 8.15 8.0 5.56each month. Oct 7.0 4.82 7.0 7.26 7.0 6.42 8.0 7.91 Nov 5.0 5.68 5.0 4.74 5.0 5.73 8.0 6.89Looking at the average, the Average 9.0 7.50 9.0 7.50 9.0 7.50 9.0 7.50four branches have an Variance 10.0 3.75 10.0 3.75 10.0 3.75 10.0 3.75identical performance. The average price is the same. The average sales is the same too.DO YOU AGREE? The variance in price is the same. is the variance in sales. So
    4. 4. ARE THEY REALLY IDENTICAL? CHECK AGAIN…But in fact, the four cities aretotally different in behaviour.Bangalore sales hasgenerally increased withprice.Hyderabad has a nearlyperfect increase in sales withprice, except for oneaberration.Delhi shows a decline insales beyond a price of 10.Mumbai’s sales fluctuatesdespite a nearly constantprice.
    5. 5. DETECTING FRAUD “ We know meter readings are incorrect, for various reasons. We don’t, however, have the concrete proof we need to start the process of meter readingENERGY UTILITY automation. Part of our problem is the volume of data that needs to be analysed. The other is the inexperience in tools or analyses to identify such patterns.
    6. 6. This plot shows the frequency of all meter readings from Why would Apr-2010 to Mar-2011. An unusually large number ofthese happen? readings are aligned with the tariff slab boundaries.This clearly shows Apr-10 May-10 Jun-10 Jul-10 Aug-10 Sep-10 Oct-10 Nov-10 Dec-10 Jan-11 Feb-11 Mar-11collusion of some form 217 219 200 200 200 200 200 200 200 350 200 200with the customers. 250 200 200 200 201 200 200 200 250 200 200 150 250 150 150 200 200 200 200 200 200 200 200 150This happens with specific 150 200 200 200 200 200 200 200 200 200 200 50customers, not randomly. 200 200 200 150 180 150 50 100 50 70 100 100Here are such customers’ 100 100 100 100 100 100 100 100 100 100 110 100 100 150 123 123 50 100 50 100 100 100 100 100meter readings. 0 111 100 100 100 100 100 100 100 100 50 50 0 100 27 100 50 100 100 100 100 100 70 100If we define the “extent of 1 1 1 100 99 50 100 100 100 100 100 100fraud” as the percentageexcess of the 100 unitmeter reading, Section Apr-10 May-10 Jun-10 Jul-10 Aug-10 Sep-10 Oct-10 Nov-10 Dec-10 Jan-11 Feb-11 Mar-11the value varies Section 1 70% 97% 136% 65% 110% 116% 121% 107% 114% 88% 74% 109%considerably Section 2 66% 92% New section 66% 87% 70% 64% is … and 63% 50% 58% 38% 41% 54% manager arrives transferred50% outacross sections, Section 3 90% 46% 47% 43% 28% 31% 32% 19% 38% 8% 34% Section 4 44% 24% 36% 39% 21% 18% 24% 49% 56% 44% 31% 14%and time Section 5 4% 63% -27% 20% 41% 82% 26% 34% 43% 2% 37% 15% Section 6 18% 23% 30% 21% 28% 33% 39% 41% 39% 18% 0% 33%… with some Section 7 36% 51% 33% 33% 27% 35% 10% 39% 12% 5% 15% 14%explainable Section 8 22% 21% 28% 12% 24% 27% 10% 31% 13% 11% 22% 17%anomalies. Section 9 19% 35% 14% 9% 16% 32% 37% 12% 9% 5% -3% 11%
    7. 7. PREDICTING MARKS What determines a child’s marks? Do girls score better than boys? Does the choice of subject matter?EDUCATION Does the medium of instruction matter? Does community or religion matter? Does their birthday matter? Does the first letter of their name matter?
    8. 8. … and peaksBased on the results of the 20 lakh for Sep-bornsstudents taking the Class XII exams The marksat Tamil Nadu over the last 3 years, shoot up for Aug bornsit appears that the month you wereborn in can make a difference of asmuch as 120 marks out of 1,200. 120 marks out of 1200 explainable by month of birth June borns score the lowest An identical pattern was observed in 2009 and 2010…“It’s simply that in Canada the eligibilitycutoff for age-class hockey is January 1. Aboy who turns ten on January 2, then,could be playing alongside someone whodoesn’t turn ten until the end of the year—and at that age, in preadolescence, atwelve-month gap in age represents anenormous difference in physical maturity.” -- Malcolm Gladwell, Outliers … and across districts, gender, subjects, and class X & XII.
    9. 9. FASTEST SCORERS “ I’ve always been curious… who among India’s prolific one-day run-getters had the best strike rate? Sachin?CRICKET Sehwag? What about the rest of the world?
    10. 10. INDIAN ODI BATTING
    11. 11. http://gramener.com/cricket
    12. 12. http://gramener.com/cricket
    13. 13. SECURITIES FINDING PATTERNS Which securities move together? How should I diversify? What should I sell to reduce risk? What’s a reliable predictor of a security?
    14. 14. 68% correlation between AUD & EURPlot of 6 month daily AUD - EUR values … that move counter-cyclically to indices Block of correlated currencies … clustered hierarchically
    15. 15. VISUALISING CHANGE What was the weather in India like…EDUCATION WEATHER THE LAST 100 YEARS?
    16. 16. DASHBOARDS “ Today, we use a 40-page weekly report summarise our online operations. This is prepared by a team of 6 analysts pulling data from multiple sources – both online and offline. We distribute it to the entire senior management team. I’m fairly sure they don’t read it.WEB ANALYTICS
    17. 17. ASSET MANAGEMENT
    18. 18. COMPUTER USAGE
    19. 19. EXPLORING RELATIONS This is the social network of programmers across various Indian cities, using the follower network at Github.com – a Facebook for developers. Each circle represents a coder. The size shows their number of followers. The colour shows the language they develop in.NETWORKS The lines show whom they follow.
    20. 20. Bangalore Chennai PuneHyderabad Mumbai Delhi http://gramener.com/codersearch
    21. 21. SIMPLEREDESIGNS
    22. 22. TIMING
    23. 23. GramenerA data analytics and visualisation companyWe handle terabyte-size data via non-traditional analytics and visualise it in real-time. Gramener visualises Gramener transforms your data into concise dashboards that make your business problem & solution visually obvious. your data We help you find insights quickly, based on cognitive research, and our visualisations guide you towards actionable decisions.

    ×