The document presents a data visualization challenge that asks the user 3 questions about a dataset within time limits, then repeats the challenge with simple visual cues to answer more quickly. It demonstrates how visualizing data can help identify patterns and insights more easily and quickly than just looking at the raw numbers. Visualizing data allows for consistent interpretation and conclusions to be drawn from the same dataset.
How do you cut the Big Data clutter and tell interesting, insightful and impacting stories? This session talks about the need for Data Visualization & how Visual stories can come to the aid of the Big Data problem associated with meaningful consumption. The point is illustrated by leveraging several industry case studies.
Introduction to Data Storytelling | Rasagy Sharma - GramenerGramener
Rasagy Sharma, Gramener's Principal Information Designer, created this PPT to introduce data storytelling to the students at Symbiosis Institute of Business Management (SIBM).
The slide talks about how to create data stories and the impact of data stories on businesses.
Rasagy Sharma also teaches data storytelling to analysts and data scientists. Check out more about Gramener's data storytelling workshop at https://gramener.com/data-storytelling-workshop
Check out more solutions on data storytelling at https://gramener.com
Connect with Rasagy on:
LinkedIn: https://in.linkedin.com/in/rasagy
Twitter: @rasagy
How do you cut the Big Data clutter and tell interesting, insightful and impacting stories? This session talks about the need for Data Visualization & how Visual stories can come to the aid of the Big Data problem associated with meaningful consumption. The point is illustrated by leveraging several industry case studies.
Introduction to Data Storytelling | Rasagy Sharma - GramenerGramener
Rasagy Sharma, Gramener's Principal Information Designer, created this PPT to introduce data storytelling to the students at Symbiosis Institute of Business Management (SIBM).
The slide talks about how to create data stories and the impact of data stories on businesses.
Rasagy Sharma also teaches data storytelling to analysts and data scientists. Check out more about Gramener's data storytelling workshop at https://gramener.com/data-storytelling-workshop
Check out more solutions on data storytelling at https://gramener.com
Connect with Rasagy on:
LinkedIn: https://in.linkedin.com/in/rasagy
Twitter: @rasagy
"It is frequently called the gatekeeper subject. It is used by professionals ranging from electricians to architects to computer scientists. It is no less than a civil right." - Robert Moses, founder of the Algebra Project.
O que é multiplicação?
Multiplicação é a forma usada pela matemática para expressar aumento de quantidades dobradas, triplicadas, quadruplicadas e assim por diante.
Duas vezes quatro caixas de maçãs 2x4= 8
Horses for Courses: Deep Learning Beyond Niche ApplicationsNikita Johnson
Machine learning in general and deep learning in particular are driving major advances for a wide range of specific finance use cases. This talk will outline how enterprise-wide learning loops will extend these point success to a coherent AI strategy and also show what other elements are required for success, using real-world examples at Prudential plc.
Watch the presentation here: http://videos.re-work.co/events/36-deep-learning-in-finance-summit-london-2018
A tool-agnostic overview of how to analyse and explore data in a systematic way. This talk covers metadata generation, univariate analysis, and the basics of bivariate analysis.
The talk also provides examples of natural power law distributions (scale-free networks.)
"It is frequently called the gatekeeper subject. It is used by professionals ranging from electricians to architects to computer scientists. It is no less than a civil right." - Robert Moses, founder of the Algebra Project.
O que é multiplicação?
Multiplicação é a forma usada pela matemática para expressar aumento de quantidades dobradas, triplicadas, quadruplicadas e assim por diante.
Duas vezes quatro caixas de maçãs 2x4= 8
Horses for Courses: Deep Learning Beyond Niche ApplicationsNikita Johnson
Machine learning in general and deep learning in particular are driving major advances for a wide range of specific finance use cases. This talk will outline how enterprise-wide learning loops will extend these point success to a coherent AI strategy and also show what other elements are required for success, using real-world examples at Prudential plc.
Watch the presentation here: http://videos.re-work.co/events/36-deep-learning-in-finance-summit-london-2018
A tool-agnostic overview of how to analyse and explore data in a systematic way. This talk covers metadata generation, univariate analysis, and the basics of bivariate analysis.
The talk also provides examples of natural power law distributions (scale-free networks.)
1.HISTOGRAM & FREQUENCY CURVE
2.HISTOGRAM & FREQUENCY POLYGON
3.ARITHMETIC MEAN
4.GEOMETRIC MEAN
5.HARMONIC MEAN
6.MEDIAN
7.MODE
8.QAURTILE
9.DECILE
10.PERCENTILE
11.OGIVE : LESS THAN
12.OGIVE : MORE THAN
13.QUARTILE DEVIATION
14.MEAN DEVIATION
15.MEDIAN DEVIATION
This slide file was presented in ISSCC 2015. This paper shows a padless chip concept which enables CMOS chip to be tested in bare chip state.
This padless concept can be applied to sensors, memories, etc to reduce the extra system cost.
Cryptography and network Security
The Data Encryption Standard (DES) is a symmetric-key block cipher published by the National Institute of Standards and Technology (NIST).
6 Methods to Improve Your Manufacturing Process with Computer VisionGramener
Computer vision is a technology that enables computers to interpret and comprehend visual information from their surroundings, and it has the potential to transform the manufacturing industry. Manufacturers can improve their processes in a variety of ways by using computer vision, from ensuring quality control and optimizing production to inspecting and measuring products and monitoring machinery.
In this presentation you will find out 6 methods how you can improve your manufacturing process with computer vision.
Download our E-book
bit.ly/ebookcomputervision
Detecting Manufacturing Defects with Computer VisionGramener
Computer vision is the field of artificial intelligence that deals with the ability of computers to interpret and understand visual data from the world around them. In the manufacturing industry, computer vision can be used to detect defects in products as they are being produced. This can help to improve the quality of the final product and reduce the cost of rework or recalls.
In this presentation you will find out the use of computer vision for defect detection in manufacturing which aids in improving the efficiency and effectiveness of the production process, leading to higher quality products and lower costs.
Book a discovery call
https://reachus.gramener.com/damage-detection/
How to Identify the Right Key Opinion Leaders (KOLs) in Pharma & HealthcareGramener
Find out the importance of KOLs (Key Opinion leaders) in the Pharma industry and everything you need to know about them.
In the presentation, we will show you who is a KOL in the Pharmaceutical Industry, what role they play and how to identify the right KOLs.
Book a free demo
https://gramener.com/demorequest/
Automated Barcode Generation System in ManufacturingGramener
Find out how automating barcode generation can improve the efficiency of your company's operations.
In the presentation, we will show you how barcodes play a significant role in enabling accurate inventory control and real-time stock information and how businesses can reduce 67% of their time in handling label standards.
Get a Free BarGen Demo.
https://gramener.com/demorequest/
#barcode #lowcode
The Role of Technology to Save BiodiversityGramener
Find out what are the major challenges biodiversity is facing such as deforestation, species endangerment, and poaching.
In the presentation, we will show you how some of the major technology and nature conservation organizations are building innovative solutions to protect our biodiversity.
Download this E-book to know how geospatial AI is impacting biodiversity conservation and sustainable development.
https://info.gramener.com/geospatial-analytics-ai-solutions-esg-sector-ebook
Enable Storytelling with Power BI & Comicgen PluginGramener
Gramener’s Lead Data Consultant Mrinal Ghosh and Principal Information Designer Richie Lionell conducted an exciting webinar on Power BI Comicgen.
In this webinar, they talk about the Comicgen Power BI plugin and how to use it to generate compelling comic data stories.
Who should watch: If you're a Power BI Developer, Consultant, or anyone who often develops data graphics on the Power BI dashboard.
Full webinar link: https://info.gramener.com/storytelling-with-power-bi-and-comicgen-plugin
Would you like to learn more about our Power BI capabilities? Check out: https://gramener.com/power-bi-consulting/
The Most Effective Method For Selecting Data Science ProjectsGramener
Ganes Kesari, Gramener's Head of Analytics & Co-Founder gives his insights on how to craft a data science roadmap that maximizes ROI.
The biggest reason why 80% of analytics projects fail is that they don’t solve the right problem. Asking analytics or data-related question is the worst way to initiate a data analytics project.
This webinar will walk you through how to get started in the most efficient way possible. You'll discover a straightforward step-by-step strategy to unlocking corporate value through industry examples.
Things you will learn from this webinar:
-The most common reasons for the failure of data science initiatives
-Identifying projects and prioritizing them
-Building a data science strategy in three easy steps
-Real-life examples are used to explain the approach
Watch this full webinar on: https://info.gramener.com/data-science-roadmap
To know more from our industry experts book a free demo at: https://gramener.com/demorequest/
Low Code Platform To Build Data & AI ProductsGramener
Gramener's CEO, Anand S conducted this webinar where he explained how to build Data and AI products using a low-code platform in less than two weeks.
Few takeaways:
-How low-code approaches can be tailored to your data/digital needs?
-Decisions on Building vs. Buying
-Production-ready use cases to stimulate your thinking
Who should watch?
You will find this webinar to be valuable if you're a CPO, VP IT, handling product development, or building analytical solutions for your company.
Watch this full webinar on: https://info.gramener.com/low-code-platform-to-build-process-optimization-solutions?
Want to know more about our low-code platform, Gramex?
Visit: https://gramener.com/gramex/
5 Key Foundations To Build An Effective CX ProgramGramener
Gramener's VP of Analytics Amit Garg hosted this webinar and talked about what are the principles of a good customer experience program, and why is it important.
This webinar will be beneficial to leaders in the CMO, CCO, Customer Service, and any other customer-facing departments within a firm.
Pain points discussed:
-You'll be able to assess the level of CX maturity in your company.
-You'll learn the high-level steps to creating a successful CX program.
-You'll figure out what tools you'll need to improve your talents.
To watch the full webinar visit: https://info.gramener.com/5-key-foundations-effective-cx-program
Learn more about CX Analytics: https://gramener.com/customer-experience-analytics/
Using Power BI To Improve Media Buying & Ad PerformanceGramener
Gramener's Senior Lead Data Consultant, Sidharth Parameswaran, and Navya Sri Channamsetty, Gramener's Associate Lead Data Science Engineer conducted this joint webinar session.
Pain points discussed:
-Actual vs. planned results of a campaign
-Competitor Evaluation & Comparison
-Modeling of Media Mix
-Metrics assessed across the Agency, Client, and Brand levels
- Genre/Channel Performance Evaluation
Things you will learn:
1. Power BI may be used in a variety of ways to investigate findings.
2. Various dashboards would be used to analyze ad/program performance.
3. How can you help your clients obtain higher ROI and acquire a competitive advantage?
Do join us if you are a:
Power BI Developer, Media buyer, Campaign Manager, Brand Manager, Consultant, etc.
To watch this webinar visit: https://info.gramener.com/power-bi-media-buying-ad-performance
Learn more about Gramener: https://gramener.com/
This webinar was hosted by Gramener's CEO/Co-Founder, Anand S, and Ganes Kesari, Head of Analytics/Co-Founder on how data can help firms recover quickly throughout the recession and recovery period.
Who should watch this webinar :
Analytics Leaders, Business Leaders, CDOs, CTOs, etc.
Few takeaways :
-Which aspects of your company could benefit the most from a data-driven response?
-A strategy for identifying use cases that will provide the most value for the money.
How to use data in creative ways to uncover new market opportunities and customers.
Objectives :
-Data's utility in COVID situation
-How data science may assist you in navigating the recession
-Gramener's industry case studies to assist businesses in responding to COVID-19
Full Webinar: https://info.gramener.com/recession-proofing-your-business-with-data
To know more from industry leaders visit our official website: https://gramener.com/
Engage Your Audience With PowerPoint Decks: WebinarGramener
Gramener's CEO and Co-Founder Anand S hosted a webinar on how interactive PowerPoint decks can engage your audiences.
Pain points discussed in this webinar :
-How to utilize interactive slides to answer business questions like "Where is the problem?" and "What created this problem?"
-What forms of interactivity does PowerPoint offer, and when should you utilize each?
-What tools and plug-ins can aid in the creation of interactive presentations?
Watch the full webinar on: https://info.gramener.com/interactive-powerpoint-for-operations
Book a free demo to know more about Gramener's solutions: https://gramener.com/demorequest/
Structure Your Data Science Teams For Best OutcomesGramener
Gramener's Head of Analytics, Ganes Kesari conducted this webinar and discussed the following points :
-Why do data analytics and visualization initiatives require teams to work in silos?
-What are the best organizational structures for data science?
-As your data journey progresses, how should the organizational structure evolve?
-Best methods for encouraging team collaboration in data projects
This is a unique webinar designed for Executives, Chief Analytics Officers, Heads of Analytics, Directors, Technology Leaders, and Managers that work with data science teams on a daily basis.
To check out the full webinar visit: https://info.gramener.com/data-science-teams-structure-for-best-outcomes
To contact us & book a free demo visit: https://gramener.com/demorequest/
Gramener's Lead Data Scientist Soumya Ranjan and Senior Data Science Engineer Sumedh Ghatage conducted a webinar on Geospatial AI.
In this webinar, they discussed the technical know-how to get started, as well as some strategies for navigating this fascinating realm of Geospatial Analytics.
Pain points covered :
-How to begin with Geospatial Analytics in Python
-How can large-scale geospatial datasets be cleaned and analyzed?
-What is the best way to design geospatial workflows?
-How to use Geospatial Datasets for Deep Learning?
No matter whatever industry you're in, Geospatial Analytics will provide you with a wealth of unique solutions.
To watch the full webinar visit: https://info.gramener.com/geospatial-ai-technical-sneak-peek
To know more about Gramener's Geospatial AI solutions book a free demo on: https://gramener.com/demorequest/
5 Steps To Become A Data-Driven Organization : WebinarGramener
Gramener's Chief Data Scientist and Co-founder Ganes Kesari conducted an interesting webinar that will give you an idea of how to analyze your data maturity and plan the five steps to transforming your business using data.
Who should watch this webinar?
Executives, Chief Data/Analytics Officers, Technology leaders, Business heads, Directors, and Managers.
Important points discussed on the webinar:
-The majority of businesses reach a halt in the middle of their data journey.
-According to Gartner, approximately 87% of companies in the business have a poor degree of data maturity (levels 1 and 2 on a scale of 5).
-Adding more data science projects to your portfolio will not boost your talents or results. The truth is that CDOs' primary issues are divided into five categories.
Learnings from this webinar:
-Data Science Maturity. What is it and why is it important?
-How can you determine the maturity of data science and its limitations?
-How does data science maturity (described with an example) assist your business in progressing?
Watch the full webinar on:
https://info.gramener.com/5-steps-to-transform-into-data-driven-organization
To know more about Data Maturity visit:
https://gramener.com/data-maturity/#
5 Steps To Measure ROI On Your Data Science Initiatives - WebinarGramener
Gramener's Chief Decision Scientist & Co-Founder Ganes Kesari conducted an exciting webinar on how to measure ROI on your data science initiatives.
In this webinar people from the C-suite level CEO, COO, Directors, Managers across various industries joined.
Ganes Kesari covered the following points with industry examples:
-Identifying business use cases with a high impact
-Choosing effective success indicators
-Ascertaining that the consequences may be traced back to your data project
The attendees had a good time. Learnings from the webinar:
-Why do businesses struggle to get a return on their data investments?
-A straightforward framework for calculating the return on investment from your data projects
-Benchmarking of typical payback from data initiatives in the industry
To check out the complete recording of the webinar please visit:
https://info.gramener.com/5-steps-to-measure-roi-on-your-data-science-initiatives
To know more about data advisory check out:
https://gramener.com/advisory-consulting/
Saving Lives with Geospatial AI - Pycon Indonesia 2020Gramener
There’s a powerful way to fight dengue. Infect a mosquito with Wolbachia, release it in highly populated regions, and wait for it to infect all mosquitoes in the region.
But this process is expensive, and we need to release it in the most densely populated regions in a city.
And no one really knows what population density is at a 100m x 100m level.
Can we use satellite imagery and use this to identify building density?
Driving Transformation in Industries with Artificial Intelligence (AI)Gramener
Gramener's Director of Delivery, Priyaranjan Mohanty, delivered a virtual session at IIM Nagpur and talked about how organizations are moving towards digital transformation by leveraging advanced technologies such as Artificial Intelligence (AI), Machine Learning (ML) and Deep Learning.
Starting from what the industry leaders think about digital transformation to how it can shape the global economy, this presentation explores the scope of AI in the digital world.
The Art of Storytelling Using Data ScienceGramener
Gramener's VP - Sales, APAC Region, Vijayam Sirikonda interacted with the students of IIM Raipur and talked about the importance of data storytelling for business users.
Storyfying your Data: How to go from Data to Insights to StoriesGramener
Gramener's Director - Client success, Shravan Kumar A, delivered an online session to the students of Praxis Business School.
In his session he talked about how converting data into stories can benefit businesses and enable quick decision making. Furthermore, he shared approaches to create data stories along with some use cases and case studies we solved at Gramener to benefit our clients.
Check out our initiative to teach data storytelling to data scientists and analysts so that they can think out of the box and create wonderful data stories for their stakeholders: https://gramener.com/data-storytelling-workshop
As Europe's leading economic powerhouse and the fourth-largest hashtag#economy globally, Germany stands at the forefront of innovation and industrial might. Renowned for its precision engineering and high-tech sectors, Germany's economic structure is heavily supported by a robust service industry, accounting for approximately 68% of its GDP. This economic clout and strategic geopolitical stance position Germany as a focal point in the global cyber threat landscape.
In the face of escalating global tensions, particularly those emanating from geopolitical disputes with nations like hashtag#Russia and hashtag#China, hashtag#Germany has witnessed a significant uptick in targeted cyber operations. Our analysis indicates a marked increase in hashtag#cyberattack sophistication aimed at critical infrastructure and key industrial sectors. These attacks range from ransomware campaigns to hashtag#AdvancedPersistentThreats (hashtag#APTs), threatening national security and business integrity.
🔑 Key findings include:
🔍 Increased frequency and complexity of cyber threats.
🔍 Escalation of state-sponsored and criminally motivated cyber operations.
🔍 Active dark web exchanges of malicious tools and tactics.
Our comprehensive report delves into these challenges, using a blend of open-source and proprietary data collection techniques. By monitoring activity on critical networks and analyzing attack patterns, our team provides a detailed overview of the threats facing German entities.
This report aims to equip stakeholders across public and private sectors with the knowledge to enhance their defensive strategies, reduce exposure to cyber risks, and reinforce Germany's resilience against cyber threats.
Explore our comprehensive data analysis project presentation on predicting product ad campaign performance. Learn how data-driven insights can optimize your marketing strategies and enhance campaign effectiveness. Perfect for professionals and students looking to understand the power of data analysis in advertising. for more details visit: https://bostoninstituteofanalytics.org/data-science-and-artificial-intelligence/
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...John Andrews
SlideShare Description for "Chatty Kathy - UNC Bootcamp Final Project Presentation"
Title: Chatty Kathy: Enhancing Physical Activity Among Older Adults
Description:
Discover how Chatty Kathy, an innovative project developed at the UNC Bootcamp, aims to tackle the challenge of low physical activity among older adults. Our AI-driven solution uses peer interaction to boost and sustain exercise levels, significantly improving health outcomes. This presentation covers our problem statement, the rationale behind Chatty Kathy, synthetic data and persona creation, model performance metrics, a visual demonstration of the project, and potential future developments. Join us for an insightful Q&A session to explore the potential of this groundbreaking project.
Project Team: Jay Requarth, Jana Avery, John Andrews, Dr. Dick Davis II, Nee Buntoum, Nam Yeongjin & Mat Nicholas
Adjusting primitives for graph : SHORT REPORT / NOTESSubhajit Sahu
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is
Multiply with different modes (map)
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
1. Comparing various launch configs for CUDA based vector element sum (in-place).
10. DRAW FOCUS TO PRIORITIES
THIS IS ONE OF THE REASONS TO VISUALIZE DATA
11. 11
CRICKET
FASTEST SCORERS
“
I’ve always been curious… who
among India’s prolific one-day
run-getters had the best strike
rate?
Sachin?
Sehwag?
What about the rest of the world?
12. 12
LET’S TAKE ONE DAY CRICKET DATA
Country Player Runs ScoreRate MatchDate Ground Versus
Australia Michael J Clarke 99* 93.39 30-06-2010The Oval England
Australia Dean M Jones 99* 128.57 28-01-1985Adelaide Oval Sri Lanka
Australia Bradley J Hodge 99* 115.11 04-02-2007Melbourne Cricket Ground New Zealand
India Virender Sehwag 99* 99 16-08-2010Rangiri Dambulla International Stad. Sri Lanka
New Zealand Bruce A Edgar 99* 72.79 14-02-1981Eden Park India
Pakistan Mohammad Yousuf 99* 95.19 15-11-2007Captain Roop Singh Stadium India
West Indies Richard B Richardson 99* 70.21 15-11-1985Sharjah CA Stadium Pakistan
West Indies Ramnaresh R Sarwan 99* 95.19 15-11-2002Sardar Patel Stadium India
Zimbabwe Andrew Flower 99* 89.18 24-10-1999Harare Sports Club Australia
Zimbabwe Alistair D R Campbell 99* 79.83 01-10-2000Queens Sports Club New Zealand
Zimbabwe Malcolm N Waller 99* 133.78 25-10-2011Queens Sports Club New Zealand
Australia David C Boon 98* 82.35 08-12-1994Bellerive Oval Zimbabwe
Australia Graeme M Wood 98* 63.22 11-01-1981Melbourne Cricket Ground India
England Ian J L Trott 98* 84.48 20-10-2011Punjab Cricket Association Stadium India
India Yuvraj Singh 98* 89.09 01-08-2001Sinhalese Sports Club Ground Sri Lanka
Ireland Kevin J O'Brien 98* 94.23 10-07-2010VRA Ground Scotland
Kenya Collins O Obuya 98* 75.96 13-03-2011M.Chinnaswamy Stadium Australia
Netherlands Ryan N ten Doeschate 98* 73.68 01-09-2009VRA Ground Afghanistan
New Zealand James E C Franklin 98* 142.02 07-12-2010M.Chinnaswamy Stadium India
Pakistan Ijaz Ahmed 98* 112.64 28-10-1994Iqbal Stadium South Africa
South Africa Jacques H Kallis 98* 74.24 06-02-2000St George's Park Zimbabwe
14. 14
Which player scores the
most per ball?
The player with the highest strike
rate is an obscure South African
whose name most of us have never
heard of.
In fact, this list is filled with players
we have never heard of.
15. 15
ODI STRIKE RATES OF THE WORLD
We want to see the
prioritised performance.
That is, what is the strike
rate of the established
players?
LINK
20. 20
RESTAURANT FOUND AN UNUSUAL DIP IN SALES
A restaurant chain had data for every single
transaction made over a few years. Plotting
this as a time series showed them nothing
unusual.
However, the same data on a calendar map
reveals a very different story.
Specifically, at the bottom left point-of-sale terminal, sales dips on every
Wednesday. At the bottom right point-of-sale terminal, sales rises on
every Wednesday (almost as if to compensate for the loss.)
It turns out that the manager closes the bottom-left counter every
Wednesday afternoon due to shortage of staff, assuming that it results in
no loss of sales. There is, however, a net loss every Wednesday.
A similar visual helped a telecom company identify specific days on which their competitors’ market
share rose significantly, enabling them to negate the strategy.
Communicating data visually is the most effective way to a shared understanding
24. 24
CONSISTENT CONCLUSIONS FROM DATA
Stock market crash?
Doesn’t look so bad.. This gives the right perspective
Source: http://www.cc.gatech.edu/~stasko/7450/index.html
The same dataset can lead to very
different conclusions.
Visualizations freeze the
rendering of data, allowing a
consistent (and hopefully correct)
interpretation.
25. 25
WINNING PARTIES
In the 2004 election to Lok
Sabha there were 1,351
candidates from 6 National
parties, 801 candidates from
36 State parties, 898
candidates from officially
recognised parties and 2385
Independent candidates.
The Congress (INC) won 145
seats in the 2004 elections.
BJP won 138, coming a close
second.
The constituencies where
each party won is shown
here.
Party BJP BSP CPM INC RJD SP
26. 26
Party BJP BSP CPM INC RJD SPWINNING PARTIES
In the 2004 election to Lok
Sabha there were 1,351
candidates from 6 National
parties, 801 candidates from
36 State parties, 898
candidates from officially
recognised parties and 2385
Independent candidates.
The Congress (INC) won 145
seats in the 2004 elections.
BJP won 138, coming a close
second.
The constituencies where
each party won is shown
here.
27. WHAT SHOULD I TALK ABOUT NOW?
I’VE ALWAYS HAD A PROBLEM DETERMINING AUDIENCE INTEREST
28. We have internal
information. Getting
information from outside is
our challenge. There’s no way
of doing that.
– Senior Editor
Leading Media Company
“
32. 32
WHAT DO PEOPLE LOOKING FOR IN VISUALIZATION?
USA India
data visualization tools
data visualization software
data visualization examples
data visualization jobs
data visualization tools
data visualization techniques
data visualization examples
data visualization software
Tools &
Software
Techniques &
Examples
33. WHAT TOOLS SHOULD YOU USE?
THIS IS ONE OF THE MOST FREQUENT QUESTIONS I’M ASKED
34. 34
DATA SCIENCE TOOLS
Alteryx
Amazon EC2
Azure ML
BigQuery
Birst
Caffe
Cassandra
Cloud Compute
Cloudera
Cognos
CouchDB
D3
Decision tree
ElasticSearch
Excel
Gephi
ggplot2
Hadoop
HP Vertica
IBM Watson
Impala
Julia
Jupyter Notebook
Kafka
Kibana
Kinesis
Lambda
Logstash
MapR
MapReduce
Matplotlib
Microstrategy
MongoDB
NodeXL
Pandas
Pentaho
Pivotal
PowerPoint
Qlikview
R
R Studio
Random Forest
Redis
Redshift
Regression
Revolution R
S3
SAP Hana
SAS
Spark
Spotfire
SPSS
SQL Server
Stanford NLP
Storm
SVM
Tableau
TensorFlow
Teradata
Theano
Thrift
Torch
Weka
Word2Vec
The tool does not matter. A person’s skill with the tool does.
Pick the person. Let them pick the tool.
35. I’M FAMILIAR WITH EXCEL
I TURN TO IT AS A FIRST CHOICE FOR ALMOST EVERYTHING
39. 39
Profits Made: Over the last 6
years, you would have beaten a 10%
Inflation about 82% of the time and lost out
about 18% of the time. So, mostly, you would
have made money on Cipla with an average
return of 14.9%.
Highest Returns: An average return of 14.1%
has been observed when held for a period of one year.
with a maximum of 79.6% if sold in Dec 2009, after being
held for a year. And a maximum of 486.9% if sold at the end
of Nov 2007 after holding for a month. The highest stock price
was Rs 414 in Nov/Dec 2012.
-50% +50%returns
WHEN TO
INVEST
This visual shows the
returns from buying
Cipla’s stock on any
given month, and
selling it in another.
The color of each cell is
the return (red is low,
green is high) if you
had invested in the
stock in a given month
and sold it on another.
For example this mild
red is the slightly
negative return if you
had bought Cipla stock
in Mar 2011 (the row)
and sold it in Jun 2011
(the column).
Link
45. 45
PORTFOLIO PERFORMANCE
VISUAL
Worldwide$288.0mn
A: Accelerate$68.9mn
B: Build$77.2mn
C: Cut down$141.9mn
Worldwide:
$288 mn
The visualization shows the market
opportunities across various countries to
identify areas of focus. This chart has
been built as an interactive-app to
present the key findings, while letting
user click-through and drill-down to a
custom view across 4 different levels.
LINK
47. TOOLS DO HELP, OF COURSE
FOR SOME THINGS, YOU NEED THE RIGHT PLATFORM
48. 48
How does Mahabharata, one of the largest epics with 1.8
million words lend itself to text analytics?
Can this ‘unstructured data’ be processed to extract
analytical insights?
What does sentiment analysis of this tome convey?
Is there a better way to explore relations between
characters?
How can closeness of characters be analyzed & visualized?
VISUALISING THE MAHABHARATA
49. 49
Recruiting top quality developers is always a problem. We decided to use an
algorithmic approach and pulled out the social network of developers on
Github (a social network for open source code).
In this visualization, each circle is a person. The size of the circle
represents the number of followers. Larger circles have more
followers (but not in proportion – it’s a log scale.)
The circle’s color represents the city the
programmer’s live in. This visual is a slice showing the
tale of two cities: Bangalore and Singapore
Two people are connected if one
follows the other. This leads to a
clustering of people in the form of a
network.
Here, you can see that Bangalore and
Singapore are reasonably well
connected cities. Bangalore has more
developers, but Singapore has more
popular ones (larger circles).
However, the interaction between
Bangalore and Singapore are few and
far between. But for a few people
across both cities, like:
… etc.
Sudar, Yahoo!
Anand C, Consultant
Kiran, Hasgeek
Anand S, Gramener
Mugunth, Steinlogic
Honcheng, buUuk
Sau Sheong, HP Labs
Lim Chee Aung
Bangalore
Singapore
1 follower
100 followers
A follows B (or)
B follows A
Most followed in
Bangalore
Most followed in
Singapore
Ciju Cherian
Lin Junjie
Amudhi Sebastian
There are, of course, a number of smaller
independent circles – people who are not connected
to others in the same city. (They may be connected to
people in other cities.)
Apart from this, there are a few small networks of
connected people – often people within the same
company or start-up – who form a community of their
own.
THE SOCIAL TALE OF TWO CITIES: BANGALORE & SINGAPORE
59. 59
VIJAY KARNATAKA’S PUBLICATION ON CANDIDATE WEALTH LINK
Media
Based on candidate declarations, Karnataka 2013
Continued… Microsoft
60. 60
IMPACT OF THE BUDGET ON STOCK PRICES LINK
Financial ServicesNarrativesMediaPublic SectorFinancePlatform
61. 61
WORLD BANK: INNOVATION, TECHNOLOGY & ENTREPRENEURSHIP
Does access to new Technology facilitate Innovation? Does it
facilitate Entrepreneurship? The Global Information Technology
Report findings tell us that "innovation is increasingly based on
digital technologies and business models, which can drive economic
and social gains from ICTs...".
We were curious about whether the data on TCData360 could tell a
story about influential factors on innovation and entrepreneurship.
With over 1800 indicators, we focused on the Networked Readiness
Index, as it has indicators on entrepreneurship, technology, and
innovation.
LINK
SocietyPlatform
62. … BUT CONTENT IS KING
KEEP THE STORY AT THE FOREFRONT
63. 63
PREDICTING MARKS
EDUCATION
“
What determines a child’s marks?
Do girls score better than boys?
Does the choice of subject
matter?
Does the medium of instruction
matter?
Does community or religion
matter?
Does their birthday matter?
Does the first letter of their name
matter?
68. 68
PERFORMANCE
DRIVERS
Do girls score more than
boys, or is it the other way
around?
Gender is a known driver of
performance. Girls generally
score higher. There is
considerable variation across
subjects, however. The
differences in sciences is
minimal. But languages,
commerce and economics
give girls a significant edge.
There is also a correlation
between girls’ dropout ratio
and their over-performance
– indicating perhaps that the
smarter girls tend to stay
back in school.
Subject Girs higher by Girls Boys
Physics 0 119 119
Chemistry 1 123 122
English 4 130 126
Computers 6 137 131
Biology 6 129 123
Mathematics 11 123 112
Language 11 152 141
Accounting 12 138 126
Commerce 13 127 114
Economics 16 142 126
WHO SCORES MORE? BOYS OR
GIRLS?
69. 69
The marks shoot
up for Aug borns
… and peaks for
Sep-borns
120 marks out of
1200 explainable
by month of birth
An identical pattern was observed in 2009 and 2010…
… and across districts, gender, subjects, and class X & XII.
“It’s simply that in Canada the eligibility
cut-off for age-class hockey is January
1. A boy who turns ten on January 2,
then, could be playing alongside
someone who doesn’t turn ten until the
end of the year—and at that age, in
preadolescence, a twelve-month gap in
age represents an enormous difference
in physical maturity.”
-- Malcolm Gladwell, Outliers
SUN SIGNS
Based on the results of the
20 lakh students taking the
Class XII exams at Tamil
Nadu over the last 3 years, it
appears that the month you
were born in can make a
difference of as much as 120
marks out of 1,200.
June borns
score the lowest
70. 70
This is a dataset (1975 – 1990) that has
been around for several years, and has
been studied extensively. Yet, a
visualization can reveal patterns that
are neither obvious nor well known.
For example,
• Are birthdays uniformly distributed?
• Do doctors or parents exercise the C-section option to move dates?
• Is there any day of the month that has unusually high or low births?
• Are there any months with relatively high or low births?
Very high births in September.
But this is fairly well known. Most
conceptions happen during the
winter holiday season
Relatively few births during the
Christmas and Thanksgiving
holidays, as well as New Year and
Independence Day.
Most people prefer not
to have children on the
13th of any month, given
that it’s an unlucky day
Some special days like April
Fool’s day are avoided, but
Valentine’s Day is quite
popular
More births Fewer births … on average, for each day of the year (from 1975 to 1990)
LET’S LOOK AT 15 YEARS OF US BIRTH DATA
71. 71
THE PATTERN IN INDIA IS QUITE DIFFERENT
This is a birth date dataset that’s
obtained from school admission data
for over 10 million children. When we
compare this with births in the US, we
see none of the same patterns.
For example,
• Is there an aversion to the 13th or is there a local cultural nuance?
• Are holidays avoided for births?
• Which months have a higher propensity for births, and why?
• Are there any patterns not found in the US data?
Very few children are born in the
month of August, and thereafter.
Most births are concentrated in
the first half of the year
We see a large number of
children born on the 5th, 10th,
15th, 20th and 25th of each month
– that is, round numbered dates
Such round numbered patterns a
typical indication of fraud. Here,
birthdates are brought forward to
aid early school admission
More births Fewer births … on average, for each day of the year (from 2007 to 2013)
72. 72
THIS ADVERSELY IMPACTS CHILDREN’S MARKS
It’s a well established fact that older
children tend to do better at school in
most activities. Since many children
have had their birth dates brought
forward, these younger children suffer.
The average marks of children “born” on the 1st, 5th, 10th, 15th etc. of the
month tend to score lower marks.
• Are holidays avoided for births?
• Which months have a higher propensity for births, and why?
• Are there any patterns not found in the US data?
Higher marks Lower marks … on average, for children born on a given day of the year (from 2007 to 2013)
Children “born” on round numbered days score lower marks on average,
due to a higher proportion of younger children
74. 74
Source: Designing Data Visualizations by Noah Iliinsky and Julie Steele (O’Reilly).
Copyright 2011 Julie Steele and Noah Iliinsky, 978-1-449-31228-2.
Position is the most powerful encoding.
The eye and brain are naturally wired to detect mis-alignment of
the smallest order
1
Colour, when used in context, is powerful.
We can detect miniscule changes or variations in colour when
comparing an element with neighbouring elements. This is what
makes true colour (32-pixel colour, i.e. 4 billion) a necessity in
computer graphics
2
Size is a useful differentiator.
The eye can detect moderate size variations at
moderate distances. Size also has a natural
interpretation: that of priority.
3
Several other encodings are possible
Aesthetics such as angle, shadows, shapes, patterns,
density, labelling, enclosures, etc. can each be used to
map data.
4
VISUAL ENCODINGS VARY IN THEIR EFFECTIVENESS
75. 75
POSITION IS EVERYTHING
Absolute & relative departure time (continuous)
Absolute & relative arrival time (continuous)
Absolute & relative length of trip (continuous)
Stopovers (binary)
Absolute & relative stopover duration (continuous)
Absolute & relative stopover start & stop time
(continuous)
Sort order (ranked)
Source: http://hipmunk.com
Let’s take a small test. We’ll show a table of numbers on the screen, and ask 3 questions about those numbers. You have 30 seconds to answer these. You can just write down the answers or remember them – there’s no need to say the answer out aloud.
Your timer starts now.
What answers did you get?
How many numbers were above 100?
How many were below 10?
Which quadrant had the highest total?
[Typically, there will be a lot of variance in these answers]
So there’s considerable variation in the answers you get.
Now, let’s do the same exercise again, but with some extremely simple highlighting. It’s the same questions. You have 30 seconds. This time, you can say the answer out aloud if you like.
Your time starts now.
We were also interested in applying these rich visualisations to sports. One question we had was, for example, “Who’s the fastest one day international player?”
The trouble with that is, depending on when you measure it and how you measure it, the results could be very different. For example, if we take strike rate as a metric, it turned out (when we did it) that it was a South African who had the highest strike rate – of 200%. He played one match, hit a four, and got out the next ball.
Clearly, that’s not what we’re looking for. We could, perhaps, take a minimum number of runs as a cut-off. But the question is, what should that be? 100? 1000? 5000? Where does one draw the line, and why is that the right one? If you don’t know the domain, answering this is difficult.
Like with the contract farming example before, we need a way of looking at performance combined with scale or importance.
For the same chain, we also looked at the daily sales across restaurants. Here are a series of calendar maps showing the daily sales for four different points of sale terminals at one restaurant. Each calendar map shows a calendar for 7 months. Each day is coloured based on the value of sales on that day. Red indicates low sales, green indicates high sales.
For the two terminals at the front (i.e. the ones you see on top), sales was relatively low during the first two months, but picked up steadily thereafter. It’s easy to spot the exceptions among this. For example, the 30th and 31st of January were good days for both terminals.
Interestingly, when you look at the terminal at the bottom left, there is a red bar indicating consistent dip in sales every Wednesday. Almost as if to compensate, the terminal at the bottom right has an increase in sales every Wednesday – but not as significant as the dip.
We did not have an explanation for this, though our client did a few weeks later. It turned out that the person manning the bottom left counter takes half-day off every Wednesday, and was not being replaced by the manager. The queue naturally shifts over to the other terminal, increasing the sales. But this restaurant is in an area where there are many other food outlets. Once the queue reaches a certain size, people drop off, resulting in a net loss in sales every Wednesday – a loss that had gone unobserved for at least 7 months.
So, what we did was put a variant of this visual together. On the right, you have a series of currencies like the Australian dollar, the Euro, the British pound, etc; some commodities like silver and gold; and some stock indices like Sensex, FTSE, and S&P.
The cells here have a number inside that indicates the pairwise correlation between a pair of securities. For example, the number 68 on the top left indicates a 68% correlation between the Australian dollar and the Euro. To the left of the Euro and just below the dollar (diagonally opposite to the 68), there’s a scatter plot that shows the daily prices of both these currencies. Each dot is one day’s data. The x-axis shows the Australian dollar value. The y-axis shows the Euro value. This helps identify what the pattern of movements of any two currencies is. From this, you can easily see visually that the Australian dollar and the Euro both tend to move together. Or, where there are strong correlations like the FTSE & S&P, the pattern is almost a straight line.
In some cases there are negative correlations. For instance, if you take the Sensex against the Japanese Yen, the correlation is -79%. The cells are coloured based on their correlation values. Greens indicate strong positive correlation. Reds indicate strong negative correlation.
These are also grouped hierarchically. On the left, we have a series of lines indicating clusters. The most similar securities are grouped together. So FTSE and S&P with a 98% correlation are very close. The ones that are less correlated are kept further away based on a tree-structure.
This leads to clustering of securities. For example, there is a green block in the center which has SGD, JPY, XAU, CHF and CNY. All of these are fairly well correlated. When any one currency in this block goes up, all the others go up as well. When any one goes down, all others go down as well.
Similarly, you have another block to its top left: S&P, FTSE, Sensex and to a certain extent, the Pakistani Rupee. These move together as a block as well.
But when this block goes up, all the currencies in the other block go down, as indicated by the red negative correlations between these two blocks.
This can be used very easily for decision making. For example, one client who was trading with Singapore and Japan looked at the strong correlation and decided to consolidate their holdings in Japanese Yen. They then moved up and down this column to find a good hedge. FTSE looked like a good hedge – it was the most negatively correlated with JPY at that time -- and they decided to place a third of their portfolio in FTSE.
A sheet like this improves people’s understanding of relatively complex data, and results in significantly increased trade volumes.
This example illustrates the point. Charts 2 and 3 show a much better representation as opposed to the first one.
Amongst the key attributes that the brain instantly recognizes, the chief one is position. The proximity of items, distance of separation and layout is instantly processed and relationships assigned by the brain accordingly. This must be leveraged for an effective visualization, as shown in the example here.
Colour is amongst the most powerful, but oft mis-used set of attributes. A basic principle of colours is that the brain does not naturally order them i.e. blue is not intuitively ranked higher than, say green or red. This is the prime reason why this chart is unreadable.
However, different shades of colour can be better ranked. As shown in the example here, darker shades can be intuitively mapped to higher altitude regions and its easy to spot patterns.
This table shows the complete set of encodings ranked in the same order as the ease of processing by the brain. Their applicability across the different data properties of Quantitative, Ordinal, Categorical and Relational is shown.