SlideShare a Scribd company logo
Data Visualization tools &
techniques
K Sravan Kumar
Outline
 Different visualizations
 How to draw in R
 How to draw in MS Excel
3 Stages of Understanding
Perceiving Interpreting Comprehending
What does it show ?
Where is big, medium, small ?
How do things compare?
What relationships exist?
What does it mean?
What is good and bad?
Is it meaningful or insignificant?
Unusual or expected?
What does it mean to me?
What are the main messages?
What have I learnt?
Any actions to take?
3 Principles of Good Visualization design
Principle 1
Good data visualization
is TRUSTWORTHY
Principle 2
Good data visualization
is ACCESSIBLE
Principle 3
Good data visualization is
ELEGANT
Visualization Workflow
 Formulating brief
 Working with data
 Establishing editorial thinking
 Developing design solution
Hidden
Thinking stages
Production Cycle
Formulating brief
 Curiosity: Why are we doing it ?
 Personal Intrigue : ‘I wonder what…..’
 Stakeholder Intrigue : ‘He/She needs to know …..;
 Audience Intrigue : ‘They need to know ……..’
 Anticipated Intrigue : ‘They might be interested in knowing …’
 Potential Intrigue : ‘They should be interested in knowing …’
Purpose Map
EXPLANATORY EXHIBITORY EXPLORATORY
FEELINGREADING
sequence | drama annotate | describe display manipulate | interrogate participate | contribute
emotive|drama|big-pictureutilitarian|efficient|precision
Working with data
 Types of data
 Textual(Qualitative)
 Nominal (Qualitative)
 Ordinal (Qualitative)
 Interval (quantitative)
 Ratio (quantitative)
Working with data : steps
 Acquire
 Examine
 Transform
 Explore
Exploratory data analysis
 Addressing of unknowns and substantiating knowns.
The things we are
aware of knowing
Beware complacency
The things we are
aware of not knowing
Deductive reasoning
The things we are
unaware of knowing
Acquire and review
The things we are
unaware of not
knowing
Inductive reasoning
KNOWN UNKNOWN
KNOWNUNKNOWN ACQUIRED
AWARENESS
Reasoning
 Deductive reasoning
Hypothesis framed by subject knowledge, interrogate the
data to find evidence of relevance or interest in concluding
the finding. (Sherlock Holmes)
 Inductive reasoning
Play around with data, based on sense or instinct and wait
and see what emerges.
Establishing editorial thinking
 Angle
 Relevant views to the potential interest of audience
 Sufficient to cover all relevant views
 Framing
 Apply filters to determine inclusion and exclusion criteria.
 Provide access to most salient content but also avoid
any distortion of data
 Focus
 Features of display to draw particular attention
 Organize visibility and hierarchy
Developing design solution
 Steps of production cycle:
 Conceiving ideas across 5 layers of visual design
 Wireframing & storyboarding designs
Create low fidelity illustration and weave the illustrations to create sequenced view
 Developing prototypes
Develop first working version/ blueprints
 Testing
Test ,evaluate and collect feedback on trustworthiness, accessibility and elegancy.
 Refining & completing
Incorporate feedback, correct and double check.
 Launching the solution
5 layers of visual design
 Data representation
 Interactivity
 Annotation
 Color
 Composition
Chart Types
 Categorical
Comparing categories and distributions of data
 Hierarchical
Charting part to whole relationships and hierarchies
 Relational
Graphing relationships to explore correlations and
connections
 Temporal
Showing trends and activities over time
 Spatial
Mapping spatial patterns through overlays and distortions
Bar Chart
R Code:-
library(MASS)
school = painters$School
school.freq = table(school)
barplot(school.freq)
title("School wise number of painters")
Tips & Tricks
• Quantitative axis should start
always from 0
• Make the categorical sorting
meaningful (X-axis).
• If you have axis labels, don’t
label each bar with values.
• Used for comparing C H R T S
Clustered Bar Chart
R Code:-
counts <- table(mtcars$cyl, mtcars$gear)
barplot(counts, main="Car Distribution by Gears
and Cylinders", xlab = "Number of Gears", col =
c("grey","lightblue","orange") , legend =
rownames(counts), beside=TRUE)
C H R T S
Tips & Tricks
• Quantitative axis should start
always from 0
• Make the categorical sorting
meaningful (X-axis).
• If you have axis labels, don’t
label each bar with values.
• Used for comparing within and
across clusters
Dot Plot
R Code:-
tt <- read.csv("test.csv")
ggplot(data = tt, aes(x=Percentage, y=Country,
color = Gender)) + geom_point(aes(size = Count))
+ xlim(0,100)
Tips & Tricks
• Quantitative axis can start from 0.
Otherwise label axis values clearly
• Make the categorical sorting
meaningful (Y-axis).
• Position of the point indicates
quantitative value of each category
• Size of the data can also be used to
indicate quantitative value.
C H R T S
Connected Dot Plot (barbell/dumb-bell
chart)
C H R T S
R Code:-
tt <- read.csv("test.csv")
ggplot(data = tt, aes(x=Year2000, xend=Year2012,
y=Country, group=Country)) + geom_dumbbell(
color="orange", size=0.75, point.colour.l = "#0e668b“ )
+ xlim(0,1000000) +labs(x=NULL, y=NULL, title
="OECD 2000 vs 2012")
Tips & Tricks
• Quantitative axis can start from 0.
Otherwise label axis values clearly
• Make the categorical sorting meaningful
(Y-axis).
• Position of the point indicates quantitative
value of each category
• Size of the data can also be used to
indicate quantitative value.
Pictogram
R Code:-
man<-readPNG("man.png")
pictogram(icon=man, n=c(12,35,52),
grouplabels=c("dudes","chaps","lads"))
Tips & Tricks
• Quantitative axis can start from 0.
Otherwise label axis values clearly
• Make the categorical sorting meaningful
(Y-axis).
• Position of the point indicates quantitative
value of each category
• Size of the data can also be used to
indicate quantitative value.
Bubble chart
C H R T S
R Code:-
g <- ggplot(dt, aes(x= xlab, y = alphabet)) + labs(title
="State wise public spending") + geom_jitter
(aes(col=alphabet, size=FY.11)) + geom_text
(aes(label=State), size=3) + guides(colour=FALSE,
size = FALSE, x = FALSE, y = FALSE) +
theme(axis.title.x=element_blank(),axis.text.x=element
_blank(),axis.ticks.x=element_blank(),axis.title.y=elem
ent_blank(),axis.text.y=element_blank(),axis.ticks.y=el
ement_blank()) + scale_size_continuous(range = c(0,
50)) Tips & Tricks
• Interactive features can be added
• Colors can be used to make quantitative
sizes more distinguishable
Polar Chart
R Code:-
plot <- ggplot(DF, aes(variable, value, fill = variable)) + geom_bar(width
= 1, stat = "identity", color = "white") + scale_y_continuous(breaks =
0:10) + coord_polar()
plot
Tips & Tricks
• Filled with colors with a degree of
transparency to allow background to be
partially visible
• Grid lines are relevant if there are
common scales across quantitative
variables
C H R T S

More Related Content

What's hot

Missing data handling
Missing data handlingMissing data handling
Missing data handling
QuantUniversity
 
Data preprocessing in Data Mining
Data preprocessing  in Data MiningData preprocessing  in Data Mining
Data preprocessing in Data Mining
Samad Baseer Khan
 
Machine learning session4(linear regression)
Machine learning   session4(linear regression)Machine learning   session4(linear regression)
Machine learning session4(linear regression)
Abhimanyu Dwivedi
 
Classification Based Machine Learning Algorithms
Classification Based Machine Learning AlgorithmsClassification Based Machine Learning Algorithms
Classification Based Machine Learning Algorithms
Md. Main Uddin Rony
 
Data analytics vs. Data analysis
Data analytics vs. Data analysisData analytics vs. Data analysis
Data analytics vs. Data analysis
Dr. C.V. Suresh Babu
 
Introduction to Data Science and Analytics
Introduction to Data Science and AnalyticsIntroduction to Data Science and Analytics
Introduction to Data Science and Analytics
Srinath Perera
 
Data Visualisation.pdf
Data Visualisation.pdfData Visualisation.pdf
Data Visualisation.pdf
Thiyagu K
 
Data Types and Structures in R
Data Types and Structures in RData Types and Structures in R
Data Types and Structures in R
Rupak Roy
 
Data Visualization Techniques
Data Visualization TechniquesData Visualization Techniques
Data Visualization TechniquesAllAnalytics
 
Dimensionality Reduction
Dimensionality ReductionDimensionality Reduction
Dimensionality Reduction
mrizwan969
 
01 Data Mining: Concepts and Techniques, 2nd ed.
01 Data Mining: Concepts and Techniques, 2nd ed.01 Data Mining: Concepts and Techniques, 2nd ed.
01 Data Mining: Concepts and Techniques, 2nd ed.
Institute of Technology Telkom
 
Data preprocessing in Data Mining
Data preprocessing in Data MiningData preprocessing in Data Mining
Data preprocessing in Data Mining
DHIVYADEVAKI
 
Data Visualization
Data VisualizationData Visualization
Data Visualization
javaidsameer123
 
Data preprocessing PPT
Data preprocessing PPTData preprocessing PPT
Data preprocessing PPT
ANUSUYA T K
 
Model selection and cross validation techniques
Model selection and cross validation techniquesModel selection and cross validation techniques
Model selection and cross validation techniques
Venkata Reddy Konasani
 
Logistic Regression | Logistic Regression In Python | Machine Learning Algori...
Logistic Regression | Logistic Regression In Python | Machine Learning Algori...Logistic Regression | Logistic Regression In Python | Machine Learning Algori...
Logistic Regression | Logistic Regression In Python | Machine Learning Algori...
Simplilearn
 
Data Science - Part VII - Cluster Analysis
Data Science - Part VII -  Cluster AnalysisData Science - Part VII -  Cluster Analysis
Data Science - Part VII - Cluster Analysis
Derek Kane
 
Data Visualization - A Brief Overview
Data Visualization - A Brief OverviewData Visualization - A Brief Overview
Data Visualization - A Brief Overview
Rotary Club of North Raleigh
 

What's hot (20)

Missing data handling
Missing data handlingMissing data handling
Missing data handling
 
Data preprocessing in Data Mining
Data preprocessing  in Data MiningData preprocessing  in Data Mining
Data preprocessing in Data Mining
 
An introduction to R
An introduction to RAn introduction to R
An introduction to R
 
Machine learning session4(linear regression)
Machine learning   session4(linear regression)Machine learning   session4(linear regression)
Machine learning session4(linear regression)
 
Classification Based Machine Learning Algorithms
Classification Based Machine Learning AlgorithmsClassification Based Machine Learning Algorithms
Classification Based Machine Learning Algorithms
 
Data analytics vs. Data analysis
Data analytics vs. Data analysisData analytics vs. Data analysis
Data analytics vs. Data analysis
 
Introduction to Data Science and Analytics
Introduction to Data Science and AnalyticsIntroduction to Data Science and Analytics
Introduction to Data Science and Analytics
 
Data Visualisation.pdf
Data Visualisation.pdfData Visualisation.pdf
Data Visualisation.pdf
 
Data Types and Structures in R
Data Types and Structures in RData Types and Structures in R
Data Types and Structures in R
 
Data Visualization Techniques
Data Visualization TechniquesData Visualization Techniques
Data Visualization Techniques
 
Dimensionality Reduction
Dimensionality ReductionDimensionality Reduction
Dimensionality Reduction
 
01 Data Mining: Concepts and Techniques, 2nd ed.
01 Data Mining: Concepts and Techniques, 2nd ed.01 Data Mining: Concepts and Techniques, 2nd ed.
01 Data Mining: Concepts and Techniques, 2nd ed.
 
Data preprocessing in Data Mining
Data preprocessing in Data MiningData preprocessing in Data Mining
Data preprocessing in Data Mining
 
Data Visualization
Data VisualizationData Visualization
Data Visualization
 
Data preprocessing PPT
Data preprocessing PPTData preprocessing PPT
Data preprocessing PPT
 
Model selection and cross validation techniques
Model selection and cross validation techniquesModel selection and cross validation techniques
Model selection and cross validation techniques
 
Logistic Regression | Logistic Regression In Python | Machine Learning Algori...
Logistic Regression | Logistic Regression In Python | Machine Learning Algori...Logistic Regression | Logistic Regression In Python | Machine Learning Algori...
Logistic Regression | Logistic Regression In Python | Machine Learning Algori...
 
Data Science - Part VII - Cluster Analysis
Data Science - Part VII -  Cluster AnalysisData Science - Part VII -  Cluster Analysis
Data Science - Part VII - Cluster Analysis
 
OLAP
OLAPOLAP
OLAP
 
Data Visualization - A Brief Overview
Data Visualization - A Brief OverviewData Visualization - A Brief Overview
Data Visualization - A Brief Overview
 

Similar to Data visualization tools & techniques - 1

PowerPoint Presentation On giving effective PowerPoint Presentations
PowerPoint Presentation On giving effective PowerPoint PresentationsPowerPoint Presentation On giving effective PowerPoint Presentations
PowerPoint Presentation On giving effective PowerPoint Presentations
Amanda Gilmore
 
Data Visualisation: A Game of Decisions
Data Visualisation: A Game of DecisionsData Visualisation: A Game of Decisions
Data Visualisation: A Game of Decisions
Andy Kirk
 
Design for Delight
Design for DelightDesign for Delight
Design for Delight
Amanda Makulec
 
Designing Data Visualizations to Strengthen Health Systems
Designing Data Visualizations to Strengthen Health SystemsDesigning Data Visualizations to Strengthen Health Systems
Designing Data Visualizations to Strengthen Health Systems
Amanda Makulec
 
Data Visualization Design Best Practices Workshop
Data Visualization Design Best Practices WorkshopData Visualization Design Best Practices Workshop
Data Visualization Design Best Practices Workshop
Amanda Makulec
 
Data Visualization Design Best Practices Workshop
Data Visualization Design Best Practices WorkshopData Visualization Design Best Practices Workshop
Data Visualization Design Best Practices Workshop
JSI
 
Data Visualization in Data Science
Data Visualization in Data ScienceData Visualization in Data Science
Data Visualization in Data Science
Maloy Manna, PMP®
 
Creating Functional Art in Excel
Creating Functional Art in ExcelCreating Functional Art in Excel
Creating Functional Art in Excel
Amanda Makulec
 
Design and Data Processes  Unified -  3rd Corner View
Design and Data Processes  Unified -  3rd Corner ViewDesign and Data Processes  Unified -  3rd Corner View
Design and Data Processes  Unified -  3rd Corner View
Julian Jordan
 
Data Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentationData Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentation
Data Con LA
 
Machine learning 101
Machine learning 101Machine learning 101
Machine learning 101
AmmarChalifah
 
visualize data d83ae67f-6375-401f-b95a-f29eae37dd32.pdf
visualize data d83ae67f-6375-401f-b95a-f29eae37dd32.pdfvisualize data d83ae67f-6375-401f-b95a-f29eae37dd32.pdf
visualize data d83ae67f-6375-401f-b95a-f29eae37dd32.pdf
dimce3
 
LR2. Summary Day 2
LR2. Summary Day 2LR2. Summary Day 2
LR2. Summary Day 2
Machine Learning Valencia
 
Barga Data Science lecture 2
Barga Data Science lecture 2Barga Data Science lecture 2
Barga Data Science lecture 2
Roger Barga
 
Effective Business Presentations with Storyboarding and Data Visualization
Effective Business Presentations with Storyboarding and Data VisualizationEffective Business Presentations with Storyboarding and Data Visualization
Effective Business Presentations with Storyboarding and Data Visualization
Carmen Proctor
 
Introduction to machine learning and model building using linear regression
Introduction to machine learning and model building using linear regressionIntroduction to machine learning and model building using linear regression
Introduction to machine learning and model building using linear regression
Girish Gore
 
Echelon Asia Summit 2017 Startup Academy Workshop
Echelon Asia Summit 2017 Startup Academy WorkshopEchelon Asia Summit 2017 Startup Academy Workshop
Echelon Asia Summit 2017 Startup Academy Workshop
Garrett Teoh Hor Keong
 
How to Improve Your Data Science CV
How to Improve Your Data Science CVHow to Improve Your Data Science CV
How to Improve Your Data Science CV
Kaitlin McAndrews
 
DATA VISUALIZATION
DATA VISUALIZATIONDATA VISUALIZATION
DATA VISUALIZATION
Aabhika Samantaray
 
Assessment In Spreadsheets
Assessment In SpreadsheetsAssessment In Spreadsheets
Assessment In Spreadsheetsguest46de76
 

Similar to Data visualization tools & techniques - 1 (20)

PowerPoint Presentation On giving effective PowerPoint Presentations
PowerPoint Presentation On giving effective PowerPoint PresentationsPowerPoint Presentation On giving effective PowerPoint Presentations
PowerPoint Presentation On giving effective PowerPoint Presentations
 
Data Visualisation: A Game of Decisions
Data Visualisation: A Game of DecisionsData Visualisation: A Game of Decisions
Data Visualisation: A Game of Decisions
 
Design for Delight
Design for DelightDesign for Delight
Design for Delight
 
Designing Data Visualizations to Strengthen Health Systems
Designing Data Visualizations to Strengthen Health SystemsDesigning Data Visualizations to Strengthen Health Systems
Designing Data Visualizations to Strengthen Health Systems
 
Data Visualization Design Best Practices Workshop
Data Visualization Design Best Practices WorkshopData Visualization Design Best Practices Workshop
Data Visualization Design Best Practices Workshop
 
Data Visualization Design Best Practices Workshop
Data Visualization Design Best Practices WorkshopData Visualization Design Best Practices Workshop
Data Visualization Design Best Practices Workshop
 
Data Visualization in Data Science
Data Visualization in Data ScienceData Visualization in Data Science
Data Visualization in Data Science
 
Creating Functional Art in Excel
Creating Functional Art in ExcelCreating Functional Art in Excel
Creating Functional Art in Excel
 
Design and Data Processes  Unified -  3rd Corner View
Design and Data Processes  Unified -  3rd Corner ViewDesign and Data Processes  Unified -  3rd Corner View
Design and Data Processes  Unified -  3rd Corner View
 
Data Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentationData Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentation
 
Machine learning 101
Machine learning 101Machine learning 101
Machine learning 101
 
visualize data d83ae67f-6375-401f-b95a-f29eae37dd32.pdf
visualize data d83ae67f-6375-401f-b95a-f29eae37dd32.pdfvisualize data d83ae67f-6375-401f-b95a-f29eae37dd32.pdf
visualize data d83ae67f-6375-401f-b95a-f29eae37dd32.pdf
 
LR2. Summary Day 2
LR2. Summary Day 2LR2. Summary Day 2
LR2. Summary Day 2
 
Barga Data Science lecture 2
Barga Data Science lecture 2Barga Data Science lecture 2
Barga Data Science lecture 2
 
Effective Business Presentations with Storyboarding and Data Visualization
Effective Business Presentations with Storyboarding and Data VisualizationEffective Business Presentations with Storyboarding and Data Visualization
Effective Business Presentations with Storyboarding and Data Visualization
 
Introduction to machine learning and model building using linear regression
Introduction to machine learning and model building using linear regressionIntroduction to machine learning and model building using linear regression
Introduction to machine learning and model building using linear regression
 
Echelon Asia Summit 2017 Startup Academy Workshop
Echelon Asia Summit 2017 Startup Academy WorkshopEchelon Asia Summit 2017 Startup Academy Workshop
Echelon Asia Summit 2017 Startup Academy Workshop
 
How to Improve Your Data Science CV
How to Improve Your Data Science CVHow to Improve Your Data Science CV
How to Improve Your Data Science CV
 
DATA VISUALIZATION
DATA VISUALIZATIONDATA VISUALIZATION
DATA VISUALIZATION
 
Assessment In Spreadsheets
Assessment In SpreadsheetsAssessment In Spreadsheets
Assessment In Spreadsheets
 

More from Korivi Sravan Kumar

Notes.pptx
Notes.pptxNotes.pptx
No bill is available.docx
No bill is available.docxNo bill is available.docx
No bill is available.docx
Korivi Sravan Kumar
 
Web trafic time series forecasting
Web trafic time series forecastingWeb trafic time series forecasting
Web trafic time series forecasting
Korivi Sravan Kumar
 
Study on Zara International Strategy
Study on Zara International StrategyStudy on Zara International Strategy
Study on Zara International Strategy
Korivi Sravan Kumar
 
RBL Bank Strategy analysis and formulation
RBL Bank Strategy analysis and formulationRBL Bank Strategy analysis and formulation
RBL Bank Strategy analysis and formulation
Korivi Sravan Kumar
 
P&G Strategic Restructuring of Global Business Service
P&G Strategic Restructuring of Global Business ServiceP&G Strategic Restructuring of Global Business Service
P&G Strategic Restructuring of Global Business Service
Korivi Sravan Kumar
 

More from Korivi Sravan Kumar (6)

Notes.pptx
Notes.pptxNotes.pptx
Notes.pptx
 
No bill is available.docx
No bill is available.docxNo bill is available.docx
No bill is available.docx
 
Web trafic time series forecasting
Web trafic time series forecastingWeb trafic time series forecasting
Web trafic time series forecasting
 
Study on Zara International Strategy
Study on Zara International StrategyStudy on Zara International Strategy
Study on Zara International Strategy
 
RBL Bank Strategy analysis and formulation
RBL Bank Strategy analysis and formulationRBL Bank Strategy analysis and formulation
RBL Bank Strategy analysis and formulation
 
P&G Strategic Restructuring of Global Business Service
P&G Strategic Restructuring of Global Business ServiceP&G Strategic Restructuring of Global Business Service
P&G Strategic Restructuring of Global Business Service
 

Recently uploaded

standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
Oppotus
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
Tiktokethiodaily
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
NABLAS株式会社
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 

Recently uploaded (20)

standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 

Data visualization tools & techniques - 1

  • 1. Data Visualization tools & techniques K Sravan Kumar
  • 2. Outline  Different visualizations  How to draw in R  How to draw in MS Excel
  • 3. 3 Stages of Understanding Perceiving Interpreting Comprehending What does it show ? Where is big, medium, small ? How do things compare? What relationships exist? What does it mean? What is good and bad? Is it meaningful or insignificant? Unusual or expected? What does it mean to me? What are the main messages? What have I learnt? Any actions to take?
  • 4. 3 Principles of Good Visualization design Principle 1 Good data visualization is TRUSTWORTHY Principle 2 Good data visualization is ACCESSIBLE Principle 3 Good data visualization is ELEGANT
  • 5. Visualization Workflow  Formulating brief  Working with data  Establishing editorial thinking  Developing design solution Hidden Thinking stages Production Cycle
  • 6. Formulating brief  Curiosity: Why are we doing it ?  Personal Intrigue : ‘I wonder what…..’  Stakeholder Intrigue : ‘He/She needs to know …..;  Audience Intrigue : ‘They need to know ……..’  Anticipated Intrigue : ‘They might be interested in knowing …’  Potential Intrigue : ‘They should be interested in knowing …’
  • 7. Purpose Map EXPLANATORY EXHIBITORY EXPLORATORY FEELINGREADING sequence | drama annotate | describe display manipulate | interrogate participate | contribute emotive|drama|big-pictureutilitarian|efficient|precision
  • 8. Working with data  Types of data  Textual(Qualitative)  Nominal (Qualitative)  Ordinal (Qualitative)  Interval (quantitative)  Ratio (quantitative)
  • 9. Working with data : steps  Acquire  Examine  Transform  Explore
  • 10. Exploratory data analysis  Addressing of unknowns and substantiating knowns. The things we are aware of knowing Beware complacency The things we are aware of not knowing Deductive reasoning The things we are unaware of knowing Acquire and review The things we are unaware of not knowing Inductive reasoning KNOWN UNKNOWN KNOWNUNKNOWN ACQUIRED AWARENESS
  • 11. Reasoning  Deductive reasoning Hypothesis framed by subject knowledge, interrogate the data to find evidence of relevance or interest in concluding the finding. (Sherlock Holmes)  Inductive reasoning Play around with data, based on sense or instinct and wait and see what emerges.
  • 12. Establishing editorial thinking  Angle  Relevant views to the potential interest of audience  Sufficient to cover all relevant views  Framing  Apply filters to determine inclusion and exclusion criteria.  Provide access to most salient content but also avoid any distortion of data  Focus  Features of display to draw particular attention  Organize visibility and hierarchy
  • 13. Developing design solution  Steps of production cycle:  Conceiving ideas across 5 layers of visual design  Wireframing & storyboarding designs Create low fidelity illustration and weave the illustrations to create sequenced view  Developing prototypes Develop first working version/ blueprints  Testing Test ,evaluate and collect feedback on trustworthiness, accessibility and elegancy.  Refining & completing Incorporate feedback, correct and double check.  Launching the solution
  • 14. 5 layers of visual design  Data representation  Interactivity  Annotation  Color  Composition
  • 15. Chart Types  Categorical Comparing categories and distributions of data  Hierarchical Charting part to whole relationships and hierarchies  Relational Graphing relationships to explore correlations and connections  Temporal Showing trends and activities over time  Spatial Mapping spatial patterns through overlays and distortions
  • 16. Bar Chart R Code:- library(MASS) school = painters$School school.freq = table(school) barplot(school.freq) title("School wise number of painters") Tips & Tricks • Quantitative axis should start always from 0 • Make the categorical sorting meaningful (X-axis). • If you have axis labels, don’t label each bar with values. • Used for comparing C H R T S
  • 17. Clustered Bar Chart R Code:- counts <- table(mtcars$cyl, mtcars$gear) barplot(counts, main="Car Distribution by Gears and Cylinders", xlab = "Number of Gears", col = c("grey","lightblue","orange") , legend = rownames(counts), beside=TRUE) C H R T S Tips & Tricks • Quantitative axis should start always from 0 • Make the categorical sorting meaningful (X-axis). • If you have axis labels, don’t label each bar with values. • Used for comparing within and across clusters
  • 18. Dot Plot R Code:- tt <- read.csv("test.csv") ggplot(data = tt, aes(x=Percentage, y=Country, color = Gender)) + geom_point(aes(size = Count)) + xlim(0,100) Tips & Tricks • Quantitative axis can start from 0. Otherwise label axis values clearly • Make the categorical sorting meaningful (Y-axis). • Position of the point indicates quantitative value of each category • Size of the data can also be used to indicate quantitative value. C H R T S
  • 19. Connected Dot Plot (barbell/dumb-bell chart) C H R T S R Code:- tt <- read.csv("test.csv") ggplot(data = tt, aes(x=Year2000, xend=Year2012, y=Country, group=Country)) + geom_dumbbell( color="orange", size=0.75, point.colour.l = "#0e668b“ ) + xlim(0,1000000) +labs(x=NULL, y=NULL, title ="OECD 2000 vs 2012") Tips & Tricks • Quantitative axis can start from 0. Otherwise label axis values clearly • Make the categorical sorting meaningful (Y-axis). • Position of the point indicates quantitative value of each category • Size of the data can also be used to indicate quantitative value.
  • 20. Pictogram R Code:- man<-readPNG("man.png") pictogram(icon=man, n=c(12,35,52), grouplabels=c("dudes","chaps","lads")) Tips & Tricks • Quantitative axis can start from 0. Otherwise label axis values clearly • Make the categorical sorting meaningful (Y-axis). • Position of the point indicates quantitative value of each category • Size of the data can also be used to indicate quantitative value.
  • 21. Bubble chart C H R T S R Code:- g <- ggplot(dt, aes(x= xlab, y = alphabet)) + labs(title ="State wise public spending") + geom_jitter (aes(col=alphabet, size=FY.11)) + geom_text (aes(label=State), size=3) + guides(colour=FALSE, size = FALSE, x = FALSE, y = FALSE) + theme(axis.title.x=element_blank(),axis.text.x=element _blank(),axis.ticks.x=element_blank(),axis.title.y=elem ent_blank(),axis.text.y=element_blank(),axis.ticks.y=el ement_blank()) + scale_size_continuous(range = c(0, 50)) Tips & Tricks • Interactive features can be added • Colors can be used to make quantitative sizes more distinguishable
  • 22. Polar Chart R Code:- plot <- ggplot(DF, aes(variable, value, fill = variable)) + geom_bar(width = 1, stat = "identity", color = "white") + scale_y_continuous(breaks = 0:10) + coord_polar() plot Tips & Tricks • Filled with colors with a degree of transparency to allow background to be partially visible • Grid lines are relevant if there are common scales across quantitative variables C H R T S