SlideShare a Scribd company logo
Data Visualization
How you see it…..
Baijayanti Chakraborty
About Me:-
I am Baijayanti Chakraborty, a Post Graduate student from Great Lakes Institute
of Management. I am doing PG in Business Analytics and Business Intelligence.
You can find me on:
1.LinkedIN: https://www.linkedin.com/in/baijayanti-chakraborty/
2.Twitter: twitter.com/baijayantic
3.Mail: baijayantichakraborty96@gmail.com
4.Github: https://github.com/baijayantichakraborty
5.Kaggel: https://www.kaggle.com/baijayanti94
Today’s Spot Of Interest
❖ What is visualization and why do we need it !!!!
❖ Basic Visualizations
❖ Advanced Visualizations
What is visualization and why do we need it !!!
Data visualization is an art of how to turn numbers into useful knowledge. We all know that when we see images its easy to
understand than when reading a lot of information.
Let’s consider the below example: Over here is a snip from the IRIS dataset which is already present in R. It’s quite difficult
to comprehend anything from the huge lot of data and hence to make it easy for understanding we will be using visualization
techniques.
Th
Some famous tools for data visualization ……….
Selecting the right kindof chart !!!
There are four basic presentation types:
1. Comparison
2. Composition
3. Distribution
4. Relationship
To determine which amongst these types is best suited for your
data at hand we should be able to answer the below questions :-
● How many variables do you want to show in a single
chart?
● How many data points will you display for each variable?
● Will you display values over a period of time, or among
items or groups?
Basic Visualization
(Without using ggplot2 library)
Histogram
Histogram is basically a plot that breaks the data into bins (or breaks) and shows frequency distribution of these bins.
Bar/Line Chart
● Bar charts are recommended when you want to plot a categorical variable or a combination of continuous and categorical
variable.
● Line Charts are commonly preferred when we are to analyse a trend spread over a time period.
Bar Chart/Line Chart …..
Boxplots
Box Plots are used to plot a combination of categorical and continuous variables. This plot is useful for visualizing the spread of the
data and detect outliers. It shows five statistically significant numbers- the minimum, the 25th percentile, the median, the 75th
percentile and the maximum.Example for boxplot creation using the below code :
The parts of Boxplot
Boxplot analysis for the Iris dataset
Scatter Plot
Scatter Plot is used to see the relationship between two continuous variables.
Scatter plot can help to visualise multiple variables and their relations as well.
Advanced Visualizations…...
Some advanced packages of visualisation in R are :-
● Lattice Graphs :- Lattice package is essentially an improvement upon the R Graphics package and is used to
visualize multivariate data. Some kinds of visualisations with lattice package are :-
1.Kernal Density Plots
2.Scatterplot matrices :
● ggplot2 :- this package is one of the most widely used visualisation packages in R. It enables the users to create
sophisticated visualisations with little code using the Grammar of Graphics.
● Plotly is an R package that creates interactive web-based graphs via the open source JavaScript graphing library
plotly.js. It can easily translate the ‘ggplot2’ graphs to web-based versions also.
Adavanced Scatter Plots
Besides the basic version of scatterplots we can also create them using the “ggplot2” library.
The below codes give a taste of the same.
Advanced scatter plots contd...
HeatMaps
Heat Map uses intensity (density) of colors to display relationship between two or three or many variables
in a two dimensional image. It allows us to explore two dimensions as the axis and the third dimension by
intensity of color.
The colour of the bars in the heat map is
dependent on the cyl parameter of the dataset.
The dataset used here is mtcars. It’s an inbuilt
dataset.
HeatMaps contd….
Using the library “plotly”, the heatmaps can be made interactive in nature. The below code gives
an insight as to how we can use plotly.
Correlogram
Correlogram is used to test the level of correlation among the variable available in the data
set. The cells of the matrix can be shaded or colored to show the correlation value.
Correlogram contd...
It is possible to use “ggplot2” aesthetics on the chart, for instance to color each category. We can use a new library “GGally”
and see how different variations are made to the simple correlogram.
Correlogram contd….
Change the type of plot used on each part of the correlogram. This is done with the upper and lower argument.
Area Chart
Area chart is used to show continuity across a variable or data set. It is very much same as line chart and is commonly used
for time series plots. Alternatively, it is also used to plot continuous variables and analyze the underlying trends.
3D Plots
● To create a 3D plot using R can be done
with the help of scatterplot3d package.
● scaterplot3d is very simple to use and it
can be easily extended by adding
supplementary points or regression
planes into an already generated
graphics.
Another variation of 3D plots
Quick Information
For quick references you can easily check the cheatsheet side of Rstudio:
https://rstudio.com/resources/cheatsheets/
References :-
1. https://rstudio.com/resources/cheatsheets/
2. https://www.slant.co/topics/2354/~best-data-visualization-tools-for-massive-datasets
3. https://policyviz.com/product/core-principles-of-data-visualization-cheatsheet/
4. https://eazybi.com/blog/data_visualization_and_chart_types/
5. https://www.r-graph-gallery.com/199-correlation-matrix-with-ggally.html
6. https://towardsdatascience.com/a-guide-to-data-visualisation-in-r-for-beginners-ef6d41a34174?#0689
Happy Learning!!!

More Related Content

What's hot

Design of suitable Magnitude Comparator Architecture for Big Data Analytics
Design of suitable Magnitude Comparator Architecture for Big Data AnalyticsDesign of suitable Magnitude Comparator Architecture for Big Data Analytics
Design of suitable Magnitude Comparator Architecture for Big Data Analytics
rahulmonikasharma
 
Application of Matrices in real life
Application of Matrices in real lifeApplication of Matrices in real life
Application of Matrices in real life
Shayshab Azad
 
Matrix and it's Application
Matrix and it's ApplicationMatrix and it's Application
Matrix and it's Application
Mahmudle Hassan
 
Statistics and Probability - all in one
Statistics and Probability - all in oneStatistics and Probability - all in one
Statistics and Probability - all in one
Marjuk Ahmed Siddiki
 
Closeness through-microaggregation-strict-privacy-with-enhanced-utility-prese...
Closeness through-microaggregation-strict-privacy-with-enhanced-utility-prese...Closeness through-microaggregation-strict-privacy-with-enhanced-utility-prese...
Closeness through-microaggregation-strict-privacy-with-enhanced-utility-prese...
Pvrtechnologies Nellore
 
Use of matrix in daily life
Use of matrix in daily lifeUse of matrix in daily life
Use of matrix in daily life
sadia Afrose
 
Applications of Matrices in Engineering
Applications of Matrices in EngineeringApplications of Matrices in Engineering
Applications of Matrices in Engineering
AliHasan358
 
Applications of matrices in Real\Daily life
Applications of matrices in Real\Daily lifeApplications of matrices in Real\Daily life
Applications of matrices in Real\Daily life
Sami Ullah
 
Application of Matrix
Application of MatrixApplication of Matrix
Application of Matrix
Rahman Hillol
 
Exploring optimizations for dynamic PageRank algorithm based on GPU : V4
Exploring optimizations for dynamic PageRank algorithm based on GPU : V4Exploring optimizations for dynamic PageRank algorithm based on GPU : V4
Exploring optimizations for dynamic PageRank algorithm based on GPU : V4
Subhajit Sahu
 
105575916 maths-edit-new
105575916 maths-edit-new105575916 maths-edit-new
105575916 maths-edit-new
homeworkping7
 
Graph Tea: Simulating Tool for Graph Theory & Algorithms
Graph Tea: Simulating Tool for Graph Theory & AlgorithmsGraph Tea: Simulating Tool for Graph Theory & Algorithms
Graph Tea: Simulating Tool for Graph Theory & Algorithms
IJMTST Journal
 
Utilizing Graph Theory to Model Forensic Examination
Utilizing Graph Theory to Model Forensic ExaminationUtilizing Graph Theory to Model Forensic Examination
Utilizing Graph Theory to Model Forensic Examination
AM Publications,India
 
Clustering - Machine Learning Techniques
Clustering - Machine Learning TechniquesClustering - Machine Learning Techniques
Clustering - Machine Learning Techniques
Kush Kulshrestha
 
Spline (Interpolation)
Spline (Interpolation)Spline (Interpolation)
Spline (Interpolation)
Pallab Jana
 
Data science lab project
Data science lab projectData science lab project
Data science lab project
LuciaRavazzi
 

What's hot (16)

Design of suitable Magnitude Comparator Architecture for Big Data Analytics
Design of suitable Magnitude Comparator Architecture for Big Data AnalyticsDesign of suitable Magnitude Comparator Architecture for Big Data Analytics
Design of suitable Magnitude Comparator Architecture for Big Data Analytics
 
Application of Matrices in real life
Application of Matrices in real lifeApplication of Matrices in real life
Application of Matrices in real life
 
Matrix and it's Application
Matrix and it's ApplicationMatrix and it's Application
Matrix and it's Application
 
Statistics and Probability - all in one
Statistics and Probability - all in oneStatistics and Probability - all in one
Statistics and Probability - all in one
 
Closeness through-microaggregation-strict-privacy-with-enhanced-utility-prese...
Closeness through-microaggregation-strict-privacy-with-enhanced-utility-prese...Closeness through-microaggregation-strict-privacy-with-enhanced-utility-prese...
Closeness through-microaggregation-strict-privacy-with-enhanced-utility-prese...
 
Use of matrix in daily life
Use of matrix in daily lifeUse of matrix in daily life
Use of matrix in daily life
 
Applications of Matrices in Engineering
Applications of Matrices in EngineeringApplications of Matrices in Engineering
Applications of Matrices in Engineering
 
Applications of matrices in Real\Daily life
Applications of matrices in Real\Daily lifeApplications of matrices in Real\Daily life
Applications of matrices in Real\Daily life
 
Application of Matrix
Application of MatrixApplication of Matrix
Application of Matrix
 
Exploring optimizations for dynamic PageRank algorithm based on GPU : V4
Exploring optimizations for dynamic PageRank algorithm based on GPU : V4Exploring optimizations for dynamic PageRank algorithm based on GPU : V4
Exploring optimizations for dynamic PageRank algorithm based on GPU : V4
 
105575916 maths-edit-new
105575916 maths-edit-new105575916 maths-edit-new
105575916 maths-edit-new
 
Graph Tea: Simulating Tool for Graph Theory & Algorithms
Graph Tea: Simulating Tool for Graph Theory & AlgorithmsGraph Tea: Simulating Tool for Graph Theory & Algorithms
Graph Tea: Simulating Tool for Graph Theory & Algorithms
 
Utilizing Graph Theory to Model Forensic Examination
Utilizing Graph Theory to Model Forensic ExaminationUtilizing Graph Theory to Model Forensic Examination
Utilizing Graph Theory to Model Forensic Examination
 
Clustering - Machine Learning Techniques
Clustering - Machine Learning TechniquesClustering - Machine Learning Techniques
Clustering - Machine Learning Techniques
 
Spline (Interpolation)
Spline (Interpolation)Spline (Interpolation)
Spline (Interpolation)
 
Data science lab project
Data science lab projectData science lab project
Data science lab project
 

Similar to Data visualization

QQ Plot.pptx
QQ Plot.pptxQQ Plot.pptx
QQ Plot.pptx
Rahul Borate
 
Introduction to Matplotlib Library in Python.pptx
Introduction to Matplotlib Library in Python.pptxIntroduction to Matplotlib Library in Python.pptx
Introduction to Matplotlib Library in Python.pptx
bajajrishabh96tech
 
Data Visualization With R
Data Visualization With RData Visualization With R
Data Visualization With R
Rsquared Academy
 
Mat189: Cluster Analysis with NBA Sports Data
Mat189: Cluster Analysis with NBA Sports DataMat189: Cluster Analysis with NBA Sports Data
Mat189: Cluster Analysis with NBA Sports Data
KathleneNgo
 
Dimensionality reduction
Dimensionality reductionDimensionality reduction
Dimensionality reduction
Shatakirti Er
 
Data Visualization using different python libraries.pptx
Data Visualization using different python libraries.pptxData Visualization using different python libraries.pptx
Data Visualization using different python libraries.pptx
HamzaAli998966
 
You Don't Have to Be a Data Scientist to Do Data Science
You Don't Have to Be a Data Scientist to Do Data ScienceYou Don't Have to Be a Data Scientist to Do Data Science
You Don't Have to Be a Data Scientist to Do Data Science
Carmen Mardiros
 
Graphs and chars
Graphs and charsGraphs and chars
Graphs and chars
salemhusin
 
Spark ml streaming
Spark ml streamingSpark ml streaming
Spark ml streaming
Adam Doyle
 
Data Science as a Career and Intro to R
Data Science as a Career and Intro to RData Science as a Career and Intro to R
Data Science as a Career and Intro to R
Anshik Bansal
 
XL-MINER: Data Exploration
XL-MINER: Data ExplorationXL-MINER: Data Exploration
XL-MINER: Data Exploration
DataminingTools Inc
 
XL-MINER:Data Exploration
XL-MINER:Data ExplorationXL-MINER:Data Exploration
XL-MINER:Data Exploration
xlminer content
 
Graph Algorithms, Sparse Algebra, and the GraphBLAS with Janice McMahon
Graph Algorithms, Sparse Algebra, and the GraphBLAS with Janice McMahonGraph Algorithms, Sparse Algebra, and the GraphBLAS with Janice McMahon
Graph Algorithms, Sparse Algebra, and the GraphBLAS with Janice McMahon
Christopher Conlan
 
Dip day1&2
Dip day1&2Dip day1&2
Dip day1&2
nakarthik91
 
Data Analytics Using R - Report
Data Analytics Using R - ReportData Analytics Using R - Report
Data Analytics Using R - Report
Akanksha Gohil
 
Data Visualization (1).pptx
Data Visualization (1).pptxData Visualization (1).pptx
Data Visualization (1).pptx
cfiskillzz159
 
Introduction to Datamining Concept and Techniques
Introduction to Datamining Concept and TechniquesIntroduction to Datamining Concept and Techniques
Introduction to Datamining Concept and Techniques
Sơn Còm Nhom
 
Application of discrete mathematics in IT
Application of discrete mathematics in ITApplication of discrete mathematics in IT
Application of discrete mathematics in IT
ShahidAbbas52
 
Python Machine Learning - Getting Started
Python Machine Learning - Getting StartedPython Machine Learning - Getting Started
Python Machine Learning - Getting Started
Rafey Iqbal Rahman
 
Chapter 1 Introduction to Data Structures and Algorithms.pdf
Chapter 1 Introduction to Data Structures and Algorithms.pdfChapter 1 Introduction to Data Structures and Algorithms.pdf
Chapter 1 Introduction to Data Structures and Algorithms.pdf
Axmedcarb
 

Similar to Data visualization (20)

QQ Plot.pptx
QQ Plot.pptxQQ Plot.pptx
QQ Plot.pptx
 
Introduction to Matplotlib Library in Python.pptx
Introduction to Matplotlib Library in Python.pptxIntroduction to Matplotlib Library in Python.pptx
Introduction to Matplotlib Library in Python.pptx
 
Data Visualization With R
Data Visualization With RData Visualization With R
Data Visualization With R
 
Mat189: Cluster Analysis with NBA Sports Data
Mat189: Cluster Analysis with NBA Sports DataMat189: Cluster Analysis with NBA Sports Data
Mat189: Cluster Analysis with NBA Sports Data
 
Dimensionality reduction
Dimensionality reductionDimensionality reduction
Dimensionality reduction
 
Data Visualization using different python libraries.pptx
Data Visualization using different python libraries.pptxData Visualization using different python libraries.pptx
Data Visualization using different python libraries.pptx
 
You Don't Have to Be a Data Scientist to Do Data Science
You Don't Have to Be a Data Scientist to Do Data ScienceYou Don't Have to Be a Data Scientist to Do Data Science
You Don't Have to Be a Data Scientist to Do Data Science
 
Graphs and chars
Graphs and charsGraphs and chars
Graphs and chars
 
Spark ml streaming
Spark ml streamingSpark ml streaming
Spark ml streaming
 
Data Science as a Career and Intro to R
Data Science as a Career and Intro to RData Science as a Career and Intro to R
Data Science as a Career and Intro to R
 
XL-MINER: Data Exploration
XL-MINER: Data ExplorationXL-MINER: Data Exploration
XL-MINER: Data Exploration
 
XL-MINER:Data Exploration
XL-MINER:Data ExplorationXL-MINER:Data Exploration
XL-MINER:Data Exploration
 
Graph Algorithms, Sparse Algebra, and the GraphBLAS with Janice McMahon
Graph Algorithms, Sparse Algebra, and the GraphBLAS with Janice McMahonGraph Algorithms, Sparse Algebra, and the GraphBLAS with Janice McMahon
Graph Algorithms, Sparse Algebra, and the GraphBLAS with Janice McMahon
 
Dip day1&2
Dip day1&2Dip day1&2
Dip day1&2
 
Data Analytics Using R - Report
Data Analytics Using R - ReportData Analytics Using R - Report
Data Analytics Using R - Report
 
Data Visualization (1).pptx
Data Visualization (1).pptxData Visualization (1).pptx
Data Visualization (1).pptx
 
Introduction to Datamining Concept and Techniques
Introduction to Datamining Concept and TechniquesIntroduction to Datamining Concept and Techniques
Introduction to Datamining Concept and Techniques
 
Application of discrete mathematics in IT
Application of discrete mathematics in ITApplication of discrete mathematics in IT
Application of discrete mathematics in IT
 
Python Machine Learning - Getting Started
Python Machine Learning - Getting StartedPython Machine Learning - Getting Started
Python Machine Learning - Getting Started
 
Chapter 1 Introduction to Data Structures and Algorithms.pdf
Chapter 1 Introduction to Data Structures and Algorithms.pdfChapter 1 Introduction to Data Structures and Algorithms.pdf
Chapter 1 Introduction to Data Structures and Algorithms.pdf
 

Recently uploaded

一比一原版南昆士兰大学毕业证如何办理
一比一原版南昆士兰大学毕业证如何办理一比一原版南昆士兰大学毕业证如何办理
一比一原版南昆士兰大学毕业证如何办理
ugydym
 
Module 1 ppt BIG DATA ANALYTICS_NOTES FOR MCA
Module 1 ppt BIG DATA ANALYTICS_NOTES FOR MCAModule 1 ppt BIG DATA ANALYTICS_NOTES FOR MCA
Module 1 ppt BIG DATA ANALYTICS_NOTES FOR MCA
yuvarajkumar334
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
Vietnam Cotton & Spinning Association
 
A gentle exploration of Retrieval Augmented Generation
A gentle exploration of Retrieval Augmented GenerationA gentle exploration of Retrieval Augmented Generation
A gentle exploration of Retrieval Augmented Generation
dataschool1
 
一比一原版(UofT毕业证)多伦多大学毕业证如何办理
一比一原版(UofT毕业证)多伦多大学毕业证如何办理一比一原版(UofT毕业证)多伦多大学毕业证如何办理
一比一原版(UofT毕业证)多伦多大学毕业证如何办理
exukyp
 
一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理
一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理
一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理
actyx
 
Sample Devops SRE Product Companies .pdf
Sample Devops SRE  Product Companies .pdfSample Devops SRE  Product Companies .pdf
Sample Devops SRE Product Companies .pdf
Vineet
 
一比一原版澳洲西澳大学毕业证(uwa毕业证书)如何办理
一比一原版澳洲西澳大学毕业证(uwa毕业证书)如何办理一比一原版澳洲西澳大学毕业证(uwa毕业证书)如何办理
一比一原版澳洲西澳大学毕业证(uwa毕业证书)如何办理
aguty
 
Data Scientist Machine Learning Profiles .pdf
Data Scientist Machine Learning  Profiles .pdfData Scientist Machine Learning  Profiles .pdf
Data Scientist Machine Learning Profiles .pdf
Vineet
 
Overview IFM June 2024 Consumer Confidence INDEX Report.pdf
Overview IFM June 2024 Consumer Confidence INDEX Report.pdfOverview IFM June 2024 Consumer Confidence INDEX Report.pdf
Overview IFM June 2024 Consumer Confidence INDEX Report.pdf
nhutnguyen355078
 
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
mkkikqvo
 
一比一原版莱斯大学毕业证(rice毕业证)如何办理
一比一原版莱斯大学毕业证(rice毕业证)如何办理一比一原版莱斯大学毕业证(rice毕业证)如何办理
一比一原版莱斯大学毕业证(rice毕业证)如何办理
zsafxbf
 
Econ3060_Screen Time and Success_ final_GroupProject.pdf
Econ3060_Screen Time and Success_ final_GroupProject.pdfEcon3060_Screen Time and Success_ final_GroupProject.pdf
Econ3060_Screen Time and Success_ final_GroupProject.pdf
blueshagoo1
 
Build applications with generative AI on Google Cloud
Build applications with generative AI on Google CloudBuild applications with generative AI on Google Cloud
Build applications with generative AI on Google Cloud
Márton Kodok
 
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
hyfjgavov
 
一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
1tyxnjpia
 
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
ywqeos
 
一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理
一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理
一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理
hqfek
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
Timothy Spann
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
Vietnam Cotton & Spinning Association
 

Recently uploaded (20)

一比一原版南昆士兰大学毕业证如何办理
一比一原版南昆士兰大学毕业证如何办理一比一原版南昆士兰大学毕业证如何办理
一比一原版南昆士兰大学毕业证如何办理
 
Module 1 ppt BIG DATA ANALYTICS_NOTES FOR MCA
Module 1 ppt BIG DATA ANALYTICS_NOTES FOR MCAModule 1 ppt BIG DATA ANALYTICS_NOTES FOR MCA
Module 1 ppt BIG DATA ANALYTICS_NOTES FOR MCA
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
 
A gentle exploration of Retrieval Augmented Generation
A gentle exploration of Retrieval Augmented GenerationA gentle exploration of Retrieval Augmented Generation
A gentle exploration of Retrieval Augmented Generation
 
一比一原版(UofT毕业证)多伦多大学毕业证如何办理
一比一原版(UofT毕业证)多伦多大学毕业证如何办理一比一原版(UofT毕业证)多伦多大学毕业证如何办理
一比一原版(UofT毕业证)多伦多大学毕业证如何办理
 
一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理
一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理
一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理
 
Sample Devops SRE Product Companies .pdf
Sample Devops SRE  Product Companies .pdfSample Devops SRE  Product Companies .pdf
Sample Devops SRE Product Companies .pdf
 
一比一原版澳洲西澳大学毕业证(uwa毕业证书)如何办理
一比一原版澳洲西澳大学毕业证(uwa毕业证书)如何办理一比一原版澳洲西澳大学毕业证(uwa毕业证书)如何办理
一比一原版澳洲西澳大学毕业证(uwa毕业证书)如何办理
 
Data Scientist Machine Learning Profiles .pdf
Data Scientist Machine Learning  Profiles .pdfData Scientist Machine Learning  Profiles .pdf
Data Scientist Machine Learning Profiles .pdf
 
Overview IFM June 2024 Consumer Confidence INDEX Report.pdf
Overview IFM June 2024 Consumer Confidence INDEX Report.pdfOverview IFM June 2024 Consumer Confidence INDEX Report.pdf
Overview IFM June 2024 Consumer Confidence INDEX Report.pdf
 
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
 
一比一原版莱斯大学毕业证(rice毕业证)如何办理
一比一原版莱斯大学毕业证(rice毕业证)如何办理一比一原版莱斯大学毕业证(rice毕业证)如何办理
一比一原版莱斯大学毕业证(rice毕业证)如何办理
 
Econ3060_Screen Time and Success_ final_GroupProject.pdf
Econ3060_Screen Time and Success_ final_GroupProject.pdfEcon3060_Screen Time and Success_ final_GroupProject.pdf
Econ3060_Screen Time and Success_ final_GroupProject.pdf
 
Build applications with generative AI on Google Cloud
Build applications with generative AI on Google CloudBuild applications with generative AI on Google Cloud
Build applications with generative AI on Google Cloud
 
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
 
一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
 
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理
 
一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理
一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理
一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
 

Data visualization

  • 1. Data Visualization How you see it….. Baijayanti Chakraborty
  • 2. About Me:- I am Baijayanti Chakraborty, a Post Graduate student from Great Lakes Institute of Management. I am doing PG in Business Analytics and Business Intelligence. You can find me on: 1.LinkedIN: https://www.linkedin.com/in/baijayanti-chakraborty/ 2.Twitter: twitter.com/baijayantic 3.Mail: baijayantichakraborty96@gmail.com 4.Github: https://github.com/baijayantichakraborty 5.Kaggel: https://www.kaggle.com/baijayanti94
  • 3. Today’s Spot Of Interest ❖ What is visualization and why do we need it !!!! ❖ Basic Visualizations ❖ Advanced Visualizations
  • 4. What is visualization and why do we need it !!! Data visualization is an art of how to turn numbers into useful knowledge. We all know that when we see images its easy to understand than when reading a lot of information. Let’s consider the below example: Over here is a snip from the IRIS dataset which is already present in R. It’s quite difficult to comprehend anything from the huge lot of data and hence to make it easy for understanding we will be using visualization techniques. Th
  • 5. Some famous tools for data visualization ……….
  • 6.
  • 7.
  • 8. Selecting the right kindof chart !!! There are four basic presentation types: 1. Comparison 2. Composition 3. Distribution 4. Relationship To determine which amongst these types is best suited for your data at hand we should be able to answer the below questions :- ● How many variables do you want to show in a single chart? ● How many data points will you display for each variable? ● Will you display values over a period of time, or among items or groups?
  • 9.
  • 11. Histogram Histogram is basically a plot that breaks the data into bins (or breaks) and shows frequency distribution of these bins.
  • 12. Bar/Line Chart ● Bar charts are recommended when you want to plot a categorical variable or a combination of continuous and categorical variable. ● Line Charts are commonly preferred when we are to analyse a trend spread over a time period.
  • 14. Boxplots Box Plots are used to plot a combination of categorical and continuous variables. This plot is useful for visualizing the spread of the data and detect outliers. It shows five statistically significant numbers- the minimum, the 25th percentile, the median, the 75th percentile and the maximum.Example for boxplot creation using the below code :
  • 15. The parts of Boxplot
  • 16. Boxplot analysis for the Iris dataset
  • 17. Scatter Plot Scatter Plot is used to see the relationship between two continuous variables.
  • 18. Scatter plot can help to visualise multiple variables and their relations as well.
  • 20. Some advanced packages of visualisation in R are :- ● Lattice Graphs :- Lattice package is essentially an improvement upon the R Graphics package and is used to visualize multivariate data. Some kinds of visualisations with lattice package are :- 1.Kernal Density Plots
  • 22. ● ggplot2 :- this package is one of the most widely used visualisation packages in R. It enables the users to create sophisticated visualisations with little code using the Grammar of Graphics. ● Plotly is an R package that creates interactive web-based graphs via the open source JavaScript graphing library plotly.js. It can easily translate the ‘ggplot2’ graphs to web-based versions also.
  • 23. Adavanced Scatter Plots Besides the basic version of scatterplots we can also create them using the “ggplot2” library. The below codes give a taste of the same.
  • 25. HeatMaps Heat Map uses intensity (density) of colors to display relationship between two or three or many variables in a two dimensional image. It allows us to explore two dimensions as the axis and the third dimension by intensity of color. The colour of the bars in the heat map is dependent on the cyl parameter of the dataset. The dataset used here is mtcars. It’s an inbuilt dataset.
  • 26. HeatMaps contd…. Using the library “plotly”, the heatmaps can be made interactive in nature. The below code gives an insight as to how we can use plotly.
  • 27. Correlogram Correlogram is used to test the level of correlation among the variable available in the data set. The cells of the matrix can be shaded or colored to show the correlation value.
  • 28. Correlogram contd... It is possible to use “ggplot2” aesthetics on the chart, for instance to color each category. We can use a new library “GGally” and see how different variations are made to the simple correlogram.
  • 29. Correlogram contd…. Change the type of plot used on each part of the correlogram. This is done with the upper and lower argument.
  • 30. Area Chart Area chart is used to show continuity across a variable or data set. It is very much same as line chart and is commonly used for time series plots. Alternatively, it is also used to plot continuous variables and analyze the underlying trends.
  • 31. 3D Plots ● To create a 3D plot using R can be done with the help of scatterplot3d package. ● scaterplot3d is very simple to use and it can be easily extended by adding supplementary points or regression planes into an already generated graphics.
  • 33. Quick Information For quick references you can easily check the cheatsheet side of Rstudio: https://rstudio.com/resources/cheatsheets/ References :- 1. https://rstudio.com/resources/cheatsheets/ 2. https://www.slant.co/topics/2354/~best-data-visualization-tools-for-massive-datasets 3. https://policyviz.com/product/core-principles-of-data-visualization-cheatsheet/ 4. https://eazybi.com/blog/data_visualization_and_chart_types/ 5. https://www.r-graph-gallery.com/199-correlation-matrix-with-ggally.html 6. https://towardsdatascience.com/a-guide-to-data-visualisation-in-r-for-beginners-ef6d41a34174?#0689

Editor's Notes

  1. This is like a million dollar question because before we start with any kind of analysis with data we need to know about the insights from the data.These relations among the various variables in the data needs to be understood and what better could it be than by understanding them with visual effects.An outlier is an observation that lies an abnormal distance from other values in a random sample from a population.
  2. For a very proper understanding of datasets we need to know which type of chart should be used when….
  3. 1. Used for continuous variables 2.It breaks the data into bins and shows frequency distribution of these bins 3.We can always change the bin size and see the effect it has on visualization. brewer.pal makes the color palettes from ColorBrewer available as R palettes.
  4. Boxplots are also used to detect the outliers present in the dataset. Outlier detection and removal is an essential step of successful data exploration.
  5. We can find the median , and also treat the outliers.
  6. By using the ~ sign, we can visualize how the spread (of Sepal Length) is across various categories ( of Species). In the last two graphs we have seen the example of color palettes. A color palette is a group of colors that is used to make the graph more appealing and helping create visual distinctions in the data.
  7. Lattice enables the use of trellis graphs. Trellis graphs exhibit the relationship between variables which are dependent on one or more variables.
  8. The Grammar of Graphics is a general scheme for data visualization which breaks up graphs into semantic components such as scales and layers.The popularity of ggplot2 has increased tremendously in recent years since it makes it possible to create graphs that contain both univariate and multivariate data in a very simple manner.
  9. Advanced visualisations include graphs like heatcharts,geographical maps,3D charts etc.which can be easily made by using visualisation tools like tableau etc.
  10. Darker the color, higher the correlation between variables. Positive correlations are displayed in blue and negative correlations in red color. Color intensity is proportional to the correlation value.
  11. GGally extends ggplot2 by adding several functions to reduce the complexity of combining geoms with transformed data. Some of these functions include a pairwise plot matrix, a scatterplot plot matrix, a parallel coordinates plot, a survival plot, and several functions to plot networks.