SlideShare a Scribd company logo
1 of 34
Data Visualization
How you see it…..
Baijayanti Chakraborty
About Me:-
I am Baijayanti Chakraborty, a Post Graduate student from Great Lakes Institute
of Management. I am doing PG in Business Analytics and Business Intelligence.
You can find me on:
1.LinkedIN: https://www.linkedin.com/in/baijayanti-chakraborty/
2.Twitter: twitter.com/baijayantic
3.Mail: baijayantichakraborty96@gmail.com
4.Github: https://github.com/baijayantichakraborty
5.Kaggel: https://www.kaggle.com/baijayanti94
Today’s Spot Of Interest
❖ What is visualization and why do we need it !!!!
❖ Basic Visualizations
❖ Advanced Visualizations
What is visualization and why do we need it !!!
Data visualization is an art of how to turn numbers into useful knowledge. We all know that when we see images its easy to
understand than when reading a lot of information.
Let’s consider the below example: Over here is a snip from the IRIS dataset which is already present in R. It’s quite difficult
to comprehend anything from the huge lot of data and hence to make it easy for understanding we will be using visualization
techniques.
Th
Some famous tools for data visualization ……….
Selecting the right kindof chart !!!
There are four basic presentation types:
1. Comparison
2. Composition
3. Distribution
4. Relationship
To determine which amongst these types is best suited for your
data at hand we should be able to answer the below questions :-
● How many variables do you want to show in a single
chart?
● How many data points will you display for each variable?
● Will you display values over a period of time, or among
items or groups?
Basic Visualization
(Without using ggplot2 library)
Histogram
Histogram is basically a plot that breaks the data into bins (or breaks) and shows frequency distribution of these bins.
Bar/Line Chart
● Bar charts are recommended when you want to plot a categorical variable or a combination of continuous and categorical
variable.
● Line Charts are commonly preferred when we are to analyse a trend spread over a time period.
Bar Chart/Line Chart …..
Boxplots
Box Plots are used to plot a combination of categorical and continuous variables. This plot is useful for visualizing the spread of the
data and detect outliers. It shows five statistically significant numbers- the minimum, the 25th percentile, the median, the 75th
percentile and the maximum.Example for boxplot creation using the below code :
The parts of Boxplot
Boxplot analysis for the Iris dataset
Scatter Plot
Scatter Plot is used to see the relationship between two continuous variables.
Scatter plot can help to visualise multiple variables and their relations as well.
Advanced Visualizations…...
Some advanced packages of visualisation in R are :-
● Lattice Graphs :- Lattice package is essentially an improvement upon the R Graphics package and is used to
visualize multivariate data. Some kinds of visualisations with lattice package are :-
1.Kernal Density Plots
2.Scatterplot matrices :
● ggplot2 :- this package is one of the most widely used visualisation packages in R. It enables the users to create
sophisticated visualisations with little code using the Grammar of Graphics.
● Plotly is an R package that creates interactive web-based graphs via the open source JavaScript graphing library
plotly.js. It can easily translate the ‘ggplot2’ graphs to web-based versions also.
Adavanced Scatter Plots
Besides the basic version of scatterplots we can also create them using the “ggplot2” library.
The below codes give a taste of the same.
Advanced scatter plots contd...
HeatMaps
Heat Map uses intensity (density) of colors to display relationship between two or three or many variables
in a two dimensional image. It allows us to explore two dimensions as the axis and the third dimension by
intensity of color.
The colour of the bars in the heat map is
dependent on the cyl parameter of the dataset.
The dataset used here is mtcars. It’s an inbuilt
dataset.
HeatMaps contd….
Using the library “plotly”, the heatmaps can be made interactive in nature. The below code gives
an insight as to how we can use plotly.
Correlogram
Correlogram is used to test the level of correlation among the variable available in the data
set. The cells of the matrix can be shaded or colored to show the correlation value.
Correlogram contd...
It is possible to use “ggplot2” aesthetics on the chart, for instance to color each category. We can use a new library “GGally”
and see how different variations are made to the simple correlogram.
Correlogram contd….
Change the type of plot used on each part of the correlogram. This is done with the upper and lower argument.
Area Chart
Area chart is used to show continuity across a variable or data set. It is very much same as line chart and is commonly used
for time series plots. Alternatively, it is also used to plot continuous variables and analyze the underlying trends.
3D Plots
● To create a 3D plot using R can be done
with the help of scatterplot3d package.
● scaterplot3d is very simple to use and it
can be easily extended by adding
supplementary points or regression
planes into an already generated
graphics.
Another variation of 3D plots
Quick Information
For quick references you can easily check the cheatsheet side of Rstudio:
https://rstudio.com/resources/cheatsheets/
References :-
1. https://rstudio.com/resources/cheatsheets/
2. https://www.slant.co/topics/2354/~best-data-visualization-tools-for-massive-datasets
3. https://policyviz.com/product/core-principles-of-data-visualization-cheatsheet/
4. https://eazybi.com/blog/data_visualization_and_chart_types/
5. https://www.r-graph-gallery.com/199-correlation-matrix-with-ggally.html
6. https://towardsdatascience.com/a-guide-to-data-visualisation-in-r-for-beginners-ef6d41a34174?#0689
Happy Learning!!!

More Related Content

What's hot

Design of suitable Magnitude Comparator Architecture for Big Data Analytics
Design of suitable Magnitude Comparator Architecture for Big Data AnalyticsDesign of suitable Magnitude Comparator Architecture for Big Data Analytics
Design of suitable Magnitude Comparator Architecture for Big Data Analyticsrahulmonikasharma
 
Application of Matrices in real life
Application of Matrices in real lifeApplication of Matrices in real life
Application of Matrices in real lifeShayshab Azad
 
Matrix and it's Application
Matrix and it's ApplicationMatrix and it's Application
Matrix and it's ApplicationMahmudle Hassan
 
Statistics and Probability - all in one
Statistics and Probability - all in oneStatistics and Probability - all in one
Statistics and Probability - all in oneMarjuk Ahmed Siddiki
 
Closeness through-microaggregation-strict-privacy-with-enhanced-utility-prese...
Closeness through-microaggregation-strict-privacy-with-enhanced-utility-prese...Closeness through-microaggregation-strict-privacy-with-enhanced-utility-prese...
Closeness through-microaggregation-strict-privacy-with-enhanced-utility-prese...Pvrtechnologies Nellore
 
Use of matrix in daily life
Use of matrix in daily lifeUse of matrix in daily life
Use of matrix in daily lifesadia Afrose
 
Applications of Matrices in Engineering
Applications of Matrices in EngineeringApplications of Matrices in Engineering
Applications of Matrices in EngineeringAliHasan358
 
Applications of matrices in Real\Daily life
Applications of matrices in Real\Daily lifeApplications of matrices in Real\Daily life
Applications of matrices in Real\Daily lifeSami Ullah
 
Application of Matrix
Application of MatrixApplication of Matrix
Application of MatrixRahman Hillol
 
Exploring optimizations for dynamic PageRank algorithm based on GPU : V4
Exploring optimizations for dynamic PageRank algorithm based on GPU : V4Exploring optimizations for dynamic PageRank algorithm based on GPU : V4
Exploring optimizations for dynamic PageRank algorithm based on GPU : V4Subhajit Sahu
 
105575916 maths-edit-new
105575916 maths-edit-new105575916 maths-edit-new
105575916 maths-edit-newhomeworkping7
 
Graph Tea: Simulating Tool for Graph Theory & Algorithms
Graph Tea: Simulating Tool for Graph Theory & AlgorithmsGraph Tea: Simulating Tool for Graph Theory & Algorithms
Graph Tea: Simulating Tool for Graph Theory & AlgorithmsIJMTST Journal
 
Utilizing Graph Theory to Model Forensic Examination
Utilizing Graph Theory to Model Forensic ExaminationUtilizing Graph Theory to Model Forensic Examination
Utilizing Graph Theory to Model Forensic ExaminationAM Publications,India
 
Clustering - Machine Learning Techniques
Clustering - Machine Learning TechniquesClustering - Machine Learning Techniques
Clustering - Machine Learning TechniquesKush Kulshrestha
 
Spline (Interpolation)
Spline (Interpolation)Spline (Interpolation)
Spline (Interpolation)Pallab Jana
 
Data science lab project
Data science lab projectData science lab project
Data science lab projectLuciaRavazzi
 

What's hot (16)

Design of suitable Magnitude Comparator Architecture for Big Data Analytics
Design of suitable Magnitude Comparator Architecture for Big Data AnalyticsDesign of suitable Magnitude Comparator Architecture for Big Data Analytics
Design of suitable Magnitude Comparator Architecture for Big Data Analytics
 
Application of Matrices in real life
Application of Matrices in real lifeApplication of Matrices in real life
Application of Matrices in real life
 
Matrix and it's Application
Matrix and it's ApplicationMatrix and it's Application
Matrix and it's Application
 
Statistics and Probability - all in one
Statistics and Probability - all in oneStatistics and Probability - all in one
Statistics and Probability - all in one
 
Closeness through-microaggregation-strict-privacy-with-enhanced-utility-prese...
Closeness through-microaggregation-strict-privacy-with-enhanced-utility-prese...Closeness through-microaggregation-strict-privacy-with-enhanced-utility-prese...
Closeness through-microaggregation-strict-privacy-with-enhanced-utility-prese...
 
Use of matrix in daily life
Use of matrix in daily lifeUse of matrix in daily life
Use of matrix in daily life
 
Applications of Matrices in Engineering
Applications of Matrices in EngineeringApplications of Matrices in Engineering
Applications of Matrices in Engineering
 
Applications of matrices in Real\Daily life
Applications of matrices in Real\Daily lifeApplications of matrices in Real\Daily life
Applications of matrices in Real\Daily life
 
Application of Matrix
Application of MatrixApplication of Matrix
Application of Matrix
 
Exploring optimizations for dynamic PageRank algorithm based on GPU : V4
Exploring optimizations for dynamic PageRank algorithm based on GPU : V4Exploring optimizations for dynamic PageRank algorithm based on GPU : V4
Exploring optimizations for dynamic PageRank algorithm based on GPU : V4
 
105575916 maths-edit-new
105575916 maths-edit-new105575916 maths-edit-new
105575916 maths-edit-new
 
Graph Tea: Simulating Tool for Graph Theory & Algorithms
Graph Tea: Simulating Tool for Graph Theory & AlgorithmsGraph Tea: Simulating Tool for Graph Theory & Algorithms
Graph Tea: Simulating Tool for Graph Theory & Algorithms
 
Utilizing Graph Theory to Model Forensic Examination
Utilizing Graph Theory to Model Forensic ExaminationUtilizing Graph Theory to Model Forensic Examination
Utilizing Graph Theory to Model Forensic Examination
 
Clustering - Machine Learning Techniques
Clustering - Machine Learning TechniquesClustering - Machine Learning Techniques
Clustering - Machine Learning Techniques
 
Spline (Interpolation)
Spline (Interpolation)Spline (Interpolation)
Spline (Interpolation)
 
Data science lab project
Data science lab projectData science lab project
Data science lab project
 

Similar to Data visualization

Mat189: Cluster Analysis with NBA Sports Data
Mat189: Cluster Analysis with NBA Sports DataMat189: Cluster Analysis with NBA Sports Data
Mat189: Cluster Analysis with NBA Sports DataKathleneNgo
 
Dimensionality reduction
Dimensionality reductionDimensionality reduction
Dimensionality reductionShatakirti Er
 
Data Visualization using different python libraries.pptx
Data Visualization using different python libraries.pptxData Visualization using different python libraries.pptx
Data Visualization using different python libraries.pptxHamzaAli998966
 
You Don't Have to Be a Data Scientist to Do Data Science
You Don't Have to Be a Data Scientist to Do Data ScienceYou Don't Have to Be a Data Scientist to Do Data Science
You Don't Have to Be a Data Scientist to Do Data ScienceCarmen Mardiros
 
Graphs and chars
Graphs and charsGraphs and chars
Graphs and charssalemhusin
 
Spark ml streaming
Spark ml streamingSpark ml streaming
Spark ml streamingAdam Doyle
 
Data Science as a Career and Intro to R
Data Science as a Career and Intro to RData Science as a Career and Intro to R
Data Science as a Career and Intro to RAnshik Bansal
 
XL-MINER:Data Exploration
XL-MINER:Data ExplorationXL-MINER:Data Exploration
XL-MINER:Data Explorationxlminer content
 
Graph Algorithms, Sparse Algebra, and the GraphBLAS with Janice McMahon
Graph Algorithms, Sparse Algebra, and the GraphBLAS with Janice McMahonGraph Algorithms, Sparse Algebra, and the GraphBLAS with Janice McMahon
Graph Algorithms, Sparse Algebra, and the GraphBLAS with Janice McMahonChristopher Conlan
 
Data Analytics Using R - Report
Data Analytics Using R - ReportData Analytics Using R - Report
Data Analytics Using R - ReportAkanksha Gohil
 
Data Visualization (1).pptx
Data Visualization (1).pptxData Visualization (1).pptx
Data Visualization (1).pptxcfiskillzz159
 
Introduction to Datamining Concept and Techniques
Introduction to Datamining Concept and TechniquesIntroduction to Datamining Concept and Techniques
Introduction to Datamining Concept and TechniquesSơn Còm Nhom
 
Application of discrete mathematics in IT
Application of discrete mathematics in ITApplication of discrete mathematics in IT
Application of discrete mathematics in ITShahidAbbas52
 
Python Machine Learning - Getting Started
Python Machine Learning - Getting StartedPython Machine Learning - Getting Started
Python Machine Learning - Getting StartedRafey Iqbal Rahman
 
Chapter 1 Introduction to Data Structures and Algorithms.pdf
Chapter 1 Introduction to Data Structures and Algorithms.pdfChapter 1 Introduction to Data Structures and Algorithms.pdf
Chapter 1 Introduction to Data Structures and Algorithms.pdfAxmedcarb
 
Profile Analysis of Users in Data Analytics Domain
Profile Analysis of   Users in Data Analytics DomainProfile Analysis of   Users in Data Analytics Domain
Profile Analysis of Users in Data Analytics DomainDrjabez
 

Similar to Data visualization (20)

QQ Plot.pptx
QQ Plot.pptxQQ Plot.pptx
QQ Plot.pptx
 
Data Visualization With R
Data Visualization With RData Visualization With R
Data Visualization With R
 
Mat189: Cluster Analysis with NBA Sports Data
Mat189: Cluster Analysis with NBA Sports DataMat189: Cluster Analysis with NBA Sports Data
Mat189: Cluster Analysis with NBA Sports Data
 
Dimensionality reduction
Dimensionality reductionDimensionality reduction
Dimensionality reduction
 
Data Visualization using different python libraries.pptx
Data Visualization using different python libraries.pptxData Visualization using different python libraries.pptx
Data Visualization using different python libraries.pptx
 
You Don't Have to Be a Data Scientist to Do Data Science
You Don't Have to Be a Data Scientist to Do Data ScienceYou Don't Have to Be a Data Scientist to Do Data Science
You Don't Have to Be a Data Scientist to Do Data Science
 
Graphs and chars
Graphs and charsGraphs and chars
Graphs and chars
 
Spark ml streaming
Spark ml streamingSpark ml streaming
Spark ml streaming
 
Data Science as a Career and Intro to R
Data Science as a Career and Intro to RData Science as a Career and Intro to R
Data Science as a Career and Intro to R
 
XL-MINER: Data Exploration
XL-MINER: Data ExplorationXL-MINER: Data Exploration
XL-MINER: Data Exploration
 
XL-MINER:Data Exploration
XL-MINER:Data ExplorationXL-MINER:Data Exploration
XL-MINER:Data Exploration
 
Graph Algorithms, Sparse Algebra, and the GraphBLAS with Janice McMahon
Graph Algorithms, Sparse Algebra, and the GraphBLAS with Janice McMahonGraph Algorithms, Sparse Algebra, and the GraphBLAS with Janice McMahon
Graph Algorithms, Sparse Algebra, and the GraphBLAS with Janice McMahon
 
Dip day1&2
Dip day1&2Dip day1&2
Dip day1&2
 
Data Analytics Using R - Report
Data Analytics Using R - ReportData Analytics Using R - Report
Data Analytics Using R - Report
 
Data Visualization (1).pptx
Data Visualization (1).pptxData Visualization (1).pptx
Data Visualization (1).pptx
 
Introduction to Datamining Concept and Techniques
Introduction to Datamining Concept and TechniquesIntroduction to Datamining Concept and Techniques
Introduction to Datamining Concept and Techniques
 
Application of discrete mathematics in IT
Application of discrete mathematics in ITApplication of discrete mathematics in IT
Application of discrete mathematics in IT
 
Python Machine Learning - Getting Started
Python Machine Learning - Getting StartedPython Machine Learning - Getting Started
Python Machine Learning - Getting Started
 
Chapter 1 Introduction to Data Structures and Algorithms.pdf
Chapter 1 Introduction to Data Structures and Algorithms.pdfChapter 1 Introduction to Data Structures and Algorithms.pdf
Chapter 1 Introduction to Data Structures and Algorithms.pdf
 
Profile Analysis of Users in Data Analytics Domain
Profile Analysis of   Users in Data Analytics DomainProfile Analysis of   Users in Data Analytics Domain
Profile Analysis of Users in Data Analytics Domain
 

Recently uploaded

Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubaikojalkojal131
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...nirzagarg
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样wsppdmt
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...HyderabadDolls
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNKTimothy Spann
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...Health
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Klinik kandungan
 
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...gragchanchal546
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...HyderabadDolls
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareGraham Ware
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxRESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxronsairoathenadugay
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRajesh Mondal
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...Elaine Werffeli
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxchadhar227
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraGovindSinghDasila
 
Kings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about themKings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about themeitharjee
 
Computer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfComputer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfSayantanBiswas37
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangeThinkInnovation
 

Recently uploaded (20)

Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham Ware
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxRESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
Kings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about themKings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about them
 
Computer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfComputer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdf
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 

Data visualization

  • 1. Data Visualization How you see it….. Baijayanti Chakraborty
  • 2. About Me:- I am Baijayanti Chakraborty, a Post Graduate student from Great Lakes Institute of Management. I am doing PG in Business Analytics and Business Intelligence. You can find me on: 1.LinkedIN: https://www.linkedin.com/in/baijayanti-chakraborty/ 2.Twitter: twitter.com/baijayantic 3.Mail: baijayantichakraborty96@gmail.com 4.Github: https://github.com/baijayantichakraborty 5.Kaggel: https://www.kaggle.com/baijayanti94
  • 3. Today’s Spot Of Interest ❖ What is visualization and why do we need it !!!! ❖ Basic Visualizations ❖ Advanced Visualizations
  • 4. What is visualization and why do we need it !!! Data visualization is an art of how to turn numbers into useful knowledge. We all know that when we see images its easy to understand than when reading a lot of information. Let’s consider the below example: Over here is a snip from the IRIS dataset which is already present in R. It’s quite difficult to comprehend anything from the huge lot of data and hence to make it easy for understanding we will be using visualization techniques. Th
  • 5. Some famous tools for data visualization ……….
  • 6.
  • 7.
  • 8. Selecting the right kindof chart !!! There are four basic presentation types: 1. Comparison 2. Composition 3. Distribution 4. Relationship To determine which amongst these types is best suited for your data at hand we should be able to answer the below questions :- ● How many variables do you want to show in a single chart? ● How many data points will you display for each variable? ● Will you display values over a period of time, or among items or groups?
  • 9.
  • 11. Histogram Histogram is basically a plot that breaks the data into bins (or breaks) and shows frequency distribution of these bins.
  • 12. Bar/Line Chart ● Bar charts are recommended when you want to plot a categorical variable or a combination of continuous and categorical variable. ● Line Charts are commonly preferred when we are to analyse a trend spread over a time period.
  • 14. Boxplots Box Plots are used to plot a combination of categorical and continuous variables. This plot is useful for visualizing the spread of the data and detect outliers. It shows five statistically significant numbers- the minimum, the 25th percentile, the median, the 75th percentile and the maximum.Example for boxplot creation using the below code :
  • 15. The parts of Boxplot
  • 16. Boxplot analysis for the Iris dataset
  • 17. Scatter Plot Scatter Plot is used to see the relationship between two continuous variables.
  • 18. Scatter plot can help to visualise multiple variables and their relations as well.
  • 20. Some advanced packages of visualisation in R are :- ● Lattice Graphs :- Lattice package is essentially an improvement upon the R Graphics package and is used to visualize multivariate data. Some kinds of visualisations with lattice package are :- 1.Kernal Density Plots
  • 22. ● ggplot2 :- this package is one of the most widely used visualisation packages in R. It enables the users to create sophisticated visualisations with little code using the Grammar of Graphics. ● Plotly is an R package that creates interactive web-based graphs via the open source JavaScript graphing library plotly.js. It can easily translate the ‘ggplot2’ graphs to web-based versions also.
  • 23. Adavanced Scatter Plots Besides the basic version of scatterplots we can also create them using the “ggplot2” library. The below codes give a taste of the same.
  • 25. HeatMaps Heat Map uses intensity (density) of colors to display relationship between two or three or many variables in a two dimensional image. It allows us to explore two dimensions as the axis and the third dimension by intensity of color. The colour of the bars in the heat map is dependent on the cyl parameter of the dataset. The dataset used here is mtcars. It’s an inbuilt dataset.
  • 26. HeatMaps contd…. Using the library “plotly”, the heatmaps can be made interactive in nature. The below code gives an insight as to how we can use plotly.
  • 27. Correlogram Correlogram is used to test the level of correlation among the variable available in the data set. The cells of the matrix can be shaded or colored to show the correlation value.
  • 28. Correlogram contd... It is possible to use “ggplot2” aesthetics on the chart, for instance to color each category. We can use a new library “GGally” and see how different variations are made to the simple correlogram.
  • 29. Correlogram contd…. Change the type of plot used on each part of the correlogram. This is done with the upper and lower argument.
  • 30. Area Chart Area chart is used to show continuity across a variable or data set. It is very much same as line chart and is commonly used for time series plots. Alternatively, it is also used to plot continuous variables and analyze the underlying trends.
  • 31. 3D Plots ● To create a 3D plot using R can be done with the help of scatterplot3d package. ● scaterplot3d is very simple to use and it can be easily extended by adding supplementary points or regression planes into an already generated graphics.
  • 33. Quick Information For quick references you can easily check the cheatsheet side of Rstudio: https://rstudio.com/resources/cheatsheets/ References :- 1. https://rstudio.com/resources/cheatsheets/ 2. https://www.slant.co/topics/2354/~best-data-visualization-tools-for-massive-datasets 3. https://policyviz.com/product/core-principles-of-data-visualization-cheatsheet/ 4. https://eazybi.com/blog/data_visualization_and_chart_types/ 5. https://www.r-graph-gallery.com/199-correlation-matrix-with-ggally.html 6. https://towardsdatascience.com/a-guide-to-data-visualisation-in-r-for-beginners-ef6d41a34174?#0689

Editor's Notes

  1. This is like a million dollar question because before we start with any kind of analysis with data we need to know about the insights from the data.These relations among the various variables in the data needs to be understood and what better could it be than by understanding them with visual effects.An outlier is an observation that lies an abnormal distance from other values in a random sample from a population.
  2. For a very proper understanding of datasets we need to know which type of chart should be used when….
  3. 1. Used for continuous variables 2.It breaks the data into bins and shows frequency distribution of these bins 3.We can always change the bin size and see the effect it has on visualization. brewer.pal makes the color palettes from ColorBrewer available as R palettes.
  4. Boxplots are also used to detect the outliers present in the dataset. Outlier detection and removal is an essential step of successful data exploration.
  5. We can find the median , and also treat the outliers.
  6. By using the ~ sign, we can visualize how the spread (of Sepal Length) is across various categories ( of Species). In the last two graphs we have seen the example of color palettes. A color palette is a group of colors that is used to make the graph more appealing and helping create visual distinctions in the data.
  7. Lattice enables the use of trellis graphs. Trellis graphs exhibit the relationship between variables which are dependent on one or more variables.
  8. The Grammar of Graphics is a general scheme for data visualization which breaks up graphs into semantic components such as scales and layers.The popularity of ggplot2 has increased tremendously in recent years since it makes it possible to create graphs that contain both univariate and multivariate data in a very simple manner.
  9. Advanced visualisations include graphs like heatcharts,geographical maps,3D charts etc.which can be easily made by using visualisation tools like tableau etc.
  10. Darker the color, higher the correlation between variables. Positive correlations are displayed in blue and negative correlations in red color. Color intensity is proportional to the correlation value.
  11. GGally extends ggplot2 by adding several functions to reduce the complexity of combining geoms with transformed data. Some of these functions include a pairwise plot matrix, a scatterplot plot matrix, a parallel coordinates plot, a survival plot, and several functions to plot networks.