SlideShare a Scribd company logo
Data
Exploration
And
Visiulization
Data visualization and its importance
• It is almost impossible for anyone to extract any information from terabytes or petabytes of data without
using visualization techniques.
• Pictures and visuals way better than any numbers or text
• Data visualization techniques are also helpful for the feature engineering part in machine learning
Basic visualization techniques
Matplotlib / Seaborn
•Bar chart Histogram
•Density plot or
Distribution
•Box Plot •Scatter Plot •Pair Plot
•Correlation and
HeatMap
Bar Chart
• The bar chart is a frequency chart for a qualitative variable.
• A bar chart can be used to access the most-occurring and least-occurring categories within a
dataset
Histogram
A histogram is a plot that shows the frequency distribution of a set of continuous variables.
Distribution or Density plot
A distribution or density plot depicts the distribution of data over a continuous
interval. A density plot is like a smoothed histogram and visualizes the
distribution of data over a continuous interval.
Box Plot
• Box plot is a graphical representation of numerical data that can be used to understand the variability of the data
and the existence of outliers. Box plot is designed by identifying the following descriptive statistics.
1. Lower quartile, median and upper quartile
2. Lowest and highest values
3. Interquartile range(IQR)
• Box plot is constructed using IQR, minimum and maximum values. IQR is the distance between the 3rd quartile
and 1st quartile. The length of the box is equivalent to IQR.
 Using “Matplotlib”
 Using “Seaborn”
Scatter Plot
• In Scatter Plot ,the values of two variables are plotted along two axes and the resulting pattern can
reveal correlation present between the variables if any
• A scatter plot is also useful for assessing the strength of the relationship and to find if there are any
outliers in the data
Pair Plot
• When we have many variables it is not convenient to draw scatter plots for each pair of
variables to understand the relationship
• So that we have to use a pair plot to depict the relationship in a single diagram
Correlation and HeatMap
Correlation is used for measuring the strength and direction of the linear relationship between
two continuous random variables x and y.
• A positive correlation means the variables increase or decrease together.
• A negative correlation means if one variable increases then the other decrease.
Conclusion
 The objective of descriptive analytics is simple comprehension of data using
summarization basic statistical measures and visualization.
 Matplotlib and seaborn are the two most widely used libraries for creating a
visualization.
 Plots like histograms, distribution plots, box plots, scatter plots, pair plots,
heatmap, can be created to find insights during exploratory analysis.

More Related Content

Similar to Data Visualization (1).pptx

Data presentation.pptx
Data presentation.pptxData presentation.pptx
Data presentation.pptx
ssusera0e0e9
 
collectionandrepresentationofdata1-200904192336.pptx
collectionandrepresentationofdata1-200904192336.pptxcollectionandrepresentationofdata1-200904192336.pptx
collectionandrepresentationofdata1-200904192336.pptx
aibakimito
 
Tabular and Graphical Representation of Data
Tabular and Graphical Representation of Data Tabular and Graphical Representation of Data
Tabular and Graphical Representation of Data
Sir Parashurambhau College, Pune
 
diagrammatic and graphical representation of data
 diagrammatic and graphical representation of data diagrammatic and graphical representation of data
diagrammatic and graphical representation of data
Varun Prem Varu
 
Introduction to Descriptive Statistics
Introduction to Descriptive StatisticsIntroduction to Descriptive Statistics
Introduction to Descriptive Statistics
Sanju Rusara Seneviratne
 
Graphs(Biostatistics and Research Methodology) B.pharmacy(8th sem.)
Graphs(Biostatistics and Research Methodology) B.pharmacy(8th sem.)Graphs(Biostatistics and Research Methodology) B.pharmacy(8th sem.)
Graphs(Biostatistics and Research Methodology) B.pharmacy(8th sem.)
Pranjal Saxena
 
2. AAdata presentation edited edited tutor srudents(1).pptx
2. AAdata presentation edited edited tutor srudents(1).pptx2. AAdata presentation edited edited tutor srudents(1).pptx
2. AAdata presentation edited edited tutor srudents(1).pptx
ssuser504dda
 
Exploratory Data Analysis (EDA) .pptx
Exploratory Data Analysis (EDA) .pptxExploratory Data Analysis (EDA) .pptx
Exploratory Data Analysis (EDA) .pptx
ZahidRiazHaans
 
Plotting histogram in bigdata analytics
Plotting histogram in bigdata analyticsPlotting histogram in bigdata analytics
Plotting histogram in bigdata analytics
RajalakshmiK19
 
Data Visualisation.pdf
Data Visualisation.pdfData Visualisation.pdf
Data Visualisation.pdf
Thiyagu K
 
cs 601 - lecture 1.pptx
cs 601 - lecture 1.pptxcs 601 - lecture 1.pptx
cs 601 - lecture 1.pptx
GopalPatidar13
 
Basic understanding of Plots and diagrams used in data interpretation
 Basic understanding of Plots and diagrams used in data interpretation   Basic understanding of Plots and diagrams used in data interpretation
Basic understanding of Plots and diagrams used in data interpretation
Subedi Suraj
 
chapter2.ppt
chapter2.pptchapter2.ppt
chapter2.ppt
deepika563208
 
chapter2 research paper in mathematics.ppt
chapter2 research paper in mathematics.pptchapter2 research paper in mathematics.ppt
chapter2 research paper in mathematics.ppt
KyMarieCabilesSedico
 
Data presentation
Data presentationData presentation
Data presentation
Weam Banjar
 
Lect4 principal component analysis-I
Lect4 principal component analysis-ILect4 principal component analysis-I
Lect4 principal component analysis-I
hktripathy
 
2.4 Scatterplots, correlation, and regression
2.4 Scatterplots, correlation, and regression2.4 Scatterplots, correlation, and regression
2.4 Scatterplots, correlation, and regression
Long Beach City College
 
graphic representations in statistics
 graphic representations in statistics graphic representations in statistics
graphic representations in statistics
Unsa Shakir
 
Basic statisctis -Anandh Shankar
Basic statisctis -Anandh ShankarBasic statisctis -Anandh Shankar
Basic statisctis -Anandh Shankar
Anandh Shankar Sundararajan
 
SEMINAR Presentation ppt.pptx
SEMINAR Presentation ppt.pptxSEMINAR Presentation ppt.pptx
SEMINAR Presentation ppt.pptx
WageYado
 

Similar to Data Visualization (1).pptx (20)

Data presentation.pptx
Data presentation.pptxData presentation.pptx
Data presentation.pptx
 
collectionandrepresentationofdata1-200904192336.pptx
collectionandrepresentationofdata1-200904192336.pptxcollectionandrepresentationofdata1-200904192336.pptx
collectionandrepresentationofdata1-200904192336.pptx
 
Tabular and Graphical Representation of Data
Tabular and Graphical Representation of Data Tabular and Graphical Representation of Data
Tabular and Graphical Representation of Data
 
diagrammatic and graphical representation of data
 diagrammatic and graphical representation of data diagrammatic and graphical representation of data
diagrammatic and graphical representation of data
 
Introduction to Descriptive Statistics
Introduction to Descriptive StatisticsIntroduction to Descriptive Statistics
Introduction to Descriptive Statistics
 
Graphs(Biostatistics and Research Methodology) B.pharmacy(8th sem.)
Graphs(Biostatistics and Research Methodology) B.pharmacy(8th sem.)Graphs(Biostatistics and Research Methodology) B.pharmacy(8th sem.)
Graphs(Biostatistics and Research Methodology) B.pharmacy(8th sem.)
 
2. AAdata presentation edited edited tutor srudents(1).pptx
2. AAdata presentation edited edited tutor srudents(1).pptx2. AAdata presentation edited edited tutor srudents(1).pptx
2. AAdata presentation edited edited tutor srudents(1).pptx
 
Exploratory Data Analysis (EDA) .pptx
Exploratory Data Analysis (EDA) .pptxExploratory Data Analysis (EDA) .pptx
Exploratory Data Analysis (EDA) .pptx
 
Plotting histogram in bigdata analytics
Plotting histogram in bigdata analyticsPlotting histogram in bigdata analytics
Plotting histogram in bigdata analytics
 
Data Visualisation.pdf
Data Visualisation.pdfData Visualisation.pdf
Data Visualisation.pdf
 
cs 601 - lecture 1.pptx
cs 601 - lecture 1.pptxcs 601 - lecture 1.pptx
cs 601 - lecture 1.pptx
 
Basic understanding of Plots and diagrams used in data interpretation
 Basic understanding of Plots and diagrams used in data interpretation   Basic understanding of Plots and diagrams used in data interpretation
Basic understanding of Plots and diagrams used in data interpretation
 
chapter2.ppt
chapter2.pptchapter2.ppt
chapter2.ppt
 
chapter2 research paper in mathematics.ppt
chapter2 research paper in mathematics.pptchapter2 research paper in mathematics.ppt
chapter2 research paper in mathematics.ppt
 
Data presentation
Data presentationData presentation
Data presentation
 
Lect4 principal component analysis-I
Lect4 principal component analysis-ILect4 principal component analysis-I
Lect4 principal component analysis-I
 
2.4 Scatterplots, correlation, and regression
2.4 Scatterplots, correlation, and regression2.4 Scatterplots, correlation, and regression
2.4 Scatterplots, correlation, and regression
 
graphic representations in statistics
 graphic representations in statistics graphic representations in statistics
graphic representations in statistics
 
Basic statisctis -Anandh Shankar
Basic statisctis -Anandh ShankarBasic statisctis -Anandh Shankar
Basic statisctis -Anandh Shankar
 
SEMINAR Presentation ppt.pptx
SEMINAR Presentation ppt.pptxSEMINAR Presentation ppt.pptx
SEMINAR Presentation ppt.pptx
 

Recently uploaded

Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
AndrzejJarynowski
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
Timothy Spann
 
Module 1 ppt BIG DATA ANALYTICS_NOTES FOR MCA
Module 1 ppt BIG DATA ANALYTICS_NOTES FOR MCAModule 1 ppt BIG DATA ANALYTICS_NOTES FOR MCA
Module 1 ppt BIG DATA ANALYTICS_NOTES FOR MCA
yuvarajkumar334
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
nyfuhyz
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
z6osjkqvd
 
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Kaxil Naik
 
Build applications with generative AI on Google Cloud
Build applications with generative AI on Google CloudBuild applications with generative AI on Google Cloud
Build applications with generative AI on Google Cloud
Márton Kodok
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
Bill641377
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
soxrziqu
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
nuttdpt
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Aggregage
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
bmucuha
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
v7oacc3l
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
jitskeb
 
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
slg6lamcq
 
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
ihavuls
 
A presentation that explain the Power BI Licensing
A presentation that explain the Power BI LicensingA presentation that explain the Power BI Licensing
A presentation that explain the Power BI Licensing
AlessioFois2
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
apvysm8
 
writing report business partner b1+ .pdf
writing report business partner b1+ .pdfwriting report business partner b1+ .pdf
writing report business partner b1+ .pdf
VyNguyen709676
 

Recently uploaded (20)

Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
 
Module 1 ppt BIG DATA ANALYTICS_NOTES FOR MCA
Module 1 ppt BIG DATA ANALYTICS_NOTES FOR MCAModule 1 ppt BIG DATA ANALYTICS_NOTES FOR MCA
Module 1 ppt BIG DATA ANALYTICS_NOTES FOR MCA
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
 
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
 
Build applications with generative AI on Google Cloud
Build applications with generative AI on Google CloudBuild applications with generative AI on Google Cloud
Build applications with generative AI on Google Cloud
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
 
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
 
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
 
A presentation that explain the Power BI Licensing
A presentation that explain the Power BI LicensingA presentation that explain the Power BI Licensing
A presentation that explain the Power BI Licensing
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
 
writing report business partner b1+ .pdf
writing report business partner b1+ .pdfwriting report business partner b1+ .pdf
writing report business partner b1+ .pdf
 

Data Visualization (1).pptx

  • 2. Data visualization and its importance • It is almost impossible for anyone to extract any information from terabytes or petabytes of data without using visualization techniques. • Pictures and visuals way better than any numbers or text • Data visualization techniques are also helpful for the feature engineering part in machine learning
  • 3. Basic visualization techniques Matplotlib / Seaborn •Bar chart Histogram •Density plot or Distribution •Box Plot •Scatter Plot •Pair Plot •Correlation and HeatMap
  • 4. Bar Chart • The bar chart is a frequency chart for a qualitative variable. • A bar chart can be used to access the most-occurring and least-occurring categories within a dataset
  • 5. Histogram A histogram is a plot that shows the frequency distribution of a set of continuous variables.
  • 6. Distribution or Density plot A distribution or density plot depicts the distribution of data over a continuous interval. A density plot is like a smoothed histogram and visualizes the distribution of data over a continuous interval.
  • 7. Box Plot • Box plot is a graphical representation of numerical data that can be used to understand the variability of the data and the existence of outliers. Box plot is designed by identifying the following descriptive statistics. 1. Lower quartile, median and upper quartile 2. Lowest and highest values 3. Interquartile range(IQR) • Box plot is constructed using IQR, minimum and maximum values. IQR is the distance between the 3rd quartile and 1st quartile. The length of the box is equivalent to IQR.  Using “Matplotlib”  Using “Seaborn”
  • 8. Scatter Plot • In Scatter Plot ,the values of two variables are plotted along two axes and the resulting pattern can reveal correlation present between the variables if any • A scatter plot is also useful for assessing the strength of the relationship and to find if there are any outliers in the data
  • 9. Pair Plot • When we have many variables it is not convenient to draw scatter plots for each pair of variables to understand the relationship • So that we have to use a pair plot to depict the relationship in a single diagram
  • 10. Correlation and HeatMap Correlation is used for measuring the strength and direction of the linear relationship between two continuous random variables x and y. • A positive correlation means the variables increase or decrease together. • A negative correlation means if one variable increases then the other decrease.
  • 11. Conclusion  The objective of descriptive analytics is simple comprehension of data using summarization basic statistical measures and visualization.  Matplotlib and seaborn are the two most widely used libraries for creating a visualization.  Plots like histograms, distribution plots, box plots, scatter plots, pair plots, heatmap, can be created to find insights during exploratory analysis.