SlideShare a Scribd company logo
1 of 11
Data
Exploration
And
Visiulization
Data visualization and its importance
• It is almost impossible for anyone to extract any information from terabytes or petabytes of data without
using visualization techniques.
• Pictures and visuals way better than any numbers or text
• Data visualization techniques are also helpful for the feature engineering part in machine learning
Basic visualization techniques
Matplotlib / Seaborn
•Bar chart Histogram
•Density plot or
Distribution
•Box Plot •Scatter Plot •Pair Plot
•Correlation and
HeatMap
Bar Chart
• The bar chart is a frequency chart for a qualitative variable.
• A bar chart can be used to access the most-occurring and least-occurring categories within a
dataset
Histogram
A histogram is a plot that shows the frequency distribution of a set of continuous variables.
Distribution or Density plot
A distribution or density plot depicts the distribution of data over a continuous
interval. A density plot is like a smoothed histogram and visualizes the
distribution of data over a continuous interval.
Box Plot
• Box plot is a graphical representation of numerical data that can be used to understand the variability of the data
and the existence of outliers. Box plot is designed by identifying the following descriptive statistics.
1. Lower quartile, median and upper quartile
2. Lowest and highest values
3. Interquartile range(IQR)
• Box plot is constructed using IQR, minimum and maximum values. IQR is the distance between the 3rd quartile
and 1st quartile. The length of the box is equivalent to IQR.
 Using “Matplotlib”
 Using “Seaborn”
Scatter Plot
• In Scatter Plot ,the values of two variables are plotted along two axes and the resulting pattern can
reveal correlation present between the variables if any
• A scatter plot is also useful for assessing the strength of the relationship and to find if there are any
outliers in the data
Pair Plot
• When we have many variables it is not convenient to draw scatter plots for each pair of
variables to understand the relationship
• So that we have to use a pair plot to depict the relationship in a single diagram
Correlation and HeatMap
Correlation is used for measuring the strength and direction of the linear relationship between
two continuous random variables x and y.
• A positive correlation means the variables increase or decrease together.
• A negative correlation means if one variable increases then the other decrease.
Conclusion
 The objective of descriptive analytics is simple comprehension of data using
summarization basic statistical measures and visualization.
 Matplotlib and seaborn are the two most widely used libraries for creating a
visualization.
 Plots like histograms, distribution plots, box plots, scatter plots, pair plots,
heatmap, can be created to find insights during exploratory analysis.

More Related Content

Similar to Data Visualization (1).pptx

collectionandrepresentationofdata1-200904192336.pptx
collectionandrepresentationofdata1-200904192336.pptxcollectionandrepresentationofdata1-200904192336.pptx
collectionandrepresentationofdata1-200904192336.pptxaibakimito
 
diagrammatic and graphical representation of data
 diagrammatic and graphical representation of data diagrammatic and graphical representation of data
diagrammatic and graphical representation of dataVarun Prem Varu
 
Graphs(Biostatistics and Research Methodology) B.pharmacy(8th sem.)
Graphs(Biostatistics and Research Methodology) B.pharmacy(8th sem.)Graphs(Biostatistics and Research Methodology) B.pharmacy(8th sem.)
Graphs(Biostatistics and Research Methodology) B.pharmacy(8th sem.)Pranjal Saxena
 
2. AAdata presentation edited edited tutor srudents(1).pptx
2. AAdata presentation edited edited tutor srudents(1).pptx2. AAdata presentation edited edited tutor srudents(1).pptx
2. AAdata presentation edited edited tutor srudents(1).pptxssuser504dda
 
Exploratory Data Analysis (EDA) .pptx
Exploratory Data Analysis (EDA) .pptxExploratory Data Analysis (EDA) .pptx
Exploratory Data Analysis (EDA) .pptxZahidRiazHaans
 
Plotting histogram in bigdata analytics
Plotting histogram in bigdata analyticsPlotting histogram in bigdata analytics
Plotting histogram in bigdata analyticsRajalakshmiK19
 
Data Visualisation.pdf
Data Visualisation.pdfData Visualisation.pdf
Data Visualisation.pdfThiyagu K
 
cs 601 - lecture 1.pptx
cs 601 - lecture 1.pptxcs 601 - lecture 1.pptx
cs 601 - lecture 1.pptxGopalPatidar13
 
Basic understanding of Plots and diagrams used in data interpretation
 Basic understanding of Plots and diagrams used in data interpretation   Basic understanding of Plots and diagrams used in data interpretation
Basic understanding of Plots and diagrams used in data interpretation Subedi Suraj
 
chapter2 research paper in mathematics.ppt
chapter2 research paper in mathematics.pptchapter2 research paper in mathematics.ppt
chapter2 research paper in mathematics.pptKyMarieCabilesSedico
 
Data presentation
Data presentationData presentation
Data presentationWeam Banjar
 
Lect4 principal component analysis-I
Lect4 principal component analysis-ILect4 principal component analysis-I
Lect4 principal component analysis-Ihktripathy
 
2.4 Scatterplots, correlation, and regression
2.4 Scatterplots, correlation, and regression2.4 Scatterplots, correlation, and regression
2.4 Scatterplots, correlation, and regressionLong Beach City College
 
graphic representations in statistics
 graphic representations in statistics graphic representations in statistics
graphic representations in statisticsUnsa Shakir
 
SEMINAR Presentation ppt.pptx
SEMINAR Presentation ppt.pptxSEMINAR Presentation ppt.pptx
SEMINAR Presentation ppt.pptxWageYado
 
SEMINAR Presentation ppt.pptx
SEMINAR Presentation ppt.pptxSEMINAR Presentation ppt.pptx
SEMINAR Presentation ppt.pptxWageYado
 

Similar to Data Visualization (1).pptx (20)

collectionandrepresentationofdata1-200904192336.pptx
collectionandrepresentationofdata1-200904192336.pptxcollectionandrepresentationofdata1-200904192336.pptx
collectionandrepresentationofdata1-200904192336.pptx
 
Tabular and Graphical Representation of Data
Tabular and Graphical Representation of Data Tabular and Graphical Representation of Data
Tabular and Graphical Representation of Data
 
diagrammatic and graphical representation of data
 diagrammatic and graphical representation of data diagrammatic and graphical representation of data
diagrammatic and graphical representation of data
 
Introduction to Descriptive Statistics
Introduction to Descriptive StatisticsIntroduction to Descriptive Statistics
Introduction to Descriptive Statistics
 
Graphs(Biostatistics and Research Methodology) B.pharmacy(8th sem.)
Graphs(Biostatistics and Research Methodology) B.pharmacy(8th sem.)Graphs(Biostatistics and Research Methodology) B.pharmacy(8th sem.)
Graphs(Biostatistics and Research Methodology) B.pharmacy(8th sem.)
 
2. AAdata presentation edited edited tutor srudents(1).pptx
2. AAdata presentation edited edited tutor srudents(1).pptx2. AAdata presentation edited edited tutor srudents(1).pptx
2. AAdata presentation edited edited tutor srudents(1).pptx
 
Exploratory Data Analysis (EDA) .pptx
Exploratory Data Analysis (EDA) .pptxExploratory Data Analysis (EDA) .pptx
Exploratory Data Analysis (EDA) .pptx
 
Plotting histogram in bigdata analytics
Plotting histogram in bigdata analyticsPlotting histogram in bigdata analytics
Plotting histogram in bigdata analytics
 
Data Visualisation.pdf
Data Visualisation.pdfData Visualisation.pdf
Data Visualisation.pdf
 
cs 601 - lecture 1.pptx
cs 601 - lecture 1.pptxcs 601 - lecture 1.pptx
cs 601 - lecture 1.pptx
 
Basic understanding of Plots and diagrams used in data interpretation
 Basic understanding of Plots and diagrams used in data interpretation   Basic understanding of Plots and diagrams used in data interpretation
Basic understanding of Plots and diagrams used in data interpretation
 
chapter2.ppt
chapter2.pptchapter2.ppt
chapter2.ppt
 
chapter2 research paper in mathematics.ppt
chapter2 research paper in mathematics.pptchapter2 research paper in mathematics.ppt
chapter2 research paper in mathematics.ppt
 
Data presentation
Data presentationData presentation
Data presentation
 
Lect4 principal component analysis-I
Lect4 principal component analysis-ILect4 principal component analysis-I
Lect4 principal component analysis-I
 
2.4 Scatterplots, correlation, and regression
2.4 Scatterplots, correlation, and regression2.4 Scatterplots, correlation, and regression
2.4 Scatterplots, correlation, and regression
 
graphic representations in statistics
 graphic representations in statistics graphic representations in statistics
graphic representations in statistics
 
Basic statisctis -Anandh Shankar
Basic statisctis -Anandh ShankarBasic statisctis -Anandh Shankar
Basic statisctis -Anandh Shankar
 
SEMINAR Presentation ppt.pptx
SEMINAR Presentation ppt.pptxSEMINAR Presentation ppt.pptx
SEMINAR Presentation ppt.pptx
 
SEMINAR Presentation ppt.pptx
SEMINAR Presentation ppt.pptxSEMINAR Presentation ppt.pptx
SEMINAR Presentation ppt.pptx
 

Recently uploaded

Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...Bertram Ludäscher
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareGraham Ware
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...gajnagarg
 
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...SOFTTECHHUB
 
Kings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about themKings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about themeitharjee
 
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...HyderabadDolls
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...gajnagarg
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Klinik kandungan
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...HyderabadDolls
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraGovindSinghDasila
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubaikojalkojal131
 
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxRESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxronsairoathenadugay
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1ranjankumarbehera14
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...nirzagarg
 
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...HyderabadDolls
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangeThinkInnovation
 

Recently uploaded (20)

Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham Ware
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
 
Kings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about themKings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about them
 
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxRESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
 
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 

Data Visualization (1).pptx

  • 2. Data visualization and its importance • It is almost impossible for anyone to extract any information from terabytes or petabytes of data without using visualization techniques. • Pictures and visuals way better than any numbers or text • Data visualization techniques are also helpful for the feature engineering part in machine learning
  • 3. Basic visualization techniques Matplotlib / Seaborn •Bar chart Histogram •Density plot or Distribution •Box Plot •Scatter Plot •Pair Plot •Correlation and HeatMap
  • 4. Bar Chart • The bar chart is a frequency chart for a qualitative variable. • A bar chart can be used to access the most-occurring and least-occurring categories within a dataset
  • 5. Histogram A histogram is a plot that shows the frequency distribution of a set of continuous variables.
  • 6. Distribution or Density plot A distribution or density plot depicts the distribution of data over a continuous interval. A density plot is like a smoothed histogram and visualizes the distribution of data over a continuous interval.
  • 7. Box Plot • Box plot is a graphical representation of numerical data that can be used to understand the variability of the data and the existence of outliers. Box plot is designed by identifying the following descriptive statistics. 1. Lower quartile, median and upper quartile 2. Lowest and highest values 3. Interquartile range(IQR) • Box plot is constructed using IQR, minimum and maximum values. IQR is the distance between the 3rd quartile and 1st quartile. The length of the box is equivalent to IQR.  Using “Matplotlib”  Using “Seaborn”
  • 8. Scatter Plot • In Scatter Plot ,the values of two variables are plotted along two axes and the resulting pattern can reveal correlation present between the variables if any • A scatter plot is also useful for assessing the strength of the relationship and to find if there are any outliers in the data
  • 9. Pair Plot • When we have many variables it is not convenient to draw scatter plots for each pair of variables to understand the relationship • So that we have to use a pair plot to depict the relationship in a single diagram
  • 10. Correlation and HeatMap Correlation is used for measuring the strength and direction of the linear relationship between two continuous random variables x and y. • A positive correlation means the variables increase or decrease together. • A negative correlation means if one variable increases then the other decrease.
  • 11. Conclusion  The objective of descriptive analytics is simple comprehension of data using summarization basic statistical measures and visualization.  Matplotlib and seaborn are the two most widely used libraries for creating a visualization.  Plots like histograms, distribution plots, box plots, scatter plots, pair plots, heatmap, can be created to find insights during exploratory analysis.