2. VISUAL AIDS FOR EDA
TWO IMPORTANT GOALS
1. TO EXTRACT KNOWLEDGE FROM THE DATA
2. TO PRESENT THE DATA TO STAKEHOLDERS
• Line chart
• Bar chart
• Scatter plot
• Area plot and stacked plot
• Pie chart
• Table chart
• Polar chart
• Histogram
• Lollipop chart
LINE CHART
• a continuous variable
• a discrete variable
• a line chart is used to illustrate the relationship between two or more continuous
variables
• A line chart, also referred to as a line graph or a line plot, connects a series of
data points using a line
• This chart type presents sequential values to help you identify trends
LINE CHART
• the x-axis (horizontal axis) represents a sequential progression of values( a dimension of
time)
• The y-axis (vertical axis) then tells you the values for a selected metric across that
progression(Ex. money, number of people, percent change, etc.)
• The graph should have each axis, or horizontal and vertical lines framing the chart, labeled
• Line charts can display both single or many lines on a chart
• In real-life scenarios, the data is mostly available in CSV, JSON, Excel, or XML formats
BAR CHARTS
• the most common types of visualization
• Bars can be drawn horizontally or vertically to represent categorical
variables.
• bar charts are very convenient when the changes are large
SCATTER PLOT
• Scatter plots are also called scatter graphs, scatter charts, scattergrams, and scatter diagrams
• They use a Cartesian coordinates system to display values of typically two variables for a set of data.
• When should we use a scatter plot
1. When one continuous variable is dependent on another variable, which is under the control of the
observer
2. When both continuous variables are independent
• scatter plots are used when we need to show the relationship between two variables
• referred to as correlation plots
Some examples in which scatter plots are suitable are as follows:
• Research studies have successfully established that the number of hours of
sleep required by a person depends on the age of the person.
• The average income for adults is based on the number of years of education
2 EDAV Exploratory Data Analysis1234.pptx
2 EDAV Exploratory Data Analysis1234.pptx
2 EDAV Exploratory Data Analysis1234.pptx
2 EDAV Exploratory Data Analysis1234.pptx

2 EDAV Exploratory Data Analysis1234.pptx

  • 1.
  • 2.
    TWO IMPORTANT GOALS 1.TO EXTRACT KNOWLEDGE FROM THE DATA 2. TO PRESENT THE DATA TO STAKEHOLDERS • Line chart • Bar chart • Scatter plot • Area plot and stacked plot • Pie chart • Table chart • Polar chart • Histogram • Lollipop chart
  • 3.
    LINE CHART • acontinuous variable • a discrete variable • a line chart is used to illustrate the relationship between two or more continuous variables • A line chart, also referred to as a line graph or a line plot, connects a series of data points using a line • This chart type presents sequential values to help you identify trends
  • 4.
    LINE CHART • thex-axis (horizontal axis) represents a sequential progression of values( a dimension of time) • The y-axis (vertical axis) then tells you the values for a selected metric across that progression(Ex. money, number of people, percent change, etc.) • The graph should have each axis, or horizontal and vertical lines framing the chart, labeled • Line charts can display both single or many lines on a chart • In real-life scenarios, the data is mostly available in CSV, JSON, Excel, or XML formats
  • 7.
    BAR CHARTS • themost common types of visualization • Bars can be drawn horizontally or vertically to represent categorical variables. • bar charts are very convenient when the changes are large
  • 10.
    SCATTER PLOT • Scatterplots are also called scatter graphs, scatter charts, scattergrams, and scatter diagrams • They use a Cartesian coordinates system to display values of typically two variables for a set of data. • When should we use a scatter plot 1. When one continuous variable is dependent on another variable, which is under the control of the observer 2. When both continuous variables are independent • scatter plots are used when we need to show the relationship between two variables • referred to as correlation plots
  • 11.
    Some examples inwhich scatter plots are suitable are as follows: • Research studies have successfully established that the number of hours of sleep required by a person depends on the age of the person. • The average income for adults is based on the number of years of education