Lecture on Introduction to Descriptive Statistics - Part 1 and Part 2. These slides were presented during a lecture at the Colombo Institute of Research and Psychology.
What is a Box Plot ?
In descriptive statistics, a boxplot is a method for graphically depicting groups of numerical data through their quartiles. Box plots may also have lines extending from the boxes (whiskers) indicating variability outside the upper and lower quartiles, hence the terms box-and-whisker plot and box-and-whisker diagram
Box plots are non-parametric: they display variation in samples of a statistical population without making any assumptions of the underlying statistical distribution
The spacings between the different parts of the box indicate the degree of dispersion (spread) and skewness in the data, and show outliers.
Box Plot Requirement
Minimum : the lowest data point excluding any outliers.
Maximum : the largest data point excluding any outliers.
Median (Q2 / 50th percentile) : the middle value of the dataset.
First quartile (Q1 / 25th percentile) : also known as the lower quartile qn(0.25), is the median of the lower half of the dataset.
Third quartile (Q3 / 75th percentile) : also known as the upper quartile qn(0.75), is the median of the upper half of the dataset.
Interquartile range (IQR) : is the distance between the upper and lower quartiles
IQR =Q3-Q1= qn(0.75) –qn(0.25)
A boxplot is constructed of two parts, a box and a set of whiskers . The lowest point is the minimum of the data set and the highest point is the maximum of the data set. The box is drawn from Q1 to Q3 with a horizontal line drawn in the middle to denote the median.
Why Box Plot is useful ?
Box plots divide the data into sections that each contain approximately 25% of the data in that set.
When the median is in the middle of the box, and the whiskers are about the same on both sides of the box, then the distribution is symmetric
When the median is closer to the bottom of the box, and if the whisker is shorter on the lower end of the box, then the distribution is positively skewed (skewed right)
When the median is closer to the top of the box, and if the whisker is shorter on the upper end of the box, then the distribution is negatively skewed (skewed left)
How to compare box plot
How to draw box plot on Excel
Step 2: Calculate quartile differences
Next, calculate the differences between each phase. In effect, you have to calculate the differentials between the following:
First quartile and minimum value
Median and first quartile
Third quartile and median
Maximum value and third quartile
Step 3: Create a stacked column chart
The data in the third table is well suited for a box plot, and we'll start by creating a stacked column chart which we'll then modify.
Select all the data from the third table, and click Insert > Insert Column Chart > Stacked Column.
To reverse the chart axes, right-click on the chart, and click Select Data.
Click Switch Row/Column.
Click OK.
The next step is to replace the topmost and second-from-bottom (the deep blue and orange areas in the image) data series with lines, or whiskers.
Select the to
Contains different types of Data Visualizations, best practices to follow for each case and what type of visualization should be made for different kinds of datasets.
Lecture on Introduction to Descriptive Statistics - Part 1 and Part 2. These slides were presented during a lecture at the Colombo Institute of Research and Psychology.
What is a Box Plot ?
In descriptive statistics, a boxplot is a method for graphically depicting groups of numerical data through their quartiles. Box plots may also have lines extending from the boxes (whiskers) indicating variability outside the upper and lower quartiles, hence the terms box-and-whisker plot and box-and-whisker diagram
Box plots are non-parametric: they display variation in samples of a statistical population without making any assumptions of the underlying statistical distribution
The spacings between the different parts of the box indicate the degree of dispersion (spread) and skewness in the data, and show outliers.
Box Plot Requirement
Minimum : the lowest data point excluding any outliers.
Maximum : the largest data point excluding any outliers.
Median (Q2 / 50th percentile) : the middle value of the dataset.
First quartile (Q1 / 25th percentile) : also known as the lower quartile qn(0.25), is the median of the lower half of the dataset.
Third quartile (Q3 / 75th percentile) : also known as the upper quartile qn(0.75), is the median of the upper half of the dataset.
Interquartile range (IQR) : is the distance between the upper and lower quartiles
IQR =Q3-Q1= qn(0.75) –qn(0.25)
A boxplot is constructed of two parts, a box and a set of whiskers . The lowest point is the minimum of the data set and the highest point is the maximum of the data set. The box is drawn from Q1 to Q3 with a horizontal line drawn in the middle to denote the median.
Why Box Plot is useful ?
Box plots divide the data into sections that each contain approximately 25% of the data in that set.
When the median is in the middle of the box, and the whiskers are about the same on both sides of the box, then the distribution is symmetric
When the median is closer to the bottom of the box, and if the whisker is shorter on the lower end of the box, then the distribution is positively skewed (skewed right)
When the median is closer to the top of the box, and if the whisker is shorter on the upper end of the box, then the distribution is negatively skewed (skewed left)
How to compare box plot
How to draw box plot on Excel
Step 2: Calculate quartile differences
Next, calculate the differences between each phase. In effect, you have to calculate the differentials between the following:
First quartile and minimum value
Median and first quartile
Third quartile and median
Maximum value and third quartile
Step 3: Create a stacked column chart
The data in the third table is well suited for a box plot, and we'll start by creating a stacked column chart which we'll then modify.
Select all the data from the third table, and click Insert > Insert Column Chart > Stacked Column.
To reverse the chart axes, right-click on the chart, and click Select Data.
Click Switch Row/Column.
Click OK.
The next step is to replace the topmost and second-from-bottom (the deep blue and orange areas in the image) data series with lines, or whiskers.
Select the to
Contains different types of Data Visualizations, best practices to follow for each case and what type of visualization should be made for different kinds of datasets.
In this ppt the viewer will able to know about Graphs. Graph is defined as to create a diagram that shows a relationship between two or more things. A diagram showing the relationship of quantities, especially such a diagram in which lines, bars, or proportional areas represent how one quantity depends on or changes with another. Histogram is one type of graphical presentation of data obtained from any source. This is easy method to represent the data and quick understanding way. Histogram should be designed in various other way to reveal more complicated data in single sheet. These histogram having great importance in industrial and educational point of view. Different statistical software playing major role to show the results & reports in histograms in different organizations
Portion explained:
1. Introduction to Graphs
2. Types of Graphs
3. Histogram
4. Types of Histogram
5. Uniform Histogram
6. Bimodal Histogram
7. Symmetric Histogram
8. Probability Histogram
9. Histogram Example
In this ppt the viewer will able to know about Graphs. Graph is defined as to create a diagram that shows a relationship between two or more things. A diagram showing the relationship of quantities, especially such a diagram in which lines, bars, or proportional areas represent how one quantity depends on or changes with another. Histogram is one type of graphical presentation of data obtained from any source. This is easy method to represent the data and quick understanding way. Histogram should be designed in various other way to reveal more complicated data in single sheet. These histogram having great importance in industrial and educational point of view. Different statistical software playing major role to show the results & reports in histograms in different organizations
Portion explained:
1. Introduction to Graphs
2. Types of Graphs
3. Histogram
4. Types of Histogram
5. Uniform Histogram
6. Bimodal Histogram
7. Symmetric Histogram
8. Probability Histogram
9. Histogram Example
Instructions for Submissions thorugh G- Classroom.pptxJheel Barad
This presentation provides a briefing on how to upload submissions and documents in Google Classroom. It was prepared as part of an orientation for new Sainik School in-service teacher trainees. As a training officer, my goal is to ensure that you are comfortable and proficient with this essential tool for managing assignments and fostering student engagement.
Power-sharing Class 10 is a vital aspect of democratic governance. It refers to the distribution of power among different organs of government, levels of government, and social groups. This ensures that no single entity can control all aspects of governance, promoting stability and unity in a diverse society.
For more information, visit-www.vavaclasses.com
This is a presentation by Dada Robert in a Your Skill Boost masterclass organised by the Excellence Foundation for South Sudan (EFSS) on Saturday, the 25th and Sunday, the 26th of May 2024.
He discussed the concept of quality improvement, emphasizing its applicability to various aspects of life, including personal, project, and program improvements. He defined quality as doing the right thing at the right time in the right way to achieve the best possible results and discussed the concept of the "gap" between what we know and what we do, and how this gap represents the areas we need to improve. He explained the scientific approach to quality improvement, which involves systematic performance analysis, testing and learning, and implementing change ideas. He also highlighted the importance of client focus and a team approach to quality improvement.
Palestine last event orientationfvgnh .pptxRaedMohamed3
An EFL lesson about the current events in Palestine. It is intended to be for intermediate students who wish to increase their listening skills through a short lesson in power point.
We all have good and bad thoughts from time to time and situation to situation. We are bombarded daily with spiraling thoughts(both negative and positive) creating all-consuming feel , making us difficult to manage with associated suffering. Good thoughts are like our Mob Signal (Positive thought) amidst noise(negative thought) in the atmosphere. Negative thoughts like noise outweigh positive thoughts. These thoughts often create unwanted confusion, trouble, stress and frustration in our mind as well as chaos in our physical world. Negative thoughts are also known as “distorted thinking”.
2. Box plots (Box-and-Whisker Plots):
• A box plot, also known as a box-and-whisker plot, is a graphical
representation of the distribution of a dataset based on five summary
statistics: minimum, first quartile (Q1), median (Q2), third quartile (Q3),
and maximum.
• It provides a visual summary of the central tendency, spread, and
skewness of the data, as well as identifying outliers.
• The boxplot() function in Matplotlib is used to create boxplot.
3. Box Plot (Box-and-Whisker Plots):
• Minimum and Maximum: The smallest and largest exam scores in the
dataset.
• Quartiles (Q1, Q2, Q3): These divide the dataset into four equal parts.
Q1 represents the 25th percentile, Q2 is the median (50th percentile), and
Q3 is the 75th percentile.
• Interquartile Range (IQR): The range between the first and third
quartiles (Q3 - Q1).
• Whiskers: Lines extending from the box to the minimum and maximum
values within 1.5 times the IQR from the first and third quartiles.
• Outliers: Data points outside the whiskers, indicating potential anomalies
or extreme values.
4.
5. Interpreting the Box Plot:
• If the Median is at the center of the Box and the whiskers are almost the
same on both the ends, then the data is Normally Distributed.
• If the Median lies closer to the First Quartile and if the whisker at the lower
end is shorter then it has a Positive Skew (Right Skew).
• If the Median lies closer to the Third Quartile and if the whisker at the
upper end is shorter than it has a Negative Skew (Left Skew).
• If there are values that fall above or below the end of the whiskers, they are
plotted as dots. These points are often called outliers.
• In the example of exam scores, the box plot helps visualize the
performance of students and understand the spread of scores across the
class.
7. Scatter plots
• Scatter plots visualize the relationship between two numerical variables
by plotting each observation as a point on a two-dimensional graph. They
help in identifying patterns, trends, and correlations between variables.
• The scatter() function in Matplotlib is used to create scatter plots.
Check description box for Introduction to Machine Learning and Machine learning algorithms video link.
8.
9. Interpreting the Scatter plots
• If the data points tend to move upwards from left to right, it indicates a
positive relationship. Conversely, if the data points tend to move
downwards from left to right, it indicates a negative relationship.
• If the data points form a roughly straight line, it suggests a linear
relationship. If the data points follow a curved pattern or do not conform
to a straight line, it suggests a nonlinear relationship.
• We can Calculate the correlation coefficient (such as Pearson's correlation
coefficient) to quantify the strength and direction of the relationship
between the variables. A correlation coefficient close to +1 indicates a
strong positive relationship, close to -1 indicates a strong negative
relationship, and close to 0 indicates no linear relationship.
10. Bar Plots
• Bar plots represent the frequency or count of categorical variables by
displaying bars of varying heights. They are effective for comparing the
distribution of categories or groups.
• In a bar chart, the x-axis represents the categories or groups, and the y-
axis represents the frequency or count of observations in each category.
The bars are separated from each other to represent distinct categories.
• The bar() function in Matplotlib is used to create bar plot.
• Bar charts are versatile and can be used to compare categorical data
across different groups or time periods, such as sales by product category,
votes by political party, or average temperatures by month.
11.
12. Line plots
• Line plots show the trend or pattern of a numerical variable over time or
another continuous variable. They are commonly used for time series data
analysis.
• Line charts are used to visualize changes in one continuous numerical
variable over time or another ordered categorical variable.
• Example: Stock prices over time: We can plot stock prices (continuous
numerical variable) against time (ordered categorical variable) to see how
they change over different time periods.
Please check the description box for the link to Machine Learning videos.