Statistics is the study of the collection, analysis, interpretation, presentation, and organization of data.
The word STATISTICS is seems to be derived from the Latin word ‘status’ or the Italian word ‘Statista’ or German word ‘Statistik’. All of them means the same thing i.e. a political state.
Facts expressed numerically are called statistics such as data related to income, height of a class, weight of a class, etc.
However mere facts or aggregate of facts cannot be called statistics.
For example 151, 182, 169, 158, 162, 148 etc. are not statistics.
But if I say the above digits are the height of students of a particular class then that’s statistics.
#2 Classification and tabulation of dataKawita Bhatt
The placement of data in different homogenous groups formed on the basis of some characteristics or criteria is called classification. The Table is a systematic arrangement of data in rows and/or column. Here, few basic concepts of classification and tabulation such as class interval, variable, frequency, frequency distribution and cumulative frequency distribution have been explained in a nutshell. This presentation also deals with the basic guidelines for preparing a table. Any suggestion and query are welcomed please drop them in the comments.
Taking of a measurement and the process of counting yield numbers that contain information. The objective of a person applying the tools of statistics to these numbers is to determine the nature of this information.
This task is made much easier if the numbers are organized and summarized.
Even quite small data sets are difficult to understand without some summarization. Statistical quantities such as the mean and variance can be extremely helpful in summarizing data but first we discuss tabular and graphical summaries.
There are several ways to present a statistical data like;
Frequency table
Simple bar diagrams
Multiple Bar Diagrams
Histogram
Frequency Polygon etc.
Steam and Leaf plots
Pie Charts
A frequency distribution is a tabular arrangement of data in which various items are arranged into classes or groups and the number of items falling in each class is stated.
The number of observations falling in a particular class is referred to as class frequency and is denoted by "f".
In frequency distribution all the values falling in a class are assumed to be equal to the midpoint of that class.
Data presented in the form of a frequency distribution is also called grouped data. A frequency distribution table contains a condensed summary of the original data.
There are two types of frequency distribution i) Simple Frequency distribution ) ii) Grouped Frequency distribution.
Statistics is the study of the collection, analysis, interpretation, presentation, and organization of data.
The word STATISTICS is seems to be derived from the Latin word ‘status’ or the Italian word ‘Statista’ or German word ‘Statistik’. All of them means the same thing i.e. a political state.
Facts expressed numerically are called statistics such as data related to income, height of a class, weight of a class, etc.
However mere facts or aggregate of facts cannot be called statistics.
For example 151, 182, 169, 158, 162, 148 etc. are not statistics.
But if I say the above digits are the height of students of a particular class then that’s statistics.
#2 Classification and tabulation of dataKawita Bhatt
The placement of data in different homogenous groups formed on the basis of some characteristics or criteria is called classification. The Table is a systematic arrangement of data in rows and/or column. Here, few basic concepts of classification and tabulation such as class interval, variable, frequency, frequency distribution and cumulative frequency distribution have been explained in a nutshell. This presentation also deals with the basic guidelines for preparing a table. Any suggestion and query are welcomed please drop them in the comments.
Taking of a measurement and the process of counting yield numbers that contain information. The objective of a person applying the tools of statistics to these numbers is to determine the nature of this information.
This task is made much easier if the numbers are organized and summarized.
Even quite small data sets are difficult to understand without some summarization. Statistical quantities such as the mean and variance can be extremely helpful in summarizing data but first we discuss tabular and graphical summaries.
There are several ways to present a statistical data like;
Frequency table
Simple bar diagrams
Multiple Bar Diagrams
Histogram
Frequency Polygon etc.
Steam and Leaf plots
Pie Charts
A frequency distribution is a tabular arrangement of data in which various items are arranged into classes or groups and the number of items falling in each class is stated.
The number of observations falling in a particular class is referred to as class frequency and is denoted by "f".
In frequency distribution all the values falling in a class are assumed to be equal to the midpoint of that class.
Data presented in the form of a frequency distribution is also called grouped data. A frequency distribution table contains a condensed summary of the original data.
There are two types of frequency distribution i) Simple Frequency distribution ) ii) Grouped Frequency distribution.
Levels of Measurement: Nominal = Data one collects when doing a wide-open descriptive or exploratory study, however, it is not limited to these kinds of studies. We can count this data, but we can’t order it. We need to be able to put this data into categories that are mutually exclusive, i.e. it can’t be in more than one category at a time. An example would be looking at age, race, sex, or some other type of data that you either are or aren’t. The categories need to be exhaustive – there need to be enough categories to cover the data you collect. Ordinal = this category has mutually exclusive categories, but with ordinal data you can order the data within each category. The ratings of poor, fair, good are an example of ordinal information. Note that you can order the ratings, but you can’t really tell how far apart each of these descriptors are from each other. You could also look at who finishes a task first, second, third, and so on. Again, you can rank this data, but you don’t know how much faster the first person was in relation to the second person or subsequent people. Interval-ratio data = this type of data allows you to measure the difference between each of your rankings. Data is ordered (as with ordinal data) and you can tell how much difference there is between each observation because there is a scale that is divided into equal units. You can measure a race with a stopwatch in terms of seconds or tenths of seconds. A thermometer gives you data with measurements in degrees. Ratio data is like interval data (and is often lumped together with it because they are usually handled the same way statistically). Its primary difference is that there is a zero point on the scale so that you can do multiplication and division. Money is an example of a ratio scale – two dollars are exactly twice one dollar. Volume, area, and distance measures are also ratio scales (2 times 1 liter equals 2 liters). This is different from a strict interval scale like a thermometer – we can’t say that 10 degrees Fahrenheit is twice as warm as 5 degrees Fahrenheit. Statistical Distributions: According to Shi, “a distribution organizes the values of a variable into categories. Frequency Distribution (aka Marginal Distribution): Displays the number of cases that falls into each category. Percentage Distribution: Found by dividing the number of frequency of cases in the category by the total N. Measures of Central Tendency: Mean: The most common measure of central tendency. It simply the sum of the numbers divided by the number of numbers. Median: It is defined as the middle position or midpoint of a distribution. Mode: Is defined as the most frequently occurring value. What is variability? Amount of spread or dispersion within a distribution of scores within a set of data. Measures of Variability: Range: The difference between the highest and lowest values in a distribution. Interquartile Range: Known as the ‘midspread’ or ‘middle fifty.” It contains the middle 50% of
Levels of Measurement: Nominal = Data one collects when doing a wide-open descriptive or exploratory study, however, it is not limited to these kinds of studies. We can count this data, but we can’t order it. We need to be able to put this data into categories that are mutually exclusive, i.e. it can’t be in more than one category at a time. An example would be looking at age, race, sex, or some other type of data that you either are or aren’t. The categories need to be exhaustive – there need to be enough categories to cover the data you collect. Ordinal = this category has mutually exclusive categories, but with ordinal data you can order the data within each category. The ratings of poor, fair, good are an example of ordinal information. Note that you can order the ratings, but you can’t really tell how far apart each of these descriptors are from each other. You could also look at who finishes a task first, second, third, and so on. Again, you can rank this data, but you don’t know how much faster the first person was in relation to the second person or subsequent people. Interval-ratio data = this type of data allows you to measure the difference between each of your rankings. Data is ordered (as with ordinal data) and you can tell how much difference there is between each observation because there is a scale that is divided into equal units. You can measure a race with a stopwatch in terms of seconds or tenths of seconds. A thermometer gives you data with measurements in degrees. Ratio data is like interval data (and is often lumped together with it because they are usually handled the same way statistically). Its primary difference is that there is a zero point on the scale so that you can do multiplication and division. Money is an example of a ratio scale – two dollars are exactly twice one dollar. Volume, area, and distance measures are also ratio scales (2 times 1 liter equals 2 liters). This is different from a strict interval scale like a thermometer – we can’t say that 10 degrees Fahrenheit is twice as warm as 5 degrees Fahrenheit. Statistical Distributions: According to Shi, “a distribution organizes the values of a variable into categories. Frequency Distribution (aka Marginal Distribution): Displays the number of cases that falls into each category. Percentage Distribution: Found by dividing the number of frequency of cases in the category by the total N. Measures of Central Tendency: Mean: The most common measure of central tendency. It simply the sum of the numbers divided by the number of numbers. Median: It is defined as the middle position or midpoint of a distribution. Mode: Is defined as the most frequently occurring value. What is variability? Amount of spread or dispersion within a distribution of scores within a set of data. Measures of Variability: Range: The difference between the highest and lowest values in a distribution. Interquartile Range: Known as the ‘midspread’ or ‘middle fifty.” It contains the middle 50% of
Don't get confused with Summary Statistics. Learn in-depth types of summary statistics from measures of central tendency, measures of dispersion and much more.
Let me know if anything is required. ping me at google #bobrupakroy
Classify data into Qualitative and Quantitative data.
Scales of Measurement in Statistics.
Nominal, Ordinal, Ratio and Interval
Prepare table or continuous frequency distribution.
A review of the growth of the Israel Genealogy Research Association Database Collection for the last 12 months. Our collection is now passed the 3 million mark and still growing. See which archives have contributed the most. See the different types of records we have, and which years have had records added. You can also see what we have for the future.
Levels of Measurement: Nominal = Data one collects when doing a wide-open descriptive or exploratory study, however, it is not limited to these kinds of studies. We can count this data, but we can’t order it. We need to be able to put this data into categories that are mutually exclusive, i.e. it can’t be in more than one category at a time. An example would be looking at age, race, sex, or some other type of data that you either are or aren’t. The categories need to be exhaustive – there need to be enough categories to cover the data you collect. Ordinal = this category has mutually exclusive categories, but with ordinal data you can order the data within each category. The ratings of poor, fair, good are an example of ordinal information. Note that you can order the ratings, but you can’t really tell how far apart each of these descriptors are from each other. You could also look at who finishes a task first, second, third, and so on. Again, you can rank this data, but you don’t know how much faster the first person was in relation to the second person or subsequent people. Interval-ratio data = this type of data allows you to measure the difference between each of your rankings. Data is ordered (as with ordinal data) and you can tell how much difference there is between each observation because there is a scale that is divided into equal units. You can measure a race with a stopwatch in terms of seconds or tenths of seconds. A thermometer gives you data with measurements in degrees. Ratio data is like interval data (and is often lumped together with it because they are usually handled the same way statistically). Its primary difference is that there is a zero point on the scale so that you can do multiplication and division. Money is an example of a ratio scale – two dollars are exactly twice one dollar. Volume, area, and distance measures are also ratio scales (2 times 1 liter equals 2 liters). This is different from a strict interval scale like a thermometer – we can’t say that 10 degrees Fahrenheit is twice as warm as 5 degrees Fahrenheit. Statistical Distributions: According to Shi, “a distribution organizes the values of a variable into categories. Frequency Distribution (aka Marginal Distribution): Displays the number of cases that falls into each category. Percentage Distribution: Found by dividing the number of frequency of cases in the category by the total N. Measures of Central Tendency: Mean: The most common measure of central tendency. It simply the sum of the numbers divided by the number of numbers. Median: It is defined as the middle position or midpoint of a distribution. Mode: Is defined as the most frequently occurring value. What is variability? Amount of spread or dispersion within a distribution of scores within a set of data. Measures of Variability: Range: The difference between the highest and lowest values in a distribution. Interquartile Range: Known as the ‘midspread’ or ‘middle fifty.” It contains the middle 50% of
Levels of Measurement: Nominal = Data one collects when doing a wide-open descriptive or exploratory study, however, it is not limited to these kinds of studies. We can count this data, but we can’t order it. We need to be able to put this data into categories that are mutually exclusive, i.e. it can’t be in more than one category at a time. An example would be looking at age, race, sex, or some other type of data that you either are or aren’t. The categories need to be exhaustive – there need to be enough categories to cover the data you collect. Ordinal = this category has mutually exclusive categories, but with ordinal data you can order the data within each category. The ratings of poor, fair, good are an example of ordinal information. Note that you can order the ratings, but you can’t really tell how far apart each of these descriptors are from each other. You could also look at who finishes a task first, second, third, and so on. Again, you can rank this data, but you don’t know how much faster the first person was in relation to the second person or subsequent people. Interval-ratio data = this type of data allows you to measure the difference between each of your rankings. Data is ordered (as with ordinal data) and you can tell how much difference there is between each observation because there is a scale that is divided into equal units. You can measure a race with a stopwatch in terms of seconds or tenths of seconds. A thermometer gives you data with measurements in degrees. Ratio data is like interval data (and is often lumped together with it because they are usually handled the same way statistically). Its primary difference is that there is a zero point on the scale so that you can do multiplication and division. Money is an example of a ratio scale – two dollars are exactly twice one dollar. Volume, area, and distance measures are also ratio scales (2 times 1 liter equals 2 liters). This is different from a strict interval scale like a thermometer – we can’t say that 10 degrees Fahrenheit is twice as warm as 5 degrees Fahrenheit. Statistical Distributions: According to Shi, “a distribution organizes the values of a variable into categories. Frequency Distribution (aka Marginal Distribution): Displays the number of cases that falls into each category. Percentage Distribution: Found by dividing the number of frequency of cases in the category by the total N. Measures of Central Tendency: Mean: The most common measure of central tendency. It simply the sum of the numbers divided by the number of numbers. Median: It is defined as the middle position or midpoint of a distribution. Mode: Is defined as the most frequently occurring value. What is variability? Amount of spread or dispersion within a distribution of scores within a set of data. Measures of Variability: Range: The difference between the highest and lowest values in a distribution. Interquartile Range: Known as the ‘midspread’ or ‘middle fifty.” It contains the middle 50% of
Don't get confused with Summary Statistics. Learn in-depth types of summary statistics from measures of central tendency, measures of dispersion and much more.
Let me know if anything is required. ping me at google #bobrupakroy
Classify data into Qualitative and Quantitative data.
Scales of Measurement in Statistics.
Nominal, Ordinal, Ratio and Interval
Prepare table or continuous frequency distribution.
A review of the growth of the Israel Genealogy Research Association Database Collection for the last 12 months. Our collection is now passed the 3 million mark and still growing. See which archives have contributed the most. See the different types of records we have, and which years have had records added. You can also see what we have for the future.
This slide is special for master students (MIBS & MIFB) in UUM. Also useful for readers who are interested in the topic of contemporary Islamic banking.
Synthetic Fiber Construction in lab .pptxPavel ( NSTU)
Synthetic fiber production is a fascinating and complex field that blends chemistry, engineering, and environmental science. By understanding these aspects, students can gain a comprehensive view of synthetic fiber production, its impact on society and the environment, and the potential for future innovations. Synthetic fibers play a crucial role in modern society, impacting various aspects of daily life, industry, and the environment. ynthetic fibers are integral to modern life, offering a range of benefits from cost-effectiveness and versatility to innovative applications and performance characteristics. While they pose environmental challenges, ongoing research and development aim to create more sustainable and eco-friendly alternatives. Understanding the importance of synthetic fibers helps in appreciating their role in the economy, industry, and daily life, while also emphasizing the need for sustainable practices and innovation.
Introduction to AI for Nonprofits with Tapp NetworkTechSoup
Dive into the world of AI! Experts Jon Hill and Tareq Monaur will guide you through AI's role in enhancing nonprofit websites and basic marketing strategies, making it easy to understand and apply.
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Dr. Vinod Kumar Kanvaria
Exploiting Artificial Intelligence for Empowering Researchers and Faculty,
International FDP on Fundamentals of Research in Social Sciences
at Integral University, Lucknow, 06.06.2024
By Dr. Vinod Kumar Kanvaria
Unit 8 - Information and Communication Technology (Paper I).pdfThiyagu K
This slides describes the basic concepts of ICT, basics of Email, Emerging Technology and Digital Initiatives in Education. This presentations aligns with the UGC Paper I syllabus.
2024.06.01 Introducing a competency framework for languag learning materials ...Sandy Millin
http://sandymillin.wordpress.com/iateflwebinar2024
Published classroom materials form the basis of syllabuses, drive teacher professional development, and have a potentially huge influence on learners, teachers and education systems. All teachers also create their own materials, whether a few sentences on a blackboard, a highly-structured fully-realised online course, or anything in between. Despite this, the knowledge and skills needed to create effective language learning materials are rarely part of teacher training, and are mostly learnt by trial and error.
Knowledge and skills frameworks, generally called competency frameworks, for ELT teachers, trainers and managers have existed for a few years now. However, until I created one for my MA dissertation, there wasn’t one drawing together what we need to know and do to be able to effectively produce language learning materials.
This webinar will introduce you to my framework, highlighting the key competencies I identified from my research. It will also show how anybody involved in language teaching (any language, not just English!), teacher training, managing schools or developing language learning materials can benefit from using the framework.
Read| The latest issue of The Challenger is here! We are thrilled to announce that our school paper has qualified for the NATIONAL SCHOOLS PRESS CONFERENCE (NSPC) 2024. Thank you for your unwavering support and trust. Dive into the stories that made us stand out!
Acetabularia Information For Class 9 .docxvaibhavrinwa19
Acetabularia acetabulum is a single-celled green alga that in its vegetative state is morphologically differentiated into a basal rhizoid and an axially elongated stalk, which bears whorls of branching hairs. The single diploid nucleus resides in the rhizoid.
Model Attribute Check Company Auto PropertyCeline George
In Odoo, the multi-company feature allows you to manage multiple companies within a single Odoo database instance. Each company can have its own configurations while still sharing common resources such as products, customers, and suppliers.
A Strategic Approach: GenAI in EducationPeter Windle
Artificial Intelligence (AI) technologies such as Generative AI, Image Generators and Large Language Models have had a dramatic impact on teaching, learning and assessment over the past 18 months. The most immediate threat AI posed was to Academic Integrity with Higher Education Institutes (HEIs) focusing their efforts on combating the use of GenAI in assessment. Guidelines were developed for staff and students, policies put in place too. Innovative educators have forged paths in the use of Generative AI for teaching, learning and assessments leading to pockets of transformation springing up across HEIs, often with little or no top-down guidance, support or direction.
This Gasta posits a strategic approach to integrating AI into HEIs to prepare staff, students and the curriculum for an evolving world and workplace. We will highlight the advantages of working with these technologies beyond the realm of teaching, learning and assessment by considering prompt engineering skills, industry impact, curriculum changes, and the need for staff upskilling. In contrast, not engaging strategically with Generative AI poses risks, including falling behind peers, missed opportunities and failing to ensure our graduates remain employable. The rapid evolution of AI technologies necessitates a proactive and strategic approach if we are to remain relevant.
2. In statistics, frequency refers to the number of times a particular observation or value
occurs in a dataset. For example, if we have a dataset of the ages of a group of people,
the frequency of a particular age would be the number of people in the group who
have that age.
Frequency can be expressed as an absolute frequency or a relative frequency.
Absolute frequency is simply the number of times a value occurs in a dataset, while
relative frequency is the proportion of times a value occurs relative to the total
number of observations in the dataset.
Frequency distribution is a tabular representation of the frequencies of different
values in a dataset. It shows the number of observations that fall into each category or
value range, along with their corresponding frequencies. Frequency distributions are
commonly used in descriptive statistics to summarize and display the distribution of
data in a meaningful way.
The concept of frequency is fundamental to many statistical methods, such as
hypothesis testing, probability theory, and sampling theory. It provides a way to
quantify the distribution of data and to draw conclusions about the underlying
population from a sample of observations.
2 April 2023 Sanat Kumar Purkait; Raidighi College 2
3. In statistics, frequency can be classified into the following types:
1. Absolute frequency: This refers to the number of times a particular value or observation
occurs in a dataset.
2. Relative frequency: This is the proportion of times a particular value or observation occurs in a
dataset relative to the total number of observations in the dataset.
3. Cumulative frequency: This is the sum of the frequencies of all the values up to and including a
particular value or observation in a dataset.
4. Cumulative relative frequency: This is the sum of the relative frequencies of all the values up to
and including a particular value or observation in a dataset.
5. Grouped frequency: This is the frequency of a range or interval of values in a dataset. Grouped
frequency is used when dealing with continuous data that is grouped into intervals or
categories.
6. Conditional frequency: This is the frequency of a particular value or observation in a dataset,
given that another variable or condition is true.
These different types of frequency can be used to analyze and summarize different aspects of a
dataset, depending on the research question and the type of data being analyzed.
2 April 2023 SANAT K PURKAIT; Raidighi College 3
4. Here are examples of each classification of frequency:
1. Absolute frequency: Consider the following dataset of test scores: 80, 75, 90, 80, 85.
The absolute frequency of the score 80 is 2, because it appears twice in the dataset.
2. Relative frequency: Using the same dataset as above, the relative frequency of the
score 80 is 2/5 or 0.4, because it appears twice out of a total of five observations.
3. Cumulative frequency: Continuing with the same dataset, the cumulative frequency of
the score 80 is 2, because there are two scores of 80 or lower in the dataset (i.e., 75
and 80).
4. Cumulative relative frequency: With the same dataset, the cumulative relative
frequency of the score 80 is 0.6, because the sum of the relative frequencies of the
scores 75, 80, and 85 is 0.6 (i.e., 0.2+0.4+0.0).
5. Grouped frequency: Suppose we have a dataset of heights (in inches) of 50 students.
Rather than analyzing each individual height, we group them into intervals of 5
inches: 60-64, 65-69, 70-74, etc. The grouped frequency would then show the number
of students that fall within each interval.
6. Conditional frequency: Consider a dataset of students' grades in two subjects, Math
and Science. We can calculate the frequency of students who received an A in Math,
given that they also received an A in Science. This would be an example of conditional
frequency.
2 April 2023 SANAT K PURKAIT; Raidighi College 4
5. 2 April 2023 SANAT K PURKAIT; Raidighi College 5
1. Class interval, or class.
2. Class frequency and total frequency.
3. Class limits- lower class limit and upper class limit.
4. Class boundaries- lower class boundary and upper class boundary.
5. Class mark or mid value, or mid point of class interval;
6. Width or size of class interval;
7. Frequency density.
6. 2 April 2023 SANAT K PURKAIT; Raidighi College 6
Age ( years ) Frequency ( No of Persons)
15-19 37
20-24 81
25-29 43
30-34 24
35-39 9
39-44 6
Total 200
GROUP FREQUENCY DISTRIBUTION
Now we will explain those
terms with reference to the above frequency table
7. 2 April 2023 SANAT K PURKAIT; Raidighi College 7
Class
interval
Frequency
(f)
Class
boundary
Mid value
(x)
Class width
(w)
Frequency
density
(f/w)
cumulative frequency
Less than More than
15-19 37 14.5-19.5 17 5 7.4 37 200
20-24 81 19.5-24.5 22 5 16.2 118 163
25-29 43 24.5-29.5 27 5 8.6 161 82
30-34 24 29.5-34.5 32 5 4.8 185 39
35-44 9 34.5-44.5 39.5 10 0.9 194 15
45-59 6 44.5-59.5 52 15 0.4 200 6
TOTAL 200
FREQUENCY TABLE
8. 2 April 2023 SANAT K PURKAIT; Raidighi College 8
When a large number of observations varying in a wide range are
available, these are usually classified in several groups according to the
size of values. Each of these groups , defined by an interval, is called class
interval, or simply class. In the previous table , column no 1 the class
interval of ages ( years) are 15-19, 20-24 etc. There are six classes in the
frequency distribution, the last class being 45-49.
When one end of a classes is not specified, the class is called an open
end classes. A frequency distribution may either one or open end classes
arises when there are relatively few observations which are apart from the
rest. In such a case, it is not considered worth-while to show several
classes with zero frequencies ( called empty class ) before reaching a class
with a very small frequency.
9. 2 April 2023 SANAT K PURKAIT; Raidighi College 9
The number of observations following within a class is
called its class frequency or, simply frequency. The sum of all the
class frequencies is called total frequency. In the given table, the
total frequency is 200. total frequency shows the total number of
observations considered in the frequency distribution.
10. 2 April 2023 SANAT K PURKAIT; Raidighi College 10
All recorded data or observations are discrete in character. For a discrete
variable, the values themselves are isolated, e.g. Records regarding the no of
workers employed in a factory will such data such as 200, 226, 512, 66 etc. ( no
fractional numbers are possible ). .Although a continuous variable can theoretically
take value ,all observations are rounded off a certain unit for convenience. For
example, if it is decided to record the height of persons to the nearest inch, we get
observations like 58, 67, 63 etc. If the decision is to record them to the nearest
tenth of an inch we may get 37.8, 67.5, 63 etc. In any case, all observations will be
rounded, either all whole numbers, or all showing upto one place of decimal, and
so on.
11. 2 April 2023 SANAT K PURKAIT; Raidighi College 11
When a grouped frequency distribution is constructed from these collected
data, the value of the variable are shown in several classes. For determining the
class frequencies it is necessary that these classes are mutually exclusive, i.e. Be
such that any observation is contained in only one class; for example 54-56, 57-59,
60-62 etc. ( not like 54-56, 56-58, 58-60, etc.) or 54.1-56.0, 56.1-58.0, 58.1-60.0, etc (
54.0-56.0, 56.0-58.0, 58.0-60.0 etc.). Note that classes will be written to the same
degree of precision as the original data.
In the construction of a grouped frequency distribution, the class intervals
must therefore be defined by pairs of numbers such that the upper end of one
class does not coincide with the lower end of the immediately- following class.
The two numbers used to specify the limits of a class interval for the purpose of
tallying the original observations into the various classes, are called ‘ class limits’.
The smaller of the pair is known as lower class limit and the larger as upper class
limit with reference to the particular class.
12. 2 April 2023 SANAT K PURKAIT; Raidighi College 12
When measurements are taken on a continuous variable, all data are
recorded nearest to a certain unit. Thus, if ages are recorded to the nearest
whole number of years, any age between 14.5 years and 15.5 years is recorded
as 15 years. Similarly, ’19 years’ denotes an age between 18.5 years and 19.5
years. Hence the class interval 15-19 actually includes all ages between 14.5 and
19.5 years. These most extreme values which would ever be included in a class
interval are called ‘class boundaries’. Class boundaries , are in fact, the real
limits of a class interval. The lower extreme point is called lower class
boundary, and the upper extreme point is called upper class boundary with
reference to any particular class.
13. 2 April 2023 SANAT K PURKAIT; Raidighi College 13
Class boundaries may be calculated from class limits by applying the following rule.
•Lower Class Boundary = Lower Class Limit-1/2d
•Upper Class Boundary = Upper Class Limit+ 1/2d
Where d is the common difference between the upper
class limit of any class interval and lower class limit of the
next class interval. In fact, if observations are recorded to the
nearest unit, d=1; if to the nearest tenth of a unit , d=0.1, etc.
14. 2 April 2023 SANAT K PURKAIT; Raidighi College 14
Note that the upper boundary of any class coincides with the lower
boundary of the next class; but the upper limit of any class is different from
the lower limit of the next class. This fact may be utilized in deciding
whether the pairs of numbers defining the class intervals in a given
frequency distribution are really class limits or class boundaries. Class
limits are used only for the construction of a grouped frequency
distribution, but in all statistical calculations and diagrams involving end
points of a classes ( e.g. Median, mode, quartiles and histogram, ogive
etc.), class boundaries are used.
15. 2 April 2023 SANAT K PURKAIT; Raidighi College 15
The value exactly at the middle of a class interval is called class mark or, mid
value. If lies half- way between the class limits or between the class
boundaries.
Class mark=( lower class limit + upper class limit )/2
Or,
Class mark =( lower class boundary + upper class boundary)/2
Class mark is used as a representative value of the class interval, for the
calculation of mean, standard deviation, mean deviation, etc.
16. 2 April 2023 SANAT K PURKAIT; Raidighi College 16
Width of a class is the difference between the lower and upper
class boundaries ( not class limits ).
Width of class = (upper class boundary – lower class boundary )
17. 2 April 2023 SANAT K PURKAIT; Raidighi College 17
0
1
2
3
4
5
6
7
x axis
Series 1
Series 2
Series 3
Series 4
Series 5
Series 6
Series 7
18. 2 April 2023 SANAT K PURKAIT; Raidighi College 18