This document discusses measures of central tendency, including the mean, median, and mode. It defines each measure and describes their characteristics and how to calculate them. The mean is the average value and is affected by outliers. The median is the middle value and is not affected by outliers. The mode is the most frequently occurring value. The document provides examples of calculating each measure from data sets and discusses their advantages and disadvantages.
The standard deviation is a measure of the spread of scores within a set of data. Usually, we are interested in the standard deviation of a population.
students will be able to understand various measures of central tendency and also will be able to calculate mean median and mode for individual discrete and continuous series.
01 parametric and non parametric statisticsVasant Kothari
Definition of Parametric and Non-parametric Statistics
Assumptions of Parametric and Non-parametric Statistics
Assumptions of Parametric Statistics
Assumptions of Non-parametric Statistics
Advantages of Non-parametric Statistics
Disadvantages of Non-parametric Statistical Tests
Parametric Statistical Tests for Different Samples
Parametric Statistical Measures for Calculating the Difference Between Means
Significance of Difference Between the Means of Two Independent Large and
Small Samples
Significance of the Difference Between the Means of Two Dependent Samples
Significance of the Difference Between the Means of Three or More Samples
Parametric Statistics Measures Related to Pearson’s ‘r’
Non-parametric Tests Used for Inference
The standard deviation is a measure of the spread of scores within a set of data. Usually, we are interested in the standard deviation of a population.
students will be able to understand various measures of central tendency and also will be able to calculate mean median and mode for individual discrete and continuous series.
01 parametric and non parametric statisticsVasant Kothari
Definition of Parametric and Non-parametric Statistics
Assumptions of Parametric and Non-parametric Statistics
Assumptions of Parametric Statistics
Assumptions of Non-parametric Statistics
Advantages of Non-parametric Statistics
Disadvantages of Non-parametric Statistical Tests
Parametric Statistical Tests for Different Samples
Parametric Statistical Measures for Calculating the Difference Between Means
Significance of Difference Between the Means of Two Independent Large and
Small Samples
Significance of the Difference Between the Means of Two Dependent Samples
Significance of the Difference Between the Means of Three or More Samples
Parametric Statistics Measures Related to Pearson’s ‘r’
Non-parametric Tests Used for Inference
Topic: Frequency Distribution
Student Name: Abdul Hafeez
Class: B.Ed. (Hons) Elementary
Project Name: “Young Teachers' Professional Development (TPD)"
"Project Founder: Prof. Dr. Amjad Ali Arain
Faculty of Education, University of Sindh, Pakistan
Measure of dispersion has two types Absolute measure and Graphical measure. There are other different types in there.
In this slide the discussed points are:
1. Dispersion & it's types
2. Definition
3. Use
4. Merits
5. Demerits
6. Formula & math
7. Graph and pictures
8. Real life application.
A brief description of F Test and ANOVA for Msc Life Science students. I have taken the example slides from youtube where an excellent explanation is available.
Here is the link : https://www.youtube.com/watch?v=-yQb_ZJnFXw
Characteristics of a Good Sample
Representativeness
Absence of sampling error
Economically viable
Generalized and applicable
Goal oriented
Proportional
Randomly Selected
Actual information provider
Practical
UNIVARIATE & BIVARIATE ANALYSIS
UNIVARIATE BIVARIATE & MULTIVARIATE
UNIVARIATE ANALYSIS
-One variable analysed at a time
BIVARIATE ANALYSIS
-Two variable analysed at a time
MULTIVARIATE ANALYSIS
-More than two variables analysed at a time
TYPES OF ANALYSIS
DESCRIPTIVE ANALYSIS
INFERENTIAL ANALYSIS
DESCRIPTIVE ANALYSIS
Transformation of raw data
Facilitate easy understanding and interpretation
Deals with summary measures relating to sample data
Eg-what is the average age of the sample?
INFERENTIAL ANALYSIS
Carried out after descriptive analysis
Inferences drawn on population parameters based on sample results
Generalizes results to the population based on sample results
Eg-is the average age of population different from 35?
DESCRIPTIVE ANALYSIS OF UNIVARIATE DATA
1. Prepare frequency distribution of each variable
Missing Data
Situation where certain questions are left unanswered
Analysis of multiple responses
Measures of central tendency
3 measures of central tendency
1.Mean
2.Median
3.Mode
MEAN
Arithmetic average of a variable
Appropriate for interval and ratio scale data
x
MEDIAN
Calculates the middle value of the data
Computed for ratio, interval or ordinal scale.
Data needs to be arranged in ascending or descending order
MODE
Point of maximum frequency
Should not be computed for ordinal or interval data unless grouped.
Widely used in business
MEASURE OF DISPERSION
Measures of central tendency do not explain distribution of variables
4 measures of dispersion
1.Range
2.Variance and standard deviation
3.Coefficient of variation
4.Relative and absolute frequencies
DESCRIPTIVE ANALYSIS OF BIVARIATE DATA
There are three types of measure used.
1.Cross tabulation
2.Spearmans rank correlation coefficient
3.Pearsons linear correlation coefficient
Cross Tabulation
Responses of two questions are combined
Spearman’s rank order correlation coefficient.
Used in case of ordinal data
Measures of Central tendency-bio-statistics
Biostatistics and research methodology
Mean
Median
Mode
Mean- Arithmetic mean
weighted mean
harmonic mean
geometric mean
individual series
discrete series
continuous series
Relation between mean, median and mode
Statistical average
mathematical average
positional average
Merits and demerits of mean, median and mode
statistics
Bachelor of Pharmacy
8th Semester
Biostatistics
Topic: Frequency Distribution
Student Name: Abdul Hafeez
Class: B.Ed. (Hons) Elementary
Project Name: “Young Teachers' Professional Development (TPD)"
"Project Founder: Prof. Dr. Amjad Ali Arain
Faculty of Education, University of Sindh, Pakistan
Measure of dispersion has two types Absolute measure and Graphical measure. There are other different types in there.
In this slide the discussed points are:
1. Dispersion & it's types
2. Definition
3. Use
4. Merits
5. Demerits
6. Formula & math
7. Graph and pictures
8. Real life application.
A brief description of F Test and ANOVA for Msc Life Science students. I have taken the example slides from youtube where an excellent explanation is available.
Here is the link : https://www.youtube.com/watch?v=-yQb_ZJnFXw
Characteristics of a Good Sample
Representativeness
Absence of sampling error
Economically viable
Generalized and applicable
Goal oriented
Proportional
Randomly Selected
Actual information provider
Practical
UNIVARIATE & BIVARIATE ANALYSIS
UNIVARIATE BIVARIATE & MULTIVARIATE
UNIVARIATE ANALYSIS
-One variable analysed at a time
BIVARIATE ANALYSIS
-Two variable analysed at a time
MULTIVARIATE ANALYSIS
-More than two variables analysed at a time
TYPES OF ANALYSIS
DESCRIPTIVE ANALYSIS
INFERENTIAL ANALYSIS
DESCRIPTIVE ANALYSIS
Transformation of raw data
Facilitate easy understanding and interpretation
Deals with summary measures relating to sample data
Eg-what is the average age of the sample?
INFERENTIAL ANALYSIS
Carried out after descriptive analysis
Inferences drawn on population parameters based on sample results
Generalizes results to the population based on sample results
Eg-is the average age of population different from 35?
DESCRIPTIVE ANALYSIS OF UNIVARIATE DATA
1. Prepare frequency distribution of each variable
Missing Data
Situation where certain questions are left unanswered
Analysis of multiple responses
Measures of central tendency
3 measures of central tendency
1.Mean
2.Median
3.Mode
MEAN
Arithmetic average of a variable
Appropriate for interval and ratio scale data
x
MEDIAN
Calculates the middle value of the data
Computed for ratio, interval or ordinal scale.
Data needs to be arranged in ascending or descending order
MODE
Point of maximum frequency
Should not be computed for ordinal or interval data unless grouped.
Widely used in business
MEASURE OF DISPERSION
Measures of central tendency do not explain distribution of variables
4 measures of dispersion
1.Range
2.Variance and standard deviation
3.Coefficient of variation
4.Relative and absolute frequencies
DESCRIPTIVE ANALYSIS OF BIVARIATE DATA
There are three types of measure used.
1.Cross tabulation
2.Spearmans rank correlation coefficient
3.Pearsons linear correlation coefficient
Cross Tabulation
Responses of two questions are combined
Spearman’s rank order correlation coefficient.
Used in case of ordinal data
Measures of Central tendency-bio-statistics
Biostatistics and research methodology
Mean
Median
Mode
Mean- Arithmetic mean
weighted mean
harmonic mean
geometric mean
individual series
discrete series
continuous series
Relation between mean, median and mode
Statistical average
mathematical average
positional average
Merits and demerits of mean, median and mode
statistics
Bachelor of Pharmacy
8th Semester
Biostatistics
in biostatistics, a measure of central tendency is a single value that describes a set of data by of typical value. it is also called as average. Arithmetic mean” or “mean” is the term used for average. The arithmetic mean or simply mean is the sum of the separate scores or measures divided by their number.
The Roman Empire A Historical Colossus.pdfkaushalkr1407
The Roman Empire, a vast and enduring power, stands as one of history's most remarkable civilizations, leaving an indelible imprint on the world. It emerged from the Roman Republic, transitioning into an imperial powerhouse under the leadership of Augustus Caesar in 27 BCE. This transformation marked the beginning of an era defined by unprecedented territorial expansion, architectural marvels, and profound cultural influence.
The empire's roots lie in the city of Rome, founded, according to legend, by Romulus in 753 BCE. Over centuries, Rome evolved from a small settlement to a formidable republic, characterized by a complex political system with elected officials and checks on power. However, internal strife, class conflicts, and military ambitions paved the way for the end of the Republic. Julius Caesar’s dictatorship and subsequent assassination in 44 BCE created a power vacuum, leading to a civil war. Octavian, later Augustus, emerged victorious, heralding the Roman Empire’s birth.
Under Augustus, the empire experienced the Pax Romana, a 200-year period of relative peace and stability. Augustus reformed the military, established efficient administrative systems, and initiated grand construction projects. The empire's borders expanded, encompassing territories from Britain to Egypt and from Spain to the Euphrates. Roman legions, renowned for their discipline and engineering prowess, secured and maintained these vast territories, building roads, fortifications, and cities that facilitated control and integration.
The Roman Empire’s society was hierarchical, with a rigid class system. At the top were the patricians, wealthy elites who held significant political power. Below them were the plebeians, free citizens with limited political influence, and the vast numbers of slaves who formed the backbone of the economy. The family unit was central, governed by the paterfamilias, the male head who held absolute authority.
Culturally, the Romans were eclectic, absorbing and adapting elements from the civilizations they encountered, particularly the Greeks. Roman art, literature, and philosophy reflected this synthesis, creating a rich cultural tapestry. Latin, the Roman language, became the lingua franca of the Western world, influencing numerous modern languages.
Roman architecture and engineering achievements were monumental. They perfected the arch, vault, and dome, constructing enduring structures like the Colosseum, Pantheon, and aqueducts. These engineering marvels not only showcased Roman ingenuity but also served practical purposes, from public entertainment to water supply.
Acetabularia Information For Class 9 .docxvaibhavrinwa19
Acetabularia acetabulum is a single-celled green alga that in its vegetative state is morphologically differentiated into a basal rhizoid and an axially elongated stalk, which bears whorls of branching hairs. The single diploid nucleus resides in the rhizoid.
Operation “Blue Star” is the only event in the history of Independent India where the state went into war with its own people. Even after about 40 years it is not clear if it was culmination of states anger over people of the region, a political game of power or start of dictatorial chapter in the democratic setup.
The people of Punjab felt alienated from main stream due to denial of their just demands during a long democratic struggle since independence. As it happen all over the word, it led to militant struggle with great loss of lives of military, police and civilian personnel. Killing of Indira Gandhi and massacre of innocent Sikhs in Delhi and other India cities was also associated with this movement.
Honest Reviews of Tim Han LMA Course Program.pptxtimhan337
Personal development courses are widely available today, with each one promising life-changing outcomes. Tim Han’s Life Mastery Achievers (LMA) Course has drawn a lot of interest. In addition to offering my frank assessment of Success Insider’s LMA Course, this piece examines the course’s effects via a variety of Tim Han LMA course reviews and Success Insider comments.
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...Levi Shapiro
Letter from the Congress of the United States regarding Anti-Semitism sent June 3rd to MIT President Sally Kornbluth, MIT Corp Chair, Mark Gorenberg
Dear Dr. Kornbluth and Mr. Gorenberg,
The US House of Representatives is deeply concerned by ongoing and pervasive acts of antisemitic
harassment and intimidation at the Massachusetts Institute of Technology (MIT). Failing to act decisively to ensure a safe learning environment for all students would be a grave dereliction of your responsibilities as President of MIT and Chair of the MIT Corporation.
This Congress will not stand idly by and allow an environment hostile to Jewish students to persist. The House believes that your institution is in violation of Title VI of the Civil Rights Act, and the inability or
unwillingness to rectify this violation through action requires accountability.
Postsecondary education is a unique opportunity for students to learn and have their ideas and beliefs challenged. However, universities receiving hundreds of millions of federal funds annually have denied
students that opportunity and have been hijacked to become venues for the promotion of terrorism, antisemitic harassment and intimidation, unlawful encampments, and in some cases, assaults and riots.
The House of Representatives will not countenance the use of federal funds to indoctrinate students into hateful, antisemitic, anti-American supporters of terrorism. Investigations into campus antisemitism by the Committee on Education and the Workforce and the Committee on Ways and Means have been expanded into a Congress-wide probe across all relevant jurisdictions to address this national crisis. The undersigned Committees will conduct oversight into the use of federal funds at MIT and its learning environment under authorities granted to each Committee.
• The Committee on Education and the Workforce has been investigating your institution since December 7, 2023. The Committee has broad jurisdiction over postsecondary education, including its compliance with Title VI of the Civil Rights Act, campus safety concerns over disruptions to the learning environment, and the awarding of federal student aid under the Higher Education Act.
• The Committee on Oversight and Accountability is investigating the sources of funding and other support flowing to groups espousing pro-Hamas propaganda and engaged in antisemitic harassment and intimidation of students. The Committee on Oversight and Accountability is the principal oversight committee of the US House of Representatives and has broad authority to investigate “any matter” at “any time” under House Rule X.
• The Committee on Ways and Means has been investigating several universities since November 15, 2023, when the Committee held a hearing entitled From Ivory Towers to Dark Corners: Investigating the Nexus Between Antisemitism, Tax-Exempt Universities, and Terror Financing. The Committee followed the hearing with letters to those institutions on January 10, 202
Biological screening of herbal drugs: Introduction and Need for
Phyto-Pharmacological Screening, New Strategies for evaluating
Natural Products, In vitro evaluation techniques for Antioxidants, Antimicrobial and Anticancer drugs. In vivo evaluation techniques
for Anti-inflammatory, Antiulcer, Anticancer, Wound healing, Antidiabetic, Hepatoprotective, Cardio protective, Diuretics and
Antifertility, Toxicity studies as per OECD guidelines
2024.06.01 Introducing a competency framework for languag learning materials ...Sandy Millin
http://sandymillin.wordpress.com/iateflwebinar2024
Published classroom materials form the basis of syllabuses, drive teacher professional development, and have a potentially huge influence on learners, teachers and education systems. All teachers also create their own materials, whether a few sentences on a blackboard, a highly-structured fully-realised online course, or anything in between. Despite this, the knowledge and skills needed to create effective language learning materials are rarely part of teacher training, and are mostly learnt by trial and error.
Knowledge and skills frameworks, generally called competency frameworks, for ELT teachers, trainers and managers have existed for a few years now. However, until I created one for my MA dissertation, there wasn’t one drawing together what we need to know and do to be able to effectively produce language learning materials.
This webinar will introduce you to my framework, highlighting the key competencies I identified from my research. It will also show how anybody involved in language teaching (any language, not just English!), teacher training, managing schools or developing language learning materials can benefit from using the framework.
Synthetic Fiber Construction in lab .pptxPavel ( NSTU)
Synthetic fiber production is a fascinating and complex field that blends chemistry, engineering, and environmental science. By understanding these aspects, students can gain a comprehensive view of synthetic fiber production, its impact on society and the environment, and the potential for future innovations. Synthetic fibers play a crucial role in modern society, impacting various aspects of daily life, industry, and the environment. ynthetic fibers are integral to modern life, offering a range of benefits from cost-effectiveness and versatility to innovative applications and performance characteristics. While they pose environmental challenges, ongoing research and development aim to create more sustainable and eco-friendly alternatives. Understanding the importance of synthetic fibers helps in appreciating their role in the economy, industry, and daily life, while also emphasizing the need for sustainable practices and innovation.
Unit 8 - Information and Communication Technology (Paper I).pdfThiyagu K
This slides describes the basic concepts of ICT, basics of Email, Emerging Technology and Digital Initiatives in Education. This presentations aligns with the UGC Paper I syllabus.
Palestine last event orientationfvgnh .pptxRaedMohamed3
An EFL lesson about the current events in Palestine. It is intended to be for intermediate students who wish to increase their listening skills through a short lesson in power point.
1. Lecture Series on
Biostatistics
No. Biostat - 5
Date: 18.01.2009
MEASURES OF CENTRAL
TENDENCY
Dr. Bijaya Bhusan Nanda,
M. Sc (Gold Medalist) Ph. D. (Stat.)
Topper Orissa Statistics & Economics Services, 1988
bijayabnanda@yahoo.com
2. CONTENTS
Descriptive Measures
Measure of Central Tendency (CT)
Concept and Definition
Mean
Median
Mode
Uses of Different Measures of CT
Advantages and Disadvantages of Different
Measures of CT
3. Enabling Objectives
To equip the trainee with skills to
manipulate the data in the form of
numbers that they encounter as a health
sciences professional.
The better able they are to manipulate
such data, better understanding they will
have of the environment and forces that
generate these data.
4. Descriptive Statistics
Classification and Tabulation of Data
Presentation of Data
Measure of Central Tendency (CT)
Measure of Shape
Measure of Dispersion
5. Measures of Central Tendency
Concept
Data, in nature, has a tendency
to cluster around a central Value.
That central value condenses the
large mass of data into a single
representative figure.
The Central Value can be obtained
from sample values (Called
Statistics) and Population
observations (Called Parameters)
6. Measures of Central Tendency
Definition
A measure of central tendency is a
typical value around which other
figures congregate.
Simpson and Kofka
(Statistician)
Average is an attempt to find an
single figure to describe a group of
figures.
Clark and Schakade
(Statistician)
7. Different Measure of Central Tendency (MCT)
1. Mathematical Average
Arithmetic Mean Simply Mean
Geometric Mean
Harmonic Mean
1. Positional Average
Median
Mode
1. Mean, Median and Mode are the
Most Commonly used MCT in
Health Science
8. Characteristics of an Ideal MCT
1. It should be rigidly defined so that different persons may
not interpret it differently.
2. It should be easy to understand and easy to calculate.
3. It should be based on all the observations of the data.
4. It should be easily subjected to further mathematical
treatment.
5. It should be least affected by the sampling fluctuation .
6. It should not be unduly affected by the extreme values.
9. Characteristics of Mean
Most representative figure for the entire mass of data
Tells the point about which items have a tendency to
cluster
Unduly affected by extreme items (very sensitive to
extreme values)
The positive deviations must balance the negative
deviations (The sum of the deviations of individual
values of observation from the mean will always add
up to zero)
The sum of squares of the deviations about the
mean is minimum
10. Arithmetic Mean (AM) or Mean
AM is obtained by summing up all the observations and
11. Individual data: X1, X2, X3, X4, …………Xn
(n = Total No. of observation)
X1 + X2 + X3 +…..
Mean =
n
Example:
Find the mean size of Tuberculin test of 10
boys measured in millimeters.
3,5,7,7,8,8,9,10,11,12.
3 + 5 + 7 +7 + 8 + 8 + 9 + 10 + 11 + 12
Mean =
10
= 80 / 10 = 8 mm
12. Discrete frequency distribution
Let x1,x2,x3…… xn be the variate values and f1,f2,f3….fn be their
corresponding frequencies, then their mean is given by
x1.f1 + x2.f2 +x3.f3+……+xn.fn
Mean =
f1+f2+f3+….+fn
Example:
Find mean days of Confinement after delivery in the following series.
Days of No. of Total days of confinement of
Confinement ( X) patients (f) each group. Xf Ans. Mean
days of
6 5 30
confinement
7 4 28 = 137 / 18
8 4 32 = 7.61
9 3 27
10 2 20
Total 18 137
13. Grouped frequency distribution
Let m1, m2,…….mn be the mid-point of the class and
f1, f2,……….fn be the corresponding frequency
Direct method: Σ fi mi
Mean: X =
Σ fi
, Where mi refers to the mid point of the ith class fi
frequency of the ith class and Σfi = N
Shortcut method:
Σfidi
Mean = A +
N
Where, A is the assumed mean and di = mi-A.
14. Example:
Calculate overall fatality rate in smallpox
from the age wise fatality rate given below.
Age Gr in 0-1 2-4 5-9 Above 9
yr
No. of Cases 150 304 421 170
Fatality rate 35.33 21.38 16.86 14.17
15. Solution :
Mean fatality rate= ∑fx = 21305.98 =20.39
∑f 1045
Age Gr. in No. of Cases Fatality Rate fx
year (f) (x)
0-1 150 35.33 5299.5
2-4 304 21.38 6499.52
5-9 421 16.86 7098.06
Above 9 170 14.17 2408.9
Total 1045 87.74 21305.98
16. Problem:
Find the mean weight of 470 infants born in a
hospital in one year from following table.
Weight 2.0-2.4 2.5-2.9 3.0-3.4 3.5-3.9 4.0-4.4 4.5+
in Kg
No. of 17 97 187 135 28 6
infant
17. Solution :
Weight Continuo Mid di = mi- f fd Sum of
in Kg. us CI in Value
A fd
X kg.
2.0-2.4 1.95-2.45 2.2 -1.0 17 -17 -65.5
2.5-2.9 2.45-2.95 2.7 -0.5 97 -48.5
3.0-3.4 2.95-3.45 3.2 0 187 0
3.5-3.9 3.45-3.85 3.7 0.5 135 67.5 +104.5
4.0-4.4 3.95-4.45 4.2 1 28 28
4.5+ 4.45+ 4.7 1.5 6 9
Total 470 +39
Mean Weight = w + (Σ fx / Σ f )
Mean Weight = 3.2 +( 39 / 470 ) = 3.28 Kg
18. Step deviation method
Ex:- Calculation of the mean from a frequency distribution of
weights of 265 male students at the university of Washington
CI-Wt) Cont. CI f d fd X-A
d=
90 - 99 89.5 – 99.5 1 -5 -5 h
100 – 109 99.5 – 109.5 1 -4 -4
Σ fd
110 – 119 109.5 – 119.5 9 -3 -27 X= A + Xh
120 – 129 119.5 – 129.5 30 -2 -60 N
130 – 139 129.5 – 139.5 42 -1 -42 99
X= 145 + x 10
140 – 149 139.5 – 149.5 66 0 0 265
150 – 159 149.5 – 159.5 47 1 47 = 145 + (0.3736)( 10 )
160 – 169 159.5 – 169.5 39 2 78
170 – 179 169.5 – 179.5 15 3 45 = 145 +0 3.74
180 – 189 179.5 – 189.5 11 4 44
= 148.74
190 -199 189.5 -199.5 1 5 5
200 – 209 199.5 – 209.5 3 6 18
Total 265 99
N=265
19. Merits, Demerits and Uses of Mean
Merits of Mean:
1. It can be easily calculated.
2. Its calculation is based on all the observations.
3. It is easy to understand.
4. It is rigidly defined by the mathematical
formula.
5. It is least affected by sampling fluctuations.
6. It is the best measure to compare two or more
series of data.
7. It does not depend upon any position.
20. Demerits of Mean :
1. It may not be represented in actual data so it
is theoretical.
2. It is affected by extreme values.
3. It can not be calculated if all the
observations are not known.
4. It can not be used for qualitative data i.e.
love, beauty , honesty, etc.
5. It may lead to fallacious conditions in the
absence of original observations.
Uses of Mean :
1. It is extremely used in medical statistics.
2. Estimates are always obtained by mean.
21. Median
Definition:
Median is defined as the middle most or
the central value of the variable in a set of
observations, when the observations are
arranged either in ascending or in
descending order of their magnitudes.
It divides the series into two equal parts.
It is a positional average, whereas the mean
is the calculated average.
22. Characteristic of Median
A positional value of the variable which
divides the distribution into two equal
parts, i.e., the median is a value that
divides the set of observations into two
halves so that one half of observations are
less than or equal to it and the other half
are greater than or equal to it.
Extreme items do not affect median and is
specially useful in open ended frequencies.
For discrete data, mean and median do not
change if all the measurements are
multiplied by the same positive number
and the result divided later by the same
23. Computation of Median
For Individual data series:
Arrange the values of X in ascending or
descending order.
When the number of observations N, is odd,
the middle most value –i.e. the (N+1)/2th
value in the arrangement will be the median.
When N is even, the A.M of N/2th and
(N+1)/2th values of the variable will give the
median.
24. Example:
ESRs of 7 subjects are 3,4,5,6,4,7,5. Find the
median.
Ans: Let us arrange the values in ascending order.
3,4,4,5,5,6,7. The 4th observation i.e. 5 is the
median in this series.
Example:
ESRs of 8 subjects are 3,4,5,6,4,7,6,7. Find the
median.
Ans: Let us arrange the values in ascending order.
3,4,4,5,6,6,7,7. In this series the median is the
a.m of 4th and 5th Observations
25. Discrete Data Series
Arrange the value of the variable in
ascending or descending order of
magnitude. Find out the cumulative
frequencies (c.f.). Since median is the size of
(N+1)/2th. Item, look at the cumulative
frequency column find that c.f. which is
either equal to (N+1)/2 or next higher to
that and value of the variable corresponding
to it is the median.
26. Grouped frequency distribution
N/2 is used as the rank of the median instead
of (N+1)/2. The formula for calculating median
is given as:
Median = L+ [(N/2-c.f)/f] × I
Where L = Lower limit of the median class i.e.
the class in which the middle item of the
distribution lies.
c.f = Cumulative frequency of the class
preceding the media n class
f = Frequency of the median class. I= class
interval of the median class.
27. Median for grouped or interval data
Example : Calculation of the median (x). data N/2-Cf
represent weights of 265 male students at the X= L + ×I
university of Washington f
Class – interval f Cumulative frequency 132.5-83
X= 140 + x 10
( Weight) f “less than” 66
90 - 99 1 1 49.5
= 140 + x 10
100 – 109 1 2 66
110 – 119 9 11
= 140 + (0.750) ( 10 )
120 – 129 30 41
130 – 139 42 83 = 140 + 7.50
140 – 149 66 149
= 147.5
150 – 159 47 196
160 – 169 39 235
170 – 179 15 250
180 – 189 11 261
190 -199 1 262
200 - 209 3 265
N = 265 N/2=132.5
28. Mode
Definition:
Mode is defined as the most frequently
occurring measurement in a set of
observations, or a measurement of relatively
great concentration, for some frequency
distributions may have more than one such
point of concentration, even though these
concentration might not contain precisely
the same frequencies.
29. Characteristics of Mode
Mode of a categorical or a discrete numerical variable is that value of
the variable which occurs maximum number of times and for a
continuous variable it is the value around which the series has
maximum concentration.
The mode does not necessarily describe the ‘most’( for example,
more than 50 %) of the cases
Like median, mode is also a positional average. Hence mode is useful
in eliminating the effect of extreme variations and to study popular
(highest occurring) case (used in qualitative data)
The mode is usually a good indicator of the centre of the data only if
there is one dominating frequency.
Mode not amenable for algebraic treatment (like median or mean)
Median lies between mean & mode.
For normal distribution, mean, median and mode are equal (one and
the same)
30. Discrete Data Series
In case of discrete series, quite often the
mode can be determined by closely looking
at the data.
Example. Find the mode -
Quantity of glucose (mg%) in First we arrange this data
blood of 25 students set in the ascending order
70 88 95 101 106 and find the frequency.
79 93 96 101 107
83 93 97 103 108
86 95 97 103 112
87 95 98 106 115
31. Quantity of Frequency Quantity of Frequency
glucose (mg%) glucose (mg%) in
in blood blood
70 1 97 2
79 1 98 1
83 1 101 2
86 1 103 2
87 1 106 2
88 1 107 1
93 3 108 1
95 2 112 1
96 1 115 1
This data set contains 25 observations. We see that, the value of 93 is
repeated most often. Therefore, the mode of the data set is 93.
32. Grouped frequency distribution
Calculation of Mode:
{(f1 – f0)
Mode= L + x I
[(f1 – f0) + (f1 – f2 )]
L = lower limit of the modal class i.e.- the class
containing mode
f1 = Frequency of modal class
f0 = Frequency of the pre-modal class
f2 =Frequency of the post modal class
I = Class length of the modal class
33. Example: Find the value of the mode from the data given
below
Weight in Kg Exclusive No of Weight in No of
CI students Kg students
93-97 92.5-97.5 2 113-117 14
98-102 97.5-102.5 5 118-122 6
103-107 102.5-107.5 12 123-127 3
108-112 107.5-112.5 17 128-131 1
Solution: By inspection mode lies in the class 108-112. But the real
limit of this class is 107.5-112.5
{(f1 – f0)
Mode= L + x I = 110.63
[(f1 – f0) + (f1 – fx )]
Where, L = 107.5,f1 = 17, f0 = 12, f2 = 14, I = 5
34. Merits, demerits and use of mode
Merits:
Mode is readily comprehensible and easy to
calculate.
Mode is not at all affected by extreme values,
Mode can be conveniently located even if the
frequency distribution has class-intervals of
unequal magnitude provided the modal class and
the classes preceding and succeeding it are of the
same magnitude,
35. Demerits:
Mode is ill-defined. Not always possible to find a
clearly defined mode.
It is not based upon all the observations,
It is not amenable to further mathematical
treatment
As compared with mean, mode is affected to a
great extent by fluctuations of sampling,
When data sets contain two, three, or many modes,
they are difficult to interpret and compare.
Uses
Mode is useful for qualitative data.
36. Comparison of mean, median and mode
The mean is rigidly defined. So is the median, and so is the
mode except when there are more than one value with the
highest frequency density.
The computation of the three measures of central tendency
involves equal amount of labour. But in case of continuous
distribution, the calculation of exact mode is impossible when
few observations are given. Even though a large number of
observations grouped into a frequency distribution are
available, the mode is difficult to determine. The present
formula discussed here for computation of mode suffers from
the drawback that it assumes uniform distribution of frequency
in the modal class. As regards median, when the number of
observations are even it is determined approximately as the
midpoint of two middle items. The value of the mode as well
as the median can be determined graphically where as the A.M
cannot be determined graphically.
The A.M is based upon all the observations. But the median
and, so also, mode are not based on all the observations
37. Contd……
When the A.Ms for two or more series of the variables and the
number of observations in them are available, the A.M of
combined series can be computed directly from the A.M of the
individual series. But the median or mode of the combined series
cannot be computed from the median or mode of the individual
series.
The A.M median and mode are easily comprehensible.
When the data contains few extreme values i.e. very high or small
values, the A.M is not representative of the series. In such case the
median is more appropriate.
Of the three measures, the mean is generally the one that is least
affected by sampling fluctuations, although in some particular
situations the median or the mode may be superior in this respect.
In case of open end distribution there is no difficulty in calculating
the median or mode. But, mean cannot be computed in this case.
Thus it is evident from the above comparison, by and large the
A.M is the best measure of central tendency.
38. REFERENCE
Applied Biostatistical Analysis, W.W.
Daniel
Biostatistical Analysis, J.S. Zar
Mathematical Statistics and data
analysis, John. A. Rice
Fundamentals of Applied Statistics,
S.C Gupta and V. K Kapoor.
Fundamental Mathematical Statistics,
S.C. Gupta, V.K. Kapoor
40. Outlier
An observation (or measurement) that is
unusually large or small relative to the
other values in a data set is called an
outlier. Outliers typically are attributable
to one of the following causes:
The measurement is observed, recorded,
or entered into the computer incorrectly.
The measurements come from a different
population.
The measurement is correct, but
represents a rare event.