This document provides an overview of descriptive statistics. It defines descriptive statistics as procedures used to summarize, organize, and simplify data. Key aspects covered include frequency distributions, graphical representations such as bar graphs and scatter plots, and measures of central tendency (mean, median, mode) and dispersion. The mean is defined as the sum of all values divided by the total number of values. The median is the middle value when values are ordered from lowest to highest. The mode is the most frequently occurring value. Examples are provided to demonstrate calculating each measure.
Lecture on Introduction to Descriptive Statistics - Part 1 and Part 2. These slides were presented during a lecture at the Colombo Institute of Research and Psychology.
Lecture on Introduction to Descriptive Statistics - Part 1 and Part 2. These slides were presented during a lecture at the Colombo Institute of Research and Psychology.
The material is consolidated from different sources on the basic concepts of Statistics which could be used for the Visualization an Prediction requirements of Analytics.
I deeply acknowledge the sources which helped me consolidate the material for my students.
Statistics for machine learning shifa noorulainShifaNoorUlAin1
Introduction to Statistics
Descriptive Statistics
Inferential Statistics
Categories in Statistics
Descriptive Vs Inferential Statistics
Descritive statistics Topics
-Measures of Central Tendency
-Measures of the Spread
-Measures of Asymmetry(Skewness)
The material is consolidated from different sources on the basic concepts of Statistics which could be used for the Visualization an Prediction requirements of Analytics.
I deeply acknowledge the sources which helped me consolidate the material for my students.
Statistics for machine learning shifa noorulainShifaNoorUlAin1
Introduction to Statistics
Descriptive Statistics
Inferential Statistics
Categories in Statistics
Descriptive Vs Inferential Statistics
Descritive statistics Topics
-Measures of Central Tendency
-Measures of the Spread
-Measures of Asymmetry(Skewness)
Title: Sense of Smell
Presenter: Dr. Faiza, Assistant Professor of Physiology
Qualifications:
MBBS (Best Graduate, AIMC Lahore)
FCPS Physiology
ICMT, CHPE, DHPE (STMU)
MPH (GC University, Faisalabad)
MBA (Virtual University of Pakistan)
Learning Objectives:
Describe the primary categories of smells and the concept of odor blindness.
Explain the structure and location of the olfactory membrane and mucosa, including the types and roles of cells involved in olfaction.
Describe the pathway and mechanisms of olfactory signal transmission from the olfactory receptors to the brain.
Illustrate the biochemical cascade triggered by odorant binding to olfactory receptors, including the role of G-proteins and second messengers in generating an action potential.
Identify different types of olfactory disorders such as anosmia, hyposmia, hyperosmia, and dysosmia, including their potential causes.
Key Topics:
Olfactory Genes:
3% of the human genome accounts for olfactory genes.
400 genes for odorant receptors.
Olfactory Membrane:
Located in the superior part of the nasal cavity.
Medially: Folds downward along the superior septum.
Laterally: Folds over the superior turbinate and upper surface of the middle turbinate.
Total surface area: 5-10 square centimeters.
Olfactory Mucosa:
Olfactory Cells: Bipolar nerve cells derived from the CNS (100 million), with 4-25 olfactory cilia per cell.
Sustentacular Cells: Produce mucus and maintain ionic and molecular environment.
Basal Cells: Replace worn-out olfactory cells with an average lifespan of 1-2 months.
Bowman’s Gland: Secretes mucus.
Stimulation of Olfactory Cells:
Odorant dissolves in mucus and attaches to receptors on olfactory cilia.
Involves a cascade effect through G-proteins and second messengers, leading to depolarization and action potential generation in the olfactory nerve.
Quality of a Good Odorant:
Small (3-20 Carbon atoms), volatile, water-soluble, and lipid-soluble.
Facilitated by odorant-binding proteins in mucus.
Membrane Potential and Action Potential:
Resting membrane potential: -55mV.
Action potential frequency in the olfactory nerve increases with odorant strength.
Adaptation Towards the Sense of Smell:
Rapid adaptation within the first second, with further slow adaptation.
Psychological adaptation greater than receptor adaptation, involving feedback inhibition from the central nervous system.
Primary Sensations of Smell:
Camphoraceous, Musky, Floral, Pepperminty, Ethereal, Pungent, Putrid.
Odor Detection Threshold:
Examples: Hydrogen sulfide (0.0005 ppm), Methyl-mercaptan (0.002 ppm).
Some toxic substances are odorless at lethal concentrations.
Characteristics of Smell:
Odor blindness for single substances due to lack of appropriate receptor protein.
Behavioral and emotional influences of smell.
Transmission of Olfactory Signals:
From olfactory cells to glomeruli in the olfactory bulb, involving lateral inhibition.
Primitive, less old, and new olfactory systems with different path
Recomendações da OMS sobre cuidados maternos e neonatais para uma experiência pós-natal positiva.
Em consonância com os ODS – Objetivos do Desenvolvimento Sustentável e a Estratégia Global para a Saúde das Mulheres, Crianças e Adolescentes, e aplicando uma abordagem baseada nos direitos humanos, os esforços de cuidados pós-natais devem expandir-se para além da cobertura e da simples sobrevivência, de modo a incluir cuidados de qualidade.
Estas diretrizes visam melhorar a qualidade dos cuidados pós-natais essenciais e de rotina prestados às mulheres e aos recém-nascidos, com o objetivo final de melhorar a saúde e o bem-estar materno e neonatal.
Uma “experiência pós-natal positiva” é um resultado importante para todas as mulheres que dão à luz e para os seus recém-nascidos, estabelecendo as bases para a melhoria da saúde e do bem-estar a curto e longo prazo. Uma experiência pós-natal positiva é definida como aquela em que as mulheres, pessoas que gestam, os recém-nascidos, os casais, os pais, os cuidadores e as famílias recebem informação consistente, garantia e apoio de profissionais de saúde motivados; e onde um sistema de saúde flexível e com recursos reconheça as necessidades das mulheres e dos bebês e respeite o seu contexto cultural.
Estas diretrizes consolidadas apresentam algumas recomendações novas e já bem fundamentadas sobre cuidados pós-natais de rotina para mulheres e neonatos que recebem cuidados no pós-parto em unidades de saúde ou na comunidade, independentemente dos recursos disponíveis.
É fornecido um conjunto abrangente de recomendações para cuidados durante o período puerperal, com ênfase nos cuidados essenciais que todas as mulheres e recém-nascidos devem receber, e com a devida atenção à qualidade dos cuidados; isto é, a entrega e a experiência do cuidado recebido. Estas diretrizes atualizam e ampliam as recomendações da OMS de 2014 sobre cuidados pós-natais da mãe e do recém-nascido e complementam as atuais diretrizes da OMS sobre a gestão de complicações pós-natais.
O estabelecimento da amamentação e o manejo das principais intercorrências é contemplada.
Recomendamos muito.
Vamos discutir essas recomendações no nosso curso de pós-graduação em Aleitamento no Instituto Ciclos.
Esta publicação só está disponível em inglês até o momento.
Prof. Marcus Renato de Carvalho
www.agostodourado.com
ARTIFICIAL INTELLIGENCE IN HEALTHCARE.pdfAnujkumaranit
Artificial intelligence (AI) refers to the simulation of human intelligence processes by machines, especially computer systems. It encompasses tasks such as learning, reasoning, problem-solving, perception, and language understanding. AI technologies are revolutionizing various fields, from healthcare to finance, by enabling machines to perform tasks that typically require human intelligence.
The prostate is an exocrine gland of the male mammalian reproductive system
It is a walnut-sized gland that forms part of the male reproductive system and is located in front of the rectum and just below the urinary bladder
Function is to store and secrete a clear, slightly alkaline fluid that constitutes 10-30% of the volume of the seminal fluid that along with the spermatozoa, constitutes semen
A healthy human prostate measures (4cm-vertical, by 3cm-horizontal, 2cm ant-post ).
It surrounds the urethra just below the urinary bladder. It has anterior, median, posterior and two lateral lobes
It’s work is regulated by androgens which are responsible for male sex characteristics
Generalised disease of the prostate due to hormonal derangement which leads to non malignant enlargement of the gland (increase in the number of epithelial cells and stromal tissue)to cause compression of the urethra leading to symptoms (LUTS
TEST BANK for Operations Management, 14th Edition by William J. Stevenson, Ve...kevinkariuki227
TEST BANK for Operations Management, 14th Edition by William J. Stevenson, Verified Chapters 1 - 19, Complete Newest Version.pdf
TEST BANK for Operations Management, 14th Edition by William J. Stevenson, Verified Chapters 1 - 19, Complete Newest Version.pdf
micro teaching on communication m.sc nursing.pdfAnurag Sharma
Microteaching is a unique model of practice teaching. It is a viable instrument for the. desired change in the teaching behavior or the behavior potential which, in specified types of real. classroom situations, tends to facilitate the achievement of specified types of objectives.
Flu Vaccine Alert in Bangalore Karnatakaaddon Scans
As flu season approaches, health officials in Bangalore, Karnataka, are urging residents to get their flu vaccinations. The seasonal flu, while common, can lead to severe health complications, particularly for vulnerable populations such as young children, the elderly, and those with underlying health conditions.
Dr. Vidisha Kumari, a leading epidemiologist in Bangalore, emphasizes the importance of getting vaccinated. "The flu vaccine is our best defense against the influenza virus. It not only protects individuals but also helps prevent the spread of the virus in our communities," he says.
This year, the flu season is expected to coincide with a potential increase in other respiratory illnesses. The Karnataka Health Department has launched an awareness campaign highlighting the significance of flu vaccinations. They have set up multiple vaccination centers across Bangalore, making it convenient for residents to receive their shots.
To encourage widespread vaccination, the government is also collaborating with local schools, workplaces, and community centers to facilitate vaccination drives. Special attention is being given to ensuring that the vaccine is accessible to all, including marginalized communities who may have limited access to healthcare.
Residents are reminded that the flu vaccine is safe and effective. Common side effects are mild and may include soreness at the injection site, mild fever, or muscle aches. These side effects are generally short-lived and far less severe than the flu itself.
Healthcare providers are also stressing the importance of continuing COVID-19 precautions. Wearing masks, practicing good hand hygiene, and maintaining social distancing are still crucial, especially in crowded places.
Protect yourself and your loved ones by getting vaccinated. Together, we can help keep Bangalore healthy and safe this flu season. For more information on vaccination centers and schedules, residents can visit the Karnataka Health Department’s official website or follow their social media pages.
Stay informed, stay safe, and get your flu shot today!
New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...i3 Health
i3 Health is pleased to make the speaker slides from this activity available for use as a non-accredited self-study or teaching resource.
This slide deck presented by Dr. Kami Maddocks, Professor-Clinical in the Division of Hematology and
Associate Division Director for Ambulatory Operations
The Ohio State University Comprehensive Cancer Center, will provide insight into new directions in targeted therapeutic approaches for older adults with mantle cell lymphoma.
STATEMENT OF NEED
Mantle cell lymphoma (MCL) is a rare, aggressive B-cell non-Hodgkin lymphoma (NHL) accounting for 5% to 7% of all lymphomas. Its prognosis ranges from indolent disease that does not require treatment for years to very aggressive disease, which is associated with poor survival (Silkenstedt et al, 2021). Typically, MCL is diagnosed at advanced stage and in older patients who cannot tolerate intensive therapy (NCCN, 2022). Although recent advances have slightly increased remission rates, recurrence and relapse remain very common, leading to a median overall survival between 3 and 6 years (LLS, 2021). Though there are several effective options, progress is still needed towards establishing an accepted frontline approach for MCL (Castellino et al, 2022). Treatment selection and management of MCL are complicated by the heterogeneity of prognosis, advanced age and comorbidities of patients, and lack of an established standard approach for treatment, making it vital that clinicians be familiar with the latest research and advances in this area. In this activity chaired by Michael Wang, MD, Professor in the Department of Lymphoma & Myeloma at MD Anderson Cancer Center, expert faculty will discuss prognostic factors informing treatment, the promising results of recent trials in new therapeutic approaches, and the implications of treatment resistance in therapeutic selection for MCL.
Target Audience
Hematology/oncology fellows, attending faculty, and other health care professionals involved in the treatment of patients with mantle cell lymphoma (MCL).
Learning Objectives
1.) Identify clinical and biological prognostic factors that can guide treatment decision making for older adults with MCL
2.) Evaluate emerging data on targeted therapeutic approaches for treatment-naive and relapsed/refractory MCL and their applicability to older adults
3.) Assess mechanisms of resistance to targeted therapies for MCL and their implications for treatment selection
Explore natural remedies for syphilis treatment in Singapore. Discover alternative therapies, herbal remedies, and lifestyle changes that may complement conventional treatments. Learn about holistic approaches to managing syphilis symptoms and supporting overall health.
Acute scrotum is a general term referring to an emergency condition affecting the contents or the wall of the scrotum.
There are a number of conditions that present acutely, predominantly with pain and/or swelling
A careful and detailed history and examination, and in some cases, investigations allow differentiation between these diagnoses. A prompt diagnosis is essential as the patient may require urgent surgical intervention
Testicular torsion refers to twisting of the spermatic cord, causing ischaemia of the testicle.
Testicular torsion results from inadequate fixation of the testis to the tunica vaginalis producing ischemia from reduced arterial inflow and venous outflow obstruction.
The prevalence of testicular torsion in adult patients hospitalized with acute scrotal pain is approximately 25 to 50 percent
2. The objective of the chapter
• At the end the session, students will able to;
• Describe descriptive statistics
• Compute and interpret measure of central tendency
• Compute and interpret measure of central tendency
• Compute and interpret measure of dispersion
3. Aspects of statistics
There two major categories of statistics
1. Descriptive statistics
2. Inferential statistics
4. Descriptive statistics
• Statistical procedures used to summarise, organise, and
simplify data.
• Encompasses tools devised to organize and display data in an
accessible fashion
• It involves the quantification of recurring phenomena.
5. Descriptive statistics,…
• This process should be carried out in such a way that reflects
overall findings
– Raw data is made more manageable
– Raw data is presented in a logical form
– Patterns can be seen from organised data
6. Descriptive statistics,…
Summarizing and organizing data can be achieved through:
1. Frequency Distributions
2. Graphical Representations
3. Measures of Central Tendency
4. Measures of variability
7. Simple Frequency Distributions
• It is useful for categorical variable
• the following information can be obtained
– it allows you to pick up at a glance some valuable
information, such as highest, lowest value.
– ascertain the general shape or form of the distribution
– make an informed guess about central tendency values
10. Frequency distribution of group data
Weight Group Frequency Cum. frequency Rel. frequency Cum. Rel. freq
1500-2000 0 0 0 0
2000-2500 13 13 13.54 13.54
2500-3000 21 34 21.88 35.42
3000-3500 38 72 39.58 75
3500-4000 18 90 18.75 93.75
4000-4500 6 96 6.25 100
Total 96 100
11. Frequency distribution for numeric data,…
• If there is no good knowledge on the data one can use the
following rule as a guide to construct class intervals;
• Sturge’s Rule
– K=1 + 3.322log 𝑛, k is number of class intervals, n is
sample size
K
– W=Range
, W is width of the class interval
– Range, the difference of maximum and minimum value
14. Simple Bar Chart
• Displays the frequency distribution of a qualitative variable
• Take into account one or more classification factors
• a single bar for each category is drawn where the height of the
bar represents the frequency of the category
15. Simple bar chart,…
30
25
20
15
10
5
0
hemorrhage Embolism
Percent
hypertensive sepsis abortion
disorders
Causes of maternal mortality
Lale Say, et al. Global causes of maternal death: a W H O systematic analysis. T h e Lancet Global Health,
2014: 2(6): e323-e333
21. Stem & leaf plots
of each
• Draw a vertical line and place the first digits
class‐called the “stem” on the left side of the line.
• The numbers on the right side of the vertical line present the
second digit of each observation; they are the “leaves”
23. Box and Whisker plot
• Used to display information when the objective is to illustrate
certain locations in the distribution.
• Abox is drawn with the top of the box at the third quartile and
the bottom at the first quartile.
• The location of the mid‐point of the distribution is indicated
with a horizontal line in the box.
• straight lines, or whiskers, are drawn from the center of the top
of the box to the largest observation and from the center of the
bottom of the box to the smallest observation.
24. Box and Whisker plot,…
• Points required to construct box and whisker plot
– Lowest value
– First quartile(Q1)
– Second quartile/median(Q2)
– Third quartile(Q3)
– Larges value
25. Box and Whisker plot,…
40
60
100
120
mweigh
t
80
Minimum
value
Q2
Q1
Q2
Minimum
value
26. Box and Whisker plot,…
40
60
100
120 female male
mweight
80
Graphs by sex
27. Scatter plot
• display the relationship between two characteristics
• Used when both the variables are quatitative
• When one of the characteristics is qualitative and the other is
quantitative, the data can be displayed in box and whisker
plots.
34. Line graph, … .
Figure. Trends in early childhood mortality rates
35. Numerical summary measures
• summarization of data by means of descriptive measures
• Statistic a descriptive measure computed from the values of a
sample
• Parameter a descriptive measure computed from the values of
a population
36. parameter
• a measure, quantity, or numerical value obtained from the
population values X1, X2, . . ., XN
• As the value of a parameter is generally unknown but constant,
• we are interested in estimating a parameter to understand the
characteristic of a population using the sample data
37. statistics
• Ameasure, quantity, or numerical value obtained from the
sample values x1, x2, . . ., xn.
• The values of statistics can be computed from the sample as
functions of observations.
• Used to approximate (estimate) parameters.
38. Statistics
• The data can be organized and presented using figures and
tables in order to display the important features and
characteristics contained in data.
• However, tables and figures may not reveal the underlying
characteristics very comprehensively.
• For instance, if we want to characterize the data with a single
value, it would be very useful not only for communicating to
everyone but also for making comparisons.
39. • Let us consider that we are interested in number of C-section
births among the women during their reproductive period in
different regions of a country.
• To collect information on this variable, we need to consider
women who have completed their reproductive period.
40. • With tables and figures, we can display their frequency
distributions.
• However, to compare tables and figures for different regions, it
would be a very difficult task.
• In such situations, we need a single statistic which contains the
salient feature of the data for a region.
• This single value can be a statistic such as arithmetic mean in
this case
41. • In case of a qualitative variable, it could be a proportion.
• With this single value, we can represent the whole data set in a
meaningful way to represent the corresponding population
characteristic and we may compare this value with similar
other values at different times or for different groups or
regions conveniently.
• Asingle value makes it very convenient to represent a whole
data set of small, medium, or large size.
42. • Four types of descriptive measures are commonly used for
characterizing underlying features of data:
1. Measures of central tendency,
2. Measures of dispersion,
3. Measures of skewness,
4. Measures of kurtosis.
43. Measures of Central Tendency
• The values of a variable often tend to be concentrated around
the center of the data.
• The center of the data can be determined by the measures of
central tendency.
• Ameasure of central tendency is considered to be a typical (or
a representative) value of the set of data as a whole.
44. • The measures of central tendency are sometimes referred as
the measures of location too.
• The most commonly used measures of central tendency are;
• mean
• median
• mode
45. Mean
Population Mean (µ)
• If X1, X2, . . ., XN are the population values, then the
population mean is
µ=
𝑥
𝑖
∑𝑁
= 𝑖=1
X1+𝑋2+⋯𝑋𝑁
𝑁 𝑁
• The population mean µ is a parameter.
• It is usually unknown, and we are interested to estimate its
value.
46. Sample mean
• If x1, x2, . . ., xn are the sample values, then the sample mean is
𝑋=
𝑥1+𝑥2,…+ 𝑥𝑛 𝑥
𝑖
∑𝑛
= 𝑖=1
𝑛
𝑛
• It is noteworthy that the sample mean x is a statistic. It is
known and we can calculate it from the sample.
• The sample mean x is used to approximate (estimate) the
population mean, µ.
47. • Amore precise way to compute the mean from a frequency
distribution is
𝑓 𝑥 +𝑓 𝑥 ,…+𝑓 𝑥 𝑓𝑖𝑥
𝑖
∑𝑛
𝑋= 1 1 2 2 𝑘 𝑘
= 𝑖=1
𝑓𝑖+𝑓2+⋯+𝑓𝑘 𝑛
where fi denotes the frequency of distinct sample observation
xi, i =1, . . ., k.
48. • Example 2.1
• Let us consider the data on number of children ever born
reported by 20 women in a sample: 4, 2, 1, 0, 2, 3, 2, 1, 1, 2, 3,
4, 3, 2, 1, 2, 2, 1, 3, 2.
• n = 20.
• We want to find the average number of children ever born
from this sample.
• The sample mean is
49. •
10/16/2019 49
𝑥=4+2+1+0+2+3+2+1+1+2+3+4+3+2+1+2+2+1+3+2
20
• 𝑥=41 =2.05
20
• Example 2.2
– Employing the same data shown above, we can compute
the arithmetic mean in a more precise way by using
frequencies. We can construct a frequency distribution table
from this data to summarize the data first
52. Features of mean
10/16/2019 52
• Simplicity: The mean is easily understood and easy to
compute.
• Uniqueness: There is one and only one mean for a given set of
data.
• The mean takes into account all values of the data.
53. • Disadvantage
– The arithmetic mean is influenced by extreme values
• Therefore, the mean may be distorted by extreme values
– cannot be used for qualitative data.
– can only be found for quantitative variables.
10/16/2019 53
54. median
10/16/2019 54
• The value which divides the ordered array into two equal parts.
• The numbers in the first part are less than or equal to the
median, and the numbers in the second part are greater than or
equal to the median.
• Notice that 50% (or less) of the data are ≤Median and 50% (or
less) of the data are≥ Median.
55. 𝑟𝑎𝑛𝑘 =
• Calculating the Median;
– Let x1, x2, . . ., xn be the sample values.
– The sample size (n) can be odd or even.
– The steps for computing the median are discussed below:
• First, we order the sample to obtain the ordered array.
• Suppose that the ordered array is;
y1, y2, . . ., yn
• We compute the rank of the middle value(s):
𝑛 + 1
2
10/16/2019 55
56. • If the sample size (n) is an odd number, there is only
one value in the middle, and the rank will be an integer:
𝑟𝑎𝑛𝑘 = 𝑛+1
=m, (m is integer)
2
• The median is the middle value of the ordered
observations, which is
Median =ym
10/16/2019 56
57. • If the sample size (n) is an even number, there are two values
in the middle, and the rank will be an integer plus 0.5.
𝑟𝑎𝑛𝑘 = 𝑛+1
=m+0.5
10/16/2019 57
2
– Therefore, the ranks of the middle values are (m) and (m +
1).
– The median is the mean (average) of the two middle values
of the ordered observations:
59. • Solution
• n = 5 (odd number)
• There is only one value in the middle.
• The rank of the middle value is
median= 𝑛+1
=5+1
=3, (m=3)
2 2
The median = 38 (unit).
10/16/2019 59
60. • Example 2.5 (Even number) Find the median for the sample
values: 10, 35, 41, 16, 20, 32.
Solution
– n = 6 (even number)
– There are two values in the middle. The rank is
median= 𝑛+1
= 6+1
=3.5, (m=3)
2 2
10/16/2019 60
62. Median
10/16/2019 62
Advantage
– Simplicity: The median is easily understood and easy to
compute.
– Uniqueness: There is only one median for a given set of
data.
– Not as drastically affected by extreme values as is the
mean.
63. • Disadvantages:
– Does not take into account all values of the sample after
ordering the data in ascending order.
– Can only be found for quantitative variables.
10/16/2019 63
64. Mode
10/16/2019 64
• Aset of values is that value which occurs most frequently (i.e.,
with the highest frequency).
• If all values are different or have the same frequencies, there
will be no mode.
• Aset of data may have more than one mode
66. Percentiles and Quartiles
10/16/2019 66
• Represent a measure of position
• The measures of position indicate the location of a data set
• These descriptive measures are called position parameters
because they can be used to designate certain positions on the
horizontal axis when the distribution of a variable is graphed
67. Percentile,…
10/16/2019 67
– Given a set of observations x1, x2; . . ., xn, the Pth percentile,
Pp, p = 0,1, 2,…, 100, is the value of X such that p percent
or less of the observations are less than Pp and (100 − p)
percent or more of the observations are greater than Pp.
68. Percentile,…
– The 10th percentile is denoted by P10 where P10
100
= 10 𝑛+1 th in the ordered value
– The 50th percentile is P50;
100
= 50 𝑛+1 th in the ordered value = median
10/16/2019 68
– The median is a special case of percentile and it represents
the 50th percentile
69. Quartile
10/16/2019 69
– Given a set of observations x1, x2, . . .,xn, the rth quartile, Qr
, r = 1,2,3, is the value of X such that (25x r) percent or
less of the observations are less than Qr and [100 − (25 r)]
percent or less of the observations are greater than Qr.
71. Measures of Dispersion
10/16/2019 71
• The dispersion of a set of observations refers to the variation
contained in the data.
• Ameasure of dispersion conveys information regarding the
amount of variability present in a set of data.
• The dispersion is small when the values are close together
• Measures of dispersion includes range, variance, standard
deviation, and coefficient of variation
74. Measures of Dispersion,…
• Example; Let us consider the hypothetical data sets of size 5
with same measure of central tendency
If we represent the samples by only measure of central tendency,
or by 𝒙 in this case, then all the samples have the same value
10/16/2019 74
75. Measures of Dispersion,…
10/16/2019 75
• In other words, although we observe that the measure of
central tendency remains same, the variation contained in each
data set, or the variation in values of a data set from its central
value are not same at all
• This indicates that the MCT alone fails to characterize
different types of properties of data sets
76. Measures of Dispersion,…
10/16/2019 76
• We want to represent the data by a small number of measures,
if possible by a single value, but sometimes a single value
cannot represent the data adequately or sufficiently
• Therefore, the measure of variation is needed to characterize
the data in addition to a measure of central tendency
77. Range
• The range is the difference between the largest value (Max)
and the smallest value (Min):
Range (R) = Max – Min
• Example
10/16/2019 77
78. Range,…
10/16/2019 78
• The unit of the range is the same as the unit of the data.
• The usefulness of the range is limited.
• A poor measure of the dispersion because it only takes into
account two of the values, maximum and minimum; however,
it plays a significant role in many applications to have a quick
view of the dispersion in a data set
79. Interquartile Range
10/16/2019 79
• Adisadvantage of the range is that it is computed using the
two extreme values of the sample data
• If the data contain extreme outlier values, then the range is
affected by those outliers.
• Interquartile range can be used to overcome this limitation of
the range
• Computed by taking the difference between the third quartile
and the first quartile
80. Interquartile Range,…
10/16/2019 80
• Provides a more meaningful measure that represents the range
of the middle 50% of the data.
• The interquartile range is defined as
IQR=Q3 - Q1
• As the extreme values are not considered, the IQR is a more
robust measure of variation in the data and is not affected by
outliers
81. Box-and-Whisker Plot
10/16/2019 81
• Measure of location in which value around which other values
are clustered.
• Shows the variation or spread in the data from the central
value
• Also shows whether the data are symmetric or not
• The box contains features included in the middle 50% of the
observations, and whiskers show the minimum and maximum
values
82. Box-and-Whisker Plot
• Represents the main features of the data very efficiently and
precisely
• Example: Consider the following data set:
14.6, 24.3, 24.9, 27.0, 27.2, 27.4, 28.2, 28.8, 29.9, 30.7, 31.5,
31.6, 32.3, 32.8, 33.3, 33.6, 34.3, 36.9, 38.3, 44.0.
n=20
𝑡ℎ
Q1= (𝑛+1)
=5.25th observation
10/16/2019 82
4
=5th +0.25(6th -5th)=> 27.2+0.25(27.4-27.2)
=27.2+0.05=27.25
84. The Variance
10/16/2019 84
• Uses the mean as a point of reference
• an average measure of squared difference between each
observation and arithmetic mean
• Small when the observations are close to the mean, is large
when the observations are spread out from the mean, and is
zero (no variation) when all observations have the same value
(concentrated at the mean)
• Shows, on an average, how close the values of a variable are to
the arithmetic mean
85. The Variance,…
• Deviations of sample values from the sample mean
– Let x1, x2, . . ., xn be the sample values, and x be the sample mean.
– The deviation of the value xi from the sample mean x is
𝑥𝑖 − 𝑥
– The squared deviation is 𝑥𝑖 − 𝑥 𝟐
𝟐
10/16/2019 85
– The sum of squared deviations is ∑𝑛 𝑥𝑖 − 𝑥
𝑖
=1
86. The Population Variance, σ2
• Let X1, X2, . . ., XN be the population values. The population
variance, δ2, is defined by
∑𝑁
σ2 = 𝑖=1
2
(𝑥𝑖−μ)
𝑁
𝑋 −𝜇 2
+(𝑋 −𝜇)2+⋯+ 𝑋 −𝜇
10/16/2019 86
2
= 1 2 𝑁
(unit)2
𝑁
• It may be noted here that σ2 is a parameter because it is
obtained from the population values, it is generally unknown.
It is also always true that σ2 ≥0.
87. The Sample Variance, S2
• Let x1, x2, . . ., xn be the sample values. The sample variance s2
is defined by
(𝑥𝑖−𝑥)2
∑𝑛
= 𝑖=1
𝑛−1
∑𝑛
= 𝑖=1(𝑥1−𝑥)2 + 𝑥𝑖−𝑥 2
…,+(𝑥𝑖−𝑥)2
𝑛
−1
10/16/2019 87
88. S2 ,….
• Example
– We want to compute the sample variance of the following
sample values: 10, 21, 33, 53, 54.
10/16/2019 88
89. Standard Deviation
• The variance represents squared units, and therefore is not
appropriate measure of dispersion when we wish to express
the concept of dispersion in terms of the original unit.
• The standard deviation is the square root of the variance
– Population standard deviation is σ= 𝜎2
– Sample standard deviation is S= 𝑆2
10/16/2019 89
91. Coefficient of Variation (C.V.)
10/16/2019 91
• Used to compare the variation of two variables
• we cannot use the variance or the standard deviation directly to
compare variation of two variable, because
– the variables might have different units
– the variables might have different means
• For comparison, we need a measure of the relative variation
that does not depend on either the units or on the values of
means.
92. C.V., …
• The coefficient of variation is defined by
𝐶𝑉 = 𝑆
x100
𝑋
• C.V. is free of unit and the standard deviation is divided by the
corresponding arithmetic mean to make it comparable for
different means.
• To compare the variability of two sets of data
10/16/2019 92
93. C.V., …
10/16/2019 93
• The data set with the larger value of C.V. has larger variation
which is expressed in percentage.
• The relative variability of the data set 1 is larger than the
relative variability of the data set 2 if C.V.1 > C.V.2 and vice
versa
94. C.V., …
10/16/2019 94
• Example
Variable mean Standard deviation Coefficient of variation
Sample 1 66kg 4.5kg 6.8%
Sample 2 36kg 4.5kg 12:5%
Since C.V.2> C.V.1, the relative variability of the data set 2 is
larger than the relative variability of the data set 1.
95. Skewness
10/16/2019 95
• the degree of asymmetry or departure of symmetry of a
distribution.
• In a skew distribution, the mean tends to lie on the same side
of the longer tail.
• In a symmetric distribution, the mean, the mode, and the
median coincide.
97. Skewness, …
• Pearson’s Measure
– Skewness =
𝐦𝐞𝐚𝐧−𝐦𝐨𝐝𝐞
𝐬𝐭𝐚𝐧𝐝𝐚𝐫𝐝 𝐝𝐞𝐯𝐢𝐚𝐭𝐢𝐨𝐧
=0 for symmetry,
>0 for positively skewed distribution,
<0 for negatively skewed distribution.
10/16/2019 97
98. Skewness, …
• Alternatively
Skewness =
3𝑥(mean−m𝑒𝑑𝑖𝑎𝑛)
standard deviation
+0 for symmetry,
>0 for positively skewed distribution,
<0, for negatively skewed distribution.
10/16/2019 98
99. Skewness, …
• Bowley’s Measure
Skewness =
𝑄3−𝑄2 −(𝑄2−𝑄1)
𝑄3−𝑄1
Example: Consider the following data set:
14.6, 24.3, 24.9, 27.0, 27.2, 27.4, 28.2, 28.8, 29.9, 30.7,
31.5, 31.6, 32.3, 32.8, 33.3, 33.6, 34.3, 36.9, 38.3, 44.0
• Here, n = 20. 𝑥 = 30.58: There is no mode.
• The sample variance, s2=36.3417 and the sample standard
deviation, S= 6:0284
10/16/2019 99
100. Skewness, …
• Q1=27.25, Q2=31.1, Q3=33.525
Skewness = 3𝑥(30.58−31.1)
= -0.2588
6.0284
• This shows that there is negative skewness in the distribution
The Bowley’s measure of skewness is
Skewness= 33.525−31.1 − 31.1−27.25
= -0.2271
33.525−27.25
• This measure also confirms that there is negative skewness in
the distribution
10/16/2019 100
101. Kurtosis
10/16/2019 101
• The peakedness of a distribution, usually taken relative to a
normal distribution.
• Leptokurtic: a distribution with a relatively high peak
• Platykurtic: a curve which is relatively flat topped
• Mesokurtic: a curve which is neither too peaked nor flat
topped (normal distribution curve)
103. • The measure of kurtosis is
A mesokurtic distribution has a kurtosis measure of 3 based, but
computer program reduce measure by 3, then mesokurtic distribution
will be equal to 0, leptokurtic distribution will have a kurtosis
measure > 0, and a platykurtic distribution will have a kurtosis
measure < 0.
10/16/2019 103