SlideShare a Scribd company logo
1 of 20
Biostatistics is the application of statistical principles to questions and
problems in medicine, public health or biology. One can imagine that it might
be of interest to characterize a given population (e.g., adults in Boston or all
children in the United States) with respect to the proportion of subjects who
are overweight or the proportion who have asthma, and it would also be
important to estimate the magnitude of these problems over time or perhaps
in different locations. In other circumstances in would be important to make
comparisons among groups of subjects in order to determine whether
certain behaviors (e.g., smoking, exercise, etc.) are associated with a
greater risk of certain health outcomes. It would, of course, be impossible to
answer all such questions by collecting information (data) from all subjects
in the populations of interest. A more realistic approach is to study samples
or subsets of a population. The discipline of biostatistics provides tools and
techniques for collecting data and then summarizing, analyzing, and
interpreting it. If the samples one takes are representative of the population
of interest, they will provide good estimates regarding the population overall.
Consequently, in biostatistics one analyzes samples in order to make
inferences about the population.
A fundamental task of biostatistics is to analyze samples in order to make inferences
about the population from which the samples were drawn. To illustrate this, consider
the population of Delhi in 2010, which consisted of 6,547,629 persons. One characteristic
(or variable) of potential interest might be the diastolic blood pressure of the population. It is
obviously not feasible to measure and record blood pressures for of all the residents, but
one could take samples of the population in order estimate the population's mean diastolic
blood pressure.
Despite the simplicity of this
example, it raises a series of
concepts and terms that need to be
defined. The
terms population, subjects, sample,
variable, and data elements
It is possible to select many samples from a given population. The simple
example above shows three small samples that were drawn to estimate the
mean diastolic blood pressure of Delhi residents, although it doesn't specify
how the samples were drawn. Note also that each of the samples provided a
different estimate of the mean value for the population, and none of the
estimates was the same as the actual mean for the overall population (78
mm Hg in this hypothetical example). In reality, one generally doesn't know
the true mean values of the characteristics of the population, which is of
course why we are trying to estimate them from samples. Consequently, it is
important to define and distinguish between:
• population size versus sample size
• parameter versus sample statistic.
Measures of Central Tendency
A measure of central tendency is a measure which indicates where the
middle of the data is. The three most commonly used measures of central
tendency are: The Mean, the Median, and the Mode.
Participant ID Systolic Blood
Pressure
Diastolic Blood
Pressure
Total Serum
Cholesterol
Weight Height Body Mass Index
1 141 76 199 138 63.00 24.4
2 119 64 150 183 69.75 26.4
3 122 62 227 153 65.75 24.9
4 127 81 227 178 70.00 25.5
5 125 70 163 161 70.50 22.8
6 123 72 210 206 70.00 29.6
7 105 81 205 235 72.00 31.9
8 113 63 275 151 60.75 28.8
9 106 67 208 213 69.00 31.5
10 131 77 159 142 61.00 26.8
Data Values for a Small Sample
In order to illustrate the computation of sample statistics,
we selected a small subset (n=10) of participants in the
Center of Heart Study. The data values for these ten
individuals are shown in the table below.
The first summary statistic that is important to report is the sample size. In this example the
sample size is n=10. Because this sample is small (n=10), it is easy to summarize the
sample by inspecting the observed values, for example, by listing the diastolic blood
pressures in ascending order:
62 63 64 67 70 72 76 77 81 81
Simple inspection of this small sample gives us a sense of the center of the observed
diastolic pressures and also gives us a sense of how much variability there is. However, for
a large sample, inspection of the individual data values does not provide a meaningful
summary, and summary statistics are necessary. The two key components of a useful
summary for a continuous variable are:
• a description of the center or 'average' of the data (i.e., what is a typical value?) and
• an indication of the variability in the data.
There are several statistics that describe the center of the data, but for now we will focus on the sample mean, which is computed by summing all of
the values for a particular variable in the sample and dividing by the sample size. For the sample of diastolic blood pressures in the table above, the
sample mean is computed as follows:
To simplify the formulas for sample statistics (and for population parameters), we usually denote the variable of interest as "X". X is simply a
placeholder for the variable being analyzed. Here X=diastolic blood pressure.
The general formula for the sample mean is:
The X with the bar over it represents the sample mean, and it is read as "X bar". The Σ indicates summation (i.e., sum of the X's or sum of the
diastolic blood pressures in this example).
When reporting summary statistics for a continuous variable, the convention is to report one more decimal place than the number of decimal places
measured. Systolic and diastolic blood pressures, total serum cholesterol and weight were measured to the nearest integer, therefore the summary
statistics are reported to the nearest tenth place. Height was measured to the nearest quarter inch (hundredths place), therefore the summary
statistics are reported to the nearest thousandths place. Body mass index was computed to the nearest tenths place, summary statistics are reported
to the nearest hundredths place.
Sample Mean
Measures of Dispersion
A measure of dispersion conveys information regarding
the amount of variability present in a set of data.
• Note:
1. If all the values are the same → There is no
dispersion .
2. If all the values are different → There is a dispersion:
a) If the values close to each other →The amount of
Dispersion small.
b) If the values are widely scattered → The Dispersion is
greater. Measures of Dispersion are :
1.Range (R).
2. Variance.
3. Standard deviation.
4.Coefficient of variation (C.V).
Standard Deviation
If there are no extreme or outlying values of
the variable, the mean is the most appropriate
summary of a typical value, and to summarize
variability in the data we specifically estimate
the variability in the sample around the
sample mean. If all of the observed values
in a sample are close to the sample mean,
the standard deviation will be small (i.e.,
close to zero), and if the observed values
vary widely around the sample mean, the
standard deviation will be large. If all of the
values in the sample are identical, the sample
standard deviation will be zero.
X=Diastolic Blood Pressure Deviation from the Mean
76 4.7
64 -7.3
62 -9.3
81 9.7
70 -1.3
72 0.7
81 9.7
63 -8.3
67 -4.3
77 5.7
Table - Diastolic Blood Pressures and Deviations from the Sample Mean
When discussing the sample mean, we found that the sample mean for
diastolic blood pressure = 71.3. The table below shows each of the
observed values along with its respective deviation from the sample
mean.
• The deviations from the mean reflect how far each individual's diastolic blood pressure is
from the mean diastolic blood pressure. The first participant's diastolic blood pressure is 4.7 units
above the mean while the second participant's diastolic blood pressure is 7.3 units below the
mean. What we need is a summary of these deviations from the mean, in particular a measure of how
far, on average, each participant is from the mean diastolic blood pressure. If we compute the mean
of the deviations by summing the deviations and dividing by the sample size we run into a
problem. The sum of the deviations from the mean is zero. This will always be the case as it is a
property of the sample mean, i.e., the sum of the deviations below the mean will always equal the sum
of the deviations above the mean. However, the goal is to capture the magnitude of these deviations
in a summary measure. To address this problem of the deviations summing to zero, we could take
absolute values or square each deviation from the mean. Both methods would address the
problem. The more popular method to summarize the deviations from the mean involves
squaring the deviations (absolute values are difficult in mathematical proofs). The table below
displays each of the observed values, the respective deviations from the sample mean and the
squared deviations from the mean.
X=Diastolic Blood Pressure Deviation from the Mean Squared Deviation from the Mean
76 4.7 22.09
64 -7.3 53.29
62 -9.3 86.49
81 9.7 94.09
70 -1.3 1.69
72 0.7 0.49
81 9.7 94.09
63 -8.3 68.89
67 -4.3 18.49
77 5.7 32.49
The squared deviations are interpreted as follows. The first participant's squared deviation is 22.09
meaning that his/her diastolic blood pressure is 22.09 units squared from the mean diastolic blood
pressure, and the second participant's diastolic blood pressure is 53.29 units squared from the
mean diastolic blood pressure. A quantity that is often used to measure variability in a sample is
called the sample variance, and it is essentially the mean of the squared deviations. The sample
variance is denoted s² and is computed as follows:
Why do we divided by (n-1) instead of n?
The sample variance is not actually the mean of the
squared deviations, because we divide by (n-1) instead of
n. In statistical inference we make generalizations or
estimates of population parameters based on sample
statistics. If we were to compute the sample variance by
taking the mean of the squared deviations and dividing by
n we would consistently underestimate the true population
variance. Dividing by (n-1) produces a better estimate of
the population variance. The sample variance is
nonetheless usually interpreted as the average squared
deviation from the mean.
In this sample of n=10 diastolic blood pressures, the sample variance is s2 = 472.10/9 =52.46.
Thus, on average diastolic blood pressures are 52.46 units squared from the mean diastolic blood
pressure. Because of the squaring, the variance is not particularly interpretable. The more common
measure of variability in a sample is the sample standard deviation, defined as the square root of
the sample variance:
Try to find:
- sample size
- mode
- median
- range
- sample mean
- sample variance
- sample standard deviation
• We usually don't have information about all of the subjects in a population of interest, so
we take samples from the population in order to make inferences about unknown
population parameters.
• An obvious concern would be how good a given sample's statistics are in estimating the
characteristics of the population from which it was drawn. There are many factors that
influence diastolic blood pressure levels, such as age, body weight, fitness, and heredity.
• We would ideally like the sample to be representative of the population. Intuitively, it
would seem preferable to have a random sample, meaning that all subjects in the
population have an equal chance of being selected into the sample; this would minimize
systematic errors caused by biased sampling.
• In addition, it is also intuitive that small samples might not be representative of the
population just by chance, and large samples are less likely to be affected by "the luck of
the draw"; this would reduce so-called random error. Since we often rely on a single
sample to estimate population parameters, we never actually know how good our
estimates are. However, one can use sampling methods that reduce bias, and the degree
of random error in a given sample can be estimated in order to get a sense of the
precision of our estimates.
A sample of 10 women seeking prenatal care at Almaty
Medical center agree to participate in a study to assess
the quality of prenatal care. At the time of study
enrollment, you the study coordinator, collected
background characteristics on each of the moms including
their age (in years).The data are shown below:
24 18 28 32 26 21 22 43 27 29
Try to find:
- sample size
- mode
- median
- range
- sample mean
- sample variance
- sample standard deviation

More Related Content

Similar to biostat 2.pptx h h jbjbivigyfyfyfyfyftftc

EXERCISE 27I WILL SEND THE DATA TO WHOM EVER WILL DO THE ASSIGNMEN.docx
EXERCISE 27I WILL SEND THE DATA TO WHOM EVER WILL DO THE ASSIGNMEN.docxEXERCISE 27I WILL SEND THE DATA TO WHOM EVER WILL DO THE ASSIGNMEN.docx
EXERCISE 27I WILL SEND THE DATA TO WHOM EVER WILL DO THE ASSIGNMEN.docxAlleneMcclendon878
 
Epidemiology Lectures for UG
Epidemiology Lectures for UGEpidemiology Lectures for UG
Epidemiology Lectures for UGamitakashyap1
 
Lecture 5 Sampling distribution of sample mean.pptx
Lecture 5 Sampling distribution of sample mean.pptxLecture 5 Sampling distribution of sample mean.pptx
Lecture 5 Sampling distribution of sample mean.pptxshakirRahman10
 
Advice On Statistical Analysis For Circulation Research
Advice On Statistical Analysis For Circulation ResearchAdvice On Statistical Analysis For Circulation Research
Advice On Statistical Analysis For Circulation ResearchNancy Ideker
 
Running Head SCENARIO NCLEX MEMORIAL HOSPITAL .docx
Running Head SCENARIO NCLEX MEMORIAL HOSPITAL                    .docxRunning Head SCENARIO NCLEX MEMORIAL HOSPITAL                    .docx
Running Head SCENARIO NCLEX MEMORIAL HOSPITAL .docxtoltonkendal
 
Sampling distribution
Sampling distributionSampling distribution
Sampling distributionswarna dey
 
Statistics for Medical students
Statistics for Medical studentsStatistics for Medical students
Statistics for Medical studentsANUSWARUM
 
Tools and Techniques - Statistics: descriptive statistics
Tools and Techniques - Statistics: descriptive statistics Tools and Techniques - Statistics: descriptive statistics
Tools and Techniques - Statistics: descriptive statistics Ramachandra Barik
 
Statistics "Descriptive & Inferential"
Statistics "Descriptive & Inferential"Statistics "Descriptive & Inferential"
Statistics "Descriptive & Inferential"Dalia El-Shafei
 
Statistics ppt.ppt
Statistics ppt.pptStatistics ppt.ppt
Statistics ppt.pptHeni524956
 
Statistics for Social Workers J. Timothy Stocks tatr.docx
Statistics for Social Workers J. Timothy Stocks tatr.docxStatistics for Social Workers J. Timothy Stocks tatr.docx
Statistics for Social Workers J. Timothy Stocks tatr.docxdarwinming1
 
Statistics for Social Workers J. Timothy Stocks tatr.docx
Statistics for Social Workers J. Timothy Stocks tatr.docxStatistics for Social Workers J. Timothy Stocks tatr.docx
Statistics for Social Workers J. Timothy Stocks tatr.docxrafaelaj1
 
Statistics for Social Workers J. Timothy Stocks tatr.docx
Statistics for Social Workers J. Timothy Stocks tatr.docxStatistics for Social Workers J. Timothy Stocks tatr.docx
Statistics for Social Workers J. Timothy Stocks tatr.docxsusanschei
 
inferentialstatistics-210411214248.pdf
inferentialstatistics-210411214248.pdfinferentialstatistics-210411214248.pdf
inferentialstatistics-210411214248.pdfChenPalaruan
 

Similar to biostat 2.pptx h h jbjbivigyfyfyfyfyftftc (20)

EXERCISE 27I WILL SEND THE DATA TO WHOM EVER WILL DO THE ASSIGNMEN.docx
EXERCISE 27I WILL SEND THE DATA TO WHOM EVER WILL DO THE ASSIGNMEN.docxEXERCISE 27I WILL SEND THE DATA TO WHOM EVER WILL DO THE ASSIGNMEN.docx
EXERCISE 27I WILL SEND THE DATA TO WHOM EVER WILL DO THE ASSIGNMEN.docx
 
Epidemiology Lectures for UG
Epidemiology Lectures for UGEpidemiology Lectures for UG
Epidemiology Lectures for UG
 
Lecture 5 Sampling distribution of sample mean.pptx
Lecture 5 Sampling distribution of sample mean.pptxLecture 5 Sampling distribution of sample mean.pptx
Lecture 5 Sampling distribution of sample mean.pptx
 
Advice On Statistical Analysis For Circulation Research
Advice On Statistical Analysis For Circulation ResearchAdvice On Statistical Analysis For Circulation Research
Advice On Statistical Analysis For Circulation Research
 
Running Head SCENARIO NCLEX MEMORIAL HOSPITAL .docx
Running Head SCENARIO NCLEX MEMORIAL HOSPITAL                    .docxRunning Head SCENARIO NCLEX MEMORIAL HOSPITAL                    .docx
Running Head SCENARIO NCLEX MEMORIAL HOSPITAL .docx
 
Sampling distribution
Sampling distributionSampling distribution
Sampling distribution
 
Statistics for Medical students
Statistics for Medical studentsStatistics for Medical students
Statistics for Medical students
 
B02308013
B02308013B02308013
B02308013
 
B02308013
B02308013B02308013
B02308013
 
Tools and Techniques - Statistics: descriptive statistics
Tools and Techniques - Statistics: descriptive statistics Tools and Techniques - Statistics: descriptive statistics
Tools and Techniques - Statistics: descriptive statistics
 
Statistics "Descriptive & Inferential"
Statistics "Descriptive & Inferential"Statistics "Descriptive & Inferential"
Statistics "Descriptive & Inferential"
 
Statistics ppt.ppt
Statistics ppt.pptStatistics ppt.ppt
Statistics ppt.ppt
 
Parametric tests
Parametric testsParametric tests
Parametric tests
 
Statistics for Social Workers J. Timothy Stocks tatr.docx
Statistics for Social Workers J. Timothy Stocks tatr.docxStatistics for Social Workers J. Timothy Stocks tatr.docx
Statistics for Social Workers J. Timothy Stocks tatr.docx
 
Statistics for Social Workers J. Timothy Stocks tatr.docx
Statistics for Social Workers J. Timothy Stocks tatr.docxStatistics for Social Workers J. Timothy Stocks tatr.docx
Statistics for Social Workers J. Timothy Stocks tatr.docx
 
Statistics for Social Workers J. Timothy Stocks tatr.docx
Statistics for Social Workers J. Timothy Stocks tatr.docxStatistics for Social Workers J. Timothy Stocks tatr.docx
Statistics for Social Workers J. Timothy Stocks tatr.docx
 
inferentialstatistics-210411214248.pdf
inferentialstatistics-210411214248.pdfinferentialstatistics-210411214248.pdf
inferentialstatistics-210411214248.pdf
 
Inferential statistics
Inferential statisticsInferential statistics
Inferential statistics
 
Intro to Biostat. ppt
Intro to Biostat. pptIntro to Biostat. ppt
Intro to Biostat. ppt
 
Statistics excellent
Statistics excellentStatistics excellent
Statistics excellent
 

More from MrMedicine

AG0cnae449UQep77565.pptxgghjjjjjfcchyfxcff
AG0cnae449UQep77565.pptxgghjjjjjfcchyfxcffAG0cnae449UQep77565.pptxgghjjjjjfcchyfxcff
AG0cnae449UQep77565.pptxgghjjjjjfcchyfxcffMrMedicine
 
tirthpharma313.pptxh hvibobobohog9h8g8gig8g
tirthpharma313.pptxh hvibobobohog9h8g8gig8gtirthpharma313.pptxh hvibobobohog9h8g8gig8g
tirthpharma313.pptxh hvibobobohog9h8g8gig8gMrMedicine
 
Peptic.pptxhhjsushxgxvxjxuyxhddidudyddgdydydu
Peptic.pptxhhjsushxgxvxjxuyxhddidudyddgdydyduPeptic.pptxhhjsushxgxvxjxuyxhddidudyddgdydydu
Peptic.pptxhhjsushxgxvxjxuyxhddidudyddgdydyduMrMedicine
 
Ram Chandra - 318 (2) (2).pptxcyvygugihihigug
Ram Chandra - 318 (2) (2).pptxcyvygugihihigugRam Chandra - 318 (2) (2).pptxcyvygugihihigug
Ram Chandra - 318 (2) (2).pptxcyvygugihihigugMrMedicine
 
SadC7RHR9NkPCE4D719.pptxvvvvvvĝggffttytg
SadC7RHR9NkPCE4D719.pptxvvvvvvĝggffttytgSadC7RHR9NkPCE4D719.pptxvvvvvvĝggffttytg
SadC7RHR9NkPCE4D719.pptxvvvvvvĝggffttytgMrMedicine
 
Nephrotic Syndrome Management Case Report by Slidesgo (1).pptx
Nephrotic Syndrome Management Case Report by Slidesgo (1).pptxNephrotic Syndrome Management Case Report by Slidesgo (1).pptx
Nephrotic Syndrome Management Case Report by Slidesgo (1).pptxMrMedicine
 
Presentation 2.pptxxxxxxxxxxxxrezghyyhggg
Presentation 2.pptxxxxxxxxxxxxrezghyyhgggPresentation 2.pptxxxxxxxxxxxxrezghyyhggg
Presentation 2.pptxxxxxxxxxxxxrezghyyhgggMrMedicine
 
poliomyelitis microbiology.pptx
poliomyelitis microbiology.pptxpoliomyelitis microbiology.pptx
poliomyelitis microbiology.pptxMrMedicine
 
2 урок анг.pptx
2 урок анг.pptx2 урок анг.pptx
2 урок анг.pptxMrMedicine
 
introduction-to-epidemiology (1).pptx
introduction-to-epidemiology (1).pptxintroduction-to-epidemiology (1).pptx
introduction-to-epidemiology (1).pptxMrMedicine
 
Aman%20Ppt.pptx
Aman%20Ppt.pptxAman%20Ppt.pptx
Aman%20Ppt.pptxMrMedicine
 
Basic_First_Aid.pptx
Basic_First_Aid.pptxBasic_First_Aid.pptx
Basic_First_Aid.pptxMrMedicine
 
Ram Chandra Sharma.pptx
Ram Chandra Sharma.pptxRam Chandra Sharma.pptx
Ram Chandra Sharma.pptxMrMedicine
 
Group-218(1st).pptx
Group-218(1st).pptxGroup-218(1st).pptx
Group-218(1st).pptxMrMedicine
 
Vijay Chaudhary.pptx
Vijay Chaudhary.pptxVijay Chaudhary.pptx
Vijay Chaudhary.pptxMrMedicine
 
leonardo-da-vinci-ppt-1204527819891627-4.pdf
leonardo-da-vinci-ppt-1204527819891627-4.pdfleonardo-da-vinci-ppt-1204527819891627-4.pdf
leonardo-da-vinci-ppt-1204527819891627-4.pdfMrMedicine
 
Ram Chandra 2.pptx
Ram Chandra 2.pptxRam Chandra 2.pptx
Ram Chandra 2.pptxMrMedicine
 
Vipin Kumar.pptx
Vipin Kumar.pptxVipin Kumar.pptx
Vipin Kumar.pptxMrMedicine
 

More from MrMedicine (20)

AG0cnae449UQep77565.pptxgghjjjjjfcchyfxcff
AG0cnae449UQep77565.pptxgghjjjjjfcchyfxcffAG0cnae449UQep77565.pptxgghjjjjjfcchyfxcff
AG0cnae449UQep77565.pptxgghjjjjjfcchyfxcff
 
tirthpharma313.pptxh hvibobobohog9h8g8gig8g
tirthpharma313.pptxh hvibobobohog9h8g8gig8gtirthpharma313.pptxh hvibobobohog9h8g8gig8g
tirthpharma313.pptxh hvibobobohog9h8g8gig8g
 
Peptic.pptxhhjsushxgxvxjxuyxhddidudyddgdydydu
Peptic.pptxhhjsushxgxvxjxuyxhddidudyddgdydyduPeptic.pptxhhjsushxgxvxjxuyxhddidudyddgdydydu
Peptic.pptxhhjsushxgxvxjxuyxhddidudyddgdydydu
 
Ram Chandra - 318 (2) (2).pptxcyvygugihihigug
Ram Chandra - 318 (2) (2).pptxcyvygugihihigugRam Chandra - 318 (2) (2).pptxcyvygugihihigug
Ram Chandra - 318 (2) (2).pptxcyvygugihihigug
 
SadC7RHR9NkPCE4D719.pptxvvvvvvĝggffttytg
SadC7RHR9NkPCE4D719.pptxvvvvvvĝggffttytgSadC7RHR9NkPCE4D719.pptxvvvvvvĝggffttytg
SadC7RHR9NkPCE4D719.pptxvvvvvvĝggffttytg
 
Nephrotic Syndrome Management Case Report by Slidesgo (1).pptx
Nephrotic Syndrome Management Case Report by Slidesgo (1).pptxNephrotic Syndrome Management Case Report by Slidesgo (1).pptx
Nephrotic Syndrome Management Case Report by Slidesgo (1).pptx
 
Presentation 2.pptxxxxxxxxxxxxrezghyyhggg
Presentation 2.pptxxxxxxxxxxxxrezghyyhgggPresentation 2.pptxxxxxxxxxxxxrezghyyhggg
Presentation 2.pptxxxxxxxxxxxxrezghyyhggg
 
poliomyelitis microbiology.pptx
poliomyelitis microbiology.pptxpoliomyelitis microbiology.pptx
poliomyelitis microbiology.pptx
 
2 урок анг.pptx
2 урок анг.pptx2 урок анг.pptx
2 урок анг.pptx
 
introduction-to-epidemiology (1).pptx
introduction-to-epidemiology (1).pptxintroduction-to-epidemiology (1).pptx
introduction-to-epidemiology (1).pptx
 
2 анг.pptx
2 анг.pptx2 анг.pptx
2 анг.pptx
 
Aman%20Ppt.pptx
Aman%20Ppt.pptxAman%20Ppt.pptx
Aman%20Ppt.pptx
 
Basic_First_Aid.pptx
Basic_First_Aid.pptxBasic_First_Aid.pptx
Basic_First_Aid.pptx
 
Ram Chandra Sharma.pptx
Ram Chandra Sharma.pptxRam Chandra Sharma.pptx
Ram Chandra Sharma.pptx
 
Group-218(1st).pptx
Group-218(1st).pptxGroup-218(1st).pptx
Group-218(1st).pptx
 
Group-1.pptx
Group-1.pptxGroup-1.pptx
Group-1.pptx
 
Vijay Chaudhary.pptx
Vijay Chaudhary.pptxVijay Chaudhary.pptx
Vijay Chaudhary.pptx
 
leonardo-da-vinci-ppt-1204527819891627-4.pdf
leonardo-da-vinci-ppt-1204527819891627-4.pdfleonardo-da-vinci-ppt-1204527819891627-4.pdf
leonardo-da-vinci-ppt-1204527819891627-4.pdf
 
Ram Chandra 2.pptx
Ram Chandra 2.pptxRam Chandra 2.pptx
Ram Chandra 2.pptx
 
Vipin Kumar.pptx
Vipin Kumar.pptxVipin Kumar.pptx
Vipin Kumar.pptx
 

Recently uploaded

原版1:1复刻温哥华岛大学毕业证Vancouver毕业证留信学历认证
原版1:1复刻温哥华岛大学毕业证Vancouver毕业证留信学历认证原版1:1复刻温哥华岛大学毕业证Vancouver毕业证留信学历认证
原版1:1复刻温哥华岛大学毕业证Vancouver毕业证留信学历认证rjrjkk
 
Q3 2024 Earnings Conference Call and Webcast Slides
Q3 2024 Earnings Conference Call and Webcast SlidesQ3 2024 Earnings Conference Call and Webcast Slides
Q3 2024 Earnings Conference Call and Webcast SlidesMarketing847413
 
Interimreport1 January–31 March2024 Elo Mutual Pension Insurance Company
Interimreport1 January–31 March2024 Elo Mutual Pension Insurance CompanyInterimreport1 January–31 March2024 Elo Mutual Pension Insurance Company
Interimreport1 January–31 March2024 Elo Mutual Pension Insurance CompanyTyöeläkeyhtiö Elo
 
Bladex 1Q24 Earning Results Presentation
Bladex 1Q24 Earning Results PresentationBladex 1Q24 Earning Results Presentation
Bladex 1Q24 Earning Results PresentationBladex
 
Call Girls Service Nagpur Maya Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Maya Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Maya Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Maya Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
Vip B Aizawl Call Girls #9907093804 Contact Number Escorts Service Aizawl
Vip B Aizawl Call Girls #9907093804 Contact Number Escorts Service AizawlVip B Aizawl Call Girls #9907093804 Contact Number Escorts Service Aizawl
Vip B Aizawl Call Girls #9907093804 Contact Number Escorts Service Aizawlmakika9823
 
VIP Call Girls LB Nagar ( Hyderabad ) Phone 8250192130 | ₹5k To 25k With Room...
VIP Call Girls LB Nagar ( Hyderabad ) Phone 8250192130 | ₹5k To 25k With Room...VIP Call Girls LB Nagar ( Hyderabad ) Phone 8250192130 | ₹5k To 25k With Room...
VIP Call Girls LB Nagar ( Hyderabad ) Phone 8250192130 | ₹5k To 25k With Room...Suhani Kapoor
 
Vp Girls near me Delhi Call Now or WhatsApp
Vp Girls near me Delhi Call Now or WhatsAppVp Girls near me Delhi Call Now or WhatsApp
Vp Girls near me Delhi Call Now or WhatsAppmiss dipika
 
AfRESFullPaper22018EmpiricalPerformanceofRealEstateInvestmentTrustsandShareho...
AfRESFullPaper22018EmpiricalPerformanceofRealEstateInvestmentTrustsandShareho...AfRESFullPaper22018EmpiricalPerformanceofRealEstateInvestmentTrustsandShareho...
AfRESFullPaper22018EmpiricalPerformanceofRealEstateInvestmentTrustsandShareho...yordanosyohannes2
 
Unveiling the Top Chartered Accountants in India and Their Staggering Net Worth
Unveiling the Top Chartered Accountants in India and Their Staggering Net WorthUnveiling the Top Chartered Accountants in India and Their Staggering Net Worth
Unveiling the Top Chartered Accountants in India and Their Staggering Net WorthShaheen Kumar
 
Monthly Market Risk Update: April 2024 [SlideShare]
Monthly Market Risk Update: April 2024 [SlideShare]Monthly Market Risk Update: April 2024 [SlideShare]
Monthly Market Risk Update: April 2024 [SlideShare]Commonwealth
 
Lundin Gold April 2024 Corporate Presentation v4.pdf
Lundin Gold April 2024 Corporate Presentation v4.pdfLundin Gold April 2024 Corporate Presentation v4.pdf
Lundin Gold April 2024 Corporate Presentation v4.pdfAdnet Communications
 
Stock Market Brief Deck for "this does not happen often".pdf
Stock Market Brief Deck for "this does not happen often".pdfStock Market Brief Deck for "this does not happen often".pdf
Stock Market Brief Deck for "this does not happen often".pdfMichael Silva
 
government_intervention_in_business_ownership[1].pdf
government_intervention_in_business_ownership[1].pdfgovernment_intervention_in_business_ownership[1].pdf
government_intervention_in_business_ownership[1].pdfshaunmashale756
 
Instant Issue Debit Cards - High School Spirit
Instant Issue Debit Cards - High School SpiritInstant Issue Debit Cards - High School Spirit
Instant Issue Debit Cards - High School Spiritegoetzinger
 
Financial Leverage Definition, Advantages, and Disadvantages
Financial Leverage Definition, Advantages, and DisadvantagesFinancial Leverage Definition, Advantages, and Disadvantages
Financial Leverage Definition, Advantages, and Disadvantagesjayjaymabutot13
 
The Triple Threat | Article on Global Resession | Harsh Kumar
The Triple Threat | Article on Global Resession | Harsh KumarThe Triple Threat | Article on Global Resession | Harsh Kumar
The Triple Threat | Article on Global Resession | Harsh KumarHarsh Kumar
 

Recently uploaded (20)

原版1:1复刻温哥华岛大学毕业证Vancouver毕业证留信学历认证
原版1:1复刻温哥华岛大学毕业证Vancouver毕业证留信学历认证原版1:1复刻温哥华岛大学毕业证Vancouver毕业证留信学历认证
原版1:1复刻温哥华岛大学毕业证Vancouver毕业证留信学历认证
 
Monthly Economic Monitoring of Ukraine No 231, April 2024
Monthly Economic Monitoring of Ukraine No 231, April 2024Monthly Economic Monitoring of Ukraine No 231, April 2024
Monthly Economic Monitoring of Ukraine No 231, April 2024
 
Q3 2024 Earnings Conference Call and Webcast Slides
Q3 2024 Earnings Conference Call and Webcast SlidesQ3 2024 Earnings Conference Call and Webcast Slides
Q3 2024 Earnings Conference Call and Webcast Slides
 
Interimreport1 January–31 March2024 Elo Mutual Pension Insurance Company
Interimreport1 January–31 March2024 Elo Mutual Pension Insurance CompanyInterimreport1 January–31 March2024 Elo Mutual Pension Insurance Company
Interimreport1 January–31 March2024 Elo Mutual Pension Insurance Company
 
Bladex 1Q24 Earning Results Presentation
Bladex 1Q24 Earning Results PresentationBladex 1Q24 Earning Results Presentation
Bladex 1Q24 Earning Results Presentation
 
Call Girls Service Nagpur Maya Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Maya Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Maya Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Maya Call 7001035870 Meet With Nagpur Escorts
 
Vip B Aizawl Call Girls #9907093804 Contact Number Escorts Service Aizawl
Vip B Aizawl Call Girls #9907093804 Contact Number Escorts Service AizawlVip B Aizawl Call Girls #9907093804 Contact Number Escorts Service Aizawl
Vip B Aizawl Call Girls #9907093804 Contact Number Escorts Service Aizawl
 
VIP Call Girls LB Nagar ( Hyderabad ) Phone 8250192130 | ₹5k To 25k With Room...
VIP Call Girls LB Nagar ( Hyderabad ) Phone 8250192130 | ₹5k To 25k With Room...VIP Call Girls LB Nagar ( Hyderabad ) Phone 8250192130 | ₹5k To 25k With Room...
VIP Call Girls LB Nagar ( Hyderabad ) Phone 8250192130 | ₹5k To 25k With Room...
 
Vp Girls near me Delhi Call Now or WhatsApp
Vp Girls near me Delhi Call Now or WhatsAppVp Girls near me Delhi Call Now or WhatsApp
Vp Girls near me Delhi Call Now or WhatsApp
 
AfRESFullPaper22018EmpiricalPerformanceofRealEstateInvestmentTrustsandShareho...
AfRESFullPaper22018EmpiricalPerformanceofRealEstateInvestmentTrustsandShareho...AfRESFullPaper22018EmpiricalPerformanceofRealEstateInvestmentTrustsandShareho...
AfRESFullPaper22018EmpiricalPerformanceofRealEstateInvestmentTrustsandShareho...
 
Unveiling the Top Chartered Accountants in India and Their Staggering Net Worth
Unveiling the Top Chartered Accountants in India and Their Staggering Net WorthUnveiling the Top Chartered Accountants in India and Their Staggering Net Worth
Unveiling the Top Chartered Accountants in India and Their Staggering Net Worth
 
Monthly Market Risk Update: April 2024 [SlideShare]
Monthly Market Risk Update: April 2024 [SlideShare]Monthly Market Risk Update: April 2024 [SlideShare]
Monthly Market Risk Update: April 2024 [SlideShare]
 
Lundin Gold April 2024 Corporate Presentation v4.pdf
Lundin Gold April 2024 Corporate Presentation v4.pdfLundin Gold April 2024 Corporate Presentation v4.pdf
Lundin Gold April 2024 Corporate Presentation v4.pdf
 
Stock Market Brief Deck for "this does not happen often".pdf
Stock Market Brief Deck for "this does not happen often".pdfStock Market Brief Deck for "this does not happen often".pdf
Stock Market Brief Deck for "this does not happen often".pdf
 
Commercial Bank Economic Capsule - April 2024
Commercial Bank Economic Capsule - April 2024Commercial Bank Economic Capsule - April 2024
Commercial Bank Economic Capsule - April 2024
 
government_intervention_in_business_ownership[1].pdf
government_intervention_in_business_ownership[1].pdfgovernment_intervention_in_business_ownership[1].pdf
government_intervention_in_business_ownership[1].pdf
 
Instant Issue Debit Cards - High School Spirit
Instant Issue Debit Cards - High School SpiritInstant Issue Debit Cards - High School Spirit
Instant Issue Debit Cards - High School Spirit
 
Financial Leverage Definition, Advantages, and Disadvantages
Financial Leverage Definition, Advantages, and DisadvantagesFinancial Leverage Definition, Advantages, and Disadvantages
Financial Leverage Definition, Advantages, and Disadvantages
 
The Triple Threat | Article on Global Resession | Harsh Kumar
The Triple Threat | Article on Global Resession | Harsh KumarThe Triple Threat | Article on Global Resession | Harsh Kumar
The Triple Threat | Article on Global Resession | Harsh Kumar
 
🔝+919953056974 🔝young Delhi Escort service Pusa Road
🔝+919953056974 🔝young Delhi Escort service Pusa Road🔝+919953056974 🔝young Delhi Escort service Pusa Road
🔝+919953056974 🔝young Delhi Escort service Pusa Road
 

biostat 2.pptx h h jbjbivigyfyfyfyfyftftc

  • 1.
  • 2. Biostatistics is the application of statistical principles to questions and problems in medicine, public health or biology. One can imagine that it might be of interest to characterize a given population (e.g., adults in Boston or all children in the United States) with respect to the proportion of subjects who are overweight or the proportion who have asthma, and it would also be important to estimate the magnitude of these problems over time or perhaps in different locations. In other circumstances in would be important to make comparisons among groups of subjects in order to determine whether certain behaviors (e.g., smoking, exercise, etc.) are associated with a greater risk of certain health outcomes. It would, of course, be impossible to answer all such questions by collecting information (data) from all subjects in the populations of interest. A more realistic approach is to study samples or subsets of a population. The discipline of biostatistics provides tools and techniques for collecting data and then summarizing, analyzing, and interpreting it. If the samples one takes are representative of the population of interest, they will provide good estimates regarding the population overall. Consequently, in biostatistics one analyzes samples in order to make inferences about the population.
  • 3. A fundamental task of biostatistics is to analyze samples in order to make inferences about the population from which the samples were drawn. To illustrate this, consider the population of Delhi in 2010, which consisted of 6,547,629 persons. One characteristic (or variable) of potential interest might be the diastolic blood pressure of the population. It is obviously not feasible to measure and record blood pressures for of all the residents, but one could take samples of the population in order estimate the population's mean diastolic blood pressure. Despite the simplicity of this example, it raises a series of concepts and terms that need to be defined. The terms population, subjects, sample, variable, and data elements
  • 4. It is possible to select many samples from a given population. The simple example above shows three small samples that were drawn to estimate the mean diastolic blood pressure of Delhi residents, although it doesn't specify how the samples were drawn. Note also that each of the samples provided a different estimate of the mean value for the population, and none of the estimates was the same as the actual mean for the overall population (78 mm Hg in this hypothetical example). In reality, one generally doesn't know the true mean values of the characteristics of the population, which is of course why we are trying to estimate them from samples. Consequently, it is important to define and distinguish between: • population size versus sample size • parameter versus sample statistic.
  • 5. Measures of Central Tendency A measure of central tendency is a measure which indicates where the middle of the data is. The three most commonly used measures of central tendency are: The Mean, the Median, and the Mode.
  • 6. Participant ID Systolic Blood Pressure Diastolic Blood Pressure Total Serum Cholesterol Weight Height Body Mass Index 1 141 76 199 138 63.00 24.4 2 119 64 150 183 69.75 26.4 3 122 62 227 153 65.75 24.9 4 127 81 227 178 70.00 25.5 5 125 70 163 161 70.50 22.8 6 123 72 210 206 70.00 29.6 7 105 81 205 235 72.00 31.9 8 113 63 275 151 60.75 28.8 9 106 67 208 213 69.00 31.5 10 131 77 159 142 61.00 26.8 Data Values for a Small Sample In order to illustrate the computation of sample statistics, we selected a small subset (n=10) of participants in the Center of Heart Study. The data values for these ten individuals are shown in the table below.
  • 7. The first summary statistic that is important to report is the sample size. In this example the sample size is n=10. Because this sample is small (n=10), it is easy to summarize the sample by inspecting the observed values, for example, by listing the diastolic blood pressures in ascending order: 62 63 64 67 70 72 76 77 81 81 Simple inspection of this small sample gives us a sense of the center of the observed diastolic pressures and also gives us a sense of how much variability there is. However, for a large sample, inspection of the individual data values does not provide a meaningful summary, and summary statistics are necessary. The two key components of a useful summary for a continuous variable are: • a description of the center or 'average' of the data (i.e., what is a typical value?) and • an indication of the variability in the data.
  • 8. There are several statistics that describe the center of the data, but for now we will focus on the sample mean, which is computed by summing all of the values for a particular variable in the sample and dividing by the sample size. For the sample of diastolic blood pressures in the table above, the sample mean is computed as follows: To simplify the formulas for sample statistics (and for population parameters), we usually denote the variable of interest as "X". X is simply a placeholder for the variable being analyzed. Here X=diastolic blood pressure. The general formula for the sample mean is: The X with the bar over it represents the sample mean, and it is read as "X bar". The Σ indicates summation (i.e., sum of the X's or sum of the diastolic blood pressures in this example). When reporting summary statistics for a continuous variable, the convention is to report one more decimal place than the number of decimal places measured. Systolic and diastolic blood pressures, total serum cholesterol and weight were measured to the nearest integer, therefore the summary statistics are reported to the nearest tenth place. Height was measured to the nearest quarter inch (hundredths place), therefore the summary statistics are reported to the nearest thousandths place. Body mass index was computed to the nearest tenths place, summary statistics are reported to the nearest hundredths place. Sample Mean
  • 9.
  • 10.
  • 11. Measures of Dispersion A measure of dispersion conveys information regarding the amount of variability present in a set of data. • Note: 1. If all the values are the same → There is no dispersion . 2. If all the values are different → There is a dispersion: a) If the values close to each other →The amount of Dispersion small. b) If the values are widely scattered → The Dispersion is greater. Measures of Dispersion are : 1.Range (R). 2. Variance. 3. Standard deviation. 4.Coefficient of variation (C.V).
  • 12. Standard Deviation If there are no extreme or outlying values of the variable, the mean is the most appropriate summary of a typical value, and to summarize variability in the data we specifically estimate the variability in the sample around the sample mean. If all of the observed values in a sample are close to the sample mean, the standard deviation will be small (i.e., close to zero), and if the observed values vary widely around the sample mean, the standard deviation will be large. If all of the values in the sample are identical, the sample standard deviation will be zero. X=Diastolic Blood Pressure Deviation from the Mean 76 4.7 64 -7.3 62 -9.3 81 9.7 70 -1.3 72 0.7 81 9.7 63 -8.3 67 -4.3 77 5.7 Table - Diastolic Blood Pressures and Deviations from the Sample Mean When discussing the sample mean, we found that the sample mean for diastolic blood pressure = 71.3. The table below shows each of the observed values along with its respective deviation from the sample mean.
  • 13. • The deviations from the mean reflect how far each individual's diastolic blood pressure is from the mean diastolic blood pressure. The first participant's diastolic blood pressure is 4.7 units above the mean while the second participant's diastolic blood pressure is 7.3 units below the mean. What we need is a summary of these deviations from the mean, in particular a measure of how far, on average, each participant is from the mean diastolic blood pressure. If we compute the mean of the deviations by summing the deviations and dividing by the sample size we run into a problem. The sum of the deviations from the mean is zero. This will always be the case as it is a property of the sample mean, i.e., the sum of the deviations below the mean will always equal the sum of the deviations above the mean. However, the goal is to capture the magnitude of these deviations in a summary measure. To address this problem of the deviations summing to zero, we could take absolute values or square each deviation from the mean. Both methods would address the problem. The more popular method to summarize the deviations from the mean involves squaring the deviations (absolute values are difficult in mathematical proofs). The table below displays each of the observed values, the respective deviations from the sample mean and the squared deviations from the mean.
  • 14. X=Diastolic Blood Pressure Deviation from the Mean Squared Deviation from the Mean 76 4.7 22.09 64 -7.3 53.29 62 -9.3 86.49 81 9.7 94.09 70 -1.3 1.69 72 0.7 0.49 81 9.7 94.09 63 -8.3 68.89 67 -4.3 18.49 77 5.7 32.49
  • 15. The squared deviations are interpreted as follows. The first participant's squared deviation is 22.09 meaning that his/her diastolic blood pressure is 22.09 units squared from the mean diastolic blood pressure, and the second participant's diastolic blood pressure is 53.29 units squared from the mean diastolic blood pressure. A quantity that is often used to measure variability in a sample is called the sample variance, and it is essentially the mean of the squared deviations. The sample variance is denoted s² and is computed as follows: Why do we divided by (n-1) instead of n? The sample variance is not actually the mean of the squared deviations, because we divide by (n-1) instead of n. In statistical inference we make generalizations or estimates of population parameters based on sample statistics. If we were to compute the sample variance by taking the mean of the squared deviations and dividing by n we would consistently underestimate the true population variance. Dividing by (n-1) produces a better estimate of the population variance. The sample variance is nonetheless usually interpreted as the average squared deviation from the mean.
  • 16. In this sample of n=10 diastolic blood pressures, the sample variance is s2 = 472.10/9 =52.46. Thus, on average diastolic blood pressures are 52.46 units squared from the mean diastolic blood pressure. Because of the squaring, the variance is not particularly interpretable. The more common measure of variability in a sample is the sample standard deviation, defined as the square root of the sample variance:
  • 17.
  • 18. Try to find: - sample size - mode - median - range - sample mean - sample variance - sample standard deviation
  • 19. • We usually don't have information about all of the subjects in a population of interest, so we take samples from the population in order to make inferences about unknown population parameters. • An obvious concern would be how good a given sample's statistics are in estimating the characteristics of the population from which it was drawn. There are many factors that influence diastolic blood pressure levels, such as age, body weight, fitness, and heredity. • We would ideally like the sample to be representative of the population. Intuitively, it would seem preferable to have a random sample, meaning that all subjects in the population have an equal chance of being selected into the sample; this would minimize systematic errors caused by biased sampling. • In addition, it is also intuitive that small samples might not be representative of the population just by chance, and large samples are less likely to be affected by "the luck of the draw"; this would reduce so-called random error. Since we often rely on a single sample to estimate population parameters, we never actually know how good our estimates are. However, one can use sampling methods that reduce bias, and the degree of random error in a given sample can be estimated in order to get a sense of the precision of our estimates.
  • 20. A sample of 10 women seeking prenatal care at Almaty Medical center agree to participate in a study to assess the quality of prenatal care. At the time of study enrollment, you the study coordinator, collected background characteristics on each of the moms including their age (in years).The data are shown below: 24 18 28 32 26 21 22 43 27 29 Try to find: - sample size - mode - median - range - sample mean - sample variance - sample standard deviation