SlideShare a Scribd company logo
1 of 68
Lecture No. 1
Statistics and Probability
By: Muhammad Tufail
Objective
ā€¢ To inculcate in you an attitude of
Statistical and Probabilistic thinking.
ā€¢ To give you some very basic techniques in
order to apply Statistical analysis to real-
world situations/problems.
That science which enables us to draw conclusions about
various phenomena on the basis of real data collected on
sample-basis
ļ‚°A tool for data-based research
ļ‚°Also known as Quantitative Analysis
ļ‚°Any scientific enquiry in which you would like to base your
conclusions and decisions on real-life data, you need to
employ statistical techniques!
ļ‚°Now a days, in the developed countries of the world, there is
an active movement for of Statistical Literacy.
WHAT IS STATISTICS?
Application Areas
A lot of application in a wide variety of
disciplines ā€¦
Agriculture, Anthropology, Astronomy,
Biology, Economics, Engineering,
Environment, Geology, Genetics, Medicine,
Physics, Psychology, Sociology, Zoology ā€¦.
Virtually every single subject from
Anthropology to Zoology ā€¦. A to Z!
DESCRIPTIVE STATISTICS
STATISTICS
INFERENTIAL STATISTICS
THE NATURE OF DISCIPLINE
The primary text-book for the course is Introduction to Statistical
Theory (Sixth Edition) by Sher Muhammad Chaudhry and Shahid Kamal
published by Ilmi Kitab Khana, Lahore. Reference books for the course
are:
1. ā€œ ā€œ by Afzal Beg & Miraj Din Mirza.
2. ā€œ ā€œ by Mohammad Rauf Chaudhry (Polymer Publications, Urdu
Bazar, Lahore).
3. ā€œStatisticsā€ by James T. McClave & Frank H. Dietrich, II (Dellen
Publishing Company, California, U.S.A).
4. ā€œIntroducing Statisticsā€ by K.A. Yeomans (Penguin Books Ltd.,
England).
5. ā€œApplied Statisticsā€ by K.A. Yeomans (Penguin Books Ltd., England).
6. ā€œBusiness Statistics for Management & Economicsā€ by Wayne W.
Daniel and James C. Terrell (Houghton Mifflin Company, U.S.A.).
7. ā€œBasic Business Statisticsā€ by Berenson & Levine ( )
Text and Reference Material
IN ACCORDANCE WITH THE ABOVE-MENTIONED STRUCTURE,
THE ORGANIZATION OF THIS COURSE IS AS FOLLOWS:
WEEKS
LEC-
TURES
AREA
TO BE
COVERED
HOME-
WORK
ASSIGN-
MENTS
EXAMS
1 TO 5 1 TO 15
DESCRIPTIVE
STATISTICS
1 TO 5
MID-TERM-
I
6 TO 10 16 TO 30 PROBABILITY 6 TO 10
MID-TERM-
II
11 TO 15 31 TO 45
INFERENTIAL
STATISTICS
11 TO 15
FINAL
EXAM
ORGANIZATION OF THIS
COURSE
ā€¢Appreciate the nature of statistical data.
ā€¢Understand various methods of collecting
statistical data.
ā€¢Appreciate the importance of a proper sampling
procedure.
ā€¢Utilize various methods of summarizing and
describing collected data.
ā€¢Employ statistical techniques to understand the
nature of relationship between two quantitative
variables.
Upon completion of the first
segment, you will be able to:
ā€¢Understand the basic concepts of probability theory (which is
the foundation of statistical inference). Understand the
concept of discrete probability distributions and their
mathematical properties.
ā€¢Understand the concept of continuous probability distributions
and their mathematical properties.
ā€¢Get acquainted with some of the most commonly
encountered and important discrete and continuous probability
distributions such as the binomial and the normal distribution.
Upon completion of the second
segment, you will be able to:
Understand and employ various techniques of
estimation and hypothesis-testing in order to draw
reliable conclusions necessary for decision-making
in various fields of human activity.
Through this segment, you will be able to
appreciate the purpose and the goal of the subject
of Statistics.
Upon completion of the third
segment, you will be able to:
There will be two term exams and one final
exam. In addition, there will be 15 homework
assignments. The final examination will be
comprehensive in nature. (Approximately 25-30% of the
final exam paper will be on the course covered upto the
Mid-Term-II Exam.)
These will contribute the following percentages to the
final grade:
Mid-Term-I: 20%
Mid-Term-II: 20%
Final Exam: 30%
Homework Assignments: 30%
GRADING
Meaning of Statistics
Statistics
Meanings
STATUS
Political
State
Information useful for the State
The word ā€œdataā€ appears in many contexts
and frequently is used in ordinary conversation.
Although the word carries something of an aura of
scientific mystique, its meaning is quite simple and
mundane.
It is Latin for ā€œthose that are givenā€ (the
singular form is ā€œdatumā€). Data may therefore be
thought of as the results of observation.
The meaning of Data
Data are collected in many aspects of everyday life.
ā€¢ Statements given to a police officer or physician or
psychologist during an interview are data.
ā€¢ The correct and incorrect answers given by a student on
a final examination.
ā€¢ Almost any athletic event produces data.
ā€¢ The time required by a runner to complete a marathon,
ā€¢ The number of errors committed by a baseball team in
nine innings of play.
EXAMPLES OF DATA
EXAMPLES OF DATA
ā€¢ And, of course, data are obtained in the course of
scientific inquiry:
ā€¢ The positions of artifacts and fossils in an archaeological
site,
ā€¢ The number of interactions between two members of an
animal colony during a period of observation,
ā€¢ The spectral composition of light emitted by a star.
Types of Data
Data
Quantitative
(Numeric)
Qualitative
(Non - Numeric)
Variable
A quantity that, varies from an individual to
individual.
Variable
Quantitative
(Numeric)
Qualitative
(Non - Numeric)
In statistics, an observation often means any sort
of numerical recording of information, whether it is a
physical measurement such as height or weight; a
classification such as heads or tails, or an answer to a
question such as yes or no.
Variable:
A characteristic that varies with an individual or an
object, is called a variable.
For example, age is a variable as it varies from person to
person. A variable can assume a number of values. The
given set of all possible values from which the variable
takes on a value is called its Domain. If for a given
problem, the domain of a variable contains only one
value, then the variable is referred to as a constant.
OBSERVATIONS AND VARIABLES
Variables may be classified into quantitative and
qualitative according to the form of the characteristic of
interest.
A variable is called a quantitative variable when a
characteristic can be expressed numerically such as age,
weight, income or number of children.
On the other hand, if the characteristic is non-
numerical such as education, sex, eye-colour, quality,
intelligence, poverty, satisfaction, etc. the variable is referred
to as a qualitative variable. A qualitative characteristic is also
called an attribute.
An individual or an object with such a characteristic
can be counted or enumerated after having been assigned to
one of the several mutually exclusive classes or categories.
QUANTITATIVE & QUALITATIVE VARIABLES
Variable
Variable
Quantitative
(Numeric)
Qualitative
(Non - Numeric)
Continuous Discrete
Continuous Variable
Continuous Variable
Measurement
Height, Weight etc
Discrete Variable
Discrete Variable
Counting
e.g. No. of sisters
Gaps, Jumps
A quantitative variable may be classified as discrete or
continuous. A discrete variable is one that can take only a discrete
set of integers or whole numbers, that is, the values are taken by
jumps or breaks. A discrete variable represents count data such as
the number of persons in a family, the number of rooms in a house,
the number of deaths in an accident, the income of an individual, etc.
A variable is called a continuous variable if it can take on any
value-fractional or integralā€“ā€“within a given interval, i.e. its domain is
an interval with all possible values without gaps. A continuous
variable represents measurement data such as the age of a person,
the height of a plant, the weight of a commodity, the temperature at a
place, etc.
A variable whether countable or measurable, is generally
denoted by some symbol such as X or Y and Xi or Xj represents the
ith or jth value of the variable. The subscript i or j is replaced by a
number such as 1,2,3, ā€¦ when referred to a particular value.
DISCRETE AND CONTINUOUS VARIABLES:
Measurement Scales
Measurement Scales
Nominal Scale
Ordinal Scale
Interval Scale Ratio Scale
By measurement, we usually mean the assigning of number to
observations or objects and scaling is a process of measuring. The four
scales of measurements are briefly mentioned below:
NOMINAL SCALE
The classification or grouping of the observations into mutually
exclusive qualitative categories or classes is said to constitute a nominal
scale. For example, students are classified as male and female. Number 1
and 2 may also be used to identify these two categories. Similarly, rainfall
may be classified as heavy moderate and light. We may use number 1, 2
and 3 to denote the three classes of rainfall. The numbers when they are
used only to identify the categories of the given scale, carry no numerical
significance and there is no particular order for the grouping.
MEASUREMENT SCALES
MEASUREMENT SCALES (Cont.)
ORDINAL OR RANKING SCALE
It includes the characteristic of a nominal scale
and in addition has the property of ordering or
ranking of measurements. For example, the
performance of students (or players) is rated as
excellent, good fair or poor, etc. Number 1, 2, 3,
4 etc. are also used to indicate ranks. The only
relation that holds between any pair of
categories is that of ā€œgreater thanā€ (or more
preferred).
INTERVAL SCALE
A measurement scale possessing a constant interval size
(distance) but not a true zero point, is called an interval scale.
Temperature measured on either the Celcius or the Fahrenheit
scale is an outstanding example of interval scale because the
same difference exists between 20o C (68o F) and 30o C (86o
F) as between 5o C (41o F) and 15o C (59o F). It cannot be said
that a temperature of 40 degrees is twice as hot as a
temperature of 20 degree, i.e. the ratio 40/20 has no meaning.
The arithmetic operation of addition, subtraction, etc. are
meaningful.
RATIO SCALE
It is a special kind of an interval scale where the sale of
measurement has a true zero point as its origin. The ratio scale
is used to measure weight, volume, distance, money, etc. The,
key to differentiating interval and ratio scale is that the zero point
is meaningful for ratio scale.
MEASUREMENT SCALES (Cont.)
Example
Chemical and manufacturing plants
sometimes discharge toxic-waste materials
such as DDT into nearby rivers and streams
These toxins can adversely affect the plants
and animals inhabiting the river and the river
bank.
A study of fish was conducted in the
Tennessee River in Alabama and its three
tributary creeks: Flint creek, Limestone creek
and Spring creek.
A total of 144 fish were captured, and the
following variable measured for each one:
1. River/Creek from where fish was captured
2. Species of fish (Channel fish, Largemouth
bass or smallmouth buffalo fish)
3. Length of fish (Centimeters)
4. Weight of fish (grams)
5. DDT concentration in the bodily system of the
fish (parts per million)
Classify each of the five variables measured
as quantitative or qualitative.
Also, identify the types of measurement
scales for each of the five variables.
Solution
The variables Length, weight and DDT
concentration are quantitative variables
because each is measured on a nominal
scale (Length is centimeters, Weight is
grams and DDT in parts per million).
All three of these variables are being
measured on the Ratio Scale.
Rationale
Whenever we speak about the weight of an
object, obviously, if our measuring instrument
reads ā€˜zeroā€™, this means that the object being
measured has zero weight --- and, in this sense,
the ā€˜zeroā€™ would be a true zero.
An exactly similar argument holds for the length of
an object.
As far as DDT concentration in the bodily
system of the fish is concerned, obviously, if
there is absolutely no DDT in the fish, then
the DDT concentration reads zero --- and,
this particular ā€˜zeroā€™ reading will be true
zero.
As, explained above, the three variables
length of fish, weight of fish and DDT
concentration in the bodily system of the
fish are quantitative variables measures
on the ratio scale.
In contrast:
Data on River/Creek from which the fish
were captured, and the species of fish are
qualitative data.
Both of these variables are measured on
Nominal Scale.
Rationale
The river/creek from which the fish
were captured, and the species of fish are
qualitative data because these can not be
measured quantitatively, they can only be
classified into categories.
(i.e. Channel fish, Largemouth bass or
smallmouth buffalo fish for the species and Tennessee
River, Flint creek, Limestone creek and Spring
creek)
The Statistical methods for describing,
reporting and analyzing data depend on
the type of data measured (i.e. whether
data are quantitative or qualitative).
Experience has shown that a continuous variable can never be
measured with perfect fineness because of certain habits and practices,
methods of measurements, instruments used, etc. the measurements are
thus always recorded correct to the nearest units and hence are of limited
accuracy. The actual or true values are, however, assumed to exist. For
example, if a studentā€™s weight is recorded as 60 kg (correct to the nearest
kilogram), his true weight in fact lies between 59.5 kg and 60.5 kg, whereas
a weight recorded as 60.00 kg means the true weight is known to lie
between 59.995 and 60.005 kg. Thus there is a difference, however small it
may be between the measured value and the true value. This sort of
departure from the true value is technically known as the error of
measurement. In other words, if the observed value and the true value of a
variable are denoted by x and x + Īµ respectively, then the difference (x + Īµ) ā€“
x, i.e. Īµ is the error. This error involves the unit of measurement of x and is
therefore called an absolute error. An absolute error divided by the true
value is called the relative error. Thus the relative error, which when
multiplied by 100, is percentage error. These errors are independent of the
units of measurement of x. It ought to be noted that an error has both
magnitude and direction and that the word error in statistics does not mean
mistake which is a chance inaccuracy.
ERRORS OF MEASUREMENT
Errors of Measurements
Errors of Measurements
Biased Errors
Cumulative Errors
Systematic Errors
Random Errors
Compensating Errors
Accidental Errors
An error is said to be biased when the observed value is
consistently and constantly higher or lower than the true value.
Biased errors arise from the personal limitations of the observer,
the imperfection in the instruments used or some other conditions
which control the measurements. These errors are not revealed by
repeating the measurements. They are cumulative in nature, that
is, the greater the number of measurements, the greater would be
the magnitude of error. They are thus more troublesome. These
errors are also called cumulative or systematic errors.
An error, on the other hand, is said to be unbiased when the
deviations, i.e. the excesses and defects, from the true value tend
to occur equally often. Unbiased errors and revealed when
measurements are repeated and they tend to cancel out in the long
run. These errors are therefore compensating and are also known
as random errors or accidental errors.
BIASED AND RANDOM ERRORS
Statistical Inference
A Statistical Inference in an estimate or
prediction or some other generalization
about a population based on information
contained in sample.
That is, we use information contained in
sample to learn about the larger population.
Population and Sample
Population:
The collection of all individuals, items or
data under consideration in a statistical
study.
Sample:
That part of the population from which
information is collected.
Population and Sample
Population
Sample
Five Elements of an Inferencial
Statistical Problem:
ā€¢ A population
ā€¢ One or more variables of interest
ā€¢ A sample
ā€¢ An Inference
ā€¢ A measure of Reliability
In order of understand the concept of
Reliability, a very important point to be
understood is that making an inference
about population from the sample is only
part of the story.
We also need to know its reliability --- that
is,
how good our inference is.
Measure of Reliability
A measure of reliability is a statement
(usually quantified) about the degree of
uncertainty associated with a statistical
inference.
The point to be noted is that the only way
we can be certain that an inference about
population is correct is to include the
entire population in our sample.
However, because of resource constraints,
(i.e. Insufficient time and/ or money). We
usually can not work with whole
population, so we base our inference on
just a portion of population (i.e. Sample)
Consequently, whenever possible, it is
important to determine and report the
reliability of each inference made.
As such, reliability is the fifth element of
statistical inferencial problems.
Example
A large paint retailer has had numerous
complaints from customers about under-
filled paint cans.
As, a result retailer has begun inspecting
incoming shipments of paint from
suppliers.
Shipments with under-filled problems will be
sent back to supplier.
A recent shipment contained 2,440 gallon-
size cans.
The retailer sampled 50 cans and weighted
each on a scale capable of measuring
weight to four decimal places.
Properly filled cans weigh 10 pounds.
a) Describe a population
b) Describe a variable of interest
c) Describe a sample
d) Describe the Inference
e) Describe a measure of uncertainty of our
inference.
Solution
a) The population is the set of units of
interests to the retailer, which is the
shipment of 2,440 cans of paint.
b) The weight of paint cans is the variable,
the retailer wishes to evaluate.
c) The sample is the subset of population.
In this case, it is the 50 cans of paint
selected by the retailer.
d) The inference of interest involves the
generalization of the information contained in
the sample of paint cans to the population of
paint cans.
In particular, Retailer wants to learn about
the content of under-filled problem (if any)
In the population.
This might be accomplished by finding the
average weight of the cans in the sample,
and using it to estimate the average weight
of the cans of population.
e) As far as the measure of reliability of our
inference is concerned, the point to be
noted is that, using statistical methods,
we can determine a bound on the
estimation error.
Bound on the Estimation Error
This bound is simply a number that our
estimation error (i.e. the difference between
the average weight of sample and average
weight of population of cans) is not likely to
exceed.
This bound is a measure of the uncertainty
of our inference, or, in other words, the
reliability of statistical inference.
The crux of the matter is that an inference is
incomplete without a measure of its reliability
When the weights of 50 paint cans are used
to estimate the average weight of all the
cans, the estimate will not exactly mirror the
entire population.
For Example:
If the sample of 50 cans yields a mean
weight of 9 pounds, it does not follow (nor is
it likely) that the mean weight of population
of can is also exactly 9 pounds.
Nevertheless, we can use sound statistical
reasoning to ensure that our sampling
procedure will generate estimate that is
almost certainly within a specified limit of the
true mean weight of all the cans.
For example such reasoning might assure us that
the estimate of the population from the sample is
almost certainly within 1 pound of the actual
population mean.
The implication is that the actual mean weight of
the entire population of the cans is between
9 ā€“ 1=8 pounds and 9 +1=10 pounds --- that is,
(9 Ā± 1) pounds.
This interval represents the a measure of reliability
for the inference.
IN TODAYā€™S LECTURE,
YOU LEARNT:
ā€¢ The nature of the science of Statistics
ā€¢ The importance of Statistics in various
fields
ā€¢ Some technical concepts such as
ā€“ The meaning of ā€œdataā€
ā€“ Various types of variables
ā€“ Various types of measurement scales
ā€“ The concept of errors of measurement
IN THE NEXT LECTURE,
YOU WILL LEARN:
ā€¢ Concept of sampling
ā€“ Random verses non-random sampling
ā€“ Simple random sampling
ā€“ A brief introduction to other types of random sampling
ā€¢ Methods of data collection
In other words, you will begin your journey in a
subject with reference to which it has been said
that ā€œstatistical thinking will one day be as
necessary for efficient citizenship as the ability to
read and writeā€.

More Related Content

What's hot

Introduction to Statistics
Introduction to StatisticsIntroduction to Statistics
Introduction to StatisticsAnjan Mahanta
Ā 
random variable and distribution
random variable and distributionrandom variable and distribution
random variable and distributionlovemucheca
Ā 
Chapter 4 part2- Random Variables
Chapter 4 part2- Random VariablesChapter 4 part2- Random Variables
Chapter 4 part2- Random Variablesnszakir
Ā 
Probability
ProbabilityProbability
ProbabilityTodd Davies
Ā 
Set theory
Set theorySet theory
Set theoryGaditek
Ā 
Chapter 2 understanding the normal curve distribution
Chapter 2   understanding the normal curve distributionChapter 2   understanding the normal curve distribution
Chapter 2 understanding the normal curve distributionAntonio F. Balatar Jr.
Ā 
Statistics & probability
Statistics & probabilityStatistics & probability
Statistics & probabilitySuzaan van Heerden
Ā 
Geometric probability distribution
Geometric probability distributionGeometric probability distribution
Geometric probability distributionNadeem Uddin
Ā 
Sets, functions and groups
Sets, functions and groupsSets, functions and groups
Sets, functions and groupsMuhammad Adnan Ejaz
Ā 
Math 8 - Linear Functions
Math 8 - Linear FunctionsMath 8 - Linear Functions
Math 8 - Linear FunctionsCarlo Luna
Ā 
Logistic regression with SPSS
Logistic regression with SPSSLogistic regression with SPSS
Logistic regression with SPSSLNIPE
Ā 
Chapter 1 random variables and probability distributions
Chapter 1   random variables and probability distributionsChapter 1   random variables and probability distributions
Chapter 1 random variables and probability distributionsAntonio F. Balatar Jr.
Ā 
Introduction to Probability and Probability Distributions
Introduction to Probability and Probability DistributionsIntroduction to Probability and Probability Distributions
Introduction to Probability and Probability DistributionsJezhabeth Villegas
Ā 
Basic probability concept
Basic probability conceptBasic probability concept
Basic probability conceptMmedsc Hahm
Ā 
5.4 mathematical induction
5.4 mathematical induction5.4 mathematical induction
5.4 mathematical inductionmath260
Ā 
Mcqs (probability)
Mcqs (probability)Mcqs (probability)
Mcqs (probability)Nadeem Uddin
Ā 
Relations and Functions
Relations and FunctionsRelations and Functions
Relations and Functionstoni dimella
Ā 
Probability Powerpoint
Probability PowerpointProbability Powerpoint
Probability Powerpointspike2904
Ā 

What's hot (20)

Introduction to Statistics
Introduction to StatisticsIntroduction to Statistics
Introduction to Statistics
Ā 
Probability distributions & expected values
Probability distributions & expected valuesProbability distributions & expected values
Probability distributions & expected values
Ā 
random variable and distribution
random variable and distributionrandom variable and distribution
random variable and distribution
Ā 
Chapter 4 part2- Random Variables
Chapter 4 part2- Random VariablesChapter 4 part2- Random Variables
Chapter 4 part2- Random Variables
Ā 
Probability
ProbabilityProbability
Probability
Ā 
Set theory
Set theorySet theory
Set theory
Ā 
Chapter 2 understanding the normal curve distribution
Chapter 2   understanding the normal curve distributionChapter 2   understanding the normal curve distribution
Chapter 2 understanding the normal curve distribution
Ā 
Statistics & probability
Statistics & probabilityStatistics & probability
Statistics & probability
Ā 
Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regression
Ā 
Geometric probability distribution
Geometric probability distributionGeometric probability distribution
Geometric probability distribution
Ā 
Sets, functions and groups
Sets, functions and groupsSets, functions and groups
Sets, functions and groups
Ā 
Math 8 - Linear Functions
Math 8 - Linear FunctionsMath 8 - Linear Functions
Math 8 - Linear Functions
Ā 
Logistic regression with SPSS
Logistic regression with SPSSLogistic regression with SPSS
Logistic regression with SPSS
Ā 
Chapter 1 random variables and probability distributions
Chapter 1   random variables and probability distributionsChapter 1   random variables and probability distributions
Chapter 1 random variables and probability distributions
Ā 
Introduction to Probability and Probability Distributions
Introduction to Probability and Probability DistributionsIntroduction to Probability and Probability Distributions
Introduction to Probability and Probability Distributions
Ā 
Basic probability concept
Basic probability conceptBasic probability concept
Basic probability concept
Ā 
5.4 mathematical induction
5.4 mathematical induction5.4 mathematical induction
5.4 mathematical induction
Ā 
Mcqs (probability)
Mcqs (probability)Mcqs (probability)
Mcqs (probability)
Ā 
Relations and Functions
Relations and FunctionsRelations and Functions
Relations and Functions
Ā 
Probability Powerpoint
Probability PowerpointProbability Powerpoint
Probability Powerpoint
Ā 

Similar to Statistics and probability lecture 01

Statistics and probability
Statistics and probability   Statistics and probability
Statistics and probability Muhammad Mayo
Ā 
Sta301 lec01
Sta301 lec01Sta301 lec01
Sta301 lec01Rizwan Alvi
Ā 
Ebd1 lecture 3 2010
Ebd1 lecture 3  2010Ebd1 lecture 3  2010
Ebd1 lecture 3 2010Reko Kemo
Ā 
Ebd1 lecture 3 2010
Ebd1 lecture 3  2010Ebd1 lecture 3  2010
Ebd1 lecture 3 2010Reko Kemo
Ā 
Ebd1 lecture 3 2010
Ebd1 lecture 3  2010Ebd1 lecture 3  2010
Ebd1 lecture 3 2010Reko Kemo
Ā 
Probability and statistics(assign 7 and 8)
Probability and statistics(assign 7 and 8)Probability and statistics(assign 7 and 8)
Probability and statistics(assign 7 and 8)Fatima Bianca Gueco
Ā 
Probability and statistics
Probability and statisticsProbability and statistics
Probability and statisticsFatima Bianca Gueco
Ā 
Probability and statistics(exercise answers)
Probability and statistics(exercise answers)Probability and statistics(exercise answers)
Probability and statistics(exercise answers)Fatima Bianca Gueco
Ā 
Probability and statistics
Probability and statisticsProbability and statistics
Probability and statisticsFatima Bianca Gueco
Ā 
Probability and statistics
Probability and statisticsProbability and statistics
Probability and statisticstrixiacruz
Ā 
Probability in statistics
Probability in statisticsProbability in statistics
Probability in statisticsSukirti Garg
Ā 
1. week 1
1. week 11. week 1
1. week 1renz50
Ā 
QUANTITATIVE RESEARCH DESIGN AND METHODS.ppt
QUANTITATIVE RESEARCH DESIGN AND METHODS.pptQUANTITATIVE RESEARCH DESIGN AND METHODS.ppt
QUANTITATIVE RESEARCH DESIGN AND METHODS.pptBhawna173140
Ā 
Mr 4. quantitative research design and methods
Mr 4. quantitative research design and methodsMr 4. quantitative research design and methods
Mr 4. quantitative research design and methodsS'Roni Roni
Ā 
Module 8-S M & T C I, Regular.pptx
Module 8-S M & T C I, Regular.pptxModule 8-S M & T C I, Regular.pptx
Module 8-S M & T C I, Regular.pptxRajashekhar Shirvalkar
Ā 

Similar to Statistics and probability lecture 01 (20)

Statistics and probability
Statistics and probability   Statistics and probability
Statistics and probability
Ā 
Sta301 lec01
Sta301 lec01Sta301 lec01
Sta301 lec01
Ā 
Ebd1 lecture 3 2010
Ebd1 lecture 3  2010Ebd1 lecture 3  2010
Ebd1 lecture 3 2010
Ā 
Ebd1 lecture 3 2010
Ebd1 lecture 3  2010Ebd1 lecture 3  2010
Ebd1 lecture 3 2010
Ā 
Ebd1 lecture 3 2010
Ebd1 lecture 3  2010Ebd1 lecture 3  2010
Ebd1 lecture 3 2010
Ā 
Probability and statistics(assign 7 and 8)
Probability and statistics(assign 7 and 8)Probability and statistics(assign 7 and 8)
Probability and statistics(assign 7 and 8)
Ā 
New statistics
New statisticsNew statistics
New statistics
Ā 
Probability and statistics
Probability and statisticsProbability and statistics
Probability and statistics
Ā 
Probability and statistics(exercise answers)
Probability and statistics(exercise answers)Probability and statistics(exercise answers)
Probability and statistics(exercise answers)
Ā 
Probability and statistics
Probability and statisticsProbability and statistics
Probability and statistics
Ā 
Statistics1(finals)
Statistics1(finals)Statistics1(finals)
Statistics1(finals)
Ā 
Probability and statistics
Probability and statisticsProbability and statistics
Probability and statistics
Ā 
Finals Stat 1
Finals Stat 1Finals Stat 1
Finals Stat 1
Ā 
Biostatistics
BiostatisticsBiostatistics
Biostatistics
Ā 
Probability in statistics
Probability in statisticsProbability in statistics
Probability in statistics
Ā 
Introduction.pdf
Introduction.pdfIntroduction.pdf
Introduction.pdf
Ā 
1. week 1
1. week 11. week 1
1. week 1
Ā 
QUANTITATIVE RESEARCH DESIGN AND METHODS.ppt
QUANTITATIVE RESEARCH DESIGN AND METHODS.pptQUANTITATIVE RESEARCH DESIGN AND METHODS.ppt
QUANTITATIVE RESEARCH DESIGN AND METHODS.ppt
Ā 
Mr 4. quantitative research design and methods
Mr 4. quantitative research design and methodsMr 4. quantitative research design and methods
Mr 4. quantitative research design and methods
Ā 
Module 8-S M & T C I, Regular.pptx
Module 8-S M & T C I, Regular.pptxModule 8-S M & T C I, Regular.pptx
Module 8-S M & T C I, Regular.pptx
Ā 

Recently uploaded

05_Annelore Lenoir_Docbyte_MeetupDora&Cybersecurity.pptx
05_Annelore Lenoir_Docbyte_MeetupDora&Cybersecurity.pptx05_Annelore Lenoir_Docbyte_MeetupDora&Cybersecurity.pptx
05_Annelore Lenoir_Docbyte_MeetupDora&Cybersecurity.pptxFinTech Belgium
Ā 
Booking open Available Pune Call Girls Shivane 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Shivane  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Shivane  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Shivane 6297143586 Call Hot Indian Gi...Call Girls in Nagpur High Profile
Ā 
How Automation is Driving Efficiency Through the Last Mile of Reporting
How Automation is Driving Efficiency Through the Last Mile of ReportingHow Automation is Driving Efficiency Through the Last Mile of Reporting
How Automation is Driving Efficiency Through the Last Mile of ReportingAggregage
Ā 
00_Main ppt_MeetupDORA&CyberSecurity.pptx
00_Main ppt_MeetupDORA&CyberSecurity.pptx00_Main ppt_MeetupDORA&CyberSecurity.pptx
00_Main ppt_MeetupDORA&CyberSecurity.pptxFinTech Belgium
Ā 
Best VIP Call Girls Noida Sector 18 Call Me: 8448380779
Best VIP Call Girls Noida Sector 18 Call Me: 8448380779Best VIP Call Girls Noida Sector 18 Call Me: 8448380779
Best VIP Call Girls Noida Sector 18 Call Me: 8448380779Delhi Call girls
Ā 
Russian Call Girls In Gtb Nagar (Delhi) 9711199012 šŸ’‹āœ”šŸ’•šŸ˜˜ Naughty Call Girls Se...
Russian Call Girls In Gtb Nagar (Delhi) 9711199012 šŸ’‹āœ”šŸ’•šŸ˜˜ Naughty Call Girls Se...Russian Call Girls In Gtb Nagar (Delhi) 9711199012 šŸ’‹āœ”šŸ’•šŸ˜˜ Naughty Call Girls Se...
Russian Call Girls In Gtb Nagar (Delhi) 9711199012 šŸ’‹āœ”šŸ’•šŸ˜˜ Naughty Call Girls Se...shivangimorya083
Ā 
Instant Issue Debit Cards - High School Spirit
Instant Issue Debit Cards - High School SpiritInstant Issue Debit Cards - High School Spirit
Instant Issue Debit Cards - High School Spiritegoetzinger
Ā 
Independent Call Girl Number in Kurla MumbaišŸ“² Pooja Nehwal 9892124323 šŸ’ž Full ...
Independent Call Girl Number in Kurla MumbaišŸ“² Pooja Nehwal 9892124323 šŸ’ž Full ...Independent Call Girl Number in Kurla MumbaišŸ“² Pooja Nehwal 9892124323 šŸ’ž Full ...
Independent Call Girl Number in Kurla MumbaišŸ“² Pooja Nehwal 9892124323 šŸ’ž Full ...Pooja Nehwal
Ā 
OAT_RI_Ep19 WeighingTheRisks_Apr24_TheYellowMetal.pptx
OAT_RI_Ep19 WeighingTheRisks_Apr24_TheYellowMetal.pptxOAT_RI_Ep19 WeighingTheRisks_Apr24_TheYellowMetal.pptx
OAT_RI_Ep19 WeighingTheRisks_Apr24_TheYellowMetal.pptxhiddenlevers
Ā 
Q3 2024 Earnings Conference Call and Webcast Slides
Q3 2024 Earnings Conference Call and Webcast SlidesQ3 2024 Earnings Conference Call and Webcast Slides
Q3 2024 Earnings Conference Call and Webcast SlidesMarketing847413
Ā 
VIP Call Girls LB Nagar ( Hyderabad ) Phone 8250192130 | ā‚¹5k To 25k With Room...
VIP Call Girls LB Nagar ( Hyderabad ) Phone 8250192130 | ā‚¹5k To 25k With Room...VIP Call Girls LB Nagar ( Hyderabad ) Phone 8250192130 | ā‚¹5k To 25k With Room...
VIP Call Girls LB Nagar ( Hyderabad ) Phone 8250192130 | ā‚¹5k To 25k With Room...Suhani Kapoor
Ā 
High Class Call Girls Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
High Class Call Girls Nagpur Grishma Call 7001035870 Meet With Nagpur EscortsHigh Class Call Girls Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
High Class Call Girls Nagpur Grishma Call 7001035870 Meet With Nagpur Escortsranjana rawat
Ā 
Dividend Policy and Dividend Decision Theories.pptx
Dividend Policy and Dividend Decision Theories.pptxDividend Policy and Dividend Decision Theories.pptx
Dividend Policy and Dividend Decision Theories.pptxanshikagoel52
Ā 
02_Fabio Colombo_Accenture_MeetupDora&Cybersecurity.pptx
02_Fabio Colombo_Accenture_MeetupDora&Cybersecurity.pptx02_Fabio Colombo_Accenture_MeetupDora&Cybersecurity.pptx
02_Fabio Colombo_Accenture_MeetupDora&Cybersecurity.pptxFinTech Belgium
Ā 
(DIYA) Bhumkar Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(DIYA) Bhumkar Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(DIYA) Bhumkar Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(DIYA) Bhumkar Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
Ā 
call girls in Nand Nagri (DELHI) šŸ” >ą¼’9953330565šŸ” genuine Escort Service šŸ”āœ”ļøāœ”ļø
call girls in  Nand Nagri (DELHI) šŸ” >ą¼’9953330565šŸ” genuine Escort Service šŸ”āœ”ļøāœ”ļøcall girls in  Nand Nagri (DELHI) šŸ” >ą¼’9953330565šŸ” genuine Escort Service šŸ”āœ”ļøāœ”ļø
call girls in Nand Nagri (DELHI) šŸ” >ą¼’9953330565šŸ” genuine Escort Service šŸ”āœ”ļøāœ”ļø9953056974 Low Rate Call Girls In Saket, Delhi NCR
Ā 
VIP Call Girls Service Dilsukhnagar Hyderabad Call +91-8250192130
VIP Call Girls Service Dilsukhnagar Hyderabad Call +91-8250192130VIP Call Girls Service Dilsukhnagar Hyderabad Call +91-8250192130
VIP Call Girls Service Dilsukhnagar Hyderabad Call +91-8250192130Suhani Kapoor
Ā 
Stock Market Brief Deck for 4/24/24 .pdf
Stock Market Brief Deck for 4/24/24 .pdfStock Market Brief Deck for 4/24/24 .pdf
Stock Market Brief Deck for 4/24/24 .pdfMichael Silva
Ā 
The Economic History of the U.S. Lecture 17.pdf
The Economic History of the U.S. Lecture 17.pdfThe Economic History of the U.S. Lecture 17.pdf
The Economic History of the U.S. Lecture 17.pdfGale Pooley
Ā 

Recently uploaded (20)

Commercial Bank Economic Capsule - April 2024
Commercial Bank Economic Capsule - April 2024Commercial Bank Economic Capsule - April 2024
Commercial Bank Economic Capsule - April 2024
Ā 
05_Annelore Lenoir_Docbyte_MeetupDora&Cybersecurity.pptx
05_Annelore Lenoir_Docbyte_MeetupDora&Cybersecurity.pptx05_Annelore Lenoir_Docbyte_MeetupDora&Cybersecurity.pptx
05_Annelore Lenoir_Docbyte_MeetupDora&Cybersecurity.pptx
Ā 
Booking open Available Pune Call Girls Shivane 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Shivane  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Shivane  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Shivane 6297143586 Call Hot Indian Gi...
Ā 
How Automation is Driving Efficiency Through the Last Mile of Reporting
How Automation is Driving Efficiency Through the Last Mile of ReportingHow Automation is Driving Efficiency Through the Last Mile of Reporting
How Automation is Driving Efficiency Through the Last Mile of Reporting
Ā 
00_Main ppt_MeetupDORA&CyberSecurity.pptx
00_Main ppt_MeetupDORA&CyberSecurity.pptx00_Main ppt_MeetupDORA&CyberSecurity.pptx
00_Main ppt_MeetupDORA&CyberSecurity.pptx
Ā 
Best VIP Call Girls Noida Sector 18 Call Me: 8448380779
Best VIP Call Girls Noida Sector 18 Call Me: 8448380779Best VIP Call Girls Noida Sector 18 Call Me: 8448380779
Best VIP Call Girls Noida Sector 18 Call Me: 8448380779
Ā 
Russian Call Girls In Gtb Nagar (Delhi) 9711199012 šŸ’‹āœ”šŸ’•šŸ˜˜ Naughty Call Girls Se...
Russian Call Girls In Gtb Nagar (Delhi) 9711199012 šŸ’‹āœ”šŸ’•šŸ˜˜ Naughty Call Girls Se...Russian Call Girls In Gtb Nagar (Delhi) 9711199012 šŸ’‹āœ”šŸ’•šŸ˜˜ Naughty Call Girls Se...
Russian Call Girls In Gtb Nagar (Delhi) 9711199012 šŸ’‹āœ”šŸ’•šŸ˜˜ Naughty Call Girls Se...
Ā 
Instant Issue Debit Cards - High School Spirit
Instant Issue Debit Cards - High School SpiritInstant Issue Debit Cards - High School Spirit
Instant Issue Debit Cards - High School Spirit
Ā 
Independent Call Girl Number in Kurla MumbaišŸ“² Pooja Nehwal 9892124323 šŸ’ž Full ...
Independent Call Girl Number in Kurla MumbaišŸ“² Pooja Nehwal 9892124323 šŸ’ž Full ...Independent Call Girl Number in Kurla MumbaišŸ“² Pooja Nehwal 9892124323 šŸ’ž Full ...
Independent Call Girl Number in Kurla MumbaišŸ“² Pooja Nehwal 9892124323 šŸ’ž Full ...
Ā 
OAT_RI_Ep19 WeighingTheRisks_Apr24_TheYellowMetal.pptx
OAT_RI_Ep19 WeighingTheRisks_Apr24_TheYellowMetal.pptxOAT_RI_Ep19 WeighingTheRisks_Apr24_TheYellowMetal.pptx
OAT_RI_Ep19 WeighingTheRisks_Apr24_TheYellowMetal.pptx
Ā 
Q3 2024 Earnings Conference Call and Webcast Slides
Q3 2024 Earnings Conference Call and Webcast SlidesQ3 2024 Earnings Conference Call and Webcast Slides
Q3 2024 Earnings Conference Call and Webcast Slides
Ā 
VIP Call Girls LB Nagar ( Hyderabad ) Phone 8250192130 | ā‚¹5k To 25k With Room...
VIP Call Girls LB Nagar ( Hyderabad ) Phone 8250192130 | ā‚¹5k To 25k With Room...VIP Call Girls LB Nagar ( Hyderabad ) Phone 8250192130 | ā‚¹5k To 25k With Room...
VIP Call Girls LB Nagar ( Hyderabad ) Phone 8250192130 | ā‚¹5k To 25k With Room...
Ā 
High Class Call Girls Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
High Class Call Girls Nagpur Grishma Call 7001035870 Meet With Nagpur EscortsHigh Class Call Girls Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
High Class Call Girls Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
Ā 
Dividend Policy and Dividend Decision Theories.pptx
Dividend Policy and Dividend Decision Theories.pptxDividend Policy and Dividend Decision Theories.pptx
Dividend Policy and Dividend Decision Theories.pptx
Ā 
02_Fabio Colombo_Accenture_MeetupDora&Cybersecurity.pptx
02_Fabio Colombo_Accenture_MeetupDora&Cybersecurity.pptx02_Fabio Colombo_Accenture_MeetupDora&Cybersecurity.pptx
02_Fabio Colombo_Accenture_MeetupDora&Cybersecurity.pptx
Ā 
(DIYA) Bhumkar Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(DIYA) Bhumkar Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(DIYA) Bhumkar Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(DIYA) Bhumkar Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
Ā 
call girls in Nand Nagri (DELHI) šŸ” >ą¼’9953330565šŸ” genuine Escort Service šŸ”āœ”ļøāœ”ļø
call girls in  Nand Nagri (DELHI) šŸ” >ą¼’9953330565šŸ” genuine Escort Service šŸ”āœ”ļøāœ”ļøcall girls in  Nand Nagri (DELHI) šŸ” >ą¼’9953330565šŸ” genuine Escort Service šŸ”āœ”ļøāœ”ļø
call girls in Nand Nagri (DELHI) šŸ” >ą¼’9953330565šŸ” genuine Escort Service šŸ”āœ”ļøāœ”ļø
Ā 
VIP Call Girls Service Dilsukhnagar Hyderabad Call +91-8250192130
VIP Call Girls Service Dilsukhnagar Hyderabad Call +91-8250192130VIP Call Girls Service Dilsukhnagar Hyderabad Call +91-8250192130
VIP Call Girls Service Dilsukhnagar Hyderabad Call +91-8250192130
Ā 
Stock Market Brief Deck for 4/24/24 .pdf
Stock Market Brief Deck for 4/24/24 .pdfStock Market Brief Deck for 4/24/24 .pdf
Stock Market Brief Deck for 4/24/24 .pdf
Ā 
The Economic History of the U.S. Lecture 17.pdf
The Economic History of the U.S. Lecture 17.pdfThe Economic History of the U.S. Lecture 17.pdf
The Economic History of the U.S. Lecture 17.pdf
Ā 

Statistics and probability lecture 01

  • 1. Lecture No. 1 Statistics and Probability By: Muhammad Tufail
  • 2. Objective ā€¢ To inculcate in you an attitude of Statistical and Probabilistic thinking. ā€¢ To give you some very basic techniques in order to apply Statistical analysis to real- world situations/problems.
  • 3. That science which enables us to draw conclusions about various phenomena on the basis of real data collected on sample-basis ļ‚°A tool for data-based research ļ‚°Also known as Quantitative Analysis ļ‚°Any scientific enquiry in which you would like to base your conclusions and decisions on real-life data, you need to employ statistical techniques! ļ‚°Now a days, in the developed countries of the world, there is an active movement for of Statistical Literacy. WHAT IS STATISTICS?
  • 4. Application Areas A lot of application in a wide variety of disciplines ā€¦ Agriculture, Anthropology, Astronomy, Biology, Economics, Engineering, Environment, Geology, Genetics, Medicine, Physics, Psychology, Sociology, Zoology ā€¦. Virtually every single subject from Anthropology to Zoology ā€¦. A to Z!
  • 6. The primary text-book for the course is Introduction to Statistical Theory (Sixth Edition) by Sher Muhammad Chaudhry and Shahid Kamal published by Ilmi Kitab Khana, Lahore. Reference books for the course are: 1. ā€œ ā€œ by Afzal Beg & Miraj Din Mirza. 2. ā€œ ā€œ by Mohammad Rauf Chaudhry (Polymer Publications, Urdu Bazar, Lahore). 3. ā€œStatisticsā€ by James T. McClave & Frank H. Dietrich, II (Dellen Publishing Company, California, U.S.A). 4. ā€œIntroducing Statisticsā€ by K.A. Yeomans (Penguin Books Ltd., England). 5. ā€œApplied Statisticsā€ by K.A. Yeomans (Penguin Books Ltd., England). 6. ā€œBusiness Statistics for Management & Economicsā€ by Wayne W. Daniel and James C. Terrell (Houghton Mifflin Company, U.S.A.). 7. ā€œBasic Business Statisticsā€ by Berenson & Levine ( ) Text and Reference Material
  • 7. IN ACCORDANCE WITH THE ABOVE-MENTIONED STRUCTURE, THE ORGANIZATION OF THIS COURSE IS AS FOLLOWS: WEEKS LEC- TURES AREA TO BE COVERED HOME- WORK ASSIGN- MENTS EXAMS 1 TO 5 1 TO 15 DESCRIPTIVE STATISTICS 1 TO 5 MID-TERM- I 6 TO 10 16 TO 30 PROBABILITY 6 TO 10 MID-TERM- II 11 TO 15 31 TO 45 INFERENTIAL STATISTICS 11 TO 15 FINAL EXAM ORGANIZATION OF THIS COURSE
  • 8. ā€¢Appreciate the nature of statistical data. ā€¢Understand various methods of collecting statistical data. ā€¢Appreciate the importance of a proper sampling procedure. ā€¢Utilize various methods of summarizing and describing collected data. ā€¢Employ statistical techniques to understand the nature of relationship between two quantitative variables. Upon completion of the first segment, you will be able to:
  • 9. ā€¢Understand the basic concepts of probability theory (which is the foundation of statistical inference). Understand the concept of discrete probability distributions and their mathematical properties. ā€¢Understand the concept of continuous probability distributions and their mathematical properties. ā€¢Get acquainted with some of the most commonly encountered and important discrete and continuous probability distributions such as the binomial and the normal distribution. Upon completion of the second segment, you will be able to:
  • 10. Understand and employ various techniques of estimation and hypothesis-testing in order to draw reliable conclusions necessary for decision-making in various fields of human activity. Through this segment, you will be able to appreciate the purpose and the goal of the subject of Statistics. Upon completion of the third segment, you will be able to:
  • 11. There will be two term exams and one final exam. In addition, there will be 15 homework assignments. The final examination will be comprehensive in nature. (Approximately 25-30% of the final exam paper will be on the course covered upto the Mid-Term-II Exam.) These will contribute the following percentages to the final grade: Mid-Term-I: 20% Mid-Term-II: 20% Final Exam: 30% Homework Assignments: 30% GRADING
  • 13. The word ā€œdataā€ appears in many contexts and frequently is used in ordinary conversation. Although the word carries something of an aura of scientific mystique, its meaning is quite simple and mundane. It is Latin for ā€œthose that are givenā€ (the singular form is ā€œdatumā€). Data may therefore be thought of as the results of observation. The meaning of Data
  • 14. Data are collected in many aspects of everyday life. ā€¢ Statements given to a police officer or physician or psychologist during an interview are data. ā€¢ The correct and incorrect answers given by a student on a final examination. ā€¢ Almost any athletic event produces data. ā€¢ The time required by a runner to complete a marathon, ā€¢ The number of errors committed by a baseball team in nine innings of play. EXAMPLES OF DATA
  • 15.
  • 16.
  • 17.
  • 18. EXAMPLES OF DATA ā€¢ And, of course, data are obtained in the course of scientific inquiry: ā€¢ The positions of artifacts and fossils in an archaeological site, ā€¢ The number of interactions between two members of an animal colony during a period of observation, ā€¢ The spectral composition of light emitted by a star.
  • 20. Variable A quantity that, varies from an individual to individual. Variable Quantitative (Numeric) Qualitative (Non - Numeric)
  • 21. In statistics, an observation often means any sort of numerical recording of information, whether it is a physical measurement such as height or weight; a classification such as heads or tails, or an answer to a question such as yes or no. Variable: A characteristic that varies with an individual or an object, is called a variable. For example, age is a variable as it varies from person to person. A variable can assume a number of values. The given set of all possible values from which the variable takes on a value is called its Domain. If for a given problem, the domain of a variable contains only one value, then the variable is referred to as a constant. OBSERVATIONS AND VARIABLES
  • 22. Variables may be classified into quantitative and qualitative according to the form of the characteristic of interest. A variable is called a quantitative variable when a characteristic can be expressed numerically such as age, weight, income or number of children. On the other hand, if the characteristic is non- numerical such as education, sex, eye-colour, quality, intelligence, poverty, satisfaction, etc. the variable is referred to as a qualitative variable. A qualitative characteristic is also called an attribute. An individual or an object with such a characteristic can be counted or enumerated after having been assigned to one of the several mutually exclusive classes or categories. QUANTITATIVE & QUALITATIVE VARIABLES
  • 26. A quantitative variable may be classified as discrete or continuous. A discrete variable is one that can take only a discrete set of integers or whole numbers, that is, the values are taken by jumps or breaks. A discrete variable represents count data such as the number of persons in a family, the number of rooms in a house, the number of deaths in an accident, the income of an individual, etc. A variable is called a continuous variable if it can take on any value-fractional or integralā€“ā€“within a given interval, i.e. its domain is an interval with all possible values without gaps. A continuous variable represents measurement data such as the age of a person, the height of a plant, the weight of a commodity, the temperature at a place, etc. A variable whether countable or measurable, is generally denoted by some symbol such as X or Y and Xi or Xj represents the ith or jth value of the variable. The subscript i or j is replaced by a number such as 1,2,3, ā€¦ when referred to a particular value. DISCRETE AND CONTINUOUS VARIABLES:
  • 27. Measurement Scales Measurement Scales Nominal Scale Ordinal Scale Interval Scale Ratio Scale
  • 28. By measurement, we usually mean the assigning of number to observations or objects and scaling is a process of measuring. The four scales of measurements are briefly mentioned below: NOMINAL SCALE The classification or grouping of the observations into mutually exclusive qualitative categories or classes is said to constitute a nominal scale. For example, students are classified as male and female. Number 1 and 2 may also be used to identify these two categories. Similarly, rainfall may be classified as heavy moderate and light. We may use number 1, 2 and 3 to denote the three classes of rainfall. The numbers when they are used only to identify the categories of the given scale, carry no numerical significance and there is no particular order for the grouping. MEASUREMENT SCALES
  • 29. MEASUREMENT SCALES (Cont.) ORDINAL OR RANKING SCALE It includes the characteristic of a nominal scale and in addition has the property of ordering or ranking of measurements. For example, the performance of students (or players) is rated as excellent, good fair or poor, etc. Number 1, 2, 3, 4 etc. are also used to indicate ranks. The only relation that holds between any pair of categories is that of ā€œgreater thanā€ (or more preferred).
  • 30. INTERVAL SCALE A measurement scale possessing a constant interval size (distance) but not a true zero point, is called an interval scale. Temperature measured on either the Celcius or the Fahrenheit scale is an outstanding example of interval scale because the same difference exists between 20o C (68o F) and 30o C (86o F) as between 5o C (41o F) and 15o C (59o F). It cannot be said that a temperature of 40 degrees is twice as hot as a temperature of 20 degree, i.e. the ratio 40/20 has no meaning. The arithmetic operation of addition, subtraction, etc. are meaningful. RATIO SCALE It is a special kind of an interval scale where the sale of measurement has a true zero point as its origin. The ratio scale is used to measure weight, volume, distance, money, etc. The, key to differentiating interval and ratio scale is that the zero point is meaningful for ratio scale. MEASUREMENT SCALES (Cont.)
  • 31. Example Chemical and manufacturing plants sometimes discharge toxic-waste materials such as DDT into nearby rivers and streams These toxins can adversely affect the plants and animals inhabiting the river and the river bank.
  • 32. A study of fish was conducted in the Tennessee River in Alabama and its three tributary creeks: Flint creek, Limestone creek and Spring creek. A total of 144 fish were captured, and the following variable measured for each one:
  • 33. 1. River/Creek from where fish was captured 2. Species of fish (Channel fish, Largemouth bass or smallmouth buffalo fish) 3. Length of fish (Centimeters) 4. Weight of fish (grams) 5. DDT concentration in the bodily system of the fish (parts per million)
  • 34. Classify each of the five variables measured as quantitative or qualitative. Also, identify the types of measurement scales for each of the five variables.
  • 35. Solution The variables Length, weight and DDT concentration are quantitative variables because each is measured on a nominal scale (Length is centimeters, Weight is grams and DDT in parts per million). All three of these variables are being measured on the Ratio Scale.
  • 36. Rationale Whenever we speak about the weight of an object, obviously, if our measuring instrument reads ā€˜zeroā€™, this means that the object being measured has zero weight --- and, in this sense, the ā€˜zeroā€™ would be a true zero. An exactly similar argument holds for the length of an object.
  • 37. As far as DDT concentration in the bodily system of the fish is concerned, obviously, if there is absolutely no DDT in the fish, then the DDT concentration reads zero --- and, this particular ā€˜zeroā€™ reading will be true zero.
  • 38. As, explained above, the three variables length of fish, weight of fish and DDT concentration in the bodily system of the fish are quantitative variables measures on the ratio scale. In contrast:
  • 39. Data on River/Creek from which the fish were captured, and the species of fish are qualitative data. Both of these variables are measured on Nominal Scale.
  • 40. Rationale The river/creek from which the fish were captured, and the species of fish are qualitative data because these can not be measured quantitatively, they can only be classified into categories. (i.e. Channel fish, Largemouth bass or smallmouth buffalo fish for the species and Tennessee River, Flint creek, Limestone creek and Spring creek)
  • 41. The Statistical methods for describing, reporting and analyzing data depend on the type of data measured (i.e. whether data are quantitative or qualitative).
  • 42. Experience has shown that a continuous variable can never be measured with perfect fineness because of certain habits and practices, methods of measurements, instruments used, etc. the measurements are thus always recorded correct to the nearest units and hence are of limited accuracy. The actual or true values are, however, assumed to exist. For example, if a studentā€™s weight is recorded as 60 kg (correct to the nearest kilogram), his true weight in fact lies between 59.5 kg and 60.5 kg, whereas a weight recorded as 60.00 kg means the true weight is known to lie between 59.995 and 60.005 kg. Thus there is a difference, however small it may be between the measured value and the true value. This sort of departure from the true value is technically known as the error of measurement. In other words, if the observed value and the true value of a variable are denoted by x and x + Īµ respectively, then the difference (x + Īµ) ā€“ x, i.e. Īµ is the error. This error involves the unit of measurement of x and is therefore called an absolute error. An absolute error divided by the true value is called the relative error. Thus the relative error, which when multiplied by 100, is percentage error. These errors are independent of the units of measurement of x. It ought to be noted that an error has both magnitude and direction and that the word error in statistics does not mean mistake which is a chance inaccuracy. ERRORS OF MEASUREMENT
  • 43. Errors of Measurements Errors of Measurements Biased Errors Cumulative Errors Systematic Errors Random Errors Compensating Errors Accidental Errors
  • 44. An error is said to be biased when the observed value is consistently and constantly higher or lower than the true value. Biased errors arise from the personal limitations of the observer, the imperfection in the instruments used or some other conditions which control the measurements. These errors are not revealed by repeating the measurements. They are cumulative in nature, that is, the greater the number of measurements, the greater would be the magnitude of error. They are thus more troublesome. These errors are also called cumulative or systematic errors. An error, on the other hand, is said to be unbiased when the deviations, i.e. the excesses and defects, from the true value tend to occur equally often. Unbiased errors and revealed when measurements are repeated and they tend to cancel out in the long run. These errors are therefore compensating and are also known as random errors or accidental errors. BIASED AND RANDOM ERRORS
  • 45. Statistical Inference A Statistical Inference in an estimate or prediction or some other generalization about a population based on information contained in sample. That is, we use information contained in sample to learn about the larger population.
  • 46. Population and Sample Population: The collection of all individuals, items or data under consideration in a statistical study. Sample: That part of the population from which information is collected.
  • 48. Five Elements of an Inferencial Statistical Problem: ā€¢ A population ā€¢ One or more variables of interest ā€¢ A sample ā€¢ An Inference ā€¢ A measure of Reliability
  • 49. In order of understand the concept of Reliability, a very important point to be understood is that making an inference about population from the sample is only part of the story. We also need to know its reliability --- that is, how good our inference is.
  • 50. Measure of Reliability A measure of reliability is a statement (usually quantified) about the degree of uncertainty associated with a statistical inference.
  • 51. The point to be noted is that the only way we can be certain that an inference about population is correct is to include the entire population in our sample. However, because of resource constraints, (i.e. Insufficient time and/ or money). We usually can not work with whole population, so we base our inference on just a portion of population (i.e. Sample)
  • 52. Consequently, whenever possible, it is important to determine and report the reliability of each inference made. As such, reliability is the fifth element of statistical inferencial problems.
  • 53. Example A large paint retailer has had numerous complaints from customers about under- filled paint cans. As, a result retailer has begun inspecting incoming shipments of paint from suppliers. Shipments with under-filled problems will be sent back to supplier.
  • 54. A recent shipment contained 2,440 gallon- size cans. The retailer sampled 50 cans and weighted each on a scale capable of measuring weight to four decimal places. Properly filled cans weigh 10 pounds.
  • 55. a) Describe a population b) Describe a variable of interest c) Describe a sample d) Describe the Inference e) Describe a measure of uncertainty of our inference.
  • 56. Solution a) The population is the set of units of interests to the retailer, which is the shipment of 2,440 cans of paint. b) The weight of paint cans is the variable, the retailer wishes to evaluate.
  • 57. c) The sample is the subset of population. In this case, it is the 50 cans of paint selected by the retailer.
  • 58. d) The inference of interest involves the generalization of the information contained in the sample of paint cans to the population of paint cans.
  • 59. In particular, Retailer wants to learn about the content of under-filled problem (if any) In the population. This might be accomplished by finding the average weight of the cans in the sample, and using it to estimate the average weight of the cans of population.
  • 60. e) As far as the measure of reliability of our inference is concerned, the point to be noted is that, using statistical methods, we can determine a bound on the estimation error.
  • 61. Bound on the Estimation Error This bound is simply a number that our estimation error (i.e. the difference between the average weight of sample and average weight of population of cans) is not likely to exceed.
  • 62. This bound is a measure of the uncertainty of our inference, or, in other words, the reliability of statistical inference. The crux of the matter is that an inference is incomplete without a measure of its reliability
  • 63. When the weights of 50 paint cans are used to estimate the average weight of all the cans, the estimate will not exactly mirror the entire population. For Example:
  • 64. If the sample of 50 cans yields a mean weight of 9 pounds, it does not follow (nor is it likely) that the mean weight of population of can is also exactly 9 pounds.
  • 65. Nevertheless, we can use sound statistical reasoning to ensure that our sampling procedure will generate estimate that is almost certainly within a specified limit of the true mean weight of all the cans.
  • 66. For example such reasoning might assure us that the estimate of the population from the sample is almost certainly within 1 pound of the actual population mean. The implication is that the actual mean weight of the entire population of the cans is between 9 ā€“ 1=8 pounds and 9 +1=10 pounds --- that is, (9 Ā± 1) pounds. This interval represents the a measure of reliability for the inference.
  • 67. IN TODAYā€™S LECTURE, YOU LEARNT: ā€¢ The nature of the science of Statistics ā€¢ The importance of Statistics in various fields ā€¢ Some technical concepts such as ā€“ The meaning of ā€œdataā€ ā€“ Various types of variables ā€“ Various types of measurement scales ā€“ The concept of errors of measurement
  • 68. IN THE NEXT LECTURE, YOU WILL LEARN: ā€¢ Concept of sampling ā€“ Random verses non-random sampling ā€“ Simple random sampling ā€“ A brief introduction to other types of random sampling ā€¢ Methods of data collection In other words, you will begin your journey in a subject with reference to which it has been said that ā€œstatistical thinking will one day be as necessary for efficient citizenship as the ability to read and writeā€.