SlideShare a Scribd company logo
An introduction to statistical inference
Dr. Abhay Pratap Pandey
University of Delhi
What is inference?
Inference defined:
ā€¢ An everyday meaningā€¦
We infer a conclusion based on evidence and reasoning
ā€¢ A statistical meaningā€¦
We infer a property of a population from a sample
Why inference?
The aim of inference is to determine the characteristics of a population
from a sample.
Population
Sample
Population and sample
In statistical analysis, a population is a collection of all the
people, items, or events about which one wants to make
inferences. OR
Any well-defined group of subjects, which could be
individuals, firms, cities, or many other possibilities
(For example university students in India.)
In statistical analysis, a sample, is a subset of the population
(i.e. the people, items, or events) that one collects and
analyzes to make inferences. (For example 200 randomly
chosen university students.)
Statistical sample - Subset of the population chosen to represent the
population in a statistical analysis; denoted as (X1,X2, ... Xn).
Random sample- randomly chosen from the population sample of
individuals.
In the case of random sampling, the following techniques can be used:
Independent sampling (draw with replacement) - after each draw the
unit returns to the population.
Dependent sampling (draw without replacement) - after each draw the
unit does not return to the population (no longer participate in the
drawing).
In statistical analysis, an observation is an elements of the sample. (For
example Helena, a student at Central University.)
Sampling
Estimation
Testing of
hypothesis
Statistical inference
Aim of statistical inference
The aim of statistical inference is to learn about the population using the observed
data
This involves:
ā€¢ computing something with the data
ā€¢ a statistic: function of data
ā€¢ interpret the result
ā€¢ in probabilistic terms: sampling distribution of statistic
Estimation
ā€¢ Determination of the population parameter by the calculation of a
sample statisticā€¦
Characteristic
Population
Parameter
Ī¼
Sample
Statistic
š‘„
A sampling distribution is a probability distribution of a statistic obtained
through a large number of samples drawn from a specific population.
Population
parameter
Ī¼
Sample
Statistic š‘„1
Sample
Statistic
š‘„2
Sample
Statistic š‘„3
Uncertainty
Estimates are not perfect
Sampling
distribution
Types of estimators in statistics
Estimator
An estimator is a statistic (function of data) that produces such a guess.
We usually mean by ā€œbestā€ an estimator whose sampling distribution is more
concentrated about the population parameter value compared to other
estimators.
The two main types of estimators in statistics are
ā€¢ Point estimators
ā€¢ Interval estimators
Point estimation: Point estimators are functions that are used to find an
approximate value of a population parameter from random samples of the
population. They use the sample data of a population to calculate a point
estimate or a statistic that serves as the best estimate of an
unknown parameter of a population. We want to estimate a population
parameter using the observed data.
Ex. some measure of variation, an average, min, max, quantile, etc.
ā€¢ Interval estimation
Interval estimation uses sample data to calculate the interval
of the possible values of an unknown parameter of a
population. The interval of the parameter is selected in a
way that it falls within a 95% or higher probability, also
known as the confidence interval. The confidence interval is
used to indicate how reliable an estimate is, and it is
calculated from the observed data. The endpoints of the
intervals are referred to as the upper and lower confidence
limits.
Properties of Point Estimators
ā€¢ Unbiasedness
ā€¢ Consistency
ā€¢ Sufficiency
ā€¢ Efficiency
Unbiasedness
An estimator of a given parameter is said to be unbiased if its expected
value is equal to the true value of the parameter.
The bias of a point estimator is defined as the difference between
the expected value of the estimator and the value of the parameter being
estimated. When
Also, the closer the expected value of a parameter is to the value of the
parameter being measured, the lesser the bias is.
Consistency
Consistency tells us how close the point estimator stays to the value of
the parameter as it increases in size. The point estimator requires a
large sample size for it to be more consistent and accurate. You can also
check if a point estimator is consistent by looking at its corresponding
expected value and variance. For the point estimator to be consistent,
the expected value should move toward the true value of the
parameter.
Maximum likelihood estimator
The maximum likelihood estimator method of point estimation
attempts to find the unknown parameters that maximize the likelihood
function. It takes a known model and uses the values to compare data
sets and find the most suitable match for the data.
For example, a researcher may be interested in knowing the average
weight of babies born prematurely. Since it would be impossible to
measure all babies born prematurely in the population, the researcher
can take a sample from one location. Since the weight of pre-term
babies follows a normal distribution, the researcher can use the
maximum likelihood estimator to find the average weight of the entire
population of pre-term babies based on the sample data.
Method of moments
The method of moments of estimating parameters was introduced in
1887 by Russian mathematician Pafnuty Chebyshev. It starts by taking
known facts about a population and then applying the facts to a sample
of the population. The first step is to derive equations that relate the
population moments to the unknown parameters.
The next step is to draw a sample of the population to be used to
estimate the population moments. The equations derived in step one
are then solved using the sample mean of the population moments.
This produces the best estimate of the unknown population
parameters.
What is Confidence Interval?
A confidence interval is an estimate of an interval in statistics that may
contain a population parameter. The unknown population parameter is
found through a sample parameter calculated from the sampled data.
For example, the population mean Ī¼ is found using the sample mean xĢ….
The interval is generally defined by its lower and upper bounds. The
confidence interval is expressed as a percentage (the most frequently
quoted percentages are 90%, 95%, and 99%). The percentage reflects
the confidence level.
The concept of the confidence interval is very important in statistics
(hypothesis testing) since it is used as a measure of uncertainty. The
concept was introduced by Polish mathematician and statistician, Jerzy
Neyman in 1937.
Confidence Interval
We can also quantify the uncertainty (sampling distribution) of our
point estimate.
One way of doing this is by constructing an interval that is likely to
contain the population parameter.
One such an interval, which is computed on the basis of the data, is
called a confidence interval.
The sampling probability that the confidence interval will indeed
contain the parameter value is called the confidence level.
We construct confidence intervals for a given confidence level.
Interpretation of Confidence Interval
The proper interpretation of a confidence interval is probably the most
challenging aspect of this statistical concept. One example of the most
common interpretation of the concept is the following:
There is a 95% probability that, in the future, the true value of the
population parameter (e.g., mean) will fall within X [lower bound] and Y
[upper bound] interval.
In addition, we may interpret the confidence interval using the statement
below:
We are 95% confident that the interval between X [lower bound] and Y
[upper bound] contains the true value of the population parameter.
However, it would be inappropriate to state the following:
There is a 95% probability that the interval between X [lower bound] and
Y [upper bound] contains the true value of the population parameter.
How to Calculate the Confidence Interval?
The interval is calculated using the following steps:
ā€¢ Gather the sample data.
ā€¢ Calculate the sample mean xĢ….
ā€¢ Determine whether a populationā€™s standard deviation is known or
unknown.
ā€¢ If a populationā€™s standard deviation is known, we can use a z-score for
the corresponding confidence level.
ā€¢ If a populationā€™s standard deviation is unknown, we can use a t-
statistic for the corresponding confidence level.
ā€¢ Find the lower and upper bounds of the confidence interval using the
following formulas:
a. Known population standard deviation
b. Unknown population standard deviation
Examples
ā€¢ Suppose we conduct a poll to try and get a sense of the outcome of an
upcoming election with two candidates. We poll 1000 people, and 550 of
them respond that they will vote for candidate A .
How confident can we be that a given person will cast their vote for
candidate A?
Sol.
1. Select our desired levels of confidence Weā€™re going to use the 90%,
95%, and 99% levels
2. Calculate Ī± and Ī±/2 Our Ī± values are 0.1, 0.05, and 0.01 respectively
Our Ī±/2 values are 0.05, 0.025, and 0.005
3. Look up the corresponding z-scores Our ZĪ± /2 values are 1.645, 1.96,
and 2.58
4. Multiply the z-score by the standard error to find the margin of error
First we need to calculate the standard error
5. Find the interval by adding and subtracting this product from the mean.
In this case, we are working with a distribution we have not previously
discussed, a normal binomial distribution (i.e. a vote can choose Candidate
A or B, a binomial function).
We have a probability estimator from our sample, where the probability of
an individual in our sample voting for candidate A was found to be 550/1000
or 0.55.
We can use this information in a formula to estimate the standard error for
such a distribution:
5. Multiply the z-score by the standard error cont.
ā€¢ For a normal binominal distribution, the standard error can be estimated
using:
S.E= 0.0157
ā€¢ We can now multiply this value by the z-scores to calculate the
margins of error for each conf. level
Multiply the z-score by the standard error cont.
ā€¢ We calculate the margin of error and add and subtract that value
from the mean (0.55 in this case) to find the bounds of our confidence
intervals at each level of confidence:
CI ZĪ±/2 Margin of error Lower Bounds Upper Bounds
90% 1.645 0.026 0.524 0.576
95% 1.96 0.031 0.519 0.581
99% 2.58 0.041 0.509 0.591
What is Hypothesis Testing?
Hypothesis Testing is a method of statistical inference. It is used to test
if a statement regarding a population parameter is statistically
significant. Hypothesis testing is a powerful tool for testing the power
of predictions.
For example: A Statistician might want to make a prediction of the
mean value a customer would pay for his firmā€™s product. He can then
formulate a hypothesis, for example, ā€œThe average value that
customers will pay for my product is larger than $5ā€. To statistically test
this question, the firm owner could use hypothesis testing.
Hypothesis testing is formulated in terms of two hypothesis:
ā€¢ H0: the null hypothesis;
ā€¢ H1: the alternate hypothesis.
The hypothesis we want to test is if H1 is ā€œlikely" true.
So, there are two possible outcomes:
ā€¢ Reject H0 and accept H1 because of sufficient evidence in the sample
in favor or H1;
ā€¢ Do not reject H0 because of insufficient evidence to support H1.
Null Hypothesis and Alternative Hypothesis
ā€¢ Null Hypothesis
ā€¢ Alternative Hypothesis
The Null Hypothesis is usually set as what we donā€™t want to be true. It is
the hypothesis to be tested. Therefore, the Null Hypothesis is considered
to be true, until we have sufficient evidence to reject it. If we reject the
null hypothesis, we are led to the alternative hypothesis.
Example of the business owner who is looking for some customer insight.
His null hypothesis would be:
H0 : The average value customers are willing to pay for my product is
smaller than or equal to $5 or H0 : Āµ ā‰¤ 5(Āµ = the population mean)
The alternative hypothesis would then be what we are evaluating, so, in
this case, it would be:
Ha : The average value customers are willing to pay for the product is
greater than $5 or Ha : Āµ > 5
Type I and Type II Errors
A Type I Error arises when a true Null Hypothesis is rejected. The
probability of making a Type I Error is also known as the level of
significance of the test, which is commonly referred to as alpha (Ī±). So,
for example, if a test that has its alpha set as 0.01, there is a 1%
probability of rejecting a true null hypothesis or a 1% probability of
making a Type I Error.
A Type II Error arises when you fail to reject a False Null Hypothesis.
The probability of making a Type II Error is commonly denoted by the
Greek letter beta (Ī²). Ī² is used to define the Power of a Test, which is
the probability of correctly rejecting a false null hypothesis.
The Power of a Test is defined as 1-Ī². A test with more Power is more
desirable, as there is a lower probability of making a Type II Error.
However, there is a tradeoff between the probability of making a Type I
Error and the probability of making a Type II Error.
Properties of hypothesis testing
ā€¢ Significance level - is the maximum probability of committing a Type I
error. This probability is symbolized by Ī±.
P(Type I error|H0 is true)=Ī±.
ā€¢ Critical or Rejection Region ā€“ the range of values for the test value
that indicate a significant difference and that the null hypothesis
should be rejected.
ā€¢ Non-critical or Non-rejection Region ā€“ the range of values for the test
value that indicates that the difference was probably due to chance
and that the null hypothesis should not be rejected.
One tail test(Right tail)
Left-tail test
Two-tail test
Steps in hypothesis testing
Testing a hypothesis about the mean of a population
We have the following steps:
1.Data: determine variable, sample size (n), sample mean( ) ,
population standard deviation or sample standard deviation (s) if is
unknown
2. Assumptions : We have two cases:
Case1: Population is normally or approximately normally distributed
with known or unknown variance (sample size n may be small or large),
Case 2: Population is not normal with known or unknown variance (n is
large i.e. nā‰„30).
3.Hypothesis: we have three cases
Case I : H0: Ī¼=Ī¼0 Vs HA: Ī¼ Ī¼0
e.g. we want to test that the population mean is different than 50
Case II : H0: Ī¼ = Ī¼0 Vs HA: Ī¼ > Ī¼0
e.g. we want to test that the population mean is greater than 50
Case III : H0: Ī¼ = Ī¼0 Vs HA: Ī¼< Ī¼0
e.g. we want to test that the population mean is less than 50
Example
ā€¢ Researchers are interested in the mean age of a certain population.
ā€¢ A random sample of 10 individuals drawn from the population of
interest has a mean of 27.
ā€¢ Assuming that the population is approximately normally distributed
with variance 20,can we conclude that the mean is different from 30
years ? (Ī±=0.05) .
ā€¢ If the p - value is 0.0340 how can we use it in making a decision?
Solution
1-Data: variable is age, n=10, =27 ,Ļƒ2=20,Ī±=0.05
2-Assumptions: the population is approximately normally distributed with
variance 20
3-Hypotheses:
ā€¢ H0 : Ī¼=30
ā€¢ HA: Ī¼ 30
4-Test Statistic:
ā€¢ Z = -2.12
5.Decision Rule
The alternative hypothesis is HA: Ī¼ ā‰  30
Hence we reject H0 if Z > Z(1-0.025)= Z(0.975)
ā€¢ or Z< - Z(1-0.025 )= - Z(0.975)
ā€¢ Z(0.975)=1.96(from table D)
6.Decision:
ā€¢ We reject H0 ,since -2.12 is in the rejection region .
ā€¢ We can conclude that Ī¼ is not equal to 30
ā€¢ Using the p value ,we note that p-value =0.0340< 0.05,therefore we
reject H0
Thankyou

More Related Content

What's hot

Inferential statictis ready go
Inferential statictis ready goInferential statictis ready go
Inferential statictis ready go
Mmedsc Hahm
Ā 
Statistics:Probability Theory
Statistics:Probability TheoryStatistics:Probability Theory
Statistics:Probability Theory
St Mary's College,Thrissur,Kerala
Ā 
Analysis of variance (ANOVA)
Analysis of variance (ANOVA)Analysis of variance (ANOVA)
Analysis of variance (ANOVA)
Tesfamichael Getu
Ā 
Measures of Central Tendency
Measures of Central TendencyMeasures of Central Tendency
Measures of Central Tendency
Rejvi Ahmed
Ā 
Understanding inferential statistics
Understanding inferential statisticsUnderstanding inferential statistics
Understanding inferential statisticsHanimarcelo slideshare
Ā 
Sample size determination
Sample size determinationSample size determination
Sample size determination
Augustine Gatimu
Ā 
Non parametric test
Non parametric testNon parametric test
Non parametric test
Neetathakur3
Ā 
Statistical inference
Statistical inferenceStatistical inference
Statistical inferenceJags Jagdish
Ā 
Hypothesis Testing
Hypothesis TestingHypothesis Testing
Hypothesis Testing
Kalyan Acharjya
Ā 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statisticsAileen Balbido
Ā 
Biostatistics ppt
Biostatistics  pptBiostatistics  ppt
Biostatistics ppt
santhoshikayithi
Ā 
introduction to statistical theory
introduction to statistical theoryintroduction to statistical theory
introduction to statistical theory
Unsa Shakir
Ā 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statisticskemdoby
Ā 
Sampling
Sampling Sampling
Sampling
Anuj Suneja
Ā 
Descriptive statistics ii
Descriptive statistics iiDescriptive statistics ii
Descriptive statistics ii
Mohammad Ihmeidan
Ā 
Confidence Intervals: Basic concepts and overview
Confidence Intervals: Basic concepts and overviewConfidence Intervals: Basic concepts and overview
Confidence Intervals: Basic concepts and overview
Rizwan S A
Ā 
Univariate & bivariate analysis
Univariate & bivariate analysisUnivariate & bivariate analysis
Univariate & bivariate analysis
sristi1992
Ā 
Inferential Statistics
Inferential StatisticsInferential Statistics
Inferential Statistics
ewhite00
Ā 

What's hot (20)

Inferential statictis ready go
Inferential statictis ready goInferential statictis ready go
Inferential statictis ready go
Ā 
Statistics:Probability Theory
Statistics:Probability TheoryStatistics:Probability Theory
Statistics:Probability Theory
Ā 
Univariate Analysis
Univariate AnalysisUnivariate Analysis
Univariate Analysis
Ā 
Analysis of variance (ANOVA)
Analysis of variance (ANOVA)Analysis of variance (ANOVA)
Analysis of variance (ANOVA)
Ā 
Measures of Central Tendency
Measures of Central TendencyMeasures of Central Tendency
Measures of Central Tendency
Ā 
Understanding inferential statistics
Understanding inferential statisticsUnderstanding inferential statistics
Understanding inferential statistics
Ā 
Sample size determination
Sample size determinationSample size determination
Sample size determination
Ā 
Non parametric test
Non parametric testNon parametric test
Non parametric test
Ā 
Statistical inference
Statistical inferenceStatistical inference
Statistical inference
Ā 
Hypothesis Testing
Hypothesis TestingHypothesis Testing
Hypothesis Testing
Ā 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
Ā 
Biostatistics ppt
Biostatistics  pptBiostatistics  ppt
Biostatistics ppt
Ā 
Spss an introduction
Spss  an introductionSpss  an introduction
Spss an introduction
Ā 
introduction to statistical theory
introduction to statistical theoryintroduction to statistical theory
introduction to statistical theory
Ā 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
Ā 
Sampling
Sampling Sampling
Sampling
Ā 
Descriptive statistics ii
Descriptive statistics iiDescriptive statistics ii
Descriptive statistics ii
Ā 
Confidence Intervals: Basic concepts and overview
Confidence Intervals: Basic concepts and overviewConfidence Intervals: Basic concepts and overview
Confidence Intervals: Basic concepts and overview
Ā 
Univariate & bivariate analysis
Univariate & bivariate analysisUnivariate & bivariate analysis
Univariate & bivariate analysis
Ā 
Inferential Statistics
Inferential StatisticsInferential Statistics
Inferential Statistics
Ā 

Similar to inferencial statistics

Point and Interval Estimation
Point and Interval EstimationPoint and Interval Estimation
Point and Interval EstimationShubham Mehta
Ā 
Burns And Bush Chapter 16
Burns And Bush Chapter 16Burns And Bush Chapter 16
Burns And Bush Chapter 16
Largest Catholic University
Ā 
statistical estimation
statistical estimationstatistical estimation
statistical estimation
Amish Akbar
Ā 
Estimating a Population Proportion
Estimating a Population ProportionEstimating a Population Proportion
Estimating a Population Proportion
Long Beach City College
Ā 
Estimating a Population Proportion
Estimating a Population ProportionEstimating a Population Proportion
Estimating a Population Proportion
Long Beach City College
Ā 
Estimation in statistics
Estimation in statisticsEstimation in statistics
Estimation in statistics
Rabea Jamal
Ā 
Normal and standard normal distribution
Normal and standard normal distributionNormal and standard normal distribution
Normal and standard normal distribution
Avjinder (Avi) Kaler
Ā 
Basic stat analysis using excel
Basic stat analysis using excelBasic stat analysis using excel
Basic stat analysis using excel
Parag Shah
Ā 
Chapter 8
Chapter 8Chapter 8
Chapter 8
MaryWall14
Ā 
Estimation and hypothesis
Estimation and hypothesisEstimation and hypothesis
Estimation and hypothesis
Junaid Ijaz
Ā 
statistical inference
statistical inference statistical inference
statistical inference
BasitShah18
Ā 
Descriptive Analysis.pptx
Descriptive Analysis.pptxDescriptive Analysis.pptx
Descriptive Analysis.pptx
Parveen Vashisth
Ā 
Statistical inference with Python
Statistical inference with PythonStatistical inference with Python
Statistical inference with Python
Johnson Ubah
Ā 
Estimating a Population Proportion
Estimating a Population Proportion  Estimating a Population Proportion
Estimating a Population Proportion
Long Beach City College
Ā 
stats for 1st sem MBA atudents hypothesis testing notes
stats  for 1st sem MBA atudents hypothesis testing notesstats  for 1st sem MBA atudents hypothesis testing notes
stats for 1st sem MBA atudents hypothesis testing notes
SoujanyaLk1
Ā 
Sampling
SamplingSampling
Sampling
Md Iqbal
Ā 
Estimation and hypothesis testing 1 (graduate statistics2)
Estimation and hypothesis testing 1 (graduate statistics2)Estimation and hypothesis testing 1 (graduate statistics2)
Estimation and hypothesis testing 1 (graduate statistics2)Harve Abella
Ā 
Review & Hypothesis Testing
Review & Hypothesis TestingReview & Hypothesis Testing
Review & Hypothesis Testing
Sr Edith Bogue
Ā 
Chapter 8
Chapter 8Chapter 8
Chapter 8
Tara Kissel, M.Ed
Ā 
Elements of inferential statistics
Elements of inferential statisticsElements of inferential statistics
Elements of inferential statistics
Arati Mishra Ingalageri
Ā 

Similar to inferencial statistics (20)

Point and Interval Estimation
Point and Interval EstimationPoint and Interval Estimation
Point and Interval Estimation
Ā 
Burns And Bush Chapter 16
Burns And Bush Chapter 16Burns And Bush Chapter 16
Burns And Bush Chapter 16
Ā 
statistical estimation
statistical estimationstatistical estimation
statistical estimation
Ā 
Estimating a Population Proportion
Estimating a Population ProportionEstimating a Population Proportion
Estimating a Population Proportion
Ā 
Estimating a Population Proportion
Estimating a Population ProportionEstimating a Population Proportion
Estimating a Population Proportion
Ā 
Estimation in statistics
Estimation in statisticsEstimation in statistics
Estimation in statistics
Ā 
Normal and standard normal distribution
Normal and standard normal distributionNormal and standard normal distribution
Normal and standard normal distribution
Ā 
Basic stat analysis using excel
Basic stat analysis using excelBasic stat analysis using excel
Basic stat analysis using excel
Ā 
Chapter 8
Chapter 8Chapter 8
Chapter 8
Ā 
Estimation and hypothesis
Estimation and hypothesisEstimation and hypothesis
Estimation and hypothesis
Ā 
statistical inference
statistical inference statistical inference
statistical inference
Ā 
Descriptive Analysis.pptx
Descriptive Analysis.pptxDescriptive Analysis.pptx
Descriptive Analysis.pptx
Ā 
Statistical inference with Python
Statistical inference with PythonStatistical inference with Python
Statistical inference with Python
Ā 
Estimating a Population Proportion
Estimating a Population Proportion  Estimating a Population Proportion
Estimating a Population Proportion
Ā 
stats for 1st sem MBA atudents hypothesis testing notes
stats  for 1st sem MBA atudents hypothesis testing notesstats  for 1st sem MBA atudents hypothesis testing notes
stats for 1st sem MBA atudents hypothesis testing notes
Ā 
Sampling
SamplingSampling
Sampling
Ā 
Estimation and hypothesis testing 1 (graduate statistics2)
Estimation and hypothesis testing 1 (graduate statistics2)Estimation and hypothesis testing 1 (graduate statistics2)
Estimation and hypothesis testing 1 (graduate statistics2)
Ā 
Review & Hypothesis Testing
Review & Hypothesis TestingReview & Hypothesis Testing
Review & Hypothesis Testing
Ā 
Chapter 8
Chapter 8Chapter 8
Chapter 8
Ā 
Elements of inferential statistics
Elements of inferential statisticsElements of inferential statistics
Elements of inferential statistics
Ā 

Recently uploaded

how to sell pi coins effectively (from 50 - 100k pi)
how to sell pi coins effectively (from 50 - 100k  pi)how to sell pi coins effectively (from 50 - 100k  pi)
how to sell pi coins effectively (from 50 - 100k pi)
DOT TECH
Ā 
PF-Wagner's Theory of Public Expenditure.pptx
PF-Wagner's Theory of Public Expenditure.pptxPF-Wagner's Theory of Public Expenditure.pptx
PF-Wagner's Theory of Public Expenditure.pptx
GunjanSharma28848
Ā 
Webinar Exploring DORA for Fintechs - Simont Braun
Webinar Exploring DORA for Fintechs - Simont BraunWebinar Exploring DORA for Fintechs - Simont Braun
Webinar Exploring DORA for Fintechs - Simont Braun
FinTech Belgium
Ā 
how to sell pi coins on Binance exchange
how to sell pi coins on Binance exchangehow to sell pi coins on Binance exchange
how to sell pi coins on Binance exchange
DOT TECH
Ā 
Which Crypto to Buy Today for Short-Term in May-June 2024.pdf
Which Crypto to Buy Today for Short-Term in May-June 2024.pdfWhich Crypto to Buy Today for Short-Term in May-June 2024.pdf
Which Crypto to Buy Today for Short-Term in May-June 2024.pdf
Kezex (KZX)
Ā 
Poonawalla Fincorp and IndusInd Bank Introduce New Co-Branded Credit Card
Poonawalla Fincorp and IndusInd Bank Introduce New Co-Branded Credit CardPoonawalla Fincorp and IndusInd Bank Introduce New Co-Branded Credit Card
Poonawalla Fincorp and IndusInd Bank Introduce New Co-Branded Credit Card
nickysharmasucks
Ā 
Summary of financial results for 1Q2024
Summary of financial  results for 1Q2024Summary of financial  results for 1Q2024
Summary of financial results for 1Q2024
InterCars
Ā 
innovative-invoice-discounting-platforms-in-india-empowering-retail-investors...
innovative-invoice-discounting-platforms-in-india-empowering-retail-investors...innovative-invoice-discounting-platforms-in-india-empowering-retail-investors...
innovative-invoice-discounting-platforms-in-india-empowering-retail-investors...
Falcon Invoice Discounting
Ā 
when will pi network coin be available on crypto exchange.
when will pi network coin be available on crypto exchange.when will pi network coin be available on crypto exchange.
when will pi network coin be available on crypto exchange.
DOT TECH
Ā 
The WhatsPump Pseudonym Problem and the Hilarious Downfall of Artificial Enga...
The WhatsPump Pseudonym Problem and the Hilarious Downfall of Artificial Enga...The WhatsPump Pseudonym Problem and the Hilarious Downfall of Artificial Enga...
The WhatsPump Pseudonym Problem and the Hilarious Downfall of Artificial Enga...
muslimdavidovich670
Ā 
234Presentation on Indian Debt Market.ppt
234Presentation on Indian Debt Market.ppt234Presentation on Indian Debt Market.ppt
234Presentation on Indian Debt Market.ppt
PravinPatil144525
Ā 
Intro_Economics_ GPresentation Week 4.pptx
Intro_Economics_ GPresentation Week 4.pptxIntro_Economics_ GPresentation Week 4.pptx
Intro_Economics_ GPresentation Week 4.pptx
shetivia
Ā 
US Economic Outlook - Being Decided - M Capital Group August 2021.pdf
US Economic Outlook - Being Decided - M Capital Group August 2021.pdfUS Economic Outlook - Being Decided - M Capital Group August 2021.pdf
US Economic Outlook - Being Decided - M Capital Group August 2021.pdf
pchutichetpong
Ā 
how to sell pi coins in South Korea profitably.
how to sell pi coins in South Korea profitably.how to sell pi coins in South Korea profitably.
how to sell pi coins in South Korea profitably.
DOT TECH
Ā 
The secret way to sell pi coins effortlessly.
The secret way to sell pi coins effortlessly.The secret way to sell pi coins effortlessly.
The secret way to sell pi coins effortlessly.
DOT TECH
Ā 
USDA Loans in California: A Comprehensive Overview.pptx
USDA Loans in California: A Comprehensive Overview.pptxUSDA Loans in California: A Comprehensive Overview.pptx
USDA Loans in California: A Comprehensive Overview.pptx
marketing367770
Ā 
Isios-2024-Professional-Independent-Trustee-Survey.pdf
Isios-2024-Professional-Independent-Trustee-Survey.pdfIsios-2024-Professional-Independent-Trustee-Survey.pdf
Isios-2024-Professional-Independent-Trustee-Survey.pdf
Henry Tapper
Ā 
What price will pi network be listed on exchanges
What price will pi network be listed on exchangesWhat price will pi network be listed on exchanges
What price will pi network be listed on exchanges
DOT TECH
Ā 
Turin Startup Ecosystem 2024 - Ricerca sulle Startup e il Sistema dell'Innov...
Turin Startup Ecosystem 2024  - Ricerca sulle Startup e il Sistema dell'Innov...Turin Startup Ecosystem 2024  - Ricerca sulle Startup e il Sistema dell'Innov...
Turin Startup Ecosystem 2024 - Ricerca sulle Startup e il Sistema dell'Innov...
Quotidiano Piemontese
Ā 
Chʰʔng 6. Ancol - phenol - ether (1).pdf
Chʰʔng 6. Ancol - phenol - ether (1).pdfChʰʔng 6. Ancol - phenol - ether (1).pdf
Chʰʔng 6. Ancol - phenol - ether (1).pdf
va2132004
Ā 

Recently uploaded (20)

how to sell pi coins effectively (from 50 - 100k pi)
how to sell pi coins effectively (from 50 - 100k  pi)how to sell pi coins effectively (from 50 - 100k  pi)
how to sell pi coins effectively (from 50 - 100k pi)
Ā 
PF-Wagner's Theory of Public Expenditure.pptx
PF-Wagner's Theory of Public Expenditure.pptxPF-Wagner's Theory of Public Expenditure.pptx
PF-Wagner's Theory of Public Expenditure.pptx
Ā 
Webinar Exploring DORA for Fintechs - Simont Braun
Webinar Exploring DORA for Fintechs - Simont BraunWebinar Exploring DORA for Fintechs - Simont Braun
Webinar Exploring DORA for Fintechs - Simont Braun
Ā 
how to sell pi coins on Binance exchange
how to sell pi coins on Binance exchangehow to sell pi coins on Binance exchange
how to sell pi coins on Binance exchange
Ā 
Which Crypto to Buy Today for Short-Term in May-June 2024.pdf
Which Crypto to Buy Today for Short-Term in May-June 2024.pdfWhich Crypto to Buy Today for Short-Term in May-June 2024.pdf
Which Crypto to Buy Today for Short-Term in May-June 2024.pdf
Ā 
Poonawalla Fincorp and IndusInd Bank Introduce New Co-Branded Credit Card
Poonawalla Fincorp and IndusInd Bank Introduce New Co-Branded Credit CardPoonawalla Fincorp and IndusInd Bank Introduce New Co-Branded Credit Card
Poonawalla Fincorp and IndusInd Bank Introduce New Co-Branded Credit Card
Ā 
Summary of financial results for 1Q2024
Summary of financial  results for 1Q2024Summary of financial  results for 1Q2024
Summary of financial results for 1Q2024
Ā 
innovative-invoice-discounting-platforms-in-india-empowering-retail-investors...
innovative-invoice-discounting-platforms-in-india-empowering-retail-investors...innovative-invoice-discounting-platforms-in-india-empowering-retail-investors...
innovative-invoice-discounting-platforms-in-india-empowering-retail-investors...
Ā 
when will pi network coin be available on crypto exchange.
when will pi network coin be available on crypto exchange.when will pi network coin be available on crypto exchange.
when will pi network coin be available on crypto exchange.
Ā 
The WhatsPump Pseudonym Problem and the Hilarious Downfall of Artificial Enga...
The WhatsPump Pseudonym Problem and the Hilarious Downfall of Artificial Enga...The WhatsPump Pseudonym Problem and the Hilarious Downfall of Artificial Enga...
The WhatsPump Pseudonym Problem and the Hilarious Downfall of Artificial Enga...
Ā 
234Presentation on Indian Debt Market.ppt
234Presentation on Indian Debt Market.ppt234Presentation on Indian Debt Market.ppt
234Presentation on Indian Debt Market.ppt
Ā 
Intro_Economics_ GPresentation Week 4.pptx
Intro_Economics_ GPresentation Week 4.pptxIntro_Economics_ GPresentation Week 4.pptx
Intro_Economics_ GPresentation Week 4.pptx
Ā 
US Economic Outlook - Being Decided - M Capital Group August 2021.pdf
US Economic Outlook - Being Decided - M Capital Group August 2021.pdfUS Economic Outlook - Being Decided - M Capital Group August 2021.pdf
US Economic Outlook - Being Decided - M Capital Group August 2021.pdf
Ā 
how to sell pi coins in South Korea profitably.
how to sell pi coins in South Korea profitably.how to sell pi coins in South Korea profitably.
how to sell pi coins in South Korea profitably.
Ā 
The secret way to sell pi coins effortlessly.
The secret way to sell pi coins effortlessly.The secret way to sell pi coins effortlessly.
The secret way to sell pi coins effortlessly.
Ā 
USDA Loans in California: A Comprehensive Overview.pptx
USDA Loans in California: A Comprehensive Overview.pptxUSDA Loans in California: A Comprehensive Overview.pptx
USDA Loans in California: A Comprehensive Overview.pptx
Ā 
Isios-2024-Professional-Independent-Trustee-Survey.pdf
Isios-2024-Professional-Independent-Trustee-Survey.pdfIsios-2024-Professional-Independent-Trustee-Survey.pdf
Isios-2024-Professional-Independent-Trustee-Survey.pdf
Ā 
What price will pi network be listed on exchanges
What price will pi network be listed on exchangesWhat price will pi network be listed on exchanges
What price will pi network be listed on exchanges
Ā 
Turin Startup Ecosystem 2024 - Ricerca sulle Startup e il Sistema dell'Innov...
Turin Startup Ecosystem 2024  - Ricerca sulle Startup e il Sistema dell'Innov...Turin Startup Ecosystem 2024  - Ricerca sulle Startup e il Sistema dell'Innov...
Turin Startup Ecosystem 2024 - Ricerca sulle Startup e il Sistema dell'Innov...
Ā 
Chʰʔng 6. Ancol - phenol - ether (1).pdf
Chʰʔng 6. Ancol - phenol - ether (1).pdfChʰʔng 6. Ancol - phenol - ether (1).pdf
Chʰʔng 6. Ancol - phenol - ether (1).pdf
Ā 

inferencial statistics

  • 1. An introduction to statistical inference Dr. Abhay Pratap Pandey University of Delhi
  • 2. What is inference? Inference defined: ā€¢ An everyday meaningā€¦ We infer a conclusion based on evidence and reasoning ā€¢ A statistical meaningā€¦ We infer a property of a population from a sample
  • 3. Why inference? The aim of inference is to determine the characteristics of a population from a sample. Population Sample
  • 4.
  • 5. Population and sample In statistical analysis, a population is a collection of all the people, items, or events about which one wants to make inferences. OR Any well-defined group of subjects, which could be individuals, firms, cities, or many other possibilities (For example university students in India.) In statistical analysis, a sample, is a subset of the population (i.e. the people, items, or events) that one collects and analyzes to make inferences. (For example 200 randomly chosen university students.)
  • 6. Statistical sample - Subset of the population chosen to represent the population in a statistical analysis; denoted as (X1,X2, ... Xn). Random sample- randomly chosen from the population sample of individuals. In the case of random sampling, the following techniques can be used: Independent sampling (draw with replacement) - after each draw the unit returns to the population. Dependent sampling (draw without replacement) - after each draw the unit does not return to the population (no longer participate in the drawing). In statistical analysis, an observation is an elements of the sample. (For example Helena, a student at Central University.)
  • 8. Aim of statistical inference The aim of statistical inference is to learn about the population using the observed data This involves: ā€¢ computing something with the data ā€¢ a statistic: function of data ā€¢ interpret the result ā€¢ in probabilistic terms: sampling distribution of statistic
  • 9. Estimation ā€¢ Determination of the population parameter by the calculation of a sample statisticā€¦ Characteristic Population Parameter Ī¼ Sample Statistic š‘„
  • 10.
  • 11. A sampling distribution is a probability distribution of a statistic obtained through a large number of samples drawn from a specific population. Population parameter Ī¼ Sample Statistic š‘„1 Sample Statistic š‘„2 Sample Statistic š‘„3 Uncertainty Estimates are not perfect Sampling distribution
  • 12.
  • 13. Types of estimators in statistics Estimator An estimator is a statistic (function of data) that produces such a guess. We usually mean by ā€œbestā€ an estimator whose sampling distribution is more concentrated about the population parameter value compared to other estimators. The two main types of estimators in statistics are ā€¢ Point estimators ā€¢ Interval estimators Point estimation: Point estimators are functions that are used to find an approximate value of a population parameter from random samples of the population. They use the sample data of a population to calculate a point estimate or a statistic that serves as the best estimate of an unknown parameter of a population. We want to estimate a population parameter using the observed data. Ex. some measure of variation, an average, min, max, quantile, etc.
  • 14. ā€¢ Interval estimation Interval estimation uses sample data to calculate the interval of the possible values of an unknown parameter of a population. The interval of the parameter is selected in a way that it falls within a 95% or higher probability, also known as the confidence interval. The confidence interval is used to indicate how reliable an estimate is, and it is calculated from the observed data. The endpoints of the intervals are referred to as the upper and lower confidence limits.
  • 15. Properties of Point Estimators ā€¢ Unbiasedness ā€¢ Consistency ā€¢ Sufficiency ā€¢ Efficiency Unbiasedness An estimator of a given parameter is said to be unbiased if its expected value is equal to the true value of the parameter. The bias of a point estimator is defined as the difference between the expected value of the estimator and the value of the parameter being estimated. When Also, the closer the expected value of a parameter is to the value of the parameter being measured, the lesser the bias is.
  • 16.
  • 17. Consistency Consistency tells us how close the point estimator stays to the value of the parameter as it increases in size. The point estimator requires a large sample size for it to be more consistent and accurate. You can also check if a point estimator is consistent by looking at its corresponding expected value and variance. For the point estimator to be consistent, the expected value should move toward the true value of the parameter.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22. Maximum likelihood estimator The maximum likelihood estimator method of point estimation attempts to find the unknown parameters that maximize the likelihood function. It takes a known model and uses the values to compare data sets and find the most suitable match for the data. For example, a researcher may be interested in knowing the average weight of babies born prematurely. Since it would be impossible to measure all babies born prematurely in the population, the researcher can take a sample from one location. Since the weight of pre-term babies follows a normal distribution, the researcher can use the maximum likelihood estimator to find the average weight of the entire population of pre-term babies based on the sample data.
  • 23.
  • 24.
  • 25.
  • 26. Method of moments The method of moments of estimating parameters was introduced in 1887 by Russian mathematician Pafnuty Chebyshev. It starts by taking known facts about a population and then applying the facts to a sample of the population. The first step is to derive equations that relate the population moments to the unknown parameters. The next step is to draw a sample of the population to be used to estimate the population moments. The equations derived in step one are then solved using the sample mean of the population moments. This produces the best estimate of the unknown population parameters.
  • 27.
  • 28.
  • 29.
  • 30.
  • 31.
  • 32.
  • 33.
  • 34.
  • 35.
  • 36.
  • 37.
  • 38. What is Confidence Interval? A confidence interval is an estimate of an interval in statistics that may contain a population parameter. The unknown population parameter is found through a sample parameter calculated from the sampled data. For example, the population mean Ī¼ is found using the sample mean xĢ…. The interval is generally defined by its lower and upper bounds. The confidence interval is expressed as a percentage (the most frequently quoted percentages are 90%, 95%, and 99%). The percentage reflects the confidence level. The concept of the confidence interval is very important in statistics (hypothesis testing) since it is used as a measure of uncertainty. The concept was introduced by Polish mathematician and statistician, Jerzy Neyman in 1937.
  • 39. Confidence Interval We can also quantify the uncertainty (sampling distribution) of our point estimate. One way of doing this is by constructing an interval that is likely to contain the population parameter. One such an interval, which is computed on the basis of the data, is called a confidence interval. The sampling probability that the confidence interval will indeed contain the parameter value is called the confidence level. We construct confidence intervals for a given confidence level.
  • 40. Interpretation of Confidence Interval The proper interpretation of a confidence interval is probably the most challenging aspect of this statistical concept. One example of the most common interpretation of the concept is the following: There is a 95% probability that, in the future, the true value of the population parameter (e.g., mean) will fall within X [lower bound] and Y [upper bound] interval. In addition, we may interpret the confidence interval using the statement below: We are 95% confident that the interval between X [lower bound] and Y [upper bound] contains the true value of the population parameter. However, it would be inappropriate to state the following: There is a 95% probability that the interval between X [lower bound] and Y [upper bound] contains the true value of the population parameter.
  • 41. How to Calculate the Confidence Interval? The interval is calculated using the following steps: ā€¢ Gather the sample data. ā€¢ Calculate the sample mean xĢ…. ā€¢ Determine whether a populationā€™s standard deviation is known or unknown. ā€¢ If a populationā€™s standard deviation is known, we can use a z-score for the corresponding confidence level. ā€¢ If a populationā€™s standard deviation is unknown, we can use a t- statistic for the corresponding confidence level.
  • 42. ā€¢ Find the lower and upper bounds of the confidence interval using the following formulas: a. Known population standard deviation
  • 43. b. Unknown population standard deviation
  • 44. Examples ā€¢ Suppose we conduct a poll to try and get a sense of the outcome of an upcoming election with two candidates. We poll 1000 people, and 550 of them respond that they will vote for candidate A . How confident can we be that a given person will cast their vote for candidate A? Sol. 1. Select our desired levels of confidence Weā€™re going to use the 90%, 95%, and 99% levels 2. Calculate Ī± and Ī±/2 Our Ī± values are 0.1, 0.05, and 0.01 respectively Our Ī±/2 values are 0.05, 0.025, and 0.005 3. Look up the corresponding z-scores Our ZĪ± /2 values are 1.645, 1.96, and 2.58 4. Multiply the z-score by the standard error to find the margin of error First we need to calculate the standard error
  • 45. 5. Find the interval by adding and subtracting this product from the mean. In this case, we are working with a distribution we have not previously discussed, a normal binomial distribution (i.e. a vote can choose Candidate A or B, a binomial function). We have a probability estimator from our sample, where the probability of an individual in our sample voting for candidate A was found to be 550/1000 or 0.55. We can use this information in a formula to estimate the standard error for such a distribution: 5. Multiply the z-score by the standard error cont. ā€¢ For a normal binominal distribution, the standard error can be estimated using: S.E= 0.0157
  • 46. ā€¢ We can now multiply this value by the z-scores to calculate the margins of error for each conf. level Multiply the z-score by the standard error cont. ā€¢ We calculate the margin of error and add and subtract that value from the mean (0.55 in this case) to find the bounds of our confidence intervals at each level of confidence: CI ZĪ±/2 Margin of error Lower Bounds Upper Bounds 90% 1.645 0.026 0.524 0.576 95% 1.96 0.031 0.519 0.581 99% 2.58 0.041 0.509 0.591
  • 47. What is Hypothesis Testing? Hypothesis Testing is a method of statistical inference. It is used to test if a statement regarding a population parameter is statistically significant. Hypothesis testing is a powerful tool for testing the power of predictions. For example: A Statistician might want to make a prediction of the mean value a customer would pay for his firmā€™s product. He can then formulate a hypothesis, for example, ā€œThe average value that customers will pay for my product is larger than $5ā€. To statistically test this question, the firm owner could use hypothesis testing.
  • 48. Hypothesis testing is formulated in terms of two hypothesis: ā€¢ H0: the null hypothesis; ā€¢ H1: the alternate hypothesis. The hypothesis we want to test is if H1 is ā€œlikely" true. So, there are two possible outcomes: ā€¢ Reject H0 and accept H1 because of sufficient evidence in the sample in favor or H1; ā€¢ Do not reject H0 because of insufficient evidence to support H1.
  • 49. Null Hypothesis and Alternative Hypothesis ā€¢ Null Hypothesis ā€¢ Alternative Hypothesis The Null Hypothesis is usually set as what we donā€™t want to be true. It is the hypothesis to be tested. Therefore, the Null Hypothesis is considered to be true, until we have sufficient evidence to reject it. If we reject the null hypothesis, we are led to the alternative hypothesis. Example of the business owner who is looking for some customer insight. His null hypothesis would be: H0 : The average value customers are willing to pay for my product is smaller than or equal to $5 or H0 : Āµ ā‰¤ 5(Āµ = the population mean) The alternative hypothesis would then be what we are evaluating, so, in this case, it would be: Ha : The average value customers are willing to pay for the product is greater than $5 or Ha : Āµ > 5
  • 50.
  • 51. Type I and Type II Errors A Type I Error arises when a true Null Hypothesis is rejected. The probability of making a Type I Error is also known as the level of significance of the test, which is commonly referred to as alpha (Ī±). So, for example, if a test that has its alpha set as 0.01, there is a 1% probability of rejecting a true null hypothesis or a 1% probability of making a Type I Error. A Type II Error arises when you fail to reject a False Null Hypothesis. The probability of making a Type II Error is commonly denoted by the Greek letter beta (Ī²). Ī² is used to define the Power of a Test, which is the probability of correctly rejecting a false null hypothesis.
  • 52. The Power of a Test is defined as 1-Ī². A test with more Power is more desirable, as there is a lower probability of making a Type II Error. However, there is a tradeoff between the probability of making a Type I Error and the probability of making a Type II Error.
  • 54. ā€¢ Significance level - is the maximum probability of committing a Type I error. This probability is symbolized by Ī±. P(Type I error|H0 is true)=Ī±. ā€¢ Critical or Rejection Region ā€“ the range of values for the test value that indicate a significant difference and that the null hypothesis should be rejected. ā€¢ Non-critical or Non-rejection Region ā€“ the range of values for the test value that indicates that the difference was probably due to chance and that the null hypothesis should not be rejected.
  • 55.
  • 59.
  • 60.
  • 61.
  • 62.
  • 64.
  • 65.
  • 66. Testing a hypothesis about the mean of a population We have the following steps: 1.Data: determine variable, sample size (n), sample mean( ) , population standard deviation or sample standard deviation (s) if is unknown 2. Assumptions : We have two cases: Case1: Population is normally or approximately normally distributed with known or unknown variance (sample size n may be small or large), Case 2: Population is not normal with known or unknown variance (n is large i.e. nā‰„30).
  • 67. 3.Hypothesis: we have three cases Case I : H0: Ī¼=Ī¼0 Vs HA: Ī¼ Ī¼0 e.g. we want to test that the population mean is different than 50 Case II : H0: Ī¼ = Ī¼0 Vs HA: Ī¼ > Ī¼0 e.g. we want to test that the population mean is greater than 50 Case III : H0: Ī¼ = Ī¼0 Vs HA: Ī¼< Ī¼0 e.g. we want to test that the population mean is less than 50
  • 68.
  • 69.
  • 70.
  • 71.
  • 72.
  • 73.
  • 74. Example ā€¢ Researchers are interested in the mean age of a certain population. ā€¢ A random sample of 10 individuals drawn from the population of interest has a mean of 27. ā€¢ Assuming that the population is approximately normally distributed with variance 20,can we conclude that the mean is different from 30 years ? (Ī±=0.05) . ā€¢ If the p - value is 0.0340 how can we use it in making a decision?
  • 75. Solution 1-Data: variable is age, n=10, =27 ,Ļƒ2=20,Ī±=0.05 2-Assumptions: the population is approximately normally distributed with variance 20 3-Hypotheses: ā€¢ H0 : Ī¼=30 ā€¢ HA: Ī¼ 30 4-Test Statistic: ā€¢ Z = -2.12 5.Decision Rule The alternative hypothesis is HA: Ī¼ ā‰  30 Hence we reject H0 if Z > Z(1-0.025)= Z(0.975) ā€¢ or Z< - Z(1-0.025 )= - Z(0.975) ā€¢ Z(0.975)=1.96(from table D)
  • 76. 6.Decision: ā€¢ We reject H0 ,since -2.12 is in the rejection region . ā€¢ We can conclude that Ī¼ is not equal to 30 ā€¢ Using the p value ,we note that p-value =0.0340< 0.05,therefore we reject H0
  • 77.
  • 78.
  • 79.