Dr. Abhay Pratap Pandey introduces statistical inference and its key concepts. Statistical inference allows making conclusions about a population based on a sample. It involves estimation and hypothesis testing. Estimation determines population parameters using sample statistics. Hypothesis testing determines if sample data provides sufficient evidence to reject claims about population parameters. The document defines key terms like population, sample, parameter, statistic, and discusses properties of estimators like unbiasedness and consistency. It also explains hypothesis testing concepts like null and alternative hypotheses, types of errors, and steps to conduct hypothesis tests on a population mean. An example demonstrates hypothesis testing for a population mean using a z-test.
Introduction to Statistics - Basic concepts
- How to be a good doctor - A step in Health promotion
- By Ibrahim A. Abdelhaleem - Zagazig Medical Research Society (ZMRS)
Introduction to Statistics - Basic concepts
- How to be a good doctor - A step in Health promotion
- By Ibrahim A. Abdelhaleem - Zagazig Medical Research Society (ZMRS)
A sample design is a definite plan for obtaining a sample from a given population. Researcher must select/prepare a sample design which should be reliable and appropriate for his research study.
Hypothesis Testing is important part of research, based on hypothesis testing we can check the truth of presumes hypothesis (Research Statement or Research Methodology )
UNIVARIATE & BIVARIATE ANALYSIS
UNIVARIATE BIVARIATE & MULTIVARIATE
UNIVARIATE ANALYSIS
-One variable analysed at a time
BIVARIATE ANALYSIS
-Two variable analysed at a time
MULTIVARIATE ANALYSIS
-More than two variables analysed at a time
TYPES OF ANALYSIS
DESCRIPTIVE ANALYSIS
INFERENTIAL ANALYSIS
DESCRIPTIVE ANALYSIS
Transformation of raw data
Facilitate easy understanding and interpretation
Deals with summary measures relating to sample data
Eg-what is the average age of the sample?
INFERENTIAL ANALYSIS
Carried out after descriptive analysis
Inferences drawn on population parameters based on sample results
Generalizes results to the population based on sample results
Eg-is the average age of population different from 35?
DESCRIPTIVE ANALYSIS OF UNIVARIATE DATA
1. Prepare frequency distribution of each variable
Missing Data
Situation where certain questions are left unanswered
Analysis of multiple responses
Measures of central tendency
3 measures of central tendency
1.Mean
2.Median
3.Mode
MEAN
Arithmetic average of a variable
Appropriate for interval and ratio scale data
x
MEDIAN
Calculates the middle value of the data
Computed for ratio, interval or ordinal scale.
Data needs to be arranged in ascending or descending order
MODE
Point of maximum frequency
Should not be computed for ordinal or interval data unless grouped.
Widely used in business
MEASURE OF DISPERSION
Measures of central tendency do not explain distribution of variables
4 measures of dispersion
1.Range
2.Variance and standard deviation
3.Coefficient of variation
4.Relative and absolute frequencies
DESCRIPTIVE ANALYSIS OF BIVARIATE DATA
There are three types of measure used.
1.Cross tabulation
2.Spearmans rank correlation coefficient
3.Pearsons linear correlation coefficient
Cross Tabulation
Responses of two questions are combined
Spearmanās rank order correlation coefficient.
Used in case of ordinal data
A sample design is a definite plan for obtaining a sample from a given population. Researcher must select/prepare a sample design which should be reliable and appropriate for his research study.
Hypothesis Testing is important part of research, based on hypothesis testing we can check the truth of presumes hypothesis (Research Statement or Research Methodology )
UNIVARIATE & BIVARIATE ANALYSIS
UNIVARIATE BIVARIATE & MULTIVARIATE
UNIVARIATE ANALYSIS
-One variable analysed at a time
BIVARIATE ANALYSIS
-Two variable analysed at a time
MULTIVARIATE ANALYSIS
-More than two variables analysed at a time
TYPES OF ANALYSIS
DESCRIPTIVE ANALYSIS
INFERENTIAL ANALYSIS
DESCRIPTIVE ANALYSIS
Transformation of raw data
Facilitate easy understanding and interpretation
Deals with summary measures relating to sample data
Eg-what is the average age of the sample?
INFERENTIAL ANALYSIS
Carried out after descriptive analysis
Inferences drawn on population parameters based on sample results
Generalizes results to the population based on sample results
Eg-is the average age of population different from 35?
DESCRIPTIVE ANALYSIS OF UNIVARIATE DATA
1. Prepare frequency distribution of each variable
Missing Data
Situation where certain questions are left unanswered
Analysis of multiple responses
Measures of central tendency
3 measures of central tendency
1.Mean
2.Median
3.Mode
MEAN
Arithmetic average of a variable
Appropriate for interval and ratio scale data
x
MEDIAN
Calculates the middle value of the data
Computed for ratio, interval or ordinal scale.
Data needs to be arranged in ascending or descending order
MODE
Point of maximum frequency
Should not be computed for ordinal or interval data unless grouped.
Widely used in business
MEASURE OF DISPERSION
Measures of central tendency do not explain distribution of variables
4 measures of dispersion
1.Range
2.Variance and standard deviation
3.Coefficient of variation
4.Relative and absolute frequencies
DESCRIPTIVE ANALYSIS OF BIVARIATE DATA
There are three types of measure used.
1.Cross tabulation
2.Spearmans rank correlation coefficient
3.Pearsons linear correlation coefficient
Cross Tabulation
Responses of two questions are combined
Spearmanās rank order correlation coefficient.
Used in case of ordinal data
šŗPlease Subscribe to this Channel for more solutions and lectures
http://www.youtube.com/onlineteaching
Chapter 7: Estimating Parameters and Determining Sample Sizes
7.1: Estimating a Population Proportion
ļŗPlease Subscribe to this Channel for more solutions and lectures
http://www.youtube.com/onlineteaching
Chapter 7: Estimating Parameters and Determining Sample Sizes
7.1: Estimating a Population Proportion
This ppt includes basic concepts about data types, levels of measurements. It also explains which descriptive measure, graph and tests should be used for different types of data. A brief of Pivot tables and charts is also included.
how to sell pi coins effectively (from 50 - 100k pi)DOT TECH
Ā
Anywhere in the world, including Africa, America, and Europe, you can sell Pi Network Coins online and receive cash through online payment options.
Pi has not yet been launched on any exchange because we are currently using the confined Mainnet. The planned launch date for Pi is June 28, 2026.
Reselling to investors who want to hold until the mainnet launch in 2026 is currently the sole way to sell.
Consequently, right now. All you need to do is select the right pi network provider.
Who is a pi merchant?
An individual who buys coins from miners on the pi network and resells them to investors hoping to hang onto them until the mainnet is launched is known as a pi merchant.
debuts.
I'll provide you the Telegram username
@Pi_vendor_247
Currently pi network is not tradable on binance or any other exchange because we are still in the enclosed mainnet.
Right now the only way to sell pi coins is by trading with a verified merchant.
What is a pi merchant?
A pi merchant is someone verified by pi network team and allowed to barter pi coins for goods and services.
Since pi network is not doing any pre-sale The only way exchanges like binance/huobi or crypto whales can get pi is by buying from miners. And a merchant stands in between the exchanges and the miners.
I will leave the telegram contact of my personal pi merchant. I and my friends has traded more than 6000pi coins successfully
Tele-gram
@Pi_vendor_247
Poonawalla Fincorp and IndusInd Bank Introduce New Co-Branded Credit Cardnickysharmasucks
Ā
The unveiling of the IndusInd Bank Poonawalla Fincorp eLITE RuPay Platinum Credit Card marks a notable milestone in the Indian financial landscape, showcasing a successful partnership between two leading institutions, Poonawalla Fincorp and IndusInd Bank. This co-branded credit card not only offers users a plethora of benefits but also reflects a commitment to innovation and adaptation. With a focus on providing value-driven and customer-centric solutions, this launch represents more than just a new productāit signifies a step towards redefining the banking experience for millions. Promising convenience, rewards, and a touch of luxury in everyday financial transactions, this collaboration aims to cater to the evolving needs of customers and set new standards in the industry.
Falcon stands out as a top-tier P2P Invoice Discounting platform in India, bridging esteemed blue-chip companies and eager investors. Our goal is to transform the investment landscape in India by establishing a comprehensive destination for borrowers and investors with diverse profiles and needs, all while minimizing risk. What sets Falcon apart is the elimination of intermediaries such as commercial banks and depository institutions, allowing investors to enjoy higher yields.
when will pi network coin be available on crypto exchange.DOT TECH
Ā
There is no set date for when Pi coins will enter the market.
However, the developers are working hard to get them released as soon as possible.
Once they are available, users will be able to exchange other cryptocurrencies for Pi coins on designated exchanges.
But for now the only way to sell your pi coins is through verified pi vendor.
Here is the telegram contact of my personal pi vendor
@Pi_vendor_247
US Economic Outlook - Being Decided - M Capital Group August 2021.pdfpchutichetpong
Ā
The U.S. economy is continuing its impressive recovery from the COVID-19 pandemic and not slowing down despite re-occurring bumps. The U.S. savings rate reached its highest ever recorded level at 34% in April 2020 and Americans seem ready to spend. The sectors that had been hurt the most by the pandemic specifically reduced consumer spending, like retail, leisure, hospitality, and travel, are now experiencing massive growth in revenue and job openings.
Could this growth lead to a āRoaring Twentiesā? As quickly as the U.S. economy contracted, experiencing a 9.1% drop in economic output relative to the business cycle in Q2 2020, the largest in recorded history, it has rebounded beyond expectations. This surprising growth seems to be fueled by the U.S. governmentās aggressive fiscal and monetary policies, and an increase in consumer spending as mobility restrictions are lifted. Unemployment rates between June 2020 and June 2021 decreased by 5.2%, while the demand for labor is increasing, coupled with increasing wages to incentivize Americans to rejoin the labor force. Schools and businesses are expected to fully reopen soon. In parallel, vaccination rates across the country and the world continue to rise, with full vaccination rates of 50% and 14.8% respectively.
However, it is not completely smooth sailing from here. According to M Capital Group, the main risks that threaten the continued growth of the U.S. economy are inflation, unsettled trade relations, and another wave of Covid-19 mutations that could shut down the world again. Have we learned from the past year of COVID-19 and adapted our economy accordingly?
āIn order for the U.S. economy to continue growing, whether there is another wave or not, the U.S. needs to focus on diversifying supply chains, supporting business investment, and maintaining consumer spending,ā says Grace Feeley, a research analyst at M Capital Group.
While the economic indicators are positive, the risks are coming closer to manifesting and threatening such growth. The new variants spreading throughout the world, Delta, Lambda, and Gamma, are vaccine-resistant and muddy the predictions made about the economy and health of the country. These variants bring back the feeling of uncertainty that has wreaked havoc not only on the stock market but the mindset of people around the world. MCG provides unique insight on how to mitigate these risks to possibly ensure a bright economic future.
how to sell pi coins in South Korea profitably.DOT TECH
Ā
Yes. You can sell your pi network coins in South Korea or any other country, by finding a verified pi merchant
What is a verified pi merchant?
Since pi network is not launched yet on any exchange, the only way you can sell pi coins is by selling to a verified pi merchant, and this is because pi network is not launched yet on any exchange and no pre-sale or ico offerings Is done on pi.
Since there is no pre-sale, the only way exchanges can get pi is by buying from miners. So a pi merchant facilitates these transactions by acting as a bridge for both transactions.
How can i find a pi vendor/merchant?
Well for those who haven't traded with a pi merchant or who don't already have one. I will leave the telegram id of my personal pi merchant who i trade pi with.
Tele gram: @Pi_vendor_247
#pi #sell #nigeria #pinetwork #picoins #sellpi #Nigerian #tradepi #pinetworkcoins #sellmypi
The secret way to sell pi coins effortlessly.DOT TECH
Ā
Well as we all know pi isn't launched yet. But you can still sell your pi coins effortlessly because some whales in China are interested in holding massive pi coins. And they are willing to pay good money for it. If you are interested in selling I will leave a contact for you. Just telegram this number below. I sold about 3000 pi coins to him and he paid me immediately.
Telegram: @Pi_vendor_247
USDA Loans in California: A Comprehensive Overview.pptxmarketing367770
Ā
USDA Loans in California: A Comprehensive Overview
If you're dreaming of owning a home in California's rural or suburban areas, a USDA loan might be the perfect solution. The U.S. Department of Agriculture (USDA) offers these loans to help low-to-moderate-income individuals and families achieve homeownership.
Key Features of USDA Loans:
Zero Down Payment: USDA loans require no down payment, making homeownership more accessible.
Competitive Interest Rates: These loans often come with lower interest rates compared to conventional loans.
Flexible Credit Requirements: USDA loans have more lenient credit score requirements, helping those with less-than-perfect credit.
Guaranteed Loan Program: The USDA guarantees a portion of the loan, reducing risk for lenders and expanding borrowing options.
Eligibility Criteria:
Location: The property must be located in a USDA-designated rural or suburban area. Many areas in California qualify.
Income Limits: Applicants must meet income guidelines, which vary by region and household size.
Primary Residence: The home must be used as the borrower's primary residence.
Application Process:
Find a USDA-Approved Lender: Not all lenders offer USDA loans, so it's essential to choose one approved by the USDA.
Pre-Qualification: Determine your eligibility and the amount you can borrow.
Property Search: Look for properties in eligible rural or suburban areas.
Loan Application: Submit your application, including financial and personal information.
Processing and Approval: The lender and USDA will review your application. If approved, you can proceed to closing.
USDA loans are an excellent option for those looking to buy a home in California's rural and suburban areas. With no down payment and flexible requirements, these loans make homeownership more attainable for many families. Explore your eligibility today and take the first step toward owning your dream home.
What price will pi network be listed on exchangesDOT TECH
Ā
The rate at which pi will be listed is practically unknown. But due to speculations surrounding it the predicted rate is tends to be from 30$ ā 50$.
So if you are interested in selling your pi network coins at a high rate tho. Or you can't wait till the mainnet launch in 2026. You can easily trade your pi coins with a merchant.
A merchant is someone who buys pi coins from miners and resell them to Investors looking forward to hold massive quantities till mainnet launch.
I will leave the telegram contact of my personal pi vendor to trade with.
@Pi_vendor_247
Turin Startup Ecosystem 2024 - Ricerca sulle Startup e il Sistema dell'Innov...Quotidiano Piemontese
Ā
Turin Startup Ecosystem 2024
Una ricerca de il Club degli Investitori, in collaborazione con ToTeM Torino Tech Map e con il supporto della ESCP Business School e di Growth Capital
1. An introduction to statistical inference
Dr. Abhay Pratap Pandey
University of Delhi
2. What is inference?
Inference defined:
ā¢ An everyday meaningā¦
We infer a conclusion based on evidence and reasoning
ā¢ A statistical meaningā¦
We infer a property of a population from a sample
3. Why inference?
The aim of inference is to determine the characteristics of a population
from a sample.
Population
Sample
4.
5. Population and sample
In statistical analysis, a population is a collection of all the
people, items, or events about which one wants to make
inferences. OR
Any well-defined group of subjects, which could be
individuals, firms, cities, or many other possibilities
(For example university students in India.)
In statistical analysis, a sample, is a subset of the population
(i.e. the people, items, or events) that one collects and
analyzes to make inferences. (For example 200 randomly
chosen university students.)
6. Statistical sample - Subset of the population chosen to represent the
population in a statistical analysis; denoted as (X1,X2, ... Xn).
Random sample- randomly chosen from the population sample of
individuals.
In the case of random sampling, the following techniques can be used:
Independent sampling (draw with replacement) - after each draw the
unit returns to the population.
Dependent sampling (draw without replacement) - after each draw the
unit does not return to the population (no longer participate in the
drawing).
In statistical analysis, an observation is an elements of the sample. (For
example Helena, a student at Central University.)
8. Aim of statistical inference
The aim of statistical inference is to learn about the population using the observed
data
This involves:
ā¢ computing something with the data
ā¢ a statistic: function of data
ā¢ interpret the result
ā¢ in probabilistic terms: sampling distribution of statistic
9. Estimation
ā¢ Determination of the population parameter by the calculation of a
sample statisticā¦
Characteristic
Population
Parameter
Ī¼
Sample
Statistic
š„
10.
11. A sampling distribution is a probability distribution of a statistic obtained
through a large number of samples drawn from a specific population.
Population
parameter
Ī¼
Sample
Statistic š„1
Sample
Statistic
š„2
Sample
Statistic š„3
Uncertainty
Estimates are not perfect
Sampling
distribution
12.
13. Types of estimators in statistics
Estimator
An estimator is a statistic (function of data) that produces such a guess.
We usually mean by ābestā an estimator whose sampling distribution is more
concentrated about the population parameter value compared to other
estimators.
The two main types of estimators in statistics are
ā¢ Point estimators
ā¢ Interval estimators
Point estimation: Point estimators are functions that are used to find an
approximate value of a population parameter from random samples of the
population. They use the sample data of a population to calculate a point
estimate or a statistic that serves as the best estimate of an
unknown parameter of a population. We want to estimate a population
parameter using the observed data.
Ex. some measure of variation, an average, min, max, quantile, etc.
14. ā¢ Interval estimation
Interval estimation uses sample data to calculate the interval
of the possible values of an unknown parameter of a
population. The interval of the parameter is selected in a
way that it falls within a 95% or higher probability, also
known as the confidence interval. The confidence interval is
used to indicate how reliable an estimate is, and it is
calculated from the observed data. The endpoints of the
intervals are referred to as the upper and lower confidence
limits.
15. Properties of Point Estimators
ā¢ Unbiasedness
ā¢ Consistency
ā¢ Sufficiency
ā¢ Efficiency
Unbiasedness
An estimator of a given parameter is said to be unbiased if its expected
value is equal to the true value of the parameter.
The bias of a point estimator is defined as the difference between
the expected value of the estimator and the value of the parameter being
estimated. When
Also, the closer the expected value of a parameter is to the value of the
parameter being measured, the lesser the bias is.
16.
17. Consistency
Consistency tells us how close the point estimator stays to the value of
the parameter as it increases in size. The point estimator requires a
large sample size for it to be more consistent and accurate. You can also
check if a point estimator is consistent by looking at its corresponding
expected value and variance. For the point estimator to be consistent,
the expected value should move toward the true value of the
parameter.
18.
19.
20.
21.
22. Maximum likelihood estimator
The maximum likelihood estimator method of point estimation
attempts to find the unknown parameters that maximize the likelihood
function. It takes a known model and uses the values to compare data
sets and find the most suitable match for the data.
For example, a researcher may be interested in knowing the average
weight of babies born prematurely. Since it would be impossible to
measure all babies born prematurely in the population, the researcher
can take a sample from one location. Since the weight of pre-term
babies follows a normal distribution, the researcher can use the
maximum likelihood estimator to find the average weight of the entire
population of pre-term babies based on the sample data.
23.
24.
25.
26. Method of moments
The method of moments of estimating parameters was introduced in
1887 by Russian mathematician Pafnuty Chebyshev. It starts by taking
known facts about a population and then applying the facts to a sample
of the population. The first step is to derive equations that relate the
population moments to the unknown parameters.
The next step is to draw a sample of the population to be used to
estimate the population moments. The equations derived in step one
are then solved using the sample mean of the population moments.
This produces the best estimate of the unknown population
parameters.
27.
28.
29.
30.
31.
32.
33.
34.
35.
36.
37.
38. What is Confidence Interval?
A confidence interval is an estimate of an interval in statistics that may
contain a population parameter. The unknown population parameter is
found through a sample parameter calculated from the sampled data.
For example, the population mean Ī¼ is found using the sample mean xĢ .
The interval is generally defined by its lower and upper bounds. The
confidence interval is expressed as a percentage (the most frequently
quoted percentages are 90%, 95%, and 99%). The percentage reflects
the confidence level.
The concept of the confidence interval is very important in statistics
(hypothesis testing) since it is used as a measure of uncertainty. The
concept was introduced by Polish mathematician and statistician, Jerzy
Neyman in 1937.
39. Confidence Interval
We can also quantify the uncertainty (sampling distribution) of our
point estimate.
One way of doing this is by constructing an interval that is likely to
contain the population parameter.
One such an interval, which is computed on the basis of the data, is
called a confidence interval.
The sampling probability that the confidence interval will indeed
contain the parameter value is called the confidence level.
We construct confidence intervals for a given confidence level.
40. Interpretation of Confidence Interval
The proper interpretation of a confidence interval is probably the most
challenging aspect of this statistical concept. One example of the most
common interpretation of the concept is the following:
There is a 95% probability that, in the future, the true value of the
population parameter (e.g., mean) will fall within X [lower bound] and Y
[upper bound] interval.
In addition, we may interpret the confidence interval using the statement
below:
We are 95% confident that the interval between X [lower bound] and Y
[upper bound] contains the true value of the population parameter.
However, it would be inappropriate to state the following:
There is a 95% probability that the interval between X [lower bound] and
Y [upper bound] contains the true value of the population parameter.
41. How to Calculate the Confidence Interval?
The interval is calculated using the following steps:
ā¢ Gather the sample data.
ā¢ Calculate the sample mean xĢ .
ā¢ Determine whether a populationās standard deviation is known or
unknown.
ā¢ If a populationās standard deviation is known, we can use a z-score for
the corresponding confidence level.
ā¢ If a populationās standard deviation is unknown, we can use a t-
statistic for the corresponding confidence level.
42. ā¢ Find the lower and upper bounds of the confidence interval using the
following formulas:
a. Known population standard deviation
44. Examples
ā¢ Suppose we conduct a poll to try and get a sense of the outcome of an
upcoming election with two candidates. We poll 1000 people, and 550 of
them respond that they will vote for candidate A .
How confident can we be that a given person will cast their vote for
candidate A?
Sol.
1. Select our desired levels of confidence Weāre going to use the 90%,
95%, and 99% levels
2. Calculate Ī± and Ī±/2 Our Ī± values are 0.1, 0.05, and 0.01 respectively
Our Ī±/2 values are 0.05, 0.025, and 0.005
3. Look up the corresponding z-scores Our ZĪ± /2 values are 1.645, 1.96,
and 2.58
4. Multiply the z-score by the standard error to find the margin of error
First we need to calculate the standard error
45. 5. Find the interval by adding and subtracting this product from the mean.
In this case, we are working with a distribution we have not previously
discussed, a normal binomial distribution (i.e. a vote can choose Candidate
A or B, a binomial function).
We have a probability estimator from our sample, where the probability of
an individual in our sample voting for candidate A was found to be 550/1000
or 0.55.
We can use this information in a formula to estimate the standard error for
such a distribution:
5. Multiply the z-score by the standard error cont.
ā¢ For a normal binominal distribution, the standard error can be estimated
using:
S.E= 0.0157
46. ā¢ We can now multiply this value by the z-scores to calculate the
margins of error for each conf. level
Multiply the z-score by the standard error cont.
ā¢ We calculate the margin of error and add and subtract that value
from the mean (0.55 in this case) to find the bounds of our confidence
intervals at each level of confidence:
CI ZĪ±/2 Margin of error Lower Bounds Upper Bounds
90% 1.645 0.026 0.524 0.576
95% 1.96 0.031 0.519 0.581
99% 2.58 0.041 0.509 0.591
47. What is Hypothesis Testing?
Hypothesis Testing is a method of statistical inference. It is used to test
if a statement regarding a population parameter is statistically
significant. Hypothesis testing is a powerful tool for testing the power
of predictions.
For example: A Statistician might want to make a prediction of the
mean value a customer would pay for his firmās product. He can then
formulate a hypothesis, for example, āThe average value that
customers will pay for my product is larger than $5ā. To statistically test
this question, the firm owner could use hypothesis testing.
48. Hypothesis testing is formulated in terms of two hypothesis:
ā¢ H0: the null hypothesis;
ā¢ H1: the alternate hypothesis.
The hypothesis we want to test is if H1 is ālikely" true.
So, there are two possible outcomes:
ā¢ Reject H0 and accept H1 because of sufficient evidence in the sample
in favor or H1;
ā¢ Do not reject H0 because of insufficient evidence to support H1.
49. Null Hypothesis and Alternative Hypothesis
ā¢ Null Hypothesis
ā¢ Alternative Hypothesis
The Null Hypothesis is usually set as what we donāt want to be true. It is
the hypothesis to be tested. Therefore, the Null Hypothesis is considered
to be true, until we have sufficient evidence to reject it. If we reject the
null hypothesis, we are led to the alternative hypothesis.
Example of the business owner who is looking for some customer insight.
His null hypothesis would be:
H0 : The average value customers are willing to pay for my product is
smaller than or equal to $5 or H0 : Āµ ā¤ 5(Āµ = the population mean)
The alternative hypothesis would then be what we are evaluating, so, in
this case, it would be:
Ha : The average value customers are willing to pay for the product is
greater than $5 or Ha : Āµ > 5
50.
51. Type I and Type II Errors
A Type I Error arises when a true Null Hypothesis is rejected. The
probability of making a Type I Error is also known as the level of
significance of the test, which is commonly referred to as alpha (Ī±). So,
for example, if a test that has its alpha set as 0.01, there is a 1%
probability of rejecting a true null hypothesis or a 1% probability of
making a Type I Error.
A Type II Error arises when you fail to reject a False Null Hypothesis.
The probability of making a Type II Error is commonly denoted by the
Greek letter beta (Ī²). Ī² is used to define the Power of a Test, which is
the probability of correctly rejecting a false null hypothesis.
52. The Power of a Test is defined as 1-Ī². A test with more Power is more
desirable, as there is a lower probability of making a Type II Error.
However, there is a tradeoff between the probability of making a Type I
Error and the probability of making a Type II Error.
54. ā¢ Significance level - is the maximum probability of committing a Type I
error. This probability is symbolized by Ī±.
P(Type I error|H0 is true)=Ī±.
ā¢ Critical or Rejection Region ā the range of values for the test value
that indicate a significant difference and that the null hypothesis
should be rejected.
ā¢ Non-critical or Non-rejection Region ā the range of values for the test
value that indicates that the difference was probably due to chance
and that the null hypothesis should not be rejected.
66. Testing a hypothesis about the mean of a population
We have the following steps:
1.Data: determine variable, sample size (n), sample mean( ) ,
population standard deviation or sample standard deviation (s) if is
unknown
2. Assumptions : We have two cases:
Case1: Population is normally or approximately normally distributed
with known or unknown variance (sample size n may be small or large),
Case 2: Population is not normal with known or unknown variance (n is
large i.e. nā„30).
67. 3.Hypothesis: we have three cases
Case I : H0: Ī¼=Ī¼0 Vs HA: Ī¼ Ī¼0
e.g. we want to test that the population mean is different than 50
Case II : H0: Ī¼ = Ī¼0 Vs HA: Ī¼ > Ī¼0
e.g. we want to test that the population mean is greater than 50
Case III : H0: Ī¼ = Ī¼0 Vs HA: Ī¼< Ī¼0
e.g. we want to test that the population mean is less than 50
68.
69.
70.
71.
72.
73.
74. Example
ā¢ Researchers are interested in the mean age of a certain population.
ā¢ A random sample of 10 individuals drawn from the population of
interest has a mean of 27.
ā¢ Assuming that the population is approximately normally distributed
with variance 20,can we conclude that the mean is different from 30
years ? (Ī±=0.05) .
ā¢ If the p - value is 0.0340 how can we use it in making a decision?
75. Solution
1-Data: variable is age, n=10, =27 ,Ļ2=20,Ī±=0.05
2-Assumptions: the population is approximately normally distributed with
variance 20
3-Hypotheses:
ā¢ H0 : Ī¼=30
ā¢ HA: Ī¼ 30
4-Test Statistic:
ā¢ Z = -2.12
5.Decision Rule
The alternative hypothesis is HA: Ī¼ ā 30
Hence we reject H0 if Z > Z(1-0.025)= Z(0.975)
ā¢ or Z< - Z(1-0.025 )= - Z(0.975)
ā¢ Z(0.975)=1.96(from table D)
76. 6.Decision:
ā¢ We reject H0 ,since -2.12 is in the rejection region .
ā¢ We can conclude that Ī¼ is not equal to 30
ā¢ Using the p value ,we note that p-value =0.0340< 0.05,therefore we
reject H0