This document provides definitions and concepts related to probability, random variables, and probability distributions. It covers the following key points in 3 sentences:
1) It defines basic statistical concepts like probability, random variables, sample space, and probability distributions including binomial, Poisson, uniform, and normal distributions.
2) It provides examples and rules regarding probability, conditional probability, mutually exclusive events, stratified sampling, simple random sampling, statistical inference, point and interval estimation.
3) It asks multiple choice and numerical problems related to these statistical concepts to test understanding of probability distributions, sampling methods, confidence intervals, and hypothesis testing.
Characteristics of a Good Hypothesis
Should be testable
Should be logical
Should be specific & Clear
Should be simple & understandable
Expressed in present tense
Directly related to the research problem
State relationship between the variables
Stated in declarative statement manner
Should be proved or disproved by the researcher
This document provides an overview of how to conduct research. It defines research as systematically collecting and analyzing data to increase understanding. It explains that as students, future practitioners, and educated citizens, understanding research is important. The document outlines the basic steps of a research project including finding a topic, formulating questions, defining the population, selecting a design and measurements, gathering evidence, interpreting evidence, and reporting findings. It also discusses key research concepts like variables, hypotheses, sampling, and quantitative and qualitative methodologies.
This document discusses different evaluation design approaches including quantitative, qualitative, and mixed methods. It provides details on key aspects of each approach such as data collection instruments, strengths, and when each is most applicable. For quantitative methods, it describes experimental, quasi-experimental, time series, and cross-sectional designs. For qualitative methods, it discusses observation, interviews, focus groups, document studies, and key informants. It notes that mixed methods combine quantitative and qualitative approaches to provide multiple perspectives on outcomes and implementation.
Sampling is concerned with the selection of a subset of individuals from within a statistical population to estimate characteristics of the whole population
Class lecture notes #1 (statistics for research)Harve Abella
This document defines key concepts in statistics and research methods. It discusses the difference between descriptive and inferential statistics, and explains variables, parameters, and statistics. It also outlines three research methods: correlational, experimental, and quasi-experimental. The correlational method looks for relationships between variables but does not prove causation. The experimental method establishes cause-and-effect by manipulating an independent variable. The quasi-experimental method uses a non-manipulated variable like gender or time to define comparison groups.
This document discusses the importance of clearly defined research objectives. It states that objectives provide direction and focus for a study. Objectives should be specific, measurable, attainable, realistic and time-bound. Without clear objectives, a study risks becoming aimless and producing no meaningful results. The document also categorizes objectives as general or specific, provides examples of each, and offers guidelines for writing objectives using action verbs that can be evaluated.
Characteristics of a Good Hypothesis
Should be testable
Should be logical
Should be specific & Clear
Should be simple & understandable
Expressed in present tense
Directly related to the research problem
State relationship between the variables
Stated in declarative statement manner
Should be proved or disproved by the researcher
This document provides an overview of how to conduct research. It defines research as systematically collecting and analyzing data to increase understanding. It explains that as students, future practitioners, and educated citizens, understanding research is important. The document outlines the basic steps of a research project including finding a topic, formulating questions, defining the population, selecting a design and measurements, gathering evidence, interpreting evidence, and reporting findings. It also discusses key research concepts like variables, hypotheses, sampling, and quantitative and qualitative methodologies.
This document discusses different evaluation design approaches including quantitative, qualitative, and mixed methods. It provides details on key aspects of each approach such as data collection instruments, strengths, and when each is most applicable. For quantitative methods, it describes experimental, quasi-experimental, time series, and cross-sectional designs. For qualitative methods, it discusses observation, interviews, focus groups, document studies, and key informants. It notes that mixed methods combine quantitative and qualitative approaches to provide multiple perspectives on outcomes and implementation.
Sampling is concerned with the selection of a subset of individuals from within a statistical population to estimate characteristics of the whole population
Class lecture notes #1 (statistics for research)Harve Abella
This document defines key concepts in statistics and research methods. It discusses the difference between descriptive and inferential statistics, and explains variables, parameters, and statistics. It also outlines three research methods: correlational, experimental, and quasi-experimental. The correlational method looks for relationships between variables but does not prove causation. The experimental method establishes cause-and-effect by manipulating an independent variable. The quasi-experimental method uses a non-manipulated variable like gender or time to define comparison groups.
This document discusses the importance of clearly defined research objectives. It states that objectives provide direction and focus for a study. Objectives should be specific, measurable, attainable, realistic and time-bound. Without clear objectives, a study risks becoming aimless and producing no meaningful results. The document also categorizes objectives as general or specific, provides examples of each, and offers guidelines for writing objectives using action verbs that can be evaluated.
This document provides an introduction to probability theory and different probability distributions. It begins with defining probability as a quantitative measure of the likelihood of events occurring. It then covers fundamental probability concepts like mutually exclusive events, additive and multiplicative laws of probability, and independent events. The document also introduces random variables and common probability distributions like the binomial, Poisson, and normal distributions. It provides examples of how each distribution is used and concludes with characteristics of the normal distribution.
1. The document discusses basic concepts in probability and statistics, including sample spaces, events, probability distributions, and random variables.
2. Key concepts are explained such as independent and conditional probability, Bayes' theorem, and common probability distributions like the uniform and normal distributions.
3. Statistical analysis methods are introduced including how to estimate the mean and variance from samples from a distribution.
HW1_STAT206.pdfStatistical Inference II J. Lee Assignment.docxwilcockiris
HW1_STAT206.pdf
Statistical Inference II: J. Lee Assignment 1
Problem 1. Suppose the day after the Drexel-Northeastern basketball game, a poll of 1000 Drexel students
was conducted and it was determined that 850 out of the 1000 watched the game (live or on television).
Assume that this was a simple random sample and that the Drexel undergraduate population is 20000.
(a) Generate an unbiased estimate of the true proportion of Drexel undergraduate students who watched
the game.
(b) What is your estimated standard error for the proportion estimate in (a)?
(c) Give a 95% confidence interval for the true proportion of Drexel undergraduate students who watched
the game.
Problem 2. (Exercise 18 in Chapter 7 of Rice) From independent surveys of two populations, 90% con-
fidence intervals for the population means are conducted. What is the probability that neither interval
contains the respective population mean? That both do?
Problem 3. (Exercise 23 in Chapter 7 of Rice)
(a) Show that the standard error of an estimated proportion is largest when p = 1/2.
(b) Use this result and Corollary B of Section 7.3.2 (also, on Page 17 of the lecture notes) to conclude that
the quantity
1
2
√
N − n
N(n − 1)
is a conservative estimate of the standard error of p̂ no matter what the value of p may be.
(c) Use the central limit theorem to conclude that the interval
p̂ ±
√
N − n
N(n − 1)
contains p with probability at least .95.
HW2_STAT206.pdf
Statistical Inference II: J. Lee Assignment 2
Problem 1. The following data set represents the number of NBA games in January 2016, watched by 10
randomly selected student in STAT 206.
7, 0, 4, 2, 2, 1, 0, 1, 2, 3
(a) What is the sample mean?
(b) Calculate sample variance.
(c) Estimate the mean number of NBA games watched by a student in January 2016.
(d) Estimate the standard error of the estimated mean.
Problem 2. True or false? Tell me why for the false statements.
(a) The center of a 95% confidence interval for the population mean is a random variable.
(b) A 95% confidence interval for µ contains the sample mean with probability .95.
(c) A 95% confidence interval contains 95% of the population.
(d) Out of one hundred 95% confidence intervals for µ, 95 will contain µ.
Problem 3. An investigator quantifies her uncertainty about the estimate of a population mean by reporting
X ± sX . What size confidence interval is?
Problem 4. For a random sample of size n from a population of size N, consider the following as an
estimate of µ:
Xc =
n∑
i=1
ciXi,
where the ci are fixed numbers and X1, . . . ,Xn are the sample. Find a condition on the ci such that the
estimate is unbiased.
Problem 5. A sample of size 100 has the sample mean X = 10. Suppose the we know that the population
standard deviation σ = 5. Find a 95% confidence interval for the population mean µ.
Problem 6. Suppose the we know that the population standard deviation σ = 5. Then how large should a
sample be to estimate the popula.
This document provides an introduction to probability and important concepts in probability theory. It defines probability as a measure of the likelihood of an event occurring based on chance. Probability can be estimated empirically by calculating the relative frequency of outcomes in a series of trials, or estimated subjectively based on experience. Classical probability uses an a priori approach to assign probabilities to outcomes that are considered equally likely, such as outcomes of rolling dice or drawing cards. The document provides examples and definitions of key probability terms and concepts such as sample space, events, axioms of probability, and approaches to calculating probability.
The document summarizes the binomial and Poisson probability distributions. The binomial distribution describes the number of successes in a fixed number of independent yes/no trials, where the probability of success is constant across trials. The Poisson distribution approximates the binomial when the number of trials is large and the probability of success is small. Examples are provided to demonstrate calculating probabilities using the binomial probability formula, binomial tables, and the Poisson probability formula with Poisson tables.
This document provides an overview of Bernoulli statistics and distributions, including key concepts like Bernoulli trials, the Bernoulli distribution, and the mean and variance of Bernoulli random variables. It also discusses the binomial distribution and how it arises from a series of independent Bernoulli trials. Finally, it covers the Poisson distribution and how it can model the number of events occurring in a fixed interval of time or space given a constant average rate of occurrence.
This document presents an analysis of the exponential distribution under an adaptive type-I progressive hybrid censoring scheme for competing risks data. Maximum likelihood and Bayesian estimation methods are used to estimate the distribution parameter. Specifically, maximum likelihood estimators are derived for the exponential distribution parameter. Bayesian estimators are also obtained for the parameter based on squared error and LINEX loss functions using gamma priors. Asymptotic confidence intervals and Bayesian credible intervals are proposed. A simulation study is conducted to evaluate the performance of the estimators.
This document discusses various discrete probability distributions including the binomial and Poisson distributions. It provides examples and formulas for calculating probabilities using these distributions. The key points covered are:
- The binomial distribution describes the probability of successes in a fixed number of yes/no trials where the probability of success is constant. It requires the trials to be independent and have two possible outcomes.
- The Poisson distribution describes the probability of events occurring randomly in time, space or volume. It applies when the number of events is large but the probability of any one event is small.
- Formulas are provided for calculating probabilities using the binomial and Poisson distributions, including the binomial probability formula and Poisson probability formula. Tables can also be used to find binomial and
Computational Pool-Testing with Retesting StrategyWaqas Tariq
Pool testing is a cost effective procedure for identifying defective items in a large population. It also improves the efficiency of the testing procedure when imperfect tests are employed. This study develops computational pool-testing strategy based on a proposed pool testing with re-testing strategy. Statistical moments based on this applied design have been generated. With advent of computers in 1980‘s, pool-testing with re-testing strategy under discussion is handled in the context of computational statistics. From this study, it has been established that re-testing reduces misclassifications significantly as compared to Dorfman procedure although re-testing comes with a cost i.e. increase in the number of tests. Re-testing considered improves the sensitivity and specificity of the testing scheme.
This thesis examines methods for estimating extinction times of species on the Mascarene Islands using occupancy data. Maximum likelihood estimation is used to fit population decline models, including exponential, linear, and changepoint models. Issues with sparse data and parameter redundancy make extinction time estimation difficult without external population information. Simulations show linear decline models can estimate extinction times well when the true population decline is linear, but tend to underestimate extinction times.
InstructionDue Date 6 pm on October 28 (Wed)Part IProbability a.docxdirkrplav
This document discusses implementing a social, environmental, and economic impact measurement system within a company. It explains that measuring sustainability performance is critical for evaluating projects, the company, and its members. A proper measurement system allows companies to develop a sustainability strategy, allocate resources to support it, and evaluate trade-offs between sustainability projects. The document provides examples from Nike and P&G of measuring impacts to demonstrate the business case for sustainability. It stresses that measurement is important for linking performance to sustainability principles and facilitating continuous improvement.
The document defines key concepts in probability and hypothesis testing. It discusses probability as a numerical quantity between 0 and 1 that expresses the likelihood of an event. Different probability distributions are covered, including binomial, normal, and Poisson distributions. Hypothesis testing is defined as a methodology to either accept or reject a null hypothesis based on sample data. Types of hypotheses, terms used in testing like test statistics and p-values, and types of errors are also summarized.
The document provides an introduction to probability concepts including:
1) Definitions of probability, sample space, outcomes, events, and interpretations of probability including classical, empirical, and subjective.
2) Examples of sample spaces for experiments like coin tosses and dice rolls.
3) Explanations of tree diagrams, Venn diagrams, and the addition rules for determining probabilities of mutually exclusive and non-mutually exclusive events.
4) Descriptions of joint, marginal, and conditional probabilities and examples of calculating each from contingency tables.
This document summarizes key probability distributions: binomial, Poisson, and normal. The binomial distribution describes the number of successes in fixed number of trials where the probability of success is constant. The Poisson distribution approximates the binomial when the number of trials is large and the probability of success is small. The normal distribution describes many continuous random variables and is symmetric with two parameters: mean and standard deviation. The document also discusses when binomial and Poisson distributions can be approximated as normal distributions.
- Hypothesis testing involves evaluating claims about population parameters by comparing a null hypothesis to an alternative hypothesis.
- The null hypothesis states that there is no difference or effect, while the alternative hypothesis states that a difference or effect exists.
- There are three main methods for hypothesis testing: the critical value method which separates a critical region from a noncritical region, the p-value method which calculates the probability of obtaining a test statistic at least as extreme as the sample test statistic assuming the null is true, and the confidence interval method which rejects claims not included in the confidence interval.
- The steps of hypothesis testing are to state the hypotheses, calculate the test statistic, find the critical value, make a decision to reject
4Probability and probability distributions.pdfAmanuelDina
Here are the key steps to solve this problem:
1) Draw the standard normal curve
2) The probability is the area between -2.55 and 2.55
3) From the standard normal table:
P(Z ≤ 2.55) = 0.9938
P(Z ≤ -2.55) = 0.0049
4) Use the area property:
P(-2.55 ≤ Z ≤ 2.55) = P(Z ≤ 2.55) - P(Z ≤ -2.55)
= 0.9938 - 0.0049
= 0.9889
Therefore, the probability that a z value will be between -2.55 and 2
Two events are independent if the occurrence of one event does not affect the probability of the other event occurring. The multiplication rule can be used to calculate the probability of independent events occurring. Two events are dependent if the occurrence of one event does affect the probability of the other event occurring. The multiplication rule must be modified to calculate probabilities of dependent events. The addition rule can be used to calculate the probability of one or more events occurring, whether the events are mutually exclusive or not mutually exclusive.
The document provides an overview of quantitative data analysis and various statistical concepts including the normal distribution, z-tests, confidence intervals, and t-tests. It discusses how the normal distribution was developed by de Moivre and Gauss. It then explains the key properties of the normal distribution and how it can be used to describe many natural phenomena. Examples are provided to illustrate how to calculate and interpret confidence intervals and choose the appropriate statistical test.
This document summarizes a lecture on probability, probability distributions, and sampling distributions. It includes examples of calculating probabilities of events using concepts like the binomial distribution, normal distribution, and sampling distributions. It also defines key terms like population, sample, parameter, and statistic. Sample problems are worked through, such as finding the probability that the mean length of 5 randomly selected trout is between 8 to 12 inches.
Understanding User Needs and Satisfying ThemAggregage
https://www.productmanagementtoday.com/frs/26903918/understanding-user-needs-and-satisfying-them
We know we want to create products which our customers find to be valuable. Whether we label it as customer-centric or product-led depends on how long we've been doing product management. There are three challenges we face when doing this. The obvious challenge is figuring out what our users need; the non-obvious challenges are in creating a shared understanding of those needs and in sensing if what we're doing is meeting those needs.
In this webinar, we won't focus on the research methods for discovering user-needs. We will focus on synthesis of the needs we discover, communication and alignment tools, and how we operationalize addressing those needs.
Industry expert Scott Sehlhorst will:
• Introduce a taxonomy for user goals with real world examples
• Present the Onion Diagram, a tool for contextualizing task-level goals
• Illustrate how customer journey maps capture activity-level and task-level goals
• Demonstrate the best approach to selection and prioritization of user-goals to address
• Highlight the crucial benchmarks, observable changes, in ensuring fulfillment of customer needs
This document provides an introduction to probability theory and different probability distributions. It begins with defining probability as a quantitative measure of the likelihood of events occurring. It then covers fundamental probability concepts like mutually exclusive events, additive and multiplicative laws of probability, and independent events. The document also introduces random variables and common probability distributions like the binomial, Poisson, and normal distributions. It provides examples of how each distribution is used and concludes with characteristics of the normal distribution.
1. The document discusses basic concepts in probability and statistics, including sample spaces, events, probability distributions, and random variables.
2. Key concepts are explained such as independent and conditional probability, Bayes' theorem, and common probability distributions like the uniform and normal distributions.
3. Statistical analysis methods are introduced including how to estimate the mean and variance from samples from a distribution.
HW1_STAT206.pdfStatistical Inference II J. Lee Assignment.docxwilcockiris
HW1_STAT206.pdf
Statistical Inference II: J. Lee Assignment 1
Problem 1. Suppose the day after the Drexel-Northeastern basketball game, a poll of 1000 Drexel students
was conducted and it was determined that 850 out of the 1000 watched the game (live or on television).
Assume that this was a simple random sample and that the Drexel undergraduate population is 20000.
(a) Generate an unbiased estimate of the true proportion of Drexel undergraduate students who watched
the game.
(b) What is your estimated standard error for the proportion estimate in (a)?
(c) Give a 95% confidence interval for the true proportion of Drexel undergraduate students who watched
the game.
Problem 2. (Exercise 18 in Chapter 7 of Rice) From independent surveys of two populations, 90% con-
fidence intervals for the population means are conducted. What is the probability that neither interval
contains the respective population mean? That both do?
Problem 3. (Exercise 23 in Chapter 7 of Rice)
(a) Show that the standard error of an estimated proportion is largest when p = 1/2.
(b) Use this result and Corollary B of Section 7.3.2 (also, on Page 17 of the lecture notes) to conclude that
the quantity
1
2
√
N − n
N(n − 1)
is a conservative estimate of the standard error of p̂ no matter what the value of p may be.
(c) Use the central limit theorem to conclude that the interval
p̂ ±
√
N − n
N(n − 1)
contains p with probability at least .95.
HW2_STAT206.pdf
Statistical Inference II: J. Lee Assignment 2
Problem 1. The following data set represents the number of NBA games in January 2016, watched by 10
randomly selected student in STAT 206.
7, 0, 4, 2, 2, 1, 0, 1, 2, 3
(a) What is the sample mean?
(b) Calculate sample variance.
(c) Estimate the mean number of NBA games watched by a student in January 2016.
(d) Estimate the standard error of the estimated mean.
Problem 2. True or false? Tell me why for the false statements.
(a) The center of a 95% confidence interval for the population mean is a random variable.
(b) A 95% confidence interval for µ contains the sample mean with probability .95.
(c) A 95% confidence interval contains 95% of the population.
(d) Out of one hundred 95% confidence intervals for µ, 95 will contain µ.
Problem 3. An investigator quantifies her uncertainty about the estimate of a population mean by reporting
X ± sX . What size confidence interval is?
Problem 4. For a random sample of size n from a population of size N, consider the following as an
estimate of µ:
Xc =
n∑
i=1
ciXi,
where the ci are fixed numbers and X1, . . . ,Xn are the sample. Find a condition on the ci such that the
estimate is unbiased.
Problem 5. A sample of size 100 has the sample mean X = 10. Suppose the we know that the population
standard deviation σ = 5. Find a 95% confidence interval for the population mean µ.
Problem 6. Suppose the we know that the population standard deviation σ = 5. Then how large should a
sample be to estimate the popula.
This document provides an introduction to probability and important concepts in probability theory. It defines probability as a measure of the likelihood of an event occurring based on chance. Probability can be estimated empirically by calculating the relative frequency of outcomes in a series of trials, or estimated subjectively based on experience. Classical probability uses an a priori approach to assign probabilities to outcomes that are considered equally likely, such as outcomes of rolling dice or drawing cards. The document provides examples and definitions of key probability terms and concepts such as sample space, events, axioms of probability, and approaches to calculating probability.
The document summarizes the binomial and Poisson probability distributions. The binomial distribution describes the number of successes in a fixed number of independent yes/no trials, where the probability of success is constant across trials. The Poisson distribution approximates the binomial when the number of trials is large and the probability of success is small. Examples are provided to demonstrate calculating probabilities using the binomial probability formula, binomial tables, and the Poisson probability formula with Poisson tables.
This document provides an overview of Bernoulli statistics and distributions, including key concepts like Bernoulli trials, the Bernoulli distribution, and the mean and variance of Bernoulli random variables. It also discusses the binomial distribution and how it arises from a series of independent Bernoulli trials. Finally, it covers the Poisson distribution and how it can model the number of events occurring in a fixed interval of time or space given a constant average rate of occurrence.
This document presents an analysis of the exponential distribution under an adaptive type-I progressive hybrid censoring scheme for competing risks data. Maximum likelihood and Bayesian estimation methods are used to estimate the distribution parameter. Specifically, maximum likelihood estimators are derived for the exponential distribution parameter. Bayesian estimators are also obtained for the parameter based on squared error and LINEX loss functions using gamma priors. Asymptotic confidence intervals and Bayesian credible intervals are proposed. A simulation study is conducted to evaluate the performance of the estimators.
This document discusses various discrete probability distributions including the binomial and Poisson distributions. It provides examples and formulas for calculating probabilities using these distributions. The key points covered are:
- The binomial distribution describes the probability of successes in a fixed number of yes/no trials where the probability of success is constant. It requires the trials to be independent and have two possible outcomes.
- The Poisson distribution describes the probability of events occurring randomly in time, space or volume. It applies when the number of events is large but the probability of any one event is small.
- Formulas are provided for calculating probabilities using the binomial and Poisson distributions, including the binomial probability formula and Poisson probability formula. Tables can also be used to find binomial and
Computational Pool-Testing with Retesting StrategyWaqas Tariq
Pool testing is a cost effective procedure for identifying defective items in a large population. It also improves the efficiency of the testing procedure when imperfect tests are employed. This study develops computational pool-testing strategy based on a proposed pool testing with re-testing strategy. Statistical moments based on this applied design have been generated. With advent of computers in 1980‘s, pool-testing with re-testing strategy under discussion is handled in the context of computational statistics. From this study, it has been established that re-testing reduces misclassifications significantly as compared to Dorfman procedure although re-testing comes with a cost i.e. increase in the number of tests. Re-testing considered improves the sensitivity and specificity of the testing scheme.
This thesis examines methods for estimating extinction times of species on the Mascarene Islands using occupancy data. Maximum likelihood estimation is used to fit population decline models, including exponential, linear, and changepoint models. Issues with sparse data and parameter redundancy make extinction time estimation difficult without external population information. Simulations show linear decline models can estimate extinction times well when the true population decline is linear, but tend to underestimate extinction times.
InstructionDue Date 6 pm on October 28 (Wed)Part IProbability a.docxdirkrplav
This document discusses implementing a social, environmental, and economic impact measurement system within a company. It explains that measuring sustainability performance is critical for evaluating projects, the company, and its members. A proper measurement system allows companies to develop a sustainability strategy, allocate resources to support it, and evaluate trade-offs between sustainability projects. The document provides examples from Nike and P&G of measuring impacts to demonstrate the business case for sustainability. It stresses that measurement is important for linking performance to sustainability principles and facilitating continuous improvement.
The document defines key concepts in probability and hypothesis testing. It discusses probability as a numerical quantity between 0 and 1 that expresses the likelihood of an event. Different probability distributions are covered, including binomial, normal, and Poisson distributions. Hypothesis testing is defined as a methodology to either accept or reject a null hypothesis based on sample data. Types of hypotheses, terms used in testing like test statistics and p-values, and types of errors are also summarized.
The document provides an introduction to probability concepts including:
1) Definitions of probability, sample space, outcomes, events, and interpretations of probability including classical, empirical, and subjective.
2) Examples of sample spaces for experiments like coin tosses and dice rolls.
3) Explanations of tree diagrams, Venn diagrams, and the addition rules for determining probabilities of mutually exclusive and non-mutually exclusive events.
4) Descriptions of joint, marginal, and conditional probabilities and examples of calculating each from contingency tables.
This document summarizes key probability distributions: binomial, Poisson, and normal. The binomial distribution describes the number of successes in fixed number of trials where the probability of success is constant. The Poisson distribution approximates the binomial when the number of trials is large and the probability of success is small. The normal distribution describes many continuous random variables and is symmetric with two parameters: mean and standard deviation. The document also discusses when binomial and Poisson distributions can be approximated as normal distributions.
- Hypothesis testing involves evaluating claims about population parameters by comparing a null hypothesis to an alternative hypothesis.
- The null hypothesis states that there is no difference or effect, while the alternative hypothesis states that a difference or effect exists.
- There are three main methods for hypothesis testing: the critical value method which separates a critical region from a noncritical region, the p-value method which calculates the probability of obtaining a test statistic at least as extreme as the sample test statistic assuming the null is true, and the confidence interval method which rejects claims not included in the confidence interval.
- The steps of hypothesis testing are to state the hypotheses, calculate the test statistic, find the critical value, make a decision to reject
4Probability and probability distributions.pdfAmanuelDina
Here are the key steps to solve this problem:
1) Draw the standard normal curve
2) The probability is the area between -2.55 and 2.55
3) From the standard normal table:
P(Z ≤ 2.55) = 0.9938
P(Z ≤ -2.55) = 0.0049
4) Use the area property:
P(-2.55 ≤ Z ≤ 2.55) = P(Z ≤ 2.55) - P(Z ≤ -2.55)
= 0.9938 - 0.0049
= 0.9889
Therefore, the probability that a z value will be between -2.55 and 2
Two events are independent if the occurrence of one event does not affect the probability of the other event occurring. The multiplication rule can be used to calculate the probability of independent events occurring. Two events are dependent if the occurrence of one event does affect the probability of the other event occurring. The multiplication rule must be modified to calculate probabilities of dependent events. The addition rule can be used to calculate the probability of one or more events occurring, whether the events are mutually exclusive or not mutually exclusive.
The document provides an overview of quantitative data analysis and various statistical concepts including the normal distribution, z-tests, confidence intervals, and t-tests. It discusses how the normal distribution was developed by de Moivre and Gauss. It then explains the key properties of the normal distribution and how it can be used to describe many natural phenomena. Examples are provided to illustrate how to calculate and interpret confidence intervals and choose the appropriate statistical test.
This document summarizes a lecture on probability, probability distributions, and sampling distributions. It includes examples of calculating probabilities of events using concepts like the binomial distribution, normal distribution, and sampling distributions. It also defines key terms like population, sample, parameter, and statistic. Sample problems are worked through, such as finding the probability that the mean length of 5 randomly selected trout is between 8 to 12 inches.
Understanding User Needs and Satisfying ThemAggregage
https://www.productmanagementtoday.com/frs/26903918/understanding-user-needs-and-satisfying-them
We know we want to create products which our customers find to be valuable. Whether we label it as customer-centric or product-led depends on how long we've been doing product management. There are three challenges we face when doing this. The obvious challenge is figuring out what our users need; the non-obvious challenges are in creating a shared understanding of those needs and in sensing if what we're doing is meeting those needs.
In this webinar, we won't focus on the research methods for discovering user-needs. We will focus on synthesis of the needs we discover, communication and alignment tools, and how we operationalize addressing those needs.
Industry expert Scott Sehlhorst will:
• Introduce a taxonomy for user goals with real world examples
• Present the Onion Diagram, a tool for contextualizing task-level goals
• Illustrate how customer journey maps capture activity-level and task-level goals
• Demonstrate the best approach to selection and prioritization of user-goals to address
• Highlight the crucial benchmarks, observable changes, in ensuring fulfillment of customer needs
Top mailing list providers in the USA.pptxJeremyPeirce1
Discover the top mailing list providers in the USA, offering targeted lists, segmentation, and analytics to optimize your marketing campaigns and drive engagement.
Part 2 Deep Dive: Navigating the 2024 Slowdownjeffkluth1
Introduction
The global retail industry has weathered numerous storms, with the financial crisis of 2008 serving as a poignant reminder of the sector's resilience and adaptability. However, as we navigate the complex landscape of 2024, retailers face a unique set of challenges that demand innovative strategies and a fundamental shift in mindset. This white paper contrasts the impact of the 2008 recession on the retail sector with the current headwinds retailers are grappling with, while offering a comprehensive roadmap for success in this new paradigm.
HOW TO START UP A COMPANY A STEP-BY-STEP GUIDE.pdf46adnanshahzad
How to Start Up a Company: A Step-by-Step Guide Starting a company is an exciting adventure that combines creativity, strategy, and hard work. It can seem overwhelming at first, but with the right guidance, anyone can transform a great idea into a successful business. Let's dive into how to start up a company, from the initial spark of an idea to securing funding and launching your startup.
Introduction
Have you ever dreamed of turning your innovative idea into a thriving business? Starting a company involves numerous steps and decisions, but don't worry—we're here to help. Whether you're exploring how to start a startup company or wondering how to start up a small business, this guide will walk you through the process, step by step.
Navigating the world of forex trading can be challenging, especially for beginners. To help you make an informed decision, we have comprehensively compared the best forex brokers in India for 2024. This article, reviewed by Top Forex Brokers Review, will cover featured award winners, the best forex brokers, featured offers, the best copy trading platforms, the best forex brokers for beginners, the best MetaTrader brokers, and recently updated reviews. We will focus on FP Markets, Black Bull, EightCap, IC Markets, and Octa.
The APCO Geopolitical Radar - Q3 2024 The Global Operating Environment for Bu...APCO
The Radar reflects input from APCO’s teams located around the world. It distils a host of interconnected events and trends into insights to inform operational and strategic decisions. Issues covered in this edition include:
How MJ Global Leads the Packaging Industry.pdfMJ Global
MJ Global's success in staying ahead of the curve in the packaging industry is a testament to its dedication to innovation, sustainability, and customer-centricity. By embracing technological advancements, leading in eco-friendly solutions, collaborating with industry leaders, and adapting to evolving consumer preferences, MJ Global continues to set new standards in the packaging sector.
At Techbox Square, in Singapore, we're not just creative web designers and developers, we're the driving force behind your brand identity. Contact us today.
Event Report - SAP Sapphire 2024 Orlando - lots of innovation and old challengesHolger Mueller
Holger Mueller of Constellation Research shares his key takeaways from SAP's Sapphire confernece, held in Orlando, June 3rd till 5th 2024, in the Orange Convention Center.
❼❷⓿❺❻❷❽❷❼❽ Dpboss Matka Result Satta Matka Guessing Satta Fix jodi Kalyan Final ank Satta Matka Dpbos Final ank Satta Matta Matka 143 Kalyan Matka Guessing Final Matka Final ank Today Matka 420 Satta Batta Satta 143 Kalyan Chart Main Bazar Chart vip Matka Guessing Dpboss 143 Guessing Kalyan night
Taurus Zodiac Sign: Unveiling the Traits, Dates, and Horoscope Insights of th...my Pandit
Dive into the steadfast world of the Taurus Zodiac Sign. Discover the grounded, stable, and logical nature of Taurus individuals, and explore their key personality traits, important dates, and horoscope insights. Learn how the determination and patience of the Taurus sign make them the rock-steady achievers and anchors of the zodiac.
Building Your Employer Brand with Social MediaLuanWise
Presented at The Global HR Summit, 6th June 2024
In this keynote, Luan Wise will provide invaluable insights to elevate your employer brand on social media platforms including LinkedIn, Facebook, Instagram, X (formerly Twitter) and TikTok. You'll learn how compelling content can authentically showcase your company culture, values, and employee experiences to support your talent acquisition and retention objectives. Additionally, you'll understand the power of employee advocacy to amplify reach and engagement – helping to position your organization as an employer of choice in today's competitive talent landscape.
[To download this presentation, visit:
https://www.oeconsulting.com.sg/training-presentations]
This presentation is a curated compilation of PowerPoint diagrams and templates designed to illustrate 20 different digital transformation frameworks and models. These frameworks are based on recent industry trends and best practices, ensuring that the content remains relevant and up-to-date.
Key highlights include Microsoft's Digital Transformation Framework, which focuses on driving innovation and efficiency, and McKinsey's Ten Guiding Principles, which provide strategic insights for successful digital transformation. Additionally, Forrester's framework emphasizes enhancing customer experiences and modernizing IT infrastructure, while IDC's MaturityScape helps assess and develop organizational digital maturity. MIT's framework explores cutting-edge strategies for achieving digital success.
These materials are perfect for enhancing your business or classroom presentations, offering visual aids to supplement your insights. Please note that while comprehensive, these slides are intended as supplementary resources and may not be complete for standalone instructional purposes.
Frameworks/Models included:
Microsoft’s Digital Transformation Framework
McKinsey’s Ten Guiding Principles of Digital Transformation
Forrester’s Digital Transformation Framework
IDC’s Digital Transformation MaturityScape
MIT’s Digital Transformation Framework
Gartner’s Digital Transformation Framework
Accenture’s Digital Strategy & Enterprise Frameworks
Deloitte’s Digital Industrial Transformation Framework
Capgemini’s Digital Transformation Framework
PwC’s Digital Transformation Framework
Cisco’s Digital Transformation Framework
Cognizant’s Digital Transformation Framework
DXC Technology’s Digital Transformation Framework
The BCG Strategy Palette
McKinsey’s Digital Transformation Framework
Digital Transformation Compass
Four Levels of Digital Maturity
Design Thinking Framework
Business Model Canvas
Customer Journey Map
The Genesis of BriansClub.cm Famous Dark WEb PlatformSabaaSudozai
BriansClub.cm, a famous platform on the dark web, has become one of the most infamous carding marketplaces, specializing in the sale of stolen credit card data.
How to Implement a Real Estate CRM SoftwareSalesTown
To implement a CRM for real estate, set clear goals, choose a CRM with key real estate features, and customize it to your needs. Migrate your data, train your team, and use automation to save time. Monitor performance, ensure data security, and use the CRM to enhance marketing. Regularly check its effectiveness to improve your business.
3 Simple Steps To Buy Verified Payoneer Account In 2024SEOSMMEARTH
Buy Verified Payoneer Account: Quick and Secure Way to Receive Payments
Buy Verified Payoneer Account With 100% secure documents, [ USA, UK, CA ]. Are you looking for a reliable and safe way to receive payments online? Then you need buy verified Payoneer account ! Payoneer is a global payment platform that allows businesses and individuals to send and receive money in over 200 countries.
If You Want To More Information just Contact Now:
Skype: SEOSMMEARTH
Telegram: @seosmmearth
Gmail: seosmmearth@gmail.com
Zodiac Signs and Food Preferences_ What Your Sign Says About Your Tastemy Pandit
Know what your zodiac sign says about your taste in food! Explore how the 12 zodiac signs influence your culinary preferences with insights from MyPandit. Dive into astrology and flavors!
Zodiac Signs and Food Preferences_ What Your Sign Says About Your Taste
BA4101 Question Bank.pdf
1. BA4101
STATISTICS FOR MANAGEMENT
QUESTION BANK
UNIT I INTRODUCTION
Basic definitions and rules for probability, conditional probability independence of events,
Bayes’ theorem, and random variables, Probability distributions: Binomial, Poisson, Uniform
and Normal distributions.
PART A
1. Define probability. (CO1, K1)
Probability is the branch of mathematics that studies the likelihood of occurrence of
random events in order to predict the behaviour of a defined system.
The probability of an event A,
P(A) =
Number of case𝑠𝑓𝑎𝑣𝑜𝑢𝑟𝑎𝑏𝑙𝑒 𝑡𝑜 𝑡ℎ𝑒 𝑜𝑐𝑐𝑢𝑟𝑎𝑛𝑐𝑒 𝑜𝑓 𝑡ℎ𝑒 𝑒𝑣𝑒𝑛𝑡
𝑇𝑜𝑡𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑚𝑢𝑡𝑢𝑎𝑙𝑙𝑦 𝑒𝑥𝑐𝑙𝑢𝑠𝑖𝑣𝑒 𝑎𝑛𝑑 𝑒𝑥ℎ𝑎𝑢𝑠𝑡𝑒𝑑 𝑒𝑣𝑒𝑛𝑡𝑠
2. What is sample space? (CO1, K1)
A sample space is a collection of all possible outcomes of a random experiment.
Mathematically, the sample space is denoted by the symbol S.
e.g.) the sample space of tossing two coins is the set of all possible outcomes: HH,
HT, TH, and TT, where T for Tail and H for Head.
3. Define random variable. (CO1, K1)
When the numerical value of a variable is determined by a chance event i.e., by
conducting random experiment, that variable is called a random variable. The value of
a random variable will vary from trail to trail as the experiment is repeated.
4. Bring out the rules for probability. (CO1, K1)
1) The probability of occurrence of an event A should be: 0 ≤ P(A) ≤ 1 in a
sample space.
2) The related probability of sample space S is 1, i.e., P(S) = 1.
3) The related probability of occurring event A or event B or both events is equal
to the sum of the probabilities of the events individually, i.e., P(A or B) = P(A)
+ P(B).
4) If P(A) is the probability of occurring an event A then the probability of the
event which does not occur is: P(Ã) = 1 – P(A), where à is the non-
occurrence of P(A).
2. 5. Define mutually exclusive events. (CO1, K1)
Two or more events are said to be mutually exclusive events if it is not possible for
them to occur together at the same time.
e.g.) experiment of throwing a six-sided dice, A be the event of getting odd number A
= {1, 3, 5} whereas B be the event of getting even numbers B = {2, 4, 6}. Here, A and
B cannot occur together. So, it is said to be mutually exclusive events.
6. Define conditional probability. (CO1, K1)
Conditional probability involves estimating the probability of occurrence of a
particular event (A) as the conditional upon the occurrence of a particular event (B). It
is written as P(A/B)
P(A/B) =
P (A ∩ B)
𝑃(𝐴)
7. What do you mean by probability distribution of a discrete random variable?
(CO1, K1)
In a probability distribution, a random variable X can only take the value of discrete
integers i.e., countable values like 0, 1, 2, 3, 4, … And the probability distribution is
called discrete probability distribution.
8. State the Bayes’ theorem. (CO1, K1)
Bayes’ theorem states that the conditional probability of an event, based on the
occurrence of another event, is equal to likelihood of the second event given the first
event multiplied by the probability of the first event.
9. Suppose that X has a Poisson distribution with a parameter m = 2. Compute P[X≥1]
whereas e-2
= 0.1353. (CO1, K3)
Using the Poisson Distribution f(x) =
𝑒−𝑚𝑚𝑥
𝑥!
where x = 0, 1, 2, …
10. Write the Mean, Variance and Standard Deviation of uniform distribution. (CO1, K1)
Mean, µ =
𝑎+𝑏
2
, Variance, σ2
=
(𝑏−𝑎)2
12
and Standard Deviation, σ =
𝑏−𝑎
√12
PART B
1. Suppose that ¾ % of a population have a terminal disease and that the test to detect
this disease is 99% accurate in identifying those with the disease and 95% accurate in
3. identifying those without the disease. Compute the probability that one has the disease
given that the test so indicates. (CO1, K3)
2. The average number of defective chips manufactured daily at the plant is 5. Assume
the number of defects is a Poisson random variable X. compute mean and variance of
X if P[x = 0] = 0.0497. (CO1, K3)
3. A bag contains 5 balls and it is not known how many of them are white. Two balls are
drawn at random from the bag and they are noted to be white. What is the probability
that all the balls in the bag are white? (CO1, K3)
4. If the actual amount of instant coffee which filling puts into 6-ounce jar is a random
variable having a normal distribution with SD = 0.05 ounce and if only 3% of the jar
are to contain less than 6 ounces of coffee, what must be the mean fill of these jars?
(CO1, K3)
5. The incident of occupational disease in an industry is such that workers have a 20%
chance of suffering from it. What is the probability that out of six workers 3 or more
will contract the disease? (CO1, K3)
6. A Market analyst believes that the stock market has a 0.70 probability of going up in
the next year if the economy should do well, and a 0.20 probability of going up if the
economy should not do well during the year. The analyst believes that there is a 0.80
probability that the economy will do well in the coming year. What is the probability
that stock market will go up next year? (CO1, K3)
PART C
1. Sun Network took a survey of 500 Sun Tv viewers to determine people favourite film
to telecast on Diwali matni time show,
Male Female
Ayan 80 100
Sarkar 100 25
Other 50 125
a) Draw the probability distribution for the above given data.
b) What is the probability of favourite film being Ayan?
c) What is the probability of female?
d) What is the probability of a Sun Tv viewer being male and preferring Sarkar?
e) What is the probability of a Sun Tv viewer being male or preferring Sarkar?
f) Amala is a Sun Tv viewer, what is the chance that her favourite film will be
Ayan?
g) Find out whether gender influences the choices of films for the above given data.
4. (CO1, K3)
2. An aptitude test was conducted on 600 employees of the Provincial Life Care Limited
in which the mean score was found to be 60 with standard deviation of 25.
Find:
i. What is the number of employees whose mean score was less than 30?
ii. What was the number of employees whose mean score exceed 80?
iii. What was the number of employees whose mean score is between 30 and 90?
(CO1, K3)
5. UNIT II SAMPLING DISTRIBUTION AND ESTIMATION
Basic definitions and rules for probability, conditional probability independence of events,
Bayes’ theorem, and random variables, Probability distributions: Binomial, Poisson, Uniform
and Normal distributions.
PART A
1. What is the central limit theorem? (CO2, K1)
Central limit theorem states that when an infinite number of successive random
samples are taken from a population, the sampling distribution of the means of those
samples will become approximately normally distributed with mean μ and standard
deviation
𝜎
√𝑁
, irrespective of the form of the population distribution.
2. What is sampling distribution? (CO2, K1)
The probability distribution of statistic (mean, proportion, standard deviation) of
samples of size ‘n’ from a given population is called sampling distribution.
3. What is sampling method and its types? (CO2, K1)
A sampling method is a way of selecting a group of individuals from a population to
conduct research on. There are two types of sampling methods: probability sampling
and non-probability sampling.
4. What is the systematic sampling? (CO2, K1)
Systematic sampling is a type of probability sampling method in which sample
members from a larger population are selected according to a random starting point
but with a fixed, periodic interval. This interval, called the sampling interval, is
calculated by dividing the population size by the desired sample size.
5. What is the stratified sampling? (CO2, K1)
Stratified random sampling is a form of probability sampling that provides a
methodology for dividing a population into smaller subgroups as a means of ensuring
greater accuracy of your high-level survey results. The smaller subgroups are called
strata. Stratified random sampling is also called proportional or quota random
sampling.
6. Define the term simple random sampling. (CO2, K1)
Simple random sampling is a type of probability sampling in which the researcher
randomly selects a subset of participants from a population. Each member of the
population has an equal chance of being selected.
7. What is statistical inference and its applications? (CO2, K1)
6. Statistical inference is the process of analysing the result and making conclusions
from data subject to random variation. It is also called inferential statistics.
Hypothesis testing and confidence intervals are the applications of the statistical
inference.
8. What is statistical estimation? (CO2, K1)
Statistical estimation refers to the process by which one makes inferences about a
population, based on information obtained from a sample.
9. What is point estimation? (CO2, K1)
A point estimate of a population parameter is a single value of a statistic. For
example, the sample mean 𝑋
̅ is a point estimate of the population mean μ. Similarly,
the sample proportion p is a point estimate of the population proportion P.
10. What is interval estimation? (CO2, K1)
An interval estimate is defined by two numbers, between which a population
parameter is said to lie. For example, a < x < b is an interval estimate of the
population mean μ. It indicates that the population mean is greater than ‘a’ but less
than ‘b’
PART B
1. Illustrate sampling distribution. Explain its properties. (CO2, K2)
2. A random sample of 144 observations yields sample mean 𝑋
̅ = 160 and sample
variance s2
= 100. Compute a 95% confidence interval for the population mean.
(CO2, K3)
3. Explain the relationship between sample size and error. (CO2, K3)
4. Briefly explain the confidence interval for proportion and mean. (CO2, K2)
5. Explain statistical estimations with suitable examples. (CO2, K2)
6. How do you find confidential interval of population mean in both large sample and
small sample case? (CO2, K2)
PART C
1. The BOB (better of bests) departmental store wants to have an idea of the level of
satisfaction of its customers. The owner wants the exercise to be done on a Sunday
when about 2000 customers visits the store. The store is spread over two floors and
has ten cash counters. Discuss as to how a survey can be conducted for accessing the
overall level of satisfaction on a scale of 0 – 10. The management of the store feels
that a sample of 100 customers could be sufficient for the purpose. (CO2, K4)
7. 2. Discuss various techniques you would apply for sampling from a population.
(CO2, K2)
8. UNIT 3 TESTING OF HYPOTHESIS - PARAMETIRC TESTS
Hypothesis testing: one sample and two sample tests for means and proportions of large samples (z-
test), one sample and two sample tests for means of small samples (t-test), F-test for two sample
standard deviations. ANOVA one and two way
PART A
1. What is hypothesis testing? (CO3, K1)
Hypothesis testing is a statistical method used to determine if there is enough
evidence in a sample data to draw conclusions about a population
2. What is Type I and Type II error? (CO3, K1)
A type I error (false-positive) occurs if an investigator rejects a null hypothesis that is
actually true in the population; a type II error (false-negative) occurs if the
investigator fails to reject a null hypothesis that is actually false in the population.
H0
True False
Reject H0 Type I error -
Accept H0 - Type II error
3. What is null hypothesis? (CO3, K1)
A null hypothesis is a type of statistical hypothesis that proposes that no statistical
significance exists in a set of given observations. It is represented as H0.
4. What are the various elements in hypothesis testing? (CO3, K1)
The hypothesis test consists of several elements namely two statements (the null
hypothesis and the alternative hypothesis), the test statistic which in turn give us the
P-value and the critical value which in turn give us the rejection region.
5. Describe various stages involved in hypothesis testing. (CO3, K1)
The four steps of hypothesis testing include stating the hypotheses, formulating an
analysis plan, analysing the sample data, and analysing the result.
6. What do you mean by analysis of variance? (CO3, K1)
Analysis of Variance (ANOVA) is a statistical formula used to compare variances
across the means (or average) of different groups. It is used to determine if there is
any difference between the means of different groups.
7. What are the assumptions of ANOVA? (CO3, K1)
The three primary assumptions in ANOVA are
i. Population is normally distributed
ii. These distributions have the same variance
iii. The data are independent.
9. 8. When does the Z – Test apply? (CO3, K1)
Z test is used if there is a significant variation in the sample and population means. It
is used to test how an observed value of coefficient correlation r varies significantly
from the hypothetical value and to test whether two sample values of coefficient
correlation r vary significantly.
9. State Critical Value. (CO3, K1)
A critical value is the value of the test statistic which defines the upper and lower
bounds of a confidence interval, or which defines the threshold of statistical
significance in a statistical test.
10. Discuss the difference between parametric and non-parametric test. (CO3, K2)
In Statistics, a parametric test is a kind of hypothesis test which gives generalizations
for generating records regarding the mean of the primary/original population. The
non-parametric test does not require any population distribution, which is meant by
distinct parameters. It is also a kind of hypothesis test, which is not based on the
underlying hypothesis. In the case of the non-parametric test, the test is based on the
differences in the median. So, this kind of test is also called a distribution-free test.
PART B
1. Given a sample mean of 83, a sample standard deviation of 12.5 and the sample size
of 22, test the hypothesis that the value of the proportion mean is 70 against the
alternative that it is more than 70. Use the 0.025 significance level. (CO3, K3)
2. Two samples are drawn from two normal population. From the following data, test
whether the two samples have the same variance at 5% level of significance:
(CO3, K3)
Sample 1 60 65 71 74 76 82 85 87
Sample 2 61 66 67 85 78 63 85 86 88 91
3. Two independent samples of eight and seven items respectively following values of
the variable:
Sample 1 9 11 13 11 15 9 12 14
Sample 2 10 12 10 14 9 8 10
Do the two estimates of the population variance differ significantly at 5% level of
significant? (CO3, K3)
4. Four doctors each test four treatments for a certain disease and observe the number of
days each patient takes to recover. The results are as follows: (recovery time in days)
(CO3, K3)
10. Doctors
Treatment
1 2 3 4
A 10 14 19 20
B 11 15 17 21
C 9 12 16 19
D 8 13 17 20
Discuss the difference between (i) Doctors and (ii) treatments
5. A social experiment shows that in a group 20% people are ready to sell their votes for
money when they are offered a small amount. In another group, 40% people are ready
to sell their votes when they are offered huge sum money. In both the cases, 1000
members each were participated. Test at 5% level of significance (Two – sided) that
there is a difference two proportions. (CO3, K3)
6. A manufacturing company has purchased three new machines of different makes and
wishes to determine whether one of them is faster than the others in producing a
certain output. Five hourly production figures are observed at random from each
machine and the results are given below: (CO3, K3)
A1 25 30 36 38 31
A2 31 39 38 42 35
A3 24 30 28 25 28
Use analysis of variance and determine whether the machines are significantly different in
their mean speed of 5% level of significance.
PART C
1. In a feeding experiment of swine, three ratios R1, R2, R3 were tried. The animals were put
into three classes of three each according to litter and initial body weight. The following table
gives the gain in the body weight in Kg in a certain period. Analyse the data and state your
conclusion. (CO3, K3)
Class I Class II Class III
R1
R2
R3
4
14
3
16
18
14
10
19
7
2. At one of the management institutes, it is found that people come from diverse educational
background and from different cities across India, which could be Metro, Large, Medium, or
Small. To see if these two factors are dependent on each other, data about student having
different background such as B. Tech, B. Com, B.A, C. A and other with their corresponding
size of the cities is recorded. This data is shown below in table. At the 0.05 significant level,
does educational background differ according to the size of the cities to which these students
belong? (CO3, K3)
Educational
Background
Metro Large
Medium /
Small
Total
B. Tech 15 25 15 55
11. B. Com 35 20 15 70
B. Sc 10 10 5 25
B. A 15 10 20 45
C. A 10 5 4 20
Other 15 10 10 35
12. UNIT IV NON-PARAMETRIC TESTS
Chi-square test for single sample standard deviation. Chi-square tests for independence of attributes
and goodness of fit. Sign test for paired data. Rank sum test. Kolmogorov-Smirnov – test for goodness
of fit, comparing two populations. Mann – Whitney U test and Kruskal Wallis test. One sample run
test.
1. What is the chi-square test for a single variance? (CO4, K1)
The Chi-Square Test for One Variance is a statistical test used to compare the
variance of a sample to a known population variance. It is used to test a hypothesis
about the population variance.
2. Define rank sum test. (CO4, K1)
Rank sum test also known as Wilcoxon signed rank test is a non-parametric test used
for testing the difference of median in a paired data.
3. What is meant by paired data? (CO4, K1)
Paired data means that the two group’s value being compared are linked naturally and
rise from the individuals that are being measured more than one time usually.
4. State the working rule framed in Mann-Whitney test. (CO4, K1)
a) Set null hypothesis.
b) Combine all samples in array and arrange.
c) Find the ranks.
d) Calculation of U test.
5. What is Kruskal Wallis test and when it is used? (CO4, K1)
Kruskal Wallis test is a nonparametric (distribution free) test which assesses for
significant differences on a continuous dependent variable by a categorical
independent variable (with two or more groups). It is used when the assumptions of
one-way ANOVA are not met.
6. What are the two types of Pearson’s Chi-Square tests? (CO4, K1)
a) The Chi-Square goodness of fit test
b) The Chi-Square test of independence
7. State the uses of Chi-Square test. (CO4, K1)
The chi-square goodness of fit test is used to test whether the frequency distribution of
a categorical variable is different from the expectations. The chi-square test of
independence is used to test whether two categorical variables are related to each
other.
8. Write the formula for Kruskal Wallis test. (CO4, K1)
13. H =
12
𝑛(𝑛+1)
∑
𝑅𝑖
2
𝑛𝑖
− 3(𝑛 + 1)
𝑘
𝑖=1
Where, H = Kruskal Wallis test; n = Total number of observations in all
sample; k = Number of independent samples; ni = Number of cases in the ith
sample;
Ri = Rank of the sample
9. Write down the formula for Chi-Square test of standard deviation. (CO4, K1)
X2
=
(𝑛−1)𝑠2
𝜎2
for n – 1 degree of freedom
Where, n = Sample size; s = Standard Deviation; s2
= Variance sample; σ =
Expected standard deviation; σ2
= Expected variance
10. What is run test? (CO4, K1)
The runs test is a statistical test to determine whether random selection has been made
in the process of sample selection from an ordered population.
PART B
1. Discuss on One-Sample and Two Sample Sign Test. (CO4, K2)
2. The following is an arrangement of 25 men M and 15 women W lined up to purchase
tickets for a premier picture show: M WW MMM W MM W M W M WWW MMM
W MM WWW MMMMMM WWW MMMMMM Test for randomness at 5% level of
significance. (CO4, K3)
3. Verify whether Poisson distribution can be assumed from the data given below:
(CO4, K3)
No. of Defects 0 1 2 3 4 5
Observed Frequency 6 13 13 8 4 3
4. Melisa’s Boutique has three mall locations. Melisa keeps a daily record for each
location of the number of customers who actually make a purchase. A sample of those
data follows. Using the Kruskal Wallis test, can you say at the 0.05 level of
significance that her stores have the same number of customers who buy? (CO4, K3)
X mall 99 64 101 85 79 88 97 95 90 100
Y mall 83 102 125 61 91 96 94 89 93 75
Z mall 89 98 56 105 87 90 87 101 76 89
5. Fit a binomial distribution for the following data and also test the goodness of fit.
(CO4, K3)
x 0 1 2 3 4 5 6 Total
f 5 18 28 12 7 6 4 80
6. Use Mann Whitney U test to determine whether there is a difference at 5% level of
significance between cables made of Alloy I and Alloy II. (CO4, K3)
14. Alloy I
18.3 16.4 22.7 17.8
18.9 25.3 16.1 24.2
Alloy II
12.6 14.1 20.5 10.7 15.9
19.6 12.9 15.2 11.8 14.7
PART C
1. Use the run test to test the randomness of return on RIL, Infosys, NIFTY and Dollar, based on
the following data.
(CO4, K3)
Date Infosys Reliance Nifty Dollar Value
01-12-2018 2174.9 961.6 28559.6 62.05
02-12-2018 2126.6 962.7 28444 61.9
03-12-2018 2123.5 968.4 28442.7 61.87
04-12-2018 2101.8 958.6 28562.8 61.93
05-12-2018 2070.3 957.4 28458.1 61.83
08-12-2018 1970.2 944.6 28119.4 61.97
09-12-2018 1964.8 939.9 27791.1 61.88
10-12-2018 1963.8 932.6 27831.1 61.8
11-12-2018 1921.1 906.2 27602 62.04
12-12-2018 1938.7 882.4 27350.7 62.25
15-12-2023 1924.8 878.8 27319.6 62.2
2. Discuss on principles and techniques of various non-parametric tests. (CO4, K2)
15. UNIT V CORRELATION AND REGRESSION
Correlation – Coefficient of Determination – Rank Correlation – Regression – Estimation of
Regression line – Method of Least Squares – Standard Error of estimate.
PART A
1. What is Correlation? (CO5, K1)
Correlation is a statistical term describing the degree to which two variables move in
coordination with one another. If the two variables move in the same direction, then
those variables are said to have a positive correlation. If they move in opposite
directions, then they have a negative correlation.
2. What is the use of correlation and regression? (CO5, K2)
Correlation and Regression are the most commonly used techniques for investigating
the relationship between two quantitative variables. Correlation quantifies the strength
of the linear relationship between a pair of variables, whereas regression expresses the
relationship in the form of an equation.
3. What can be the values for correlation coefficient? (CO5, K1)
The value of a correlation coefficient an vary from -1 to 1. -1 indicates a perfect
negative correlation and +1 indicates a perfect positive correlation. A correlation
coefficient of zero means there is no relationship between the two variables.
4. What is the interpretation of the correlation coefficient values? (CO5, K2)
When there is a negative correlation between two variables as the value of one
variable increases, the value of the other variable decreases and vice versa. In other
words, for a negative correlation the variables work apposite each other. When there
is a positive correlation between two variables as the value of one variable increases
the value of the other variable also increases. The variables move together.
5. What is simple regression? (CO5, K1)
Simple linear regression is a regression model that estimates the relationship between
one independent variable and one dependent variable using a straight line. Both
variables should be quantitative.
6. What is the use of regression? (CO5, K2)
The regression statistics can be used to predict the dependent variable when the
independent variable is known. Regression goes beyond correlation by adding
prediction capabilities.
7. Define rank correlation. (CO5, K1)
16. It is a measure of correlation which is used when quantitative measures for certain
factors. It can be arranging in serial order.
8. Define least square method. (CO5, K1)
The least square method is the process of finding the best-fitting curve or line of best
fit for a set of data points by reducing the sum of the squares of the offsets (residual
part) of the points from the curve.
9. List advantages of least square. (CO5, K1)
a) Objective method.
b) Easy calculation.
c) Determines trend values.
d) Flexible method.
10. What is coefficient of determination? (CO5, K1)
The coefficient of determination (R²) measures how well a statistical model predicts
an outcome. The outcome is represented by the model's dependent variable. The
lowest possible value of R² is 0 and the highest possible value is 1.
PART B
1. Explain the Karl Pearson’s coefficient of correlation and Spearman’s rank correlation.
(CO5, K2)
2. Explain regression line and why are there two regression lines? When do we use one
in the preference to the other? (CO5, K2)
3. Study the correlation from the given data of industrial city. (CO5, K3)
Sales (‘000 in Rs.) 125 170 175 180 190 210 250 300 320 400
Profit (‘000 in Rs.) 20 29 32 35 34 41 55 60 64 70
4. The following table gives the age of bike and annual maintenance cost. Obtain the
regression equation. Also find the maintenance cost of the bike whose age is 12 years
old. (CO5, K3)
Age of Bike (years) 2 5 7 11 15
Maintenance Cost (in Thousands) 1 3 5 8 10
5. Cost accounts often estimate overhead based on the level of production. At the
Standard Knitting Co., they have collected information on the overhead expenses and
units produced at different plants and wants to estimate a regression equation to
predict future overheads. (CO5, K3)
Overheads 191 170 272 155 280 173 234 116 153 178
Units 40 42 53 35 56 39 48 30 37 40
i. Develop the regression equation for the cost accountants
17. ii. Predict overheads when 50 units are produced
6. Find the coefficient of correlation between X and Y using the following data:
(CO5, K3)
X 65 67 66 71 67 70 68 69
Y 67 68 68 70 64 67 72 70
PART C
1. Obtain the equation of regression lines from the following data using method of least
square. Hence find the co-efficient of correlation between x and y. also estimate the
value of: (CO5, K3)
i. Y when x = 38
ii. X when y = 18
X 20 26 29 30 31 31 34 35
Y 20 20 21 29 27 24 27 31
2. Find the standard error of estimate of y on x and x on y from the following data:
(CO5, K3)
x 1 2 3 4 5
y 2 5 9 13 14