The document discusses various measures of central tendency including the mean, median, and mode. It provides definitions and formulas for calculating the arithmetic mean, weighted arithmetic mean, mean of composite groups, and harmonic mean. The arithmetic mean is calculated by summing all values and dividing by the total number of values. It is impacted by outliers. The harmonic mean gives more weight to smaller values and is used to average rates or speeds. Examples are provided to demonstrate calculating the different types of means.
This document discusses key concepts in statistical estimation including:
- Estimation involves using sample data to infer properties of the population by calculating point estimates and interval estimates.
- A point estimate is a single value that estimates an unknown population parameter, while an interval estimate provides a range of plausible values for the parameter.
- A confidence interval gives the probability that the interval calculated from the sample data contains the true population parameter. Common confidence intervals are 95% confidence intervals.
- Formulas for confidence intervals depend on whether the population standard deviation is known or unknown, and the sample size.
Confidence Intervals: Basic concepts and overviewRizwan S A
This document provides an overview of confidence intervals. It defines confidence intervals and describes their use in statistical inference to estimate population parameters. It explains that a confidence interval provides a range of plausible values for an unknown population parameter based on a sample statistic. The document outlines the key steps in calculating a confidence interval, including determining the point estimate, standard error, and critical value corresponding to the desired confidence level. It discusses how the width of the confidence interval indicates the precision of the estimate and is affected by factors like the sample size and population variability.
This document discusses sampling distributions and their relationship to statistical inference. It defines key terms like population, parameter, sample, and statistic. A sampling distribution describes the possible values of a statistic calculated from random samples of the same size from a population. It explains that there are population distributions, sample data distributions, and sampling distributions. The mean and spread of a sampling distribution determine if a statistic is an unbiased estimator and how variable it is. Larger sample sizes result in smaller variability in the sampling distribution.
Regression analysis is a statistical technique used to estimate the relationships between variables. It allows one to predict the value of a dependent variable based on the value of one or more independent variables. The document discusses simple linear regression, where there is one independent variable, as well as multiple linear regression which involves two or more independent variables. Examples of linear relationships that can be modeled using regression analysis include price vs. quantity, sales vs. advertising, and crop yield vs. fertilizer usage. The key methods for performing regression analysis covered in the document are least squares regression and regressions based on deviations from the mean.
This document provides an overview of non-parametric statistics. It defines non-parametric tests as those that make fewer assumptions than parametric tests, such as not assuming a normal distribution. The document compares and contrasts parametric and non-parametric tests. It then explains several common non-parametric tests - the Mann-Whitney U test, Wilcoxon signed-rank test, sign test, and Kruskal-Wallis test - and provides examples of how to perform and interpret each test.
This document discusses statistical concepts such as parameters, statistics, descriptive statistics, estimation, and hypothesis testing. It provides examples of:
- Point estimates and interval estimates used to estimate population parameters from sample statistics. Point estimates provide a single value while interval estimates provide a range of values.
- Confidence intervals which specify a range of values that is expected to contain the population parameter a certain percentage of times, known as the confidence level. Common confidence levels are 90%, 95%, and 99%.
- Formulas for constructing confidence intervals for the population mean, proportion, and variance based on the sample statistic, sample size, confidence level, and whether the population standard deviation is known.
1) The document provides information about statistics homework help and tutoring services offered by Homework Guru. It discusses various types of statistics help available, including online tutoring, homework help, and exam preparation.
2) Key aspects of their tutoring services are highlighted, including the qualifications of tutors, availability, and interactive online classrooms. Confidence intervals and how to calculate them are also explained in detail.
3) Examples are given to demonstrate how to calculate 95% and 99% confidence intervals for a population mean when the population standard deviation is known or unknown. Interval estimation procedures and when to use z-tests or t-tests are summarized.
This document discusses sampling and sampling distributions. It begins by explaining why sampling is preferable to a census in terms of time, cost and practicality. It then defines the sampling frame as the listing of items that make up the population. Different types of samples are described, including probability and non-probability samples. Probability samples include simple random, systematic, stratified, and cluster samples. Key aspects of each type are defined. The document also discusses sampling distributions and how the distribution of sample statistics such as means and proportions can be approximated as normal even if the population is not normal, due to the central limit theorem. It provides examples of how to calculate probabilities and intervals for sampling distributions.
This document discusses key concepts in statistical estimation including:
- Estimation involves using sample data to infer properties of the population by calculating point estimates and interval estimates.
- A point estimate is a single value that estimates an unknown population parameter, while an interval estimate provides a range of plausible values for the parameter.
- A confidence interval gives the probability that the interval calculated from the sample data contains the true population parameter. Common confidence intervals are 95% confidence intervals.
- Formulas for confidence intervals depend on whether the population standard deviation is known or unknown, and the sample size.
Confidence Intervals: Basic concepts and overviewRizwan S A
This document provides an overview of confidence intervals. It defines confidence intervals and describes their use in statistical inference to estimate population parameters. It explains that a confidence interval provides a range of plausible values for an unknown population parameter based on a sample statistic. The document outlines the key steps in calculating a confidence interval, including determining the point estimate, standard error, and critical value corresponding to the desired confidence level. It discusses how the width of the confidence interval indicates the precision of the estimate and is affected by factors like the sample size and population variability.
This document discusses sampling distributions and their relationship to statistical inference. It defines key terms like population, parameter, sample, and statistic. A sampling distribution describes the possible values of a statistic calculated from random samples of the same size from a population. It explains that there are population distributions, sample data distributions, and sampling distributions. The mean and spread of a sampling distribution determine if a statistic is an unbiased estimator and how variable it is. Larger sample sizes result in smaller variability in the sampling distribution.
Regression analysis is a statistical technique used to estimate the relationships between variables. It allows one to predict the value of a dependent variable based on the value of one or more independent variables. The document discusses simple linear regression, where there is one independent variable, as well as multiple linear regression which involves two or more independent variables. Examples of linear relationships that can be modeled using regression analysis include price vs. quantity, sales vs. advertising, and crop yield vs. fertilizer usage. The key methods for performing regression analysis covered in the document are least squares regression and regressions based on deviations from the mean.
This document provides an overview of non-parametric statistics. It defines non-parametric tests as those that make fewer assumptions than parametric tests, such as not assuming a normal distribution. The document compares and contrasts parametric and non-parametric tests. It then explains several common non-parametric tests - the Mann-Whitney U test, Wilcoxon signed-rank test, sign test, and Kruskal-Wallis test - and provides examples of how to perform and interpret each test.
This document discusses statistical concepts such as parameters, statistics, descriptive statistics, estimation, and hypothesis testing. It provides examples of:
- Point estimates and interval estimates used to estimate population parameters from sample statistics. Point estimates provide a single value while interval estimates provide a range of values.
- Confidence intervals which specify a range of values that is expected to contain the population parameter a certain percentage of times, known as the confidence level. Common confidence levels are 90%, 95%, and 99%.
- Formulas for constructing confidence intervals for the population mean, proportion, and variance based on the sample statistic, sample size, confidence level, and whether the population standard deviation is known.
1) The document provides information about statistics homework help and tutoring services offered by Homework Guru. It discusses various types of statistics help available, including online tutoring, homework help, and exam preparation.
2) Key aspects of their tutoring services are highlighted, including the qualifications of tutors, availability, and interactive online classrooms. Confidence intervals and how to calculate them are also explained in detail.
3) Examples are given to demonstrate how to calculate 95% and 99% confidence intervals for a population mean when the population standard deviation is known or unknown. Interval estimation procedures and when to use z-tests or t-tests are summarized.
This document discusses sampling and sampling distributions. It begins by explaining why sampling is preferable to a census in terms of time, cost and practicality. It then defines the sampling frame as the listing of items that make up the population. Different types of samples are described, including probability and non-probability samples. Probability samples include simple random, systematic, stratified, and cluster samples. Key aspects of each type are defined. The document also discusses sampling distributions and how the distribution of sample statistics such as means and proportions can be approximated as normal even if the population is not normal, due to the central limit theorem. It provides examples of how to calculate probabilities and intervals for sampling distributions.
This document provides an introduction and overview of biostatistics. It defines key biostatistics terms like population, sample, parameter, statistic, quantitative vs. qualitative data, levels of measurement, descriptive vs. inferential biostatistics, and common statistical notations. It also discusses sources of health information and how computerized health management information systems are used to collect, analyze and report data.
v When to Choose a Statistical Tests OR When NOT to Choose? v Parametric vs. Non-Parametric Tests (Comparison)
v Parameters to check when Choosing a Statistical Test:
- Distribution of Data
- Type of data/Variable
- Types of Analysis (What’s the hypothesis)
- No of groups or data-sets
- Data Group Design
v Snapshot of all statistical test and “How” to Choose using above parameters v Explanation using Examples:
- Mann Whitney U Test
- Wilcoxon Sign Rank Test
- Spearman’s co-relation
- Chi-Square Test
v Conclusion
The two major areas of statistics are: descriptive statistics and inferential statistics. In this presentation, the difference between the two are shown including examples.
The document provides an overview of inferential statistics. It defines inferential statistics as making generalizations about a larger population based on a sample. Key topics covered include hypothesis testing, types of hypotheses, significance tests, critical values, p-values, confidence intervals, z-tests, t-tests, ANOVA, chi-square tests, correlation, and linear regression. The document aims to explain these statistical concepts and techniques at a high level.
this ppt gives you adequate information about Karl Pearsonscoefficient correlation and its calculation. its the widely used to calculate a relationship between two variables. The correlation shows a specific value of the degree of a linear relationship between the X and Y variables. it is also called as The Karl Pearson‘s product-moment correlation coefficient. the value of r is alwys lies between -1 to +1. + 0.1 shows Lower degree of +ve correlation, +0.8 shows Higher degree of +ve correlation.-0.1 shows Lower degree of -ve correlation. -0.8 shows Higher degree of -ve correlation.
Basic statistics is the science of collecting, organizing, summarizing, and interpreting data. It allows researchers to gain insights from data through graphical or numerical summaries, regardless of the amount of data. Descriptive statistics can be used to describe single variables through frequencies, percentages, means, and standard deviations. Inferential statistics make inferences about phenomena through hypothesis testing, correlations, and predicting relationships between variables.
The document discusses the approval of the drug AZT to treat AIDS in 1987. It describes how early clinical trials showed AZT significantly reduced deaths among AIDS patients compared to a control group. However, statistical analysis was needed to determine if the results were due to the drug or chance. Statistical tests found the probability the results were due to chance was less than 1 in 1000. Armed with this evidence, the FDA approved AZT after only 21 months of testing.
The document discusses sampling distributions and summarizes key points about the sampling distribution of the mean for both known and unknown population variance. It states that the sampling distribution of the mean has a normal distribution with mean equal to the population mean and variance equal to the population variance divided by the sample size when the population variance is known. When the population variance is unknown, the sampling distribution follows a t-distribution if the population is normally distributed.
This document discusses correlation and the Pearson's coefficient of correlation. It defines correlation as the relationship between two variables, which can be positive, negative, or zero. The Pearson's coefficient of correlation, represented by r, measures the strength and direction of this relationship. The document provides examples of calculating r using the product-moment method for different sets of data. It interprets the resulting r values and discusses advantages and limitations of the product-moment correlation method.
1) The document discusses commonly used statistical tests in research such as descriptive statistics, inferential statistics, hypothesis testing, and tests like t-tests, ANOVA, chi-square tests, and normal distributions.
2) It provides examples of how to determine sample sizes needed for adequate power in hypothesis testing and how to perform t-tests to analyze sample means.
3) Key statistical concepts covered include parameters, statistics, measurement scales, type I and II errors, and interpreting results of hypothesis tests.
The document discusses Spearman's rank correlation coefficient, a nonparametric measure of statistical dependence between two variables. It assumes values between -1 and 1, with -1 indicating a perfect negative correlation and 1 a perfect positive correlation. The steps involve converting values to ranks, calculating the differences between ranks, and determining if there is a statistically significant correlation based on the test statistic and critical values. An example calculates Spearman's rho using rankings of cricket teams in test and one day international matches.
This document discusses sampling distribution about sample mean. It defines key terms like population, sample, sampling units, stratified random sampling, systematic sampling, cluster sampling, probability sampling, non-probability sampling, estimation, estimator, estimate, and sampling distribution. It also discusses the sampling distribution of the sample mean and provides an example to calculate and compare the mean and variance of sample means for sampling with and without replacement.
This document discusses correlation coefficient and different types of correlation. It defines correlation coefficient as the measure of the degree of relationship between two variables. It explains different types of correlation such as perfect positive correlation, perfect negative correlation, moderately positive correlation, moderately negative correlation, and no correlation. It also discusses different methods to study correlation including scatter diagram method, graphic method, Karl Pearson's coefficient of correlation method, and Spearman's rank correlation method. It provides examples and steps to calculate correlation coefficient using these different methods.
This document provides an introduction to key concepts in statistics. It discusses various statistical measures such as measures of central tendency (mean, median, mode), measures of dispersion (range, standard deviation), correlation, and different types of correlation (simple, partial, multiple). It also outlines common statistical methods like scatter diagrams, Karl Pearson's method, and rank correlation method. The role of computer technology in statistics is mentioned.
The document discusses various measures of central tendency used in statistics. The three most common measures are the mean, median, and mode. The mean is the sum of all values divided by the number of values and is affected by outliers. The median is the middle value when data is arranged from lowest to highest. The mode is the most frequently occurring value in a data set. Each measure has advantages and disadvantages depending on the type of data distribution. The mean is the most reliable while the mode can be undefined. In symmetrical distributions, the mean, median and mode are equal, but the mean is higher than the median for positively skewed data and lower for negatively skewed data.
Introduction to Statistics - Basic concepts
- How to be a good doctor - A step in Health promotion
- By Ibrahim A. Abdelhaleem - Zagazig Medical Research Society (ZMRS)
MEASURES OF CENTRAL TENDENCY AND VARIABILITYMariele Brutas
The document introduces focus questions about norms, standards, and how data is analyzed using descriptive statistics. It then provides an overview of different measures used to analyze ungrouped and grouped data, including measures of central tendency (mean, median, mode) and measures of variability (range, standard deviation). Finally, it includes sample pre-assessment questions about these concepts.
Here are the modes for the three examples:
1. The mode is 3. This value occurs most frequently among the number of errors committed by the typists.
2. The mode is 82. This value occurs most frequently among the number of fruits yielded by the mango trees.
3. The mode is 12 and 15. These values occur most frequently among the students' quiz scores.
This document provides an introduction and overview of biostatistics. It defines key biostatistics terms like population, sample, parameter, statistic, quantitative vs. qualitative data, levels of measurement, descriptive vs. inferential biostatistics, and common statistical notations. It also discusses sources of health information and how computerized health management information systems are used to collect, analyze and report data.
v When to Choose a Statistical Tests OR When NOT to Choose? v Parametric vs. Non-Parametric Tests (Comparison)
v Parameters to check when Choosing a Statistical Test:
- Distribution of Data
- Type of data/Variable
- Types of Analysis (What’s the hypothesis)
- No of groups or data-sets
- Data Group Design
v Snapshot of all statistical test and “How” to Choose using above parameters v Explanation using Examples:
- Mann Whitney U Test
- Wilcoxon Sign Rank Test
- Spearman’s co-relation
- Chi-Square Test
v Conclusion
The two major areas of statistics are: descriptive statistics and inferential statistics. In this presentation, the difference between the two are shown including examples.
The document provides an overview of inferential statistics. It defines inferential statistics as making generalizations about a larger population based on a sample. Key topics covered include hypothesis testing, types of hypotheses, significance tests, critical values, p-values, confidence intervals, z-tests, t-tests, ANOVA, chi-square tests, correlation, and linear regression. The document aims to explain these statistical concepts and techniques at a high level.
this ppt gives you adequate information about Karl Pearsonscoefficient correlation and its calculation. its the widely used to calculate a relationship between two variables. The correlation shows a specific value of the degree of a linear relationship between the X and Y variables. it is also called as The Karl Pearson‘s product-moment correlation coefficient. the value of r is alwys lies between -1 to +1. + 0.1 shows Lower degree of +ve correlation, +0.8 shows Higher degree of +ve correlation.-0.1 shows Lower degree of -ve correlation. -0.8 shows Higher degree of -ve correlation.
Basic statistics is the science of collecting, organizing, summarizing, and interpreting data. It allows researchers to gain insights from data through graphical or numerical summaries, regardless of the amount of data. Descriptive statistics can be used to describe single variables through frequencies, percentages, means, and standard deviations. Inferential statistics make inferences about phenomena through hypothesis testing, correlations, and predicting relationships between variables.
The document discusses the approval of the drug AZT to treat AIDS in 1987. It describes how early clinical trials showed AZT significantly reduced deaths among AIDS patients compared to a control group. However, statistical analysis was needed to determine if the results were due to the drug or chance. Statistical tests found the probability the results were due to chance was less than 1 in 1000. Armed with this evidence, the FDA approved AZT after only 21 months of testing.
The document discusses sampling distributions and summarizes key points about the sampling distribution of the mean for both known and unknown population variance. It states that the sampling distribution of the mean has a normal distribution with mean equal to the population mean and variance equal to the population variance divided by the sample size when the population variance is known. When the population variance is unknown, the sampling distribution follows a t-distribution if the population is normally distributed.
This document discusses correlation and the Pearson's coefficient of correlation. It defines correlation as the relationship between two variables, which can be positive, negative, or zero. The Pearson's coefficient of correlation, represented by r, measures the strength and direction of this relationship. The document provides examples of calculating r using the product-moment method for different sets of data. It interprets the resulting r values and discusses advantages and limitations of the product-moment correlation method.
1) The document discusses commonly used statistical tests in research such as descriptive statistics, inferential statistics, hypothesis testing, and tests like t-tests, ANOVA, chi-square tests, and normal distributions.
2) It provides examples of how to determine sample sizes needed for adequate power in hypothesis testing and how to perform t-tests to analyze sample means.
3) Key statistical concepts covered include parameters, statistics, measurement scales, type I and II errors, and interpreting results of hypothesis tests.
The document discusses Spearman's rank correlation coefficient, a nonparametric measure of statistical dependence between two variables. It assumes values between -1 and 1, with -1 indicating a perfect negative correlation and 1 a perfect positive correlation. The steps involve converting values to ranks, calculating the differences between ranks, and determining if there is a statistically significant correlation based on the test statistic and critical values. An example calculates Spearman's rho using rankings of cricket teams in test and one day international matches.
This document discusses sampling distribution about sample mean. It defines key terms like population, sample, sampling units, stratified random sampling, systematic sampling, cluster sampling, probability sampling, non-probability sampling, estimation, estimator, estimate, and sampling distribution. It also discusses the sampling distribution of the sample mean and provides an example to calculate and compare the mean and variance of sample means for sampling with and without replacement.
This document discusses correlation coefficient and different types of correlation. It defines correlation coefficient as the measure of the degree of relationship between two variables. It explains different types of correlation such as perfect positive correlation, perfect negative correlation, moderately positive correlation, moderately negative correlation, and no correlation. It also discusses different methods to study correlation including scatter diagram method, graphic method, Karl Pearson's coefficient of correlation method, and Spearman's rank correlation method. It provides examples and steps to calculate correlation coefficient using these different methods.
This document provides an introduction to key concepts in statistics. It discusses various statistical measures such as measures of central tendency (mean, median, mode), measures of dispersion (range, standard deviation), correlation, and different types of correlation (simple, partial, multiple). It also outlines common statistical methods like scatter diagrams, Karl Pearson's method, and rank correlation method. The role of computer technology in statistics is mentioned.
The document discusses various measures of central tendency used in statistics. The three most common measures are the mean, median, and mode. The mean is the sum of all values divided by the number of values and is affected by outliers. The median is the middle value when data is arranged from lowest to highest. The mode is the most frequently occurring value in a data set. Each measure has advantages and disadvantages depending on the type of data distribution. The mean is the most reliable while the mode can be undefined. In symmetrical distributions, the mean, median and mode are equal, but the mean is higher than the median for positively skewed data and lower for negatively skewed data.
Introduction to Statistics - Basic concepts
- How to be a good doctor - A step in Health promotion
- By Ibrahim A. Abdelhaleem - Zagazig Medical Research Society (ZMRS)
MEASURES OF CENTRAL TENDENCY AND VARIABILITYMariele Brutas
The document introduces focus questions about norms, standards, and how data is analyzed using descriptive statistics. It then provides an overview of different measures used to analyze ungrouped and grouped data, including measures of central tendency (mean, median, mode) and measures of variability (range, standard deviation). Finally, it includes sample pre-assessment questions about these concepts.
Here are the modes for the three examples:
1. The mode is 3. This value occurs most frequently among the number of errors committed by the typists.
2. The mode is 82. This value occurs most frequently among the number of fruits yielded by the mango trees.
3. The mode is 12 and 15. These values occur most frequently among the students' quiz scores.
This document contains 16 multiple choice questions testing statistical concepts. The questions cover topics such as statistical inference, populations and samples, quantitative and qualitative data, types of errors in data collection, graphical representations of data including histograms, scatter plots, and frequency distributions, measures of central tendency and variability including mean, variance and standard deviation, probability, and conditional probability. The correct answers to each question are also provided.
This document discusses measures of central tendency including the mean, median, and mode. It provides examples of calculating the mean from raw data sets and frequency distributions. The median and mode are defined as the middle value and most frequent value, respectively. Methods for calculating each from both types of data are shown. Other measures covered include the midrange and the effects of outliers. Shapes of distributions are discussed including positively and negatively skewed and symmetric. Practice problems are provided to reinforce the concepts.
This document defines and compares various measures of central tendency, including the mean, median, and mode. It provides formulas and examples for calculating the arithmetic mean, geometric mean, harmonic mean, weighted mean, median, and mode. The document also discusses the relationships between the arithmetic mean, geometric mean, and harmonic mean, showing that the harmonic mean will always be less than or equal to the geometric mean, which will always be less than or equal to the arithmetic mean.
Measures of central tendency describe the middle or center of a data set and include the mean, median, and mode. The mean is the average value calculated by adding all values and dividing by the number of values. The median is the middle number in a data set arranged in order. The mode is the value that occurs most frequently. These measures are used to understand the typical or common values in a data set.
The document discusses measures of central tendency such as mean, median, and mode. It provides examples and questions to test understanding of these concepts. Some key points covered include:
- Measures of central tendency summarize data and tend to lie in the center. They include mean, median, and mode.
- The mean is affected by outliers while the median is not. Mean can be determined through sums of deviations while median cannot be determined graphically.
- Questions test understanding of properties of mean, median, and mode as well as how to calculate and apply them in different data scenarios.
This document discusses different measures of central tendency including the mode, median, and mean. It provides definitions and explanations of how to calculate each one. The mode is the most frequently occurring value, the median is the middle value of an ordered data set, and the mean is the average found by summing all values and dividing by the number of values. The document contrasts these measures and discusses which to use based on the measurement level of the data and shape of the distribution. It also provides examples of how to interpret the mean and median when comparing multiple data sets. Homework problems are assigned from the text.
The document discusses various measures of central tendency and dispersion. It defines the arithmetic mean, weighted mean, geometric mean, median, and mode as measures of central tendency. It also discusses calculating these measures from grouped data. For measures of dispersion, it covers range, interquartile range, variance, and standard deviation. It provides formulas to calculate these statistics for both population and sample data.
This document provides an overview of key concepts in descriptive statistics including:
- Parameters describe populations while statistics describe samples
- Measures of central tendency include the mean, median, and mode
- Measures of variation/dispersion include range, variance, standard deviation, and coefficient of variation
- The empirical rule and Chebyshev's theorem describe how data is distributed around the mean
- Z-scores and percentiles relate individual values to the overall distribution
The lesson plan discusses measures of central tendency for ungrouped data. It defines the three measures - mean, median, and mode. The lesson explains how to calculate each measure through examples and formulas. Students will practice finding the mean, median, and mode of various data sets.
Measures of central tendency by maria diza c. febriomariadiza
This document provides instructions for a lesson on measures of central tendency. It begins by explaining that measures of central tendency include the mean, median, and mode, which are ways to describe the central or typical value in a data set. The objectives are given as helping students develop an understanding of these concepts. Examples and calculations are provided to illustrate finding the mean, median, and mode. Exercises in the form of multiple choice questions test the understanding of calculating and identifying the measures of central tendency.
This document discusses different types of graphs used to represent frequency distributions: histograms, frequency polygons, and ogives. It provides examples and instructions for constructing each graph type. Histograms use vertical bars to represent frequencies, frequency polygons connect points plotted for class midpoints, and ogives show cumulative frequencies. The document also discusses relative frequency graphs and common distribution shapes like bell-shaped, uniform, and skewed. It assigns practice constructing different graph types from example data.
This document discusses different types of graphs used to represent frequency distributions: bar graphs, histograms, frequency polygons, pie charts, and OGIVE charts. It provides instructions on how to construct each graph type, including labeling axes, ensuring proportionality, and adding titles and legends. Examples of each graph type are shown using sample data on family sizes. The document concludes that bar graphs, histograms, frequency polygons and pie charts are common ways to show frequency distributions, while OGIVE charts illustrate less than and greater than cumulative frequencies.
This document discusses measures of central tendency including the mean, median, and mode. It defines each measure and provides examples of how to calculate them using data sets. The mean is the average value obtained by dividing the sum of all values by the number of values. The median is the middle value when values are arranged in order. The mode is the most frequent value in the data set. The document outlines advantages and disadvantages of each measure and concludes that measures of central tendency describe the typical or central value in a data set.
The geometric mean is a type of average that indicates the central tendency of a set of numbers using their product, as opposed to the arithmetic mean which uses their sum, and it is calculated by taking the nth root of the product of the numbers. The geometric mean is more appropriate than the arithmetic mean for describing proportional growth and ratios, and it has various applications in fields like optics, signal processing, geometry, and finance.
The document discusses calculating and interpreting measures of central tendency (mean, median, mode) from automobile sales data. It analyzes the data to determine which measure would be most useful for a customer with a Rs. 8 lakh budget and a manufacturer selecting a model. The mode (Rs. 8 lakhs) is identified as the most influential factor for both, as it indicates the most commonly purchased price point and rules out expensive outliers. In conclusion, the mode provides the best indication of which model to feature for target audiences.
Choosing the best measure of central tendencybujols
This document discusses different measures of central tendency (mean, median, mode) and what each indicates. It analyzes what each measure can and cannot determine from a data set. The mean is the average and can be affected by outliers. The median is the middle number and is least affected by outliers. The mode shows the most frequent value. Choosing the best measure depends on whether you want to know the center, consider outliers, or find the most popular value. A worksheet provides examples of choosing the appropriate measure for different data sets.
This document provides an introduction to arithmetic, geometric, and harmonic progressions. It defines each type of progression and provides examples and formulas for calculating terms, sums, and other properties. Key points covered include the characteristics of an arithmetic progression, formulas for finding sums and identifying terms, geometric progressions and their essential components including starting value, ratio, and number of terms, and the definition and reverse relationship between harmonic and arithmetic progressions.
The document discusses various measures of central tendency including the mean, median, and mode. It provides definitions and formulas for calculating the arithmetic mean, weighted arithmetic mean, harmonic mean, and geometric mean. Examples are given to demonstrate calculating each type of mean from both ungrouped and grouped data. The properties, merits, and limitations of each mean are also outlined. Relationship among the different means are explained.
The document discusses various measures of central tendency including the mean, median, and mode. It provides definitions and formulas for calculating each measure, as well as examples of how to calculate the mean for grouped and ungrouped data, the median for grouped and ungrouped data, percentiles, quartiles, and the relationship between the mean, median, and mode. The key information is that measures of central tendency describe the central or typical value in a data set and include the mean, median, and mode.
This document discusses various measures of central tendency used in statistics including the mean, median, and mode. It provides definitions and formulas for calculating each measure. The mean is the average value found by summing all values and dividing by the total count. The median is the middle value when values are arranged from lowest to highest. The mode is the most frequently occurring value. The document also discusses weighted mean, geometric mean, harmonic mean, and compares the properties of each central tendency measure.
Mb0040 statistics for management spring2015_assignment- SMU_MBA-Solved-Assign...pkharb
This document contains a statistics assignment for an MBA program. It includes 6 questions covering topics like distinguishing between classification and tabulation, explaining measures of central tendency, probability, and the chi-square test. For question 1, the student is asked to define classification and tabulation, and explain the structure and components of a table with an example. For question 2, the student must explain arithmetic mean and use it to calculate the mean and standard deviation for two groups of workers combined. The remaining questions involve calculating probabilities, expectations, and determining if a population has decreased based on a chi-square test.
This document provides an overview of key numerical measures used to describe data, including measures of central tendency (mean, median, mode) and measures of dispersion (range, variance, standard deviation). It defines each measure, provides examples of calculating them, and discusses their characteristics, uses, and advantages/disadvantages. The document also covers weighted means, geometric means, Chebyshev's theorem, and calculating measures for grouped data.
The document provides information about statistics for entrepreneurs, including links to download materials on topics such as business statistics, statistical analysis, research methods, forecasting methods, and data smoothing techniques. It also contains examples and solutions to exercises on concepts like moving averages, seasonality, mean, median, mode, range, and standard deviation. The document is intended as a resource for participants in a postgraduate program in social entrepreneurship.
The document provides information and examples to study for the week's quiz on normal distributions and the central limit theorem. Key points to remember include: the total area under the standard normal curve is 1; the mean and variance of any normal distribution do not depend on sample size; and the central limit theorem states that as sample size increases, the sampling distribution of the mean approaches a normal distribution, regardless of the original data's distribution. Examples are provided to practice calculating probabilities and confidence intervals using the normal distribution.
The document discusses measures of central tendency including the mean, median, and mode. It provides definitions and formulas for calculating each measure. For the mean, formulas are given for both raw data and frequency data. The relationships between the mean, median, and mode are explored, including an empirical relationship that can be used to find one value if the other two are known for symmetrical data. The importance of each measure is discussed for different business applications depending on the characteristics of the data.
A General Manger of Harley-Davidson has to decide on the size of a.docxevonnehoggarth79783
A General Manger of Harley-Davidson has to decide on the size of a new facility. The GM has narrowed the choices to two: large facility or small facility. The company has collected information on the payoffs. It now has to decide which option is the best using probability analysis, the decision tree model, and expected monetary value.
Options:
Facility
Demand Options
Probability
Actions
Expected Payoffs
Large
Low Demand
0.4
Do Nothing
($10)
Low Demand
0.4
Reduce Prices
$50
High Demand
0.6
$70
Small
Low Demand
0.4
$40
High Demand
0.6
Do Nothing
$40
High Demand
0.6
Overtime
$50
High Demand
0.6
Expand
$55
Determination of chance probability and respective payoffs:
Build Small:
Low Demand
0.4($40)=$16
High Demand
0.6($55)=$33
Build Large:
Low Demand
0.4($50)=$20
High Demand
0.6($70)=$42
Determination of Expected Value of each alternative
Build Small: $16+$33=$49
Build Large: $20+$42=$62
Click here for the Statistical Terms review sheet.
Submit your conclusion in a Word document to the M4: Assignment 2 Dropbox byWednesday, November 18, 2015.
A General Manger of Harley
-
Davidson has to decide on the size of a new facility. The GM has narrowed
the choices to two: large facility or small facility. The company has collected information on the payoffs. It
now has to decide which option is the best u
sing probability analysis, the decision tree model, and
expected monetary value.
Options:
Facility
Demand
Options
Probability
Actions
Expected
Payoffs
Large
Low Demand
0.4
Do Nothing
($10)
Low Demand
0.4
Reduce Prices
$50
High Demand
0.6
$70
Small
Low Demand
0.4
$40
High Demand
0.6
Do Nothing
$40
High Demand
0.6
Overtime
$50
High Demand
0.6
Expand
$55
A General Manger of Harley-Davidson has to decide on the size of a new facility. The GM has narrowed
the choices to two: large facility or small facility. The company has collected information on the payoffs. It
now has to decide which option is the best using probability analysis, the decision tree model, and
expected monetary value.
Options:
Facility
Demand
Options
Probability Actions
Expected
Payoffs
Large Low Demand 0.4 Do Nothing ($10)
Low Demand 0.4 Reduce Prices $50
High Demand 0.6
$70
Small Low Demand 0.4
$40
High Demand 0.6 Do Nothing $40
High Demand 0.6 Overtime $50
High Demand 0.6 Expand $55
SAMPLING MEAN:
DEFINITION:
The term sampling mean is a statistical term used to describe the properties of statistical distributions. In statistical terms, the sample meanfrom a group of observations is an estimate of the population mean. Given a sample of size n, consider n independent random variables X1, X2... Xn, each corresponding to one randomly selected observation. Each of these variables has the distribution of the population, with mean and standard deviation. The sample mean is defined to be
WHAT IT IS USED FOR:
It is also used to measure central tendency of the numbers in a .
This document discusses measures of central tendency and different methods for calculating averages. It begins by defining central tendency as a single value that represents the characteristics of an entire data set. Three common measures of central tendency are introduced: the mean, median, and mode. The document then focuses on explaining how to calculate the arithmetic mean, or average, including the direct method, shortcut method, and how it applies to discrete and continuous data series. Weighted averages are also covered. In summary, the document provides an overview of key concepts in measures of central tendency and how to calculate various types of averages.
This document discusses summary statistics and measures of central tendency and dispersion. It provides examples of how to calculate the mean, mode, median, and geometric mean of data sets. It also discusses how to calculate the standard deviation and variance of data as measures of dispersion. Regression and correlation analysis are introduced as methods to study the relationship between variables and enable prediction. The least squares approach to determining the linear regression line that best fits a data set is demonstrated through an example.
SAMPLING MEAN DEFINITION The term sampling mean .docxanhlodge
SAMPLING MEAN:
DEFINITION:
The term sampling mean is a statistical term used to describe the properties of statistical
distributions. In statistical terms, the sample mean from a group of observations is an
estimate of the population mean . Given a sample of size n, consider n independent random
variables X1, X2... Xn, each corresponding to one randomly selected observation. Each of these
variables has the distribution of the population, with mean and standard deviation . The
sample mean is defined to be
WHAT IT IS USED FOR:
It is also used to measure central tendency of the numbers in a database. It can also be said that
it is nothing more than a balance point between the number and the low numbers.
HOW TO CALCULATE IT:
To calculate this, just add up all the numbers, then divide by how many numbers there are.
Example: what is the mean of 2, 7, and 9?
Add the numbers: 2 + 7 + 9 = 18
Divide by how many numbers (i.e., we added 3 numbers): 18 ÷ 3 = 6
So the Mean is 6
SAMPLE VARIANCE:
DEFINITION:
The sample variance, s2, is used to calculate how varied a sample is. A sample is a select number
of items taken from a population. For example, if you are measuring American people’s weights,
it wouldn’t be feasible (from either a time or a monetary standpoint) for you to measure the
weights of every person in the population. The solution is to take a sample of the population, say
1000 people, and use that sample size to estimate the actual weights of the whole population.
WHAT IT IS USED FOR:
The sample variance helps you to figure out the spread out in the data you have collected or are
going to analyze. In statistical terminology, it can be defined as the average of the squared
differences from the mean.
HOW TO CALCULATE IT:
Given below are steps of how a sample variance is calculated:
• Determine the mean
• Then for each number: subtract the Mean and square the result
• Then work out the mean of those squared differences.
To work out the mean, add up all the values then divide by the number of data points.
First add up all the values from the previous step.
But how do we say "add them all up" in mathematics? We use the Roman letter Sigma: Σ
The handy Sigma Notation says to sum up as many terms as we want.
• Next we need to divide by the number of data points, which is simply done by
multiplying by "1/N":
Statistically it can be stated by the following:
•
http://www.statisticshowto.com/find-sample-size-statistics/
http://www.mathsisfun.com/algebra/sigma-notation.html
• This value is the variance
EXAMPLE:
Sam has 20 Rose Bushes.
The number of flowers on each bush is
9, 2, 5, 4, 12, 7, 8, 11, 9, 3, 7, 4, 12, 5, 4, 10, 9, 6, 9, 4
Work out the sample variance
Step 1. Work out the mean
In the formula above, μ (the Greek letter "mu") is the mean of all our values.
For this example, the data points are: 9, 2, 5, 4, 12, 7, 8,.
This document provides information about statistics and probability. It defines statistics as the collection, analysis, and interpretation of data. There are two main categories of statistics: descriptive statistics, which summarizes and describes data, and inferential statistics, which is used to estimate, predict, and generalize results. The document also discusses population and sample, measures of central tendency (mean, median, mode), measures of dispersion (range, variance, standard deviation), qualitative vs. quantitative data, ways of representing quantitative data (numerically and graphically), and examples of organizing data using a stem-and-leaf plot.
This document provides an overview of probability, statistics, and their applications in engineering. It defines key probability and statistics concepts like trials, outcomes, random experiments, and frequency distributions. It explains how engineers use statistics and probability to analyze data from tests and experiments to better understand product quality and failure rates. Examples are given of measures of central tendency like mean and median, measures of variation like standard deviation and variance, and the normal distribution curve. Engineering applications include using these analytical techniques to assess results from a class and compare two data histograms.
This document discusses descriptive statistics and their uses in research. Descriptive statistics describe basic features of data through simple summaries, and allow researchers to describe many data points with a few key values. These include distributions to show frequencies of values, measures of central tendency like mean, median and mode, and measures of dispersion like range and standard deviation. The mean is the average value and is calculated by summing all values and dividing by the number of values. The median is the middle value when values are arranged in order. The mode is the most frequent value. Standard deviation measures how spread out values are from the mean.
Mba i qt unit-2_measures of central tendencyRai University
This document discusses various measures of central tendency, including the arithmetic mean. It defines the arithmetic mean as the total of the values divided by the number of values. It describes direct and short-cut methods for calculating the arithmetic mean for both ungrouped and grouped data. Some key properties of the arithmetic mean are that the sum of deviations from the mean is 0, and the sum of squared deviations is minimized at the mean. Weighted arithmetic mean is also discussed, which assigns weights to values before calculating the average.
The modal rating is the rating value that occurs most frequently in the dataset. To find the mode, we would need to analyze the rating frequencies and identify which rating has the highest count. Without access to the actual dataset values and frequencies, I cannot determine the modal rating directly. The mode is a measure of central tendency that is best for identifying the most common or typical value in a dataset.
This document provides an introduction to statistics, including definitions, key concepts, and measures of central tendency. It defines statistics as the collection, presentation, analysis, and interpretation of numerical data. Some important statistical concepts discussed include population, sample, variate, attribute, parameter, and statistic. Measures of central tendency covered are the mean (including weighted mean), median, mode, and measures related to percentiles. Formulas are provided for calculating various averages from both discrete and continuous data distributions.
PG STAT 531 Lecture 2 Descriptive statisticsAashish Patel
This document provides an overview of descriptive statistics. It discusses that descriptive statistics are used to describe basic features of data through simple summaries, without drawing inferences. The document outlines various measures of central tendency like mean, median and mode. It also discusses measures of dispersion such as range, variance and standard deviation that describe how spread out the data is. The key purpose of descriptive statistics is to present quantitative data in a simplified and manageable form.
The document discusses various definitions and concepts related to marketing. It defines marketing as the process of planning and executing the conception, pricing, promotion, and distribution of ideas, goods, and services to create exchanges that satisfy individual and organizational goals. It also discusses the evolution of marketing from a production orientation to a selling orientation to today's customer-centric marketing orientation. The core concepts of marketing discussed include needs, products, exchange, customers, and value.
The document discusses various topics related to products and services, including:
- Definitions of products, goods, services and experiences. Products satisfy needs and wants.
- Classification of consumer and industrial products. Consumer products are for personal use and classified by purchase patterns. Industrial products are for business use.
- Levels of products including core benefits, actual products, and augmented products.
- Product mix dimensions of width, length, depth and consistency.
- Types of product decisions around product lines, branding, packaging and labeling. Strong brands create loyalty and value. Packaging and labels identify and promote products.
- Nature of services being intangible, inseparable from the provider, and
The document outlines 22 immutable laws of branding, including:
1) Brands become stronger when they narrow their focus on a single word or concept.
2) Publicity, not just advertising, is important for launching a new brand.
3) Once established, brands need consistent advertising to stay strong in consumers' minds.
4) Brands should strive to own a word or concept in consumers' minds to become synonymous with that idea.
5) Brands are built over decades with consistency, not just years, to truly connect with consumers.
The document discusses different strategies for leveraging existing brand equity through extensions, including line extensions, brand extensions, sub-branding, and co-branding.
It provides examples of each type of extension and discusses factors to consider when deciding on an extension strategy, such as whether to use an existing brand or introduce a new brand. Risks of line extensions are also outlined, such as line confusion, weakened brand loyalty, and strained relations with trade partners.
The document discusses brand equity and customer-based brand equity (CBBE). It defines brand equity as the measurable financial value and brand assets/liabilities that accrue to a product/service. CBBE is the differential effect brand knowledge has on consumer response to marketing. The key elements of CBBE are the differential effect, brand knowledge, and consumer response. The document outlines the CBBE pyramid and sources/sub-dimensions of brand equity like brand awareness, image, associations, and relationships. It also discusses tools for building brand equity over time and across market segments.
Brands face new challenges as consumers demand simplification and risk reduction. Effective brand management is difficult due to savvy consumers, increased competition, and changing marketing tools. Marketers must have a deep understanding of customers and brands, and how the relationship between the two. The top 11 brand challenges include having a reactive approach, lack of accountability, inability to conclude initiatives, and unrealistic expectations. Strong brands provide clarity, consistency, and constancy - always communicating their unique value to customers.
2. Elemental Economics - Mineral demand.pdfNeal Brewster
After this second you should be able to: Explain the main determinants of demand for any mineral product, and their relative importance; recognise and explain how demand for any product is likely to change with economic activity; recognise and explain the roles of technology and relative prices in influencing demand; be able to explain the differences between the rates of growth of demand for different products.
Vicinity Jobs’ data includes more than three million 2023 OJPs and thousands of skills. Most skills appear in less than 0.02% of job postings, so most postings rely on a small subset of commonly used terms, like teamwork.
Laura Adkins-Hackett, Economist, LMIC, and Sukriti Trehan, Data Scientist, LMIC, presented their research exploring trends in the skills listed in OJPs to develop a deeper understanding of in-demand skills. This research project uses pointwise mutual information and other methods to extract more information about common skills from the relationships between skills, occupations and regions.
BONKMILLON Unleashes Its Bonkers Potential on Solana.pdfcoingabbar
Introducing BONKMILLON - The Most Bonkers Meme Coin Yet
Let's be real for a second – the world of meme coins can feel like a bit of a circus at times. Every other day, there's a new token promising to take you "to the moon" or offering some groundbreaking utility that'll change the game forever. But how many of them actually deliver on that hype?
Seminar: Gender Board Diversity through Ownership NetworksGRAPE
Seminar on gender diversity spillovers through ownership networks at FAME|GRAPE. Presenting novel research. Studies in economics and management using econometrics methods.
Understanding how timely GST payments influence a lender's decision to approve loans, this topic explores the correlation between GST compliance and creditworthiness. It highlights how consistent GST payments can enhance a business's financial credibility, potentially leading to higher chances of loan approval.
Falcon stands out as a top-tier P2P Invoice Discounting platform in India, bridging esteemed blue-chip companies and eager investors. Our goal is to transform the investment landscape in India by establishing a comprehensive destination for borrowers and investors with diverse profiles and needs, all while minimizing risk. What sets Falcon apart is the elimination of intermediaries such as commercial banks and depository institutions, allowing investors to enjoy higher yields.
Economic Risk Factor Update: June 2024 [SlideShare]Commonwealth
May’s reports showed signs of continued economic growth, said Sam Millette, director, fixed income, in his latest Economic Risk Factor Update.
For more market updates, subscribe to The Independent Market Observer at https://blog.commonwealth.com/independent-market-observer.
2. Concept of Central TendencyConcept of Central Tendency
A measure of central tendency is a typicalA measure of central tendency is a typical
value around which other figures congregatevalue around which other figures congregate
- Simpson & Kalfa- Simpson & Kalfa
OROR
An average is a single value which is used toAn average is a single value which is used to
represent all of the values in the series.represent all of the values in the series.
3. Measures of Central Tendency
Mean (mathematical
average)
Median (positional
average)
Mode (positional
average)
Arithmetic Mean Geometric mean Harmonic mean
Simple Arithmetic Mean
Weighted Arithmetic Mean
Mean of Composite Group
4. BasicsBasics
• MeanMean AverageAverage
• MedianMedian Mid positional valueMid positional value
• ModeMode Most frequentlyMost frequently
occurring valueoccurring value
44
7. ExampleExample
For the above data, the A.M. is
2717 + 2796 +…… 4645+….. + 5424 + ….+ 6874
= --------------------------------------------------------------------------
20
= Rs. 4565.4 Millions
x
8. Arithmetic MeanArithmetic Mean
Grouped DataGrouped Data
N=N= = Total frequency= Total frequency
Here, xHere, xii is the mid value of the class interval.is the mid value of the class interval.
∑
∑=
i
i
f
f ix
x
∑=
n
i
if
1
9. exampleexample
Calculate arithmeticCalculate arithmetic
mean from themean from the
following frequencyfollowing frequency
distribution of marks atdistribution of marks at
a test in statistics.a test in statistics.
MarksMarks No.ofNo.of
studentsstudents
2525 22
3030 33
3535 44
4040 88
4545 99
5050 44
5555 33
6060 22
10. The details of the monthly salary of 100The details of the monthly salary of 100
employees of a firm are given below:employees of a firm are given below:
Monthly salary (in Rs.) No. of employees
1000 18
1500 26
2000 31
2500 16
3000 5
5000 4
11. In grouped data, the middle value of each group is theIn grouped data, the middle value of each group is the
representative of the group bz when the data arerepresentative of the group bz when the data are
grouped, the exact frequency with which each valuegrouped, the exact frequency with which each value
of the variable occurs in the distribution is unknown.of the variable occurs in the distribution is unknown.
We only know the limits within which a certainWe only know the limits within which a certain
number of frequencies occur.number of frequencies occur.
So, we make an assumption that the frequenciesSo, we make an assumption that the frequencies
within each class are distributed uniformly over thewithin each class are distributed uniformly over the
range of class interval.range of class interval.
12. ExampleExample
A company manufactures polythene bags. TheA company manufactures polythene bags. The
bags are evaluated on the basis of theirbags are evaluated on the basis of their
strength by buyers. The strength depends onstrength by buyers. The strength depends on
their bursting pressures. The following datatheir bursting pressures. The following data
relates to the bursting pressure recorded in arelates to the bursting pressure recorded in a
sample of 90 bags. Find the average burstingsample of 90 bags. Find the average bursting
pressure.pressure.
13. exampleexample
Bursting
pressure
( 1 )
No. of
bags
( fi
) ( 2 )
Mid Value of
Class Interval
( xi
) ( 3 )
Fi
xi
( 4 )
Col.(4) = Col.(2) x Col.
(3)
5-10 10 7.5 75
10-15 15 12.5 187.5
15-20 20 17.5 350
20-25 25 22.5 562.5
25-30 20 27.5 550
Sum Σ fi
=90 Σ fi
xi =1725
14. values of Σ fi
and Σ fi
xi
, in formula
= 1725/90
= 19.17
∑
∑=
i
i
f
f ix
x
15. EXAMPLE (short cut method)EXAMPLE (short cut method)
Calculate the mean ofCalculate the mean of
the followingthe following
distribution ofdistribution of
monthly wages ofmonthly wages of
workers in a factory :workers in a factory :
MonthlyMonthly
wages(inwages(in
Rs.)Rs.)
No. ofNo. of
workersworkers
100-120100-120 1010
120-140120-140 2020
140-160140-160 3030
160-180160-180 1515
180-200180-200 55
16. The followingThe following
frequency distributionfrequency distribution
represents the timerepresents the time
taken in seconds totaken in seconds to
serve customers at a fastserve customers at a fast
food take away.food take away.
Calculate the mean timeCalculate the mean time
taken by to servetaken by to serve
customerscustomers
Time taken (in
seconds)
frequencies
40-60 6
60-80 12
80-100 15
100-120 12
120-140 10
140-160 5
17. Weighted Arithmetic Mean
It takes into account the importance of eachIt takes into account the importance of each
value to the overall data with the help of thevalue to the overall data with the help of the
weights.weights.
Frequency i.e. the no. of occurrence indicatesFrequency i.e. the no. of occurrence indicates
the relative importance of a particular data in athe relative importance of a particular data in a
group of observations.group of observations.
Used in case the relative importance of eachUsed in case the relative importance of each
observation differs or when rates, percentagesobservation differs or when rates, percentages
or ratios are being averaged.or ratios are being averaged.
18. The weighted AM of the n observations:The weighted AM of the n observations:
AM is considered to be the best measure of centralAM is considered to be the best measure of central
tendency as its computation is based on each andtendency as its computation is based on each and
every observation.every observation.
∑
∑=
wi
wi ix
x
19. ExampleExample
5 students of a B.Sc. (Hons)5 students of a B.Sc. (Hons)
course are marked by using thecourse are marked by using the
following weighing scheme :following weighing scheme :
Mid-term = 20%Mid-term = 20%
Project = 10%Project = 10%
Attendance = 10%Attendance = 10%
Final Exam = 60%Final Exam = 60%
Calculate the average marks in theCalculate the average marks in the
examination.examination.
Marks of the students in variousMarks of the students in various
components are:components are:
StudStud
entent
midtmidt
ermerm
ProjeProje
ctct
AttenAtten
dncednce
FinFin
alal
11 6565 7070 8080 8080
22 4848 5858 5454 6060
33 5858 6363 6565 5050
44 5858 7070 5454 6060
55 6060 6565 7070 7070
20. A professor is interested inA professor is interested in
ranking the following fiveranking the following five
students in the order of merit onstudents in the order of merit on
the basis of data given below:the basis of data given below:
Attendance average will count forAttendance average will count for
20% of a student’s grade; the20% of a student’s grade; the
homework 25%; assignment 35%;homework 25%; assignment 35%;
midterm examination 10% andmidterm examination 10% and
final examination 10%. Whatfinal examination 10%. What
would be the students ranking.would be the students ranking.
Stud
ent
Atte
nda
nce
Hom
ewo
rk
Assi
gnm
ent
Midt
erm
final
A 85 89 94 87 90
B 78 84 88 91 92
C 94 88 93 86 89
D 82 79 88 84 93
E 95 90 92 82 88
21. Mean of composite groupMean of composite group
If two groups contain respectively, nIf two groups contain respectively, n11 and nand n22
observations with mean Xobservations with mean X11 and Xand X22, then the, then the
combined mean (X) of the combined group of ncombined mean (X) of the combined group of n11+n+n22
observations is given by :observations is given by :
21
2211
12
nn
XnXn
X
+
+
=
22. ExampleExample
There are two branches of a companyThere are two branches of a company
employing 100 and 80 employeesemploying 100 and 80 employees
respectively. If arithmetic means of therespectively. If arithmetic means of the
monthly salaries paid by two branches aremonthly salaries paid by two branches are
Rs. 4570 and Rs. 6750 respectively, find theRs. 4570 and Rs. 6750 respectively, find the
A.M. of the salaries of the employees of theA.M. of the salaries of the employees of the
company as a whole.company as a whole.
23. A factory has 3 shifts :- Morning, evening andA factory has 3 shifts :- Morning, evening and
night shift. The morning shift has 200 workers,night shift. The morning shift has 200 workers,
the evening shift has 150 workers and nightthe evening shift has 150 workers and night
shift has 100 workers. The mean wage of theshift has 100 workers. The mean wage of the
morning shift workers is Rs. 200, the eveningmorning shift workers is Rs. 200, the evening
shift workers is Rs. 180 and the overall meanshift workers is Rs. 180 and the overall mean
of the workers is Rs. 160. Find the mean wageof the workers is Rs. 160. Find the mean wage
of the night shift workers.of the night shift workers.
24. Properties of A.M.Properties of A.M.
If a constant amount is added or subtracted from each value inIf a constant amount is added or subtracted from each value in
the series, mean is also added or subtracted by the samethe series, mean is also added or subtracted by the same
constant amount. E.g. Consider the values 3,5,9,15,16constant amount. E.g. Consider the values 3,5,9,15,16
A.M. = 9.6A.M. = 9.6
If 2 is added to each value, thenIf 2 is added to each value, then A.M. = 11.6 = 9.6 + 2.A.M. = 11.6 = 9.6 + 2.
Thus, mean is also added by 2.Thus, mean is also added by 2.
Sum of the deviations of a set of observations say x1, x2, , xnSum of the deviations of a set of observations say x1, x2, , xn
from their mean is equal to zero.from their mean is equal to zero.
A.M. is dependent on both change in origin and scale.A.M. is dependent on both change in origin and scale.
The sum of the squares of the deviations of a set ofThe sum of the squares of the deviations of a set of
observation from any number say A is least when A is X.observation from any number say A is least when A is X.
25. Merits and demerits ofMerits and demerits of
Arithmetic Mean
Advantages Disadvantages
(i) Easy to understand and
calculate
(ii) Makes use of full data
(iii) Based upon all the
observations.
(i ) Unduly influenced by extreme
values
(ii) Cannot be
calculated from the data with
open-end class.e.g. below 10
or above 90
(iii) It cannot be obtained if a single
observation is missing.
(iv) It cannot be used if we are
dealing with qualitative
characteristics which cannot be
measured quantitatively;
intelligence, honesty, beauty
26. Harmonic MeanHarmonic Mean
The harmonic mean (H.M.) is defined as the reciprocal
of the arithmetic mean of the reciprocals of the
observations.
For example, if x1
and x2
are two observations, then the
arithmetic means of their reciprocals viz 1/x1
and 1/ x2
is
{(1 / x1
) + (1 / x2
)} / 2
= (x2
+ x1
) / 2 x1
x2
The reciprocal of this arithmetic mean is 2 x1
x2
/ (x2
+
x1
). This is called the harmonic mean.
Thus the harmonic mean of two observations x1
and x2
is 2 x1
x2
-----------------
27. In general, for the set of n observations XIn general, for the set of n observations X11,X,X22……..X……..Xnn,,
HM is given by :HM is given by :
And for the same set of observations with frequenciesAnd for the same set of observations with frequencies
ff11,f,f22……..f……..fnn, HM is given by:, HM is given by:
∑
=
ix
n
HM
1
∑
=
i
i
x
f
n
HM
28. HM gives the largest weight to the smallestHM gives the largest weight to the smallest
item and the smallest weight of the largestitem and the smallest weight of the largest
itemitem
If each observation is divided by a constant, KIf each observation is divided by a constant, K
then HM is also divided by the same constant.then HM is also divided by the same constant.
If each observation is multiplied by a constant,If each observation is multiplied by a constant,
K then HM is also multiplied by the sameK then HM is also multiplied by the same
constant.constant.
It is used in averaging speed, price of articles.It is used in averaging speed, price of articles.
29. If time varies w.r.t. a fixed distance then HM determines theIf time varies w.r.t. a fixed distance then HM determines the
average speed.average speed.
If distance varies w.r.t. a fixed time then AM determines theIf distance varies w.r.t. a fixed time then AM determines the
average speed.average speed.
EXAMPLE : If a man moves along the sides of a square withEXAMPLE : If a man moves along the sides of a square with
speed v1, v2, v3, v4 km/hr, the average speed for the wholespeed v1, v2, v3, v4 km/hr, the average speed for the whole
journey =journey = 44
(1/v1)+(1/v2)+(1/v3)+(1/v4)(1/v1)+(1/v2)+(1/v3)+(1/v4)
30. EXAMPLEEXAMPLE
In a certain factory a unit of work isIn a certain factory a unit of work is
completed by A in 4 min, by B in 5 min, bycompleted by A in 4 min, by B in 5 min, by
C in 6 min, by D in 10 min, and by E in 12C in 6 min, by D in 10 min, and by E in 12
minutes.minutes.
What is the average no. of units of workWhat is the average no. of units of work
completed per minute?completed per minute?
31. ExampleExample
The profit earned by 19The profit earned by 19
companies is givencompanies is given
below:below:
calculate the HM ofcalculate the HM of
profit earned.profit earned.
ProfitProfit
(lakhs)(lakhs)
No. ofNo. of
companiescompanies
20-2520-25 44
25-3025-30 77
30-3530-35 44
35-4035-40 44
32. Geometric MeanGeometric Mean
Neither mean, median or mode is the appropriate average in
calculating the average % rate of change over time. For this G.M. is
used.
The Geometric Mean ( G. M.) of a series of observations with x1
, x2
,
x3
, ……..,xn
is defined as the nth
root of the product of these
values . Mathematically
G.M. = { ( x1
)( x2
)( x3
)…………….(xn
) } (1/ n )
It may be noted that the G.M. cannot be defined if any value of x is
zero as the whole product of various values becomes zero.
33. When the no. of observation is three or more then to simplifyWhen the no. of observation is three or more then to simplify
the calculations logarithms are used.the calculations logarithms are used.
log G.M. = log X1 + log X2 + ……+ log Xnlog G.M. = log X1 + log X2 + ……+ log Xn
NN
G.M. = antilog (log X1 + log X2 + ……+ log Xn)G.M. = antilog (log X1 + log X2 + ……+ log Xn)
NN
For grouped data,For grouped data,
G.M. = antilog (f1log X1 + f2log X2 + ……+ fnlog Xn)G.M. = antilog (f1log X1 + f2log X2 + ……+ fnlog Xn)
NN
34. Geometric meanGeometric mean
GM is often used to calculate the rate ofGM is often used to calculate the rate of
change of population growth.change of population growth.
GM is also useful in averaging ratios, rates andGM is also useful in averaging ratios, rates and
percentages.percentages.
35. EXAMPLEEXAMPLE
A machinery is assumed to depreciate 44% in value in firstA machinery is assumed to depreciate 44% in value in first
year, 15% in second year and 10% in next three years, eachyear, 15% in second year and 10% in next three years, each
percentage being calculated on diminishing value. What ispercentage being calculated on diminishing value. What is
the average % of depreciation for the entire period?the average % of depreciation for the entire period?
Compared to the previous year the overhead expenses wentCompared to the previous year the overhead expenses went
up by 32% in 2002; they increased by 40% in the next yearup by 32% in 2002; they increased by 40% in the next year
and by 50% in the following year. Calculate the average rateand by 50% in the following year. Calculate the average rate
of increase in the overhead expenses over the three years.of increase in the overhead expenses over the three years.
36. ExampleExample
The annual rate of growth for a factory for 5 years isThe annual rate of growth for a factory for 5 years is
7%,8%,4%,6%,10%respectively.What is the average7%,8%,4%,6%,10%respectively.What is the average
rate of growth per annum for this period.rate of growth per annum for this period.
The price of the commodity increased by 8% from 1993The price of the commodity increased by 8% from 1993
to 1994,12%from 1994 to 1995 and 76% from 1995 toto 1994,12%from 1994 to 1995 and 76% from 1995 to
1996.the average price increase from 1993 to 1996 is1996.the average price increase from 1993 to 1996 is
quoted as 28.64% and not 32%.Explain and verify thequoted as 28.64% and not 32%.Explain and verify the
result.result.
3636
37. Combined G.M. of Two Sets ofCombined G.M. of Two Sets of
DataData
If G1
& G2
are the Geometric means of two sets
of observations of sizes n1 and n2, then the
combined Geometric mean, say G, of the
combined series is given by :
n1
log G1
+ n2
log G2
log G = -------------------------------
n1
+ n2
38. ExampleExample
The GM of two series of sizes 10 and 12 areThe GM of two series of sizes 10 and 12 are
12.5 and 10 respectively. Find the combined12.5 and 10 respectively. Find the combined
GM of the 22 observations.GM of the 22 observations.
39. Combined G.M. of Two Sets ofCombined G.M. of Two Sets of
DataData
10 log 12.5 + 12log 10
log G = -------------------------------
10 + 12
22.9691
= ------------ = 1.04405
22
Therefore,
G = antilog 1.04405 = x
Thus the combined average rate of growth for the period of 22
years is x%.
40. Relationship Among A.M. G.M. andRelationship Among A.M. G.M. and
H.M.H.M.
The relationships among the magnitudes of the three
types of Means calculated from the same data are as
follows:
(i) H.M. ≤ G.M. ≤ A.M.
i.e. the arithmetic mean is greater than or equal
to the geometric which is greater than or equal to the
harmonic mean.
( ii ) G.M. =
i.e. geometric mean is the square root of the product of
arithmetic mean and harmonic mean.
( iii) H.M. = ( G.M.) 2
/ A .M.
... MHMA *
41. medianmedian
It is aIt is a positional average.positional average.
Middle most value of the distribution, which divides theMiddle most value of the distribution, which divides the
distribution into two equal parts.distribution into two equal parts.
whenever there are some extreme values in the data,whenever there are some extreme values in the data,
calculation of A.M. is not desirable.calculation of A.M. is not desirable.
Further, whenever, exact values of some observationsFurther, whenever, exact values of some observations
are not available, A.M. cannot be calculated.are not available, A.M. cannot be calculated.
In both the situations, another measure of location calledIn both the situations, another measure of location called
Median is used.Median is used.
42. The median divides the data into two parts such that theThe median divides the data into two parts such that the
number of observations less than the median are equal to thenumber of observations less than the median are equal to the
number of observations more than it.number of observations more than it.
This property makes median very useful measure when theThis property makes median very useful measure when the
data is skewed like income distribution amongdata is skewed like income distribution among
persons/households, marks obtained in competitivepersons/households, marks obtained in competitive
examinations like that for admission to Engineering / Medicalexaminations like that for admission to Engineering / Medical
Colleges, etc.Colleges, etc.
Before median calculation, the arrangement of the data inBefore median calculation, the arrangement of the data in
either ascending or descending order is a must.either ascending or descending order is a must.
43. Graphical Method of Finding theGraphical Method of Finding the
MedianMedian
If we draw both the ogives viz. “Less Than “ and “If we draw both the ogives viz. “Less Than “ and “
More Than”, for a data, then the point of intersectionMore Than”, for a data, then the point of intersection
of the two ogives is the Median.of the two ogives is the Median.
0
5
10
15
20
25
Median
Less Than Ogive
More Than Ogive
44. Computation of MedianComputation of Median
For a simple series :For a simple series :
step 1 : arrange obs. in ascending or descending orderstep 1 : arrange obs. in ascending or descending order
step 2 : let n = no. of obs.step 2 : let n = no. of obs.
(i) n is odd(i) n is odd
median =median = observationobservation
(ii) n is even(ii) n is even
median = mean ofmedian = mean of andand obs.obs.thn
)
2
( thn
)1
2
( +
thn
)
2
1
(
+
45. For ungrouped frequency distributionFor ungrouped frequency distribution
Step 1 : calculate the cumulative frequenciesStep 1 : calculate the cumulative frequencies
Step 2 : N=Step 2 : N=
(i) N is odd(i) N is odd
median = size ofmedian = size of observationobservation
(ii) N is even(ii) N is even
median = mean of the sizes ofmedian = mean of the sizes of andand
obs.obs.
∑ if
thN
)
2
1
(
+
thN
)
2
(
thN
)1
2
( +
46. Median - Ungrouped DataMedian - Ungrouped Data
First the data is arranged in ascending/descending order.
In the earlier example relating to equity holdings data of 20 billionaires given in
Table 4.1, the data is arranged as per ascending order as follows
2717 2796 3098 3144 3527 3534
3862 4187 4310 4506 4745 4784 4923
5034 5071 5424 5561 6505 6707 6874
Here, the number of observations is 20, and therefore there is no middle
observation. However, the two middle most observations are 10th
and 11th
. The
values are 4506 and 4745. Therefore, the median is their average.
4506 + 4745 9251
Median = ----------------- = -----------
2 2
= 4625.5
Thus, the median equity holdings of the 20 billionaires is Rs.4625.5 Millions.
47. ExampleExample
Obtain the median forObtain the median for
the following frequencythe following frequency
distribution:distribution:
Obs.Obs. ff
11 88
22 1010
33 1111
44 1616
55 2020
66 2525
77 1515
88 99
99 66
48. Median – Grouped frequencyMedian – Grouped frequency
distributiondistribution
The median for the grouped frequency distribution is also defined as
the class corresponding to the c.f. just greater than N/2, and is
calculated from the following formula:
( (N/2) –fc
)
Median = Lm
+ ----------------- × hm
fm
where,
•Lm
is the lower limit of 'the median class internal
•fm
is the frequency of the median class interval
•fc
is the cumulative frequency of the class preceding the median class
•hm
is the width of the median class-interval
•N is the number of total frequency
50. Here, N=43, N/2 = 21.5
Cumulative frequency just >21.5 is 28
So, median class is 4000-5000
Further,
Lm
= 4000
fm
= 20
fc
= 8
wm
= 1000
Therefore,
(21.5 –8) x 1000
Median = 4000 + -------------------------
20
= 4000 + 675
= 4675
Thus,median wages is Rs. 4675
51. EXAMPLEEXAMPLE
Calculate the medianCalculate the median
from the followingfrom the following
data pertaining to thedata pertaining to the
profits (in crore Rs.)profits (in crore Rs.)
of 125 companies:of 125 companies:
ProfitsProfits No. of companiesNo. of companies
Less than 10Less than 10 44
Less than 20Less than 20 1616
Less than 30Less than 30 4040
Less than 40Less than 40 7676
Less than 50Less than 50 9696
Less than 60Less than 60 112112
Less than 70Less than 70 120120
Less than 80Less than 80 125125
52. Median
Advantages Disadvantages
(i) Simple to understand
(ii) Extreme values do not have
any impact
(iii) Can be calculated even if
values of all observations are not
known or data has open-end
class intervals
(iv) Used for measuring qualities
and factors which are not
quantifiable like to find the
average intelligence among a
group of people.
(v) Can be approximately
determined with the help of a
graph (ogives)
(i) In case of even number of obs.
Median cannot be
determined exactly.
(ii) Not amenable for
mathematical calculations
(iii) No based on all the
observations.
53. ModeMode
Mode is the value which occurs most frequently in aMode is the value which occurs most frequently in a
set of obs. And around which the other items of the setset of obs. And around which the other items of the set
cluster densely.cluster densely.
In other words, it is the value which occurs withIn other words, it is the value which occurs with
maximum frequency.maximum frequency.
distributiondistribution
Uni-modalUni-modal BimodalBimodal MultimodalMultimodal
(single mode)(single mode) (two modal value)(two modal value) (more than(more than
two mode)two mode)
54. Mode is more favourable than Mean/MedianMode is more favourable than Mean/Median
whenever we need to determine the mostwhenever we need to determine the most
typical size or value.typical size or value.
Eg: Most common size of shoes, mostEg: Most common size of shoes, most
commonly purchased vehiclecommonly purchased vehicle
55. Determination of ModeDetermination of Mode
By InspectionBy Inspection
By method of GroupingBy method of Grouping
By Interpolation FormulaBy Interpolation Formula
By Inspection : When the FD is regular, we can determineBy Inspection : When the FD is regular, we can determine
mode just by inspection.mode just by inspection.
Ex:Ex:
– Since the FD in this example is fairly regular, therefore,Since the FD in this example is fairly regular, therefore,
– mode of this distribution is = 5.mode of this distribution is = 5.
X 1 2 3 4 5 6 7 8 9 10
F 4 6 8 12 17 11 7 6 5 3
56. Given the following series, determine theGiven the following series, determine the
modal value:modal value:
4141 4242 4545 4444 4545 4848 5050 4545 4747
5151 5656
57. By method of GroupingBy method of Grouping
This method is used when the FD is not regular.This method is used when the FD is not regular.
We can understand this through the followingWe can understand this through the following
example,example,
In this variate we notice following things. 1.In this variate we notice following things. 1.
Irregular FD (Sudden rise from 20 to 100)Irregular FD (Sudden rise from 20 to 100)
Mode in this case, therefore, cannot beMode in this case, therefore, cannot be
obtained by Inspection.obtained by Inspection.
X 10 11 12 13 14 15 16 17 18 19
F 8 15 20 100 98 95 90 75 50 30
58. To obtain mode in this example, we haveTo obtain mode in this example, we have
to write the following Grouping table,to write the following Grouping table,
X F(1) F(2) F(3) F(4) F(5) F(6)
10 8
23 4311 15
35
135
12 20
120
218
13 100
198
293
14 98
193
28315 95
185
260
16 90
165
215
17 75
125
155
18 50
8019 30
59. The highest frequency total in each of the 6The highest frequency total in each of the 6
columns of the previous table is identified andcolumns of the previous table is identified and
analyzed in the following table,analyzed in the following table,
Since the values 14 and 15 occur max. no. ofSince the values 14 and 15 occur max. no. of
times, therefore the mode is ill defined.times, therefore the mode is ill defined.
Columns 10 11 12 13 14 15 16 17 18 19
1 1
2 1 1
3 1 1
4 1 1 1
5 1 1 1
6 1 1 1
Total 0 0 0 3 4 4 2 1 0 0
60. The following table gives the measurements ofThe following table gives the measurements of
collar sizes of 230 students in a university.collar sizes of 230 students in a university.
Determine the modal size of the collarDetermine the modal size of the collar
Collar
sizes(c
m)
32 33 34 35 36 37 38 39 40 41
No. of
student
s
7 14 30 28 35 34 16 14 36 16
61. ModeMode
6161
Mode by Interpolation
fm
- f0
Mode = Lm
+ ----------------- × i
2 fm
- f0
- f2
where ,
Lm
is the lower point of the modal class interval
fm
is the frequency of the modal class interval
f0
is the frequency of the interval just before the modal interval
f2
is the frequency of the interval just after the modal interval
i is the width of the modal class interval
62. Equity Holding DataEquity Holding Data
6262
Class Interval Frequency Cumulative
frequency
2000-3000 2 2
3000-4000 5 7
4000-5000 6 13
5000-6000 4 17
6000-70000 3 20
63. Equity Holding DataEquity Holding Data
the modal interval i.e., the class interval with the the modal interval i.e., the class interval with the
maximum frequency (6) is 4000 to 5000. Further,maximum frequency (6) is 4000 to 5000. Further,
LLmm = = 40004000
h h = = 10001000
ffmm = = 66
ff00 = = 55
ff22 == 44
ThereforeTherefore
6363
64. Equity Holding DataEquity Holding Data
( 6 – 5( 6 – 5 ))
Mode = Mode = 4000 + -------------------- 4000 + -------------------- ×× 1000 1000
2 2 ×× 6 – 5 – 4 6 – 5 – 4
= = 4000 + (1/3)4000 + (1/3)×× 1000 1000
= = 4000 + 333.34000 + 333.3
= = 4333.34333.3
Thus the modal equity holdings of the billionaires is Rs. Thus the modal equity holdings of the billionaires is Rs.
4333.3 Millions.4333.3 Millions.
6464
65. Example : The FD of marks obtained byExample : The FD of marks obtained by
60 students of a class is given below :60 students of a class is given below :
Calculate mode :Calculate mode :
Marks 30-34 35-39 40-44 45-49 50-54 55-59 60-64
Frequency 3 5 12 18 14 6 2
67. Empirical Relationship amongEmpirical Relationship among
Mean, Median and ModeMean, Median and Mode
In a moderately skewed distributions, it is found that the following In a moderately skewed distributions, it is found that the following
relationship, generally, holds good :relationship, generally, holds good :
Mean – Mode Mean – Mode = 3 (Mean – Median)= 3 (Mean – Median)
From the above relationship between, Mean, Median and Mode, if the From the above relationship between, Mean, Median and Mode, if the
values of two of these are given, the value of third measure can be values of two of these are given, the value of third measure can be
found out found out
For a symmetrical distribution,For a symmetrical distribution,
Mean = Median = ModeMean = Median = Mode
68. • Example : In a moderately symmetrical distribution –Example : In a moderately symmetrical distribution –
The mode and median are 75 and 60 respectively.The mode and median are 75 and 60 respectively.
Find Mean.Find Mean.
• Solution : Using the empirical relation betweenSolution : Using the empirical relation between
Mean/Median/Mode, we can writeMean/Median/Mode, we can write
Mean – Mode = 3 (Mean – Median)Mean – Mode = 3 (Mean – Median)
0R0R
2 Mean = 3 Median – Mode2 Mean = 3 Median – Mode
Hence,Hence,
Mean = [3 Median – Mode] / 2 = [3(60) – 75] / 2 = 52.5Mean = [3 Median – Mode] / 2 = [3(60) – 75] / 2 = 52.5
71. Mode
7171
Advantages Disadvantages
(i) Simple to understand
(ii) Extreme values do not have
any impact
(iii) Can be calculated when we
want to compare the consumer
prefernces for products.
(iv) It can be detected at just a
mere look at the graph.
(i) It is not based on all the
observations.
(ii) Incase of two,three or many
modal values data becomes
difficult to compare and
interpret.
(iii) It is not suitable for algebraic
treatment.
73. QuartilesQuartiles
data Q1
and Q3
are defined as values corresponding to
an observation given below :
Ungrouped Data Grouped Data
(arranged in ascending
or descending order)
Lower Quartile Q1
{( n + 1 ) / 4 }th
( n / 4 )th
Median Q2
{ ( n + 1 ) / 2 }th
( n / 2 )th
Upper Quartile Q3
{3 ( n + 1 ) / 4 } th
(3 n / 4 )th
77. Calculating exactly:Q1Calculating exactly:Q1
Using the formula:
16
X f CF
0 < 20 15 15
20 < 40 60 75
40 <100 25 100
N/4 = 25th
item
This is in the group 20 < 40
Lower limit (l) is 20
Width of group (i) is 20
Frequency of group (f) is 60
CF of previous group (F) is 15
Formula is:
−
+=
f
FN
ilQ q
4
11
First Quartile
−
+=
60
1525
20201Q
60
10
2020 ×+= 333.320 +=
= 23.3333
This means that 25% of the data is below 23.333
Width of group (i) is 20
CF of previous group (F) is 15