The document discusses correlation and the coefficient of correlation. It defines correlation as a statistical tool used to study the relationship between two or more variables. The coefficient of correlation (r) is a measure of the strength and direction of the linear relationship between variables. r can range from -1 to 1, where -1 is perfect negative correlation, 0 is no correlation, and 1 is perfect positive correlation. A scatter diagram can be used to visually depict the relationship between variables and provide an initial assessment of correlation.
This document discusses correlation analysis and different types of correlation. It defines correlation as the degree of inter-relatedness between two or more variables. Correlation can be positive, negative, simple, partial or multiple depending on the direction and number of variables analyzed. Linear and non-linear correlation are determined based on whether the relationship between variables is constant or not. The document provides examples of calculating correlation coefficients in Excel and SPSS and computing partial correlation coefficients controlling for other variables.
Time series econometrics deals with time series data that poses challenges due to non-stationarity. There are three types of stochastic processes - stationary, purely random, and non-stationary. Random walk models including random walk with and without drift are examples of non-stationary processes. A unit root stochastic process refers to non-stationary time series. Time series can be either trend stationary or difference stationary. Failing to account for non-stationarity can result in spurious regressions with high R-squared but no meaningful relationship between variables.
Measure of dispersion has two types Absolute measure and Graphical measure. There are other different types in there.
In this slide the discussed points are:
1. Dispersion & it's types
2. Definition
3. Use
4. Merits
5. Demerits
6. Formula & math
7. Graph and pictures
8. Real life application.
Heteroscedasticity occurs when the variance of the error terms in a regression model are not constant, but instead vary depending on the values of the independent variables. While ordinary least squares estimators remain unbiased, their standard errors may be incorrect under heteroscedasticity. This means that confidence intervals and hypothesis tests based on the usual standard errors are unreliable and can lead to misleading conclusions.
This document discusses matrices and determinants, and their applications in business mathematics. It covers solving systems of linear equations using matrix methods, including the matrix equation form AX=B. It also covers input-output analysis, including Leontief's input-output model, closed vs. open models, and how to set up the input-output matrix to capture inter-industry relationships and demands. Examples are provided throughout to illustrate solving systems of equations with matrices and performing input-output analysis.
This document discusses the use of dummy variables in econometric modeling. It begins by explaining that some variables cannot be quantified numerically and provides examples where dummy variables would be used. It then discusses how dummy variables are incorporated into regression models, including intercept dummy variables, slope dummy variables, and dummy variables for multiple categories. The document also covers seasonal dummy variables and concludes by explaining the Chow test and dummy variable test for testing structural stability using dummy variables.
This document discusses various measures of dispersion in statistics including range, mean deviation, variance, and standard deviation. It provides definitions and formulas for calculating each measure along with examples using both ungrouped and grouped frequency distribution data. Box-and-whisker plots are also introduced as a graphical method to display the five number summary of a data set including minimum, quartiles, and maximum values.
This document discusses correlation analysis and different types of correlation. It defines correlation as the degree of inter-relatedness between two or more variables. Correlation can be positive, negative, simple, partial or multiple depending on the direction and number of variables analyzed. Linear and non-linear correlation are determined based on whether the relationship between variables is constant or not. The document provides examples of calculating correlation coefficients in Excel and SPSS and computing partial correlation coefficients controlling for other variables.
Time series econometrics deals with time series data that poses challenges due to non-stationarity. There are three types of stochastic processes - stationary, purely random, and non-stationary. Random walk models including random walk with and without drift are examples of non-stationary processes. A unit root stochastic process refers to non-stationary time series. Time series can be either trend stationary or difference stationary. Failing to account for non-stationarity can result in spurious regressions with high R-squared but no meaningful relationship between variables.
Measure of dispersion has two types Absolute measure and Graphical measure. There are other different types in there.
In this slide the discussed points are:
1. Dispersion & it's types
2. Definition
3. Use
4. Merits
5. Demerits
6. Formula & math
7. Graph and pictures
8. Real life application.
Heteroscedasticity occurs when the variance of the error terms in a regression model are not constant, but instead vary depending on the values of the independent variables. While ordinary least squares estimators remain unbiased, their standard errors may be incorrect under heteroscedasticity. This means that confidence intervals and hypothesis tests based on the usual standard errors are unreliable and can lead to misleading conclusions.
This document discusses matrices and determinants, and their applications in business mathematics. It covers solving systems of linear equations using matrix methods, including the matrix equation form AX=B. It also covers input-output analysis, including Leontief's input-output model, closed vs. open models, and how to set up the input-output matrix to capture inter-industry relationships and demands. Examples are provided throughout to illustrate solving systems of equations with matrices and performing input-output analysis.
This document discusses the use of dummy variables in econometric modeling. It begins by explaining that some variables cannot be quantified numerically and provides examples where dummy variables would be used. It then discusses how dummy variables are incorporated into regression models, including intercept dummy variables, slope dummy variables, and dummy variables for multiple categories. The document also covers seasonal dummy variables and concludes by explaining the Chow test and dummy variable test for testing structural stability using dummy variables.
This document discusses various measures of dispersion in statistics including range, mean deviation, variance, and standard deviation. It provides definitions and formulas for calculating each measure along with examples using both ungrouped and grouped frequency distribution data. Box-and-whisker plots are also introduced as a graphical method to display the five number summary of a data set including minimum, quartiles, and maximum values.
Correlation analysis is used to find the degree of relationship between two or more variables by applying statistical tools. It produces a correlation coefficient that describes the strength and direction of the relationship. There are different types of correlation, including positive correlation where variables move in the same direction, negative correlation where they move in opposite directions, simple correlation between two variables, partial correlation controlling for other variables, and multiple correlation between three or more variables. Correlation analysis is important for measuring the degree of relationship between variables, estimating their values, and understanding economic behavior.
The document discusses economic planning and development in India between 1950-1990. It describes the objectives of economic planning as promoting economic growth, modernization, self-reliance, and equity. Key policies and initiatives during this period included land reforms and ceilings, promoting the green revolution in agriculture, establishing public enterprises and import substitution to lead industrialization, and adopting an inward-looking trade strategy to substitute imports. The public sector was given a leading role to create a strong industrial base and infrastructure, mobilize savings, and prevent concentration of economic power.
General Equilibrium IS-LM Framework for Macroeconomic AnalysisKhemraj Subedi
The document summarizes the IS-LM model of macroeconomics. It explains that the IS curve represents equilibrium in the goods market where investment equals savings at different interest rate and income level combinations. It slopes downward to show that lower interest rates lead to higher investment and income. The LM curve represents equilibrium in the money market where money demand equals supply at different interest rate and income level combinations. It slopes upward as higher income increases money demand, requiring higher interest rates to equilibrate the money market. The IS and LM curves intersect to determine the general equilibrium interest rate and income level in the short run.
It deals with various functional forms in regression along with the derivation and interpretation of the slope and elasticity values of each of the models. The frequently used models of log-lin, lin-log and log-log models are also adequately elaborated. The link of the MS powerpoint used in this video is also given separately as a pinned comment.
The document discusses India's New Industrial Policy of 1991 which introduced the principles of liberalization, privatization, and globalization (LPG). It aimed to address issues like the government's excessive spending, inefficiency, overprotection of industries, and other economic distortions. Liberalization relaxed restrictions on trade and investment. Privatization transferred ownership of public sector enterprises to private companies. Globalization opened the Indian economy to increased international trade and foreign investment. The policy changes aimed to make the Indian economy more competitive and integrate it into the global market.
This presentation provides an overview of the goods market equilibrium and money market equilibrium using the IS-LM model. It defines the equilibrium conditions for the goods market as savings equaling investment, and for the money market as money supply equaling money demand. It derives the downward sloping IS curve and upward sloping LM curve, and explains how their intersection shows the overall equilibrium in the goods and money markets. The document then discusses how fiscal and monetary policies can shift the IS and LM curves and discusses the 2001 US recession within this framework.
This document discusses autocorrelation, which occurs when there is a correlation between members of a series of observed data ordered over time or space. This violates an assumption of classical linear regression that error terms are uncorrelated. Causes of autocorrelation include inertia in macroeconomic data, specification bias from excluded or incorrectly specified variables, lags, data manipulation, and non-stationarity of time series data. Autocorrelation can be detected graphically or using the Durbin-Watson and Breusch-Godfrey tests. Remedial measures include first-difference transformation, generalized transformation, and using Newey-West standard errors.
It is essential for all regression models that the relationship between the independent and dependent variables are represented correctly. Functional form tries to do exactly this. A functional form will give an equation for the dependent and independent variables so that the hypothesis tests can be carried out properly. Copy the link given below and paste it in new browser window to get more information on Functional Forms of Regression Analysis:- http://www.transtutors.com/homework-help/economics/functional-forms-of-regression-models.aspx
This document summarizes the key assumptions and properties of Ordinary Least Squares (OLS) regression. OLS aims to minimize the sum of squared residuals by estimating the beta coefficients. It provides the best linear unbiased estimates if its assumptions are met. The key assumptions are: 1) the regression is linear in parameters; 2) the error term has a mean of zero; 3) the error term is uncorrelated with the independent variables; 4) there is no serial correlation or autocorrelation in the error term; 5) the error term has constant variance (homoskedasticity); and 6) there is no perfect multicollinearity among independent variables. When all assumptions are met, OLS estimates
The document discusses the concepts of interpolation and extrapolation. Interpolation is finding a missing value within a data set, while extrapolation is finding a value outside the existing data set. Two common methods are described: the binomial expansion method and Newton's method of advancing differences. Examples are provided to illustrate how to use each method to interpolate or extrapolate values.
This document discusses regression, comparing it to causation and correlation. Regression analysis estimates and predicts the average value of one variable based on the values of other variables. For example, predicting a son's average height from his father's height. Causation indicates a relationship where changing one variable affects another. Correlation measures the association between variables, while regression numerically relates an independent variable to a dependent variable to estimate or predict values.
This document discusses heteroscedasticity, which occurs when the error variance is not constant. It provides examples of when the variance of errors may change, such as with income level or outliers. Graphical methods are presented for detecting heteroscedasticity by examining patterns in residual plots. Formal tests are also described, including the Park test which regresses the log of the squared residuals on explanatory variables, and the Glejser test which regresses the absolute value of residuals on variables related to the error variance. Detection of heteroscedasticity is important as it violates assumptions of the classical linear regression model.
Correlation analysis examines the relationship between two or more variables. Positive correlation means the variables increase together, while negative correlation means they change in opposite directions. The Pearson correlation coefficient, r, quantifies the strength of linear correlation between -1 and 1. Multiple correlation analysis extends this to measure the correlation between one dependent variable and multiple independent variables. It is useful but assumes linear relationships and can be complex to calculate.
Advanced Econometrics by Sajid Ali Khan Rawalakot: 0334-5439066Sajid Ali Khan
This document appears to be the introduction or table of contents to a textbook on advanced econometrics. It includes 10 chapters that cover topics such as simple linear regression, multiple linear regression, dummy variables, autocorrelation, and simultaneous equation systems. The introduction defines econometrics and discusses its goals of policy making, forecasting, and analyzing economic theories using quantitative methods. It also outlines the methodology of econometrics, which involves stating an economic theory, specifying mathematical and statistical models, collecting data, estimating parameters, testing hypotheses, forecasting, and using models for control or policy purposes.
The document discusses the accelerator theory of investment. It explains that the accelerator principle states that an increase in a firm's output will require a proportional increase in its capital stock. The accelerator coefficient (v) represents the ratio of induced investment to an initial change in consumption. The naive accelerator model holds that net investment (Int) is equal to v multiplied by the change in output (ΔYt). Refinements to the simple accelerator model include allowing for asymmetrical reactions to increases and decreases in output, and assuming variable rather than fixed technical coefficients of production.
The document discusses standard deviation and its properties. Standard deviation is a measure of how spread out numbers are from the average (mean) value. It is always non-negative and can be impacted by outliers. A low standard deviation means values are close to the mean, while a high standard deviation means values are more spread out. Standard deviation can be used to calculate what percentage of data falls within certain intervals from the mean when data is normally distributed.
The document discusses time series analysis and its key components. It defines a time series as a set of data points indexed (or listed or graphed) in time order. A time series collects readings of a variable at evenly-spaced periods of time. It notes that time is the independent variable while the data is the dependent variable. The document outlines the main components of time series as trends, seasonal variations, cyclical variations, and irregular variations. It provides examples and discusses methods for measuring each component, including free hand curve, semi-average, moving average, and least squares. The purposes and importance of time series analysis are also highlighted.
The cob web model analyzes price and output dynamics in markets where supply responds to price with a time lag. It assumes that producers base current supply on previous period's price. If demand changes but supply cannot instantly adjust, prices and quantities will oscillate over time as they converge towards equilibrium. The model can produce convergent cycles that stabilize at equilibrium or divergent cycles where prices and outputs fluctuate further from equilibrium with each cycle. It is used to study agricultural commodity markets where production adjustments face time lags.
This document discusses analyzing relationships between variables using graphing calculators. It explains that correlation measures the strength of relationships between -1 and 1, with higher positive or negative values indicating stronger linear relationships. Examples are given of strong, medium, and weak correlations. The line of best fit mathematically describes the linear relationship between variables using the slope and y-intercept. Several examples are provided of exploring potential relationships between variables and determining the line of best fit equation and correlation.
This document provides an overview of correlation coefficients and how to interpret them. It discusses the difference between correlation strength and significance. The key points covered are:
ONE: Correlation coefficients measure the strength and direction of association between two variables but do not imply causation. Strength is evaluated on a scale from -1 to 1 while significance is determined by comparing the p-value to the significance level alpha.
TWO: There are two parts to interpreting a correlation - the coefficient indicates strength (weak, moderate, strong) while the p-value determines if the correlation is statistically significant or could be due to chance.
THREE: Examples are provided to demonstrate how to interpret correlation output and determine the most strongly correlated variables
Correlation analysis is used to find the degree of relationship between two or more variables by applying statistical tools. It produces a correlation coefficient that describes the strength and direction of the relationship. There are different types of correlation, including positive correlation where variables move in the same direction, negative correlation where they move in opposite directions, simple correlation between two variables, partial correlation controlling for other variables, and multiple correlation between three or more variables. Correlation analysis is important for measuring the degree of relationship between variables, estimating their values, and understanding economic behavior.
The document discusses economic planning and development in India between 1950-1990. It describes the objectives of economic planning as promoting economic growth, modernization, self-reliance, and equity. Key policies and initiatives during this period included land reforms and ceilings, promoting the green revolution in agriculture, establishing public enterprises and import substitution to lead industrialization, and adopting an inward-looking trade strategy to substitute imports. The public sector was given a leading role to create a strong industrial base and infrastructure, mobilize savings, and prevent concentration of economic power.
General Equilibrium IS-LM Framework for Macroeconomic AnalysisKhemraj Subedi
The document summarizes the IS-LM model of macroeconomics. It explains that the IS curve represents equilibrium in the goods market where investment equals savings at different interest rate and income level combinations. It slopes downward to show that lower interest rates lead to higher investment and income. The LM curve represents equilibrium in the money market where money demand equals supply at different interest rate and income level combinations. It slopes upward as higher income increases money demand, requiring higher interest rates to equilibrate the money market. The IS and LM curves intersect to determine the general equilibrium interest rate and income level in the short run.
It deals with various functional forms in regression along with the derivation and interpretation of the slope and elasticity values of each of the models. The frequently used models of log-lin, lin-log and log-log models are also adequately elaborated. The link of the MS powerpoint used in this video is also given separately as a pinned comment.
The document discusses India's New Industrial Policy of 1991 which introduced the principles of liberalization, privatization, and globalization (LPG). It aimed to address issues like the government's excessive spending, inefficiency, overprotection of industries, and other economic distortions. Liberalization relaxed restrictions on trade and investment. Privatization transferred ownership of public sector enterprises to private companies. Globalization opened the Indian economy to increased international trade and foreign investment. The policy changes aimed to make the Indian economy more competitive and integrate it into the global market.
This presentation provides an overview of the goods market equilibrium and money market equilibrium using the IS-LM model. It defines the equilibrium conditions for the goods market as savings equaling investment, and for the money market as money supply equaling money demand. It derives the downward sloping IS curve and upward sloping LM curve, and explains how their intersection shows the overall equilibrium in the goods and money markets. The document then discusses how fiscal and monetary policies can shift the IS and LM curves and discusses the 2001 US recession within this framework.
This document discusses autocorrelation, which occurs when there is a correlation between members of a series of observed data ordered over time or space. This violates an assumption of classical linear regression that error terms are uncorrelated. Causes of autocorrelation include inertia in macroeconomic data, specification bias from excluded or incorrectly specified variables, lags, data manipulation, and non-stationarity of time series data. Autocorrelation can be detected graphically or using the Durbin-Watson and Breusch-Godfrey tests. Remedial measures include first-difference transformation, generalized transformation, and using Newey-West standard errors.
It is essential for all regression models that the relationship between the independent and dependent variables are represented correctly. Functional form tries to do exactly this. A functional form will give an equation for the dependent and independent variables so that the hypothesis tests can be carried out properly. Copy the link given below and paste it in new browser window to get more information on Functional Forms of Regression Analysis:- http://www.transtutors.com/homework-help/economics/functional-forms-of-regression-models.aspx
This document summarizes the key assumptions and properties of Ordinary Least Squares (OLS) regression. OLS aims to minimize the sum of squared residuals by estimating the beta coefficients. It provides the best linear unbiased estimates if its assumptions are met. The key assumptions are: 1) the regression is linear in parameters; 2) the error term has a mean of zero; 3) the error term is uncorrelated with the independent variables; 4) there is no serial correlation or autocorrelation in the error term; 5) the error term has constant variance (homoskedasticity); and 6) there is no perfect multicollinearity among independent variables. When all assumptions are met, OLS estimates
The document discusses the concepts of interpolation and extrapolation. Interpolation is finding a missing value within a data set, while extrapolation is finding a value outside the existing data set. Two common methods are described: the binomial expansion method and Newton's method of advancing differences. Examples are provided to illustrate how to use each method to interpolate or extrapolate values.
This document discusses regression, comparing it to causation and correlation. Regression analysis estimates and predicts the average value of one variable based on the values of other variables. For example, predicting a son's average height from his father's height. Causation indicates a relationship where changing one variable affects another. Correlation measures the association between variables, while regression numerically relates an independent variable to a dependent variable to estimate or predict values.
This document discusses heteroscedasticity, which occurs when the error variance is not constant. It provides examples of when the variance of errors may change, such as with income level or outliers. Graphical methods are presented for detecting heteroscedasticity by examining patterns in residual plots. Formal tests are also described, including the Park test which regresses the log of the squared residuals on explanatory variables, and the Glejser test which regresses the absolute value of residuals on variables related to the error variance. Detection of heteroscedasticity is important as it violates assumptions of the classical linear regression model.
Correlation analysis examines the relationship between two or more variables. Positive correlation means the variables increase together, while negative correlation means they change in opposite directions. The Pearson correlation coefficient, r, quantifies the strength of linear correlation between -1 and 1. Multiple correlation analysis extends this to measure the correlation between one dependent variable and multiple independent variables. It is useful but assumes linear relationships and can be complex to calculate.
Advanced Econometrics by Sajid Ali Khan Rawalakot: 0334-5439066Sajid Ali Khan
This document appears to be the introduction or table of contents to a textbook on advanced econometrics. It includes 10 chapters that cover topics such as simple linear regression, multiple linear regression, dummy variables, autocorrelation, and simultaneous equation systems. The introduction defines econometrics and discusses its goals of policy making, forecasting, and analyzing economic theories using quantitative methods. It also outlines the methodology of econometrics, which involves stating an economic theory, specifying mathematical and statistical models, collecting data, estimating parameters, testing hypotheses, forecasting, and using models for control or policy purposes.
The document discusses the accelerator theory of investment. It explains that the accelerator principle states that an increase in a firm's output will require a proportional increase in its capital stock. The accelerator coefficient (v) represents the ratio of induced investment to an initial change in consumption. The naive accelerator model holds that net investment (Int) is equal to v multiplied by the change in output (ΔYt). Refinements to the simple accelerator model include allowing for asymmetrical reactions to increases and decreases in output, and assuming variable rather than fixed technical coefficients of production.
The document discusses standard deviation and its properties. Standard deviation is a measure of how spread out numbers are from the average (mean) value. It is always non-negative and can be impacted by outliers. A low standard deviation means values are close to the mean, while a high standard deviation means values are more spread out. Standard deviation can be used to calculate what percentage of data falls within certain intervals from the mean when data is normally distributed.
The document discusses time series analysis and its key components. It defines a time series as a set of data points indexed (or listed or graphed) in time order. A time series collects readings of a variable at evenly-spaced periods of time. It notes that time is the independent variable while the data is the dependent variable. The document outlines the main components of time series as trends, seasonal variations, cyclical variations, and irregular variations. It provides examples and discusses methods for measuring each component, including free hand curve, semi-average, moving average, and least squares. The purposes and importance of time series analysis are also highlighted.
The cob web model analyzes price and output dynamics in markets where supply responds to price with a time lag. It assumes that producers base current supply on previous period's price. If demand changes but supply cannot instantly adjust, prices and quantities will oscillate over time as they converge towards equilibrium. The model can produce convergent cycles that stabilize at equilibrium or divergent cycles where prices and outputs fluctuate further from equilibrium with each cycle. It is used to study agricultural commodity markets where production adjustments face time lags.
This document discusses analyzing relationships between variables using graphing calculators. It explains that correlation measures the strength of relationships between -1 and 1, with higher positive or negative values indicating stronger linear relationships. Examples are given of strong, medium, and weak correlations. The line of best fit mathematically describes the linear relationship between variables using the slope and y-intercept. Several examples are provided of exploring potential relationships between variables and determining the line of best fit equation and correlation.
This document provides an overview of correlation coefficients and how to interpret them. It discusses the difference between correlation strength and significance. The key points covered are:
ONE: Correlation coefficients measure the strength and direction of association between two variables but do not imply causation. Strength is evaluated on a scale from -1 to 1 while significance is determined by comparing the p-value to the significance level alpha.
TWO: There are two parts to interpreting a correlation - the coefficient indicates strength (weak, moderate, strong) while the p-value determines if the correlation is statistically significant or could be due to chance.
THREE: Examples are provided to demonstrate how to interpret correlation output and determine the most strongly correlated variables
Scatter diagrams, strong and weak correlation, positive and negative correlation, lines of best fit, extrapolation and interpolation. Aimed at UK level 2 students on Access and GCSE Maths courses.
Partial correlation estimates the relationship between two variables while removing the influence of a third variable. It is a way to determine the correlation between two variables when controlling for a third. For example, a researcher may want to know the correlation between height and weight but also wants to control for gender, which can influence bone and muscle structure. Using the data sample provided, the correlation between height and weight was 0.825 but decreased to 0.770 when controlling for gender, showing gender partially explains the relationship between height and weight.
The document describes how to report a partial correlation in APA format. It provides a template for reporting that when controlling for a covariate, the partial correlation between two variables is r = ___, p = ___. As an example, it states that when controlling for age, the partial correlation between intense fanaticism for a professional sports team and proximity to the city the team resides is r = .82, p = .000.
The document defines and describes the coefficient of correlation, which is a statistic that measures the strength and direction of the linear relationship between two variables on a scale of -1 to +1. It explains that a correlation of -1 is a perfect negative correlation, 0 is no correlation, and +1 is a perfect positive correlation. The document also differentiates between types of correlation coefficients and methods for computing correlation coefficients, including Karl Pearson's and Spearman's rank correlation coefficients.
Correlation and regression are statistical techniques used to analyze relationships between variables. Correlation determines the strength and direction of a relationship, while regression describes the linear relationship to predict changes in one variable based on changes in another. There are different types of correlation including simple, multiple, and partial correlation. Regression analysis determines the regression line that best fits the data to estimate values of one variable based on the other. The correlation coefficient measures the strength of linear correlation from -1 to 1, while regression coefficients are used to predict changes in the variables.
Correlation and regression analysis are statistical methods used to determine if a relationship exists between variables and describe the nature of that relationship. A scatter plot graphs the independent and dependent variables and allows visualization of any trends in the data. The correlation coefficient measures the strength and direction of the linear relationship between variables, ranging from -1 to 1. Regression finds the linear "best fit" line that minimizes the residuals and can be used to predict dependent variable values.
Correlation and regression analysis are statistical methods used to determine if a relationship exists between variables and describe the nature of that relationship. A scatter plot graphs the independent and dependent variables and allows visualization of any trends in the data. The correlation coefficient measures the strength and direction of the linear relationship between variables, ranging from -1 to 1. Regression finds the linear "best fit" line that minimizes the residuals, or differences between observed and predicted dependent variable values. The coefficient of determination measures how much variation in the dependent variable is explained by the regression model.
This document discusses correlation and provides examples to illustrate key concepts:
1. Correlation quantifies the linear relationship between two variables and ranges from -1 to 1. Values closer to 1 or -1 indicate a stronger linear relationship.
2. Scatterplots visually depict the relationship and can show if variables are positively or negatively correlated.
3. The Pearson correlation coefficient (r) is a common measure of linear correlation calculated using variables' means, sums, and standard deviations.
4. Correlation only captures linear relationships and does not prove causation between variables. Additional analysis is needed to interpret correlated variables.
Unit 1 bp801 t g correlation analysis24022022ashish7sattee
Correlation is a statistical measure that indicates the extent to which two or more variables fluctuate together. A positive correlation indicates that the variables increase or decrease in parallel, while a negative correlation indicates that one variable increases as the other decreases. A correlation of -0.97 represents a strong negative correlation, while 0.10 represents a weak positive correlation. Correlations above 0.4 are generally considered relatively strong. Even if there is no linear relationship between two variables, it does not necessarily mean that no relationship exists at all.
This document provides an overview of correlation and linear regression analysis. It defines correlation as a statistical measure of the relationship between two variables. Pearson's correlation coefficient (r) ranges from -1 to 1, with values farther from 0 indicating a stronger linear relationship. Positive values indicate an increasing relationship, while negative values indicate a decreasing relationship. The coefficient of determination (r2) represents the proportion of shared variance between variables. While correlation indicates linear association, it does not imply causation. Multiple regression allows predicting a continuous dependent variable from two or more independent variables.
This document discusses correlation and defines it as the statistical relationship between two variables, where a change in one variable results in a corresponding change in the other. It describes different types of correlation including positive, negative, simple, partial and multiple. Methods for studying correlation are also outlined, including scatter diagrams and Karl Pearson's coefficient of correlation (represented by r), which quantifies the strength and direction of the linear relationship between two variables from -1 to 1. The coefficient of determination (r2) is also introduced, which expresses the proportion of variance in one variable that is predictable from the other.
The document discusses covariance and correlation, which describe the relationship between two variables. Covariance indicates whether variables are positively or inversely related, while correlation also measures the degree of their relationship. A positive covariance/correlation means variables move in the same direction, while a negative covariance/correlation means they move in opposite directions. Correlation coefficients range from 1 to -1, with 1 indicating a perfect positive correlation and -1 a perfect inverse correlation. The document provides formulas for calculating covariance and correlation and examples to demonstrate their use.
This document discusses correlation and regression analysis. It defines correlation as a statistical measure of how two variables are related. A correlation coefficient between -1 and 1 indicates the strength and direction of the linear relationship between variables. A scatterplot can show this graphically. Regression analysis involves using one variable to predict scores on another variable. Simple linear regression uses one independent variable to predict a dependent variable, while multiple regression uses two or more independent variables. The goal is to identify the regression line that best fits the data with the least error. The coefficient of determination, R2, indicates how much variance in the dependent variable is explained by the independent variables.
Data analysis test for association BY Prof Sachin Udepurkarsachinudepurkar
1) The document discusses analyzing relationships between variables through bivariate analysis. Bivariate analysis examines the relationship between two variables and can determine direction, strength, and statistical significance.
2) It provides examples of using scatter plots and calculating covariance to visually represent and quantify relationships between variables. Covariance measures how much two variables change together.
3) Calculating the correlation coefficient further standardizes and quantifies relationships, resulting in a number between -1 and 1 that indicates the strength and direction of a relationship. Strong positive or negative correlations near 1 or -1 show clear relationships between variables.
Correlation analysis measures the strength and direction of association between two or more variables. It is represented by the coefficient of correlation (r), which ranges from -1 to 1. A value of 0 indicates no association, 1 indicates perfect positive association, and -1 indicates perfect negative association. The scatter diagram is a graphical method to visualize the association between variables by plotting their values. Karl Pearson's coefficient is a commonly used algebraic method to calculate the coefficient of correlation from sample data.
Correlation and Regression analysis is one of the important concepts of statistics which could be used to understand the relationship between the variables.
This document discusses different dependence techniques, including correlation and regression. It provides details on simple regression, multiple regression, and standard multiple regression. Correlation measures the relationship between two variables from -1 to 1. Regression analysis is used to predict a dependent variable from independent variables. Standard multiple regression evaluates the relationship between a set of independent variables and a dependent variable by entering all independent variables simultaneously.
Correlational research describes the linear relationship between two or more variables without attributing cause and effect. The correlation coefficient is used to measure the strength of this relationship on a scale from -1 to 1. Positive correlations indicate variables increase or decrease together, while negative correlations mean they change in opposite directions. Scatterplots visually depict the correlation by showing how paired values of different variables relate on a graph. The Pearson's r formula is commonly used to calculate correlation coefficients from sample data.
This document discusses correlation analysis and different types of correlation. It defines correlation as a statistical analysis of the relationship between two or more variables. There are three main types of correlation discussed:
1. Positive correlation means that as one variable increases, the other also tends to increase. Negative correlation means that as one variable increases, the other tends to decrease.
2. Simple correlation analyzes the relationship between two variables, while multiple correlation analyzes three or more variables simultaneously. Partial correlation holds the effect of other variables constant.
3. Methods for measuring correlation include scatter diagrams, which graphically show the relationship, and algebraic formulas that calculate a correlation coefficient to quantify the strength and direction of the relationship.
Study of Correlation Theory with Different Views and Methodsamong Variables i...inventionjournals
Correlation among two numbers is an important concept and this relationship among two variables may be direct or indirect/inverse. Generally, correlation of two numbers is studyin statistics .The different types of correlation among numbers may be positively correlated, negatively correlated and perfectly correlated in statistics. Generally theserelationship is direct or indirect/inverse.So,in this paper it is tried to explore correlation among numbers/variables inunitary methods, ratio and proportion, variation methods.
The document discusses correlation and regression analysis. It defines correlation as the statistical relationship between two variables, where a change in one variable corresponds to a change in the other. The key types of correlation are positive, negative, simple, partial and multiple, and linear and non-linear. Regression analysis establishes the average relationship between an independent and dependent variable in order to predict or estimate values of the dependent variable based on the independent variable. Methods for studying correlation include scatter diagrams and Karl Pearson's coefficient of correlation, while regression analysis uses equations to model the linear relationship between variables.
Rai University provides high quality education for MSc, Law, Mechanical Engineering, BBA, MSc, Computer Science, Microbiology, Hospital Management, Health Management and IT Engineering.
The document discusses various types of retailers including specialty stores, department stores, supermarkets, convenience stores, and discount stores. It then covers marketing decisions for retailers related to target markets, product assortment, store services, pricing, promotion, and store location. The document also discusses wholesaling, including the functions of wholesalers, types of wholesalers, and marketing decisions faced by wholesalers.
This document discusses marketing channels and channel management. It defines marketing channels as sets of interdependent organizations that make a product available for use. Channels perform important functions like information gathering, stimulating purchases, negotiating prices, ordering, financing inventory, storage, and payment. Channel design considers customer expectations, objectives, constraints, alternatives that are evaluated. Channel management includes selecting, training, motivating, and evaluating channel members. Channels are dynamic and can involve vertical, horizontal, and multi-channel systems. Conflicts between channels must be managed to balance cooperation and competition.
The document discusses integrated marketing communication and its various elements. It defines integrated marketing communication as combining different communication modes like advertising, sales promotion, public relations, personal selling, and direct marketing to provide a complete communication portfolio to audiences. It also discusses the communication process and how each element of the marketing mix communicates to customers. The document provides details on the key components of an integrated marketing communication mix and how it can be used to build brand equity.
Pricing is a key element in determining the profitability and success of a business. The price must be set correctly - if too high, demand may decrease and the product may be priced out of the market, but if too low, revenue may not cover costs. Pricing strategies should consider the product lifecycle stage, costs, competitors, and demand factors. Common pricing methods include penetration pricing for new products, market skimming for premium products, value pricing based on perceived worth, and cost-plus pricing which adds a markup to costs. Price affects demand through price elasticity, with elastic demand more sensitive to price changes.
The document discusses various aspects of branding such as definitions of a brand, brand positioning, brand name selection, brand sponsorship, brand development strategies like line extensions and brand extensions, challenges in branding, importance of packaging, labeling, and universal product codes. It provides examples of well-known brands and analyzes their branding strategies. The key points covered are creating emotional value for customers, building relationships and loyalty, using brands to project aspirational lifestyles and values to command premium prices.
This document outlines the key stages in the new product development (NPD) process. It begins with generating ideas for new products, which can come from internal or external sources. Ideas are then screened using criteria like market size and development costs. Successful concepts are developed and test marketed to customers. If testing goes well, the product proceeds to commercialization with a full market launch. The NPD process helps companies focus their resources on projects most likely to be rewarding and brings new products to market more quickly. It describes common challenges in NPD like defining specifications and managing resources and timelines, and how to overcome them through planning and cross-functional involvement.
A product is an item offered for sale that can be physical or virtual. It has a life cycle and may need to be adapted over time to remain relevant. A product needs to serve a purpose, function well, and be effectively communicated to users. It also requires a name to help it stand out.
A product hierarchy has multiple levels from core needs down to specific items. These include the need, product family, class, line, type, and item or stock keeping unit.
Products go through a life cycle with stages of development, introduction, growth, maturity, and decline. Marketing strategies must adapt to each stage such as heavy promotion and price changes in introduction and maturity.
This document discusses barriers between marketing researchers and managerial decision makers. It identifies three types of barriers: behavioral, process, and organizational. Specific behavioral barriers discussed include confirmatory bias, the difficulty balancing creativity and data, and the newcomer syndrome. Process barriers include unsuccessful problem definition and research rigidity. Organizational barriers include misuse of information asymmetries. The document also discusses ethical issues in marketing research such as deceptive practices, invasion of privacy, and breaches of confidentiality.
The document discusses best practices for organizing, writing, and presenting a marketing research report. It provides guidance on structuring the report with appropriate headings, formatting the introduction and conclusion/recommendation sections, effectively utilizing visuals like tables and graphs, and tips for an ethical and impactful oral presentation of the findings. The goal is to clearly communicate the research results and insights to the client to inform their decision-making.
This document discusses marketing research and its key steps and methods. Marketing research involves collecting, analyzing and communicating information to make informed marketing decisions. There are 5 key steps in marketing research: 1) define the problem, 2) collect data, 3) analyze and interpret data, 4) reach a conclusion, 5) implement the research. Common data collection methods include interviews, surveys, observations, and experiments. The data is then analyzed using statistical techniques like frequency, percentages, and means to interpret the findings and their implications for marketing decisions.
Bdft ii, tmt, unit-iii, dyeing & types of dyeing,Rai University
Dyeing is a method of imparting color to textiles by applying dyes. There are two major types of dyes - natural dyes extracted from plants/animals/minerals and synthetic dyes made in a laboratory. Dyes can be applied at different stages of textile production from fibers to yarns to fabrics to finished garments. Common dyeing methods include stock dyeing, yarn dyeing, piece dyeing, and garment dyeing. Proper dye and method selection are needed for good colorfastness.
Bsc agri 2 pae u-4.4 publicrevenue-presentation-130208082149-phpapp02Rai University
The government requires public revenue to fund its political, social, and economic activities. There are three main sources of public revenue: tax revenue, non-tax revenue, and capital receipts. Tax revenue is collected through direct taxes like income tax, which are paid directly to the government, and indirect taxes like sales tax, where the burden can be shifted to other parties. Non-tax revenue sources include profits from public enterprises, railways, postal services, and the Reserve Bank of India. While taxes provide wide coverage and influence production, they can also reduce incentives to work and increase inequality.
Public expenditure has increasingly grown over time to fulfill three main roles: protecting society, protecting individuals, and funding public works. The growth can be attributed to several causes like increased income, welfare state ideology, effects of war, increased resources and ability to finance expenditures, inflation, and effects of democracy, socialism, and development. There are also canons that govern public spending like benefits, economy, and approval by authorities. The effects of public expenditure include impacts on consumption, production through efficiency, incentives and allocation, and distribution of resources.
Public finance involves the taxing and spending activities of government. It focuses on the microeconomic functions of government and examines taxes and spending. Government ideology can view the community or individual as most important. In the US, the federal government has more spending flexibility than states. Government spending has increased significantly as a percentage of GDP from 1929 to 2001. Major items of federal spending have shifted from defense to entitlements like Social Security and Medicare. Revenues mainly come from individual income taxes, payroll taxes, and corporate taxes at the federal level and property, sales, and income taxes at the state and local levels.
This document provides an overview of public finance. It defines public finance as the study of how governments raise money through taxes and spending, and how these activities affect the economy. It discusses why public finance is needed to provide public goods and services, redistribute wealth, and correct issues like pollution. The key aspects of public finance covered are government spending, revenue sources like income taxes, and how fiscal policy around spending and taxation can influence economic performance.
The document discusses the classical theory of inflation and how it relates to money supply. It states that inflation is defined as a rise in the overall price level in an economy. The quantity theory of money explains that inflation is primarily caused by increases in the money supply as controlled by the central bank. When the money supply grows faster than the amount of goods and services, it leads to too much money chasing too few goods and a rise in prices, or inflation. The document also notes that hyperinflation, which is a very high rate of inflation, can occur when governments print too much money to fund spending.
Bsc agri 2 pae u-3.2 introduction to macro economicsRai University
This document provides an introduction to macroeconomics. It defines macroeconomics as the study of national economies and the policies that governments use to affect economic performance. It discusses key issues macroeconomists address such as economic growth, business cycles, unemployment, inflation, international trade, and macroeconomic policies. It also outlines different macroeconomic theories including classical, Keynesian, and unified approaches.
Market structure identifies how a market is composed in terms of the number of firms, nature of products, degree of monopoly power, and barriers to entry. Markets range from perfect competition to pure monopoly based on imperfections. The level of competition affects consumer benefits and firm behavior. While models simplify reality, they provide benchmarks to analyze real world situations, where regulation may influence firm actions.
This document discusses the concept of perfect competition in economics. It defines perfect competition as a market with many small firms, identical products, free entry and exit of firms, and complete information. The document outlines the key features of perfect competition including: a large number of buyers and sellers, homogeneous products, no barriers to entry or exit, and profit maximization by firms. It also discusses the short run and long run equilibrium of a perfectly competitive firm, including cases where firms experience super normal profits, normal profits, or losses.
Main Java[All of the Base Concepts}.docxadhitya5119
This is part 1 of my Java Learning Journey. This Contains Custom methods, classes, constructors, packages, multithreading , try- catch block, finally block and more.
How to Setup Warehouse & Location in Odoo 17 InventoryCeline George
In this slide, we'll explore how to set up warehouses and locations in Odoo 17 Inventory. This will help us manage our stock effectively, track inventory levels, and streamline warehouse operations.
A review of the growth of the Israel Genealogy Research Association Database Collection for the last 12 months. Our collection is now passed the 3 million mark and still growing. See which archives have contributed the most. See the different types of records we have, and which years have had records added. You can also see what we have for the future.
it describes the bony anatomy including the femoral head , acetabulum, labrum . also discusses the capsule , ligaments . muscle that act on the hip joint and the range of motion are outlined. factors affecting hip joint stability and weight transmission through the joint are summarized.
How to Fix the Import Error in the Odoo 17Celine George
An import error occurs when a program fails to import a module or library, disrupting its execution. In languages like Python, this issue arises when the specified module cannot be found or accessed, hindering the program's functionality. Resolving import errors is crucial for maintaining smooth software operation and uninterrupted development processes.
Executive Directors Chat Leveraging AI for Diversity, Equity, and InclusionTechSoup
Let’s explore the intersection of technology and equity in the final session of our DEI series. Discover how AI tools, like ChatGPT, can be used to support and enhance your nonprofit's DEI initiatives. Participants will gain insights into practical AI applications and get tips for leveraging technology to advance their DEI goals.
2. Ch 7_2
What is meant by correlation?
It is viewed as a statistical tool with the help of which
the relationship between two or more than two variables
is studied. Correlation analysis refers to a technique
used in measuring the closeness of the relationship
between the variables.
If two quantities vary in such a way that movements in
one are accompanied by movements in the other, these
quantities are said to be correlated.
Continued…..
3. Ch 7_3
What is meant by Correlation?
Examples:
Relationship between family income and expenditure
on luxury items
Price of a commodity and amount demanded
Increase in rainfall up to a point and production of rice
Increase in the number of a television licenses and
number of cinema admissions.
Continued…..
4. Ch 7_4
What are the major issues of analyzing
the relation between different series ?
Determining whether a relation exists and, if it does,
one has to measure it ;
Testing whether it is significant;
Establishing the cause –and- effect relations
5. Ch 7_5
What is the significance of the study of
correlation?
The study of correlation is of immense use in
practical life because of the following reasons:
Most of the variables show some kind of
relationship. With the help of correlation analysis one
can measure in one figure the degree of relationship
existing between the variables.
Once two variables are closely related, one can
estimate the value of one variable given the value of
another.
Continued……..
6. Ch 7_6
What is the significance of the study of
correlation?
Correlation analysis contributes to the economic
behavior, aids in locating the critically important
variables on which others depend. This may reveal the
connection by which disturbances spread and suggest
the paths through which stabilizing forces become
effective.
Progressive development in the methods of science
and philosophy has been characterized by increase in
the knowledge of relationship or correlations.
It should be noted that coefficient of correlation is one
of the most widely used tool and also one of the most
widely abused statistical measures.
7. Ch 7_7
Continued…….
Example:
Advertisement expenditure
(Tk lakhs)
25
35
45
55
65
Sales
(Tk. Crores)
120
140
160
180
200
The above data show a perfect positive relationship
between advertisement expenditure and sales. But such
a situation is rare in practice.
8. Ch 7_8
Does correlation always signify a cause, and
effect relationship between variables? If not,
Why?
Continued…….
Both the correlated variables may be influenced by one
or more other variables: A high degree of correlation
between the yield per acre of the rice and tea may be due
to the amount of rainfall. But none of the two variables
is the cause of the other.
Both the variables may be mutually influencing each
other so that neither can be designated as cause and the
other the effect.
Variables like demand and supply, price and production,
etc. mutually interact.
9. Ch 7_9
Continued…….
Example: As the price of a commodity increases, its
demand goes down and so price is the cause and
demand the effect. But it is also possible that
increased demand of a commodity due to the growth
of population or other reasons may force its price up.
Now the cause is the increased demand, the effect the
price. Thus at times it may become difficult to explain
from the two correlated variables which is the cause
and which is the effect because both may be reacting
on each other.
The above points clearly show that correlation does
not manifest causation or functional relationship. By
itself, it establishes only covariation.
10. Ch 7_10
What are various types of correlation?
Correlation is classified in several different ways. The
most important types of correlation are:
Positive and negative correlation
Simple, partial and multiple correlation
Linear and non–linear correlation
Continued………
11. Ch 7_11
What are various types of correlation?
Positive correlation: If both the variables vary in the
same direction i.e. if one variable increases, the other
on average also increases, or if one variable
decreases, the other on average also decreases,
correlation is said to be positive.
Continued………
13. Ch 7_13
Continued………
Negative correlation: If the variables vary in
opposite directions i.e. if one variable increases the
other decreases or vice versa, correlation is said to
be negative.
What are the various types of correlation?
15. Ch 7_15
What are the various types of correlation?
Simple correlation: When only two variables are
studied, it is a problem of simple correlation.
Continued………
16. Ch 7_16
What are the various types of correlation?
Multiple correlation: When three or more variables are
studied simultaneously, it is a problem of multiple
correlation.
Example 1: When we study the relationship between the
yield of rice per acre and both the amount of rainfall and
the amount of fertilizers used, it is problem of multiple
correlation.
Example 2: The relationship of plastic hardness,
temperature and pressure .
Continued………
17. Ch 7_17
What are the various types of correlation.
Partial correlation: In partial correlation, we recognise
more than two variables. But, when only two variables
are considered to be influencing each other and the
effect of other influencing variable is kept constant, it
is a problem of partial correlation.
Example: If we limit our correlation analysis of yield of
rice per acre and rainfall to periods when a certain
average daily temperature existed, it becomes a
problem of partial correlation.
Continued………
18. Ch 7_18
What are various types of correlation?
Linear (curvilinear) correlation: If the amount of
change in one variable tends to bear constant ratio to
the amount of change in other variable, then the
correlation is said to be linear.
Continued………
19. Ch 7_19
Continued………
Example: X 10 20 30 40 50
Y 70 140 210 280 350
It is clear that the ratio of change between two
variables is the same.
If these variables are plotted on a graph paper, all
the plotted points would fall on a straight line.
21. Ch 7_21
What are the various types of correlation.
Non–linear (curvilinear) correlation: If the amount of
change in one variable does not bear a constant ratio
to the amount of change in the other variable, then the
correlation is said to be non–linear (curvilinear).
Example: If the amount of rainfall is doubled, the
production of rice or wheat, etc. would not necessarily
be doubled. In most practical cases, we find a non-
linear relationship between the variables.
But, since techniques of analysis for measuring non-
linear correlation are very complicated, the
relationship between the variables is assumed to be of
the linear type.
Continued………
23. Ch 7_23
What are the methods of studying
correlations?
Continued………
Scatter Diagram Method
Karl Pearson’s coefficient of correlation
Spearman’s Rank correlation coefficient
24. Ch 7_24
What is Scatter Diagram ?
Continued………
A scatter diagram refers to a diagram in which the
values of the variables are plotted on a graph paper
in the form of dots i.e. for each pair of X and Y
values. If we put dots and thus obtain as many points
as the number of observations, the diagram of dots,
so obtained is known as scatter diagram.
25. Ch 7_25
How can scatter diagram method (Dot chart
method) be used to study correlation?
Continued………
In this method, the given data are plotted on a graph
paper in the form of dots. From scatter diagram i.e. by
looking to the scatter of the various points, we can
form a fairly good, though vague, idea whether the
variables are correlated or not, e.g., if the points are
dense, i.e. very close to each other, we should expect
a fairly good amount of correlation between the
variables and if the points are widely scattered, a poor
correlation is expected.
26. Ch 7_26
When is correlation said to be perfectly
positive or and perfectly negative?
Continued………
If all the points lie on a straight line rising from the
lower left hand corner to the upper right hand corner,
correlation is said to be perfectly positive ( i.e. r = +1)
If all the points lie on a straight line falling from the
upper left hand corner to the lower right hand corner
of the diagram, correlation is said to be perfectly
negative (i.e. r = - 1).
29. Ch 7_29
Continued………
If the plotted points fall in a narrow band, there would
be a high degree of correlation between the variables.
Correlation shall be positive if the points show a
rising tendency from upper left-hand corner to the
right hand corner of the diagram, and negative if the
points show a declining tendency from upper left hand
corner to the lower right hand corner of the diagram.
31. Ch 7_31
High degree of Negative Correlation
0 1 2 3 4 5 6 7 8 9 10
10
9
8
7
6
5
4
3
2
1
0
X
Y
32. Ch 7_32
When will there be low degree of
correlation between two variables ?
If the points are widely scattered over the diagrams, it
indicates very low degree of relationship between the
variables.
This correlation shall be positive if the points rise from
the lower left-hand corner to the upper right-hand
corner, and negative if the points run from the upper
left–hand side to the lower right hand side to the
diagram.
35. Ch 7_35
When will there be no correlation
between two variables?
If the plotted points lie on a straight line parallel to the X-
axis, or in a haphazard manner, it shows the absence of
any relationship between the variables (i.e. r = 0)
37. Ch 7_37
Capital employed
(Tk.crore)
1 2 3 4 5 7 8 9 11 12
Profits (Tk.lakhs) 3 5 4 7 9 8 10 11 12 14
Example:
The following pairs of values are given:
1. Make a scatter diagram
2. Do you think that there is any correlation
between profits and capital employed?
39. Ch 7_39
It appears from the above diagram that the variables –
profits and capital employed are correlated.
Correlation is positive because the trend to the points
is upward rising from the lower left hand corner to the
upper right–hand corner.
The degree of relationship is high because the plotted
points are in a narrow band which shows that it is a
case of high degree of positive correlation.
Do you think that there is any correlation
between profits and capital employed?
40. Ch 7_40
What are the merits of scatter diagram
method studying of correlation?
Merits:
It is a simple and non–mathematical method of
studying correlation between the variables. Hence, it
can be easily understood and rough idea can quickly
be formed as to whether or not the variables are
related.
It is not influenced by the size of extreme values
whereas, most of the mathematical methods of
finding correlation are influenced by extreme values.
Making a scatter diagram usually is the first step in
investigating the relationship between the variable.
41. Ch 7_41
What are the limitations of Scatter diagram
method of studying correlation?
Limitations:
It is not possible to establish the exact degree of
correlation between the variables as is possible by
applying the mathematical method.
42. Ch 7_42
The coefficient of correlation (r) is a measure of the
strength of the linear relationship between two or more
variables. This summarizes in one figure the direction
and degree of correlation.
Designated r, it is often referred to as Pearson’s ‘r’
It can assume any value from –1.00 to +1.00
inclusive. A correlation co-efficient of –1.00 or +1.00
indicates perfect correlation.
If there is absolutely no relationship between the
two sets of variables, Pearson’s r is zero.
It requires interval or ratio-scaled data (variables).
What is meant by coefficient of correlation?
Continued…….
43. Ch 7_43
Negative values indicate an inverse
relationship and positive values indicate a
direct relationship.
If there is absolutely no relationship between
the two sets of variables, Pearson’s r is zero. A
coefficient of correlation r close to o (say, 0.08).
shows that the linear relationship is very weak.
The same conclusion is drawn if r = - 0.08 .
What is meant by Coefficient of Correlation?
Continued…….
44. Ch 7_44
Coefficients of –0.91 and + 0.91 have equal strength,
both indicate very strong correlation between the two
variables. Thus, the strength of correlation does not
depend on the direction (either – or +).
If the correlation is weak, there is considerable
scatter about a line drawn through the center of the
data.
For the scatter diagram representing a strong
relationship, there is very little scatter about the line.
The following drawing shows the strength and direction
of the coefficient of correlation:
What is meant by Coefficient of Correlation?
Continued…….
46. Ch 7_46
The coefficient of correlation describes not only the
magnitude of correlation but also its direction. Thus, +
0.8 would mean that correlation is positive and the
magnitude of correlation is 0.8.
47. Ch 7_47
The following are the important properties of the co –
efficient of correlation:
The co– efficient of correlation lies between - 1 and +
1. Symbolically, - 1 ≤ r< +1 or │r ≤ 1
The co–efficient of correlation is independent of
change of origin and scale.
The co–efficient of correlation is the geometric mean
of two regression co-efficient
If X and Y are independent variables then co –
efficient of correlation is zero. However, the converse
is not true.
What are the properties of the co– efficient
of correlation?
Continued…….
48. Ch 7_48
Prove that the co –efficient of correlation lies
between - 1 and +1. Symbolically,
11 r Or r ≤
Solution:
22
YYXX
YYXX
r
Continued…..
51. Ch 7_51
What is the formula suggested by Karl
Pearson for measuring the degree of
relationship between two variables?
If the two variables under study are X and Y, the
following formula suggested by Karl Pearson can be
used for measuring the degree of relationship.
22
YYXX
YYXX
r
.
,
variablesYandXofmeans
respectivetheareYandXand
ncorrelatioofefficientcor
Where
Continued……
52. Ch 7_52
The above formula can be written us:
This formula is to be used only where the deviations are
taken from actual means and not from assumed means.
22
. yx
xy
r
YYy
andXXx
Where
,
53. Ch 7_53
Karl Pearson’s co-efficient of correlations
The co-efficient of correlation can also be calculated
from the original set of observations (i.e., without
taking deviations from the mean) by applying the
following formula:
2222
2
2
2
2
YYNXXN
yXXYN
N
Y
Y
N
X
X
N
YX
XY
r
54. Ch 7_54
Karl Pearson’s co-efficient of correlations
The co-efficient of correlation can also be calculated
from the original set of observations (i.e., without
taking deviations from the mean) by applying the
formula:
2222
2
2
2
2
YYNXXN
yXXYN
N
Y
Y
N
X
X
N
YX
XY
r
55. Ch 7_55
Find the correlation co-efficient between the sales and
expenses from the data given below:
Firm 1 2 3 4 5 6 7 8 9 10
Sales (Tk. Lakhs) 50 50 55 60 65 65 65 60 60 50
Expenses (Tk.
Lakhs)
11 13 14 16 16 15 15 14 13 13
Example:
57. Ch 7_57
14
10
140
58
10
580
N
Y
Y
N
X
XHere
Hence, there is a high degree of positive correlation
between the two variables i.e. as the value of sales
goes up, the expenses also go up.
7870
99488
70
7920
70
22360
70
,
22
yx
xy
rncorrelatioofefficientCo
58. Ch 7_58
Example:
Find the correlation by Karl Pearson’s method between
the two kinds of assessment of postgraduate students’
performance (marks out of 100)
Roll No. of
students
1 2 3 4 5 6 7 8 9 10
Internal
Assessment
45 62 67 32 12 38 47 67 42 85
External
Assessment
39 48 65 32 20 35 45 77 30 62
60. Ch 7_60
3.45
10
453
749
10
497
,
N
Y
Y
N
X
XHere
Here there is a high degree of positive correlation between
internal assessment and external assessment i.e. as the marks
of internal assessment go up, the marks of eternal assessment
also go up.
880
153373
92969
2111378139
92969
1287613956
92969
,
22
yx
xy
rncorrelatioofefficientCo
61. Ch 7_61
What are the merits ?
It summarizes in one figure not only the degree of
correlation but also the direction i.e. whether
correlation is positive or negative
It helps one to go for further analysis.
62. Ch 7_62
What are its limitations ?
The chief limitations of Karl Pearson's method are as
follows:
The correlation coefficient always assumes linear
relationship regardless of the fact whether that
assumption is true or not.
Great care must be exercised in interpreting the
value of this co-efficient as very often the coefficient
is misinterpreted.
The value of the co-efficient is unduly affected by the
extreme values.
As compared to other methods of finding correlation,
this method is more time-consuming.
63. Ch 7_63
What is rank correlation co-efficient?
Let us suppose that a group of ‘n’ individuals is
arranged in order of merits or proficiency in
possession of two characteristics A and B. These
ranks in the two characteristics will, in general , be
different.
Example: If we consider the relation between
intelligence and beauty, it is not necessary that a
beautiful individual is intelligent also.
Let (Xi, Yi); i=1, 2, 3………. n be ranks of ith individual in
two characteristics A and B respectively.
64. Ch 7_64
What is rank correlation co-efficient?
Pearson’s co-efficient of correlation refers to the
strength of relationship measured on the rank values
of two series of data..
65. Ch 7_65
Define Spearman’s rank correlation co-efficient
Spearman’s rank correlation coefficient is defined as :
,
6
1
1
6
1 3
2
2
2
NN
D
Or
NN
D
R
where R denotes rank co-efficient of correlation and D
refers to the difference of ranks between paired items in
two series.
The value of this co-efficient also lies between +1 and–1.
When R = +1, there is complete agreement in the order of
ranks and the ranks are in the same direction.
When R= –1, there is complete agreement in the order of
ranks and they are in opposite directions:
66. Ch 7_66
What are the steps involved in computing
rank correlation co-efficient when actual
ranks are not given?
Where actual ranks are given, the steps required for
computing rank correlation are :
Take the difference of the two ranks, ie.e., (R1 - R2 ) and
denote these differences by D.
Square these differences and obtain the total D2.
Apply the formula:
NN
D
R
3
2
6
1
67. Ch 7_67
Example:
Two housewires, Mrs. A and Mrs. B, were asked to express
their preference for different kinds of detergents, gave the
following replies.
Detergent Mrs. A Mrs. B
A 4 4
B 2 1
C 1 2
D 3 3
E 7 8
F 8 7
G 6 5
H 5 6
I 9 9
J 10 10
68. Ch 7_68
To what extent the preferences of these two
ladies go together?
Continued……
69. Ch 7_69
Calculation of Rank correlation co-efficient
Detergent Mrs.A
R1
Mrs. B
R2
(R1-R2 )2
=D1
A 4 4 0
B 2 1 1
C 1 2 1
D 3 3 0
E 7 8 1
F 8 7 1
G 6 5 1
H 5 6 1
I 9 9 0
J 10 10 0
N =10 D2 =6
Solution
70. Ch 7_70
In order to find out how far preferences for different
kind of detergents go together, we will calculate
rank correlation co – efficient.
Continued…..
.964003601
990
36
1
101000
36
1
10
66
1
6
1,
10
3
3
2
NN
D
RefficientConCorrelatioRank
71. Ch 7_71
This shows that the preferences of these two ladies
agree very closely as far as their opinion on detergents
is concerned.
72. Ch 7_72
When Ranks are not given, it will be necessary to assign
the ranks. Ranks can be assigned by taking either the
highest value as 1 or the lowest value as 1.
Example:
The marks obtained by students in two tests are given
below:
Preliminary Test 92 89 87 86 83 77 71 63 53 50
Final Test 86 83 91 77 68 85 52 82 37 57
Continued……..
Calculate the rank correlation coefficient and comment
on this.
75. Ch 7_75
Example:
Seven methods of imparting business education were
ranked by the MBA students of two universities as
follows:
Methods of teaching i ii iii iv v vi vii
Rank by students of
University A
2 1 5 3 4 7 6
Rank by students of
University B
1 3 2 4 7 5 6
Calculate the rank correlation co–efficient and
comment on this
Continued…….
76. Ch 7_76
Solution:
Methods of
teaching
Rank by students
of University A
R1
Rank by students
of University B
R2
(R1 – R2)2
D2
i 2 1 1
ii 1 3 4
iii 5 2 9
iv 3 4 1
v 4 7 9
vi 7 5 4
vii 6 6 0
N = 7 D2= 28
77. Ch 7_77
It shows that there is a moderate degree of positive
correlation between ranks by students of two
universities.
50
501
336
168
1
7343
168
1
286
1
6
1,
77
3
3
2
NN
D
RefficientConCorrelatioRank
78. Ch 7_78
What are the steps involved in computing rank
correlation co-efficient when equal ranks or tie
in ranks occur?
In some cases it may be found necessary to assign
equal rank to two or more individuals or entries. In such
a case, it is customary to give each individual or entry
an average rank Thus if two individuals are ranked equal
at fifth place, they are each given the , that is 5.5
while if three are ranked equal at fifth place, they are
given the rank = 6. In other words, where
two or more individuals are to be ranked equal, the rank
assigned for purposes of calculating coefficient of
correlation is the average of the ranks which these
individuals would have got, had they differed slightly
from each other.
2
65
3
765
79. Ch 7_79
What are the steps involved in computing rank
correlation co-efficient when equal ranks or tie
in ranks occur?
Where equal ranks are assigned to some entries, an
adjustment in the above formula for calculating the rank
coefficient of correlation is made.
The adjustment consists of adding to the value
of D2, where m stands for the number of items whose
ranks are common. If there are more than one such
group of items with common rank, this value is added
as many times as the number of such groups. The
formula can thus be written as:
mm 3
12
1
NN
mmmmD
R
3
2
3
21
3
1
2
.........
12
1
12
1
6
1
80. Ch 7_80
Example:
An examination of eight applicants for a clerical post
was taken by a firm. From the marks obtained by the
applicants in the Accountancy and Statistics papers,
commute rank co-efficient of correlation.
Applicant A B C D E F G H
Marks in
Accountancy
15 20 28 12 40 60 20 80
Marks in Statistics 40 30 50 30 20 10 30 60
81. Ch 7_81
Applicants Marks in
Accountancy
X
Rank
Assigned
R1
Marks in
Statistics
Y
Rank
Assigned
R2
(R1 –R2)2
D2
A 15 2 40 6 16.00
B 20 3.5 30 4 0.25
C 28 5 50 7 4.00
D 12 1 30 4 9.00
E 40 6 20 2 16.00
F 60 7 10 1 36.00
G 20 3.5 30 4 0.25
H 80 8 60 8 0.00
Calculation of Rank correlation Co-efficient
D2 =81.5
82. Ch 7_82
NN
mmmmD
R
3
2
3
21
3
1
2
12
1
12
1
6
1
0
504
846
1
504
2505816
1
3
12
1
2
12
1
5816
1
88
32
3
33
R
The item 20 is repeated 2 times in series X and hence m1 = 2.
In series Y, the item 30 occurs 3 times and hence M2= 3.
Substituting these values in the above formula:
There is no correlation between the marks obtained in the
two subjects.
83. Ch 7_83
What are the merits of the rank method?
Merits:
This method is simpler to understand and easier to
apply compared to the Karl Pearson’s method. The
answers obtained by this method and the Karl
Pearson’s method will be the same provided no
value is repeated, i.e., all the items are different.
Where the data are of a qualitative nature like
honesty, efficiency, intelligence, etc. this method can
be used with great advantage.
Example: The workers of two factories can be ranked
in order of efficiency and the degree of correlation
can be established by applying the method.
84. Ch 7_84
What are the limitations of the rank method?
Limitations:
This method cannot be used for finding out
correlation in a grouped frequency distribution.
Where the number of observations exceed 30, the
calculations become quite tedious and require a lot
of time. Therefore, this method should not be
applied where N is exceeding 30 unless we are
given the ranks and not the actual values of the
variable.
85. Ch 7_85
What are lag and lead in correlation?
The study of lag and lead is of special significance
while studying economic and business series. In the
correlation of time series the investigator may find that
there is a time gap before a cause–and-effect
relationship is established.
Example: The supply of a commodity may increase
today, but it may not have an immediate effect on
prices – it may take a few days or even months for
prices to adjust to the increased supply. The difference
in the period before a cause– and– effect relationship
is established is called ‘ Lag’, While computing
correlation this time gap must be considered;
otherwise, fallacious conclusions may be drawn. The
pairing of items is adjusted according to the time lag.
86. Ch 7_86
Example:
The following are the monthly figures of advertising
expenditure and sales of a firm. It is generally found that
advertising expenditure has its impact on sales
generally after two months. Allowing for this time lag,
calculate co-efficient of correlation between expenditure
on advertisement and sales.
87. Ch 7_87
Month Advertising expenditure Sales(Tk.)
Jan. 50 1.200
Feb. 60 1,500
March 70 1,600
April 90 2,000
May 120 2,200
June 150 2,500
July 140 2,400
Aug. 160 2,600
Sept. 170 2,800
Oct. 190 2,900
Nov. 200 3,100
Dec. 250 3,900
Allow for a time lag of 2 months, i.e., link advertising expenditure of
January with sales for march , and so on.
90. Ch 7_90
References
Quantitative Techniques, by CR Kothari, Vikas publication
Fundamentals of Statistics by SC Guta Publisher Sultan
Chand
Quantitative Techniques in management by N.D. Vohra
Publisher: Tata Mcgraw hill