Stats 3000 Week 2 - Winter 2011

Uploaded on


  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
    Be the first to like this
No Downloads


Total Views
On Slideshare
From Embeds
Number of Embeds



Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

    No notes for slide


  • 1.
  • 2. Descriptive Methods in
    Data comes in pairs of quantitative variables. Given such paired data (bivariate data), we want to determine whether there is a relationship between the two quantitative variables and, if so, identify what the relationship is.
    Regression analysis allows us to identify an equation that best fits the data, and to predict values of one variable based on another variable.
  • 3.
  • 4. What is Linear Regression?
    • The straight-line linear regression model is a means of relating one quantitative variable to another quantitative variable
    A way of predicting the value of one variable from another.
    It is a hypothetical model of the relationship between two variables.
    The model used is a linear one.
    Therefore, we describe the relationship using the equation of a straight line.
  • 5. The goal is to be able to predict new values of Y based on values of X
  • 6.
  • 7.
  • 8.
  • 9. Regression line: Line whose equation is used for prediction
    Line that best describes the relationship between y, the dependent variable and x, the independent variable.
    Describing a Straight Line
    Linear equation: When the relationship between X and Y is linear
    Linear equation: Y = bX + a
  • 10. Linear regression builds on the equation for a straight line because the relationship between the two variables is assumed to be linear
    A straight line should yield the best “fit” of the data points in a scatterplot (a linear model)
  • 11.
  • 12.
  • 13.
  • 14.
  • 15. Intercepts and Slopes
  • 16.
  • 17. Residuals - the difference between a score and its predicted value
  • 18.
  • 19.
  • 20.
  • 21.
  • 22.
  • 23.
  • 24. A researcher suspects that there is a relationship between the number of promises
    a political candidate makes and the number of promises that are fulfilled once the candidate is elected. He examines the track record of 10 politicians. Use spss to construct a regression equation that predicts the number of promises made and promises kept by politicians.
  • 25.
  • 26.
  • 27.
  • 28.
  • 29.
  • 30. slope
    The information in the column “unstandardized coefficients” column B embodies the regression equation: (constant) is the intercept
  • 31.
    • Standard error of estimate: a measure of the error in prediction used as the basis for a measure of the accuracy of prediction
  • 32.
  • 33.
  • 34.
  • 35. Scatterplot
    To see if scores may be related construct a graph of the scores, called a scatterplot
    The variable labeled X is plotted on the horizontal axis (the abscissa)
    The Y variable is plotted on the vertical axis (the ordinate)
    The score of a subject on each of the two measures is indicated by one point on the scatterplot
  • 36.
    • Conclusions drawn from scatterplots are subjective. A more precise and objective method for detecting straight-line patterns is the linear correlation coefficient.
    • 37. The linear correlation coefficient r(often simply called the correlation coefficient) measures the strength of the linear relationship between the paired x and y values in a sample.
  • Descriptive Methods in
  • 38.
  • 39.
    • The value of r is not affected by the choice of x or y. Interchange all x and y values and the value of r will not change.
    • 40. r measures the strength of a linear relationship. It is not designed to measure the strength of a relationship that is not linear.
  • 41.
  • 42.
  • 43.
  • 44.
  • 45.
  • 46.
  • 47.
  • 48.
  • 49. Direction of relationship
    A correlation coefficient indicates the direction of the relationship by the positive or negative sign of the coefficient
    A positive r indicates
    A positive (direct)relationship between variables X and Y
    As the scores on variable X increase, the scores on variable Y tend to increase
    A negative r indicates
    A negative (inverse)relationship between variables X and Y
    As the scores on variable X increase, the scores on variable Y tend to decrease
  • 50.
  • 51.
  • 52.
  • 53.
  • 54.
  • 55.
  • 56.
  • 57.
  • 58.
  • 59.
  • 60.
  • 61.
  • 62.
  • 63. SPSS assignment # 2 due next week