How to Build
                   Regression Models in
                   Excel

                                  Ashutosh Nandeshwar
                                          @n_ashutosh
   Advancement Services track sponsored by
                  Blackbaud
Follow the Better Together Conference on Twitter #CASEV+VI
Why?




@n_ashutosh
What is it?




  @n_ashutosh
http://en.wikipedia.org/wiki/File:Anscombe%27s_quartet_3.svg


@n_ashutosh
When?




@n_ashutosh
When not to?




   @n_ashutosh
@n_ashutosh
@n_ashutosh
How?




@n_ashutosh
Challenges




  @n_ashutosh
http://en.wikipedia.org/wiki/File:Anscombe%27s_quartet_3.svg

@n_ashutosh
http://xkcd.com/539/




@n_ashutosh
What else?




  @n_ashutosh
Today’s session
                                                  evaluations
                                           will be e-mailed to you.



                         Session handouts are available at
    www.casefive.org/conference/presentations/
                       Username: CASEV-VI | Password: Chicago 2012



                                           @n_ashutosh
Follow the Better Together Conference on Twitter by using the #CASEV+VI hashtag when you tweet!

CASE V & VI Linear regression In Excel

Editor's Notes

  • #3 . For segmentation, quick scoring, modeling, giving, easier options, forecasting, predicting, any analyst worth his money should ask why and so what?
  • #4 We won’t go in all details here, but we can talk about a few things.
  • #5 At its essence, linear regression is about minimizing the distance between the predicted value of an observation and the actual value of an observation, the technique is called least-squares
  • #6 If you want to use Excel, then all the variables should be numeric or coded as numeric. You have enough data, but not a whole lot of data. You are just getting started in modeling.
  • #7 If your data is quite skewed or the parameters have non-linearity , or you suspect that variables are quite similar.
  • #10 This is the exciting or not-so exciting part, where you actually learn how to do this.
  • #11 Missing data, lots of variables, low accuracy, R2 < 0.5, unable to take advantage of various newer methods, variable selection is manual.
  • #14 Weka, decision tree, attribute selection, R