SlideShare a Scribd company logo
1 of 23
Download to read offline
Heisman Trophy Voting: A Bayesian Analysis

                Daniel Heard
Heisman Trophy Data



  The data set consisted of season statistics from o๏ฌ€ensive ๏ฌnalists
  for the Heisman trophy, including passing yards, passing
  touchdowns, rushing yards, rushing touchdowns, receiving yards,
  receiving touchdowns, and the number of wins the candidates team
  had in a given year. The data also included indicators as to
  whether the candidateโ€™s team was undefeated, what conference the
  candidatesโ€™ teams were members of and in which geographical
  region the candidatesโ€™ schools were located. Finally, the data
  included the number of ballots received and total points the
  candidates earned from voters from the 2000-2009 seasons.
Voting Process



   In a given year, there are 870 media ballots, one fan ballot (based
   on fans voting online for candidates) and each living recipient of
   the Heisman trophy is able to cast a ballot.
   Points are calculated as follows: on each ballot, the voter indicates
   their 1st, 2nd, and 3rd place candidates. Each ๏ฌrst place vote is
   worth 3 points, second place votes are worth 2 points, and third
   place votes are worth 1 point.
   In the event that every ballot is submitted, each candidate can
   receive a maximum of 2778 points. However, it is unusual for more
   than 97% of the ballots to be submitted.
Items of Interest




   Items of interest regarding Heisman trophy voting include:
       Examining voting trends by position, conference, region and
       class year of candidates
       Predicting voting for future candidates
Priors


   Let ฮธ denote the expected number of points a particular Heisman
   candidate will receive in a given year. This is a strictly positive
   quantity. Without having seen the data, one could make the
   assumption that all candidates receive equal total points, with that
   amount being approximately 555 (one-tenth of the maximum
   number of total points from all ballots). So, by this reasoning, I
   gave ฮธ the prior distribution:
   p(ฮธ) โˆผ Gamma(555, 1)
   If we let y denote the number of votes earned by a candidate
   (more precisely, candidate within a particular group) in a given
   year, we can model this as p(y |ฮธ) โˆผ Poisson(ฮธ). Hence, our
   posterior distribution will be p(ฮธ|y) โˆผ Gamma(555 + yi , 1 + n).
Analysis by Conference
   Players were separated into 7 categories by their conferences (the 6
   automatic qualifying BCS conferences and a separate group for
   players for non-AQ conferences).
   The objective was to observe any voting trends by conference.
   In the data set, there were 16 players from the Pac-10, 18 from the
   Big 12, 12 from the Big 10, 6 from the ACC, 10 from the SEC, 10
   from the Big East and 16 from non-automatic qualifying BCS
   conferences.
   We obtain the following distributions:
       Non-BCS: p(ฮธ|Y ) โˆผ Gamma(4364, 17) expected=273.1
       Pac-10: p(ฮธ|Y ) โˆผ Gamma(9576, 17) expected=506.7
       Big 12: p(ฮธ|Y ) โˆผ Gamma(13617, 19) expected=708.3
       Big 10: p(ฮธ|Y ) โˆผ Gamma(6638, 13) expected=514.2
       ACC: p(ฮธ|Y ) โˆผ Gamma(2713, 7) expected=408.4
       SEC: p(ฮธ|Y ) โˆผ Gamma(9962, 11) expected=875.5
       Big East: p(ฮธ|Y ) โˆผ Gamma(4389, 11) expected=411.8
Total Heisman Votes by Conference




               2000
Total Points

               1000
               0
                      0         1            2           3           4            5        6

                                                     Conference



                                    Total Heisman Votes by Conference from MCMC
               1000
Total Points

               600
               200




                      1         2            3           4           5            6        7

                                                     Conference



                          Total Heisman Votes by Conference from MCMC (outliers removed)
               1000
Total Points

               600
               200




                      1         2            3           4           5            6        7

                                                     Conference
After performing analysis with the two outliers removed, the
average points for Pac-10 by over 100 and lowered the average
points for the Big 10 by nearly 150, resulting in the following
averages:
    Non-BCS:273.1
    Pac-10:404.8
    Big 12:708.3
    Big 10:369.9
    ACC:408.4
    SEC:876.7
    Big East:411.8
Analysis by Region
   Heisman trophy voting is broken up into 6 geographic regions: Far
   West, Midwest, Southwest, South, Mid-Atlantic and Northeast.
   Players were separated into the geographic region in which their
   school is located.
   Of interest was to see if there is a di๏ฌ€erence in the average number
   of votes a player in a particular region will receive.
   In the data set, there were 24 players from the Far West, 19 from
   the Midwest, 20 from the Southwest, 14 from the South, 12 from
   the Mid-Atlantic and 1 from the Northeast.
   So, we obtain the following distributions:
       Far West:p(ฮธ|Y ) โˆผ Gamma(11498, 25), expected=463.5
       Midwest: p(ฮธ|Y ) โˆผ Gamma(7648, 20), expected=390.7
       Southwest:p(ฮธ|Y ) โˆผ Gamma(16242, 21), exp[ected=763.6
       South: p(ฮธ|Y ) โˆผ Gamma(10912, 15), expected=716.6
       Mid-Atlantic: p(ฮธ|Y ) โˆผ Gamma(3786, 13), expected=309.9
       Northeast: p(ฮธ|Y ) โˆผ Gamma(618, 2), expected=391.0
Total Heisman Votes by Region




               2000
Total Points

               1000
               0
                      1          2               3             4               5       6

                                                      Region



                                     Total Heisman Votes by Region from MCMC
               700
Total Points

               500
               300




                      1          2               3             4               5       6

                                                      Region



                          Total Heisman Votes by Region from MCMC (outliers removed)
               700
Total Points

               500
               300




                      1          2               3             4               5       6

                                                      Region
Removing outliers lowered the average points for Far West region
by just under 100 and lowered the average points for the Midwest
region by just under 100, resulting in the following averages:
    Far West:385.4
    Midwest:294.7
    Southwest:763.3
    South: 716.8
    Mid-Atlantic:310.2
    Northeast: 390.6
The Mid-Atlantic region is actually the lowest region by points.
Among BCS conferences, it contains most of the schools in the
ACC and the Big East, the lowest point totaling among the 6
automatic-qualifying conferences.
Analysis by Class


   Player were grouped by class (Freshman, Sophomore, Junior,
   Senior)
   Of interest was to see if there is a di๏ฌ€erence in the average number
   of votes players of a particular class will receive.
   In the data set, there was 1 freshman, 16 sophomores, 25 juniors
   and 48 seniors
   So, we obtain the following distributions:
       Freshman:p(ฮธ|Y ) โˆผ Gamma(1552, 2), expected=701.9
       Sophomore: p(ฮธ|Y ) โˆผ Gamma(10608, 17), expected=620.4
       Junior: p(ฮธ|Y ) โˆผ Gamma(14779, 26), expected=568.1
       Senior: p(ฮธ|Y ) โˆผ Gamma(22655, 49), expected=463.8
Total Heisman Votes by Class




              2000
Total Votes

              1000
              0
                     1                    2                    3                     4

                                                   Class



                                 Total Heisman Votes by Class from MCMC
              800
Total Votes

              600
              400




                     1                    2                    3                     4

                                                   Class



                         Total Heisman Votes by Class from MCMC (outliers removed)
              800
Total Votes

              600
              400




                     1                    2                    3                     4

                                                   Class
Analysis by Position


   Player were grouped by o๏ฌ€ensive position (Quarterback, Running
   Back, Wide Receiver).
   Of interest was to see if there is a di๏ฌ€erence in the average number
   of votes di๏ฌ€erent positions receive.
   In the data set, there were 52 quarterbacks, 29 running backs and
   9 wide receivers.
   We get the following distrubitions:
       Quarterback: p(ฮธ|Y ) โˆผ Gamma(32911, 53), expected=619.5
       Running Back: p(ฮธ|Y ) โˆผ Gamma(13775, 30), expected=462.1
       Wide Reciever:p(ฮธ|Y ) โˆผ Gamma(2353, 10), expected=264.1
Total Heisman Votes by Position




               2000
Total Points

               1000
               0
                       1                            2                           3

                                                 Position



                              Total Heisman Votes by Position from MCMC
               600
Total Points

               400
               200




                       1                            2                           3

                                                 Position



                      Total Heisman Votes by Position from MCMC (outliers removed)
               600
Total Points

               400
               200




                       1                            2                           3

                                                 Position
Removing outliers lowered the average points for quarterbacks by
over 30 points and lowered the average points for running backs by
over 60, resulting in the following averages:
    Quarterback: 582.6
    Running Back: 396.8
    Wide Receiver: 264.4
So, with outliers removed, the voting trend by position remains the
same.
Heisman Voting Linear Model

   The 2010 Heisman trophy ๏ฌnalists were recently announced to be:
       LaMichael James, RB Oregon
       Andrew Luck, QB Stanford
       Kellen Moore, QB Boise State
       Cameron Newton, QB Auburn
   I constructed a linear model to predict the number of points earned
   by a Heisman ๏ฌnalist based on certain statistics. While the
   numerical predictions of the model are not precise, the predictions
   of relative number of votes is accurate. The formula for the model
   is as follows:
   h.lmยก-lm(log(Votes) factor(Position) + factor(Conf) +
   sqrt(Wins) + PassTDs * factor(Position) + PassYds + Scrim *
   factor(Position) + Score * factor(Position) + factor(U), data = H)
Linear Model Summary

                                           Residuals vs Fitted                                                           Normal Q-Q
                                               84    2                                                                                               2
                                                                                                                                                84




                                                                                                     2
                                 2




                                                                            Standardized residuals
                                 1




                                                                                                     1
       Residuals

                                 0




                                                                                                     0
                                 -1




                                                                                                     -1
                                 -2




                                                                                                     -2
                                               18
                                                                                                           18
                                 -3




                                       3   4             5    6     7   8                                       -2      -1     0       1         2

                                                    Fitted values                                                     Theoretical Quantiles




                                               Scale-Location                                                    Residuals vs Leverage
                                 1.5




                                                18 2
                                               84                                                                               2
                                                                                                                                                         0.5
                                                                                                     2
        Standardized residuals




                                                                                                                                           56
                                                                            Standardized residuals

                                                                                                     1                                     87
                                 1.0




                                                                                                     0
                                 0.5




                                                                                                     -1
                                                                                                     -2




                                                                                                                                                         0.5
                                                                                                                     Cook's distance
                                 0.0




                                                                                                                                                         1

                                       3   4             5    6     7   8                                 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7
Placing a g-prior on the coe๏ฌƒcients, I performed Monte Carlo
sampling with the prior sample size ฮฝ0 = 1, the prior variance
 2
ฯƒ0 = 2.3171 found by ordinary least squares and the prior values of
the coe๏ฌƒcients obtained from the model matrix of the linear
model. While the model was not precise in the predictions of the
number of votes, it was accurate in the relative vote totals.
Based on the linear model, the predictions for this yearโ€™s Heisman
trophy voting are:
    1st place: Cameron Newton: 3338.2 (above possible point
    total)
    2nd place: LaMichael James: 313.6
    3rd place: Andrew Luck: 101.3
    4th place: Kellen Moore: 77.5
Limitations


   There were a number of limitations to the data and the study. The
   primary limitation was the amount of dependence in the data.
   Because the number of votes and points a particular candidate
   receives in a given year is not independent of the number of votes
   and points other candidates receive in a given year, I was limited in
   the number of quantities I was able to compare.
   Another limitation was in the data available. One frequently
   referenced quantity relating to Heisman trophy voting is the
   percentage of their teamโ€™s total o๏ฌ€ense a particular candidate
   accounted for. This is potentially an important ๏ฌgure, however it
   was not available.
Findings


   From the analysis I performed, I found a great deal of discrepancy
   of points earned based on the categories of region, class, position,
   and conference. There may be some truth to the โ€West Coast
   Biasโ€ theory that West Coast teams receive fewer points from east
   coast voters based on the regional analysis.
   The trend in voting based on class is likely due to the fact that the
   last three Heisman trophy winners have been sophomores.
   It appears that conference and region led to the largest disparity in
   the average number of total points a candidate receives. The
   di๏ฌ€erence in votes by conference is most likely the result of votersโ€™
   perceptions of certain conferences being โ€strongerโ€ than others,
   and thus favoring players from these stronger conferences.

More Related Content

Recently uploaded

CALL ON โžฅ8923113531 ๐Ÿ”Call Girls Saharaganj Lucknow best Female service ๐Ÿฆบ
CALL ON โžฅ8923113531 ๐Ÿ”Call Girls Saharaganj Lucknow best Female service  ๐ŸฆบCALL ON โžฅ8923113531 ๐Ÿ”Call Girls Saharaganj Lucknow best Female service  ๐Ÿฆบ
CALL ON โžฅ8923113531 ๐Ÿ”Call Girls Saharaganj Lucknow best Female service ๐Ÿฆบ
anilsa9823
ย 
CALL ON โžฅ8923113531 ๐Ÿ”Call Girls Telibagh Lucknow best Night Fun service ๐Ÿงฃ
CALL ON โžฅ8923113531 ๐Ÿ”Call Girls Telibagh Lucknow best Night Fun service  ๐ŸงฃCALL ON โžฅ8923113531 ๐Ÿ”Call Girls Telibagh Lucknow best Night Fun service  ๐Ÿงฃ
CALL ON โžฅ8923113531 ๐Ÿ”Call Girls Telibagh Lucknow best Night Fun service ๐Ÿงฃ
anilsa9823
ย 
๐Ÿ”|97111เผ’99012๐Ÿ” Call Girls In {Delhi} Cr Park โ‚น5.5k Cash Payment With Room De...
๐Ÿ”|97111เผ’99012๐Ÿ” Call Girls In  {Delhi} Cr Park โ‚น5.5k Cash Payment With Room De...๐Ÿ”|97111เผ’99012๐Ÿ” Call Girls In  {Delhi} Cr Park โ‚น5.5k Cash Payment With Room De...
๐Ÿ”|97111เผ’99012๐Ÿ” Call Girls In {Delhi} Cr Park โ‚น5.5k Cash Payment With Room De...
Diya Sharma
ย 
Top Call Girls In Jankipuram ( Lucknow ) ๐Ÿ” 8923113531 ๐Ÿ” Cash Payment
Top Call Girls In Jankipuram ( Lucknow  ) ๐Ÿ” 8923113531 ๐Ÿ”  Cash PaymentTop Call Girls In Jankipuram ( Lucknow  ) ๐Ÿ” 8923113531 ๐Ÿ”  Cash Payment
Top Call Girls In Jankipuram ( Lucknow ) ๐Ÿ” 8923113531 ๐Ÿ” Cash Payment
anilsa9823
ย 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
Health
ย 

Recently uploaded (20)

CALL ON โžฅ8923113531 ๐Ÿ”Call Girls Saharaganj Lucknow best Female service ๐Ÿฆบ
CALL ON โžฅ8923113531 ๐Ÿ”Call Girls Saharaganj Lucknow best Female service  ๐ŸฆบCALL ON โžฅ8923113531 ๐Ÿ”Call Girls Saharaganj Lucknow best Female service  ๐Ÿฆบ
CALL ON โžฅ8923113531 ๐Ÿ”Call Girls Saharaganj Lucknow best Female service ๐Ÿฆบ
ย 
Slovenia Vs Serbia UEFA Euro 2024 Fixture Guide Every Fixture Detailed.docx
Slovenia Vs Serbia UEFA Euro 2024 Fixture Guide Every Fixture Detailed.docxSlovenia Vs Serbia UEFA Euro 2024 Fixture Guide Every Fixture Detailed.docx
Slovenia Vs Serbia UEFA Euro 2024 Fixture Guide Every Fixture Detailed.docx
ย 
UEFA Euro 2024 Squad Check-in Who is Most Favorite.docx
UEFA Euro 2024 Squad Check-in Who is Most Favorite.docxUEFA Euro 2024 Squad Check-in Who is Most Favorite.docx
UEFA Euro 2024 Squad Check-in Who is Most Favorite.docx
ย 
CALL ON โžฅ8923113531 ๐Ÿ”Call Girls Telibagh Lucknow best Night Fun service ๐Ÿงฃ
CALL ON โžฅ8923113531 ๐Ÿ”Call Girls Telibagh Lucknow best Night Fun service  ๐ŸงฃCALL ON โžฅ8923113531 ๐Ÿ”Call Girls Telibagh Lucknow best Night Fun service  ๐Ÿงฃ
CALL ON โžฅ8923113531 ๐Ÿ”Call Girls Telibagh Lucknow best Night Fun service ๐Ÿงฃ
ย 
Who Is Emmanuel Katto Uganda? His Career, personal life etc.
Who Is Emmanuel Katto Uganda? His Career, personal life etc.Who Is Emmanuel Katto Uganda? His Career, personal life etc.
Who Is Emmanuel Katto Uganda? His Career, personal life etc.
ย 
Atlanta Dream Exec Dan Gadd on Driving Fan Engagement and Growth, Serving the...
Atlanta Dream Exec Dan Gadd on Driving Fan Engagement and Growth, Serving the...Atlanta Dream Exec Dan Gadd on Driving Fan Engagement and Growth, Serving the...
Atlanta Dream Exec Dan Gadd on Driving Fan Engagement and Growth, Serving the...
ย 
๐Ÿ”|97111เผ’99012๐Ÿ” Call Girls In {Delhi} Cr Park โ‚น5.5k Cash Payment With Room De...
๐Ÿ”|97111เผ’99012๐Ÿ” Call Girls In  {Delhi} Cr Park โ‚น5.5k Cash Payment With Room De...๐Ÿ”|97111เผ’99012๐Ÿ” Call Girls In  {Delhi} Cr Park โ‚น5.5k Cash Payment With Room De...
๐Ÿ”|97111เผ’99012๐Ÿ” Call Girls In {Delhi} Cr Park โ‚น5.5k Cash Payment With Room De...
ย 
08448380779 Call Girls In IIT Women Seeking Men
08448380779 Call Girls In IIT Women Seeking Men08448380779 Call Girls In IIT Women Seeking Men
08448380779 Call Girls In IIT Women Seeking Men
ย 
08448380779 Call Girls In Lajpat Nagar Women Seeking Men
08448380779 Call Girls In Lajpat Nagar Women Seeking Men08448380779 Call Girls In Lajpat Nagar Women Seeking Men
08448380779 Call Girls In Lajpat Nagar Women Seeking Men
ย 
ALL NFL NETWORK CONTACTS- April 29, 2024
ALL NFL NETWORK CONTACTS- April 29, 2024ALL NFL NETWORK CONTACTS- April 29, 2024
ALL NFL NETWORK CONTACTS- April 29, 2024
ย 
Hire ๐Ÿ’• 8617697112 Kasauli Call Girls Service Call Girls Agency
Hire ๐Ÿ’• 8617697112 Kasauli Call Girls Service Call Girls AgencyHire ๐Ÿ’• 8617697112 Kasauli Call Girls Service Call Girls Agency
Hire ๐Ÿ’• 8617697112 Kasauli Call Girls Service Call Girls Agency
ย 
TAM Sports_IPL 17 Till Match 37_Celebrity Endorsement _Report.pdf
TAM Sports_IPL 17 Till Match 37_Celebrity Endorsement _Report.pdfTAM Sports_IPL 17 Till Match 37_Celebrity Endorsement _Report.pdf
TAM Sports_IPL 17 Till Match 37_Celebrity Endorsement _Report.pdf
ย 
Call Girls Service Noida Extension @9999965857 Delhi ๐Ÿซฆ No Advance VVIP ๐ŸŽ SER...
Call Girls Service Noida Extension @9999965857 Delhi ๐Ÿซฆ No Advance  VVIP ๐ŸŽ SER...Call Girls Service Noida Extension @9999965857 Delhi ๐Ÿซฆ No Advance  VVIP ๐ŸŽ SER...
Call Girls Service Noida Extension @9999965857 Delhi ๐Ÿซฆ No Advance VVIP ๐ŸŽ SER...
ย 
Top Call Girls In Jankipuram ( Lucknow ) ๐Ÿ” 8923113531 ๐Ÿ” Cash Payment
Top Call Girls In Jankipuram ( Lucknow  ) ๐Ÿ” 8923113531 ๐Ÿ”  Cash PaymentTop Call Girls In Jankipuram ( Lucknow  ) ๐Ÿ” 8923113531 ๐Ÿ”  Cash Payment
Top Call Girls In Jankipuram ( Lucknow ) ๐Ÿ” 8923113531 ๐Ÿ” Cash Payment
ย 
Croatia vs Italy Euro Cup 2024 Three pitfalls for Spallettiโ€™s Italy in Group ...
Croatia vs Italy Euro Cup 2024 Three pitfalls for Spallettiโ€™s Italy in Group ...Croatia vs Italy Euro Cup 2024 Three pitfalls for Spallettiโ€™s Italy in Group ...
Croatia vs Italy Euro Cup 2024 Three pitfalls for Spallettiโ€™s Italy in Group ...
ย 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
ย 
Technical Data | Sig Sauer Easy6 BDX 1-6x24 | Optics Trade
Technical Data | Sig Sauer Easy6 BDX 1-6x24 | Optics TradeTechnical Data | Sig Sauer Easy6 BDX 1-6x24 | Optics Trade
Technical Data | Sig Sauer Easy6 BDX 1-6x24 | Optics Trade
ย 
Spain Vs Italy 20 players confirmed for Spain's Euro 2024 squad, and three po...
Spain Vs Italy 20 players confirmed for Spain's Euro 2024 squad, and three po...Spain Vs Italy 20 players confirmed for Spain's Euro 2024 squad, and three po...
Spain Vs Italy 20 players confirmed for Spain's Euro 2024 squad, and three po...
ย 
08448380779 Call Girls In Karol Bagh Women Seeking Men
08448380779 Call Girls In Karol Bagh Women Seeking Men08448380779 Call Girls In Karol Bagh Women Seeking Men
08448380779 Call Girls In Karol Bagh Women Seeking Men
ย 
Spain Vs Albania- Spain at risk of being thrown out of Euro 2024 with Tournam...
Spain Vs Albania- Spain at risk of being thrown out of Euro 2024 with Tournam...Spain Vs Albania- Spain at risk of being thrown out of Euro 2024 with Tournam...
Spain Vs Albania- Spain at risk of being thrown out of Euro 2024 with Tournam...
ย 

Featured

Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
ย 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
Alireza Esmikhani
ย 
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them wellGood Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Saba Software
ย 

Featured (20)

Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
ย 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
ย 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
ย 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
ย 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
ย 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
ย 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
ย 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
ย 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
ย 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
ย 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
ย 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
ย 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
ย 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
ย 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
ย 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
ย 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
ย 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
ย 
Barbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy Presentation
ย 
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them wellGood Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
ย 

Heisman Trophy Analysis

  • 1. Heisman Trophy Voting: A Bayesian Analysis Daniel Heard
  • 2. Heisman Trophy Data The data set consisted of season statistics from o๏ฌ€ensive ๏ฌnalists for the Heisman trophy, including passing yards, passing touchdowns, rushing yards, rushing touchdowns, receiving yards, receiving touchdowns, and the number of wins the candidates team had in a given year. The data also included indicators as to whether the candidateโ€™s team was undefeated, what conference the candidatesโ€™ teams were members of and in which geographical region the candidatesโ€™ schools were located. Finally, the data included the number of ballots received and total points the candidates earned from voters from the 2000-2009 seasons.
  • 3. Voting Process In a given year, there are 870 media ballots, one fan ballot (based on fans voting online for candidates) and each living recipient of the Heisman trophy is able to cast a ballot. Points are calculated as follows: on each ballot, the voter indicates their 1st, 2nd, and 3rd place candidates. Each ๏ฌrst place vote is worth 3 points, second place votes are worth 2 points, and third place votes are worth 1 point. In the event that every ballot is submitted, each candidate can receive a maximum of 2778 points. However, it is unusual for more than 97% of the ballots to be submitted.
  • 4. Items of Interest Items of interest regarding Heisman trophy voting include: Examining voting trends by position, conference, region and class year of candidates Predicting voting for future candidates
  • 5. Priors Let ฮธ denote the expected number of points a particular Heisman candidate will receive in a given year. This is a strictly positive quantity. Without having seen the data, one could make the assumption that all candidates receive equal total points, with that amount being approximately 555 (one-tenth of the maximum number of total points from all ballots). So, by this reasoning, I gave ฮธ the prior distribution: p(ฮธ) โˆผ Gamma(555, 1) If we let y denote the number of votes earned by a candidate (more precisely, candidate within a particular group) in a given year, we can model this as p(y |ฮธ) โˆผ Poisson(ฮธ). Hence, our posterior distribution will be p(ฮธ|y) โˆผ Gamma(555 + yi , 1 + n).
  • 6. Analysis by Conference Players were separated into 7 categories by their conferences (the 6 automatic qualifying BCS conferences and a separate group for players for non-AQ conferences). The objective was to observe any voting trends by conference. In the data set, there were 16 players from the Pac-10, 18 from the Big 12, 12 from the Big 10, 6 from the ACC, 10 from the SEC, 10 from the Big East and 16 from non-automatic qualifying BCS conferences. We obtain the following distributions: Non-BCS: p(ฮธ|Y ) โˆผ Gamma(4364, 17) expected=273.1 Pac-10: p(ฮธ|Y ) โˆผ Gamma(9576, 17) expected=506.7 Big 12: p(ฮธ|Y ) โˆผ Gamma(13617, 19) expected=708.3 Big 10: p(ฮธ|Y ) โˆผ Gamma(6638, 13) expected=514.2 ACC: p(ฮธ|Y ) โˆผ Gamma(2713, 7) expected=408.4 SEC: p(ฮธ|Y ) โˆผ Gamma(9962, 11) expected=875.5 Big East: p(ฮธ|Y ) โˆผ Gamma(4389, 11) expected=411.8
  • 7. Total Heisman Votes by Conference 2000 Total Points 1000 0 0 1 2 3 4 5 6 Conference Total Heisman Votes by Conference from MCMC 1000 Total Points 600 200 1 2 3 4 5 6 7 Conference Total Heisman Votes by Conference from MCMC (outliers removed) 1000 Total Points 600 200 1 2 3 4 5 6 7 Conference
  • 8. After performing analysis with the two outliers removed, the average points for Pac-10 by over 100 and lowered the average points for the Big 10 by nearly 150, resulting in the following averages: Non-BCS:273.1 Pac-10:404.8 Big 12:708.3 Big 10:369.9 ACC:408.4 SEC:876.7 Big East:411.8
  • 9. Analysis by Region Heisman trophy voting is broken up into 6 geographic regions: Far West, Midwest, Southwest, South, Mid-Atlantic and Northeast. Players were separated into the geographic region in which their school is located. Of interest was to see if there is a di๏ฌ€erence in the average number of votes a player in a particular region will receive. In the data set, there were 24 players from the Far West, 19 from the Midwest, 20 from the Southwest, 14 from the South, 12 from the Mid-Atlantic and 1 from the Northeast. So, we obtain the following distributions: Far West:p(ฮธ|Y ) โˆผ Gamma(11498, 25), expected=463.5 Midwest: p(ฮธ|Y ) โˆผ Gamma(7648, 20), expected=390.7 Southwest:p(ฮธ|Y ) โˆผ Gamma(16242, 21), exp[ected=763.6 South: p(ฮธ|Y ) โˆผ Gamma(10912, 15), expected=716.6 Mid-Atlantic: p(ฮธ|Y ) โˆผ Gamma(3786, 13), expected=309.9 Northeast: p(ฮธ|Y ) โˆผ Gamma(618, 2), expected=391.0
  • 10. Total Heisman Votes by Region 2000 Total Points 1000 0 1 2 3 4 5 6 Region Total Heisman Votes by Region from MCMC 700 Total Points 500 300 1 2 3 4 5 6 Region Total Heisman Votes by Region from MCMC (outliers removed) 700 Total Points 500 300 1 2 3 4 5 6 Region
  • 11. Removing outliers lowered the average points for Far West region by just under 100 and lowered the average points for the Midwest region by just under 100, resulting in the following averages: Far West:385.4 Midwest:294.7 Southwest:763.3 South: 716.8 Mid-Atlantic:310.2 Northeast: 390.6
  • 12. The Mid-Atlantic region is actually the lowest region by points. Among BCS conferences, it contains most of the schools in the ACC and the Big East, the lowest point totaling among the 6 automatic-qualifying conferences.
  • 13. Analysis by Class Player were grouped by class (Freshman, Sophomore, Junior, Senior) Of interest was to see if there is a di๏ฌ€erence in the average number of votes players of a particular class will receive. In the data set, there was 1 freshman, 16 sophomores, 25 juniors and 48 seniors So, we obtain the following distributions: Freshman:p(ฮธ|Y ) โˆผ Gamma(1552, 2), expected=701.9 Sophomore: p(ฮธ|Y ) โˆผ Gamma(10608, 17), expected=620.4 Junior: p(ฮธ|Y ) โˆผ Gamma(14779, 26), expected=568.1 Senior: p(ฮธ|Y ) โˆผ Gamma(22655, 49), expected=463.8
  • 14. Total Heisman Votes by Class 2000 Total Votes 1000 0 1 2 3 4 Class Total Heisman Votes by Class from MCMC 800 Total Votes 600 400 1 2 3 4 Class Total Heisman Votes by Class from MCMC (outliers removed) 800 Total Votes 600 400 1 2 3 4 Class
  • 15. Analysis by Position Player were grouped by o๏ฌ€ensive position (Quarterback, Running Back, Wide Receiver). Of interest was to see if there is a di๏ฌ€erence in the average number of votes di๏ฌ€erent positions receive. In the data set, there were 52 quarterbacks, 29 running backs and 9 wide receivers. We get the following distrubitions: Quarterback: p(ฮธ|Y ) โˆผ Gamma(32911, 53), expected=619.5 Running Back: p(ฮธ|Y ) โˆผ Gamma(13775, 30), expected=462.1 Wide Reciever:p(ฮธ|Y ) โˆผ Gamma(2353, 10), expected=264.1
  • 16. Total Heisman Votes by Position 2000 Total Points 1000 0 1 2 3 Position Total Heisman Votes by Position from MCMC 600 Total Points 400 200 1 2 3 Position Total Heisman Votes by Position from MCMC (outliers removed) 600 Total Points 400 200 1 2 3 Position
  • 17. Removing outliers lowered the average points for quarterbacks by over 30 points and lowered the average points for running backs by over 60, resulting in the following averages: Quarterback: 582.6 Running Back: 396.8 Wide Receiver: 264.4 So, with outliers removed, the voting trend by position remains the same.
  • 18. Heisman Voting Linear Model The 2010 Heisman trophy ๏ฌnalists were recently announced to be: LaMichael James, RB Oregon Andrew Luck, QB Stanford Kellen Moore, QB Boise State Cameron Newton, QB Auburn I constructed a linear model to predict the number of points earned by a Heisman ๏ฌnalist based on certain statistics. While the numerical predictions of the model are not precise, the predictions of relative number of votes is accurate. The formula for the model is as follows: h.lmยก-lm(log(Votes) factor(Position) + factor(Conf) + sqrt(Wins) + PassTDs * factor(Position) + PassYds + Scrim * factor(Position) + Score * factor(Position) + factor(U), data = H)
  • 19. Linear Model Summary Residuals vs Fitted Normal Q-Q 84 2 2 84 2 2 Standardized residuals 1 1 Residuals 0 0 -1 -1 -2 -2 18 18 -3 3 4 5 6 7 8 -2 -1 0 1 2 Fitted values Theoretical Quantiles Scale-Location Residuals vs Leverage 1.5 18 2 84 2 0.5 2 Standardized residuals 56 Standardized residuals 1 87 1.0 0 0.5 -1 -2 0.5 Cook's distance 0.0 1 3 4 5 6 7 8 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7
  • 20. Placing a g-prior on the coe๏ฌƒcients, I performed Monte Carlo sampling with the prior sample size ฮฝ0 = 1, the prior variance 2 ฯƒ0 = 2.3171 found by ordinary least squares and the prior values of the coe๏ฌƒcients obtained from the model matrix of the linear model. While the model was not precise in the predictions of the number of votes, it was accurate in the relative vote totals.
  • 21. Based on the linear model, the predictions for this yearโ€™s Heisman trophy voting are: 1st place: Cameron Newton: 3338.2 (above possible point total) 2nd place: LaMichael James: 313.6 3rd place: Andrew Luck: 101.3 4th place: Kellen Moore: 77.5
  • 22. Limitations There were a number of limitations to the data and the study. The primary limitation was the amount of dependence in the data. Because the number of votes and points a particular candidate receives in a given year is not independent of the number of votes and points other candidates receive in a given year, I was limited in the number of quantities I was able to compare. Another limitation was in the data available. One frequently referenced quantity relating to Heisman trophy voting is the percentage of their teamโ€™s total o๏ฌ€ense a particular candidate accounted for. This is potentially an important ๏ฌgure, however it was not available.
  • 23. Findings From the analysis I performed, I found a great deal of discrepancy of points earned based on the categories of region, class, position, and conference. There may be some truth to the โ€West Coast Biasโ€ theory that West Coast teams receive fewer points from east coast voters based on the regional analysis. The trend in voting based on class is likely due to the fact that the last three Heisman trophy winners have been sophomores. It appears that conference and region led to the largest disparity in the average number of total points a candidate receives. The di๏ฌ€erence in votes by conference is most likely the result of votersโ€™ perceptions of certain conferences being โ€strongerโ€ than others, and thus favoring players from these stronger conferences.