This paper examines the relationship between MLB players' salaries and various performance statistics from the 2013 season. The authors regress salary data against age, games played, home runs, slugging percentage, hits, at bats, and on-base percentage for 447 players after removing pitchers. Their model explains 51.39% of salary variation, suggesting these statistics significantly influence pay. Home runs, hits, at bats, and on-base percentage positively impact salary, while slugging percentage has a negative effect. The paper concludes player salaries can be reasonably predicted using performance data.
“It’s tough to make predictions, especially about the future.” So said Yogi Berra in an era gone by. Yet, every year, during the week of the Epiphany, we make predictions about the year ahead, write them down, and lock them up in our safety deposit box to be read the following year. This year was no exception. Accordingly, last week we opened the lockbox and placed this year’s predictions in it and retrieved last year’s list.
1. After watching the attached video by Dan Pink on .docxjeremylockett77
1. After watching the attached video by Dan Pink on the inherent weaknesses of extrinsic motivators, present two salient applications to your role as a leader in athletics. Dan Pink: The puzzle of motivation Ted.com
2. One of the very real truisms about leadership is that it can be lonely at the top and quite stressful. Please describe two specific ways you as a leader manage stress in your life.
BIBLIOGRAPHY
Annala, C. N., & Winfree, J. (2011). Salary distribution and team performance in Major League Baseball. Sport Management Review, 14(2), 167-175.
Breunig, R., Garrett-Rumba, B., Jardin, M., & Rocaboy, Y. (2014). Wage dispersion and team performance: a theoretical model and evidence from baseball. Applied Economics, 46(3), 271-281.
Devi R. (2016). Data.world. Baseball Stats. Retrieved September 25, 2019 from https://data.world/deviramanan2016/baseball-stats
Lee, S., & Harris, J. (2012). Managing excellence in USA Major League Soccer: an analysis of the relationship between player performance and salary. Managing Leisure, 17(2-3), 106- 123.
Scully, G. W. (1974). Pay and performance in major league baseball. The American Economic Review, 64(6), 915-930.
Sommers, P. M., & Quinton, N. (1982). Pay and performance in major league baseball: The case of the first family of free agents. The Journal of Human Resources, 17(3), 426-436.
Tao, Y. L., Chuang, H. L., & Lin, E. S. (2016). Compensation and performance in Major League Baseball: Evidence from salary dispersion and team performance. International Review of Economics & Finance, 43, 151-159.
Wiseman, F., & Chatterjee, S. (2003). Team payroll and team performance in major league baseball: 1985–2002. Economics Bulletin, 1(2), 1-10.
Running Head: PAY AND PERFORMANCE IN MAJOR LEAGUE BASEBALL 1
PAY AND PERFORMANCE IN MAJOR LEAGUE BASEBALL 5
PAY AND PERFORMANCE IN MAJOR LEAGUE BASEBALL
RODERICK HOOKS
9-16-2019
Purpose statement and model
This study will try to examine whether there is a relationship between the payment and performance of a team. Performance is the dependent variable measured by wins of a team in the 2010 Major League Baseball (Tao Y. et al, 2016). This is the suitable dependent variable since the wins for a team can be influenced by many factors and the final results are the main target of every team (Scully G., 1974). The primary independent variable is payroll which the totals pay of the team (Wiseman F. & Chatterjee S., 2003). This is suitable in determining whether there is relationship between pay and performance due to the fact that a higher anticipates higher performance since many challenges for the team can be solved by financial stability (Sommers P. & Quinton N., 1982).
The general form of the model will be;
Wins = b0 + b1Payroll + b2Attendance + Error (
Definitions of variables
The variables used in this study are wins, payroll and attendance. Win is the dependent variable measuring the number of games the team wins. I ...
“It’s tough to make predictions, especially about the future.” So said Yogi Berra in an era gone by. Yet, every year, during the week of the Epiphany, we make predictions about the year ahead, write them down, and lock them up in our safety deposit box to be read the following year. This year was no exception. Accordingly, last week we opened the lockbox and placed this year’s predictions in it and retrieved last year’s list.
1. After watching the attached video by Dan Pink on .docxjeremylockett77
1. After watching the attached video by Dan Pink on the inherent weaknesses of extrinsic motivators, present two salient applications to your role as a leader in athletics. Dan Pink: The puzzle of motivation Ted.com
2. One of the very real truisms about leadership is that it can be lonely at the top and quite stressful. Please describe two specific ways you as a leader manage stress in your life.
BIBLIOGRAPHY
Annala, C. N., & Winfree, J. (2011). Salary distribution and team performance in Major League Baseball. Sport Management Review, 14(2), 167-175.
Breunig, R., Garrett-Rumba, B., Jardin, M., & Rocaboy, Y. (2014). Wage dispersion and team performance: a theoretical model and evidence from baseball. Applied Economics, 46(3), 271-281.
Devi R. (2016). Data.world. Baseball Stats. Retrieved September 25, 2019 from https://data.world/deviramanan2016/baseball-stats
Lee, S., & Harris, J. (2012). Managing excellence in USA Major League Soccer: an analysis of the relationship between player performance and salary. Managing Leisure, 17(2-3), 106- 123.
Scully, G. W. (1974). Pay and performance in major league baseball. The American Economic Review, 64(6), 915-930.
Sommers, P. M., & Quinton, N. (1982). Pay and performance in major league baseball: The case of the first family of free agents. The Journal of Human Resources, 17(3), 426-436.
Tao, Y. L., Chuang, H. L., & Lin, E. S. (2016). Compensation and performance in Major League Baseball: Evidence from salary dispersion and team performance. International Review of Economics & Finance, 43, 151-159.
Wiseman, F., & Chatterjee, S. (2003). Team payroll and team performance in major league baseball: 1985–2002. Economics Bulletin, 1(2), 1-10.
Running Head: PAY AND PERFORMANCE IN MAJOR LEAGUE BASEBALL 1
PAY AND PERFORMANCE IN MAJOR LEAGUE BASEBALL 5
PAY AND PERFORMANCE IN MAJOR LEAGUE BASEBALL
RODERICK HOOKS
9-16-2019
Purpose statement and model
This study will try to examine whether there is a relationship between the payment and performance of a team. Performance is the dependent variable measured by wins of a team in the 2010 Major League Baseball (Tao Y. et al, 2016). This is the suitable dependent variable since the wins for a team can be influenced by many factors and the final results are the main target of every team (Scully G., 1974). The primary independent variable is payroll which the totals pay of the team (Wiseman F. & Chatterjee S., 2003). This is suitable in determining whether there is relationship between pay and performance due to the fact that a higher anticipates higher performance since many challenges for the team can be solved by financial stability (Sommers P. & Quinton N., 1982).
The general form of the model will be;
Wins = b0 + b1Payroll + b2Attendance + Error (
Definitions of variables
The variables used in this study are wins, payroll and attendance. Win is the dependent variable measuring the number of games the team wins. I ...
Explore the effect of offensive and defensive team productivity in the NBA on wins, 10+ years of NBA regular season data (2002 – 2013).
Key words: data normalization; directional hypotheses; feauture engineering; ols regression; web scraping
Explore the effect of offensive and defensive team productivity in the NBA on wins, 10+ years of NBA regular season data (2002 – 2013).
Key words: data normalization; directional hypotheses; feauture engineering; ols regression; web scraping
Adjusting primitives for graph : SHORT REPORT / NOTESSubhajit Sahu
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is
Multiply with different modes (map)
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
1. Comparing various launch configs for CUDA based vector element sum (in-place).
As Europe's leading economic powerhouse and the fourth-largest hashtag#economy globally, Germany stands at the forefront of innovation and industrial might. Renowned for its precision engineering and high-tech sectors, Germany's economic structure is heavily supported by a robust service industry, accounting for approximately 68% of its GDP. This economic clout and strategic geopolitical stance position Germany as a focal point in the global cyber threat landscape.
In the face of escalating global tensions, particularly those emanating from geopolitical disputes with nations like hashtag#Russia and hashtag#China, hashtag#Germany has witnessed a significant uptick in targeted cyber operations. Our analysis indicates a marked increase in hashtag#cyberattack sophistication aimed at critical infrastructure and key industrial sectors. These attacks range from ransomware campaigns to hashtag#AdvancedPersistentThreats (hashtag#APTs), threatening national security and business integrity.
🔑 Key findings include:
🔍 Increased frequency and complexity of cyber threats.
🔍 Escalation of state-sponsored and criminally motivated cyber operations.
🔍 Active dark web exchanges of malicious tools and tactics.
Our comprehensive report delves into these challenges, using a blend of open-source and proprietary data collection techniques. By monitoring activity on critical networks and analyzing attack patterns, our team provides a detailed overview of the threats facing German entities.
This report aims to equip stakeholders across public and private sectors with the knowledge to enhance their defensive strategies, reduce exposure to cyber risks, and reinforce Germany's resilience against cyber threats.
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
1. DREXEL UNIVERSITY | ECON 350
Predicting Salary for MLB
Players
An Empirical project
Patel, Vraj & Greene, Robert
6/4/2015
2. ABSTRACT
The focal point of this paper is an attempt to examine the relationship between the percentage-
based contractual salaries (log(salary)) [hereto referred throughout as “LSalary”] of qualifying 2013 Major
League Baseball players and the following statistics: age, age squared, games, home-runs, slugs, hits, at bats,
and on base percentage. As there are many factors that are contributory towards LSalary, those players who
are designated as pitchers have been omitted from the data set as they have dissimilar qualities that obscure
the intended data set.
With this restricted data set, the sample consists of four hundred forty seven observations across
thirty teams within the 2013 season. Utilizing the aforementioned restricted data collection, we have
regressed LSalary, age, agesquared, games, home-runs, slugs, hits, at bats, on base percentage, strikeouts,
times caught stolen, stolen bases and runs. However, after controlling our variables, our conclusion is drawn
from the empirical results suggesting that age, age squared, games, home-runs, slugs, hits, at bats, and on
base percentageare arguably significant drivers in determining a player’s LSalary at an adjusted R2 value of
0.5139, or 51.39% explanatory level.
3. Introduction
Not prone to recent years, the seven and eight-figure salary that many professional baseball players
receive has been criticized as gross overpayment. Some critique that, because few working-class individuals
can imagine making a one-million dollar plus annual salary, the once “working-class baseball game” has lost
touch with its origin1. This begs the question as to whether or not the players of the Major League Baseball
profession are overpaid; but perhaps through their certain skillset, these players have earned these
outrageous salaries. As such, the purpose behind this research is to ascertain whether or not signed players
of Major League Baseball teams aregiven their contractual salary subsequent to their performance, or if
there is an outside factor that acts as the primary driver in this determination. However, in order to present
solutions in a meaningful manner, a more thorough understanding and discussion must be held in
determining what factors, or rather performance-based statistics, should be closely looked at.
The scope of this study resides in observing those players which have a full-detailed list of playing
statistics attributed to them. Previous conducted studies, which will be discussed further within the literature
review sub-section of this report, have attempted to illustrate meaningfulness via acknowledging every
available playing statistic variable; however, approaching the project as such will obstruct the ultimate
outcome of the data, as a percentage of players are expected to excel in (largely) batting/scoring, while
others are paid primarily for the ability to pitch. Be it that we arelooking for the main driving force behind
MLB player salaries, and that pitchers make a minute percentage of the entire league, we have omitted any
player designated as a pitcher from our sample. Additionally, although perhaps a factorial indicator in salary,
as we areunable to quantify such attributes as emotional drawl, optimism, or reactionary self-awareness,
these too will be omitted from our calculations.
1 See Phillip VanFossen (2009) for an insightful argument and justification of MLB player salary on howviewership
is largely responsiblefor salary contribution.
4. Literature Review
Unlike any comparative methodology, Major League Baseball is the only one the big four (4) North
American Sports teams that suppress any form of salary limitation. “Salary caps areemployed with
professional team sports leagues all over the world…. Conventional wisdom suggests that they are a collusive
effort of club owners to control labor costs,” (Dietl, 13). However, while there is no binding restriction on
commending a certain player with an egregious salary, there is an incorporated surcharge when the
aggregatedpayroll of a team exceeds that which is previously established by the league. This tax
restriction acts as an incentive for team management not to sign a largepercentage of the A-List players, ergo
rescinding the competitive balance needed to maintain viewership.
Upon reviewing previously published articles and scholarly journals pertaining to this subject matter,
many of the topics had predicted economical short-comings or comparable salaries of separate industry
entities as the focal point of their research. That is not to say that the research went without merit; of the
limited relevant documentation found, two had shared a similar goal in determining what factors were
detrimental in predicting a player’s salary: Meltzer and VanFossen.
Meltzer had investigated various means to measure player’s performance which lead toward
determining not only salary, but contract length as well. Explicitly, Meltzer had conducted his experimental
research utilizing data from a 2002 study, in a two-stage least squares examination. Utilizing this method
enabled Meltzer to estimate both salary and contract length as a function of the other. Meltzer’sconcluding
results illustrated that there are fundamentally two distinct areas of deviation for contract length and
averagesalary: “[The] first comes from young improving players who are likely to get long-term contracts at
low annual salaries. [The] second comes from players with chronic injuries, whose salary is not affected by
their injuries but who will tend to get shorter contracts that they otherwise would,” (Meltzer, 1).
The variables used within Meltzer’smodels are fairly consistent with that of with which we had
conducted. Meltzer had used various independent variables in the opportunity to predict salary via the
following variables: averagesalary, length of contract, OPSChange, Plate Appearance, All Star Selection, Gold
5. Glove, Age, Age-Sq, Catcher, Short Stop, Outfielder, Free Agent, Arbitration, Hi-Pay, Lux, Lo-Pay, and
Population (of the team’s metropolitan area)2. What is interesting about Meltzer’scollection of data is that
he introduces an identical hypothesis that we, as the authors of this analysis, had in conducting our own
research: Dropping any non-hitters, i.e., Pitchers. Both the research conducted by Meltzer and ourselves had
limited the data collection to hitters, as pitching statistics are “less universal than hitting statistics” (Meltzer,
13). Attempting to introduce this data would ultimately obscure the existing data pool while also being
incomplete in areas that have been determined as quantitative in terms of skill and performance, such as
variable “Hit” for instance.
VanFossen had introduced an unexpected perspective upon approaching justification of player salary
by means of economical inflation, risk assessment, strict marketing strategy, and non-quantitative
measurements of emotional appeal. Where there article lacked in strict quantitative and statistical analysis of
his premises and conclusion, his concept and theorized verbal analysis was thorough and argumentatively
sound. VanFossen’s elementary conclusion was that “[a]thletes are paid based upon their contribution to fan
satisfaction… [fans/viewership] contribute toward their salaries by attending games, watching them live on
the television, and supporting them through apparel purchases...” (VanFossen). Speaking-at-large, the
usefulness of this article is found within its ability to illuminate any inconsistencies with our predicted salary;
otherwise stated, further explaining the calculated R2 value, and consequently any error terms that might
suffice in our equation.
Data
The data used in our study originally consisted of all the player data that was available for all Major
League Baseball players during the 2013 season. This data set did not however separate players by position.
The total number observations in our full data set consisted of 837 players. Also it worth noting that the data
2 See Josh Meltzer (2005) for his regression in its entirety via step-by-step augmentation and analysis.
6. set did not exclude players because they did not have a minimum number of at bats during the season. Our
data was collected form a single source, the site Baseball-Reference.com, and any inaccuracies that are
recorded on the site will also be reflected in our regression model. The statistics that we collected included
salary, age, games played, number of home-runs, slugging percentage, number of hits, number of runs
scored, number of at bats, on base percentage, number of strikeouts, number of stolen bases, and the
number of times caught stolen. We selected these statistics because they were readily available on our data
source, and also because we believed that these statistics would be the best predictors of salary.
Methodology
Our initial thoughts on how to perform the regression was to regress salary against all the statistics
we had gathered, however this resulted very abnormal results such as the coefficient on hits and runs being
negative. Economic theory however would suggest that these statistics should have a positive effect on the
salty earned by the player. We later realized that this most likely caused by including Pitchers in the data set.
Pitcher in Major LeagueBaseball are evaluated on different metrics than batters and thus were throwing of
our results. We then striped all the Pitchers from our data set, and were left with 447 observations. When we
ran a regression on this data set, the results looked much better. We then changed our dependent variable
from salary to the log of salary because we believed that this give us a better indication of the movement in
salary. Our final step in the process was to drop all statistically insignificant variable that were not helping us
explain the variation in the log of salary such as stolen basses and stroke outs.
Empirical Results
Our final regression model, after all of the adjustment that we mentioned previously were made,
was:
𝐿𝑜𝑔𝑆𝑎𝑙𝑎𝑟𝑦 = β0 + β1Age + β2Age2 + β3Games + β4AtBats + β5Sluging + β6Hits + β7HR + β8OBP + μ
7. Where LogSalary is the log of a player’s salary, Age is the Player’s Age, Age2 is the Age squared, AtBatsis the
number of times a player batted, Slugging is the total number of bases a player had dived by the number of
at bats, Hits is the number of hits a player earned, HR is the number of home-runs a player hit, and OBP is the
percentage of times a player was on to base. As the tables at the end indicate, the model has an Adj R2 value
of .5139 which means that model is able to explain around 51% of the variation in LogSalary given the eight
aforementioned explanatory variables. This may seem to be low for a predictive model, but is actually in line
with similar studies conducted in the past.
𝐿𝑜𝑔𝑆𝑎𝑙𝑎𝑟𝑦̂ = -.213+ .869Age + -.012Age2 + -.018Games + .003AtBats + -2.01Sluging + .01Hits + .03HR +
1.58OBP
Given that all of our observations had a positive age and at least one gameplayed, we believe that
the constant term in our model is not significant to our findings. The beta coefficients on the explanatory
however are meaningful. For example one additional hit is expected to increase the salary by .01 percentage
points. Slugging percentageon the other hand is expected to have a negative effect on LogSalary, specifically
a one percent increase in slugging is expected to lower salary by 2.01 percentagepoints. Home-runs, hits, at
bats, and on base percent are all indicated to have a positive effect on salary, with interpretations of a .03
percentage point increase, .01 percentagepoint increase, .003 percentagepoint increase, and a 1.58
percentage point increase in salary respectively.
Conclusion
Given the results of model we believe that it is possible to predict the earnings of professional
baseball players in the MLB given their performance during the season. Even though our model only has an
Adj R2 of .51, we still believe that this model is usable given that it has an F stat of 59.94 and Probability F Crit
> F = 0.0000. This by no means that our model is a perfect predictor of salary, there arevarious
improvements that we can make to gain a better understating of the variance in player salary. Some possible
improvements include adding careerstatics to the model, and also classifying each player by position by
8. using dummy variables. Although we may be unable to quantify the x factor that a player may have for an
organization and its sponsors, we are however able to quantify how much a home-run, hit, or at bat is worth
to the organization, and by using this statistics we can predict how a player’s salary will change given his
performance during the season.
Tables
Summary Statistics
Regression
Correlation Matrix (Full Data Set)
10. Bibliography
Dietl, Helmut M, Markus Lang, and Alexander Rathke. "The Effect of Salary Caps in Professional Team
Sports on Social Welfare." The B.E. Journal of Economic Analysis & Policy 1, no. 72 (2009): 7-14.
Meltzer, Josh. “Average Salary and Contract Length in Major League Baseball: When Do They Diverge?”
2005, Department of Economics, Stanford University, CA. Accessed May 24, 2015
Rhonda Magel, Michael Hoffman, Predicting Salaries of Major League Baseball Players, International
Journal of Sports Science, Vol. 5 No. 2, 2015, pp. 51-58. doi: 10.5923/j.sports.20150502.02.
Sports Reference LLC. "(2013 Major League Baseball Standard Batting)." Baseball-Reference.com - Major
League Statistics and Information. http://www.baseball-reference.com/. Accessed May 22, 2015
VanFossen, Phillip. "The Economics of Professional Sports: Underpaid Millionaires?" The Economics of
Professional Sports: Underpaid Millionaires? August 5, 2009. Accessed May 27, 2015.