SlideShare a Scribd company logo
1 of 7
Download to read offline
Radu Stancut
Applied Data Science
Foundations Project Outline/Notes
Introduction
This Foundations paper will look at the performance of professional basketball teams in
the National Basketball Association (NBA) over the past ten years and investigate the
relationship that both offensive and defensive team statistics have on win totals, and by extension
playoff appearances. Specifically, I will be looking at the field goal percentages (FG%) on both
offense, the proficiency with which a team scores a basket, and defense, the effectiveness of
keeping the other team from scoring. It is expected that above average rates of performance in
either category would have a positive influence on winning games, but just how much remains to
be seen.
Statistical analysis and sports have become increasingly more intertwined over the last
decade in the public imagination. The reader may be aware of Moneyball, either the book or the
movie, if not both, describing the machinations of Major League Baseball’s (MLB) Oakland
Athletics’ General Manager, Billy Bean, who due to a limited payroll was inspired to leverage
statistical analysis in order to find underappreciated assets, in this case ballplayers, with which to
build several winning teams. This quantification of sports has in fact been going on for several
decades, from Bill James’ Baseball Prospectus in the 1980’s through to today’s Nate Silver,
ESPN-backed, website, fivethirtyeight.com, and onto the MIT Sloan Sports Analytics
Conference.
This paper is meant to provide a surface investigation into some such sports stats and
their influence on team outcomes. Below I describe the data collected, its analysis, my findings
and conclusions, followed by ideas on next steps
Data Description
NBA team statistics for regular seasons 2005 – 2014 were collected in the form of CSV
files from Basketball-Reference.com; two tables per season were collected, one for stats on
offense and the other on defense. The tables consisted of team names and a breakdown of team
performance by minutes, points, assists, rebounds, field goal percentages, and other categories (a
full list may be found in the appendix). In addition to the team season performance metrics I also
collected the win/loss standings and conference/division information for each season in an Excel
document that I later formatted for merging.
The offense and defense statistics were kept separate from one another but the ten
seasons had to be combined into one data frame for analysis. This combining of different seasons
was done via a ‘for loop’ on import within Python. The NBA team information of win/loss
records and conference/division data was also imported and subsequently merged, leading to two
data frames, one each for offense and defense, of 300 instances (30 teams, ten years) with
varying performance measurements (list in appendix).
Several calculations and identification steps were taken prior to running regression analysis:
• The winning percentage (Win%) of each team, by year, was calculated and placed into a
new column.
• A column was created to flag whether a team had a winning record (Win% > 0.5).
• Playoff teams were identified and flagged in a new binary column (Playoffs)
• FG% was normalized by grouping along Year, Year & Conference, and Year & Division,
and setting the average to 1.0 to allow for comparative analysis across seasons; a rating
above 1.0 in FG% on offense meant better than average, while the reverse was true on
defense FG%, a team had to be below 1.0 to be ‘above’ average in keeping opponents
from scoring (team examples in appendix).
Descriptive Statistics
To provide a general picture and context to the findings described in the next section, I have
outlined some exploratory numbers on the data here:
• The win distribution of NBA teams over this 10 year range tends toward a normal
distribution (number description in appendix):
• Winning teams had the following min, max, and mean indexed FG% over ten years:
o Offense – 0.945 (min/worst); 1.102 (max/best)1.015 (mean)
o Defense – 0.917 (min/best); 1.030 (max/worst); 0.982 (mean)
• Losing teams had the following min, max, and mean indexed FG% over ten years:
o Offense – 0.924 (min/worst); 1.044 (max/best); 0.984 (mean)
o Defense – 0.939 (min/best); 1.082 (max/worst); 1.018 (mean)
Methods & Analysis
As seen in the descriptive statistics above, this researcher’s expectations that above
average rates of performance in either category would have a positive influence on winning
games, has some superficial merit. In order to delve deeper I created scatter plots (see appendix),
by offense and defense, for FG% indexed by year (FG% IDX_Year), by year & conference
(FG% IDX_Conf), and year & division (FG% IDX_Div). NBA teams routinely play most of
their games within conference (52 of 82) and a sizable sample of those games are against
division rivals (16 of 52), the field goal grouping was meant to see if any of the various
performance indexes provided a better predictor of wins.
The slopes of the fitted lines for each indexes FG% group, league-wide were:
• Offense
o FG% IDX_Year: 3.011
o FG% IDX_Conf: 3.048
o FG% IDX_Div: 3.419
• Defense
o FG% IDX_Year: -3.500
o FG% IDX_Conf: -3.595
o FG% IDX_Div: -3.667
Two things jump out from the numbers above. First, the defensive FG% appeared to have
a bigger influence, as measured by the slope, than the offensive rate. Secondly, the more specific
indexes, with respect to conference and division, had a bigger influence by slope for both offense
and defense.
Following the scatter plots and fitted lines, a linear regression analysis was run; the
dependent variable in all instances was Win%, with the independent variable alternating between
offensive and defensive FG% IDX fields. In this instance there was little to no difference in
results between league, conference, and division comparisons so only the broadest measure,
league-wide, is outlined below and included in the appendix:
• Offense
o Coef: 0.5025; Std err: .008; R-squared: 0.924
• Defense
o Coef: 0.496; Std err: .0010; R-squared: 0.901
The results indicate that both offensive and defensive FG% performance numbers do a good job
of explaining winning in the NBA. Of course, much like the influence of education on wages
discussed in class, this could be due to other factors being bundled within these measurements.
Discussion & Next Steps
Taking up the thread of bundled influences above this research would need to be expanded to
other team statistical performances to properly weigh the impact that FG% alone has on winning.
Additionally, the analysis contained here dealt with only end of the year numbers and does not
purport to be predictive for in-season numbers. This is another area where further research would
need to be done, i.e., how does a team’s performance up to the All-Star break (about half way
into the season) factor into the remaining games and final win/loss record.
Appendix
List of Fields Used in Analysis, Including ‘Index’ Fields Created
'2P' 'Playoffs'
'2P%' 'STL'
'2PA' 'TOV'
'3P' 'TRB'
'3P%' 'Team'
'3PA' 'Team2'
'AST' 'Wins'
'BLK' 'Year'
'DEF Rk' 'Conference'
'DRB' 'Division'
'FG' 'Win%'
'FG%'
'Winning
Record'
'FGA' 'FG% Year'
'FT' '2P% Year'
'FT%' '3P% Year'
'FTA' 'FG% Conf'
'G' '2P% Conf'
'Losses' '3P% Conf'
'MP' 'FG% Div'
'ORB' '2P% Div'
'PF' '3P% Div'
'PTS'
'FG%
IDX_Year'
'PTS/G'
'FG%
IDX_Conf'
'FG% IDX_Div'
NBA Team Win Description (Histogram Companion)
count 300.000000
mean 40.196667
std 12.610899
min 7.000000
25% 31.750000
50% 41.000000
75% 50.000000
max 67.000000
Offense Linear Regression
OLS Regression Results
==============================================================================
Dep. Variable: Win% R-squared: 0.924
Model: OLS Adj. R-squared: 0.924
Method: Least Squares F-statistic: 3635.
Date: Wed, 10 Dec 2014 Prob (F-statistic): 2.31e-169
Time: 11:54:17 Log-Likelihood: 155.31
No. Observations: 300 AIC: -308.6
Df Residuals: 299 BIC: -304.9
Df Model: 1
================================================================================
coef std err t P>|t| [95.0% Conf. Int.]
--------------------------------------------------------------------------------
FG% IDX_Year 0.5025 0.008 60.292 0.000 0.486 0.519
==============================================================================
Omnibus: 20.895 Durbin-Watson: 2.137
Prob(Omnibus): 0.000 Jarque-Bera (JB): 8.860
Skew: -0.167 Prob(JB): 0.0119
Kurtosis: 2.227 Cond. No. 1.00
==============================================================================
Defense Linear Regression
OLS Regression Results
==============================================================================
Dep. Variable: Win% R-squared: 0.901
Model: OLS Adj. R-squared: 0.900
Method: Least Squares F-statistic: 2710.
Date: Wed, 10 Dec 2014 Prob (F-statistic): 6.03e-152
Time: 11:05:58 Log-Likelihood: 115.08
No. Observations: 300 AIC: -228.2
Df Residuals: 299 BIC: -224.5
Df Model: 1
================================================================================
coef std err t P>|t| [95.0% Conf. Int.]
--------------------------------------------------------------------------------
FG% IDX_Year 0.4961 0.010 52.055 0.000 0.477 0.515
==============================================================================
Omnibus: 20.496 Durbin-Watson: 2.091
Prob(Omnibus): 0.000 Jarque-Bera (JB): 8.474
Skew: -0.142 Prob(JB): 0.0145
Kurtosis: 2.227 Cond. No. 1.00
==============================================================================

More Related Content

Viewers also liked

文學作品
文學作品文學作品
文學作品T H
 
противодействие коррупции
противодействие коррупциипротиводействие коррупции
противодействие коррупцииЯна Милокум
 
PCNs 3 e 4 ciclos para blog
PCNs 3 e 4 ciclos para blogPCNs 3 e 4 ciclos para blog
PCNs 3 e 4 ciclos para blogThiagogui
 
Teste mat5º nº(s) naturais
Teste mat5º nº(s) naturaisTeste mat5º nº(s) naturais
Teste mat5º nº(s) naturaismarcommendes
 
Java__J2EE_Gaurav_Singh
Java__J2EE_Gaurav_SinghJava__J2EE_Gaurav_Singh
Java__J2EE_Gaurav_SinghGaurav Singh
 

Viewers also liked (6)

文學作品
文學作品文學作品
文學作品
 
противодействие коррупции
противодействие коррупциипротиводействие коррупции
противодействие коррупции
 
Juego teatral en la infancia
Juego teatral en la infanciaJuego teatral en la infancia
Juego teatral en la infancia
 
PCNs 3 e 4 ciclos para blog
PCNs 3 e 4 ciclos para blogPCNs 3 e 4 ciclos para blog
PCNs 3 e 4 ciclos para blog
 
Teste mat5º nº(s) naturais
Teste mat5º nº(s) naturaisTeste mat5º nº(s) naturais
Teste mat5º nº(s) naturais
 
Java__J2EE_Gaurav_Singh
Java__J2EE_Gaurav_SinghJava__J2EE_Gaurav_Singh
Java__J2EE_Gaurav_Singh
 

Similar to Perfunctory NBA Analysis

1. After watching the attached video by Dan Pink on .docx
1. After watching the attached video by Dan Pink on .docx1. After watching the attached video by Dan Pink on .docx
1. After watching the attached video by Dan Pink on .docxjeremylockett77
 
Yujie Zi Econ 123CW Research Paper - NBA Defensive Teams
Yujie Zi Econ 123CW Research Paper - NBA Defensive TeamsYujie Zi Econ 123CW Research Paper - NBA Defensive Teams
Yujie Zi Econ 123CW Research Paper - NBA Defensive TeamsYujie Zi
 
CLanctot_DSlavin_JMiron_Stats415_Project
CLanctot_DSlavin_JMiron_Stats415_ProjectCLanctot_DSlavin_JMiron_Stats415_Project
CLanctot_DSlavin_JMiron_Stats415_ProjectDimitry Slavin
 
Measuring Team Chemistry in MLB
Measuring Team Chemistry in MLBMeasuring Team Chemistry in MLB
Measuring Team Chemistry in MLBDavid Kelly
 
NBA Shorter Game and Competitive Balance
NBA Shorter Game and Competitive BalanceNBA Shorter Game and Competitive Balance
NBA Shorter Game and Competitive BalanceDavid Schneider
 
NBA Shorter Game and Competitive Balance
NBA Shorter Game and Competitive BalanceNBA Shorter Game and Competitive Balance
NBA Shorter Game and Competitive BalanceCaleb Engelbourg
 
Identifying Key Factors in Winning MLB Games Using a Data-Mining Approach
Identifying Key Factors in Winning MLB Games Using a Data-Mining ApproachIdentifying Key Factors in Winning MLB Games Using a Data-Mining Approach
Identifying Key Factors in Winning MLB Games Using a Data-Mining ApproachJoelDabady
 
Determinants of College Football Attendance
Determinants of College Football AttendanceDeterminants of College Football Attendance
Determinants of College Football AttendanceConnor Weaver
 
Relations to salary and team contribution in NBA ( 17-18 Season )
Relations to salary and team contribution in NBA ( 17-18 Season )Relations to salary and team contribution in NBA ( 17-18 Season )
Relations to salary and team contribution in NBA ( 17-18 Season )상연 최
 
Senior Project Research Paper
Senior Project Research PaperSenior Project Research Paper
Senior Project Research Papercrissy498
 
The Contract Year Effect in the NBA
The Contract Year Effect in the NBAThe Contract Year Effect in the NBA
The Contract Year Effect in the NBAJoshua Kaplan
 
InstructionsCongratulations. You are a finalist in for a data a.docx
InstructionsCongratulations. You are a finalist in for a data a.docxInstructionsCongratulations. You are a finalist in for a data a.docx
InstructionsCongratulations. You are a finalist in for a data a.docxnormanibarber20063
 
La liga 2013 2014 analysis
La liga 2013 2014 analysisLa liga 2013 2014 analysis
La liga 2013 2014 analysisRitu Sarkar
 
Performance Analysis NFL 2014
Performance Analysis NFL 2014Performance Analysis NFL 2014
Performance Analysis NFL 2014Roshik Ganesan
 

Similar to Perfunctory NBA Analysis (20)

1. After watching the attached video by Dan Pink on .docx
1. After watching the attached video by Dan Pink on .docx1. After watching the attached video by Dan Pink on .docx
1. After watching the attached video by Dan Pink on .docx
 
Yujie Zi Econ 123CW Research Paper - NBA Defensive Teams
Yujie Zi Econ 123CW Research Paper - NBA Defensive TeamsYujie Zi Econ 123CW Research Paper - NBA Defensive Teams
Yujie Zi Econ 123CW Research Paper - NBA Defensive Teams
 
Research Paper
Research PaperResearch Paper
Research Paper
 
CLanctot_DSlavin_JMiron_Stats415_Project
CLanctot_DSlavin_JMiron_Stats415_ProjectCLanctot_DSlavin_JMiron_Stats415_Project
CLanctot_DSlavin_JMiron_Stats415_Project
 
Measuring Team Chemistry in MLB
Measuring Team Chemistry in MLBMeasuring Team Chemistry in MLB
Measuring Team Chemistry in MLB
 
NBA Shorter Game and Competitive Balance
NBA Shorter Game and Competitive BalanceNBA Shorter Game and Competitive Balance
NBA Shorter Game and Competitive Balance
 
NBA Shorter Game and Competitive Balance
NBA Shorter Game and Competitive BalanceNBA Shorter Game and Competitive Balance
NBA Shorter Game and Competitive Balance
 
Identifying Key Factors in Winning MLB Games Using a Data-Mining Approach
Identifying Key Factors in Winning MLB Games Using a Data-Mining ApproachIdentifying Key Factors in Winning MLB Games Using a Data-Mining Approach
Identifying Key Factors in Winning MLB Games Using a Data-Mining Approach
 
LAX IMPACT! White Paper
LAX IMPACT! White PaperLAX IMPACT! White Paper
LAX IMPACT! White Paper
 
Kerber_NBA_Analysis
Kerber_NBA_AnalysisKerber_NBA_Analysis
Kerber_NBA_Analysis
 
Statistical Model Report
Statistical Model ReportStatistical Model Report
Statistical Model Report
 
Statistical Model Report
Statistical Model ReportStatistical Model Report
Statistical Model Report
 
Determinants of College Football Attendance
Determinants of College Football AttendanceDeterminants of College Football Attendance
Determinants of College Football Attendance
 
Relations to salary and team contribution in NBA ( 17-18 Season )
Relations to salary and team contribution in NBA ( 17-18 Season )Relations to salary and team contribution in NBA ( 17-18 Season )
Relations to salary and team contribution in NBA ( 17-18 Season )
 
Final Research Paper
Final Research PaperFinal Research Paper
Final Research Paper
 
Senior Project Research Paper
Senior Project Research PaperSenior Project Research Paper
Senior Project Research Paper
 
The Contract Year Effect in the NBA
The Contract Year Effect in the NBAThe Contract Year Effect in the NBA
The Contract Year Effect in the NBA
 
InstructionsCongratulations. You are a finalist in for a data a.docx
InstructionsCongratulations. You are a finalist in for a data a.docxInstructionsCongratulations. You are a finalist in for a data a.docx
InstructionsCongratulations. You are a finalist in for a data a.docx
 
La liga 2013 2014 analysis
La liga 2013 2014 analysisLa liga 2013 2014 analysis
La liga 2013 2014 analysis
 
Performance Analysis NFL 2014
Performance Analysis NFL 2014Performance Analysis NFL 2014
Performance Analysis NFL 2014
 

More from Radu Stancut

hail_market_analysis_2015
hail_market_analysis_2015hail_market_analysis_2015
hail_market_analysis_2015Radu Stancut
 
HHSAcceleratorProviderDataReport2015
HHSAcceleratorProviderDataReport2015HHSAcceleratorProviderDataReport2015
HHSAcceleratorProviderDataReport2015Radu Stancut
 
first_vehicle_safety_technology_report
first_vehicle_safety_technology_reportfirst_vehicle_safety_technology_report
first_vehicle_safety_technology_reportRadu Stancut
 
Defining Urban Waste(Memo)
Defining Urban Waste(Memo)Defining Urban Waste(Memo)
Defining Urban Waste(Memo)Radu Stancut
 
Visual Data Exploration of Parole Data with Tableau
Visual Data Exploration of Parole Data with TableauVisual Data Exploration of Parole Data with Tableau
Visual Data Exploration of Parole Data with TableauRadu Stancut
 
Data Science Methodology certificate
Data Science Methodology certificateData Science Methodology certificate
Data Science Methodology certificateRadu Stancut
 
NOLA BlightStatus Demolitions
NOLA BlightStatus DemolitionsNOLA BlightStatus Demolitions
NOLA BlightStatus DemolitionsRadu Stancut
 
Spatial Patterns of Urban Innovation and Productivity
Spatial Patterns of Urban Innovation and ProductivitySpatial Patterns of Urban Innovation and Productivity
Spatial Patterns of Urban Innovation and ProductivityRadu Stancut
 
Procuring for Innovation
Procuring for InnovationProcuring for Innovation
Procuring for InnovationRadu Stancut
 
Flooding, public parks, & marshes
Flooding, public parks, & marshesFlooding, public parks, & marshes
Flooding, public parks, & marshesRadu Stancut
 

More from Radu Stancut (11)

hail_market_analysis_2015
hail_market_analysis_2015hail_market_analysis_2015
hail_market_analysis_2015
 
HHSAcceleratorProviderDataReport2015
HHSAcceleratorProviderDataReport2015HHSAcceleratorProviderDataReport2015
HHSAcceleratorProviderDataReport2015
 
first_vehicle_safety_technology_report
first_vehicle_safety_technology_reportfirst_vehicle_safety_technology_report
first_vehicle_safety_technology_report
 
BusVizPoster-v2
BusVizPoster-v2BusVizPoster-v2
BusVizPoster-v2
 
Defining Urban Waste(Memo)
Defining Urban Waste(Memo)Defining Urban Waste(Memo)
Defining Urban Waste(Memo)
 
Visual Data Exploration of Parole Data with Tableau
Visual Data Exploration of Parole Data with TableauVisual Data Exploration of Parole Data with Tableau
Visual Data Exploration of Parole Data with Tableau
 
Data Science Methodology certificate
Data Science Methodology certificateData Science Methodology certificate
Data Science Methodology certificate
 
NOLA BlightStatus Demolitions
NOLA BlightStatus DemolitionsNOLA BlightStatus Demolitions
NOLA BlightStatus Demolitions
 
Spatial Patterns of Urban Innovation and Productivity
Spatial Patterns of Urban Innovation and ProductivitySpatial Patterns of Urban Innovation and Productivity
Spatial Patterns of Urban Innovation and Productivity
 
Procuring for Innovation
Procuring for InnovationProcuring for Innovation
Procuring for Innovation
 
Flooding, public parks, & marshes
Flooding, public parks, & marshesFlooding, public parks, & marshes
Flooding, public parks, & marshes
 

Recently uploaded

Call Girls in Dhaula Kuan 💯Call Us 🔝8264348440🔝
Call Girls in Dhaula Kuan 💯Call Us 🔝8264348440🔝Call Girls in Dhaula Kuan 💯Call Us 🔝8264348440🔝
Call Girls in Dhaula Kuan 💯Call Us 🔝8264348440🔝soniya singh
 
08448380779 Call Girls In Karol Bagh Women Seeking Men
08448380779 Call Girls In Karol Bagh Women Seeking Men08448380779 Call Girls In Karol Bagh Women Seeking Men
08448380779 Call Girls In Karol Bagh Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Lajpat Nagar Women Seeking Men
08448380779 Call Girls In Lajpat Nagar Women Seeking Men08448380779 Call Girls In Lajpat Nagar Women Seeking Men
08448380779 Call Girls In Lajpat Nagar Women Seeking MenDelhi Call girls
 
TAM Sports_IPL 17 Till Match 37_Celebrity Endorsement _Report.pdf
TAM Sports_IPL 17 Till Match 37_Celebrity Endorsement _Report.pdfTAM Sports_IPL 17 Till Match 37_Celebrity Endorsement _Report.pdf
TAM Sports_IPL 17 Till Match 37_Celebrity Endorsement _Report.pdfSocial Samosa
 
Dubai Call Girls Bikni O528786472 Call Girls Dubai Ebony
Dubai Call Girls Bikni O528786472 Call Girls Dubai EbonyDubai Call Girls Bikni O528786472 Call Girls Dubai Ebony
Dubai Call Girls Bikni O528786472 Call Girls Dubai Ebonyhf8803863
 
CALL ON ➥8923113531 🔝Call Girls Telibagh Lucknow best Night Fun service 🧣
CALL ON ➥8923113531 🔝Call Girls Telibagh Lucknow best Night Fun service  🧣CALL ON ➥8923113531 🔝Call Girls Telibagh Lucknow best Night Fun service  🧣
CALL ON ➥8923113531 🔝Call Girls Telibagh Lucknow best Night Fun service 🧣anilsa9823
 
JORNADA 4 LIGA MURO 2024TUXTEPEC1234.pdf
JORNADA 4 LIGA MURO 2024TUXTEPEC1234.pdfJORNADA 4 LIGA MURO 2024TUXTEPEC1234.pdf
JORNADA 4 LIGA MURO 2024TUXTEPEC1234.pdfArturo Pacheco Alvarez
 
CALL ON ➥8923113531 🔝Call Girls Saharaganj Lucknow best Female service 🦺
CALL ON ➥8923113531 🔝Call Girls Saharaganj Lucknow best Female service  🦺CALL ON ➥8923113531 🔝Call Girls Saharaganj Lucknow best Female service  🦺
CALL ON ➥8923113531 🔝Call Girls Saharaganj Lucknow best Female service 🦺anilsa9823
 
08448380779 Call Girls In IIT Women Seeking Men
08448380779 Call Girls In IIT Women Seeking Men08448380779 Call Girls In IIT Women Seeking Men
08448380779 Call Girls In IIT Women Seeking MenDelhi Call girls
 
Croatia vs Albania Clash of Euro Cup 2024 Squad Preparations and Euro Cup Dre...
Croatia vs Albania Clash of Euro Cup 2024 Squad Preparations and Euro Cup Dre...Croatia vs Albania Clash of Euro Cup 2024 Squad Preparations and Euro Cup Dre...
Croatia vs Albania Clash of Euro Cup 2024 Squad Preparations and Euro Cup Dre...Eticketing.co
 
Plan d'orientations stratégiques rugby féminin
Plan d'orientations stratégiques rugby fémininPlan d'orientations stratégiques rugby féminin
Plan d'orientations stratégiques rugby fémininThibaut TATRY
 
VIP Kolkata Call Girl Liluah 👉 8250192130 Available With Room
VIP Kolkata Call Girl Liluah 👉 8250192130  Available With RoomVIP Kolkata Call Girl Liluah 👉 8250192130  Available With Room
VIP Kolkata Call Girl Liluah 👉 8250192130 Available With Roomdivyansh0kumar0
 
ppt on Myself, Occupation and my Interest
ppt on Myself, Occupation and my Interestppt on Myself, Occupation and my Interest
ppt on Myself, Occupation and my InterestNagaissenValaydum
 
( Sports training) All topic (MCQs).pptx
( Sports training) All topic (MCQs).pptx( Sports training) All topic (MCQs).pptx
( Sports training) All topic (MCQs).pptxParshotamGupta1
 
Albania Vs Spain Albania is Loaded with Defensive Talent on their Roster.docx
Albania Vs Spain Albania is Loaded with Defensive Talent on their Roster.docxAlbania Vs Spain Albania is Loaded with Defensive Talent on their Roster.docx
Albania Vs Spain Albania is Loaded with Defensive Talent on their Roster.docxWorld Wide Tickets And Hospitality
 
Resultados del Campeonato mundial de Marcha por equipos Antalya 2024
Resultados del Campeonato mundial de Marcha por equipos Antalya 2024Resultados del Campeonato mundial de Marcha por equipos Antalya 2024
Resultados del Campeonato mundial de Marcha por equipos Antalya 2024Judith Chuquipul
 
Stunning ➥8448380779▻ Call Girls In Delhi Cantt Delhi NCR
Stunning ➥8448380779▻ Call Girls In Delhi Cantt Delhi NCRStunning ➥8448380779▻ Call Girls In Delhi Cantt Delhi NCR
Stunning ➥8448380779▻ Call Girls In Delhi Cantt Delhi NCRDelhi Call girls
 
Indian Premiere League 2024 by livecricline
Indian Premiere League 2024 by livecriclineIndian Premiere League 2024 by livecricline
Indian Premiere League 2024 by livecriclineLive Cric Line
 

Recently uploaded (20)

Call Girls in Dhaula Kuan 💯Call Us 🔝8264348440🔝
Call Girls in Dhaula Kuan 💯Call Us 🔝8264348440🔝Call Girls in Dhaula Kuan 💯Call Us 🔝8264348440🔝
Call Girls in Dhaula Kuan 💯Call Us 🔝8264348440🔝
 
08448380779 Call Girls In Karol Bagh Women Seeking Men
08448380779 Call Girls In Karol Bagh Women Seeking Men08448380779 Call Girls In Karol Bagh Women Seeking Men
08448380779 Call Girls In Karol Bagh Women Seeking Men
 
08448380779 Call Girls In Lajpat Nagar Women Seeking Men
08448380779 Call Girls In Lajpat Nagar Women Seeking Men08448380779 Call Girls In Lajpat Nagar Women Seeking Men
08448380779 Call Girls In Lajpat Nagar Women Seeking Men
 
TAM Sports_IPL 17 Till Match 37_Celebrity Endorsement _Report.pdf
TAM Sports_IPL 17 Till Match 37_Celebrity Endorsement _Report.pdfTAM Sports_IPL 17 Till Match 37_Celebrity Endorsement _Report.pdf
TAM Sports_IPL 17 Till Match 37_Celebrity Endorsement _Report.pdf
 
Dubai Call Girls Bikni O528786472 Call Girls Dubai Ebony
Dubai Call Girls Bikni O528786472 Call Girls Dubai EbonyDubai Call Girls Bikni O528786472 Call Girls Dubai Ebony
Dubai Call Girls Bikni O528786472 Call Girls Dubai Ebony
 
CALL ON ➥8923113531 🔝Call Girls Telibagh Lucknow best Night Fun service 🧣
CALL ON ➥8923113531 🔝Call Girls Telibagh Lucknow best Night Fun service  🧣CALL ON ➥8923113531 🔝Call Girls Telibagh Lucknow best Night Fun service  🧣
CALL ON ➥8923113531 🔝Call Girls Telibagh Lucknow best Night Fun service 🧣
 
JORNADA 4 LIGA MURO 2024TUXTEPEC1234.pdf
JORNADA 4 LIGA MURO 2024TUXTEPEC1234.pdfJORNADA 4 LIGA MURO 2024TUXTEPEC1234.pdf
JORNADA 4 LIGA MURO 2024TUXTEPEC1234.pdf
 
CALL ON ➥8923113531 🔝Call Girls Saharaganj Lucknow best Female service 🦺
CALL ON ➥8923113531 🔝Call Girls Saharaganj Lucknow best Female service  🦺CALL ON ➥8923113531 🔝Call Girls Saharaganj Lucknow best Female service  🦺
CALL ON ➥8923113531 🔝Call Girls Saharaganj Lucknow best Female service 🦺
 
08448380779 Call Girls In IIT Women Seeking Men
08448380779 Call Girls In IIT Women Seeking Men08448380779 Call Girls In IIT Women Seeking Men
08448380779 Call Girls In IIT Women Seeking Men
 
Croatia vs Albania Clash of Euro Cup 2024 Squad Preparations and Euro Cup Dre...
Croatia vs Albania Clash of Euro Cup 2024 Squad Preparations and Euro Cup Dre...Croatia vs Albania Clash of Euro Cup 2024 Squad Preparations and Euro Cup Dre...
Croatia vs Albania Clash of Euro Cup 2024 Squad Preparations and Euro Cup Dre...
 
Plan d'orientations stratégiques rugby féminin
Plan d'orientations stratégiques rugby fémininPlan d'orientations stratégiques rugby féminin
Plan d'orientations stratégiques rugby féminin
 
VIP Kolkata Call Girl Liluah 👉 8250192130 Available With Room
VIP Kolkata Call Girl Liluah 👉 8250192130  Available With RoomVIP Kolkata Call Girl Liluah 👉 8250192130  Available With Room
VIP Kolkata Call Girl Liluah 👉 8250192130 Available With Room
 
Call Girls Service Noida Extension @9999965857 Delhi 🫦 No Advance VVIP 🍎 SER...
Call Girls Service Noida Extension @9999965857 Delhi 🫦 No Advance  VVIP 🍎 SER...Call Girls Service Noida Extension @9999965857 Delhi 🫦 No Advance  VVIP 🍎 SER...
Call Girls Service Noida Extension @9999965857 Delhi 🫦 No Advance VVIP 🍎 SER...
 
ppt on Myself, Occupation and my Interest
ppt on Myself, Occupation and my Interestppt on Myself, Occupation and my Interest
ppt on Myself, Occupation and my Interest
 
( Sports training) All topic (MCQs).pptx
( Sports training) All topic (MCQs).pptx( Sports training) All topic (MCQs).pptx
( Sports training) All topic (MCQs).pptx
 
Albania Vs Spain Albania is Loaded with Defensive Talent on their Roster.docx
Albania Vs Spain Albania is Loaded with Defensive Talent on their Roster.docxAlbania Vs Spain Albania is Loaded with Defensive Talent on their Roster.docx
Albania Vs Spain Albania is Loaded with Defensive Talent on their Roster.docx
 
Resultados del Campeonato mundial de Marcha por equipos Antalya 2024
Resultados del Campeonato mundial de Marcha por equipos Antalya 2024Resultados del Campeonato mundial de Marcha por equipos Antalya 2024
Resultados del Campeonato mundial de Marcha por equipos Antalya 2024
 
Call Girls In Vasundhara 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
Call Girls In Vasundhara 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICECall Girls In Vasundhara 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
Call Girls In Vasundhara 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
 
Stunning ➥8448380779▻ Call Girls In Delhi Cantt Delhi NCR
Stunning ➥8448380779▻ Call Girls In Delhi Cantt Delhi NCRStunning ➥8448380779▻ Call Girls In Delhi Cantt Delhi NCR
Stunning ➥8448380779▻ Call Girls In Delhi Cantt Delhi NCR
 
Indian Premiere League 2024 by livecricline
Indian Premiere League 2024 by livecriclineIndian Premiere League 2024 by livecricline
Indian Premiere League 2024 by livecricline
 

Perfunctory NBA Analysis

  • 1. Radu Stancut Applied Data Science Foundations Project Outline/Notes Introduction This Foundations paper will look at the performance of professional basketball teams in the National Basketball Association (NBA) over the past ten years and investigate the relationship that both offensive and defensive team statistics have on win totals, and by extension playoff appearances. Specifically, I will be looking at the field goal percentages (FG%) on both offense, the proficiency with which a team scores a basket, and defense, the effectiveness of keeping the other team from scoring. It is expected that above average rates of performance in either category would have a positive influence on winning games, but just how much remains to be seen. Statistical analysis and sports have become increasingly more intertwined over the last decade in the public imagination. The reader may be aware of Moneyball, either the book or the movie, if not both, describing the machinations of Major League Baseball’s (MLB) Oakland Athletics’ General Manager, Billy Bean, who due to a limited payroll was inspired to leverage statistical analysis in order to find underappreciated assets, in this case ballplayers, with which to build several winning teams. This quantification of sports has in fact been going on for several decades, from Bill James’ Baseball Prospectus in the 1980’s through to today’s Nate Silver, ESPN-backed, website, fivethirtyeight.com, and onto the MIT Sloan Sports Analytics Conference.
  • 2. This paper is meant to provide a surface investigation into some such sports stats and their influence on team outcomes. Below I describe the data collected, its analysis, my findings and conclusions, followed by ideas on next steps Data Description NBA team statistics for regular seasons 2005 – 2014 were collected in the form of CSV files from Basketball-Reference.com; two tables per season were collected, one for stats on offense and the other on defense. The tables consisted of team names and a breakdown of team performance by minutes, points, assists, rebounds, field goal percentages, and other categories (a full list may be found in the appendix). In addition to the team season performance metrics I also collected the win/loss standings and conference/division information for each season in an Excel document that I later formatted for merging. The offense and defense statistics were kept separate from one another but the ten seasons had to be combined into one data frame for analysis. This combining of different seasons was done via a ‘for loop’ on import within Python. The NBA team information of win/loss records and conference/division data was also imported and subsequently merged, leading to two data frames, one each for offense and defense, of 300 instances (30 teams, ten years) with varying performance measurements (list in appendix). Several calculations and identification steps were taken prior to running regression analysis: • The winning percentage (Win%) of each team, by year, was calculated and placed into a new column. • A column was created to flag whether a team had a winning record (Win% > 0.5). • Playoff teams were identified and flagged in a new binary column (Playoffs)
  • 3. • FG% was normalized by grouping along Year, Year & Conference, and Year & Division, and setting the average to 1.0 to allow for comparative analysis across seasons; a rating above 1.0 in FG% on offense meant better than average, while the reverse was true on defense FG%, a team had to be below 1.0 to be ‘above’ average in keeping opponents from scoring (team examples in appendix). Descriptive Statistics To provide a general picture and context to the findings described in the next section, I have outlined some exploratory numbers on the data here: • The win distribution of NBA teams over this 10 year range tends toward a normal distribution (number description in appendix): • Winning teams had the following min, max, and mean indexed FG% over ten years: o Offense – 0.945 (min/worst); 1.102 (max/best)1.015 (mean) o Defense – 0.917 (min/best); 1.030 (max/worst); 0.982 (mean) • Losing teams had the following min, max, and mean indexed FG% over ten years: o Offense – 0.924 (min/worst); 1.044 (max/best); 0.984 (mean) o Defense – 0.939 (min/best); 1.082 (max/worst); 1.018 (mean)
  • 4. Methods & Analysis As seen in the descriptive statistics above, this researcher’s expectations that above average rates of performance in either category would have a positive influence on winning games, has some superficial merit. In order to delve deeper I created scatter plots (see appendix), by offense and defense, for FG% indexed by year (FG% IDX_Year), by year & conference (FG% IDX_Conf), and year & division (FG% IDX_Div). NBA teams routinely play most of their games within conference (52 of 82) and a sizable sample of those games are against division rivals (16 of 52), the field goal grouping was meant to see if any of the various performance indexes provided a better predictor of wins. The slopes of the fitted lines for each indexes FG% group, league-wide were: • Offense o FG% IDX_Year: 3.011 o FG% IDX_Conf: 3.048 o FG% IDX_Div: 3.419 • Defense o FG% IDX_Year: -3.500 o FG% IDX_Conf: -3.595 o FG% IDX_Div: -3.667 Two things jump out from the numbers above. First, the defensive FG% appeared to have a bigger influence, as measured by the slope, than the offensive rate. Secondly, the more specific indexes, with respect to conference and division, had a bigger influence by slope for both offense and defense. Following the scatter plots and fitted lines, a linear regression analysis was run; the dependent variable in all instances was Win%, with the independent variable alternating between offensive and defensive FG% IDX fields. In this instance there was little to no difference in
  • 5. results between league, conference, and division comparisons so only the broadest measure, league-wide, is outlined below and included in the appendix: • Offense o Coef: 0.5025; Std err: .008; R-squared: 0.924 • Defense o Coef: 0.496; Std err: .0010; R-squared: 0.901 The results indicate that both offensive and defensive FG% performance numbers do a good job of explaining winning in the NBA. Of course, much like the influence of education on wages discussed in class, this could be due to other factors being bundled within these measurements. Discussion & Next Steps Taking up the thread of bundled influences above this research would need to be expanded to other team statistical performances to properly weigh the impact that FG% alone has on winning. Additionally, the analysis contained here dealt with only end of the year numbers and does not purport to be predictive for in-season numbers. This is another area where further research would need to be done, i.e., how does a team’s performance up to the All-Star break (about half way into the season) factor into the remaining games and final win/loss record.
  • 6. Appendix List of Fields Used in Analysis, Including ‘Index’ Fields Created '2P' 'Playoffs' '2P%' 'STL' '2PA' 'TOV' '3P' 'TRB' '3P%' 'Team' '3PA' 'Team2' 'AST' 'Wins' 'BLK' 'Year' 'DEF Rk' 'Conference' 'DRB' 'Division' 'FG' 'Win%' 'FG%' 'Winning Record' 'FGA' 'FG% Year' 'FT' '2P% Year' 'FT%' '3P% Year' 'FTA' 'FG% Conf' 'G' '2P% Conf' 'Losses' '3P% Conf' 'MP' 'FG% Div' 'ORB' '2P% Div' 'PF' '3P% Div' 'PTS' 'FG% IDX_Year' 'PTS/G' 'FG% IDX_Conf' 'FG% IDX_Div' NBA Team Win Description (Histogram Companion) count 300.000000 mean 40.196667 std 12.610899 min 7.000000 25% 31.750000 50% 41.000000 75% 50.000000 max 67.000000
  • 7. Offense Linear Regression OLS Regression Results ============================================================================== Dep. Variable: Win% R-squared: 0.924 Model: OLS Adj. R-squared: 0.924 Method: Least Squares F-statistic: 3635. Date: Wed, 10 Dec 2014 Prob (F-statistic): 2.31e-169 Time: 11:54:17 Log-Likelihood: 155.31 No. Observations: 300 AIC: -308.6 Df Residuals: 299 BIC: -304.9 Df Model: 1 ================================================================================ coef std err t P>|t| [95.0% Conf. Int.] -------------------------------------------------------------------------------- FG% IDX_Year 0.5025 0.008 60.292 0.000 0.486 0.519 ============================================================================== Omnibus: 20.895 Durbin-Watson: 2.137 Prob(Omnibus): 0.000 Jarque-Bera (JB): 8.860 Skew: -0.167 Prob(JB): 0.0119 Kurtosis: 2.227 Cond. No. 1.00 ============================================================================== Defense Linear Regression OLS Regression Results ============================================================================== Dep. Variable: Win% R-squared: 0.901 Model: OLS Adj. R-squared: 0.900 Method: Least Squares F-statistic: 2710. Date: Wed, 10 Dec 2014 Prob (F-statistic): 6.03e-152 Time: 11:05:58 Log-Likelihood: 115.08 No. Observations: 300 AIC: -228.2 Df Residuals: 299 BIC: -224.5 Df Model: 1 ================================================================================ coef std err t P>|t| [95.0% Conf. Int.] -------------------------------------------------------------------------------- FG% IDX_Year 0.4961 0.010 52.055 0.000 0.477 0.515 ============================================================================== Omnibus: 20.496 Durbin-Watson: 2.091 Prob(Omnibus): 0.000 Jarque-Bera (JB): 8.474 Skew: -0.142 Prob(JB): 0.0145 Kurtosis: 2.227 Cond. No. 1.00 ==============================================================================