Predicting Team Success Through Statistical Analysis
This project analyzes 40 years of NBA data (1979–2019) to identify which statistical factors most influence team success and creates a predictive model that can estimate a team's wins based on their performance metrics.
Developed a linear regression model that accurately predicts NBA team wins based on key performance statistics, with shooting efficiency, offensive rebounds, and turnovers emerging as the most influential factors.
Comprehensive historical analysis
Statistical factors in final model
Post three-point line introduction
The analysis revealed several key statistics that have the strongest impact on an NBA team's win total:
Relative importance of statistical factors in predicting wins
While basketball has evolved, fundamentals like rebounds, steals, and assists have remained relatively stable. Three-point attempts and makes have dramatically increased (especially 2011–2019) while two-point attempts declined.
Evolution of three-point shooting in the NBA (1979–2019)
This project followed a structured analytical approach to develop a reliable predictive model:
Normal probability plot of residuals indicating model validity
We removed collinear variables (e.g., raw points) and reduced mean VIF to 1.98. The final model passed heteroskedasticity tests (p = 0.5167).
Variance Inflation Factor (VIF) analysis showing low multicollinearity
ŵ = -169.712 + 260.7082·2P% + 48.4078·3P% + 34.8791·FT% + 2.0234·ORB + 0.6295·DRB + 0.3581·AST + 2.0269·STL + 2.7048·BLK - 2.6861·TOV
Evaluate roster changes and prioritize the stats that most move wins.
Leverage early-season stats to form more accurate performance priors.
Emphasize shooting efficiency, offensive rebounding, and ball security.
Prioritize efficiency over volume, hunt second-chance points, and reduce turnovers.
The model highlights the statistical levers that most reliably translate to wins, offering a data-driven lens for decisions.