Summary
How the model works and what powers the predictions.
We analyze how teams have been playing recently and estimate who is more likely to win. The model examines recent game performance, opponent strength, player availability, and matchup dynamics. All predictions are data-based estimates, not guarantees.
What this model is
- Probability-based classifier for home team win outcomes.
- Built on pre-game statistical trends, not final box scores.
- Purely based on basketball-related activity (team performance, player availability, schedule factors).
- Validated with season holdout and walk-forward testing.
- Updated daily with live NBA game data from Ball Dont Lie API.
What this model is not
- Not a guarantee or recommendation.
- Does not factor in betting lines or market movements.
- Not a live injury oracle when lineups change late.
- Not tuned for totals or props (win-only focus).
- Not intended for commercial use or decision-making.
Training Snapshot
Model trained on 26 seasons (1999-2024) with 34,767 games using hundreds of engineered features. Season 2025 held out for testing (333 games). Blind baseline is a DummyClassifier; tuned model is XGBoost selected after benchmarking multiple algorithms (CV AUC: 0.705).
- ROC-AUC 0.500
- Accuracy 0.559
- Precision 0.559
- Recall 1.000
- F1 0.717
Blind baseline predicts the most common class. Sets the floor for model comparison.
Note: This is a blind baseline comparison, not Model V1