Model Transparency
Our prediction model is fully open about its methodology and accuracy. No black boxes.
How It Works
Every UFC fight since 1994, scraped from UFCStats.com. Fighter stats computed as point-in-time snapshots to prevent data leakage.
LightGBM + CatBoost ensemble (65/35 blend), tuned with Optuna (50 trials each). Trained on pre-2022 data, validated on 2022, tested on 2023.
24 features including Elo ratings, rolling fight stats, defensive metrics, style matchups, and market odds when available.
Confidence Scaling
The model's confidence labels reflect real accuracy. When we say “Strong”, we mean it.
Calibration
When the model says 70%, does that fighter actually win ~70% of the time? Closer to the diagonal = better calibrated.
| Predicted Range | Mean Predicted | Actual Win Rate | Fights | Difference |
|---|---|---|---|---|
| 10-20% | 17.2% | 18.2% | 11 | +1.0pp |
| 20-30% | 25.6% | 21.6% | 51 | -4.0pp |
| 30-40% | 35.7% | 40.7% | 54 | +5.0pp |
| 40-50% | 45.5% | 51.3% | 78 | +5.8pp |
| 50-60% | 54.9% | 60.5% | 81 | +5.6pp |
| 60-70% | 64.9% | 58.3% | 60 | -6.6pp |
| 70-80% | 74.5% | 66.7% | 42 | -7.8pp |
| 80-90% | 85.0% | 95.7% | 23 | +10.7pp |
Performance by Weight Class
What Drives Predictions
Top features ranked by SHAP importance (mean absolute impact on predictions).
Honest Assessment: Model vs Market
On fights with betting odds (294 of 402 test fights), the closing line achieves 68.0% accuracy vs our model's 64.6%. The market is hard to beat — it aggregates thousands of sharp bettors.
Where our model adds value:
- Fights without odds data — we still predict at 60.2% accuracy
- Structural analysis: Elo, style matchups, and stat differentials give context the line doesn't
- Early line detection: spots value before lines sharpen
- High-confidence picks (>75% model probability) hit at 82.7%