Home > , > Project TD Tuning β€” Version 7

Project TD Tuning β€” Version 7

# V7 Context V6 rebuilt how the model measures team strength. V7 actually calibrated it. The difference sounds technical, but the results make it concrete. V5 and V6 both opened at 53% in Week 1. V7 opened at 87%. That gap comes from the weights. Earlier versions used weights that were set before several of the model’s inputs were working properly. Better inputs arrived over time β€” but the model was still interpreting them through an outdated lens. The weights and the inputs weren’t matched. For V7, we fit all of the model’s factors together against the full historical record β€” over 7,000 games going back to 1999. The result is a set of weights that reflects what actually predicts NFL outcomes, not what we assumed would. Some factors turned out to matter far more than we’d given them credit for. Others that seemed important faded into the background once everything was calibrated together. That’s the point β€” the data gets to decide. V7 ran twelve weeks. The model has never gone that far. It stopped on a rolling average after back-to-back quieter weeks, not on a single bad week. That’s the version of failure we want β€” one where the model runs long enough to show you what it actually knows.
Weeks Run
12
Accuracy
65%
Avg Error
10.1 pts
SeasonWeekW/LAccuracyCumulative
2000113/1587%86.67%
200029/1560%73.33%
200039/1464%70.45%
200048/1457%67.24%
2000510/1471%68.06%
200068/1457%66.28%
2000712/1486%69.00%
2000811/1479%70.18%
200097/1450%67.97%
20001010/1567%67.83%
2000118/1553%66.46%
2000128/1553%65.32%

For Science and the love of the game!!! What you do with this data is up to you!!!