Pro Golf

A compact golf forecaster that beats GPT-5 on held-out golf questions

+17%
Brier Skill Score on held-out golf questions
↗ Model card
41%
lower calibration error than GPT-5
↗ Model card
5%
lower Brier score than GPT-5
↗ Model card

We built a public golf forecasting dataset and trained Golf-Forecaster on resolved tournament questions. The model card reports better Brier score, Brier Skill Score, and calibration than GPT-5 on temporally held-out golf questions.

What we did


Read more

Primary artifacts for this case study.