A compact golf forecaster that beats GPT-5 on held-out golf questions
We built a public golf forecasting dataset and trained Golf-Forecaster on resolved tournament questions. The model card reports better Brier score, Brier Skill Score, and calibration than GPT-5 on temporally held-out golf questions.
Primary artifacts for this case study.