A compact clinical forecaster trained directly from raw patient notes
We tuned a compact clinical forecaster from GPT-OSS-120B to predict clinical events from raw MIMIC-III notes, using later patient records to resolve the outcomes. The paper reports a 27% Brier Skill Score, about 70% lower calibration error than the base model, and a slightly better Brier score than GPT-5.
Primary artifacts for this case study.