About — Lightning Rod Labs

Apr 1, 2026 Research

Forecasting supply chain disruptions with foresight learning

Foresight learning trains LLMs to generate calibrated probability forecasts of rare supply chain disruptions, outperforming GPT-5 in accuracy, calibration, and precision — with structured probabilistic reasoning emerging from training alone.

Calibration reliability diagram — trained model vs GPT-5 and base model

Mar 2026 Benchmark

Foresight-v3 becomes the #1 AI forecaster

Foresight-v3 ranks first overall on ProphetArena — an independent AI forecasting benchmark from UChicago — by Brier score, outperforming GPT-5, Gemini 3 Pro, and every frontier model. Also #1 in Sports.

ProphetArena overall leaderboard — Foresight V3 #1 by Brier Score

Feb 2026 Benchmark

#1 on ProphetArena Sports

Foresight-32B beats every other model at predicting sports outcomes on ProphetArena, a live prediction market leaderboard — with 105.9% Market Return, ahead of GPT-5.2, Minimax M2, Gemini 3 Pro, and Qwen3-235B.

ProphetArena Sports leaderboard — Foresight V1 32B #1

Jan 29, 2026 Benchmark

Foresight-32B outperforms frontier models on ForecastBench

Top 5 on the ForecastBench tournament, outperforming Gemini 3 Pro, Claude Sonnet 4.5, and o3.

Jan 27, 2026 Research

Foresight-tuned 32B model outperforms GPT-5 at predicting public company risks

Foresight learning on raw SEC filings trains a 32B parameter model to beat GPT-5 in accuracy & calibration at predicting public company risks. Deployable on a single GPU for maximum data privacy.

SEC Risk: Brier Score, Brier Skill Score, ECE, and Calibration Reliability Diagram

Jan 9, 2026 Core Method

Future-as-Label enables scalable RL

We show that AI can learn directly from real-world outcomes at unlimited scale, no human annotation required. The future itself becomes the training signal. Improved Brier scores 27% and halved calibration error, outperforming Qwen3-235B with a 32B model.