New Research Tools Enhance Forecasting Accuracy for Agents

Published on April 30, 2026

Forecasting in fields like economics and politics has long relied on historical data and intuitive judgment. Conventional benchmarks focus mainly on accuracy, often sidelining the underlying reasoning that leads to precise forecasts. This gap creates challenges in understanding why some forecasting agents consistently outperform others.

Recent developments have introduced Bench to the Future 2 (BTF-2), a comprehensive benchmarking tool featuring 1,417 pastcasting questions. This innovative framework allows agents to utilize a static 15 million-document research corpus to generate forecasts while documenting their reasoning processes. In doing so, BTF-2 uncovers subtle accuracy differences among agents, identifying strengths and weaknesses in research and judgment.

The results are striking; BTF-2 highlighted a variance of 0.004 Brier score among agents, indicating significant disparities in forecasting capabilities. One advanced forecaster outperformed all known frontier agents by 0.011 Brier. The study reveals that superior accuracy stems from thorough pre-mortem analysis and a better understanding of unexpected events, or black swans.

This research not only establishes a new standard for evaluating forecasting agents but also exposes strategic reasoning failures in existing models. Experts noted deficiencies in assessing the motivations of political and business leaders, suggesting that better training and methods could significantly enhance the reliability of forecasts in complex environments.

Related News