🌟 Today's Headline
Fields Medalist says ChatGPT 5.5 Pro delivered "PhD-level" math research in under two hours with zero human help
Fields Medalist Timothy Gowers used ChatGPT 5.5 Pro on open number theory problems, with the model improving an exponential bound to polynomial in under an hour using what an MIT researcher called 'completely original' reasoning, demonstrating AI's capability for independent cutting-edge mathematical contributions.
💬 Editor's Note
The breakthrough isn't speed—it's legitimacy. When mathematics' highest authority validates AI's original, independent thinking, we're not celebrating a tool anymore. AI has graduated to 'researcher,' and the field's entire framework for what's possible must shift.
New Product
Zyphra presents ZAYA1-8B, a reasoning-focused mixture-of-experts model with 700M active parameters from 8B total, trained on AMD infrastructure. It matches or exceeds DeepSeek-R1-0528 on math and coding benchmarks despite having under 1B active parameters.
Claude Code released version 2.1.138, a routine maintenance update focused on internal improvements and stability enhancements. This point release does not introduce any new user-facing features or major functionality changes.
Ollama v0.30.0-rc11 release candidate brings critical fixes for Windows build systems and developer workflows. Specifically, it resolves issues where compiler paths containing spaces would cause build failures, a widespread problem on Windows machines where default installation directories often include spaces in their names. These path issues have prevented successful compilation for many users.
Opinion
DBMSolver is a training-free sampler for Diffusion Bridge Models that accelerates image-to-image translation by exploiting semi-linear SDE/ODE structure through exponential integrators, achieving 1st and 2nd-order solutions while significantly reducing required function evaluations (NFEs).
Introduces horizon-constrained Rashomon sets, a theoretical framework characterizing how model multiplicity evolves with prediction horizon in chaotic systems, showing exponential growth unlike static prediction tasks, providing new insights into forecasting under uncertainty.
Examines robustness of Graph Self-Supervised Learning (GSSL) methods trained on automatically extracted knowledge graphs from text containing real-world noise, filling a gap in prior research that assumed clean, curated graph data.
Tutorial
BioMedArena is an open-source toolkit that simplifies building and evaluating biomedical deep research agents by providing unified evaluation harness and tool registry, reducing per-paper engineering overhead and enabling more efficient foundation model integration.
MTL-MAD uses multiple self-supervised and pseudo-labeling tasks within a Mixture-of-Experts framework for medical image anomaly detection without anomaly samples during training, achieving state-of-the-art performance through proxy task integration.
Develops gossip-based algorithms for decentralized learning on resource-constrained edge devices that are communication-efficient and robust to data corruption, combining benefits of prior methods that previously required tradeoffs.