Xiaohu AI Daily — 2026-05-10

🌟 Today's Headline

Fields Medalist says ChatGPT 5.5 Pro delivered "PhD-level" math research in under two hours with zero human help

Fields Medalist Timothy Gowers used ChatGPT 5.5 Pro on open number theory problems, with the model improving an exponential bound to polynomial in under an hour using what an MIT researcher called 'completely original' reasoning, demonstrating AI's capability for independent cutting-edge mathematical contributions.

💬 Editor's Note

The breakthrough isn't speed—it's legitimacy. When mathematics' highest authority validates AI's original, independent thinking, we're not celebrating a tool anymore. AI has graduated to 'researcher,' and the field's entire framework for what's possible must shift.

Nvidia has already committed $40B to equity AI deals this year

9/10 News

Nvidia continues to be a big investor in the AI ecosystem.

Google 把 Fitbit Air 的全新 Google Health API 直接开放了！昨天 Fitbit Air 刚刚发布，但更重磅的是它自带了全新的 @googlehealth AP…

9/10 New Product

Google 随新款 Fitbit Air 发布了全新的 Health API 并向开发者开放。该 API 提供了涵盖运动、睡眠、心率、血氧等维度的 31 种健康数据点，支持 Webhooks 实时数据推送、精细的读写权限控制以及按时间范围查询和汇总数据。

Introducing Pareto Code： a new， free， experimental coding router Set `min_coding_score` in your req…

9/10 New Product

推出帕累托代码：一款全新、免费、实验性的编码路由工具在请求中设置 `min_coding_score`，即可路由至符合您标准且成本最低的编码模型，排名由 @ArtificialAnlys 提供。实时查看帕累托前沿的变化👇

Ranked No. 1 in benchmarks. Lightning speed. Native A/V sync. The era of waiting in line for AI vi…

9/10 New Product

基准测试排名第一。闪电速度。原生音视频同步。排队等待AI视频的时代结束了。HappyHorse现已在阿里云Model Studio上线。当别人还在渲染时，你已完成。立即构建：https：//int.alibabacloud.com/m/1000412167/

"OncoAgent： A Dual-Tier Multi-Agent Framework for Privacy-Preserving Oncology Clinical Decision Support"

7/10 Tech

研究团队发布了开源肿瘤临床决策支持系统OncoAgent。该系统采用双层多智能体框架，结合LangGraph拓扑与四阶段Corrective RAG流程，检索超过70份权威临床指南。系统根据查询复杂度，将任务路由至9B参数的速度优化模型或27B参数的深度推理模型，两者均通过QLoRA在AMD MI3…

SpaceXAI 正式官宣了！ @xFreeze 贴出 USPTO 商标申请截图，"SpaceXAI" 已于 2026 年 5 月 6 日正式提交申请（序列号 99808217），目前处于 LIV…

7/10 Industry

商标申请文件显示，"SpaceXAI"已于2026年5月6日提交申请，目前状态为待审查。该日期与Elon Musk宣布将xAI并入SpaceX的时间点吻合，标志着xAI的AI能力将与SpaceX的航天业务进行品牌统一，旨在将打造多行星文明与发展超级智能两大目标合并于单一实体之下。

New Product

ZAYA1-8B Technical Report

Zyphra presents ZAYA1-8B, a reasoning-focused mixture-of-experts model with 700M active parameters from 8B total, trained on AMD infrastructure. It matches or exceeds DeepSeek-R1-0528 on math and coding benchmarks despite having under 1B active parameters.

v2.1.138

Claude Code released version 2.1.138, a routine maintenance update focused on internal improvements and stability enhancements. This point release does not introduce any new user-facing features or major functionality changes.

v0.30.0-rc11

Ollama v0.30.0-rc11 release candidate brings critical fixes for Windows build systems and developer workflows. Specifically, it resolves issues where compiler paths containing spaces would cause build failures, a widespread problem on Windows machines where default installation directories often include spaces in their names. These path issues have prevented successful compilation for many users.

Opinion

DBMSolver: A Training-free Diffusion Bridge Sampler for High-Quality Image-to-Image Translation

DBMSolver is a training-free sampler for Diffusion Bridge Models that accelerates image-to-image translation by exploiting semi-linear SDE/ODE structure through exponential integrators, achieving 1st and 2nd-order solutions while significantly reducing required function evaluations (NFEs).

Horizon-Constrained Rashomon Sets for Chaotic Forecasting

Introduces horizon-constrained Rashomon sets, a theoretical framework characterizing how model multiplicity evolves with prediction horizon in chaotic systems, showing exponential growth unlike static prediction tasks, providing new insights into forecasting under uncertainty.

Robustness of Graph Self-Supervised Learning to Real-World Noise: A Case Study on Text-Driven Biomedical Graphs

Examines robustness of Graph Self-Supervised Learning (GSSL) methods trained on automatically extracted knowledge graphs from text containing real-world noise, filling a gap in prior research that assumed clean, curated graph data.

Industry

Auction-Based Regulation for Artificial Intelligence

Proposes a rigorous mathematical framework for AI regulation based on auction mechanisms, addressing gaps in regulatory approaches to AI safety, bias, and legal compliance, offering structured methodology for governing AI deployment.

Intelligent CCTV for Urban Design: AI-Based Analysis of Soft Infrastructure at Intersections

Leverages existing CCTV networks and computer vision to measure real-world impact of urban design interventions (temporary pedestrian refuges, curb extensions) on vehicle speed and safety. Deep learning models enabled perspective-corrected speed analysis before and after each intervention.

Tutorial

BioMedArena: An Open-source Toolkit for Building and Evaluating Biomedical Deep Research Agents

BioMedArena is an open-source toolkit that simplifies building and evaluating biomedical deep research agents by providing unified evaluation harness and tool registry, reducing per-paper engineering overhead and enabling more efficient foundation model integration.

MTL-MAD: Multi-Task Learners are Effective Medical Anomaly Detectors

MTL-MAD uses multiple self-supervised and pseudo-labeling tasks within a Mixture-of-Experts framework for medical image anomaly detection without anomaly samples during training, achieving state-of-the-art performance through proxy task integration.

Fast and Efficient Gossip Algorithms for Robust and Non-smooth Decentralized Learning

Develops gossip-based algorithms for decentralized learning on resource-constrained edge devices that are communication-efficient and robust to data corruption, combining benefits of prior methods that previously required tradeoffs.

📭Skip Today

Auto-filtered. Here's why — so you know you're not missing out:

DBMSolver: A Training-free Diffusion Bridge Sampler for High-Quality Image-to-Image Translation
→ Single-source paper, low reader value
0.131.0-alpha.4
→ Minor alpha/beta/rc release, no new feature
rust-v0.131.0-alpha.3
→ Minor alpha/beta/rc release, no new feature
0.131.0-alpha.2
→ Minor alpha/beta/rc release, no new feature
BioMedArena: An Open-source Toolkit for Building and Evaluating Biomedical Deep Research Agents
→ Single-source paper, low reader value
Horizon-Constrained Rashomon Sets for Chaotic Forecasting
→ Single-source paper, low reader value
Robustness of Graph Self-Supervised Learning to Real-World Noise: A Case Study on Text-Driven Biomedical Graphs
→ Single-source paper, low reader value
Steering Visual Generation in Unified Multimodal Models with Understanding Supervision
→ Single-source paper, low reader value

Subscribe to Xiaohu AI Daily