Xiaohu AI Daily — 2026-05-15

🌟 Today's Headline

Claude Launches Metered API Pricing with Monthly Subscription Credits

Anthropic has restructured Claude's pricing model to separate interactive and programmatic usage. Under the new policy, every Claude subscription now includes monthly API token credits equal to the subscription's dollar value. For example, a $200/month subscriber receives both access to Claude on Anthropic-owned platforms (Claude.ai, Claude Code) with full interactive usage limits, plus $200 in API credits for programmatic use on third-party platforms like OpenClaw and others. While positioned as giving clearer value, this represents a significant policy shift from historical pricing where subscription holders received 70-90% discounts off standard API rates. The change standardizes limits across different platforms, replacing previous selective restrictions on certain harnesses. Though some users perceive this as reducing prior subsidies, the official policy provides transparency and consistency absent before, particularly compared to earlier selective targeting of specific platforms.

💬 Editor's Note

Anthropic's bundling of API credits into subscriptions increases subscriber value retention while signaling potential API pricing restructuring ahead. Smart move for protecting paid users, but API-only developers should brace for compensatory rate changes.

Read more → Product

Anthropic Releases Opus 4.7 Fast Mode with 2.5x Speed Improvement

10/10 New Product

Anthropic released fast mode for Claude Opus 4.7, achieving 2.5x faster performance while maintaining the model's depth of reasoning. Internal testing at Every reveals that Opus 4.7 has become noticeably sharper—it proactively suggests workflow optimizations (like using multiple terminals for parallel work) and excels at creative writing and planning tasks.

New Claude Mythos becomes the first AI model to clear all cyberattack simulations from Britain's AI safety agency

9/10 News

The UK AI Security Institute revised its AI cyber capability doubling estimates twice: first from 8 months to 4.7 months. Anthropic's Claude Mythos Preview and OpenAI's GPT-5.5 have now surpassed even the accelerated timeline.

v0.30.0

9/10 New Product

Ollama v0.30.0 restructures its architecture to directly support llama.cpp instead of building on GGML, enabling full GGUF file format compatibility. MLX is integrated for Apple Silicon acceleration, providing performance improvements and optimized memory utilization. The update includes performance testing and stability improvements.

v0.24.0

9/10 Tutorial

Ollama v0.24.0 introduces improved application restart functionality for the Codex integration, enhancing stability when deploying Codex models within the Ollama framework. This maintenance release addresses reliability concerns in long-running sessions and improves error recovery.

Announcing Genkit Middleware： Intercept， extend， and harden your agentic apps

9/10 New Product

Google开源框架Genkit近日推出其核心中间件系统，旨在提升智能体AI应用的可靠性与可控性。该系统允许开发者在生成调用、模型及工具层进行拦截，以注入自定义行为，如重试机制、模型回退以及人工介入的工具审批流程。通过创建并堆叠自定义中间件，开发者能够实现对模型输出的确定性控制。

inclusionAI/ARGenSeg-8B

9/10 New Product

包容性AI团队发布ARGenSeg-8B模型，致力于通过开源和开放科学推动人工智能的进步与普及。该举措强调技术民主化，使更广泛的社区能够参与AI研发与应用。开源策略将促进协作创新，加速AI工具在多元场景中的落地，降低技术门槛，推动产业生态的开放发展。

🕐 ~9 min read · Industry 9/10

Anthropic Overtakes OpenAI as #1 Business AI Platform

💡 Industry trends and analysis

According to Ramp's May 2026 AI Index, Anthropic has officially surpassed OpenAI in enterprise adoption for the first time. Anthropic reached 34.4% adoption among Ramp's tracked U.S. businesses, exceeding OpenAI's 32.3%. This marks a dramatic reversal from May 2025, when Anthropic held only 8% adoption while OpenAI led with 32%. The surge is attributed primarily to Claude Code's expansion beyond technical teams into finance, legal, and research workflows. Ramp tracks payments from 50,000+ U.S. businesses, providing a reliable spending signal. However, Ramp noted risks facing Anthropic despite the trend, including recent Claude outages and cost comparisons showing Anthropic becoming more expensive than OpenAI and open-source alternatives. Despite these headwinds, the adoption swing reflects significant market confidence in Claude's capabilities and deployment options.

🕐 ~3 min read · Industry 7/10

PwC is deploying Claude to build technology， execute deals， and reinvent enterprise functions for clients

💡 Industry trends and analysis

普华永道与Anthropic宣布扩大战略联盟，将在全球数十万员工中部署Claude AI工具。双方将联合建立卓越中心，并培训认证3万名专业人员。合作聚焦三大高杠杆领域：智能体技术构建、AI原生交易执行以及企业职能重塑。普华永道已率先成立基于Claude的财务业务组。实际应用显示，Claude在保险承保、网络安全等多个领域能将交付时间缩短最高达70%，例如将保险承保周期从十周压缩至十天。

🕐 ~3 min read · Tutorial 7/10

Accelerating on-device AI： A look at Arm and Google AI Edge optimization

💡 Can be adapted into tutorial material

Arm第二代可扩展矩阵扩展（SME2）与Google AI Edge软件栈集成，将CPU转变为强大的矩阵计算加速器，从而实现高性能的设备端生成式AI。本文以Stability AI的"stable-audio-open-small"模型为例，阐述了利用LiteRT、XNNPACK和KleidiAI构建的"转换、优化、部署"自动化硬件加速流程。该方案在基于Arm架构的移动设备和笔记本电脑上，成功实现了音频生成速度提升2倍以上、内存使用减少4倍的显著效果，同时确保了高音频质量。这一集成方案为在资源受限的边缘设备上高效运行复杂AI模型提供了有效路径。

🕐 ~3 min read · Industry 7/10

为 OpenAI 累计投入超 1000 亿美元，纳德拉称微软当年投资时"没人愿意下注"

💡 Industry trends and analysis

在"马斯克诉奥尔特曼"庭审中，微软企业发展负责人确认，微软对OpenAI的累计投入已超过1000亿美元，其中包括130亿美元原始投资及大量Azure基础设施成本。此次合作已为微软带来约300亿美元营收。CEO纳德拉表示，微软是在"没人愿意下注"时承担了风险。双方已续签非独家协议，微软不再支付收入分成，并将OpenAI的分成上限设为到2030年累计380亿美元，此举较原协议节省约970亿美元。此外，微软正评估收购AI初创公司以补强人才，并调整资源投向自研模型与超级智能领域。

🕐 ~3 min read · Tutorial 7/10

http：//x.com/i/article/2054823397448712192

💡 Can be adapted into tutorial material

资深开发者与业务团队存在根本认知差异。业务团队生活在"消除不确定性"的循环中，追求快速试错验证，核心是速度。而资深开发者身处"管理复杂性"的循环，核心职责是保障付费服务的长期稳定，因此对增加系统复杂性的行为极为警惕。沟通失败在于，开发者用"控制复杂性"的理由拒绝需求，却未回应业务端"消除不确定性"的迫切诉求。解决方案是，开发者应将其精简需求、复用代码等专业能力，包装成能帮助业务"更快获得答案"的方案，例如使用"我们能不能试个更快的办法？"这样的话术。尽管AI能快速生成代码，但资深开发者不可替代的价值在于为系统长期稳定"承担责任"。

New Product

Figure AI's Humanoid Robots Complete Autonomous 8-Hour Factory Shift

Figure AI successfully livestreamed an autonomous 8-hour factory shift using multiple Figure 03 humanoid robots powered by the Helix-02 system. The robots performed package sorting tasks by detecting barcodes, picking up packages, and placing them on conveyors at human speed (approximately 3 seconds per package).

.@neilsonks 刚刚把一个完整的 3D 生成工具包开源了，专门为 Claude Code 设计。输入一张图片，它就能自动把整个场景拆解成可交互的 3D 世界：环境、网格、物理、灯光、音频…

开发者@neilsonks开源了一套专为Claude Code设计的完整3D生成工具包。该工具能将输入的单张图片自动拆解，生成包含环境、网格、物理、灯光和音频的全套可交互3D场景。其流程首先利用图像与3D生成技术提取物体并生成高质量网格，随后移除物体以得到静态背景，最后为整个场景添加物理模拟、实时灯…

OpenCode x Qwen 3.6 Plus - free， again Last time y'all treated our capacity like an all-you-can-eat…

OpenCode x Qwen 3.6 Plus - 再次免费上次各位把我们的容量当成了自助餐。我们找到了更多GPU。第二轮。

Opinion

Tokenizer Fertility and Zero-Shot Performance of Foundation Models on Ukrainian Legal Text: A Comparative Study

Researchers benchmark seven foundation models from five providers on 273 Ukrainian legal documents from the state registry (EDRSR). Key finding: tokenizer efficiency varies 1.6x across models, with Qwen3 consuming 60% more tokens than Llama-family models on identical legal text. This directly impacts inference cost, latency, and operational efficiency.

Self-Distilled Agentic Reinforcement Learning

Researchers propose On-Policy Self-Distillation (OPSD) to enhance reinforcement learning for LLM agents by providing dense token-level guidance from a privileged teacher branch augmented with contextual information. OPSD complements coarse trajectory-level RL signals to improve multi-turn agent stability and address compounding instability in extended interactions.

AI Safety Landscape for Large Language Models: Taxonomy, State-of-the-art, and Future Directions

Comprehensive taxonomy and survey of AI safety for LLMs spanning design, development, adoption, and deployment phases. Addresses emerging challenges in public safety and national security as generative AI proliferates, serving as foundational reference for the field.

Industry

Runway is Coming to Japan

生成式AI公司Runway宣布在日本东京设立总部，正式进军日本市场，并计划投入4000万美元初始资金拓展业务。日本已成为Runway增长最快的市场之一，是其全球企业及自助客户的第三大市场。过去一年，日本企业客户数量增长300%，贡献了Runway亚洲总销售额的三分之一。

WSJ： Anthropic's Mythos helped researchers find 2 unknown macOS kernel bugs and turn them into a wor…

据《华尔街日报》报道，Anthropic的Mythos AI工具在短短五天内，成功帮助研究人员发现了两个此前未知的macOS内核漏洞，并将其串联成一个完整的权限提升攻击链。该攻击针对操作系统最底层的核心，通过组合多个漏洞和技术，绕过了苹果的内存完整性保护机制，访问了本应受保护的系统区域。

OpenEvidence 已经覆盖了65%的美国医生， 4月单月覆盖2700万次临床场景，算下来平均每个医生每月用41次，基本每个工作日都在用。我一直以为是医院系统对接的。结果是医生自己用执业编…

OpenEvidence已覆盖65%的美国医生，4月单月临床场景使用达2700万次，平均每位医生每月使用41次。平台由医生个人通过执业编号在手机上注册，医院最初不知情，Mount Sinai的AI负责人称此为shadow AI，表示其早在基层普及。

Tutorial

兄弟们，这个可以啊！赶紧装起来！ Kevin Lin，牛津大学博士后，前Meta和Microsoft研究员，刚刚把Violin这个开源视频翻译Skill放了出来。视频已经是互联网绝对主流的内容形…

牛津大学博士后Kevin Lin开源视频翻译工具Violin，旨在打破高质量视频内容的语言壁垒。该工具将语音识别、大语言模型翻译与语音合成整合为自动化流水线，支持多语言互译与个性化翻译风格调整，例如将学术报告转化为儿童易懂版本。用户还能直接与视频内容进行对话并获取相关答案。

Building a safe， effective sandbox to enable Codex on Windows

OpenAI 为 Windows 平台上的 Codex 构建了一个安全沙箱环境。该沙箱通过严格控制文件访问权限和实施网络限制，确保了代码生成与执行过程的安全性。这一举措使得基于 Codex 的编码助手能够以高效且受控的方式运行，在提供强大编程辅助功能的同时，有效隔离了潜在风险，保障了用户系统的安全。

牛逼！Yetone 佬。

开发者Yetone将一篇关于桌面应用开发"最佳实践"的文章转化为一个名为"native-feel-skill"的Agent Skill。该Skill旨在帮助开发者利用Coding Agent，轻松地重构或开发跨平台桌面应用，并使其获得极其接近Native原生应用的性能体验。

📭Skip Today

Auto-filtered. Here's why — so you know you're not missing out:

Tokenizer Fertility and Zero-Shot Performance of Foundation Models on Ukrainian Legal Text: A Comparative Study
→ Single-source paper, low reader value
Self-Distilled Agentic Reinforcement Learning
→ Single-source paper, low reader value
AI Safety Landscape for Large Language Models: Taxonomy, State-of-the-art, and Future Directions
→ Single-source paper, low reader value
Adapting Vision-Language Models for Neutrino Event Classification in High-Energy Physics
→ Single-source paper, low reader value
An Agentic LLM-Based Framework for Population-Scale Mental Health Screening
→ Single-source paper, low reader value
Streaming Speech-to-Text Translation with a SpeechLLM
→ Single-source paper, low reader value
DisaBench: A Participatory Evaluation Framework for Disability Harms in Language Models
→ Single-source paper, low reader value
Formal Conjectures: An Open and Evolving Benchmark for Verified Discovery in Mathematics
→ Single-source paper, low reader value

Subscribe to Xiaohu AI Daily