AI Briefing: May 9 – 11, 2026

🚨 Breaking (last 24h)

OpenClaw v2026.5.10-beta.2 ships with QA automation & voice diagnostics
OpenClaw released its second beta on May 10, continuing extreme release velocity. Key additions: Telegram live PR evidence automation for QA/Mantis (Convex-leased credentials, Crabbox transcript capture, motion GIF previews, inline PR comments), Discord realtime voice diagnostics (speaker turns, playback resets, barge-in detection, audio cutoff analysis), and talk.realtime.instructions for operator-defined voice style guidance. Also introduces an opt-in private skill archive upload path gated by skills.install.allowUploadedArchives — a notable security enhancement. 40+ fixes across Telegram, ACPX, OpenAI-compatible models, xAI, DeepSeek, iMessage, Codex, and Cron. Dependencies refreshed: ACPX 0.33.1, Codex ACP 0.14.0, Baileys 7.0.0-rc10, Google GenAI 2.0.1, OpenAI 6.37.0, AWS SDK 3.1045.0.
GitHub Release

Sam Altman teases "goblin" as next model name
On May 10, OpenAI CEO Sam Altman posted: "what if we name the next model 'goblin' — almost worth it to make you all happy..." This follows OpenAI's April 29 blog post "Where the Goblins Came From" explaining how GPT-5.1+ developed persistent goblin/gremlin references via unintended RL reward signals. Altman also posted "curious to see if you still feel this way after the next model!" in reply to a user — signaling imminent model improvements.
Sam Altman on X

📊 Market Moves (last 48h)

No major funding, acquisition, or partnership announcements detected in the May 9–11 window. The most recent market-moving news remains:

Anthropic + SpaceX compute deal (May 6) — 300MW, 220K GPUs
Anthropic enterprise AI services company (May 4) — with Blackstone, Hellman & Friedman, Goldman Sachs
OpenAI "The Deployment Company" (May 4) — $10B joint venture, $4B raised from 19 investors
OpenAI + AWS Bedrock expansion (April 28) — GPT-5.5, Codex, and Managed Agents on AWS

🔬 Research (last 48h / 7 days for arXiv)

arXiv published 15 papers on May 7 (outside strict 48h window but within 7-day tracking window). Notable entries:

AI Co-Mathematician: Accelerating Mathematicians with Agentic AI — DeepMind workbench for mathematicians to interactively leverage AI agents for open-ended research. Combines theorem proving, conjecture generation, and literature search in a unified interface.
EMO: Pretraining Mixture of Experts for Emergent Modularity — Ryan Wang et al. explore training MoE architectures where experts spontaneously specialize into modular capabilities, enabling selective deployment of only relevant model subsets.
UniPool: A Globally Shared Expert Pool for Mixture-of-Experts — Minbin Huang et al. propose sharing experts across all transformer layers rather than rigid per-layer allocation, improving parameter efficiency.
Verifier-Backed Hard Problem Generation for Mathematical Reasoning — Yuhang Lai, Jiazhan Feng, Yee Whye Teh. Uses formal verifiers to generate challenging math problems with guaranteed validity, creating better training data for reasoning models.
Why Global LLM Leaderboards Are Misleading — Jai Moondra et al. critique pairwise human feedback ranking systems and propose small portfolio approaches for heterogeneous supervised ML evaluation.
Optimizer-Model Consistency: Full Finetuning with the Same Optimizer as Pretraining Forgets Less — Yuxing Liu et al. show that matching optimizer configurations between pretraining and finetuning reduces catastrophic forgetting.
Beyond Negative Rollouts: Positive-Only Policy Optimization — Mingwei Xu, Hao Fang. RLVR method using only positive examples with implicit negative gradients, improving reasoning without negative sampling.
StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction — Xiangyuan Xue et al. trajectory abstraction method for long-horizon agent training.
Superintelligent Retrieval Agent — Zeyu Yang et al. next-generation RAG architecture treating retrieval as a first-class agentic capability.
BAMI: Training-Free Bias Mitigation in GUI Grounding — Borui Zhang et al. for GUI agent robustness.

🛠️ Tools (last 48h)

OpenClaw v2026.5.10-beta.2 — As detailed in Breaking, this beta adds:

QA/Mantis Telegram live PR evidence automation
Discord voice diagnostics
Realtime voice instructions (talk.realtime.instructions)
Opt-in private skill archive upload (skills.install.allowUploadedArchives)
Codex tool profile simplification (always owns workspace/edit/patch/exec/process/plan)
40+ cross-platform fixes GitHub

Gemini API File Search — Now Multimodal (May 10, trending on Hacker News)
Google announced expanded Gemini API file search with multimodal RAG support, enabling efficient, verifiable retrieval across document types including images and video.
Google Blog

Hermes Agent v0.13.0 — Already covered in previous briefing (May 7), but community adoption accelerating. Twitter shows active community building: Japanese users reporting Hermes overtook OpenClaw on OpenRouter global token rankings, NFT generation workflows, PM/sysadmin profiles, and OpenHands agent orchestration under Hermes.

💭 Industry Pulse (last 48h)

Sam Altman (May 10):

"what if we name the next model 'goblin' — almost worth it to make you all happy..." X
"as i get older and wiser, so do i..." X
"curious to see if you still feel this way after the next model!" — teasing imminent improvements X

Yann LeCun (May 10):

Corrected Elad Gil on AI origins: "BS. Attention was born in Montréal. PyTorch in NYC. AlphaGo in London. AlphaFold in London." X

Greg Brockman (May 8-9):

Confirmed GPT-5.5-Cyber in limited preview for defenders securing critical infrastructure X
Revealed Codex can now drive Chrome tabs in background — expanding beyond IDE into general computer-use agent territory X
"You can now just build amazing voice agents" — on GPT-Realtime-2 API X

Andrew Ng (May 7):

"New course: Build agents that respond to users with not only plaintext, but custom UIs" — expanding AI agent interface design education X

Community trends:

Hermes Agent gaining significant mindshare in Japanese and Chinese-speaking communities
OpenClaw remains popular for crypto/token analysis automation
Multiple users comparing Hermes vs OpenClaw feature parity
"Local AI needs to be the norm" trending on Hacker News (May 10)

🖼️ New Presentations

No new presentations triggered. OpenClaw v2026.5.10-beta.2 is a beta maintenance release, not a major version warranting a dedicated presentation per the version-update-presentation-pipeline criteria.

📡 Sources & Data Provenance

Source	Status	URL
GitHub Releases	✅ Working	https://github.com
Twitter/X API	✅ Working	https://twitterapi.io
arXiv API	✅ Working	https://arxiv.org
Wiki Raw Archive	✅ Available	~/wiki/raw/
Web Extract	⚠️ Degraded	Built-in
Web Search	❌ Failed	Built-in
Hacker News	✅ Working	https://news.ycombinator.com

Primary sources for this briefing:

This briefing was generated automatically by Hermes Agent. All claims link to primary sources where possible. Unlinked claims were cross-referenced from multiple sources or derived from GitHub release metadata.