The Opus 4.7 Rate Limit Rescue
May 6, 2026 · Research Analysis
Sources: Anthropic Engineering Postmortem, Business Insider, CNBC, WSJ, Reddit r/ClaudeCode, X/Twitter
1) Doubling Claude Code 5-hour rate limits for Pro, Max, Team
2) Removing peak-hours limit reduction for Pro & Max
3) Substantially raising API rate limits for Opus models
Compute Added
300MW+
220K+ NVIDIA GPUs online within the month
Rate Limits
2×
Claude Code limits doubled for paid tiers
Peak Hours
0×
Throttling removed for Pro & Max
This does not fix the Opus 4.7 model itself. It fixes the infrastructure bottleneck that made it unusable in practice.
New tokenizer increases token count by 1.0–1.35× for the same prompt. Same pricing per token = silent price hike.
Pro subscribers reported hitting limits after ~3 Opus messages per 5-hour window. Peak hours cut limits further.
Users reported hallucinations, combative behavior, "laziness," broken adaptive reasoning, and a 3% coding eval drop.
Anthropic's April 23 postmortem traced quality issues to three separate product changes, not the model weights.
high → medium. Made Claude feel "dumber." Reverted Apr 7
clear_thinking_20251015 dropped reasoning on every turn. Claude appeared forgetful. Fixed Apr 10 (v2.1.101)
⚠️ Key Insight
The API and inference layer were not affected. All issues were in Claude Code's client-side defaults and system prompts. Resolved April 20, 2026.
💡 The Hidden Claude Code Tax
The March 26 caching bug caused continuous cache misses, draining usage limits faster than expected even before the token inflation.
Anthropic will use all compute capacity at SpaceX's Colossus 1 data center in Memphis, Tennessee.
300MW+
Power Capacity
220K+
NVIDIA GPUs
<30d
Time to Online
Amazon · Up to 5 GW (~1 GW by end of 2026)
Google & Broadcom · 5 GW beginning 2027
Microsoft & NVIDIA · $30B Azure capacity
Fluidstack · $50B US AI infrastructure
SpaceX · 300MW+ immediate (Colossus 1)
Hardware diversity: AWS Trainium, Google TPUs, and NVIDIA GPUs.
💡 No Action Required
Limits updated automatically for all paid plans. Free tier is unchanged.
Pro ($20/mo)
Previously the most constrained. Doubling + removing peak cuts makes it viable for serious development without upgrading to Max.
Max ($100–200/mo)
Rate caps should now be rarely hit, except in extremely long autonomous agent sessions. Predictable workflow.
Team / Enterprise
Combined capacity increase is substantial for multi-developer environments running concurrent Claude Code sessions.
API Users
Higher Opus caps may allow simplifying batched architectures and reducing queue complexity.
❌ New tokenizer (1.0–1.35× token inflation)
❌ Sampling controls removed (temp, top_p, top_k)
❌ Adaptive thinking hidden by default
❌ Personality shift: more literal, less helpful
❌ Opus 4.5 permanently removed
✅ Rate limits doubled
✅ Peak-hour throttling gone
✅ 300MW+ new compute online
✅ Quality bugs already fixed (Apr 20)
✅ Usage limits reset for subscribers
⚠️ The Verdict
The model itself is not changing. What changes is the infrastructure envelope around it. The May 6 update makes Opus 4.7 practically usable by removing the rate-limit chokepoint. The quality issues were already resolved in the April 20 patch.
OpenAI Codex
Shipped /goal command last week — persistent workflow feature competing directly with Claude Code's agent mode.
Pricing: Pay-per-task (usage-based)
Claude Code
Doubling limits effectively halves per-message cost for heavy Pro/Max users, strengthening the flat-fee value prop.
Pricing: Flat monthly fee with rate limits
This is a direct competitive response to Codex. Revenue from SpaceX compute deal → more capacity → happier developers → retention.
Anthropic has "expressed interest" in partnering with SpaceX to develop multiple gigawatts of orbital AI compute capacity.
🛰️ Why Space?
Bypass terrestrial constraints: power, land, cooling. Remains speculative and years from reality, but signals the severity of the physical infrastructure bottleneck.
Elon Musk merged xAI into SpaceX earlier this year. Despite prior attacks on Anthropic, revenue wins over rivalry in 2026 compute economics.
Developers will see fewer mid-session blocks, predictable workflows, and longer uninterrupted agent sessions. The Pro tier becomes genuinely usable for professional work.
Opus 4.7's tokenizer, API breaking changes, and personality shift are unchanged. Developers who hated the new behavior won't be swayed by more quota.
Anthropic acknowledged the pain, published a transparent postmortem, reset limits, and acquired massive compute. For developers, this transforms Opus 4.7 from unusable to viable — but 4.5 is still gone.
Research compiled May 7, 2026 · stark.boxmining.one