Anthropic × SpaceX

The Opus 4.7 Rate Limit Rescue

May 6, 2026 · Research Analysis

Sources: Anthropic Engineering Postmortem, Business Insider, CNBC, WSJ, Reddit r/ClaudeCode, X/Twitter

What @claudeai Announced

@claudeai (1.3M followers) ❤️ 111K · 🔄 10K · 👁 17M
"We've agreed to a partnership with @SpaceX that will substantially increase our compute capacity. This, along with our other recent compute deals, means we've been able to increase our usage limits for Claude Code and the Claude API."
EFFECTIVE IMMEDIATELY

1) Doubling Claude Code 5-hour rate limits for Pro, Max, Team
2) Removing peak-hours limit reduction for Pro & Max
3) Substantially raising API rate limits for Opus models

Executive Summary

Compute Added

300MW+

220K+ NVIDIA GPUs online within the month

Rate Limits

Claude Code limits doubled for paid tiers

Peak Hours

Throttling removed for Pro & Max

This does not fix the Opus 4.7 model itself. It fixes the infrastructure bottleneck that made it unusable in practice.

The Opus 4.7 Crisis

TOKEN INFLATION

New tokenizer increases token count by 1.0–1.35× for the same prompt. Same pricing per token = silent price hike.

RATE LIMIT HELL

Pro subscribers reported hitting limits after ~3 Opus messages per 5-hour window. Peak hours cut limits further.

QUALITY REGRESSION

Users reported hallucinations, combative behavior, "laziness," broken adaptive reasoning, and a 3% coding eval drop.

Root Cause: The Three Bugs

Anthropic's April 23 postmortem traced quality issues to three separate product changes, not the model weights.

Mar 4 · Default reasoning effort lowered from highmedium. Made Claude feel "dumber." Reverted Apr 7
Mar 26 · Caching bug: clear_thinking_20251015 dropped reasoning on every turn. Claude appeared forgetful. Fixed Apr 10 (v2.1.101)
Apr 16 · Verbosity prompt: ≤25 words between tool calls. Caused a 3% drop on coding evals. Reverted Apr 20 (v2.1.116)

⚠️ Key Insight

The API and inference layer were not affected. All issues were in Claude Code's client-side defaults and system prompts. Resolved April 20, 2026.

The Rate Limit Nightmare

Pro Tier ($20/mo)
~45 Opus messages per 5-hour window. With token inflation, real usage felt like 3 messages before hitting the wall.
Peak Hours Penalty
Weekdays 5am–11am PT: limits were cut in half. Exactly when most developers work.
Max Tier ($100–200/mo)
Even high-tier users reported exhausting 5× quota in 1.5 hours of moderate agentic usage.

💡 The Hidden Claude Code Tax

The March 26 caching bug caused continuous cache misses, draining usage limits faster than expected even before the token inflation.

The SpaceX Deal: Colossus 1

CONFIRMED

Anthropic will use all compute capacity at SpaceX's Colossus 1 data center in Memphis, Tennessee.

300MW+

Power Capacity

220K+

NVIDIA GPUs

<30d

Time to Online

@elonmusk (X/Twitter)
"Everyone I met was highly competent and cared a great deal about doing the right thing. No one set off my evil detector. So long as they engage in critical self-examination, Claude will probably be good."

Anthropic's Compute Portfolio

Amazon · Up to 5 GW (~1 GW by end of 2026)

Google & Broadcom · 5 GW beginning 2027

Microsoft & NVIDIA · $30B Azure capacity

Fluidstack · $50B US AI infrastructure

SpaceX · 300MW+ immediate (Colossus 1)

Hardware diversity: AWS Trainium, Google TPUs, and NVIDIA GPUs.

The Three Fixes (Live Now)

Doubled 5-Hour Limits — Pro, Max, Team, Enterprise now get 2× messages per rolling window (~45 → ~90 Opus messages for Pro). LIVE
Peak-Hours Throttling Removed — Pro & Max no longer suffer reduced limits during business hours. 3 AM = 3 PM. LIVE
Higher Opus API Limits — Developers using the Claude API get raised rate limits across all tiers. LIVE

💡 No Action Required

Limits updated automatically for all paid plans. Free tier is unchanged.

Developer Impact by Tier

BIGGEST WIN

Pro ($20/mo)

Previously the most constrained. Doubling + removing peak cuts makes it viable for serious development without upgrading to Max.

POWER USER

Max ($100–200/mo)

Rate caps should now be rarely hit, except in extremely long autonomous agent sessions. Predictable workflow.

TEAM SCALE

Team / Enterprise

Combined capacity increase is substantial for multi-developer environments running concurrent Claude Code sessions.

API DEVS

API Users

Higher Opus caps may allow simplifying batched architectures and reducing queue complexity.

Does This Fix Opus 4.7?

MODEL ISSUES

❌ New tokenizer (1.0–1.35× token inflation)
❌ Sampling controls removed (temp, top_p, top_k)
❌ Adaptive thinking hidden by default
❌ Personality shift: more literal, less helpful
❌ Opus 4.5 permanently removed

INFRA FIXES

✅ Rate limits doubled
✅ Peak-hour throttling gone
✅ 300MW+ new compute online
✅ Quality bugs already fixed (Apr 20)
✅ Usage limits reset for subscribers

⚠️ The Verdict

The model itself is not changing. What changes is the infrastructure envelope around it. The May 6 update makes Opus 4.7 practically usable by removing the rate-limit chokepoint. The quality issues were already resolved in the April 20 patch.

Competitive Context

OpenAI Codex

Shipped /goal command last week — persistent workflow feature competing directly with Claude Code's agent mode.

Pricing: Pay-per-task (usage-based)

Claude Code

Doubling limits effectively halves per-message cost for heavy Pro/Max users, strengthening the flat-fee value prop.

Pricing: Flat monthly fee with rate limits

This is a direct competitive response to Codex. Revenue from SpaceX compute deal → more capacity → happier developers → retention.

Community Sentiment

@Shpigford (Josh Pigford, Founder)
"opus 4.7 is the first time i've thought 'anthropic may be moving too fast'. just feels sloppy. every interaction i'm having with 4.7 across every input... they're all having substantial issues that 4.6 simply doesn't encounter."
@jeremyphoward (Jeremy Howard, fast.ai)
"the first model that 'gets' what I'm doing when I'm working."
r/ClaudeCode (Reddit, 2,300 upvotes)
"Claude Opus 4.7 is a serious regression, not an upgrade... Please open back support for Opus 4.5. 4.6 is unusable and 4.7 eats usage like nuclear reactor."

The Orbital Compute Footnote

FUTURE

Anthropic has "expressed interest" in partnering with SpaceX to develop multiple gigawatts of orbital AI compute capacity.

🛰️ Why Space?

Bypass terrestrial constraints: power, land, cooling. Remains speculative and years from reality, but signals the severity of the physical infrastructure bottleneck.

Elon Musk merged xAI into SpaceX earlier this year. Despite prior attacks on Anthropic, revenue wins over rivalry in 2026 compute economics.

Key Takeaways

1 The rate-limit pain was real and severe. Pro users were getting ~3 Opus messages per window. That's now ~90.
2 Removing peak-hour throttling is arguably more important than doubling. Developers work during business hours.
3 Token inflation remains. Same tasks still cost 1.0–1.35× more. The extra quota compensates but doesn't reverse it.
4 The quality bugs were fixed April 20. The May 6 announcement addresses compute, not model weights.
5 This is a competitive play against OpenAI Codex. Infrastructure is the new moat.

Will This Make Things Better?

YES — FOR INFRASTRUCTURE

Developers will see fewer mid-session blocks, predictable workflows, and longer uninterrupted agent sessions. The Pro tier becomes genuinely usable for professional work.

PARTIALLY — FOR THE MODEL

Opus 4.7's tokenizer, API breaking changes, and personality shift are unchanged. Developers who hated the new behavior won't be swayed by more quota.

THE BOTTOM LINE

Anthropic acknowledged the pain, published a transparent postmortem, reset limits, and acquired massive compute. For developers, this transforms Opus 4.7 from unusable to viable — but 4.5 is still gone.

Research compiled May 7, 2026 · stark.boxmining.one