What was the most-used model on OpenRouter in June 2026?

By average daily token volume, DeepSeek V4 Flash leads at roughly 619B tokens, followed by Tencent Hy3 Preview (451B) and MiniMax M3 (447B).

Is DeepSeek better than Claude?

Volume leadership is not quality leadership. Claude Opus 4.8 still ranks #1 on the Artificial Analysis Intelligence Index at 61.4. DeepSeek wins on cost efficiency and daily throughput for agent batch workloads.

Which frontier models are expected in Q3 2026?

High-confidence windows include GPT-6 (August–September), Claude Opus 5 (around September), Gemini 4, DeepSeek V5 (open weights, roughly 1T parameters), already-shipped GLM 5.2, and Grok 4.3+.

OpenRouter June 2026 Rankings: Chinese Models Own 61% of Developer Traffic — H2 2026 Decision Guide

In June 2026, Claude Fable 5 vanished globally under export controls, OpenAI and Anthropic both signaled IPO intent, and Chinese models on OpenRouter crossed 61% of developer traffic. This guide reads the June leaderboard as billing truth—not vendor marketing—using OpenRouter live volume, Artificial Analysis quality indices, and SWE-bench Pro to deliver company and model tables, a quality-versus-volume split, Q3 release forecasts, an eight-scenario picker matrix, and a five-step OpenClaw routing playbook for H2 2026.

1. June 2026 leaderboard: companies and models

OpenRouter aggregates real production calls from millions of developers worldwide. The board reflects what code actually routes in the wild—not a one-off benchmark run. Data is current through June 2026; live rankings are at openrouter.ai/rankings.

Company rankings (weekly token volume)

Rank	Company	Origin	Weekly tokens	Market share
1	DeepSeek	China	5.13T	17.6%
2	Anthropic	United States	4.34T	14.8%
3	Google	United States	3.66T	12.5%
4	OpenAI	United States	2.46T	8.4%
5	Xiaomi	China	2.42T	8.3%
6	MiniMax	China	2.37T	8.1%
7	Tencent	China	2.36T	8.1%
8	Alibaba Qwen	China	1.26T	4.3%

Chinese vendors in the top eight alone account for roughly 46% of weekly tokens among named leaders. When long-tail Chinese open-weight routes are included, developer-traffic share crosses 61%—the headline shift driving H2 planning conversations.

Model rankings (average daily tokens, Top 10)

Rank	Model	Vendor	Daily tokens
1	DeepSeek V4 Flash	DeepSeek	619B
2	Hy3 Preview	Tencent	451B
3	MiniMax M3	MiniMax	447B
4	MiMo-V2.5	Xiaomi	327B
5	DeepSeek V4 Pro	DeepSeek	300B
6	Claude Opus 4.7	Anthropic	263B
7	Claude Opus 4.8	Anthropic	~200B
8	Claude Sonnet 4.6	Anthropic	178B
9	Gemini 3 Flash Preview	Google	156B
10	Kimi K2.6	Moonshot AI	~150B

This table answers a different question than SWE-bench: which models developers trust enough to bill every day in production agents, copilots, and batch pipelines.

2. One-year reversal: US models 70% to 30%

Bloomberg-cited OpenRouter and Exponential View data frames the macro story in one sentence: US model share (Google + OpenAI + Anthropic combined) fell from about 70% in June 2025 to roughly 30% in June 2026. Forty percentage points migrated to Chinese routes in twelve months—and the shift is not regional patriotism. OpenRouter's user base is global, with heavy representation from the United States, Europe, and India.

A San Diego developer quoted in industry coverage put the economics plainly:

"Coding with Claude runs about ten dollars an hour. With DeepSeek, under fifty cents."

For the majority of daily workloads, June 2026 is an economics story before it is a capability story. Teams that still route everything through a single US frontier model are paying a premium most production traffic no longer justifies.

3. Volume leader is not quality leader

Confusing "most used" with "best" is the fastest way to misread the June board. On the Artificial Analysis Intelligence Index (through late May 2026):

Model	Quality index	SWE-bench Pro	Notes
Claude Opus 4.8	61.4 (#1)	69.2%	Leads long-context and agent reasoning
GPT-5.5	59–60	63.1%	Strong ecosystem and tool-call latency
Gemini 3.1 Pro	57	—	Hard reasoning tasks
Qwen 3.7 Max	57	—	Top-tier Chinese closed model
Claude Sonnet 4.6	—	80.8% (Verified)	Writing and instruction following

One engineer ran twenty identical tasks across frontier models: Claude Opus 4.8 won 16, GPT-5.5 won 5, and Gemini 3.1 Pro won 4. On long-context workloads Opus was effectively uncatchable at the time of testing.

Claude Fable 5 adds a separate access variable. It earned perfect quality ratings (100/100) on several leaderboards, then was pulled globally in mid-June 2026 under US export License Exception AI740 restrictions. Status remains uncertain. The episode proves US frontier labs can still lead on raw capability while availability—not benchmark score—becomes the binding constraint.

4. Three reasons Chinese models dominate daily work

Price. MiniMax M3 API input pricing is $0.60/M tokens versus Claude Opus 4.8 at $5.00/M—roughly one-eighth the cost for comparable daily coding assistance, translation, and summarization.
Good enough. For routine programming help, completion, and document work, Chinese models deliver roughly 80–90% of frontier quality at a fraction of the bill.
Open weights. DeepSeek V4, MiniMax M3, and peers ship open-weight checkpoints enterprises can self-host, removing data-residency objections that block cloud routing.

A Dallas developer described the stack that now looks mainstream: "Complex work—about five hundred dollars a month on Claude plus ChatGPT. Ninety percent of daily coding and speech-to-text on MiniMax, Kimi, and MiMo—around two hundred dollars." Route by complexity, optimize by invoice. That is the 2026 default, not the exception.

5. Eight-scenario model picker matrix (June 2026)

Scenario	Recommended model	Why
Complex code / agents	Claude Opus 4.8	Highest composite index; strongest long context
Daily coding assistance	DeepSeek V4 Flash / MiMo-V2.5	Extreme cost efficiency and speed
Ultra-low-cost API	MiniMax M3	$0.60/M, open weights, self-hostable
Long-context processing	Kimi K2.6 (1M context)	Massive window at reasonable price
Google ecosystem integration	Gemini 3.5 Flash	Native Google Workspace support
Real-time web search	Grok 4.3	Live X/Twitter content access
Self-hosted deployment	GLM 5.2 / Kimi K2.6	Top-tier open-weight options
Image generation	ChatGPT Images 2.0	Best text rendering in generated images

6. Three selection pitfalls for platform teams

Betting on a single weekly leader. DeepSeek V4 Flash at 619B daily tokens does not mean your compliance program can route through Chinese endpoints. Fortune 500 procurement still faces data-residency and congressional scrutiny caps.
Optimizing benchmarks while ignoring invoices. Claude Opus 4.8 at index 61.4 is the quality ceiling, but an agent burning millions of tokens per day can cost multiples of a DeepSeek plus MiniMax blend with negligible user-visible regression.
Perfect model choice, unstable gateway. OpenClaw on a sleeping laptop yields silent channels regardless of leaderboard rank. Always-on remote Mac hosting and openclaw channels status --probe acceptance are part of model ROI, not an afterthought.

7. Q3 2026 release forecast

Q3 2026 may be the densest frontier release quarter on record. Treat dates as confidence windows, not guarantees:

Model	Vendor	Expected window	Key angle
GPT-6	OpenAI	August–September 2026	Rumored 1.5M context, stronger agent tooling
Claude Opus 5	Anthropic	~September 2026	Successor to Opus 4.8; long-horizon agents
Gemini 4	Google	Q3 2026	Multimodal upgrade; video and audio input
DeepSeek V5	DeepSeek	Q3 2026	Open weights, ~1T parameters, closed-model parity target
GLM 5.2	Z.ai	Already released	Top open-weight coding model
Grok 4.3+	xAI	Q3 2026	1M context, enhanced live web

Multiple flagships may land inside a six-week window from mid-August through late September. Benchmark leadership will rotate faster than quarterly planning cycles—another argument for model-agnostic routing rather than annual single-vendor contracts.

8. Five macro predictions for H2 2026

Competition shifts from "strongest model" to "best fit for this scenario." With five major labs shipping inside ninety days, rational architecture sends the hardest 5% to closed US frontiers and the remaining 95% of daily volume to Chinese open-weight routes.
Chinese share keeps rising, but enterprise compliance is the ceiling. Individual developer adoption shows no sign of slowing; Fortune 500 procurement constrained by data security and US congressional oversight may keep enterprise Chinese share below 30% for years.
Agents are the real battlefield. 2026 is the year agents move from experiment to production. Anthropic's 2026 Agent Status Report cites nearly 44% of Claude API calls originating from math and computer-science workloads—exactly the agent and codegen surface OpenRouter measures.
Dual IPO pressure from OpenAI and Anthropic. June 2026 IPO signals reprice the entire sector. Public-market scrutiny could force more transparent pricing and accelerate price wars with Chinese vendors.
Local inference crosses 80% SWE-bench on consumer hardware by 2027. Expect 32GB consumer GPUs running local checkpoints to exceed SWE-bench Verified 80% within roughly eighteen months—undermining commercial API economics for routine coding even if frontier labs keep the quality crown.

9. Five steps to a switchable multi-model architecture

Archive the June baseline. Snapshot company and model Top 10, document the US 70%→30% inflection, and maintain a weekly comparison sheet. Cross-reference our OpenRouter weekly rankings guide for rolling deltas.
Layer routes by complexity. Agent batch jobs → DeepSeek V4 Flash; enterprise hard reasoning → Claude Opus 4.8; ultra-long documents → Kimi K2.6; multimodal Google workflows → Gemini 3.5 Flash.
Write primary and fallback chains in openclaw.json. OpenRouter model IDs include vendor prefixes; store keys via SecretRef; enable automatic fallback on HTTP 429 per the channels probe and 429 runbook.
Deploy an always-on remote Mac gateway. Run openclaw gateway install under launchd; sync workspaces with SFTP or rsync so agent state survives laptop sleep.
Review weekly; shorten gray windows in Q3. Pass openclaw channels status --probe before promoting a new model; when GPT-6 or Opus 5 ships, re-evaluate primary and fallback within 48 hours.

10. Frequently asked questions

Who won June 2026 on OpenRouter—DeepSeek or Claude? By token volume DeepSeek V4 Flash leads; by composite quality index Claude Opus 4.8 remains #1. Production routing should use both tables.

Can I still use Claude Fable 5? It was withdrawn globally in mid-June 2026 under export controls; status is unresolved. Plan migration paths on Opus 4.8 or Sonnet 4.6 until Anthropic publishes restoration criteria.

Which H2 releases matter most? GPT-6 and Claude Opus 5 likely collide in the August–September window. Build vendor-neutral routing now so you are not rewriting agents under launch-week pressure.

11. Conclusion: margin compression and model-agnostic design

The June story is not "Chinese models won." It is that margins on the model layer are compressing fast. DeepSeek proved in early 2025 that frontier performance need not require frontier capex; Xiaomi, Tencent, MiniMax, and Moonshot drove baseline pricing toward the floor. US labs split strategies: OpenAI on ecosystem, Anthropic on quality, Google on multimodal speed—the expensive middle tier is disappearing.

The durable skill for developers and platform leads is not picking today's #1 model. It is building architecture that switches models without rewriting agents. Q3's release pile-up will prove that again within weeks, not quarters.

If you already run multi-model OpenClaw routing, the bottleneck usually returns to gateway uptime and auditable workspace sync. Intermittent laptops, sleeping Windows hosts, and memory-starved VPS instances waste leaderboard strategy before the first invoice closes. SFTPMAC remote Mac rental targets OpenClaw and agent workflows on Apple Silicon: native launchd supervision, low-latency OpenRouter callbacks, and SFTP/rsync baselines aligned with our gateway and channel-probe runbooks— a better production home for June-ranking strategy than a household machine doubling as an AI gateway.