Why is channels status --probe green while Telegram stays silent?

Probes validate registration and a narrow health class; they do not prove messages cleared plugins.entries or that the model tier returned a completion instead of throttling.

Which half of the dual toggle is most often wrong?

Teams flip channels.*.enabled in one change and leave plugins.entries.*.enabled stale, or the opposite; both must appear in one artifact.

Does a hosted remote Mac fix quotas?

No for API billing; yes for removing laptop sleep and split HOME trees so the ladder stays reproducible.

2026 OpenClaw channels probe green but no replies: 429, dual toggles, credentials, model prefixes, doctor runbook

When OpenClaw 2026 channels look online and the probe is green, yet Telegram and Slack stay silent: reframing the symptom as a verifiable state machine

A green “online” badge and a passing probe show the gateway can register a channel and answer a synthetic check. They do not prove that a user message entered the work loop, reached the right plugins.entries handler, or became a model call that could answer back. The healthy-control-plane, mute-surface pattern is maddening until you require named evidence per hop, in line with the tiered discipline in the gateway daemon, launchd, and systemd runbook and the 4.x Telegram and WhatsApp channel material.

Pain 1: You treat a green probe as proof the model already ran. Probes are closer to reachability and registration than to a full turn through tool policy. Missing model request lines, empty tool traces, or absent finish reasons can coexist with a clean edge. Tickets must show probe time and the first model log line; the break may be entirely “below” the UI you are staring at, so you are triaging admission before you tune chat wording.

Pain 2: You update channels.*.enabled and forget plugins.entries.*.enabled during migration. The dual toggles split “attach this channel” from “this entry may accept work.” Half-migrated trees keep webhooks looking alive while the entry gate is logically closed, which mimics an upstream outage. For each messenger class, attach two screenshots or excerpts in one change, and re-read the migration traps in the 4.x channel runbook so weekend deploys do not re-teach the same bug.

Pain 3: ~/.openclaw/credentials/ exists but is empty, and the team assumes secrets are present. Installers and mkdir -p leave a plausible path with no files, while the real token lives under another user. Whatever account launches the gateway must own the ~/.openclaw tree you audit; plist and unit files must not smuggle a CI HOME. The security posture in the 2026.4.14 security audit, entrypoint, and session routing write-up calls that class of miss a primary risk, not a footnote.

Pain 4: OpenRouter-style aggregation plus wrong model id prefixes causes silent misroutes. Namespaces like openai/ or anthropic/ select adapters and billing. One stray character can land you on a disabled route while logs read “timeout.” Keep a single version-controlled table of model strings and attach it to empty-reply tickets, especially when the same process mixes direct and aggregate paths, and when you are stabilizing entrypoint and routing after a patch window.

Pain 5: 429 rate limits feel like a dead channel. The gateway can be healthy while the model tier throttles completions or a key hits a velocity cap. Log 429 with any retry metadata, back off, split keys, and cut fan-in. Do not burn a day “fixing Telegram” when the model layer is full. The same “quiet but saturated” story appears when stdio bridges or HTTP caps pin the process, as in the MCP stdio leak and transport limits guide.

Pain 6: You open pairing and version or TLS, WebSocket, and reverse proxy before you prove L2 and L3. Both topics matter; neither replaces proof that the event cleared plugins.entries and produced model HTTP. The official ladder is: process, channel registration, plugin entry, model call—including 429 and prefix sanity—then deep edge work, the same order openclaw doctor nudges you toward in concert with the daemon and linger baselines.

OpenClaw 2026 layered signal map for silent channels: L0 gateway alive, L1 channel registration, L2 `plugins.entries`, and L3 model HTTP including `429` and OpenRouter id prefixes

Read four layers so you do not “skip a floor” and then guess why the elevator never came.

L0: Gateway process, user identity, listeners. Validate the daemon, restart policy, and effective HOME. A flapping process can still win occasional probes. If traffic passes through Nginx or Caddy, keep scheme, path, WebSocket upgrade, and allowed origins aligned with the app using the TLS and WebSocket production guide. On a persistent Mac, match launchd HOME to the credential owner as in the launchd, systemd, and linger material so sleep and re-login noise do not masquerade as “random” silence in Telegram.

L1: Channel registration and openclaw channels status / --probe. Healthy probes are not enough; they only say the channel can run its check class now, not that your end-to-end chat pipeline is unblocked. Track probe duration, flaps, and TLS noise. If every messenger breaks, suspect shared app credentials; if one is silent, diff per-channel file mtimes and secrets before you blame the model.

L2: plugins.entries and the plugins.entries.*.enabled bit. This is the usual ghost. With channels.*.enabled true, a false entry can still drop events as policy, not as an error. Name the entry id you expect for each Telegram or Slack event type and show both toggles in the ticket, citing the 4.x notes wherever schema moved between minors.

L3: Model HTTP, OpenRouter id prefixes, and 429 semantics. After admission is credible, the provider line wins. You want model id, provider, rough usage, finish reason, and explicit 429 lines with retry hints when present. Wrong prefixes on aggregate routes can yield slow or vague adapter failures, so ship openclaw models status (or the documented peer) with every “no reply” case. A throttled model tier still gives “no chat response” if you never surface backoff to the user. If you mix MCP or stdio bridges, separate “model throttled” from “tool pipe blocked” using the MCP and gateway restart story, because both feel like a dead channel with different fixes.

openclaw doctor compresses the usual drift checks and belongs in the same incident folder as the audit, entrypoint, and routing checklist. A ladder is useless without written receipts; a crisp layer tag on the ticket is what lets the next on-call person trust your rollback.

When you share findings with the official Troubleshooting style narrative, name the first layer where evidence failed, not the last thing you tried. If L2 was wrong, say so before you paste proxy logs, because the audience will pattern-match to “network” otherwise. The same story applies when you file upstream questions: a vendor that sees green OAuth and healthy webhooks will ignore you until you show that their delivery reached your process and still vanished at plugins.entries or the model call. Clarity is kindness across time zones.

Decision matrix for green probes with zero chat replies: first evidence, likely root, and a next action that does not waste a day

What you see	First evidence to capture	High-probability root	Next action
Probe green, no model log lines at all	Dual toggles, entry id, last config diff	`plugins.entries` off or unbound event type	Paste `channels..enabled` and `plugins.entries..enabled` together
First line returns, then silence	`429` bursts, key rotation, concurrency	Quota, velocity limits, or adapter saturation	Backoff, new key split, lower parallel tools, verify prefixes
Silence right after an upgrade	CLI and gateway build strings, migration log	Version skew, half-merged `openclaw.json`	Run the pairing stair before re-tuning models
One messenger down, rest fine	Channel cred mtime, mode, owner	Wrong `HOME` or empty cred dir for that app	Reissue under the service user; normalize permissions
Adapter or unknown-model style errors	Exact model id string, expected namespace	Missing or extra OpenRouter-style prefix	One canonical id table; retest with `openclaw models status`
Long hangs, eventually fails	WebSocket code, proxy timings	TLS, upgrade, or `allowedOrigins` on the edge	Walk the Nginx and Caddy list before rewrites in app code

Commented CLI triage you can drop into a runbook: `channels` status, `--probe`, `models` status, and `doctor`

Subcommands can shift between releases; the contract is the evidence bundle, not a lucky incantation. Keep four artifacts: binary identity, channel truth, model inventory, and doctor output, every time a messenger goes quiet with green chrome.

# 0) Pin builds in the ticket (flags vary; follow your installed help text)
# openclaw --version
# openclaw gateway --version  # or the supported pair in your pack

# 1) Surfaces: raw status then synthetic reachability
# openclaw channels status
# openclaw channels status --probe   # if your build exposes --probe
#        # (green probe still does not prove model or plugins.entries)

# 2) Model routing: prove id strings, providers, and errors in one pass
# openclaw models status
#        # (cross-check OpenRouter / aggregate prefixes)

# 3) Whole-process sanity after edits (permissions, entrypoints, drift)
# openclaw doctor
# 4) Optional: journal / launchd wrapper context if HOME looks wrong
# log show --last 1h --predicate 'process == "openclaw"'  # example only

How to run the official OpenClaw troubleshooting-style ladder in order without smearing changes across layers

Order exists so you can roll back. If you edit TLS, prefixes, and plugin entries in one commit, the next engineer sees noise, not cause.

Step 1: Freeze scope. Record the running account, HOME, the active openclaw.json (or equivalent), and a hash or mtime. Block unrelated proxy edits until that snapshot exists.

Step 2: From the code block, run openclaw channels status and the probe your build documents; paste both. You are proving pre-admission failure, not celebrating green text.

Step 3: Prove both toggles for Telegram and Slack. Show channels.*.enabled alongside plugins.entries.*.enabled in one artifact. A single missing half is an incomplete test.

Step 4: For that same user, list ~/.openclaw/credentials with counts and modes; call out an empty directory explicitly. Re-create or rotate under the true service account with approvals.

Step 5: Reconcile every active model id with the OpenRouter or direct-provider prefix you intend. Deduplicate casual aliases; one canonical table, checked into git.

Step 6: If logs show throttling, capture 429 with retry data, add backoff, split staging and production keys, and trim concurrent tool use before you raise read timeouts on the client.

Step 7: When L0–L3 look clean yet chat is dead, only then re-check the reverse-proxy path, the pairing and CLI version stair, and MCP and HTTP transport limits if you bridge tools that way. Close with a short “layer X was guilty” line in the change record and a fresh openclaw doctor so security-related routing edits did not smuggle a new mistake, and launch agents still point at the same HOME after reboot.

FAQ, honest limits, and how SFTPMAC hosted remote Mac capacity bridges delivery and gateway stability

Why is `openclaw channels status --probe` green while Telegram and Slack get nothing?

Probes validate registration and a narrow class of health work. They do not show that a full message was admitted, that plugins.entries.*.enabled allowed it, or that the model tier returned a completion instead of a 429. Green plus empty model logs most often means L2 admission or L3 throttling, not a finished integration.

Where do people forget the second half of the dual toggle story?

During merges they flip channels.*.enabled in one branch and leave plugins.entries.*.enabled stale, or the opposite. The channel process can still start, so the UI feels half alive, yet the router never schedules the entry. Pair both booleans in one ticket and name the entry id responsible for the messenger’s event type, using the 4.x channel notes for schema context.

Will colocating the gateway on a remote Mac by itself end `429` storms or bad prefixes?

Not for quota, billing, or string mistakes; those follow keys and config. A dedicated, always-on Mac with one service identity can still cut investigation time by removing laptop sleep, roaming VPN, and split credential trees, which keeps the L0–L3 ladder testable. Business limits stay business limits.

Value: The ladder converts “it looks fine” into falsifiable steps: you either have model lines or you do not, you either see both toggles or you are guessing, and you either captured 429 or you are chasing ghosts. That discipline is what shortens the bridge between a green icon and a user-visible answer.

Limitation: Runbooks cannot raise API quotas, repair an aggregator namespace typo, or lift a banned token. They expose which layer is lying so leadership hears facts about contracts and headroom, not drama about the chat vendor.

SFTPMAC hosted remote Mac: Teams that outgrow ad-hoc laptops for production-adjacent gateways benefit when a provider couples always-on Mac hardware, launchd or supervised launch profiles, log retention, and SFTP-based artifact delivery in one place. A stable HOME for ~/.openclaw, predictable networking, and a host that also receives build output tightens the story when a channel is silent at 03:00: fewer moving parts around credentials and ingress means faster proof of whether the bug was plugins.entries, a 429, or a bad OpenRouter id. The service does not hand-wave away configuration rigor, but it reduces environmental roulette so the same ladder works next Tuesday as it did today—precisely the repeatability that keeps assistants dependable while OpenClaw and its providers keep moving in 2026.

Before the next upgrade window, archive one green ticket bundle: versions, channels output, models status, doctor, and both toggles.