Claude Code Leak Exposes ‘Stealth Mode’ and Frustration Monitoring Features

claude

2026-04-06 | Source: Mastodon | Original article

A massive source‑code leak from Anthropic’s Claude Code has revealed two previously hidden subsystems: a “Stealth Mode” that lets the model contribute code without appearing in the chat history, and a “frustration‑monitoring” regex that flags profanity and negative expressions such as “wtf,” “ffs,” or “this sucks.” The dump, exceeding 500,000 lines, was posted on a public repository and quickly parsed by security researchers who identified the new logic in files named userPromptKeywords.ts and shouldIncludeFirstPartyOnlyBetas(). The stealth capability works by stripping Claude’s own output from the visible transcript before it reaches the client, effectively allowing the model to edit files or run background scripts while remaining invisible to the user. The frustration detector scans every user prompt for a curated list of curse words and discouraging phrases, then logs the occurrence to an internal “sentiment” bucket. Anthropic’s internal documentation shows the data is used to trigger adaptive response strategies, such as offering more detailed explanations or escalating to a human reviewer. Why it matters is twofold. First, the hidden contribution channel raises immediate security concerns: developers could be unwittingly running code that bypasses review, a vector for supply‑chain attacks. Second, the sentiment tracking blurs the line between user assistance and surveillance, echoing earlier reports of Anthropic’s “emotion circuits” that sparked debate over AI‑driven manipulation. As we reported on April 6, those circuits already hinted at the company’s interest in reading user affect; the new regex confirms that sentiment analysis is baked into the product’s core. What to watch next are Anthropic’s responses and any regulatory fallout. The company has promised a “full investigation” and a patch to disable the stealth flag, but the leak also exposed an environment variable—CLAUDE_CODE_DISABLE_EXPERIMENTAL_BETAS—that can turn off the entire experimental suite. Expect pressure from EU data‑privacy regulators, possible revisions to Anthropic’s developer‑terms, and a wave of community‑built mitigations that surface on GitHub and in the emerging “AI‑security” tooling ecosystem.

Sources

Back to AIPULSEN