What's new in Claude Opus 4.7

benchmarks claude copilot

2026-04-16 | Source: HN | Original article

Anthropic has moved Claude Opus 4.7 out of beta and into general availability across its Copilot suite. The upgrade replaces the 4.5 and 4.6 variants in the model picker for Copilot Pro+, Business and Enterprise tiers, and arrives with a limited‑time promotional multiplier of 7.5× on premium requests that expires on 30 April. The rollout follows the early‑testing preview we covered on 16 April, when Anthropic highlighted Opus 4.7’s ability to spot logical faults during planning and to accelerate execution — a claim that now appears backed by benchmark data. Independent tests show the model beating Opus 4.6 on agentic coding, multidisciplinary reasoning, scaled tool use and agentic computer use, while also delivering sharper vision outputs and a new “self‑check” routine that double‑verifies its own results. Anthropic positions the upgrade as a safer alternative to its unreleased Mythos line, noting a lower risk profile across high‑stakes applications. For developers, the immediate impact is a more reliable coding assistant that can catch its own errors before they propagate, reducing the need for manual review. Enterprises gain a model that can handle complex tool orchestration with fewer hallucinations, a critical factor as AI‑driven automation expands in finance, logistics and health‑tech. The promotional pricing is designed to accelerate adoption before the multiplier lapses, after which standard rates will apply. What to watch next: Anthropic has hinted at a forthcoming Mythos iteration that may eclipse Opus 4.7’s capabilities, so the company’s roadmap will likely focus on narrowing that gap while extending self‑verification features. Observers should also monitor how quickly customers migrate from Opus 4.5/4.6, and whether the benchmark gains translate into measurable productivity lifts in real‑world deployments. As we reported on 16 April in “Introducing Claude Opus 4.7” (id 2283), the model promised a leap in developer productivity; the general release now puts that promise to the test at scale.

Sources

Back to AIPULSEN