亜人にとってのLinux Foundation</a>とヒトにとってのLinux Foundation</a>は、同じものでしょうか Anthropic says its m

anthropic

2026-04-08 | Source: Mastodon | Original article

Anthropic announced that its latest AI‑driven cyber model, internally dubbed “Glasswing,” is the most capable system it has ever built for network‑security tasks, but the company has decided to keep it out of the public domain. The model, described as a “cyber‑focused large language model” capable of generating sophisticated exploit code, scanning for vulnerabilities and even orchestrating multi‑stage attacks, was deemed too dangerous to release without unprecedented safeguards. Instead, Anthropic has confined the technology to a tightly controlled research environment called Project Glasswing, where a small team can probe its limits while enforcing strict isolation, audit trails and human‑in‑the‑loop approvals. The move underscores a growing tension between AI advancement and security risk. As we reported on 8 April, Anthropic’s discovery of zero‑day exploits in its own infrastructure highlighted the dual‑use nature of powerful models. By acknowledging the threat posed by Glasswing, the firm joins OpenAI and Google in publicly grappling with model‑copying and misuse concerns that have dominated recent headlines. Keeping the model internal may stave off immediate misuse, but it also raises questions about transparency, accountability and the broader industry’s ability to set safety standards for AI‑enabled cyber tools. What to watch next is whether Anthropic will publish safety‑research findings from Glasswing, invite external auditors, or seek regulatory guidance on AI‑driven cyber capabilities. Competitors are likely to accelerate their own defensive AI programs, and governments in the EU and US are expected to tighten oversight of dual‑use AI. The next few weeks could reveal whether Project Glasswing becomes a benchmark for responsible AI security research or a cautionary tale of technology held too close to the chest.

Sources

Back to AIPULSEN