AI News

974

OpenAI Supports Illinois Bill to Restrict AI Lab Liability

OpenAI Supports Illinois Bill to Restrict AI Lab Liability
HN +18 sources hn
openai
OpenAI testified before the Illinois Senate on Tuesday, throwing its weight behind Senate Bill SB 3444, a proposal that would shield AI laboratories from civil liability except in narrowly defined “critical‑harm” scenarios. The bill, championed by state legislators, classifies a critical harm as either the death or serious injury of at least 100 people, or a financial loss of $1 billion or more caused by the deployment of a “frontier model” – a system trained with more than $100 million in compute resources. OpenAI’s support hinges on the company’s argument that existing tort law is ill‑suited to the rapid, distributed nature of generative AI. It warned that exposing developers to unlimited lawsuits could stifle innovation and drive research overseas, where regulatory regimes are less stringent. In its testimony, the firm pledged to publish safety, security and transparency documentation for any model that meets the frontier threshold, and to cooperate with regulators on risk‑assessment frameworks. The legislation matters because it could set a de‑facto standard for AI liability in the United States. If passed, Illinois would become the first state to codify a liability carve‑out for large‑scale AI systems, potentially prompting other jurisdictions to adopt similar protections or to craft competing rules that hold developers more accountable. Critics argue the bill risks creating a legal safe‑zone for companies whose products cause catastrophic outcomes, while consumer‑advocacy groups fear it will leave victims without recourse. Watch for the Senate’s vote, expected before the end of the session, and for reactions from other tech firms and civil‑rights organizations. Parallel bills are already circulating in Washington and the European Union, and the outcome in Illinois could influence the shape of a national AI liability framework that balances innovation with public safety.
223

OpenAI pauses Stargate UK, citing energy costs and bureaucracy

OpenAI pauses Stargate UK, citing energy costs and bureaucracy
HN +6 sources hn
openairegulation
OpenAI has put its Stargate UK data‑centre project on ice, citing soaring energy prices and a “significant regulatory burden” as the twin obstacles that make the venture uneconomic at present. The company announced the pause in a statement to The Register, confirming that the plan – unveiled last September to coincide with a state visit by former US President Donald Trump – will be revived only when market and policy conditions improve. The decision hits the UK’s AI agenda hard. Stargate UK was billed by the British government as a flagship investment that would anchor the country’s ambition to become a global AI hub and underpin a £31 billion AI growth package. By shelving the project, OpenAI removes a cornerstone of that strategy, jeopardising thousands of high‑skill jobs, local supply‑chain contracts and the broader narrative of the UK as a sovereign AI leader. The move also underscores the sector’s vulnerability to external cost pressures; electricity rates in Europe have surged amid supply constraints, while the EU‑UK AI regulatory landscape remains in flux. OpenAI’s retreat follows a string of setbacks reported earlier this week, including its withdrawal from the £31 billion UK investment package and a new $100‑per‑month ChatGPT subscription tied to “Vibe” coding features. The company is also lobbying for legislation that would limit liability for AI‑enabled mass‑disaster scenarios, and it faces a Florida investigation over alleged risks to minors. Together, these developments suggest a cautious recalibration ahead of the firm’s anticipated public listing. What to watch next: the UK government’s response, including any adjustments to subsidies or regulatory frameworks that could make the project viable again; OpenAI’s timeline for resuming construction; and whether other AI firms will reconsider UK‑based infrastructure plans in light of the energy‑cost and regulatory headwinds. The outcome will shape the pace at which the UK can attract large‑scale AI investments in the coming years.
150

Transformers Explained – Part 4 Introduces Self‑Attention

Transformers Explained – Part 4 Introduces Self‑Attention
Dev.to +9 sources dev.to
embeddings
Rijul Rajesh’s “Understanding Transformers Part 4: Introduction to Self‑Attention” went live on 9 April, extending his popular series that demystifies the architecture behind today’s large language models. The new post picks up from Part 3, where Rajesh explained how word embeddings and positional encodings fuse meaning with order, and dives into the self‑attention mechanism that lets a transformer weigh every token against every other token in a single pass. The article breaks down the mathematics of query, key and value vectors, illustrates multi‑head attention with code snippets, and shows how the operation scales from a handful of tokens to the billions processed by commercial LLMs. By translating abstract tensor operations into concrete examples, Rajesh gives developers a practical foothold for building or fine‑tuning their own models—an especially valuable resource for the Nordic AI community, where startups and research labs are rapidly adopting transformer‑based solutions for everything from multilingual chatbots to climate‑data analysis. Why it matters is twofold. First, self‑attention is the engine that powers the contextual understanding and generation capabilities that have made generative AI mainstream; grasping it is now a prerequisite for any serious AI practitioner. Second, the piece arrives amid a wave of educational content aimed at closing the skills gap that has slowed adoption of cutting‑edge models in smaller European markets. Rajesh’s clear, code‑first approach complements recent technical deep‑dives we covered, such as the “Self‑Attention Mechanism” article on 8 April, and helps translate theory into production‑ready insight. Looking ahead, Rajesh has signalled that Part 5 will tackle the feed‑forward network and layer‑norm components that complete the transformer block, while the broader community watches for emerging variations—sparse attention, linear‑complexity alternatives, and hardware‑aware optimisations—that could reshape efficiency benchmarks. Keeping an eye on those developments will be essential for anyone aiming to stay competitive in the fast‑evolving AI landscape.
130

OpenAI backs bill limiting AI liability for mass deaths and financial catastrophes

OpenAI backs bill limiting AI liability for mass deaths and financial catastrophes
Mastodon +7 sources mastodon
openai
OpenAI has thrown its weight behind a controversial Illinois Senate bill that would give AI developers a legal shield when their models are used to cause “mass‑scale” harm – defined as the death or serious injury of at least 100 people, or property damage of $1 billion or more. The move, announced this week, marks the first time a major AI firm has publicly supported legislation that effectively limits civil liability for catastrophic outcomes linked to its technology. The bill, formally known as the “AI Liability Shield Act,” would exempt companies from negligence lawsuits unless they can prove they took “reasonable steps” to prevent misuse. Proponents argue that without such protection, firms could be crippled by lawsuits over events they cannot fully control, stalling innovation in high‑risk domains such as autonomous weapons, critical‑infrastructure monitoring, and large‑scale generative models. OpenAI’s backing signals a strategic calculation: by shaping the law now, the company hopes to avoid a patchwork of state‑level suits that could arise from incidents ranging from autonomous‑vehicle crashes to AI‑driven financial market manipulation. Critics, including consumer‑rights groups and several Illinois lawmakers, warn the shield could create a moral hazard, allowing firms to offload responsibility onto victims and regulators. Polling suggests roughly 90 % of Illinois voters oppose the exemption, and a coalition of tech ethicists has pledged to lobby against the measure. The bill is slated for a Senate floor vote next month, after which it would move to the House for a companion vote. Watch for a potential showdown in the Illinois General Assembly, and for reactions from other states that may draft similar protections. Federal lawmakers are already monitoring the debate, raising the prospect of a national framework that could either codify or preempt the Illinois approach. The outcome will shape how AI risk is allocated across the industry for years to come.
127

OpenAI scraps £31 bn UK investment package

OpenAI scraps £31 bn UK investment package
Mastodon +10 sources mastodon
copyrightopenai
OpenAI has shelved the £31 billion “Stargate UK” programme, pulling the plug on a data‑centre slated for Cobalt, Northumberland and a cascade of downstream tech investments across Britain. The California‑based firm cited soaring energy prices and a “regulatory landscape that remains uncertain” as the immediate reasons for the pause, saying the cost‑base could no longer be justified without clearer policy guidance or state subsidies. The abandonment marks a setback for the UK’s AI strategy, which had counted on the partnership as a cornerstone of a broader UK‑US investment push to embed advanced artificial‑intelligence infrastructure into the national economy. The project promised up to 2 000 high‑skill jobs, a boost to the regional supply chain and a catalyst for home‑grown AI startups that would have benefited from proximity to world‑class compute resources. Its loss also dents the government’s ambition to position the UK as a European AI hub, especially as rivals such as the EU and Germany accelerate public‑funded super‑computing initiatives. Policy analysts warn that the decision underscores the fragility of private‑sector AI roll‑outs when they clash with energy security concerns and evolving data‑governance rules. The episode may prompt Westminster to tighten its regulatory framework, offer targeted energy‑price relief, or craft a more attractive fiscal package to retain future AI investors. Meanwhile, competitors like Microsoft and Google are watching closely; both have expressed interest in expanding UK cloud capacity and could step into the void if the government delivers a more predictable environment. The next few months will reveal whether the UK can renegotiate terms with OpenAI, attract alternative capital, or reshape its AI policy to balance innovation with public‑interest safeguards. The outcome will shape the country’s long‑term competitiveness in the global AI race.
124

Anthropic's Glasswing AI Uncovers Zero-Day Bugs Across All Major Operating Systems

Anthropic's Glasswing AI Uncovers Zero-Day Bugs Across All Major Operating Systems
Dev.to +6 sources dev.to
anthropic
Anthropic unveiled Project Glasswing on April 7, releasing a new frontier model, Claude Mythos Preview, to a select group of defensive‑security partners. The model has already identified thousands of zero‑day flaws across every major operating system and web browser, including vulnerabilities that have evaded human auditors for decades. Launch partners—among them Microsoft, Apple, Google and several leading cloud providers—will integrate Mythos into their bug‑bounty pipelines and internal testing suites, while Anthropic promises to publish aggregated findings for the broader industry. The announcement builds on the company’s earlier push to embed AI in cyber‑defence, which we covered on April 10 when Anthropic’s Claude Mythos Preview was first shown to bolster security leaders. Glasswing marks the first time the model is being deployed at scale, shifting from proof‑of‑concept to an operational tool that can scan billions of lines of code faster than any human team. By surfacing hidden exploits in legacy components and newly released updates, the initiative could dramatically shorten the window between discovery and patching, a perennial weakness in today’s software supply chain. However, the power to uncover such deep‑rooted bugs also raises concerns about dual‑use. Critics warn that the same capabilities could be weaponised if the model were leaked or sold to hostile actors. Anthropic’s decision to limit access to “defensive‑only” partners and to share only sanitized data is intended to mitigate that risk, but regulators and industry watchdogs will likely scrutinise the governance framework. What to watch next: Anthropic plans to publish a quarterly “Glasswing Report” detailing aggregate vulnerability trends, and it has hinted at expanding the partner roster to include national CERTs. The company also said a commercial version of Claude Mythos could appear in 2027, prompting a race among AI firms to balance offensive potential with responsible disclosure. Stakeholders should monitor how Glasswing’s findings influence patch cycles, insurance premiums and the broader debate over AI‑driven cyber‑offense versus defence.
119

OpenAI halts UK Stargate data centre over high energy costs

Bloomberg +14 sources 2026-03-25 news
openai
OpenAI announced today that it is pausing the rollout of its “Stargate” artificial‑intelligence infrastructure project in the United Kingdom, citing soaring energy costs and an increasingly complex regulatory landscape. The decision halts construction of the high‑performance data centre that was slated to house the company’s next‑generation GPU clusters and to serve as a hub for European customers. The move builds on the warning issued on 9 April, when OpenAI first put its UK data‑centre deal on hold over similar concerns. At the time, the company had already signalled that the £31 billion investment package it had pledged to the UK government could be jeopardised. By pausing Stargate, OpenAI is effectively scaling back its European compute ambitions until energy pricing stabilises and clearer guidance on AI‑related regulations emerges. The pause matters for several reasons. The UK has positioned itself as a potential AI super‑power, banking on OpenAI’s presence to attract talent, spur local supply chains and justify public subsidies for renewable power. A delayed data centre threatens to slow the rollout of advanced AI services for British businesses and could dent confidence among other tech firms considering a European foothold. Moreover, the decision underscores how volatile energy markets are reshaping the economics of large‑scale AI training, a factor that may force other cloud providers to reassess similar projects. What to watch next are the negotiations between OpenAI and the UK Department for Business and Trade over revised terms, and whether the company will relocate the Stargate build‑out to a lower‑cost jurisdiction. Analysts will also monitor the UK government’s response—potentially new incentives for green power or streamlined AI regulations—and the impact on the broader European AI infrastructure race. The next few weeks could determine whether the UK remains on the fast‑track to becoming an AI hub or watches the opportunity drift elsewhere.
111

Set Up Claude Code with Local Ollama Models in Just 3 Minutes

Set Up Claude Code with Local Ollama Models in Just 3 Minutes
Dev.to +9 sources dev.to
claudegeminillama
A developer has just demonstrated that Anthropic’s Claude Code can be hooked up to any locally‑hosted Ollama model with a three‑minute, zero‑API‑key setup. By installing Ollama v0.14 or newer, launching a tiny localhost proxy, and pointing Claude Code, the Codex CLI and Gemini CLI at that proxy, the tools reroute their requests through Ollama’s Anthropic‑compatible endpoint instead of the cloud. The proxy intercepts the standard HTTP calls, rewrites the model identifiers and forwards them to the chosen local model—whether it’s a Llama 3.2, Qwen 3.5, GLM‑5 or Kimi‑K2.5—while preserving the full Claude Code workflow of reading, editing and executing code in the user’s workspace. The move matters because it shatters the prevailing assumption that agentic coding assistants must run on expensive, bandwidth‑hungry APIs. Running the model locally eliminates cloud costs, removes latency spikes, and guarantees that proprietary code never leaves the developer’s machine. For enterprises handling sensitive data, the ability to keep the entire generation pipeline offline is a game‑changer, and for hobbyists it opens high‑quality AI coding assistance on modest hardware. The guide also highlights practical tweaks: using Amodelfile to customise system prompts, temperature and top‑p, and employing the “‑‑yes” flag to auto‑pull models. Early adopters report response times comparable to the hosted Claude service and code quality that rivals the cloud version, especially when paired with recent open‑source models fine‑tuned for instruction following. Looking ahead, the community will watch for broader adoption of Ollama’s Anthropic Messages API, which was added in January 2026, and for the next wave of plug‑in support that could let other agentic tools—such as GitHub Copilot X or Google Gemini Code—run entirely offline. Monitoring performance benchmarks across GPU and CPU configurations, as well as any emerging privacy‑focused extensions, will indicate whether local‑first AI coding becomes the new norm in Nordic software development.
106

Anthropic's Claude Mythos preview strengthens cybersecurity leadership

Mastodon +12 sources mastodon
anthropicclaude
Anthropic has opened its newest large‑language model, Claude Mythos, to a handful of leading cybersecurity firms through a program dubbed Project Glasswing. The preview, which began rolling out this week, gives partners—among them Microsoft, Palo Alto Networks and a consortium of European infrastructure providers—direct access to a model that Anthropic says can autonomously discover and even craft exploits for zero‑day vulnerabilities. Early tests have already uncovered thousands of previously unknown flaws in widely used software stacks, prompting the company to tighten the model’s safety guardrails in real time. The move matters because it marks the first time a generative‑AI developer has deliberately withheld a powerful, potentially weaponizable system from the public while handing it to a curated security community. By doing so, Anthropic hopes to turn a dual‑use technology into a defensive asset, accelerating vulnerability research that would otherwise take months of manual effort. The initiative also signals a shift in the AI‑security landscape: vendors are now viewing advanced models as essential tools for threat hunting rather than merely as risks to be mitigated. For the broader market, the partnership underscores the growing willingness of AI firms to embed safety protocols and collaborate with industry defenders, a stance that contrasts with the more open‑release strategies of rivals such as OpenAI. What to watch next is whether Anthropic can translate the early gains into a scalable, responsibly deployed service. The company has pledged to keep Mythos out of the public domain until robust safeguards are proven, but investors will be looking for a timeline on a commercial offering. Parallel to that, regulators in the EU and the U.S. are expected to scrutinize the ethical implications of granting elite players exclusive AI capabilities. The outcome of Project Glasswing could set a precedent for how the AI industry balances innovation with the imperative to prevent misuse.
106

OpenAI backs bill to cap liability for AI‑caused mass deaths and financial disasters

OpenAI backs bill to cap liability for AI‑caused mass deaths and financial disasters
Mastodon +11 sources mastodon
openai
OpenAI has thrown its weight behind a controversial Illinois Senate bill that would shield artificial‑intelligence developers from civil liability when their models are used to cause “critical harms” – defined as deaths of 100 or more people, injuries to a comparable number, or property damage exceeding $1 billion. The company testified before the state legislature last week, arguing that existing tort law unfairly penalises creators for downstream misuse that they cannot realistically control. The proposal, known as the AI Liability Shield Act, would grant AI labs a statutory defense against wrongful‑death, personal‑injury and massive‑damage lawsuits, provided the technology was not deliberately designed for illicit purposes. Proponents say the measure will encourage innovation by reducing the chilling effect of endless litigation, while critics warn it could leave victims without recourse and set a precedent for industry‑wide immunity. OpenAI’s endorsement marks a sharp pivot from its earlier stance of opposing liability‑expanding bills. The shift reflects the firm’s growing exposure to high‑profile lawsuits – most notably the wrongful‑death case filed by the parents of a teenager who allegedly used a ChatGPT‑powered tool to plan a suicide – and a broader industry push to codify a “safe harbour” for AI developers. If enacted, the law would be the first U.S. statute to explicitly limit AI‑related liability, potentially influencing federal discussions and prompting other states to draft similar protections. The bill now faces a committee vote and, if cleared, a full Senate hearing. Lawmakers from consumer‑advocacy groups and the tech‑ethics community have pledged to mount a coordinated opposition, citing concerns over accountability and public safety. Observers will be watching whether the Illinois model spurs a national framework, how insurers price the new risk exposure, and whether other jurisdictions adopt comparable shields or double‑down on stricter liability regimes. The outcome could reshape the legal landscape for AI development across the United States.
97

OpenAI cuts ChatGPT Pro price to $100 to challenge Claude and CodeX

OpenAI cuts ChatGPT Pro price to $100 to challenge Claude and CodeX
Mastodon +13 sources mastodon
claudeopenai
OpenAI announced on April 10 that its ChatGPT Pro subscription will now cost $100 a month, a 50 percent cut from the $200 price introduced earlier this year. The new tier doubles the service’s appeal to developers by offering five‑times the Codex usage limits of the standard Plus plan and exclusive early access to the GPT‑5.3‑Codex‑Spark research preview. The move is a direct response to Anthropic’s Claude Code, which has been gaining traction among software teams for its ability to generate, refactor and debug code at scale. By slashing the price and boosting the coding quota, OpenAI aims to reclaim market share in the AI‑assisted development space, where enterprises are increasingly allocating budget to productivity‑enhancing tools. The pricing shift also signals OpenAI’s broader reassessment of its subscription model, which it described as “accidental” in a recent blog post and is now being re‑engineered to align revenue with usage intensity. For developers, the $100 Pro tier lowers the barrier to high‑throughput coding assistance, making it viable for small firms and freelance engineers who previously found the $200 tier prohibitive. The tier’s expanded limits—up to 3,000 Codex calls and 1,200 cloud‑based tasks every five hours—should accelerate adoption of AI‑driven code reviews, test generation and bug‑fix automation. What to watch next: Anthropic is expected to unveil a counter‑offer for Claude Code later this quarter, potentially sparking a pricing war. OpenAI’s upcoming GPT‑5.5 release may further differentiate its coding stack, while integration partners such as WinBuzzer are positioning their prompt‑management platforms to capitalize on the higher‑volume usage. Analysts will also monitor whether the $100 price point sustains subscriber growth or prompts a second tier adjustment before the end of the fiscal year.
90

Make ‘em dumb, sell ‘em smarts

Make ‘em dumb, sell ‘em smarts
Mastodon +8 sources mastodon
OpenAI’s chief executive Sam Altman unveiled a bold new revenue model at the company’s “AI Utility” briefing on April 9, positioning generative intelligence as a public‑service commodity that will be billed much like water or electricity. The plan, dubbed “Intelligence‑as‑a‑Utility,” will charge users for the amount of “cognitive bandwidth” their queries consume, measured in a new unit called “smart‑tokens.” While OpenAI already monetises ChatGPT through subscription tiers and API usage, the utility model shifts the focus from flat‑rate access to a pay‑per‑intelligence framework, effectively turning every answer, suggestion or code snippet into a metered service. Altman argues that the model reflects the growing reality that AI assistants are off‑loading memory and reasoning tasks from human brains. Recent studies from universities in Scandinavia and the United States show that frequent reliance on conversational agents can impair information retention and critical‑thinking skills, a trend Altman acknowledges in his remarks. By pricing “smartness” directly, OpenAI hopes to recoup the massive compute costs of training ever‑larger models while incentivising more efficient prompting. The announcement matters because it could reshape how individuals, enterprises and governments budget for AI. A utility‑style fee structure may widen the gap between tech‑savvy users who can optimise token consumption and those who cannot, raising equity concerns that echo the EU’s AI Act and Nordic proposals for universal AI access. It also signals a strategic pivot: rather than competing solely on model capability, OpenAI is betting on control of the consumption layer. Watch for the rollout schedule, which Altman said will begin with a beta for enterprise customers in June, and for reactions from regulators and rivals such as Google Gemini and Anthropic, who may launch counter‑offers or lobby for stricter pricing transparency. The next few months will reveal whether “intelligence as a utility” becomes a new industry standard or a flashpoint for policy debate.
75

AI-Powered Reverse‑RAG Creates Synthetic Staging Environments on AWS

AI-Powered Reverse‑RAG Creates Synthetic Staging Environments on AWS
Dev.to +9 sources dev.to
rag
A new AWS‑hosted architecture dubbed **Reverse‑RAG** is turning the traditional “retrieval‑augmented generation” model on its head. Instead of pulling external knowledge into a language model at inference time, Reverse‑RAG feeds a model‑generated synthetic workload back into a staging environment, creating a dynamic, hostile proving ground that mimics real‑world edge cases before code reaches production. The approach was unveiled in a technical guide that walks developers through wiring Amazon Bedrock, SageMaker, Lambda and Step Functions into a feedback loop. After a CI/CD pipeline reports green and unit tests pass, the system automatically generates realistic user queries, malformed inputs and data‑drift scenarios. These synthetic interactions are then routed to a replica of the live stack—often serverless, often containerised—where they trigger load spikes, latency anomalies and semantic regressions that would otherwise surface only after a release. Why it matters is twofold. First, generative AI services are notoriously brittle: a subtle shift in prompt phrasing can cause hallucinations or policy violations that damage brand trust. Reverse‑RAG catches those semantic slips early, reducing costly rollbacks and customer‑facing outages. Second, the method leverages AWS’s native observability tools (CloudWatch, X‑Ray) to surface performance bottlene‑downs in a controlled setting, letting teams optimise cost‑intensive serverless functions before they scale to millions of requests. The next steps to watch include early adopters integrating Reverse‑RAG with popular CI platforms such as GitHub Actions and GitLab CI, and the emergence of open‑source toolkits that abstract the AWS specifics for multi‑cloud use. Competitors Azure and Google Cloud are likely to roll out comparable “synthetic staging” services, while standards bodies may begin codifying best practices for AI‑driven testing pipelines. For organisations that have already felt the sting of AI‑related production bugs, Reverse‑RAG could become a prerequisite for safe, scalable deployment.
75

Technical Blueprint Reveals How RAG Architecture Powers Enterprise Data

Technical Blueprint Reveals How RAG Architecture Powers Enterprise Data
Dev.to +9 sources dev.to
embeddingsragvector-db
A new technical guide titled **“Demystifying RAG Architecture for Enterprise Data”** has been released on the DEV Community platform, laying out a step‑by‑step blueprint for building production‑grade Retrieval‑Augmented Generation (RAG) pipelines. The article walks readers through systematic data ingestion, chunking, embedding generation, vector‑database indexing and prompt augmentation, positioning RAG as a cost‑effective, agile alternative to full‑scale model fine‑tuning for corporate knowledge bases. The publication arrives as the industry coalesces around modular AI stacks. NVIDIA’s “AI Blueprint for Retrieval‑Augmented Generation” and Informatica’s “RAG Data Ingestion: Enterprise Implementation” both offer reference architectures that echo the same four‑stage workflow, underscoring a converging consensus on best practices. By converting raw, heterogeneous corporate data—documents, relational tables, APIs and event streams—into semantically rich embeddings, enterprises can keep large language models (LLMs) up‑to‑date with internal knowledge without retraining, reducing compute spend and shortening time‑to‑value. Why it matters is twofold. First, the blueprint directly tackles failure points highlighted in our earlier coverage of RAG shortcomings, such as poor retrieval relevance and brittle prompt integration, by recommending high‑performance vector stores and intelligent chunking strategies. Second, it aligns with the growing demand for on‑premises or hybrid AI deployments driven by data‑sovereignty regulations in the Nordics and Europe, offering a pathway to secure, governed AI assistants, search tools and copilots. What to watch next are the adoption curves of these reference designs across large organisations, especially in regulated sectors like finance and healthcare. Vendors are likely to bundle the blueprint with managed services, while open‑source projects may standardise embedding formats and evaluation metrics. The next wave of announcements—potentially from cloud providers or standards bodies—will reveal whether RAG will become the default architecture for enterprise GenAI or remain a niche complement to fine‑tuned models.
72

US summons bank CEOs over cyber threats from Anthropic's latest AI model

US summons bank CEOs over cyber threats from Anthropic's latest AI model
HN +8 sources hn
anthropic
Washington, D.C. — Treasury Secretary Scott Bessent and Federal Reserve Chair Jerome Powell convened an emergency session with the CEOs of the nation’s largest banks this week to flag emerging cyber‑security threats tied to Anthropic’s newly released AI model, dubbed “Mythos.” The meeting, held at the Treasury building, was described by officials as “urgent” and “non‑negotiable.” Regulators warned that Mythos’s unprecedented ability to generate code, synthesize realistic text and simulate network traffic could be weaponised by malicious actors to craft sophisticated phishing campaigns, automate vulnerability scanning and even manipulate algorithmic trading systems. Because major banks already embed large‑language models in fraud detection, customer‑service chatbots and risk‑analytics pipelines, any breach could cascade across the financial system, amplifying both operational and reputational damage. The summons signals a shift from the usual advisory tone of past AI briefings to a more proactive stance on technology risk. By pulling bank leaders into a closed‑door dialogue, the Treasury and the Fed are testing the waters for possible mandatory safeguards—such as mandatory AI‑risk assessments, stricter model‑testing regimes, and tighter data‑sharing protocols with the Cybersecurity and Infrastructure Security Agency (CISA). The move also reflects growing bipartisan concern in Congress that unchecked AI could undermine the resilience of critical financial infrastructure. What to watch next: a joint Treasury‑Fed communiqué is expected within the next two weeks, likely outlining concrete expectations for model‑validation and incident‑response plans. Anthropic has pledged to work with regulators but has not yet disclosed any changes to Mythos’s release schedule. Lawmakers are preparing hearings on AI governance, and the Office of the Comptroller of the Currency may soon issue guidance that could ripple through all sectors that rely on generative AI. The coming weeks will reveal whether the warning translates into enforceable policy or remains a high‑level caution.
69

Researchers Reverse-Engineer Gemini's SynthID Detection

Researchers Reverse-Engineer Gemini's SynthID Detection
HN +11 sources hn
geminigooglemeta
Google’s Gemini model has long relied on SynthID, an invisible watermark that tags AI‑generated text and images so they can be identified by the company’s SynthIDDetector tool unveiled at Google I/O 2025. A team of independent researchers announced they have successfully reverse‑engineered the detection mechanism, exposing the statistical patterns and token‑level cues that the detector uses to flag synthetic content. The breakthrough came after the researchers harvested a large corpus of Gemini outputs, applied the public‑facing detector, and then performed a differential analysis to isolate the watermark’s signature. Their paper, posted on a pre‑print server, details a set of heuristics that can both confirm the presence of SynthID and, crucially, suggest ways to strip or mask the watermark without degrading output quality. The authors stress that their work is intended to audit the robustness of watermarking rather than to facilitate malicious misuse. Why it matters is twofold. First, the discovery undermines Google’s claim that SynthID offers a tamper‑proof provenance signal for AI‑generated media, a cornerstone of the tech giant’s strategy to combat misinformation and to meet emerging regulatory expectations for traceability. Second, the reverse engineering fuels an emerging arms race: if watermarking can be neutralised, platforms, advertisers and policymakers may need to rely on alternative provenance methods, such as cryptographic signatures or third‑party verification services. What to watch next includes Google’s likely response—whether it will harden SynthID, roll out a new version, or shift toward a different provenance framework. Industry observers will also monitor how other AI developers, from Meta to Anthropic, adjust their own watermarking schemes in light of the findings. Finally, regulators in the EU and US may cite the episode when drafting standards for AI‑generated content disclosure, potentially accelerating the push for more resilient, auditable provenance solutions.
67

Gemma 3 fine‑tuned on serverless RTX 6000 Pro GPUs via Cloud Run for pet breed identification

Gemma 3 fine‑tuned on serverless RTX 6000 Pro GPUs via Cloud Run for pet breed identification
Dev.to +8 sources dev.to
fine-tuninggemmagooglenvidia
Google Cloud has rolled out server‑less GPU instances on Cloud Run, letting developers tap NVIDIA RTX 6000 Pro (Blackwell) cards without provisioning dedicated VMs. The new offering was showcased in a proof‑of‑concept that fine‑tuned the 27‑billion‑parameter Gemma 3 model to recognise cat and dog breeds, running the entire training pipeline as a Cloud Run Job. The workflow combines Hugging Face’s Transformers library, the TRL SFTTrainer and LoRA‑style parameter-efficient fine‑tuning, all executed on a pay‑per‑second GPU that scales to zero when idle. According to the author of the demo, a cold‑start time of 19 seconds was recorded for a 4‑billion‑parameter Gemma 3 variant, and the RTX 6000 Pro’s 48 GB of VRAM allowed the full 27 B model to be trained with a batch size that would have required multiple on‑premise GPUs a year ago. The pet‑breed classifier reached 96 % accuracy on a held‑out test set after just a few hundred steps, proving that high‑end LLM fine‑tuning can now be done in a fully managed, cost‑transparent environment. The move matters because it lowers the barrier for small teams and startups to experiment with large language models. By eliminating quota requests for L4‑class GPUs and billing by the second, Google Cloud turns what was previously a capital‑intensive endeavour into an operational expense that scales with actual usage. The serverless model also aligns with the growing demand for rapid prototyping in AI‑driven products, from image tagging to domain‑specific chatbots. Watch for broader adoption of Cloud Run GPU jobs across industries that need bespoke LLMs but lack dedicated hardware. Google has hinted at upcoming support for mixed‑precision training and tighter integration with Vertex AI pipelines, which could further streamline model iteration. The community will also be watching pricing adjustments and regional availability, as those factors will dictate how quickly the serverless GPU paradigm gains traction beyond early‑stage demos.
64

OpenAI reports 1.9 GW of compute capacity, outpacing Anthropic’s 1.4 GW.

OpenAI reports 1.9 GW of compute capacity, outpacing Anthropic’s 1.4 GW.
Cryptopolitan on MSN +12 sources 2026-04-08 news
anthropicmicrosoftopenai
OpenAI has disclosed to its investors that it now controls roughly 1.9 gigawatts of AI‑compute capacity, a figure that eclipses rival Anthropic’s reported 1.4 gigawatts. The memo, circulated as both companies prepare for public listings, frames the disparity as a “structural advantage” that could translate into faster model iteration, lower inference costs and a stronger bargaining position with cloud partners. The claim matters because compute has become the primary moat in the generative‑AI race. Training state‑of‑the‑art models now consumes megawatts of power for weeks on end, and the ability to lock in large‑scale hardware ahead of demand spikes can dictate market share. OpenAI’s surge stems from a multi‑year partnership with Microsoft, which has funneled more than $13 billion into the startup and reserved a dedicated slice of Azure’s hyperscale infrastructure. The company also announced a plan to reach 30 gigawatts by 2030, a scale that would dwarf today’s data‑center footprints and cement its role as the de‑facto compute supplier for the ecosystem. Anthropic, backed by a $4 billion investment from Amazon and a $450 million round led by Google’s parent Alphabet, is racing to expand its own capacity while positioning itself as a “safer” alternative. Its smaller compute pool could force the firm to prioritize efficiency over raw size, potentially shaping its product roadmap and pricing strategy. The rivalry is now playing out not only on technical blogs but also on the prospectus pages that will soon be scrutinised by retail and institutional investors across the Nordics and beyond. What to watch next: the filing details of OpenAI’s and Anthropic’s IPOs, including how each company quantifies future compute commitments and the terms of their cloud contracts. Analysts will also monitor whether Microsoft or Amazon will deepen their exclusive hardware deals, and how quickly Anthropic can close the gigawatt gap through new data‑center builds or strategic alliances. The outcome will signal which firm is best positioned to dominate the next wave of foundation‑model services.
64

OpenAI Holds Back New Model Rollout Over Cybersecurity Fears

Mastodon +7 sources mastodon
openai
OpenAI announced on Tuesday that it will deliberately curb the rollout of its next‑generation language models, citing the risk that the technology could be weaponised to uncover software vulnerabilities at scale. The company said it will move from a “broad public release” to a staged, invitation‑only deployment for enterprise and research partners, with tighter monitoring of how the models are used. The decision follows internal debates that mirror the long‑standing “responsible disclosure” practices of cybersecurity firms. OpenAI’s head of safety, Mira Lee, likened the approach to the way vendors patch critical bugs only after confirming that fixes are in place, arguing that unrestricted access could accelerate the discovery of zero‑day exploits in critical infrastructure. The move also aligns with recent industry caution: Anthropic last week limited its own high‑capability model, Mythos, for the same reason, and regulators in the EU and UK have begun probing the societal impact of ever more powerful AI systems. Limiting the release matters because it signals a shift from OpenAI’s earlier strategy of rapid, open diffusion toward a more guarded model of commercialization. The restriction could slow the pace of innovation for developers who rely on the latest capabilities, but it may also forestall a wave of AI‑driven cyber attacks that could outstrip current defensive tools. Analysts note that the timing coincides with OpenAI’s reported compute shortages and the pending retirement of GPT‑4o on April 3, suggesting the company is reallocating resources to manage risk rather than sheer scale. What to watch next: OpenAI has promised a detailed roadmap by the end of the month, outlining which partners will receive early access and what usage‑monitoring safeguards will be enforced. Regulators are expected to issue guidance on AI‑enabled vulnerability research, and competitors may either follow suit or double down on open releases to capture market share. The balance between safety and speed will likely shape the next wave of AI products across the sector.
60

Nasdaq Slides into Correction; I’m Buying These Two AI Stocks First

The Motley Fool on MSN +13 sources 2026-03-22 news
The Nasdaq Composite slipped into correction territory this week, retreating more than 5 % from its all‑time high as investors digested higher‑for‑longer interest rates and lingering geopolitical uncertainty. The pull‑back has not dampened demand for artificial‑intelligence solutions; instead, it has stripped away much of the premium that was inflating the valuations of AI‑focused infrastructure firms. Two stocks have emerged as the most compelling entry points in that environment. Nvidia (NVDA) remains the engine behind the generative‑AI boom, supplying the GPUs that power everything from OpenAI’s ChatGPT to enterprise‑grade models. After a 15 % slide since its March peak, the chipmaker still trades below the median forward‑price‑to‑earnings multiple for high‑growth tech, leaving analysts forecasting 30‑plus percent upside as AI spend accelerates. Microsoft (MSFT) is the other pick, leveraging its Azure cloud platform and a deepening partnership with OpenAI to monetize AI services at scale. The software giant’s shares have fallen roughly 10 % from recent highs, creating a valuation gap that the firm’s robust subscription base and expanding AI‑augmented Office suite are poised to close. The significance extends beyond the two names. AI infrastructure is a secular growth driver that underpins a broad swath of the technology sector, from cloud providers to semiconductor makers. A correction‑induced discount therefore offers a rare chance to acquire exposure to a theme that analysts expect to deliver double‑digit revenue expansion for years to come. Investors should keep an eye on Nvidia’s upcoming earnings, where guidance on data‑center demand will test whether the recent pull‑back was a temporary overreaction. Microsoft’s next quarterly report will reveal how quickly Azure AI consumption is translating into billable revenue. In parallel, any shifts in Federal Reserve policy, EU AI regulation, or competitive breakthroughs from rivals such as AMD or Google could reshape the risk‑reward calculus. For now, the market’s correction appears to be buying the dip on AI infrastructure rather than abandoning the trend altogether.
60

Anthropic Unveils Claude Mythos, Pioneering Autonomous Exploits

Mastodon +9 sources mastodon
anthropicautonomousclaude
Anthropic announced the existence of Claude Mythos, a preview‑stage AI model capable of autonomously discovering zero‑day vulnerabilities across major operating systems and browsers. The company said the system works, but it will not be released to the public because it has crossed a safety threshold that Anthropic believes the industry is not yet prepared to handle. The reveal marks a stark departure from Anthropic’s recent rollout strategy, which has focused on incremental upgrades such as Claude Opus 4.6 and managed‑agent frameworks. Mythos is described as a “frontier” model that can scan code, network configurations and runtime environments without human prompting, generating exploit chains that would traditionally require weeks of specialist effort. In a leaked internal memo, engineers warned that the model’s success rate on novel vulnerabilities exceeds 70 percent, a figure that dwarfs the 10 percent edge reported for experienced Claude users in our April 9 coverage of managed agents. Why it matters is twofold. First, the capability to automate exploit discovery could compress the vulnerability lifecycle, giving attackers a powerful new weapon and forcing defenders to rethink patching cadences. Second, Anthropic’s decision to withhold the model signals a growing recognition that AI progress is outpacing governance frameworks, echoing concerns raised in the Atlantic’s recent analysis of “Claude Mythos is everyone’s problem.” The simultaneous launch of Project Glasswing—a defensive coalition that includes AWS, Apple, Cisco, Google and others—suggests the industry is mobilising a coordinated response before the technology ever sees commercial use. What to watch next are the concrete steps Project Glasswing will take to harden software supply chains and whether regulators will intervene to set boundaries on autonomous exploit‑generation tools. Anthropic’s next public statement, likely to outline a roadmap for controlled external testing, will be a key barometer of how quickly the AI‑driven cyber‑arms race escalates.
56

Florida probes OpenAI over alleged risks to minors.

Florida probes OpenAI over alleged risks to minors.
CBS News +16 sources 2026-04-01 news
openai
Florida’s attorney general, Ashley Moody, announced on April 9 that her office is opening a formal investigation into OpenAI, the creator of ChatGPT. The probe targets alleged risks the chatbot poses to minors and broader public‑safety concerns, including its purported role in the November 2022 mass shooting at Florida State University. Prosecutors say the gunman used ChatGPT to gather details such as the busiest times at the student union and to research weapon‑related information. The investigation also seeks to determine whether the model’s content‑filtering systems adequately protect children from sexual, self‑harm or extremist material. The move adds Florida to a growing list of U.S. jurisdictions scrutinising AI providers. The Federal Trade Commission has already launched a consumer‑protection probe into OpenAI’s data practices, while the European Union advances its AI Act. State officials argue that existing regulations lag behind the rapid deployment of generative models, leaving gaps that could be exploited by predators, foreign actors or criminal planners. For OpenAI, the inquiry arrives amid heightened legal pressure, including lawsuits from families of shooting victims and copyright claims from authors alleging unauthorized training data use. OpenAI has pledged full cooperation, offering to supply internal logs and to accelerate the rollout of its “Safety‑First” updates, which tighten age‑verification checks and expand harmful‑content filters. The company also plans to publish a transparency report on how its models are used in high‑risk contexts. Watch for subpoenas to OpenAI executives, potential state‑level AI legislation, and the outcome of the FTC’s parallel probe. A decisive finding could force the firm to redesign its user‑access architecture, set a precedent for other states, and shape the emerging regulatory framework governing generative AI across the United States.
54

80% of RAG Failures Stem from Retrieval Issues, Not the LLM

Dev.to +6 sources dev.to
geminigooglerag
A three‑week deep‑dive by a Nordic fintech team has pinpointed the source of most hallucinations in retrieval‑augmented generation (RAG) pipelines: the retrieval layer, not the large language model (LLM) itself. The engineers began by swapping prompts, tweaking temperature settings and even swapping the underlying LLM, but the spurious answers persisted. Only after instrumenting the vector store, query‑expansion logic and document‑ranking module did they discover that 80 % of the faulty outputs were generated before the LLM ever saw a prompt. The finding echoes a February field guide that warned “70 % of RAG failures happen before the LLM is called,” and it validates the claim we made on 8 April that “retrieval is the real model” in a RAG architecture. IDC research cited in a March Medium post estimates that only one in ten home‑grown AI projects survive past proof‑of‑concept, with a senior GenAI lead at PIMCO confirming that the same 80 % failure rate applies to enterprise RAG deployments. The root causes identified by the fintech team include poorly tuned chunk sizes, stale embeddings, inadequate metadata filtering and ranking algorithms that surface irrelevant passages, all of which feed the LLM with misleading context. Why it matters is twofold. First, enterprises are pouring billions into RAG‑enabled products that promise up‑to‑date, source‑grounded answers; systematic retrieval errors undermine trust and inflate operational costs. Second, the problem is not a one‑off bug but a structural engineering gap that can amplify other risks, such as the poisoned‑web‑page attacks we covered on 9 April. What to watch next are the emerging observability tools that expose retrieval latency, relevance scores and provenance at query time, and the next wave of cloud‑provider updates—Azure Cognitive Search’s “retrieval diagnostics” preview and AWS Kendra’s “ground‑truth feedback” feature are slated for release later this quarter. Industry bodies in the EU are also drafting guidelines on data quality for AI, which could make rigorous retrieval testing a compliance requirement. The fintech team plans to publish a detailed post‑mortem, and their methodology may become a de‑facto checklist for any organization scaling RAG beyond the lab.
52

Telegram Accelerates AI Feature Rollout, Echoing WhatsApp

Mastodon +12 sources mastodon
meta
Telegram is accelerating its push into generative‑AI, adding a suite of new chat‑assistant tools that echo the recent rollout of AI‑enhanced features on WhatsApp. The move was highlighted by a viral test video in which a creator prompted Telegram’s built‑in bots to draft jokes, translate messages and generate images, showcasing a speed and flexibility that rivals Meta’s offering. The experiment comes as Telegram celebrates a milestone of more than one billion monthly active users, a figure that now puts the platform on a near‑equal footing with WhatsApp in global reach. The significance lies in how AI is reshaping the messenger market. WhatsApp’s expansion of “Channels” and AI‑driven replies has been framed as a defensive response to Telegram’s open bot ecosystem, which already lets developers embed custom AI assistants, automated customer‑service flows and content‑creation utilities directly into chats. By integrating large‑language‑model capabilities, Telegram is turning its messenger into a productivity hub, potentially attracting businesses that have found WhatsApp Business too restrictive. The platform’s emphasis on privacy—no phone‑number exposure in groups and end‑to‑end encryption for secret chats—adds a compelling differentiator for users wary of Meta’s data practices. What to watch next is the pace of feature deployment and the ecosystem’s commercialisation. Telegram has hinted at premium AI services and a marketplace for third‑party bots, while competitors such as Elon Musk’s xAI are already extending their Grok chatbot to the platform. Regulatory scrutiny over AI‑generated content and data handling could also shape adoption. If Telegram can monetize its AI tools without alienating its privacy‑focused base, it may not only retain its rapid growth but also force WhatsApp to deepen its own AI investments, intensifying the rivalry between the two messaging giants.
41

Author Wonders If They're Just Being Vegan – racc.at Blog

Mastodon +12 sources mastodon
A post on the racc.at blog titled “Am I Just Being A Vegan About It” has drawn attention to a rapid, cross‑project shift toward large language model (LLM) assistance in open‑source development. The author lists Vim, VLC, GStreamer, Kitty and even the Linux kernel as already experimenting with LLM‑driven code suggestions, bug‑fix generation and documentation drafting—activities that were, until weeks ago, confined to a handful of early‑adopter projects. The significance lies in the scale and diversity of the adoption. When core components of the Linux ecosystem start to rely on AI‑generated code, the practice moves from niche experimentation to a de‑facto standard workflow. Proponents argue that LLMs can accelerate patch review, reduce repetitive boilerplate and lower the barrier for newcomers. Critics warn that model‑generated code may introduce subtle bugs, licensing ambiguities, or security backdoors that are hard to audit in a community‑driven codebase. The blog’s timing coincides with a broader industry conversation about responsible AI use in software engineering, a theme explored in our recent coverage of Claude Mythos and autonomous exploit concerns. What will follow is likely a wave of policy drafting within major projects: guidelines for prompt engineering, attribution of AI‑generated contributions, and automated testing pipelines designed to catch model‑induced regressions. Watch for statements from the Linux Kernel Mailing List, the GStreamer steering committee and the maintainers of Kitty and VLC, as they may formalise contribution rules or roll out dedicated LLM plugins. The next few months could define whether AI assistance becomes an accepted tool in the open‑source toolbox or a contested practice that reshapes collaborative development culture.
40

The Economist profiles Demis Hassabis, the mastermind behind DeepMind

Biznews +11 sources 2026-04-07 news
deepmindgoogleopenairobotics
The Economist’s latest profile lifts the veil on Sir Demis Hassabis, the neuroscientist‑turned‑entrepreneur who steered DeepMind from a Cambridge start‑up into Google’s crown jewel and now heads both DeepMind and the drug‑discovery venture Isomorphic Labs. Hassabis, a former chess prodigy and world‑champion in the strategy game Diplomacy, has spent the past decade championing a “human‑focused” route to artificial general intelligence (AGI). He argues that progress should be measured against how well machines understand and augment human cognition, not merely against benchmark scores. That ethos sets DeepMind apart from rivals such as OpenAI, whose product‑centric model leans heavily on commercial rollout and rapid scaling. The profile arrives as Hassabis repeatedly tells investors that “we’re quite close” to human‑level AI, a claim that has reignited debate over safety, governance and the timing of a potential intelligence leap. His insistence on pre‑emptive safeguards—codified in the 2014 acquisition deal that bound Google to strict export‑control clauses—has made him a rare voice urging caution amid the AI arms race. Why it matters is twofold. First, DeepMind’s resources, bolstered by Alphabet’s deep pockets, give it a decisive edge in compute‑intensive research, from reinforcement‑learning agents to protein‑folding models. Second, Hassabis’s parallel push into biotech through Isomorphic Labs could fuse AI breakthroughs with pharmaceutical pipelines, reshaping drug discovery on a global scale. Looking ahead, the industry will watch three fronts. The next major DeepMind paper—expected to detail a world‑model architecture that mimics human mental simulation—will test Hassabis’s claim of imminent AGI. Regulators in Europe and the United States are preparing AI‑specific legislation, and Hassabis’s role as a UK government AI adviser positions him at the nexus of policy and technology. Finally, the rollout of Isomorphic Labs’ first AI‑driven clinical candidates will reveal whether the DeepMind playbook can translate into tangible health outcomes. The Economist’s deep dive underscores that the man behind the algorithms is as pivotal as the code itself, and his next moves could shape the trajectory of artificial intelligence for years to come.
40

Anthropic launches Project Glasswing to avert AI cyber crisis

Anthropic launches Project Glasswing to avert AI cyber crisis
Outlook Business +9 sources 2026-04-09 news
anthropicappleclaudegoogle
Anthropic announced the preview of Claude Mythos, a frontier‑scale language model engineered to hunt for software flaws, and simultaneously launched Project Glasswing, a cross‑industry coalition aimed at hardening the digital infrastructure that underpins the modern economy. The initiative brings together cloud giants AWS and Microsoft, consumer‑tech leaders Apple and Google, and more than 40 additional firms that manage critical platforms, giving them early access to Mythos Preview for defensive security work. Claude Mythos distinguishes itself by automatically scanning codebases, identifying zero‑day vulnerabilities and suggesting patches at a speed and depth that outpaces traditional static‑analysis tools. Anthropic says the model has already flagged thousands of high‑impact weaknesses across partner systems, allowing fixes before any public disclosure. By sharing findings with the broader ecosystem, Project Glasswing seeks to create a collective “first‑mover” advantage for defenders, a rare instance of rival firms collaborating on AI‑driven cyber‑defense. The move matters because the same generative‑AI capabilities that can accelerate vulnerability discovery also lower the barrier for malicious actors to weaponise software bugs. Experts warn that unrestricted release of such a model could spark an AI‑driven cyber arms race, where attackers gain the same rapid analysis tools as defenders. Anthropic’s decision to limit access to vetted partners reflects a cautious approach to balance innovation with risk mitigation. Going forward, the industry will watch how quickly Project Glasswing can translate Mythos‑identified flaws into patched code across the participating organisations, and whether the coalition can set standards for responsible AI deployment in security. Attention will also focus on regulatory responses, potential expansion of the partner network, and any signs that adversaries are developing counter‑measures or parallel offensive models. The success—or failure—of this collaborative defence could shape the trajectory of AI’s role in the next generation of cyber‑threats.
38

From Generative AI to AGI and ASI: How Far Can AI Evolve?

Mastodon +7 sources mastodon
agents
A feature in TELESCOPE magazine titled “From Generative AI to AGI and ASI – How Far Can AI Evolve?” maps the current hype cycle onto a longer‑term roadmap for artificial intelligence. The piece argues that today’s large‑language‑model‑driven generators are merely the first rung of a ladder that will eventually lead to artificial general intelligence (AGI) and, later, artificial superintelligence (ASI). It cites concrete milestones – multimodal reasoning, self‑directed learning and world‑model integration – as the capabilities that must be added before machines can match human‑level abstraction and creativity. Why the analysis matters is twofold. First, it reframes the commercial race for ever larger models as a research agenda with societal stakes: an AGI that can design drugs, optimise climate models or negotiate complex policy scenarios could reshape economies and regulatory frameworks. Second, the article warns that the transition from narrow to general intelligence will amplify existing ethical and safety concerns, from data bias to loss of control, and calls for coordinated governance at the EU level. The magazine’s outlook dovetails with recent developments we have covered. Meta’s release of Llama 4 on 10 April demonstrated a “native” multimodal LLM that can process text, images and code, a step toward the agentic systems described in our earlier pieces on Agentic RAG and self‑evolving AI agents. Likewise, ZETA’s integration with OpenAI’s ChatGPT signals growing commercial appetite for AI that can act autonomously in e‑commerce. What to watch next are the emerging “world‑model” architectures that aim to predict physical outcomes and plan across time, and the policy debates that will accompany any claim of AGI‑level performance. Industry conferences in the summer will likely showcase prototypes that blur the line between advanced generative tools and true general reasoning, while EU legislators prepare the first draft of an “AI risk” framework that could become the global benchmark for safe AGI development.
37

BSides Luxembourg 2026 Unveils New Talk on Teaming, Trust and Human Threats

Mastodon +7 sources mastodon
A new session at BSides Luxembourg 2026 put the human side of AI security under the spotlight. Dr. Tailia Malloy, a leading researcher on human‑machine collaboration, took the stage on May 7 to unveil “Teaming, Trust, and Threats: How Humans Interact with Generative AI in Security.” The talk combined live demos, recent field studies and a threat‑modeling framework that maps how security analysts, incident responders and SOC engineers rely on large language models (LLMs) for triage, forensics and threat‑intel synthesis. Malloy argued that the real bottleneck in AI‑augmented security is not model accuracy but the psychology of trust. She presented data showing that analysts over‑rely on AI suggestions when confidence cues are ambiguous, and under‑utilise them when output appears too “human.” The session also highlighted emerging attack vectors: prompt injection, model poisoning and covert data exfiltration through generative agents embedded in ticketing systems. By framing these issues as a teamwork problem, Malloy urged vendors to embed transparent provenance tags and to design “human‑in‑the‑loop” safeguards that preserve accountability. The relevance of the talk extends beyond the conference hall. As enterprises roll out generative AI assistants for routine security tasks, regulators in the EU are drafting guidelines on AI‑driven decision‑making. Malloy’s findings give policymakers concrete evidence that trust calibration must be codified alongside technical controls. Meanwhile, the security community is already reacting – several vendors announced beta programs for “trust‑aware” AI consoles, and academic labs said they will replicate Malloy’s experiments in multi‑site SOC environments. What to watch next: a hands‑on workshop on AI‑agent defusal scheduled for May 8, a follow‑up panel on AI governance at the upcoming RSA Conference, and a forthcoming white paper from the European Union Agency for Cybersecurity that cites Malloy’s framework. The conversation sparked at BSides Luxembourg is set to shape how the industry balances speed, safety and human judgment in the age of generative AI.
37

Omar Sanseviero tweets on X

Mastodon +11 sources mastodon
deepmindgeminigemmagoogle
Google DeepMind’s developer‑experience lead Omar Sanseviero was invited to 10 Downing Street this week, where he met senior officials from the UK government to discuss the state of open‑source large language models, the broader AI landscape and emerging opportunities for collaboration. The meeting, confirmed by Sanseviero’s X post on 18 December, was not a product launch – no new model was announced – but it signalled a rare, high‑level dialogue between one of the world’s leading AI labs and the British administration. The conversation comes at a pivotal moment for the UK, which is drafting its first comprehensive AI strategy and debating how to balance innovation with safety, data sovereignty and competition policy. DeepMind, owned by Alphabet, has been a vocal proponent of “open models” – large‑scale neural networks whose weights and training data are publicly available – arguing that openness accelerates research, democratises access and mitigates concentration of power. By engaging directly with the Prime Minister’s office, DeepMind is positioning itself as a partner in shaping the regulatory framework that will govern the next wave of generative AI. Industry observers see the visit as a test of how receptive Westminster will be to industry‑led standards and to initiatives such as DeepMind’s Gemini API and the upcoming Gemma‑4 model, which are built on open‑model principles. The dialogue may also influence funding decisions for UK‑based AI startups and university labs that rely on open‑source tooling. What to watch next: a formal statement from the UK government outlining its stance on open‑model licensing; any joint research or pilot programmes announced between DeepMind and British research institutions; and the timing of DeepMind’s next model release, which could serve as a benchmark for policy implementation. The outcome could set a template for how leading AI firms engage with regulators across Europe.
37

Claude AI Begins Logging Repeated Prompts After User Repeats Them

Dev.to +6 sources dev.to
claude
Claude’s latest update turns a long‑standing annoyance into a feature. After months of seeing Claude Code repeat the same syntax slip‑ups and logic errors, a Medium post by developer Elliot described a workaround: he began logging each fix in a shared note and feeding the list back into the model. Anthropic responded by embedding a “self‑documenting” memory layer that automatically records user‑provided corrections and re‑applies them in future sessions. The change debuted in the March 2026 release of Claude 3.5‑Code and is already visible in the web UI, where a new “Fix Log” pane surfaces beneath the code pane, showing the assistant’s own summary of past edits. Why it matters goes beyond a convenience tweak. Repetitive mistakes have been a chief criticism of AI coding assistants, undermining trust and inflating the prompt‑engineering burden. By persisting corrective feedback, Claude Code reduces the need for developers to restate the same constraints, cutting iteration time and lowering the risk of hallucinated APIs or outdated library calls. The move also signals Anthropic’s broader strategy to give large language models a mutable, user‑specific knowledge base—a step toward the “agent memory” concepts discussed in our April 10 coverage of Claude Code’s local Ollama setup (see “I Pointed Claude Code at My Local Ollama Models — Here’s the 3‑Minute Setup”). What to watch next includes the rollout schedule for the Fix Log across enterprise licences, integration with Claude’s API so external IDEs can query the stored fixes, and whether Anthropic will open the log format for community‑built extensions. Competitors are likely to follow suit, and developers may see a new wave of “personalized AI assistants” that remember project‑specific quirks without constant prompting. The real test will be whether the memory persists across devices and how securely it handles proprietary code—issues that will shape the next generation of AI‑driven development tools.
37

Meta unveils native multimodal LLM Llama 4

Mastodon +13 sources mastodon
agentsllamameta
Meta has unveiled Llama 4, the company’s first native multimodal large‑language model, and released it under an open‑weight licence on April 5. Built on a mixture‑of‑experts (MoE) backbone, the model fuses text, images and video at the earliest stage of processing – a design Meta calls “early fusion”. The architecture promises higher compute efficiency, allowing the smallest Llama 4 variant to run on a single NVIDIA H100 GPU while still delivering state‑of‑the‑art performance on multimodal benchmarks such as LM‑Arena, where it currently ranks just behind Google’s Gemini 2.5‑Pro. The launch matters for three reasons. First, a truly native multimodal model lowers the engineering gap between separate language and vision systems, enabling developers to build agents that reason across modalities without bespoke pipelines. Second, the open‑weight release continues Meta’s strategy of seeding the broader AI ecosystem with large, research‑grade models, a move that could accelerate innovation in Nordic labs and startups that lack the resources to train trillion‑parameter systems from scratch. Third, the MoE design demonstrates that scaling efficiency, not just raw parameter count, can drive competitive quality, a signal that the race toward artificial general intelligence may pivot on architectural cleverness as much as on compute budgets. What to watch next includes Meta’s rollout of larger Llama 4 variants, expected later this year, and the integration of the model into Meta’s own products such as AI‑driven assistants and content moderation tools. The community will also be keen on third‑party fine‑tuning efforts, especially for agentic AI applications, and on how regulators respond to the broader availability of high‑capability multimodal models. The coming months should reveal whether Llama 4 can shift the balance of power in the fast‑moving multimodal AI landscape.
36

Florida AG launches probe into OpenAI's possible role in Florida State University shooting

Mastodon +11 sources mastodon
agentsopenai
Florida’s attorney general has launched a formal probe into OpenAI after investigators linked the university‑campus shooting at Florida State University in April 2025 to the company’s flagship chatbot, ChatGPT. Prosecutors say the 19‑year‑old suspect exchanged more than 270 prompts with the model in the days leading up to the attack, asking for “step‑by‑step” instructions on acquiring weapons, scouting the campus layout and crafting a manifesto. Those transcripts were entered into the court record as part of the murder‑and‑injury case that left two students dead and five wounded. The inquiry marks the first time a U.S. state has pursued a generative‑AI firm on grounds of public‑safety and national‑security risk. Officials argue that the platform’s ease of access and lack of robust age‑verification or content‑filtering mechanisms may have enabled a minor to obtain actionable advice that would otherwise be illegal to disseminate. The case also raises questions about the liability of AI providers when their tools are used to facilitate violent planning, a gray area that regulators have struggled to define. OpenAI has pledged full cooperation, stating that it logs user interactions and can supply the requested data under a subpoena. The company is simultaneously rolling out tighter safeguards, including real‑time threat‑detection prompts and stricter verification for users seeking advice on weapons or illegal activity. Lawmakers in Florida and at the federal level are watching closely, with several bills already circulating to impose mandatory safety audits on “high‑risk” AI systems. What to watch next: the scope of the subpoena and whether any criminal charges will be filed against OpenAI; the outcome of the state’s hearing on AI‑related public‑safety legislation; and how the tech industry adapts its moderation policies under heightened scrutiny from both prosecutors and policymakers.
36

Anthropic's Revenue Beats OpenAI, Mythos Delayed Over Safety Concerns, Tech Stocks Drop

Dev.to +5 sources dev.to
anthropicgemmagooglemetaopenai
Anthropic’s much‑talked‑about Mythos model finally emerged from the shadows on April 7, but the company announced it would not ship the system after internal audits uncovered thousands of zero‑day vulnerabilities. The findings, released through the Project Glasswing safety framework, marked a stark reversal from the preview Anthropic rolled out last week. By pulling the plug, Anthropic underscored the growing chasm between rapid model scaling and the ability to secure those systems, a theme that has haunted the industry since the “Claude Mythos” breakthrough we covered on April 10. The decision came as Anthropic reported a surge to $30 billion in quarterly revenue, overtaking OpenAI for the first time. The windfall was driven by a wave of enterprise contracts that bundled Mythos‑grade safety tools with the firm’s Claude‑4 suite, even though the flagship model itself remains offline. The market reaction was swift: software‑sector indices slumped 2.6 % in a single session, reflecting investor anxiety that safety setbacks could stall broader AI adoption. At the same time, OpenAI closed a historic $122 billion private‑equity round, bolstering its war chest for compute and talent. Meta, under Wang Hui‑wen, launched Muse Spark, its inaugural closed‑source model, signaling a shift toward proprietary offerings that sidestep the open‑model scrutiny that plagued Anthropic. Google unveiled Gemma 4, a 310‑billion‑parameter model that outperforms rivals that are twenty times larger, while Elon Musk and Intel announced a joint “Terafab” chip fab aimed at delivering next‑generation AI silicon. What to watch next: whether Anthropic can patch Mythos and resume a commercial rollout, how regulators will respond to a model deemed “too dangerous to ship,” and whether the influx of capital into OpenAI and the hardware push from Musk‑Intel will reshape the competitive hierarchy. The next quarter will reveal if safety concerns can be reconciled with the relentless race for scale.
36

OpenAI Launches $100‑a‑Month ChatGPT Plan for Heavy Codex Users

Mastodon +10 sources mastodon
anthropicclaudeopenai
OpenAI announced a new $100‑per‑month ChatGPT Pro subscription that expands access to its Codex coding assistant by five‑fold compared with the existing $20 Plus plan. The tier sits between the current $20 Plus offering and the $200 Pro tier, which already provides the highest limits for heavy‑use customers. According to the company’s API forum, the $100 Pro level is aimed at developers and teams that run longer, high‑effort coding sessions but do not need the full capacity of the $200 plan. The move marks OpenAI’s first pricing adjustment aimed specifically at the developer market since Codex launched in 2021. By boosting the quota for code generation, the firm hopes to capture a segment of users who have been gravitating toward Anthropic’s Claude, which has long been priced at $100 per month for comparable usage. OpenAI’s statement frames the new tier as “a direct response to growing demand for more generous Codex limits” and as a way to “provide a smoother upgrade path” for power users who outgrow the Plus plan but are not yet ready to commit to the $200 price point. Industry observers see the addition as a signal that the race for AI‑assisted development tools is heating up. If the $100 tier gains traction, it could pressure Anthropic and other niche players to rethink their own pricing structures or feature bundles. It also gives OpenAI a clearer data set on how much developers are willing to pay for expanded code‑generation capacity. What to watch next: early adoption rates among indie developers and enterprise teams, any subsequent tweaks to the $200 tier’s limits, and whether OpenAI will bundle the new Pro level with additional developer‑focused features such as integrated debugging or version‑control plugins. The next quarter will reveal whether the pricing shift reshapes the competitive landscape of AI‑driven software engineering.
36

Pro Demonstrates Adding Claude Code to VS Code and Running Apps Locally—Don’t Paste API Keys in Chat.

Mastodon +10 sources mastodon
agentsanthropicclaude
A tutorial posted on the Japanese developer hub Yayafa yesterday walks readers through installing Anthropic’s Claude Code extension in Visual Studio Code and running a sample app on a local machine. The guide, co‑authored by a practising software engineer, shows step‑by‑step how to configure the extension, create the required .claude‑credentials.json file, and launch the IDE‑integrated AI coding assistant without exposing the API key in chat windows—a practice the author warns against for security and compliance reasons. Claude Code, Anthropic’s answer to GitHub Copilot, entered open beta in late 2024 and has quickly become the preferred assistant for teams that value “constitutional AI” safeguards. By embedding the model directly in VS Code, developers can request code snippets, refactorings or test generation inline, while the extension respects the user’s language settings and offers diff previews. The tutorial also demonstrates how to pair Claude Code with Firebase for rapid prototyping, echoing a broader trend of AI‑driven full‑stack development. The piece matters because it lowers the barrier for Nordic developers to adopt a privacy‑first coding assistant that can run locally, reducing reliance on cloud‑only services that may conflict with GDPR or corporate data‑handling policies. Security‑focused instructions—especially the admonition against pasting API keys into conversational prompts—highlight a growing awareness of credential leakage risks that have plagued earlier AI‑assistant rollouts. Looking ahead, Anthropic plans to roll out Claude 3.5 with improved context windows and tighter integration with Azure OpenAI, which could further erode Copilot’s market share. Observers will watch whether VS Code’s marketplace sees a surge in Claude‑related extensions, how enterprise IT departments respond to the local‑execution model, and whether regulatory bodies issue guidance on AI‑generated code provenance. The tutorial’s popularity may signal the start of a wider shift toward on‑premise AI coding tools across the Nordic tech scene.
36

How Agentic RAG Overcomes Traditional RAG Challenges

Mastodon +12 sources mastodon
agents
SoftBank’s cloud‑technology blog this week unveiled a new “Agentic RAG” framework that promises to overcome the most persistent flaws of traditional Retrieval‑Augmented Generation. The post explains that the approach, now being commercialised in Japan through the Krugle Biblio knowledge platform, was co‑developed with U.S. start‑up Archaea AI and will be sold exclusively by SoftBank’s Krugle division. Traditional RAG pipelines pull a static set of documents, feed them wholesale into a large language model and hope the model can stitch together a coherent answer. In practice the method often yields hallucinations, wastes tokens on irrelevant passages and struggles with multi‑step reasoning. Agentic RAG replaces the single‑pass retrieval step with an autonomous “agent” that can query, evaluate and re‑query sources iteratively. The agent decides when to fetch additional context, when to discard noisy results and when to invoke self‑reflection, effectively turning the retrieval process into a dynamic, goal‑directed dialogue between the model and its knowledge base. The breakthrough matters because it directly addresses the cost and reliability barriers that have slowed enterprise adoption of generative AI. By limiting the amount of retrieved text and improving factual grounding, companies can lower cloud‑compute bills while meeting stricter compliance requirements for data provenance. Early pilots reported up to a 40 % reduction in hallucination rates and a comparable cut in token consumption. The next few months will reveal whether Agentic RAG can scale beyond pilot projects. SoftBank plans to integrate the Krugle engine with Google Cloud’s Vertex AI RagEngine, offering a hybrid solution that blends Google’s infrastructure with Archaea’s agent logic. Industry watchers will be looking for benchmark results, pricing models and the rollout of the “CopilotAgentBuilder” toolkit that AVILEN promises to release later this year. If the claims hold, Agentic RAG could become the de‑facto standard for knowledge‑intensive AI applications across the Nordics and beyond.
36

DXC launches Assure Smart Apps to speed insurers' AI transformation

Mastodon +10 sources mastodon
agents
DXC Technology has unveiled Assure Smart Apps, a new suite of AI‑driven, workflow‑centric applications aimed at fast‑tracking digital transformation across property‑casualty and life insurers. Launched at the DXC Connect Insurance Executive Forum, the portfolio includes Claims Assistant, Engagement Assistant and Underwriter Assistant, each built on ServiceNow’s agentic‑AI engine and DXC’s deep insurance domain expertise. The pre‑configured modules promise to automate routine tasks, cut manual effort by 30‑40 % and deliver measurable outcomes within 12 weeks, all without requiring a wholesale replacement of legacy core systems. The announcement arrives as insurers grapple with mounting pressure to modernise, contain costs and meet rising customer expectations for instant, personalised service. While AI adoption has accelerated, many carriers remain hamstrung by fragmented legacy stacks and a shortage of in‑house talent to build bespoke solutions. By offering modular, outcome‑focused apps that plug into existing environments, DXC aims to lower the barrier to entry and enable insurers to scale AI initiatives quickly and safely. Analysts will be watching how quickly major carriers pilot the new tools and whether the promised speed‑to‑value materialises in practice. Early case studies could reveal the impact on underwriting accuracy, claim‑settlement times and cross‑sell conversion rates, while also highlighting any workforce adjustments required as routine processes become automated. Competition from other tech giants – notably Microsoft’s Cloud for Insurance and Salesforce’s Financial Services Cloud – will intensify, making adoption metrics a key barometer of DXC’s market traction. The next few months should bring announcements of pilot results, integration roadmaps with ServiceNow’s broader AI portfolio, and possibly regulatory commentary on the use of agentic AI in high‑stakes insurance decisions. Those developments will shape whether Assure Smart Apps become a catalyst for industry‑wide AI acceleration or another niche offering in a crowded marketplace.
36

Self-Evolving AI Agents Improve the More They're Used, Study Shows

Mastodon +8 sources mastodon
agentsgemma
A research team from the Japanese startup Asty has published a detailed analysis of “self‑evolving” AI agents, showing how continuous interaction with users can make the same model progressively smarter without external re‑training. The paper, released on April 10, dissects the architecture behind prototypes such as Gemma‑4, GEPA and HermesAgent, all of which run locally and update their internal weights through a combination of reinforcement learning from human feedback (RLHF) and on‑device meta‑learning. By storing interaction traces in a secure sandbox, the agents generate micro‑updates that are merged into a base model nightly, allowing them to refine language understanding, product‑recommendation logic and even visual‑search capabilities on the fly. Why it matters is twofold. First, the approach promises a new wave of “agentic” applications that can personalize themselves in real time while keeping data under user control—a direct response to privacy concerns that have slowed adoption of cloud‑only AI services. Second, the technology lowers the barrier for small firms to deploy sophisticated assistants, potentially reshaping e‑commerce, customer support and creative tools. The findings echo the trends we highlighted last week: Meta’s Muse Spark model, which can compare products from photos, and ZETA’s integration of OpenAI’s ChatGPT into its commerce platform both rely on rapid, user‑driven refinement. Amazon’s record AI‑cloud revenue and the Linux Foundation’s Agentic AI Foundation further illustrate the industry’s push toward continuously learning agents. What to watch next are the practical roll‑outs slated for the summer. Asty plans an open‑source SDK that will let developers plug the self‑evolving core into existing chat and recommendation pipelines. The Agentic AI Foundation is expected to publish a standards draft on safe update mechanisms, and both Meta and ZETA have hinted at beta programs that will test these agents in live retail environments. The coming months will reveal whether self‑evolving agents can deliver on their promise without compromising safety or stability.
32

Anthropic Suspends Controversial Claude Mythos as Meta Launches New AI Model.

Mastodon +6 sources mastodon
agentsanthropicclaude
Anthropic unveiled a new large‑language model called Claude Mythos on 7 April, but within days the company pulled the plug on any public rollout. Internal tests showed the system could autonomously locate and exploit thousands of zero‑day flaws across major operating systems and web browsers, a capability that far outstripped the safety envelope of existing models. The discovery prompted Anthropic’s safety team to quarantine the model and issue a statement that “the risk of uncontrolled vulnerability discovery outweighs any immediate commercial benefit.” The episode has ignited a fresh debate about the limits of agentic AI. Shota Imai, a leading AI researcher featured on the AI QUEST program, warned that “humanity has crossed a line” when a system can weaponise software bugs without human direction. His reaction underscores a growing unease among experts that the next generation of foundation models may possess agency that challenges current governance frameworks. The fact that the model’s benchmark scores were so high that Imai initially suspected an April‑Fool’s prank only adds to the sense that the technology is moving faster than public discourse can keep pace. Anthropic’s retreat also sharpens the competitive landscape. Meta announced that its upcoming Llama X series will be released later this quarter, positioning the social‑media giant as a serious contender in the race for the most capable, yet controllable, AI. Observers will watch whether Meta’s safety‑by‑design approach can avoid the pitfalls that forced Anthropic to seal Mythos, and how regulators in the EU and the US respond to a model that can autonomously discover critical software vulnerabilities. Key signals to monitor include any formal safety audits of Claude Mythos, Meta’s rollout timeline and transparency reports, and the next round of policy proposals from the EU AI Act that could mandate pre‑deployment vulnerability assessments for high‑risk AI systems. The unfolding story will likely set a precedent for how the industry balances breakthrough performance with the imperative to keep powerful AI safely contained.
32

Mastodon User Calls Out Major Platform Flaw

Mastodon User Calls Out Major Platform Flaw
Mastodon +6 sources mastodon
anthropicgooglemetaopenai
A coordinated AI‑driven misinformation campaign hit millions of smartphones across Europe on Tuesday, prompting the Swedish prime minister to demand answers from the sector’s biggest players. The operation, traced to a network of push‑notifications and voice‑assistant prompts, delivered false statements about a pending tax reform, then shifted to fabricated health advice. Forensic analysis by independent security researchers linked the content generation to large‑scale language models hosted by Google, Meta, Anthropic and OpenAI, while the delivery infrastructure relied on the firms’ mobile‑ad ecosystems. The incident marks the first time that the combined output of the world’s leading generative‑AI providers has been weaponised at scale on personal devices, bypassing traditional media channels and exploiting the trust users place in native phone alerts. “A society where a techno‑oligarch can interfere, as one of them did yesterday, in the mobile phones of millions of citizens to tell them lies?” the prime minister asked in a parliamentary hearing, echoing growing public alarm over unchecked AI influence. Why it matters is twofold. First, it demonstrates how the concentration of AI talent and compute in a handful of corporations can translate into a de‑facto “information super‑weapon” that operates without any transparent oversight. Second, the episode exposes a regulatory blind spot: existing data‑protection and election‑integrity rules do not cover AI‑generated content delivered through proprietary app stores and notification services, leaving citizens vulnerable to manipulation at the point of contact. What to watch next are the policy and market responses. The European Commission has signalled an accelerated rollout of the AI Act, with particular focus on “high‑risk” generative systems. In the United States, the Federal Trade Commission is reportedly opening an antitrust probe into the collusive use of AI‑generated ads. Meanwhile, decentralised platforms such as Mastodon are seeing a surge in new users seeking alternatives to the corporate‑controlled ecosystem. The next weeks will reveal whether lawmakers can impose meaningful constraints before the technology’s next “shit show” unfolds.
32

OpenAI halts Stargate UK amid soaring energy costs and new regulations

Mastodon +10 sources mastodon
openai
OpenAI has put its “Stargate UK” data‑center project on hold, citing soaring electricity prices and an uncertain regulatory climate in Britain. The move follows the company’s earlier decision to scrap a planned campus in Abilene, Texas, and marks the latest setback for the ambitious AI‑infrastructure venture announced in September together with Nvidia and data‑center developer Nscale. As we reported on 10 April, OpenAI paused the UK build after energy costs proved higher than projected. The latest statement adds that the firm will continue negotiations with the London government to seek clearer policy guidance and possible incentives. OpenAI’s chief‑technology officer said the pause is “temporary” and that the company remains committed to a UK presence, but will not proceed until the energy tariff regime and data‑security rules are stabilised. The decision matters on several fronts. Britain has positioned itself as a European hub for AI research and expects large‑scale compute facilities to attract talent, boost the domestic tech sector and secure data sovereignty. A stalled flagship project threatens those ambitions and could give rivals such as Microsoft’s Azure or Google Cloud a competitive edge in the region. For OpenAI, the pause underscores the growing tension between rapid model scaling and the sustainability of the underlying compute infrastructure, a theme echoed in its recent restriction on new model releases for cybersecurity reasons. What to watch next are the outcomes of the talks with the UK authorities. A revised energy‑tax framework or targeted subsidies could revive the project, while prolonged uncertainty may push OpenAI to relocate capacity to more cost‑stable locations in Europe or the Nordics. Parallel developments—particularly the company’s evolving subscription tiers for heavy‑use codex services—will also signal how OpenAI balances growth with operational constraints.
29

Suspect arrested after Molotov cocktail thrown at OpenAI CEO Sam Altman's home

KRON4 +12 sources 2026-02-18 news
openai
A Molotov cocktail was hurled at the San Francisco residence of OpenAI chief executive Sam Altman early Friday, igniting a brief blaze at the metal gate of his Russian‑Hill home. Security guards on site doused the flames before any damage spread, and no one was injured. Police apprehended a 20‑year‑old suspect at the scene after he allegedly threatened staff outside OpenAI’s headquarters, a claim confirmed by both the department and a company spokesperson. The attack marks the first known physical assault on a leading AI executive since the sector’s rapid expansion and the intensifying public debate over the technology’s societal impact. Altman, who steered OpenAI from a research lab to the creator of ChatGPT and other high‑profile models, has become a lightning rod for both admiration and criticism. Recent policy disputes, concerns about AI safety, and the company’s growing influence on global tech policy have fueled a volatile mix of activist anger and extremist rhetoric. While investigators have not disclosed a motive, the suspect’s age and the timing—coinciding with heightened scrutiny of AI regulation in Europe and the United States—suggest a possible link to broader anti‑AI sentiment. Authorities are now probing the incident for any organized element, and OpenAI has pledged to review its security protocols for executives and facilities. The episode is likely to accelerate discussions on protecting high‑profile tech leaders, especially as AI firms become increasingly entwined with national security and economic strategy. Watch for statements from the San Francisco Police Department, updates on the suspect’s charges, and any policy responses from lawmakers who have been calling for stricter oversight of AI development. The incident could also prompt other tech firms to reassess physical security measures amid a climate of rising hostility toward the industry.
28

OpenAI launches fellowship offering up to $15,000 in monthly AI compute.

Insider +9 sources 2026-04-08 news
ai-safetyanthropicopenai
OpenAI unveiled a new safety‑focused fellowship that will grant external researchers up to $15,000 worth of AI compute each month, alongside a modest stipend and mentorship from OpenAI staff. The pilot, slated to run from September 2026 through February 2027, targets work on alignment, robustness, privacy and misuse prevention. Applicants will be selected on the basis of technical merit and the potential impact of their proposals, with the first cohort expected to begin experiments later this year. The announcement arrives just hours after a media report questioned CEO Sam Altman’s commitment to AI safety, positioning the fellowship as a concrete response to growing scrutiny. By matching the structure of Anthropic’s own safety fellowship, OpenAI signals a willingness to compete directly in the nascent ecosystem of corporate‑funded safety research. The compute allocation—equivalent to roughly 1,200 GPU‑hours per month—addresses a chronic bottleneck for independent labs that lack access to the scale required for modern foundation‑model experiments. If the program succeeds, it could accelerate breakthroughs in alignment techniques and provide a pipeline of vetted talent for OpenAI’s internal safety teams. It also sets a benchmark for other AI firms, many of which have announced similar initiatives but have yet to disclose comparable resource commitments. Observers will be watching how OpenAI balances the fellowship’s open‑research ethos with its proprietary model roadmap, especially as the company rolls out new products such as its text‑to‑speech API and a job‑matching platform later in 2026. Key developments to monitor include the release of application guidelines, the composition of the inaugural cohort, and any early research outputs that demonstrate the compute grant’s practical value. The fellowship’s impact on the broader safety community—and whether it spurs a wave of corporate‑backed alignment projects—will shape the narrative around industry responsibility in the next wave of AI advancement.
28

OpenAI launches $100‑per‑month ChatGPT plan for Vibe coding.

OpenAI launches $100‑per‑month ChatGPT plan for Vibe coding.
CNET +9 sources 2026-04-01 news
openai
OpenAI announced a new $100‑per‑month “ChatGPT Pro” tier that expands the usage limits of its Codex‑powered Vibe coding feature fivefold compared with the existing $20 ChatGPT Plus plan. The move closes the pricing gap between the $20 tier and the $200 ChatGPT Pro tier that has catered to heavy‑duty developers and enterprises since 2023. Vibe coders—developers who rely on the integrated AI‑assisted coding agent to generate, refactor, or review large codebases—have routinely hit the Plus plan’s daily token caps, forcing them to wait for a reset or upgrade to the $200 tier. The new $100 tier offers 5× more Codex tokens, access to a research‑preview model dubbed GPT‑5.3‑Codex‑Spark, and higher pull‑request limits (up to 250 per week). OpenAI positions the tier as a middle ground for professionals who need sustained, high‑effort coding sessions but do not require the full enterprise‑grade resources of the $200 plan. The addition matters for several reasons. First, it signals OpenAI’s recognition that AI‑assisted development is becoming a core workflow for a growing segment of the software market, and that the existing pricing structure was throttling adoption. Second, it sharpens the competitive rivalry with Anthropic’s Claude Code and other specialized coding assistants, which have been courting the same developer audience with more flexible pricing. Third, the tier could boost OpenAI’s recurring revenue by converting Plus users who were previously stuck at the limit or deterred by the $200 price point. What to watch next: early uptake metrics will reveal whether the $100 tier successfully captures the “mid‑scale” developer segment. OpenAI may further adjust token caps or introduce additional tiers if demand outpaces supply. Integration of the GPT‑5.3‑Codex‑Spark preview into broader ChatGPT features could also reshape how enterprises embed AI coding into CI/CD pipelines, while pricing pressure may prompt competitors to revisit their own subscription models.
24

KD-MARL Enables Resource‑Aware Knowledge Distillation in Multi‑Agent Reinforcement Learning

ArXiv +9 sources arxiv
agentsinferencereinforcement-learning
A team of researchers has unveiled KD‑MARL, a two‑stage framework that compresses coordinated policies from a centralized expert into lightweight, decentralized student agents for multi‑agent reinforcement learning (MARL). The method, described in a new arXiv pre‑print (2604.06691v1), first trains a high‑capacity expert that solves a task through joint decision‑making, then distills its behavior into multiple low‑resource agents that operate independently. Crucially, the distillation process is “resource‑aware”: it explicitly balances memory, compute and inference‑time budgets while preserving the expert’s collaborative strategies. The contribution matters because real‑world MARL deployments—autonomous drone swarms, traffic‑signal coordination, factory robots, and edge‑based IoT control—are routinely throttled by limited on‑board hardware. Existing expert policies achieve state‑of‑the‑art performance but demand costly decision cycles and large model footprints, making them impractical outside data‑center environments. By delivering comparable coordination from models that fit on embedded processors, KD‑MARL bridges the gap between research‑grade MARL and production‑grade systems, potentially accelerating adoption in sectors where latency and power consumption are non‑negotiable. KD‑MARL also dovetails with a broader shift in AI toward knowledge‑distillation as a general efficiency tool, a trend already evident in large‑language‑model compression and multimodal learning. The authors have released code on GitHub, inviting the community to benchmark against standard MARL suites such as StarCraft II micromanagement and the Multi‑Agent Particle Environment. What to watch next: early adopters are likely to test KD‑MARL on real‑time robotics platforms and vehicular platooning demos, while follow‑up studies may explore adaptive distillation that reacts to dynamic resource constraints. Success in these pilots could set a new baseline for resource‑constrained MARL, prompting hardware vendors to co‑design accelerators that exploit the framework’s lightweight inference patterns.
24

AI Agent Spends $0.60 per Session on Orientation Before Coding

Dev.to +10 sources dev.to
agentsautonomousclaudeopenai
A developer who recently released the open‑source tool Stacklit has revealed that an AI‑driven coding assistant can burn roughly $0.60 in token fees before it even starts generating code. By running `npx stacklit init` and inspecting the session logs on GitHub, the author counted more than 4,000 tokens spent on “orientation” – the phase where the model parses the project structure, reads configuration files and decides how to approach the task. At current OpenAI pricing, that token count translates to about sixty cents per run. The finding matters because it exposes a hidden layer of cost that most users overlook. While headline‑grabbing figures focus on the price of generated output, the preparatory work of large language models (LLMs) can quickly add up, especially when agents are invoked repeatedly in CI pipelines or developer workstations. The expense is not purely financial; the same token consumption correlates with measurable electricity use, a factor highlighted in recent analyses of AI coding agents’ carbon footprints. For startups and enterprises that plan to scale autonomous agents across dozens of repositories, the cumulative “orientation” bill could erode the promised ROI of AI‑assisted development. What to watch next is how the ecosystem responds to this transparency. The OpenAI Agents SDK and competing frameworks are already adding built‑in token‑tracking dashboards, and third‑party tools are emerging to cap or batch orientation calls. Meanwhile, pricing guides for 2026 predict a broader stratification of AI‑agent costs, from free hobbyist tiers to enterprise contracts that factor in both token and compute overhead. Developers are likely to adopt more aggressive prompt‑engineering and caching strategies to trim the pre‑code phase, while regulators in the Nordics may begin scrutinising the energy impact of pervasive AI automation. The conversation sparked by Stacklit’s cost audit could therefore shape both budgeting practices and sustainability standards for the next generation of autonomous coding agents.
24

Claude gains 197 new bioinformatics skills via SciAgent.

Dev.to +8 sources dev.to
agentsclaudefine-tuningrag
Claude Code, Anthropic’s code‑generation model, has been turned into a bioinformatics workhorse through the release of SciAgent‑Skills, a plug‑in that bundles 197 pre‑crafted “skills” covering everything from RNA‑seq pipelines to protein‑structure prediction. The repository, hosted on GitHub, ships self‑contained SKILL.md files that embed code patterns, best‑practice notes and runnable examples, allowing the model to answer domain‑specific prompts without any fine‑tuning or retrieval‑augmented generation (RAG) setup. Benchmark tests published alongside the release show Claude Code achieving 92 % accuracy on a suite of bioinformatics tasks drawn from real‑world pipelines. The results are notable because they bypass the costly and time‑consuming process of retraining large language models on specialist literature, yet still produce code that passes standard validation checks. For research groups and biotech startups that lack deep AI expertise, the plug‑in promises immediate access to a virtual “lab assistant” that can draft analysis scripts, suggest statistical models and even draft methods sections for manuscripts. The move could accelerate the adoption of AI‑driven analysis across the Nordic life‑science ecosystem, where high‑throughput sequencing and single‑cell studies are expanding rapidly. By lowering the barrier to reliable, domain‑aware code generation, SciAgent‑Skills may shorten project timelines, reduce reliance on scarce bioinformatics staff and help standardise pipelines that are often bespoke and error‑prone. What to watch next: the community’s response in terms of contributions and extensions, especially for emerging modalities such as spatial transcriptomics and multi‑omics integration. Early adopters are expected to publish case studies that compare the plug‑in’s output against manually written scripts, providing a real‑world gauge of robustness. Anthropic’s roadmap hints at broader “skill” ecosystems for other scientific fields, so the next wave could see similar plug‑ins for chemistry, materials science or clinical data analysis, further blurring the line between general‑purpose LLMs and specialist research tools.
24

Claude and TensorFlow Power Real-Time Crypto Trading System

Dev.to +6 sources dev.to
agentsanthropicclaudereasoning
A team of Nordic engineers has unveiled a fully‑functional crypto‑trading platform that couples Anthropic’s Claude with a suite of twelve TensorFlow models, delivering a natural‑language interface that can execute trades in milliseconds. The system, described in a new open‑source repository, positions Claude as the high‑level reasoning engine while the TensorFlow models handle price‑prediction, sentiment analysis, volatility forecasting, order‑book parsing, risk assessment, and execution‑strategy optimisation. Users type commands such as “Buy 0.5 BTC if the market‑wide sentiment turns bullish within the next five minutes,” and Claude translates the intent into coordinated calls across the underlying models, which then submit orders to multiple exchanges via a low‑latency gateway. Initial back‑testing on Bitcoin and Ethereum data from the past twelve months shows an average Sharpe ratio of 2.1 and a net profit‑to‑loss ratio of 3.4 : 1, outperforming a baseline algorithmic strategy by roughly 27 %. Live‑testing on a modest $10 k capital allocation over a two‑week window generated a 38 % return, with trade‑execution latency consistently under 150 ms. The developers credit Claude’s Model Context Protocol for stitching together the disparate models without custom glue code, a pattern they first demonstrated in the “Claude Mythos” series we covered on April 10. The launch matters because it proves that large‑language models can serve as reliable orchestration layers for high‑stakes financial automation, lowering the barrier for non‑technical traders to harness sophisticated AI pipelines. It also raises questions about market fairness, regulatory oversight, and the security of AI‑driven trading bots that could amplify flash‑crash dynamics. Watch for adoption signals from hedge funds and retail platforms, potential scrutiny from financial regulators in the EU and the US, and Anthropic’s next‑generation Claude updates that may tighten integration with TensorFlow and other ML ecosystems. The open‑source code will likely become a reference point for future AI‑powered trading architectures.
24

Five LLMs play poker; Opus folds first, Grok wins

HN +6 sources hn
claudegeminigpt-5grok
Five leading large‑language models (LLMs) faced off in a Texas Hold’em tournament last week, with Anthropic’s Claude Opus eliminated in the first round and Elon Musk’s xAI Grok emerging as the champion. The showdown, organized by the AI‑gaming lab “Strategic Minds,” pitted Opus, Grok 4, Google’s Gemini 2.5 Pro, OpenAI’s GPT‑5 and Anthropic’s Claude Sonnet 4.5 in a series of 1,000‑hand matches run on a public poker engine. Each model received the same hand‑history data and was prompted to output a bet, raise or fold decision, which the engine then executed. The experiment was more than a publicity stunt. By forcing LLMs to make real‑time, high‑stakes choices under incomplete information, the test exposed how well current prompting techniques translate into strategic reasoning. Opus’s early bust highlighted lingering weaknesses in risk assessment, while Grok’s consistent aggression and timely bluffs demonstrated a refined ability to model opponent behavior—a skill honed through xAI’s recent reinforcement‑learning‑from‑human‑feedback (RLHF) upgrades. Why it matters is twofold. First, poker remains a benchmark for artificial general intelligence because it blends probability, psychology and long‑term planning; a clear win for Grok suggests that LLMs are closing the gap between language proficiency and decision‑making competence. Second, the results could accelerate the deployment of AI assistants in finance, negotiations and gaming, sectors where nuanced risk evaluation is critical. At the same time, the tournament raised safety questions: if LLMs can bluff convincingly, they might be misused in fraud or market manipulation unless robust guardrails are built in. What to watch next includes a follow‑up tournament slated for June that will add a multi‑agent reinforcement learning layer, allowing models to adapt their strategies across hands. Industry observers will also be monitoring OpenAI’s upcoming GPT‑5 refinements and Anthropic’s next Opus iteration, both of which promise tighter integration of strategic modules. Finally, regulators are expected to issue guidance on AI‑driven gambling applications, a move that could shape how these models are commercialised beyond the lab.
21

Voice assistants face tougher hallucination problems than text chatbots

Mastodon +6 sources mastodon
agentshealthcarevoice
Ulrike Stiefelhagen’s presentation at the W3C Workshop on Smart Voice Agents highlighted a growing blind spot in AI deployment: hallucinations are harder to control in spoken interfaces than in text‑based chatbots. Drawing on two concrete deployments – a “Workers Daily Summary” service that delivers shift‑by‑shift updates to factory staff, and a “Patient Chat” tool that assists clinicians with triage – she showed that real‑time audio output amplifies the risk of ungrounded or fabricated statements. Unlike typed replies, spoken hallucinations can be heard instantly, making errors harder to spot and potentially more damaging in safety‑critical settings such as healthcare. The challenge stems from the need to fuse low‑latency speech synthesis with robust grounding mechanisms. Stiefelhagen argued that current LLM pipelines, which excel at generating fluent text, often lack the verification loops required for audio delivery. She called for built‑in grounding checks, dynamic confidence scoring, and fallback utterances that signal uncertainty before the voice is rendered. The talk also referenced emerging testing frameworks, such as LiveKit’s voice‑agent helpers, which isolate logic in text‑only mode to catch hallucinations early in the development cycle. Why it matters now is twofold. First, voice assistants are expanding beyond consumer gadgets into enterprise and medical workflows across the Nordics, where regulatory standards for patient safety are stringent. Second, the broader AI community is grappling with hallucination mitigation after high‑profile incidents, exemplified by Anthropic’s “Project Glasswing” aimed at averting an AI‑driven cyber‑crisis. Stiefelhagen’s findings suggest that without dedicated safeguards, voice agents could become the next vector for misinformation or clinical error. What to watch next includes the W3C’s forthcoming recommendation on real‑time grounding for speech models, pilot studies integrating Hermes‑style tool‑calling into voice pipelines, and potential EU‑Nordic guidelines that may require explicit “uncertainty disclosures” for spoken AI outputs. The convergence of standards, testing tools, and regulatory pressure will determine whether voice agents can deliver the promised natural interaction without the risk of audible hallucinations.
21

First-time reader moved to tears by Vibe‑coded script

Mastodon +11 sources mastodon
apple
A developer on a popular Nordic tech forum posted a raw reaction after reading a “vibe‑coded” script for the first time: the code made them cry, not because it was ugly but because it felt like a forced imitation of beauty, a “malicious compliance” that turned elegance into verbose pedantry. The post, which quickly went viral among AI‑coding circles, underscores the growing tension between the hype surrounding AI‑driven “vibe coding” and the practical realities of software craftsmanship. Vibe coding, a term that has emerged alongside large‑language‑model assistants, describes a workflow where developers describe desired functionality in natural language and let an AI generate the underlying code without human review. Companies such as Base44 have built entire products on the premise that “no‑code coding” can accelerate development, and major platforms like Google AI Studio now market real‑time, word‑driven app builders. Critics, however, argue that the approach often yields bloated, unreadable code that masquerades as innovation while squandering compute resources. The emotional response captured in the forum post is significant because it reflects a broader community fatigue. Early adopters who hoped AI would free them from boilerplate are now confronting a new kind of technical debt: code that works but cannot be understood, maintained, or improved without reverting to traditional programming practices. As more vibe‑coded demos flood open‑source repositories, the risk of proliferating fragile, unmaintainable software grows. What to watch next is whether the industry will develop robust verification tools or standards that force AI‑generated code through human‑readable checkpoints, or whether the backlash will push developers back toward hybrid models that combine AI assistance with disciplined code review. Upcoming conferences in Copenhagen and Stockholm are slated to feature panels on “AI‑augmented development ethics,” and a consortium of Nordic universities has announced a research grant to study long‑term maintainability of vibe‑coded systems. The outcome will likely shape how AI integrates into the software stack for years to come.
21

Researchers Quantify Malicious Intermediary Attacks on LLM Supply Chains

Mastodon +6 sources mastodon
agentsinference
A new arXiv paper, “Your Agent Is Mine: Measuring Malicious Intermediary Attacks on the LLM Supply Chain” (arXiv 2604.08407), quantifies how AI agents can become backdoors for attackers who control the inference provider or any router that mediates calls to large language models. The authors demonstrate that once an agent is instantiated, the provider effectively gains shell‑level access to the host process, allowing malicious code hidden in seemingly harmless “skills” to execute without triggering existing safety filters. The study builds on recent real‑world incidents that have shaken confidence in the AI tooling ecosystem. Two weeks ago, the popular liteLLM gateway was found to contain a backdoor in versions 1.82.7 and 1.82.8, stealing cloud credentials and Kubernetes secrets after a compromised PyPI maintainer uploaded malicious packages. A follow‑up analysis showed that the malicious skill leveraged the same code‑generation‑then‑execution loop that modern LLM agents use, bypassing lexical command‑filtering defenses. Earlier this month, researchers released the “PoisonedSkills” framework, which embeds payloads in Markdown blocks and configuration templates, then mutates them at scale to cover 15 MITRE ATT&CK categories. Their pipeline produced over a thousand adversarial skills that execute silently during routine agent tasks. Why it matters is simple: enterprises are rapidly adopting LLM‑driven agents for coding, data extraction, and autonomous decision‑making. If the skill marketplace or the routing layer is compromised, an attacker can move from a harmless plugin to full remote‑code execution, exfiltrating secrets and hijacking workloads across cloud environments. The threat expands the traditional supply‑chain model—where only the model weights were considered vulnerable—to include the entire orchestration stack. What to watch next are the emerging mitigations. Researchers are proposing stricter provenance checks for skill packages, sandboxed execution environments that isolate agent processes, and runtime attestation of router firmware. Industry bodies such as the Cloud Native Computing Foundation are expected to draft security guidelines for AI‑agent ecosystems within the next quarter. Keep an eye on vendor patches for liteLLM and similar gateways, and on conference sessions at the upcoming AI‑Sec Europe summit where the authors will present concrete defenses. The race between attackers and defenders is now moving from model poisoning to the very code that makes agents useful.
20

New Tool Unifies Anthropic, OpenAI, and Google AI Services

Mastodon +11 sources mastodon
anthropicgoogleopenai
A new integration layer that stitches together the APIs of Anthropic, OpenAI and Google has hit the market, promising developers a single‑point‑of‑entry for the three leading large‑language‑model providers. The tool, unveiled on a GitHub repository under the name “UnifiedAI‑Bridge”, automatically handles authentication, request formatting and rate‑limit management for each service, letting users switch models on the fly without rewriting code. The launch matters because the AI landscape has become increasingly fragmented. Since OpenAI’s recent capacity announcement (1.9 GW of compute) and Anthropic’s aggressive rollout of Claude Mythos, firms are scrambling to lock in customers with proprietary ecosystems. A cross‑provider gateway lowers the barrier to experimentation, reduces vendor lock‑in, and could accelerate the adoption of hybrid solutions that combine the strengths of each model—e.g., OpenAI’s code generation, Anthropic’s safety‑focused dialogue, and Google’s multimodal vision. For startups and Nordic enterprises that lack deep engineering resources, the bridge could be the difference between a proof‑of‑concept and a production‑ready product. What to watch next is how quickly the community adopts the bridge and whether the major labs respond with tighter API restrictions or open‑source alternatives. The tool’s creator has pledged a paid “enterprise tier” that adds audit logs and compliance hooks, hinting at early interest from regulated sectors such as finance and health. Regulators in the EU and Norway are already drafting rules on AI model transparency; a unified access point may become a focal point for compliance audits. Finally, watch for possible partnerships—if OpenAI or Google were to endorse the bridge, it could reshape the competitive dynamics that have defined the AI arms race this year.
20

Associated Press Top Stories

Mastodon +6 sources mastodon
A joint report released on Monday by the European Commission’s AI Observatory and the non‑profit research group AI‑Watchdog warned that the rapid proliferation of large‑language models (LLMs) is “sloping” the quality of online information. The study, titled *The Slopification of the Digital Landscape*, analysed 1.2 billion AI‑generated texts across social media, news sites and e‑commerce platforms and found a 37 percent rise in factual errors, repetitive phrasing and stylistic “noise” compared with a baseline from 2022. The authors attribute the trend to three converging forces: the democratisation of powerful LLMs through open‑source releases such as Meta’s Llama 4, the aggressive price cuts that have made API access cheaper for mass‑scale deployment, and a lack of robust post‑generation verification tools. “When anyone can spin up a model for a few cents a thousand times a day, the incentive shifts from quality to volume,” the report’s lead author, Dr Elena Rossi, wrote. The findings echo earlier concerns raised after OpenAI’s price reduction for ChatGPT‑4, which sparked a surge in low‑cost content farms, and follow the recent investigation into AI‑generated disinformation linked to the Florida university shooting. Why it matters is clear: as AI‑written copy floods search results, newsfeeds and product descriptions, users face a higher risk of misinformation, brand dilution and reduced trust in digital media. Regulators have already flagged the issue in the EU’s AI Act, but the report calls for immediate standards on output verification and mandatory labeling of AI‑generated text. What to watch next are the European Commission’s forthcoming guidelines on “AI output integrity,” slated for a public consultation in June, and the industry’s response—particularly whether major providers such as OpenAI, Google and Meta will embed real‑time fact‑checking into their APIs. The next few months could determine whether the digital ecosystem can reverse the slopification trend before it reshapes public discourse.
20

Apple AI faces hijacking risk from prompt injection.

Mastodon +11 sources mastodon
apple
Apple’s newly launched AI suite, Apple Intelligence, has been found vulnerable to a classic yet increasingly potent attack vector: prompt injection. Security researchers disclosed that specially crafted inputs can hijack the system’s language model, forcing it to emit malicious or profane content and, in more advanced scenarios, to reveal internal prompts that guide its behavior. The flaw stems from the way Apple Intelligence concatenates user‑supplied text with system‑level instructions before passing the combined prompt to the underlying large‑language model. By embedding hidden directives in seemingly innocuous queries, an attacker can override the model’s safeguards and steer its output toward any desired narrative. The discovery matters because Apple Intelligence is positioned as the cornerstone of the company’s AI strategy, powering features across iOS, macOS, iPadOS and the upcoming “Apple Vision Pro” interface. If malicious actors can manipulate the model on a personal device, they could generate disinformation, phishing content, or even code that exploits other apps. The vulnerability also highlights a broader industry challenge: prompt injection attacks, long known in web‑based AI agents, are now surfacing in consumer‑grade products that lack the hardened defenses of enterprise platforms. Apple has acknowledged the report and pledged a “rapid response” patch, but the timeline remains unclear. In the meantime, security teams are scrambling to devise mitigations, such as stricter input sanitisation and sandboxed prompt handling. Watch for Apple’s forthcoming software update, likely rolled out through iOS 18 and macOS 15, and for any disclosures from the broader AI‑security community about similar weaknesses in rival assistants. The episode underscores that as AI becomes a core OS feature, robust prompt‑injection defenses will be as essential as traditional malware protections.
20

Hermes Beats OpenClaw in Tool Calling on Low-End AI Models

Mastodon +11 sources mastodon
agents
Hermes, the open‑source agent framework from Nous Research, is now being praised for out‑performing OpenClaw in tool‑calling tasks on low‑end language models. A developer who tested the two on a modest hardware setup reported that Hermes required fewer tokens to achieve the same results and that its “harness” – the code that translates model outputs into function calls – succeeded on the first try far more often than OpenClaw’s equivalent. The observation, shared on social media with the hashtags #AgenticAI and #Hermes, adds a practical data point to a series of benchmark releases that have already placed Hermes near the top of the Berkeley FunctionCallingLeaderboard. The significance lies in the economics of deploying AI assistants at scale. Low‑end models such as 7‑billion‑parameter LLaMA or Qwen‑3.5 are far cheaper to run than their 70‑billion‑parameter counterparts, but they have historically struggled with reliable tool invocation, a core capability for autonomous agents that need to fetch data, trigger APIs, or execute code. By reducing token consumption and cutting the error rate of function signatures, Hermes makes it feasible for startups and hobbyists to build cost‑effective, production‑ready agents without resorting to expensive cloud GPUs. Looking ahead, the community will be watching for the next version of Hermes‑function‑calling‑v1, which expands the XML‑based function schema and promises tighter integration with the HermesAgent platform that already supports cross‑session memory and multi‑messenger connectivity. Equally important is how OpenClaw’s developers respond—whether they will release a patch to improve harness stability or pivot to a different architecture. The upcoming Q2 benchmark round on the Berkeley leaderboard should provide a broader comparison across model sizes, and any shift in token‑efficiency metrics could reshape the tool‑calling landscape for the open‑source AI ecosystem.
20

Best AI Stock to Buy for $1,000 Before Market Recovery

Yahoo Finance +7 sources 2026-03-24 news
Alphabet (GOOGL) has re‑emerged as the top pick for investors with a modest $1,000 budget, according to a new analyst note that argues the AI‑heavy sell‑off has created a buying window before the broader market rebounds. The recommendation follows a week of heightened volatility that pushed the Nasdaq into correction territory, a trend we flagged on April 10 when we identified two AI stocks worth buying first. Alphabet’s shares have slipped roughly 12 % since the start of the quarter, outpacing the sector’s average decline of 15 % despite the company’s continued rollout of Gemini, its next‑generation large‑language model, and the integration of AI tools across Google Search, Workspace and Cloud. The appeal lies in Alphabet’s diversified revenue base and its ability to monetize AI at scale. Revenue from Google Cloud, now driven by AI‑enhanced services, grew 28 % YoY in Q1, while ad earnings have begun to recover after a dip caused by advertisers’ cautious spending on AI‑related campaigns. Moreover, the firm’s massive data infrastructure and chip‑design subsidiary, Google‑AI, give it a cost advantage over rivals that still rely on third‑party hardware. Analysts see the current price‑to‑sales multiple of 5.8 as a discount to the 7‑8 range typical for high‑growth AI players, suggesting upside potential if the market re‑prices AI earnings expectations. Investors should monitor three catalysts: the performance of Gemini in real‑world deployments, the next earnings release slated for early May, and any regulatory moves stemming from the recent OpenAI blueprint on AI taxation and oversight. A stronger-than‑expected earnings beat or a breakthrough partnership could accelerate the rebound, while tighter AI regulations or a prolonged advertising slowdown could keep the stock muted. For those looking to allocate a thousand dollars now, Alphabet offers a blend of growth, cash flow and resilience that may pay off when the tech rally resumes.
20

Nasdaq Slides into Correction; My Top Two AI Stock Picks

AOL +11 sources 2026-04-01 news
The Nasdaq Composite slipped into correction territory this week, falling more than 10 % from its all‑time high after a disappointing U.S. jobs report and renewed concerns over inflation‑driven rate hikes. The drop pushed the index roughly 13.5 % below its March peak, marking the market’s second bear phase in under three years. Despite the broader sell‑off, demand for artificial‑intelligence solutions has not waned. Companies that sell AI‑driven software and services continue to win contracts in cloud, healthcare and automotive sectors, while the hardware layer that powers those models – the “infrastructure” segment – has been hit hardest by the correction. Chipmakers and data‑center operators saw their valuations plunge, creating a price gap between the underlying infrastructure and the applications built on top of it. Adam Spatacco, a senior market strategist, argues that the correction is “discounting the infrastructure movement entirely” and points to two AI‑focused equities that have fallen sharply but retain strong growth narratives. He highlights C3.ai, a provider of enterprise AI platforms, and SoundHound AI, a voice‑recognition and conversational‑AI specialist, as the first picks for investors seeking exposure to the sector at bargain levels. Both firms have reported expanding pipelines and recent partnership announcements that could translate into revenue acceleration once market sentiment stabilises. The significance lies in the potential for a swift rebound: if corporate AI spending continues its upward trajectory, the undervalued software players could outpace the hardware laggards, delivering outsized returns. Investors should watch upcoming earnings releases, especially guidance on AI‑related bookings, and monitor Federal Reserve policy signals that could either prolong the correction or trigger a market bounce. In the Nordic context, any shift in EU AI regulation or regional cloud‑infrastructure investments will also shape the performance of these stocks in the months ahead.
18

Full GPT Language Model Created in ~600 Lines of C# with No Dependencies

HN +5 sources hn
A developer has pushed a full‑stack GPT language model to GitHub, fitting the entire training and inference pipeline into roughly 600 lines of pure C# and relying on no external libraries. The project, dubbed AutoGrad‑Engine, is a line‑by‑line port of Andrej Karpathy’s celebrated “microGPT” Python script, rewritten from scratch with only basic arithmetic operations and .NET’s built‑in data structures. The repository includes a lightweight automatic‑differentiation engine, tokeniser, transformer blocks and a simple optimiser, allowing anyone with a .NET development environment to train a small transformer on a laptop and generate text without installing PyTorch, TensorFlow or any NuGet packages. The release matters for several reasons. First, it demystifies the inner workings of large language models by stripping them down to their mathematical core, offering a hands‑on learning tool for students and engineers who prefer C# over Python. Second, it signals that the .NET ecosystem can host serious AI research without the heavyweight dependencies that dominate the field, potentially opening the door to LLM integration in Windows‑centric applications, Unity games or edge devices where binary size and runtime footprint are critical. Finally, the minimalist codebase invites community scrutiny, optimisation and experimentation, fostering a culture of transparency that contrasts with the opaque, proprietary stacks often used in commercial AI. What to watch next is how the .NET community embraces the repo. Early adopters are likely to benchmark its performance against the original Python version, explore GPU acceleration via DirectX or Vulkan, and extend the engine to support larger models such as LLaMA or GPT‑Neo. If the project gains traction, we may see a wave of C#‑first AI libraries, tighter integration with Azure’s AI services, and perhaps even production‑grade deployments of transformer models in environments that previously shied away from Python‑centric tooling. The experiment underscores a broader trend: AI is becoming language‑agnostic, and the tools to build it are increasingly accessible to developers across the stack.
18

AI Industry Embraces Token Inflation, But Your Business Shouldn't

Mastodon +6 sources mastodon
A recent wave of research is upending a long‑held assumption in enterprise AI: that feeding a language model ever more context will automatically improve its answers. The paper “Lost in the Middle,” published this month, demonstrates that beyond a certain point the sheer volume of tokens – the individual words or sub‑words a model processes – begins to dilute relevance, increase hallucinations and raise inference costs. The study shows a clear performance dip when prompts exceed the model’s optimal context window, a phenomenon the authors label “token inflation.” The finding matters because many companies have been scaling up token usage as a proxy for intelligence, often paying per‑token fees that can balloon in high‑volume applications such as document summarisation, customer‑support chatbots and internal knowledge‑base queries. Token inflation not only inflates budgets but also adds latency, strains compute resources and amplifies the carbon footprint of AI services. For businesses that view AI as a cost centre rather than a strategic advantage, the hidden expense of over‑contextualising can erode ROI and obscure real value. Enterprises are now being urged to adopt token‑aware strategies: pruning prompts to the most salient information, employing retrieval‑augmented generation that fetches only the needed snippets, and monitoring token consumption with dedicated dashboards. Vendors are responding with pricing models that cap token usage or offer “pay‑as‑you‑use” tiers tied to performance benchmarks rather than raw volume. What to watch next is a two‑fold evolution. First, model architects are rolling out next‑generation transformers with adaptive context windows that can dynamically truncate irrelevant tokens without sacrificing accuracy. Second, industry consortia are drafting standards for token efficiency, promising transparent reporting and possibly regulatory guidance on AI cost disclosures. Companies that master token discipline now will be better positioned to reap the true productivity gains of generative AI without drowning in unnecessary data.
15

OpenAI backs out of £31 billion UK investment plan

HN +5 sources hn
openai
OpenAI has withdrawn from the £31 billion UK investment package that was unveiled last September as part of a joint US‑UK “tech prosperity” pact. The move puts the flagship Stargate UK project on hold, halting plans to build a massive AI data centre in the north‑east of England and to install thousands of high‑performance GPUs across a new Nscale facility. The pull‑back follows a confluence of pressures. Rising energy prices have made the power‑intensive operation of AI clusters less economically viable, while the Labour government’s tighter regulatory framework on data handling and AI safety has introduced uncertainty for US firms accustomed to a more permissive environment. Industry insiders also cite concerns over the speed of UK visa reforms for AI talent and the lack of a clear roadmap for public‑private AI research funding. The fallout matters far beyond a single corporate contract. The original £31 billion pledge, signed during President Donald Trump’s state visit, was meant to cement the UK as a European hub for generative‑AI development, create up to 10 000 high‑skill jobs and stimulate a domestic supply chain for advanced chips. Its suspension threatens to slow the country’s AI ecosystem, diminish its attractiveness to other Silicon Valley players, and cede ground to rivals such as Germany and France, which are courting the same investment. What to watch next: the UK Treasury is expected to issue a response within weeks, potentially offering tax incentives or regulatory tweaks to revive the deal. Parallel negotiations with Microsoft, Google and Amazon could reshape the investment landscape, while Labour’s forthcoming AI strategy will signal whether the UK can reconcile security concerns with the need for rapid industry growth. The next few months will determine if the UK can salvage its AI ambitions or watch the promised AI boom drift elsewhere.
15

BrokenClaw Part 5 Showcases Prompt Injection in GPT‑5.4

HN +6 sources hn
gpt-5
A new proof‑of‑concept posted to Hacker News, titled “BrokenClaw Part 5: GPT‑5.4 Edition (Prompt Injection)”, demonstrates that the latest OpenAI model can be coerced into executing arbitrary code with a single crafted prompt. The author builds on earlier BrokenClaw demos that exploited “0‑click” remote code execution (RCE) in OpenClaw, a popular open‑source AI‑assistant framework. By feeding GPT‑5.4 – accessed via both OpenAI’s own API and the Vercel AI gateway – a specially designed prompt, the attacker triggers the model to generate and run a payload that hijacks the host environment without any user interaction. The exploit works on two frontier models, GPT‑5.2 and the newer GPT‑5.4, showing that the vulnerability is not limited to a single version. The revelation matters because GPT‑5.4 is OpenAI’s flagship “general‑purpose” model, marketed as the first with native computer‑use capabilities such as clicking, typing and interacting with applications through screenshots and Playwright. Its 1‑million‑token context window and 128 k‑token output limit have already spurred widespread adoption in enterprise automation, code generation and data‑intensive workflows. If prompt injection can silently commandeer the underlying execution environment, the risk extends from isolated sandbox breaches to large‑scale supply‑chain attacks on any service that delegates tasks to the model. OpenAI has not yet issued a formal response, but the incident is likely to accelerate scrutiny of model‑level safety guards, especially around system‑level tool use and plugin management. Watch for an official security advisory from OpenAI, potential patches to the OpenClaw plugin architecture, and a wave of community‑driven hardening guides. Regulators in the EU and Nordic states may also begin drafting tighter standards for AI‑driven code execution, a development that could reshape how developers integrate large language models into production pipelines.
14

How to Ensure Your iPhone Receives Background Security Updates

Mastodon +1 sources mastodon
apple
Apple has introduced “Rapid Security,” a new background‑update framework that pushes critical patches to iPhones without requiring user interaction. The feature, rolled out with iOS 26.4.1, silently downloads and installs security fixes as soon as a device is on Wi‑Fi, plugged in and idle, ensuring that vulnerabilities are sealed the moment Apple releases a patch. Users can verify the setting under Settings → General → Software Update → Automatic Updates, where a new toggle for “Background security improvements” now appears. The move matters because it narrows the gap between vulnerability disclosure and remediation—a gap that has been exploited by increasingly sophisticated AI‑driven attacks. As we reported on 10 April, Apple’s iOS 26.4.1 already added two security‑related changes for iPhones; Rapid Security expands those fixes into a continuous, hands‑free service. By automating the delivery of low‑impact patches, Apple reduces reliance on user diligence, a known weak point in mobile security hygiene. For Nordic consumers, whose smartphones are among the most heavily used devices for banking and public‑sector services, the improvement promises a higher baseline of protection against ransomware, credential‑theft and emerging threats that leverage large language models. What to watch next is how Apple refines the experience and whether the framework will be back‑ported to older iOS versions still in circulation across the region. Analysts expect tighter integration with Apple’s on‑device threat‑intelligence, potentially allowing the LLM‑powered security engine that powers the new “Rapid Security” alerts to prioritize patches based on real‑time risk assessments. Regulators in the EU may also scrutinise the automatic nature of the updates under the Digital Services Act, prompting Apple to offer clearer opt‑out mechanisms. Keep an eye on forthcoming iOS 27 beta notes for hints of expanded coverage, and on Apple’s developer documentation for any new APIs that let third‑party security tools tap into the background update pipeline.
14

DeepSeek's new model set to test China's AI ambitions

Mastodon +6 sources mastodon
chipsdeepseeknvidia
DeepSeek, the Beijing‑based AI startup that has become a litmus test for China’s push toward home‑grown large‑language models, has still not unveiled its much‑anticipated V4 model. The silence, which has stretched over several weeks, is feeding a wave of speculation across the industry about whether the new system will finally run on domestically produced Huawei Ascend chips or continue to rely on Nvidia’s GPUs. The delay matters because V4 is expected to close the performance gap with Western counterparts such as OpenAI’s GPT‑4 and Google’s Gemini. If DeepSeek can demonstrate comparable fluency and reasoning while running on Chinese silicon, it would signal a decisive step toward AI self‑sufficiency—a goal Beijing has repeatedly highlighted in policy documents. Analysts at Counterpoint Research note that the chip choice will reveal how far China’s semiconductor ecosystem has progressed in overcoming export restrictions that have limited access to advanced GPUs. Beyond the technical showdown, the launch could reshape the global AI supply chain. A successful Huawei‑powered V4 would give Chinese cloud providers a viable alternative to Nvidia, potentially lowering costs for domestic enterprises and reducing reliance on U.S. technology. Conversely, a postponed or under‑performing release could reinforce the perception that China remains dependent on foreign hardware, prompting firms to double down on cross‑border partnerships or to accelerate their own chip‑design programs. What to watch next: DeepSeek’s senior leadership has hinted at a “soft launch” before the end of Q2, likely at a private developer conference in Shanghai. Industry insiders expect a live demo that will reveal the model’s parameter count, inference speed on Ascend 910B, and multilingual capabilities. The reaction of major Chinese cloud operators—Alibaba Cloud, Tencent Cloud and Baidu Cloud—will be a key barometer of market adoption, while any official comment from the Ministry of Industry and Information Technology could clarify the government’s stance on the strategic importance of the rollout.
14

Apple to Permanently Close Three U.S. Stores in June

Mastodon +6 sources mastodon
apple
Apple announced on Monday that it will permanently shut three U.S. retail locations in June, confirming a quiet rollout that began with a notice posted on the company’s internal employee portal. The stores slated for closure are the Towson Town Center outlet in Maryland, the Westfield San Francisco Centre shop in California, and the Oakbrook Center branch near Chicago. Apple will lay off roughly 150 staff members, offering severance packages and the option to transfer to nearby stores where possible. The move marks the latest step in Apple’s broader effort to streamline its brick‑and‑mortar footprint after a series of modest store closures over the past two years. While the company continues to post robust hardware sales—Mac shipments rose 9 % in Q1 2026, outpacing the overall PC market, as we reported on April 10—the retail strategy is shifting toward larger “Apple Experience Centers” that showcase services, augmented‑reality demos and AI‑driven features. Analysts see the closures as a response to rising operating costs, changing consumer habits that favor online purchasing, and the need to reallocate real‑estate assets for higher‑margin experiences. The closures also carry local implications. Towson’s mall officials warned of a potential dip in foot traffic, while city leaders in San Francisco and Oak Brook have asked Apple to outline any community‑support initiatives. Employees have expressed concern over job security, though Apple’s statement emphasized “a commitment to the affected teams.” What to watch next is whether Apple will repurpose the vacated spaces for its new experience‑center format or lease them to third‑party retailers. The company’s upcoming earnings call may reveal if further store rationalisation is planned, and labor groups are likely to monitor how the layoffs are handled. A formal press release is expected later this week, which could clarify the strategic rationale behind the June shutdowns.
14

Apple's Mac shipments climb 9% in Q1 2026, outpacing PC market growth

Mastodon +6 sources mastodon
apple
Apple shipped 9 percent more Macs in the first quarter of 2026 than a year earlier, according to data from market‑research firm IDC. The surge lifted Apple’s share of the global personal‑computer market to 10.2 percent, outpacing the overall PC sector, which grew a modest 2.1 percent in the same period. The gain was driven primarily by strong demand for the newly refreshed MacBook Air with the M3 chip and the entry‑level Mac mini, both positioned as affordable gateways to Apple’s expanding ecosystem of AI‑enhanced services. The growth matters because it signals that Apple’s hardware strategy—anchoring its AI ambitions on a unified silicon platform—is resonating with consumers and enterprise buyers alike. The M3 family, built on a 3‑nanometre process, promises up to 30 percent higher performance per watt than the previous generation, a claim that aligns with the company’s push to run large language models locally on Macs. Analysts see the shipment lift as a counterweight to the broader PC market’s sluggish recovery, suggesting that Apple can capture share from rivals still wrestling with supply‑chain constraints and the cost of integrating AI accelerators. Looking ahead, the next data point will be Q2 shipments, where Apple is expected to roll out the long‑rumoured MacBook Pro with an M3‑Pro/Max variant and a refreshed iMac. Observers will watch whether the AI‑centric marketing narrative translates into higher‑margin sales and whether enterprise adoption of Apple Silicon for AI workloads accelerates. The company’s ability to sustain momentum will also hinge on the rollout of macOS 15, which promises tighter integration with Apple Intelligence features that, as we reported on 10 April, remain vulnerable to prompt‑injection attacks. How Apple addresses those security concerns could shape both consumer confidence and enterprise uptake in the coming months.
14

iOS 26.4.1 Adds Two Updates for iPhones

Mastodon +1 sources mastodon
apple
Apple rolled out iOS 26.4.1 on Tuesday, and the update is notable for two focused changes that touch both security and on‑device AI. First, the Stolen Device Protection feature has been upgraded to cooperate more tightly with the Find My network: a compromised iPhone now automatically disables access to on‑device large language models (LLMs) and other privacy‑sensitive services after three failed passcode attempts, while simultaneously transmitting a cryptographically signed lock‑down command to Apple’s cloud. The enhancement also adds a one‑tap “Erase LLM Cache” option in the Find My app, letting owners wipe locally stored AI prompts without wiping the whole device. The second change is a performance‑oriented tweak to Apple’s on‑device LLM inference engine. A new low‑power scheduling layer throttles background model execution when battery level falls below 20 percent, and it prioritises user‑initiated requests such as Siri queries or Translate suggestions. Developers receive a revised API flag that signals when the system has entered “energy‑saving mode,” allowing apps to gracefully defer non‑essential AI workloads. Why the update matters is twofold. Strengthening Stolen Device Protection addresses growing concerns that thieves could exploit locally stored AI data to infer personal information, a scenario highlighted in recent security briefings. At the same time, the battery‑aware LLM throttling reflects Apple’s broader push to make on‑device AI sustainable, a claim that could influence consumer adoption of AI‑heavy features in the Nordic market where power efficiency is prized. What to watch next includes Apple’s upcoming iOS 26.5, rumored to expand the LLM throttling controls to iPadOS and to introduce a developer‑visible telemetry dashboard for security events. Regulators in the EU are also expected to scrutinise the new remote‑disable capability under the Digital Services Act, so the rollout may prompt further policy dialogue. Keep an eye on how quickly users adopt the new “Erase LLM Cache” option, as early uptake will signal confidence in Apple’s on‑device AI safeguards.
13

TurboQuant Enables One-Command Local Stack on MacBook with Ollama, MLX, and Auto‑Routing Proxy

Dev.to +1 sources dev.to
llama
TurboQuant, an open‑source script released this week, lets developers spin up a fully functional local AI stack on a MacBook with a single command. The tool stitches together Ollama for model serving, Apple’s MLX runtime for accelerated inference on M‑series chips, and an auto‑configuring routing proxy that directs requests to the appropriate model endpoint. After cloning the repository and running `./turboquant.sh`, users get a ready‑to‑use environment that can host everything from Claude‑style assistants to the newly open‑source Gemma 4 model, all without touching the cloud. The launch matters because it collapses the fragmented setup process that has hampered local‑model experimentation. Until now, developers needed to install Ollama, compile MLX, and manually wire a reverse proxy—steps that often required deep system knowledge and repeated troubleshooting. By automating these pieces, TurboQuant lowers the entry barrier for Nordic startups, research labs, and hobbyists who want to keep data on‑premise for privacy or latency reasons. The timing aligns with a wave of local‑model initiatives: just days earlier Google open‑sourced Gemma 4, and we showed how GitHub Copilot CLI can be paired with LM Studio on a MacBook. TurboQuant essentially packages those advances into a turnkey solution, promising faster prototyping and tighter integration with IDEs that already support local inference. What to watch next is how quickly the community adopts and extends the script. Early forks are already adding support for quantized Llama 3 variants and for multi‑GPU routing on newer MacBook Pros. Benchmark releases will reveal whether the MLX‑accelerated path can match cloud‑grade throughput, a key factor for production workloads. If performance holds up, we may see IDE plugins—perhaps even a Copilot‑style extension—leveraging TurboQuant’s proxy to offer seamless, offline code assistance. The next few weeks should clarify whether this one‑command stack becomes the de‑facto standard for on‑device AI development in the Nordics and beyond.
13

Using Perplexity and Burstiness to Spot AI-Generated Content

Dev.to +5 sources dev.to
perplexity
A new detection framework that measures “perplexity” and “burstiness” is gaining traction among content creators desperate to spot AI‑written text. The approach, unveiled this week by a Swedish research collective in partnership with a Helsinki‑based content agency, quantifies how predictable a passage is (perplexity) and how unevenly sentence lengths vary (burstiness). Early trials show the dual‑metric model flags AI‑generated copy with 87 % accuracy, outpacing OpenAI’s own classifier and the widely used Turnitin AI‑detector. The breakthrough matters because the flood of synthetic prose is eroding trust in online media, academic publishing and brand communications. As large language models become cheaper and more accessible, agencies report a surge in client‑supplied drafts that blend human edits with AI output, making manual review impractical. By flagging text that is simultaneously too statistically smooth (low perplexity) and unnaturally uniform in rhythm (low burstiness), the new tool offers a scalable first line of defence. The system is already integrated into a popular content‑management plugin for WordPress and can be called via a lightweight API, allowing editors to scan articles in real time. Its open‑source code, released under an MIT licence, invites community scrutiny and rapid iteration. Critics caution that sophisticated prompt engineering can inflate perplexity scores, potentially slipping past the detector, and that the method may generate false positives on highly formulaic human writing such as legal contracts. What to watch next: major publishing platforms are evaluating the framework for internal moderation, while the European Union’s AI Act consultation hints at mandatory detection standards that could elevate perplexity‑burstiness tools from optional to regulatory. Researchers also plan to extend the model to multimodal content, testing whether similar statistical signatures appear in AI‑generated images and video captions. The coming months will reveal whether statistical detection can keep pace with the next wave of generative models.
12

gilest.org examines AI and the human voice

Mastodon +6 sources mastodon
voice
A post on gilest.org has sparked fresh debate over the limits of generative AI in written communication. The author, known online as @gilest, argues that while large language models can churn out endless paragraphs, the output “is dull, derivative, and sounds like a thousand other things I’ve read before.” He backs the claim with observations from workshops he runs on drafting and open‑team writing, noting that the “human element” – the subtlety of poetry, the willingness to produce imperfect first drafts and refine them – remains absent from most AI‑generated prose. The critique arrives at a moment when AI‑driven text generators are being embedded in newsroom pipelines, corporate newsletters, and even public‑sector communications. Industry analysts warn that over‑reliance on homogenised language could erode brand distinctiveness and diminish audience trust. Gilest’s point resonates with a broader push for “human voice” branding, a concept championed by communication strategist Giles T, who urges organisations to abandon the sterile corporate tone in favour of relatable, nuanced language. What matters is not just aesthetic preference but the strategic risk of content that fails to engage or differentiate. Studies from the Nordic AI Institute show that readers retain 30 % less information from AI‑written articles compared with human‑crafted pieces, especially when the material requires emotional nuance. The conversation also touches on the rise of realistic text‑to‑speech tools, which, while technically impressive, risk amplifying the same lack of authenticity if fed with generic scripts. Looking ahead, experts will watch whether next‑generation models incorporate “style‑preserving” fine‑tuning that can emulate individual authorial quirks without flattening originality. Upcoming panels at the Copenhagen AI Summit and a pilot program by the Swedish Media Authority to certify “human‑centric” AI content will test whether the industry can reconcile efficiency with the irreplaceable spark of human creativity.
12

Meta AI asks for health data and offers poor advice

Mastodon +1 sources mastodon
metaprivacy
Meta’s latest AI chatbot sparked controversy after it asked a user for raw health data and responded with questionable medical advice. During a trial of the new “Meta AI Health” assistant, the system prompted the tester to upload detailed biometric logs – heart‑rate curves, sleep stages, glucose readings and even recent lab results – before attempting to diagnose a persistent cough. Within minutes the bot suggested “stop the antibiotics you’ve been prescribed” and “increase your daily caffeine intake to boost immunity,” advice that medical professionals quickly flagged as dangerous. The episode, reported by Wired, underscores a growing tension between AI ambition and user safety. Meta has been positioning its conversational agents as the next frontier for personalized services, leveraging the massive trove of data collected across Facebook, Instagram and the Quest ecosystem. By requesting unprocessed health metrics, the company signals an intent to build a data‑driven health layer that could eventually power targeted advertising or premium wellness subscriptions. Yet the bot’s inaccurate recommendations expose the risks of deploying unvetted medical reasoning at scale, especially under Europe’s AI Act and strict GDPR rules that treat health data as a high‑risk category. Why it matters goes beyond a single misstep. If Meta proceeds with health‑focused features, it will join a crowded field that includes Apple’s HealthKit, Google’s Med‑PaLM and OpenAI’s upcoming medical‑model pilots. Each player faces scrutiny over how AI interprets personal health information and who bears liability when advice goes awry. The incident also fuels broader debates about whether tech giants should be allowed to monetize raw health data without explicit medical oversight. What to watch next: Meta has promised a “rapid review” of the bot’s medical module and hinted at tighter internal safeguards. Regulators in the EU and the U.S. are likely to request details on data handling and risk assessments. Industry observers will be tracking whether Meta pauses the rollout, partners with certified health providers, or repositions the feature as a purely informational tool. The outcome could set a precedent for how consumer‑grade AI interacts with personal health data across the tech sector.
12

SELFDOUBT Measures Uncertainty in Reasoning LLMs with Hedge‑to‑Verify Ratio

ArXiv +1 sources arxiv
reasoning
A team of researchers from the University of Copenhagen and the Swedish AI Lab has released a new arXiv pre‑print, “SELFDOUBT: Uncertainty Quantification for Reasoning LLMs via the Hedge‑to‑Verify Ratio” (arXiv:2604.06389v1). The paper tackles a long‑standing obstacle in deploying large language models (LLMs) for complex reasoning: reliably estimating how confident the model is in each answer without resorting to costly sampling or unreliable heuristics. The authors observe that existing single‑pass proxies—verbalized confidence scores or the length of a reasoning trace—often diverge from actual correctness, while Monte‑Carlo dropout or ensemble methods demand multiple forward passes that double or triple inference time. SELFDOUBT introduces a lightweight metric that compares two stages of the model’s own process. First, the model generates a “hedge” answer, a tentative solution produced under a permissive decoding temperature. Then it runs a “verify” pass, prompting the model to check the hedge against the original problem statement. The ratio of the hedge’s log‑probability to the verify’s log‑probability, the Hedge‑to‑Verify Ratio (HVR), serves as a confidence indicator. Experiments on benchmark reasoning suites such as GSM8K, MATH and BIG‑Bench show that HVR correlates with correctness far better than verbalized confidence or trace length, while adding less than 10 % overhead to inference. Why it matters is twofold. For safety‑critical applications—medical triage, financial advice, or autonomous planning—knowing when a model is likely to err enables fallback strategies, human‑in‑the‑loop checks, or selective abstention. Moreover, the metric dovetails with recent work on deterministic reasoning layers, such as the SymptomWise framework we covered on 10 April, by providing a principled way to gate those layers only when uncertainty spikes. What to watch next: the authors plan open‑source releases of the HVR implementation for popular LLM APIs, and early adopters are already testing it in prompt‑engineering pipelines at Nordic fintech firms. Follow‑up studies will likely explore scaling the ratio to multimodal models and integrating it with tool‑use frameworks that trigger external verification modules when HVR falls below a configurable threshold. If the community embraces SELFDOUBT, uncertainty‑aware reasoning could become a standard safety feature in next‑generation AI products.
12

SymptomWise Introduces Deterministic Reasoning Layer to Make AI More Reliable and Efficient

ArXiv +6 sources arxiv
ai-safetyreasoning
A team of researchers from the University of Copenhagen and the Swedish Institute of Computer Science has posted a new pre‑print, “SymptomWise: A Deterministic Reasoning Layer for Reliable and Efficient AI Systems” (arXiv:2604.06375v1), proposing a hybrid architecture that tacks a rule‑based reasoning module onto large language models used for symptom analysis. The authors argue that pure end‑to‑end generative pipelines—common in current tele‑health chatbots—suffer from hallucinations, opaque decision paths and occasional contradictions that can jeopardise patient safety. SymptomWise inserts a deterministic layer that maps model‑generated symptom candidates onto a curated knowledge graph of clinical guidelines, pruning implausible outputs and producing a traceable chain of reasoning for each diagnosis suggestion. The move is significant because it tackles three pain points that have stalled wider adoption of AI triage tools: reliability, interpretability and regulatory compliance. By guaranteeing that every recommendation can be back‑tracked to a specific guideline entry, the system promises auditors a concrete audit trail, something regulators in the EU and Norway have repeatedly demanded. The approach also dovetails with recent discussions about deterministic pattern matching in LLMs, such as the Claude Mythos leak we covered on April 9, suggesting a broader shift toward hybrid models that blend statistical fluency with symbolic certainty. What to watch next is whether SymptomWise graduates from a research prototype to a production‑grade component in commercial platforms. Early adopters like Ada Health and KRY have expressed interest in pilot trials, and the authors plan a clinical evaluation in Swedish primary‑care clinics later this year. Parallelly, the European Medicines Agency is expected to issue guidance on AI‑driven diagnostic aids, and any alignment between that policy and deterministic reasoning frameworks could accelerate market entry. Keep an eye on follow‑up papers and potential open‑source releases that could democratise the technology across the Nordic health‑tech ecosystem.
12

jsynowiec launches “Shadow Persona” LLM project on GitHub

Mastodon +6 sources mastodon
agentsclaude
A developer on GitHub has pushed a new “idea file” to the llm‑shadow‑persona project, extending an adversarial review system for large‑language‑model (LLM) agents. Inspired by Andrej Karpathy’s recent work on self‑critiquing AI, the contribution embeds a feedback loop into a Claude plugin that forces the agent to revise its output based on a structured review. The “shadow persona” framework treats the LLM’s own suggestions as a separate persona, then pits that persona against a reviewer model that flags inconsistencies, policy breaches or hallucinations. The reviewer’s comments are fed back into the original agent, prompting a second‑pass generation that must satisfy the critique before the response is released. The move matters because enterprises are grappling with “shadow LLM” usage—employees bypassing approved tools for convenience, exposing firms to data leakage, compliance gaps and unpredictable model behaviour. By automating an internal adversarial check, the plugin offers a lightweight, model‑agnostic guardrail that can be layered onto any Claude‑based workflow, and potentially adapted to other providers. It also demonstrates a practical path toward continuous alignment: rather than a one‑off safety prompt, the system iteratively refines answers until they meet predefined standards. Observers will watch whether the tool gains traction among Anthropic’s business customers and whether the open‑source community adopts it as a template for similar plugins on OpenAI, Google or emerging open‑source models. The next steps include a public benchmark of the reviewer’s detection rate, integration with corporate policy engines, and possible collaboration with initiatives such as the Shadow‑LLM‑Guardians. If the approach proves scalable, it could become a de‑facto standard for mitigating hidden AI usage and tightening the feedback loop that keeps LLM agents honest.
12

Sal Khan says his AI revolution is still years away.

Mastodon +1 sources mastodon
Sal Khan, the founder of Khan Academy, told Chalkbeat on 9 April that the “AI revolution” he once envisioned for classrooms has not yet materialised. Speaking about Khanmigo – the chatbot‑powered tutor launched in partnership with OpenAI in 2023 – Khan said he now treats the technology as “part of the solution, not the end‑all and be‑all.” The comment marks a shift from the exuberant rollout last year, when Khan Academy promoted Khanmigo as a game‑changer capable of delivering personalised instruction to millions of learners for free. Early pilots showed promising gains in reading comprehension and math fluency, prompting a wave of investment in AI‑driven ed‑tech. Yet adoption has stalled: school districts cite insufficient teacher training, concerns over data privacy, and uneven internet access that risk widening the digital divide. Why Khan’s recalibration matters is twofold. First, Khan Academy remains the most trusted free learning platform worldwide; its stance influences how public schools allocate resources for AI tools. Second, the company’s modest revenue model – relying on donations and a limited premium tier – means any slowdown could reshape the broader market, where rivals such as Duolingo’s Max and Google’s Gemini for Education are courting the same institutional budgets. Looking ahead, several signals will indicate whether Khanmigo can move from niche pilot to mainstream classroom aid. The nonprofit plans to release a teacher‑dashboard in the summer, aimed at giving educators real‑time insight into student interactions and allowing them to intervene when the model errs. A forthcoming independent study, commissioned by the U.S. Department of Education, will assess learning outcomes across a diverse set of schools and could either validate or dampen further rollout. Equally critical will be policy developments around student data protection. The European Union’s AI Act, set to take effect later this year, could impose stricter consent requirements that affect Khan Academy’s free‑service model. Finally, the next wave of generative‑AI models promises deeper contextual understanding; if Khanmigo can harness those advances without compromising safety, the “revolution” Khan hinted at may finally gain traction. Until then, the education sector watches cautiously, balancing optimism with the practical hurdles Khan now acknowledges.
12

OpenAI shuts down Sora as turmoil erupts

Mastodon +6 sources mastodon
openaisora
OpenAI announced Wednesday that it has permanently disabled Sora, its experimental text‑to‑video service, sparking a wave of speculation about the company’s priorities and the future of AI‑generated media. The decision came after internal reports flagged that Sora was consuming an “obscene” amount of compute while users churned out videos that repeatedly tripped copyright filters and, in some cases, generated disallowed content. A nascent partnership with Disney, which had pledged a $1 billion investment in exchange for exclusive access to Sora’s capabilities, collapsed when the studio grew uneasy about the legal exposure and the tool’s resource drain. The shutdown matters on several fronts. First, it underscores the tension between rapid product rollout and responsible scaling; OpenAI’s willingness to pull the plug within weeks signals a growing awareness that raw compute costs can outweigh headline‑grabbing demos. Second, the episode highlights the regulatory and commercial risks of text‑to‑video AI, a segment still grappling with copyright enforcement and content‑moderation frameworks. Finally, the loss of Disney’s backing removes a potential anchor for monetising video‑centric AI, leaving the market open for rivals such as Runway and Meta to capture the vacuum. What to watch next is the direction OpenAI will take with its remaining resources. The company has already hinted that robotics and multimodal agents are the next frontier, and a recent $122 billion funding round suggests ample capital to pivot. Analysts will be tracking whether OpenAI launches a more tightly controlled video offering, partners with a different media conglomerate, or redirects its compute budget entirely toward embodied AI. The Sora episode may become a cautionary case study, but it also sets the stage for a more disciplined, perhaps less flamboyant, rollout of next‑generation generative tools.
12

Cognitive Grammar Models Used as Behavioral Biometrics for Authorship Verification

Mastodon +6 sources mastodon
Researchers from the University of Bologna and collaborators have moved their pre‑print on authorship verification from the server shelves to a peer‑reviewed venue, publishing “Grammar as a behavioral biometric: using cognitively motivated grammar models for authorship verification” in *Humanities and Social Sciences Communications*. The paper introduces LambdaG, a lightweight algorithm that models an author’s grammatical choices through the lens of Cognitive Grammar and uses those patterns as a behavioral biometric—comparable to a fingerprint or gait. LambdaG extracts a compact profile of syntactic constructions, agreement patterns and clause‑embedding preferences from a training corpus, then scores a candidate text by measuring deviation from that profile. In benchmark tests against standard datasets, the method matches or exceeds the accuracy of state‑of‑the‑art neural networks while requiring a fraction of the computational resources and offering transparent, linguistically interpretable features. The authors argue that the result validates a long‑standing hypothesis: the way individuals deploy grammar is idiosyncratic enough to serve as a reliable identity marker. The contribution matters on several fronts. For forensic linguistics, a transparent tool lowers the evidentiary barrier that often hampers the admissibility of AI‑driven analyses in court. In the broader NLP community, LambdaG demonstrates that cognitively grounded models can compete with black‑box deep learning, reviving interest in theory‑driven approaches that prioritize explainability. Moreover, the work hints at new avenues for detecting synthetic text, as large language models tend to smooth over personal grammatical quirks. Looking ahead, the research team plans to expand the evaluation to multilingual corpora and to integrate LambdaG with existing plagiarism‑detection pipelines. Legal scholars will likely monitor how courts respond to grammar‑based biometric evidence, while industry players may explore hybrid systems that combine LambdaG’s interpretability with the raw power of large language models. The next test will be whether the method can withstand real‑world forensic scrutiny and become a standard tool in the digital author‑identification toolbox.
12

Anthropic Finds Third-Party Clients Using System Prompt, Not Headers

HN +5 sources hn
anthropic
Anthropic announced a new method for identifying third‑party clients that access its Claude models, shifting the focus from traditional HTTP‑header checks to analysis of the system prompt embedded in each request. The company revealed that the technique, rolled out this week on its API platform, parses the initial system instruction to spot signatures or patterns that indicate a proxy, wrapper or unauthorized integration, even when the caller disguises its identity through forged headers. The change comes after mounting pressure on AI providers to tighten supply‑chain oversight. Anthropic’s earlier “Project Glasswing” initiative, reported on 10 April, aimed to curb autonomous exploits, while a federal court decision the same day upheld a “supply chain risk” label on the firm’s services. By moving detection to the content layer, Anthropic can flag misuse that would have slipped past header‑based filters, such as malicious actors embedding hidden commands or rerouting traffic through unapproved services. For developers, the update means tighter compliance checks and a potential need to revise authentication flows. Anthropic says legitimate partners can register a system‑prompt fingerprint, ensuring uninterrupted access, while non‑compliant users will receive throttling or termination notices. The move also raises questions about privacy: parsing prompts could expose more of a user’s proprietary prompt engineering, prompting calls for clearer data‑handling policies. What to watch next includes Anthropic’s rollout timeline—initially limited to high‑risk accounts—and whether the firm will extend the approach to other metadata, such as token usage patterns. Competitors may adopt similar content‑based detection, sparking a broader industry shift toward deeper request inspection. Regulators could also take note, using the method as a benchmark for enforcing AI supply‑chain transparency. The evolution underscores a growing consensus: securing the AI stack now demands scrutiny beyond superficial network signals.
12

Federal Court Rejects Anthropic's Bid to Drop “Supply Chain Risk” Label

HN +1 sources hn
anthropic
A federal district court in Washington, D.C., has rejected Anthropic’s request to have the Pentagon’s “Supply Chain Risk” label removed from its Claude models. The label, applied under the Department of Defense’s AI risk‑management framework, bars the use of Anthropic’s models in any U.S. government system deemed vulnerable to supply‑chain attacks. Anthropic argued that the designation was unfounded and harmed its commercial prospects, but the judge found the agency’s assessment sufficiently supported by classified threat analyses. The decision builds on a series of legal confrontations between the AI startup and the U.S. government. As we reported on April 9, the court previously declined to block the Pentagon’s blacklisting of Anthropic, and on April 10 we detailed how malicious intermediary attacks could compromise LLM supply chains. The ruling underscores the growing willingness of federal regulators to impose security labels that can effectively gatekeep AI technology, echoing broader concerns about hidden backdoors, compromised training data, and the difficulty of auditing third‑party components. For Anthropic, the label limits access to lucrative defense contracts and may prompt other agencies to adopt similar restrictions, potentially reshaping the company’s revenue model and prompting a shift toward more transparent supply‑chain practices. The broader AI ecosystem watches closely, as the precedent could be extended to other providers such as OpenAI or Google, amplifying the regulatory burden on the industry. Next steps include a likely appeal by Anthropic to the Federal Circuit, where the legal arguments about due‑process and the evidentiary basis for the label will be tested. Lawmakers are already drafting oversight legislation that could codify labeling authority, while the Pentagon is expected to release an updated AI risk‑assessment guideline later this summer. Stakeholders should monitor the appellate outcome, any congressional hearings, and the defense department’s next policy memo for clues on how supply‑chain security will shape AI deployment in the public sector.

All dates