Alexander Embiricos (@embirico) on X
agents openai
| Source: Mastodon | Original article
OpenAI’s Codex has received a major upgrade that gives the model a far more sophisticated “computer‑use” ability, according to a tweet from Alexander Embiricos, the product lead behind the service. Embiricos, who oversees a Codex product line that now processes trillions of tokens weekly, said the new feature ranks at the top of every test he’s run on large language models (LLMs) and desktop‑agent frameworks. The enhancement lets Codex not only generate code but also interact directly with a user’s operating system—moving the mouse, typing, opening applications and manipulating files—without any additional scripting layer.
The development matters because it pushes AI agents from passive code suggestion into active execution. Developers could hand a single prompt to Codex and watch it assemble a development environment, run builds, debug failures, or even automate routine office tasks. For enterprises, the capability promises to shrink the time needed to integrate new software, lower the barrier for non‑technical staff to automate workflows, and accelerate the broader push toward “agentic” AI that can act on behalf of users across the desktop. At the same time, the power to control a computer raises safety and security questions; OpenAI will need robust sandboxing, permission controls and audit trails to prevent unintended actions or malicious exploitation.
What to watch next is the rollout plan. OpenAI is expected to publish detailed documentation and benchmark results in the coming days, and to open the feature to a limited set of Codex API customers. Integration with GitHub Copilot and other developer tools could follow, turning the upgrade into a mainstream productivity boost. Industry observers will also be tracking how competitors such as Anthropic and Google respond—whether they will accelerate their own agent‑type offerings or introduce safeguards that shape the next wave of autonomous AI. The coming weeks will reveal whether Codex’s new computer‑use skill becomes a catalyst for widespread desktop automation or a niche capability confined to early adopters.
Sources
Back to AIPULSEN