OpenAI shows why coding agents need real Windows sandboxes

OpenAI's Windows sandbox work shows that coding agents are now a host-machine security problem, not just a model problem.

The important part of OpenAI's latest Codex update is not that Windows was difficult. Everyone building developer tools already knows that. The important part is that an AI coding agent which can read files, run local commands and edit a real workspace cannot be treated like another chat window.

In a May 13 technical breakdown, OpenAI said Codex for Windows previously lacked a proper native sandbox, leaving users with two poor choices: approve almost every command, including reads, or switch into full access and let the agent run without meaningful restrictions. That is not a small product inconvenience. It is the difference between an assistant that feels useful and one that asks a security team to trust a cloud-directed process on a developer laptop.

Codex's default operating model is deliberately practical. It can read broadly across the machine, write inside the current workspace and run local commands, while network access is restricted unless the user allows it. That setup only works if the operating system enforces the boundary. A prompt cannot do that. A policy screen cannot do that. The host machine has to say no when a child process tries to write where it should not, or reach the internet when it has not been granted that right.

OpenAI's point was direct: macOS and Linux already had tools that were closer to the problem. Codex can lean on Seatbelt on macOS, while Linux has mechanisms such as seccomp and bubblewrap for controlling process behavior, filesystem access and networking. Windows, by contrast, did not offer one clean primitive that mapped to a safe autonomous coding agent.

The company looked at the obvious Windows options and found each one wrong in a different way. AppContainer offered a real isolation model, but it was built for applications that know their permissions in advance. Codex is not that kind of app. It drives shells, Git, Python, package managers, build tools and whatever else a developer workflow requires.

Windows Sandbox had the opposite problem. It gave a stronger disposable environment, but that environment was too separate from the user's actual checkout, tools and configuration. A coding agent needs to work on the real project in front of the developer, not inside a throwaway desktop that has to be bridged back to the host. It also is not available on Windows Home, which matters for any tool trying to reach a broad developer market.

Mandatory Integrity Control looked cleaner at first. OpenAI could have run Codex at low integrity and relabeled writable roots so Windows would block writes elsewhere. But that would have changed the trust model of the user's actual files, making the workspace writable by low-integrity processes in general. For a real developer machine, that is a heavy price to pay for convenience.

The final design is infrastructure, not polish

OpenAI's current Windows design uses restricted tokens, synthetic security identifiers, two local users called CodexSandboxOffline and CodexSandboxOnline, firewall rules, an elevated setup binary and a separate command runner called codex-command-runner.exe. That sounds like plumbing because it is. But this is exactly where the market is moving.

The restricted token approach lets Windows perform an extra check before writes succeed. The synthetic SID marks the places where Codex is allowed to write, including the current working directory and any additional writable roots. Sensitive paths such as .git and .codex can remain protected even when they sit inside a writable workspace. That matters because breaking a repository is not the only risk. Quietly altering metadata, credentials or tool configuration can be worse.

The network side forced a bigger redesign. OpenAI found that Windows firewall rules could not neatly target a restricted token's synthetic identity across the whole process tree. Blocking codex.exe alone would not stop Git, Python or another child process from reaching out. The solution was to run commands under separate local principals, with the offline sandbox user targeted by firewall rules and the online one used when access is allowed.

That setup required elevation at configuration time. Codex creates the sandbox users, stores their credentials locally using Windows data protection, applies firewall rules and adds read permissions where needed so the sandbox can still inspect the places a developer expects it to inspect. The main Codex process remains unelevated, while codex-windows-sandbox-setup.exe handles privileged setup and codex-command-runner.exe launches the restricted child process.

For enterprises, this is the real story. Coding agents are not just competing on model scores or editor integrations anymore. They are competing on whether security teams can understand and control what happens when an agent runs tests, installs packages, edits files and follows instructions across a messy local environment.

That also changes the startup playbook. A thin wrapper around a frontier model may be enough for demos, but it is not enough for local agents that touch source code, secrets and build systems. Teams building in this space now need operating-system knowledge, policy design, audit logs and a realistic answer for Windows fleets. The next wave of agent infrastructure will be judged less by how magical it looks in a screen recording and more by how calmly it behaves on the machines where companies actually work.

Also read: Americans now prefer nearby nuclear plants to AI data centers • Overworked AI agents are giving founders a new alignment warning • Cerebras IPO puts a public price on the AI infrastructure boom