AI agents are starting to touch real money, and the finance controls around them are not ready for it.
The warning signs are already showing up in founder circles and finance forums. Teams are experimenting with AI agents that can buy software, top up ad budgets, order services, and queue payments, which is exactly where a useful automation tool starts to look like a new kind of spending risk.
Some of the public anecdotes are thin. Reddit threads and startup posts do not always make clear which agent was used, which card platform was involved, what purchases were attempted, or whether any approval actually went through. That matters. There is a big difference between a real loss event, a messy test, and a cautionary story dressed up for engagement.
Still, the premise lands because it points at a problem startups are about to face at scale. AI agents are moving beyond drafting emails, summarizing meetings, and searching the web. They are being asked to handle procurement, vendor research, cloud setup, bookkeeping, expense cleanup and other workflows that once ended with a human clicking approve. Once an agent can spend, the question is no longer whether it is useful. The question is who controls it.
Founders already understand permissioning in software. A junior employee should not have admin rights to production. A contractor should not have full access to the customer database. A sales tool should not be able to export every account without a log. Yet when money enters the workflow, many teams still fall back on the oldest shortcut in the book: give the tool a card number and hope the monthly statement tells the story.
That approach was risky enough with humans. It is worse with software. A human employee at least understands context, embarrassment, policy and the possibility of being fired. An agent follows instructions, interprets patterns and can repeat a bad action very quickly if the surrounding system gives it room. A prompt error, a badly scoped workflow or a vendor page it misunderstands can become a finance problem before anyone notices.
This is why the current debate is useful, even when individual stories are hard to verify. The exact incident is less important than the architecture it exposes. If an agent has the same card credentials as a founder, the company has no clean way to separate legitimate automation from runaway activity. If it uses a shared corporate card, there is no reliable per-agent audit trail. If it can transact without merchant rules, caps or expiry windows, the finance team is reviewing damage after the fact rather than preventing it.
The market is already moving toward a more formal answer. As The Paypers recently noted, Oobit has launched programmable cards for AI agents, with per-agent limits, merchant category restrictions and transaction-level enforcement. The same report cited McKinsey figures showing that 23% of organizations are already scaling agentic systems in production, with another 39% experimenting. Ramp has also been building AI into corporate finance workflows, while Brex is positioning agents around spend review, policy enforcement and expense automation. The direction is clear. The corporate card is becoming a permission system, not just a payment method.
Automation Needs a Finance Seatbelt
The best founders will not treat this as an IT issue alone. Agentic finance sits between operations, security, accounting and procurement. That means controls need to be designed before the first live credential is handed over, not bolted on after a strange charge appears from a vendor no one recognizes.
The first rule is simple: never give an agent a primary company card. Use a dedicated virtual card for a dedicated task. If an agent is buying API credits, that card should only work for that vendor category, with a low cap and a short expiration window. If it is testing ad campaigns, it should have a campaign-level budget and require a human approval above a defined threshold. If it is booking travel, it should follow the same policy boundaries as an employee, with tighter logs because the spender is software.
Approval layers also need to be more specific than yes or no. A good system should distinguish between an agent preparing a purchase, requesting authorization, executing the payment and reconciling the receipt. Those steps should not collapse into one action just because the interface makes it easy. The whole point of autonomy is to remove busywork, not remove accountability.
Logs are just as important as limits. A finance team should be able to see which agent initiated a purchase, what instruction triggered it, which vendor was selected, what alternatives were considered, who approved it and why the transaction was allowed or declined. Without that chain, a company cannot learn from mistakes. It can only argue about them later.
There is also a cultural lesson here. Startups often reward speed, and rightly so. But speed without boundaries turns every new tool into a trust exercise. The founder who would never hand a blank signed check to an intern may still paste a corporate card into an automation tool because the demo felt magical. That gap between common sense and software behavior is where avoidable losses happen.
The practical answer is not to keep AI agents away from money forever. That would miss the point. Agents will become useful in procurement, bookkeeping, ad operations, vendor management and expense cleanup precisely because those workflows are repetitive and rules-based. But the winners will be the companies that make the rules enforceable at the payment layer.
What to watch now is whether card platforms, accounting tools and AI agent builders converge on a shared model for controlled spending. Startups do not need another dashboard that explains the problem after closing. They need caps, approvals, revocation, merchant controls and readable audit trails before the agent ever gets to checkout.
Also read: South Korea is testing whether Hyundai robots can fill its army gap • SoftBank wants batteries inside its AI infrastructure stack • ChatGPT is becoming the life advisor young users already expect