Baseten raises $1.5 billion at a $13 billion valuation as inference becomes AI's most contested infrastructure layer

Baseten's new $1.5 billion round says something blunt about AI infrastructure: the money has moved from model hype to the cost and reliability of running those models every day.

Baseten is no longer being valued like a useful tool for AI teams. It is being valued like part of the machinery those teams can't afford to get wrong. The Wall Street Journal reported on June 18 that the company is finalizing a $1.5 billion fundraising round with a dual structure, with some investors coming in at an $11 billion valuation and others at $13 billion. Five months after a reported $5 billion valuation, that is a serious jump.

You don't need to worship the valuation to understand the signal. Inference, the work of actually running AI models after they're trained, has become one of the most contested parts of the AI stack. It is also one of the least glamorous. Nobody launches a product demo by bragging about latency, cloud routing, or GPU utilization. Then the product gets real customers, the bill arrives, response times wobble, and suddenly that boring layer decides whether the business works.

That is the business Baseten is chasing. The company, co-founded by Tuhin Srivastava, provides software and computing capacity for companies using AI models, especially cheaper open-source models that need serious infrastructure wrapped around them. The Journal reported that Baseten sources computing from more than 20 cloud providers and counts Cursor and Mercor among its customers. Those names matter. A coding assistant and an AI recruiting company both live or die on whether model responses come back quickly and reliably. There isn't much room for drama.

Frankly, this is where a lot of AI spending is becoming more honest. Paying OpenAI or Anthropic for access to a closed model is simple, and for many companies it still makes sense. But Srivastava told the Journal that when customers go to OpenAI or Anthropic, they get the model and the inference infrastructure together. Baseten is betting that more companies want to split those pieces apart, use lower-cost models where they can, and control the serving layer themselves.

That bet has teeth because open-source models have improved fast. DeepSeek and Moonshot AI's Kimi, both named in the Journal's report, have made the point hard to ignore: if the model gap narrows, the argument shifts to cost, control, and reliability. You may not care which model sits underneath a feature if the answer is good enough and the bill is 70% lower, which the Journal said some Baseten customers can achieve when they use open-source models where appropriate.

The investor list shows how crowded this fight is getting. Altimeter Capital, Conviction, Spark Capital, Sands Capital, and Wellington Management are backing the round, according to the Journal. Wellington's involvement is useful texture because the firm is not usually the name you reach for when describing speculative AI froth. It is a long-term asset manager walking into inference because enterprise AI spending is moving from experiments to production workloads.

Baseten isn't alone, and you shouldn't read this round as a coronation. Fireworks AI raised money in October at a $4 billion valuation, the Journal reported, while Cerebras has also become part of the same inference conversation. The field is still messy. Some companies will want model hosting, some will want optimization, some will want access to cheaper compute across clouds, and some will decide the simplest answer is to stay with a hyperscaler until the bill hurts badly enough.

Here is the practical point for founders: model choice is becoming less permanent than infrastructure choice. You can swap from one model family to another if the product layer has been built cleanly. Moving the system that handles your traffic, latency, scaling, routing, and cost controls is harder. Once an AI company has tuned its product around a serving stack, ripping it out is not a weekend job.

That is why the $13 billion number is both bold and understandable. It prices Baseten as more than a vendor between clouds and models. It prices the company as a control point in the AI economy. Maybe that proves too rich if customers consolidate around AWS, Google, Microsoft, OpenAI, and Anthropic. Maybe the opposite happens, and AI teams end up running inference through multiple specialists the same way companies already run more than one cloud.

For now, the real fact is simple. The companies building AI products are discovering that the expensive part is not only intelligence. It is delivering intelligence quickly, cheaply, and thousands or millions of times a day. Baseten's new round is current, verified, and worth covering because that is where the AI market is putting real money.

Also read: Insilico Medicine signs a $2.5 billion AI drug discovery deal with SK Biopharmaceuticals for neuroimmune disorders • Oracle cut 21,000 jobs to fund its $50 billion AI buildout and workers say they trained their own replacements • Qualcomm is buying Modular for $4 billion to take on Nvidia where it actually counts