Groq raises $650 million to become a neocloud after Nvidia paid $20 billion for its soul

Groq is trying to sell investors a second company after Nvidia licensed its core inference technology and hired away key leaders. The new $650 million raise only works if customers buy GroqCloud as more than the cloud arm left behind.

Groq's next act is no longer about proving that its Language Processing Unit can embarrass a GPU on speed. Nvidia already paid for that idea. The harder job now is convincing you that the remaining company can turn GroqCloud into a serious inference provider while Nvidia folds Groq's own architecture into its AI factory roadmap.

Axios reported on May 28 that Groq was raising up to $650 million from existing investors after the Nvidia agreement delivered payouts to backers and pulled much of the senior team out of the startup. That makes the round current enough to matter. It also makes it awkward. Investors are not funding the same Groq that raised $750 million at a $6.9 billion valuation in September 2025. They are funding the business left after the most valuable technical story was licensed to the largest AI chip company in the world.

The December deal was not a clean acquisition. MarketWatch reported at the time that Nvidia entered a nonexclusive licensing agreement for Groq's inference technology, while Groq founder and CEO Jonathan Ross, president Sunny Madra and other personnel would join Nvidia. Groq said it would remain independent, GroqCloud would keep operating, and CFO Simone Edwards would become CEO. CNBC had earlier reported the transaction at about $20 billion, which is the figure that turned the deal from an industry partnership into a warning shot.

You can see why Nvidia wanted it. Training made Nvidia the toll collector for the first phase of the AI boom, but inference is where every chatbot reply, coding assistant response and enterprise agent call has to run again and again. Latency and cost per token are not investor-deck details once you are serving real customers at scale. They decide whether a product feels instant or sluggish, and whether the margin survives usage.

Groq built its reputation on that narrow, valuable problem. Its LPUs were designed for fast, predictable inference, not general-purpose GPU work. The company turned that into GroqCloud, an API service that lets developers run models on Groq hardware without buying the chips. It also announced a $1.5 billion commitment from Saudi Arabia in February 2025 to expand LPU-based inference infrastructure, tied to a data center in Dammam, according to Reuters. Those are not small facts. They are the reason the company still has a story after Nvidia took the headline asset.

But the story has changed. Before the Nvidia deal, Groq could tell customers and investors that it owned a distinctive chip architecture built for the inference era. After the deal, Nvidia can tell the same story with more supply chain muscle, deeper customer relationships and a platform buyers already know how to procure. When your biggest validation also becomes your biggest channel risk, you don't get to talk like nothing happened.

Here's the thing: a neocloud is not just a clever label for rented compute. CoreWeave built its position around scarce Nvidia GPU capacity and large AI customers. Lambda has spent years selling GPU access to developers and research teams. The hyperscalers, AWS, Google Cloud and Microsoft Azure, can cut prices, bundle services and push their own silicon when they need to defend accounts. GroqCloud has to compete in that market with a new leadership structure and with its own original technology now licensed into Nvidia's future products.

That doesn't make Groq dead. It makes the bar higher. If you're building an inference stack, you care about speed, uptime, model support, data residency, pricing and whether the provider will still be there when your usage climbs. Groq can still win business if its service is fast enough, available enough and cheaper enough to make customers ignore the corporate drama. Developers are practical. They will use the API that works.

Frankly, the $650 million raise confirms confidence from insiders more than it proves demand from the market. Existing investors already know the company, already benefited from the Nvidia transaction, and now have a chance to finance the cloud business that stayed outside it. That is rational. It is not the same as outside customers voting with production workloads.

Groq's best argument now is operational, not mythic. More data center capacity. More reliable GroqCloud service. More enterprise customers willing to put inference traffic on LPUs instead of GPUs. The company does not need to beat Nvidia in a press release. It needs to prove that the piece Nvidia did not buy can stand on its own.

Also read: Five Eyes agencies warn that AI-powered cyberattacks are months away as the US moves to lock down frontier models • SpaceX lands a $6.3 billion computing deal with Reflection AI and is quietly becoming the infrastructure backbone of frontier AI • Meta buys into CRED and bets its WhatsApp future on an Indian founder