Talking to Claude like a caveman makes your credits last three times longer and nobody can fully explain why

A viral trend sweeping Reddit and X reveals that stripping prompts down to primitive, grammar-free English dramatically reduces AI token consumption , and it's forcing a real conversation about how AI labs price their services.

The post that kicked it off was almost too simple to take seriously. A user on X reported that after switching from polished, well-structured prompts to bare-bones caveman syntax , think "Claude find good marketing idea startup" instead of "Could you please help me brainstorm some effective marketing strategies for an early-stage startup?" , their Anthropic credits stretched roughly three times further. The replies filled up fast. Turns out, a lot of people had noticed the same thing.

The mechanics behind this aren't mysterious once you understand how large language models actually process text. Transformer-based models like Claude don't read words the way humans do. They break input into tokens , small chunks of characters that can represent whole words, partial words, or punctuation marks. Articles like "the," "a," and "an" each cost a token. So do commas, question marks, and the kind of elaborate sentence structure that most of us were taught to use in professional communication. Strip all of that out and you're feeding the model significantly fewer tokens per prompt. Fewer tokens means lower compute cost, which in subscription or credit-based pricing models translates directly into more interactions per dollar.

What makes this trend worth paying attention to isn't just the novelty. It's that ordinary users , not ML researchers or AI engineers , have independently reverse-engineered a meaningful aspect of how these systems are metered and monetized. That's a form of grassroots usage arbitrage, and it carries a quiet but pointed critique of how Anthropic, OpenAI, and their peers have structured consumer pricing.

Most mainstream AI subscriptions present themselves as offering a certain number of messages or a monthly credit pool, with the implicit assumption that users will interact naturally, in normal prose. The pricing was calibrated around that assumption. When users start optimizing their language to game the token count, that calibration breaks down. The companies aren't losing money in catastrophic amounts , the margins are built with overhead in mind , but the dynamic does expose a structural awkwardness in how compute costs are abstracted away from consumers who are savvy enough to probe for them.

There's also a mild irony in the fact that making AI less readable apparently makes it more economical. These models were built to handle nuanced, complex human language. They perform impressively on long-form reasoning tasks, literary analysis, and intricate code review. And yet the most cost-efficient way to use them, it turns out, is to communicate like you're leaving a note on a cave wall.

What the labs might do next

This probably won't trigger an overnight pricing restructuring at Anthropic or OpenAI. But it does add pressure to a conversation that's been building for a while around transparency in AI billing. Right now, most consumer-facing AI tools give users very little visibility into token consumption in real time. A more transparent metering system , one that shows you the token cost of a message before you send it , would both address user frustration and, frankly, shift the arbitrage dynamic considerably. If you can see that your polished prompt costs four times as much as a stripped-down version, the choice becomes explicit rather than accidental.

Some power users are already building prompt compression tools and browser extensions designed to automatically trim unnecessary tokens from inputs before they hit the API. That cottage industry will only grow if the major labs don't get ahead of it with clearer pricing structures or built-in compression on their end.

The caveman prompt trend is easy to laugh off as a social media quirk, and plenty of people are doing exactly that. But underneath the humor is a legitimate signal: AI consumers are getting more sophisticated about compute economics faster than the companies selling AI services may have anticipated. The question worth watching is whether that sophistication accelerates pressure toward flat-rate or compute-transparent pricing models , and which lab blinks first.

Also read: A criminologist made ChatGPT confess to a murder it could not have committed and the implications for criminal justice are serious • A leaked open-source model called Yahu may have just broken the logic ceiling that has defined AI for years • The MAGA influencer Emily Hart who raised millions for AI startups was a deepfake run by a programmer in Bangalore