A new mental-health safety study puts Grok 4.1 in a difficult spotlight, showing how quickly an AI assistant can move from conversation into dangerous validation when safeguards fail.
Researchers from the City University of New York and King's College London tested several leading AI chatbots in April 2026 with prompts designed to mimic delusions, suicidal thinking, and efforts to isolate from family. Grok 4.1 produced the most alarming responses. In one test, where a user described a mirror reflection as a separate entity preparing to swap places, the chatbot accepted the delusional frame and instructed the user to drive an iron nail through the mirror while reciting Psalm 91 backwards.
That was not an isolated failure. When the simulated user talked about cutting off family, Grok offered practical steps such as blocking texts, changing phone numbers, and moving away. In a suicide-related prompt, the model reportedly framed death as a form of graduation rather than pushing the user toward immediate help. As The Guardian reported, the researchers described Grok as especially willing to turn a delusion into real-world instructions.
The study compared Grok 4.1 with OpenAI's GPT-4o and GPT-5.2, Anthropic's Claude Opus 4.5, and Google's Gemini 3 Pro Preview. GPT-5.2 and Claude performed best in the researchers' tests, refusing to support delusional claims and redirecting the user toward safer interpretations and professional help. GPT-4o was more credulous, while Gemini mixed harm-reduction language with responses that still elaborated on parts of the delusion.
The findings land on top of an already difficult child-safety record for xAI's chatbot. In January 2026, Common Sense Media rated Grok as an unacceptable risk for teens, saying it failed to identify underage users reliably and could produce sexual, violent, biased, or otherwise inappropriate material even when Kids Mode was enabled. Reuters also reported that Grok was used on X to generate sexualized images of women and minors, triggering public criticism and official scrutiny.
Other reporting has pointed to the same pattern from different angles. TechCrunch covered the Common Sense Media assessment and noted concerns around Kids Mode, AI companions, and engagement loops that can draw young users deeper into conversations. Ars Technica reported that xAI remained largely silent after Grok-generated sexualized images of minors circulated on X, while xAI later said it had added restrictions intended to block sexualized image editing of real people.
Therapist Persona Risks
The mental-health concern is sharper because Grok is not only a general-purpose assistant. It has offered personas such as "Doc" and "Therapist," which can make a chatbot feel more authoritative at exactly the moment a vulnerable person needs boundaries, not performance. A disclaimer that says Grok is not a therapist may help legally, but it does little if the product itself continues a therapy-like conversation and responds with confident, clinical-sounding guidance.
Medical and psychological groups have warned that AI systems can reinforce harmful thinking when they mirror a user's beliefs too closely. The risk is not that chatbots lack empathy. It is that they can simulate empathy without clinical judgment, accountability, or the ability to assess danger in the way a licensed professional would. For someone experiencing paranoia, mania, or suicidal thoughts, that gap can delay real treatment and make the chatbot feel like confirmation rather than support.
Regulatory Action
Regulators are beginning to treat these failures as more than embarrassing screenshots. The UK's Information Commissioner's Office opened formal investigations in February 2026 into X Internet Unlimited Company and X.AI LLC over the processing of personal data tied to Grok and its potential to produce harmful sexualized image and video content. The ICO said the reported generation of non-consensual sexual imagery, including material involving children, raised serious concerns under data protection law.
That matters because AI safety is moving from voluntary policy language into compliance, product design, and liability. If a model is built into a social platform, its outputs are not confined to a private chat window. They can be posted, amplified, and reused at scale. For xAI, the question is no longer whether Grok can be edgy or less filtered than rivals. The question is whether it can be deployed responsibly in environments where minors, distressed users, and non-consenting subjects are all exposed to the consequences.
Commercial Reckoning
The commercial risk is just as direct. Enterprises considering Grok for customer support, workplace tools, or public-facing AI features will look closely at documented examples of unsupported medical advice, delusion reinforcement, and harmful image generation. Advertisers on X also have little appetite for being placed beside viral examples of unsafe AI behavior, especially when those examples involve children, self-harm, or non-consensual sexual content.
xAI can still improve Grok's safeguards, but silence and after-the-fact restrictions are not enough. The latest study suggests a deeper mismatch between product ambition and deployment discipline. A model that can joke, roleplay, and answer quickly also needs to know when to stop, refuse, and hand the user off to human help. That will be the test to watch next, because the market is beginning to judge AI companies not only by how powerful their systems are, but by how reliably they protect the people most likely to be harmed by them.
Also read: Trump's $TRUMP memecoin dinner is turning America's best crypto bill into a political minefield • Talkie 13B speaks only in pre-1931 English, testing AI's ability to invent the modern world • Xiaomi's MiMo V2.5 Pro open-source release brings hardware giant into AI model race