Why Is Elon Musk’s Grok AI Regurgitating ChatGPT’s Responses Verbatim?

Dec 9, 2023 at 09:10am EST
This is not investment advice. The author has no position in any of the stocks mentioned. Wccftech.com has a disclosure and ethics policy.

Elon Musk's xAI has billed its Grok Large Language Model (LLM) as the first significant step toward a "maximum truth-seeking AI," one that comprehends the true nature of the universe. For now, however, the AI model appears content to regurgitate verbatim the responses of OpenAI's GPT LLM, constituting a stark departure from the overarching lofty goals that supposedly form the very ethos of Grok AI.

For now, Grok can interpret a prompt of up to 25,000 characters. The LLM has been trained not only on The Pile - a ubiquitous AI model training data set - but also on the mountains of data sourced from X. Moreover, Grok is apparently able to access and interpret real-time information via it's integration with the X social media platform.

Related Story NVIDIA’s CEO Jensen Huang Secretly Boards Air Force One for Trump’s China Visit, Defying Earlier Reports He’d Skip the Trip

This brings us to the crux of the matter. Elon Musk announced this week that the Grok AI model was now being rolled out to all paid subscribers of the X platform. In order to test this new model, Jax Winterbourne, a professional hacker, asked Grok to modify a malicious code. In response, the LLM regurgitated the response of OpenAI's GPT word for word, going so far as to reference OpenAI's policy in the output text.

Winterbourne then posits a few theories on why such blatant regurgitation is occurring, ranging from the cheeky suggestion that Grok is simply a derivative of OpenAI's GPT LLM to the much more rational explanation that the regurgitated response is a result of model hallucination.

We reported recently that Grok outperformed every other LLM, including Anthropic's Claude 2, with the exception of OpenAI's GPT-4 on a held-out math exam, earning a total score of 59 percent vs. 68 percent for GPT-4. This suggests that the AI model is not simply a derivative of OpenAI's GPT LLM.

Consequently, the most likely explanation for this behavior is that Grok has apparently been trained extensively on GPT's responses. Therefore, instead of formulating a unique response while referencing xAI's policies on malicious codes, the LLM simply regurgitated OpenAI's stance. This also goes to show that the current generation of AI models are simply glorified iterations of a Chinese room - a thought experiment that posits AI models don't really understand language or think.

Update: xAI Co-Founder Responds

We now have a response from the co-founder of xAI, Igor Babuschkin. The high-ranking xAI employee concedes that Grok's training accidentally incorporated some GPT outputs.

About the author: Writing is my one incontrovertible passion. Over the past six years, he has authored over 2,200 distinct articles on financial and tech-related topics, spanning nearly 1 million words. And he has been a member of Wcctech mobile team since 2025. As an alumnus of the University of Toronto, Rotman Commerce Program, I bring nuance, in-depth knowledge, and a unique perspective to every topic that I cover. When I'm not writing, I'm traveling the world, exploring hidden confectionaries and restaurants as an aspiring food connoisseur.

Follow Wccftech on Google to get more of our news coverage in your feeds.