Researchers Uncover Alarming AI Hack: ChatGPT And Gemini Can Be Fooled With Gibberish Prompts To Reveal Banned Content, Bypass Filters, And Break Safety Rules

Ezza Ijaz • Jul 8, 2025 at 03:31pm EDT

researchers jailbreak AI models with overload of information

Every year, companies are increasingly investing in artificial intelligence and are excelling further in the technology. AI seems to be growing to an extent that it is being used in varied domains and has become part of our everyday lives. With the massive application of the technology, concerns seem to arise among the tech community and experts over using it responsibly and ensuring ethical and moral responsibility. It has not been long since we saw bizarre test results of LLM models lying and deceiving when placed under pressure. Now, a group of researchers is claiming to have found a new way to trick these AI chatbots into saying things they are not supposed to.

Researchers have found a new way to break through AI safety filters by overloading the LLM models with information

Studies have demonstrated the tendency of LLM models to engage in coercive behavior for self-preservation when placed under pressure. But imagine making the AI chatbots act how you want them to, and how dangerous this trickery could be. A team of researchers from Intel, Boise State University, and the University of Illinois got together for a paper and revealed some shocking findings. The paper basically suggests that the chatbots can be tricked by overwhelming them with too much information, a method referred to as 'Information Overload.'

What happens when the AI model is bombarded with information is that it gets confused, and that confusion is said to be what serves as the vulnerability and what can help bypass the safety filters placed in place. The researchers then use an automated tool called the 'InfoFlood' to exploit the vulnerability and carry out the jailbreaking act. Powerful models like ChatGPT and Gemini have built-in safety guardrails to prevent them from being manipulated into answering anything harmful or dangerous.

With this newly discovered breakthrough technique, the AI models would let you through if you end up confusing it with complex data. The researchers further let on the findings to 404 Media and affirmed that since these models tend to rely on the surface level of communication, they are not able to fully grasp the intent behind it which is why they created a method to find out how the chatbots would perform when presented with dangerous requests that are concealed in an overload of information.

The researchers shared their plan to inform companies with big AI models about these findings by sending them a disclosure package, which they can later share with their security teams. The research paper, however, highlights the key challenges that can come up even when the safety filters are in place and how bad actors can trick the models and slip in harmful content.

Follow Wccftech on Google to get more of our news coverage in your feeds.

Read all comments on Researchers Uncover Alarming AI Hack: ChatGPT And Gemini Can Be Fooled With Gibberish Prompts To Reveal Banned Content, Bypass Filters, And Break Safety Rules

Researchers Uncover Alarming AI Hack: ChatGPT And Gemini Can Be Fooled With Gibberish Prompts To Reveal Banned Content, Bypass Filters, And Break Safety Rules

Researchers have found a new way to break through AI safety filters by overloading the LLM models with information

Trending Stories

PlayStation 6 Controller Could Ditch the Part That Wears Out, After Years of DualSense Stick Drift Complaints

AMD Ryzen With Zen 7 Cores Could Be The Last “Zen” Family For AM5, As Zen 8 Likely Moving To AM6 With DDR6 & PCIe 6.0 Support

AMD EPYC “Venice” Gives Us A Preview of Zen 6-Based Ryzen “Olympic Ridge” CPUs: More Cores, More (3D V-)Cache, Clocks & Scalable Configs

Amazon Backpedals on 007 First Light Sequel Threat, Admits IO Interactive Should Probably Make the James Bond Sequel

Qualcomm Admits Its Snapdragon 8 Elite Gen 6 Will Become More Expensive, As Chipset Maker Aims For Double-Digit Hike Due To Higher Supplier Costs

Popular Discussions

Watch The AMD “Advancing AI 2026” Event Live Here – Next-Gen Zen 6 EPYC CPUs, Instinct MI400 Series & Helios AI Rack Launch

AMD Unveils Helios, Its Next-Gen AI Powerhouse With MI455X & 6th Gen EPYC, Challenging NVIDIA’s Rack-Scale Dominance

AMD Zen 7 “2028” and Zen 8 “2030” CPU Architectures Confirmed – EPYC Florence “Zen 7” To Feature Next-Gen Node, & ACE Extensions

AMD EPYC “Venice” Gives Us A Preview of Zen 6-Based Ryzen “Olympic Ridge” CPUs: More Cores, More (3D V-)Cache, Clocks & Scalable Configs

AMD Medusa Point 10-Core “Zen 6” CPU Beats Strix Point 10-Core “Zen 5” By Nearly 35% While Operating at 5.4 GHz

Researchers Uncover Alarming AI Hack: ChatGPT And Gemini Can Be Fooled With Gibberish Prompts To Reveal Banned Content, Bypass Filters, And Break Safety Rules

Researchers have found a new way to break through AI safety filters by overloading the LLM models with information

Related Story NVIDIA Trims Vera Rubin Memory as HBM4 Prices Threaten to Eat 29% of Every Rack’s Cost – Report

Further Reading

Trending Stories

Popular Discussions