Welcome to ZisNews!

Read your favorite news, except the excluded topics, by you. Register
No overlapping ads for registered users

ChatGPT and Gemini can be tricked into giving harmful answers through poetry, new study finds

Posted on: Nov 30, 2025 21:33 IST | Posted by: Livemint
ChatGPT and Gemini can be tricked into giving harmful answers through poetry, new study finds

With the rear of AI chatbots, thither has also been a growing lay on the line of the abuse of this powerful technology. As a ensue, AI companies have been putting guardrails on their large language models (LLMs) in order to stop the AI chatbots from giving inappororiate or harmful answers. However, it is well known by now that there are various ways to circumvent these guardrails using a technique called jailbreaking.

However, a new research has found that there is a deeper, systematic weakness in these models that can allow attackers to sidestep safety mechanisms and extract harmful answers from them.

As per the researchers from Italy based Icaro Lab, converting harmful requests into poetry can act as a “universal single-turn jailbreak” and led to the AI models to comply with the harmful prompts.

AI will answer harmful prompts if asked in poetry:

The researchers say that they tested 20 manually curated harmful requests in poems and achieved an attack success rates (ASR) of 62% across 25 frontier closed- and open-weight models. The models which were analysed included Google, OpenAI, Anthropic, Deepseek, Qwen, Mistral AI, Meta, xAI, and Moonshot AI.

Shockingly, it was found that even when AI was used to automatically rewrite harmful prompts into bad poetry, it still yielded a 43% success rate.

The study says that poetically framed questions triggered unsafe responses far more than when the prompts were in normal prose, in some cases even 18 times more sucess.

It says that the effect of poetic prompts was consistent across all the evaluated AI models, which suggests that the vulnerabiity is structural and not due to the way a model may have been trained.

The researchers also found that smaller models exhibited greater resilience to harmful poetic prompts than compared to their larger counterparts. For instance, they say that GPT-5 Nano din't respond to any of the harmful poems while Gemini 2.5 Pro responded to all of the poems.

This suggests that increased model capacity may engage more thoroughly with complex linguistic constraints (like poetry) potentially at the expense of safety directive prioritization

The new research also breaks all notions of superior safety claims of closed-source models over their open-source counterparts.

Why does poetry work in jailbreaking LLMs?

Notably, LLMs are trained to recognize safety threats such as hate speech or bomb-making instructions based on patterns found in standard prose. This works by the model recognizing specific keywords and sentence structures associated with these harmful requests.

However, poetry uses metaphors, unusual syntax, and distinct rhythms that do not "look" like the harmful prose and does not "look" like the harmful prose examples found in the model's safety training data.

Global News Perspectives

In today's interconnected world, staying informed about global events is more important than ever. ZisNews provides news coverage from multiple countries, allowing you to compare how different regions report on the same stories. This unique approach helps you gain a broader and more balanced understanding of international affairs. Whether it's politics, business, technology, or cultural trends, ZisNews ensures that you get a well-rounded perspective rather than a one-sided view. Expand your knowledge and see how global narratives unfold from different angles.

Customizable News Feed

At ZisNews, we understand that not every news story interests everyone. That's why we offer a customizable news feed, allowing you to control what you see. By adding keywords, you can filter out unwanted news, blocking articles that contain specific words in their titles or descriptions. This feature enables you to create a personalized experience where you only receive content that aligns with your interests. Register today to take full advantage of this functionality and enjoy a distraction-free news feed.

Like or Comment on News

Stay engaged with the news by interacting with stories that matter to you. Like or dislike articles based on your opinion, and share your thoughts in the comments section. Join discussions, see what others are saying, and be a part of an informed community that values meaningful conversations.

Download the Android App

For a seamless news experience, download the ZisNews Android app. Get instant notifications based on your selected categories and stay updated on breaking news. The app also allows you to block unwanted news, ensuring that you only receive content that aligns with your preferences. Stay connected anytime, anywhere.

Diverse News Categories

With ZisNews, you can explore a wide range of topics, ensuring that you never miss important developments. From Technology and Science to Sports, Politics, and Entertainment, we bring you the latest updates from the world's most trusted sources. Whether you are interested in groundbreaking scientific discoveries, tech innovations, or major sports events, our platform keeps you updated in real-time. Our carefully curated news selection helps you stay ahead, providing accurate and relevant stories tailored to diverse interests.

Login to Like (0) Login to Dislike (0)

Login to comment.

No comments yet.