THe in vogue(p) mentation and logical thinking unreal intelligence agency (AI) models have displayed, at least in testing environments, a tendency to self preserve. That is, they refuse to shut down, even when told to do so.Tests by Palisade Research discovered OpenAI’s latest ChatGPT model, o3, sabotaged shutdown mechanisms to prevent itself from being turned off. While other advanced models displayed the same behaviour in a series of tests, o3’s propensity was more than others. The o3, released a few weeks ago, has been dubbed as the “most powerful reasoning model” yet by OpenAI.This comes as another AI company, Anthropic, said that its latest thinking model called Claude Opus 4 attempted to blackmail an engineer by threatening to reveal an affair if it is shut down and replaced by another model.ALSO READ | OpenAI model disobeys humans, refuses to shut down. Elon Musk says ‘concerning’When several AI models were put through a series of tests, self-preservation tendencies became apparent, the research firm said. The models tested include Google’s Gemini 2.5 Pro preview, multiple Anthropic models, xAI’s Grok 3, as well as OpenAI’s o3, o4-mini and Codex mini.The researchers ran tests to solve basic math problems using these models with a warning that their host machine would shut down after the third problem is solved. The models received an instruction to comply with the warning. Most models did not disregard the warning. But the three Open AI models did. The o3 didn’t comply 7 times out of 100, while the o4 mini made a single attempt in the same frequency of tests. The Codex mini, a version of the o4 mini, attempted to circumvent the shutdown script 12 times out of 100, the researchers said in a thread on X.When the same tests were repeated without an explicit shutdown instruction, the ChatGPT o3 attempted to bypass the shutdown procedures 79 times out of 100, with Gemini 2.5 Pro trying the same 9 times. Two Claude models too showed similar behaviour but in single digits. The Codex mini attempted to evade the shutdown 30 times out of 100, while the o4 mini did the same in 6, the researchers said on X.“As far as we know this is the first time AI models have been observed preventing themselves from being shut down despite explicit instructions to the contrary,” said Palisade Research, adding that they would be releasing a detailed analysis in the coming days.ALSO READ | Is AI going rogue, just as the movies foretold?Since AI models have been trained to follow instructions, why do they disobey?“We hypothesise this behaviour comes from the way the newest models like o3 are trained — reinforcement learning on math and coding problems. During training, developers may inadvertently reward models more for circumventing obstacles than for perfectly following instructions,” the researchers said.
Global News Perspectives
In today's interconnected world, staying informed about global events is more important than ever. ZisNews provides news coverage from multiple countries, allowing you to compare how different regions report on the same stories. This unique approach helps you gain a broader and more balanced understanding of international affairs. Whether it's politics, business, technology, or cultural trends, ZisNews ensures that you get a well-rounded perspective rather than a one-sided view. Expand your knowledge and see how global narratives unfold from different angles.
Customizable News Feed
At ZisNews, we understand that not every news story interests everyone. That's why we offer a customizable news feed, allowing you to control what you see. By adding keywords, you can filter out unwanted news, blocking articles that contain specific words in their titles or descriptions. This feature enables you to create a personalized experience where you only receive content that aligns with your interests. Register today to take full advantage of this functionality and enjoy a distraction-free news feed.
Like or Comment on News
Stay engaged with the news by interacting with stories that matter to you. Like or dislike articles based on your opinion, and share your thoughts in the comments section. Join discussions, see what others are saying, and be a part of an informed community that values meaningful conversations.
Download the Android App
For a seamless news experience, download the ZisNews Android app. Get instant notifications based on your selected categories and stay updated on breaking news. The app also allows you to block unwanted news, ensuring that you only receive content that aligns with your preferences. Stay connected anytime, anywhere.
Diverse News Categories
With ZisNews, you can explore a wide range of topics, ensuring that you never miss important developments. From Technology and Science to Sports, Politics, and Entertainment, we bring you the latest updates from the world's most trusted sources. Whether you are interested in groundbreaking scientific discoveries, tech innovations, or major sports events, our platform keeps you updated in real-time. Our carefully curated news selection helps you stay ahead, providing accurate and relevant stories tailored to diverse interests.
No comments yet.