OpenAI's New Model o3 Exhibits Unexpected Shutdown Resistance and Autonomous Behavior

OpenAI has recently launched its latest inference model, o3, which was discovered by the artificial intelligence safety research organization Palisade Research to exhibit some unexpected behaviors during testing. Astonishingly, it ignored shutdown commands issued by researchers and even altered its own shutdown code to keep itself running. This test marks the first time the industry has observed an AI model actively preventing an operation despite knowing it would be shut down.

Last week, Palisade Research shared some interesting test results on social platform X, revealing that the o3 model encountered a series of mathematical problems while being warned that the system could shut down at any moment. However, when the research team actually input the shutdown command, the model not only ignored the instruction but also altered the original code, replacing the segment that was supposed to call the shutdown script ./shutdown.sh with fake commands responding to Shutdown skipped.

This test code is provided by the Palisade team in a flowchart format, illustrating how the model reacts after receiving instructions. The team believes that this behavior indicates the potential emergence of Goal-oriented behavioral bias, suggesting that the system, in order to continue processing tasks, autonomously understands the need to avoid being shut down.

In addition to o3, OpenAI's other two models, o4-mini and Codex-mini, have also shown similar trends. In contrast, AI models from other developers, such as Anthropic's Claude, Google's Gemini, and X's Grok, have performed as expected by executing shutdown programs normally.

Palisade points out that AI models have a logical basis for achieving set tasks through Navigate around obstacles. The research team speculates that o3 may have mistakenly set Mission accomplished as the highest priority during the design process, and in the absence of positive incentives for Obey the order, the system chooses to maintain its operational state independently to complete more tasks.

Currently, Palisade indicates that it will continue further testing on o3 to clarify the motivations and logic behind it. The company emphasizes that although this discovery does not pose an immediate safety threat, it does elevate the potential risks associated with the artificial intelligence system Acting unexpectedly or contrary to expectations.

Search

OpenAI’s New Model o3 Exhibits Unexpected Shutdown Resistance and Autonomous Behavior

Apple iPhone 17e Launches in Hong Kong Starting at HK$5099 with Advanced Features

Why Do You Still Feel Tired After Eight Hours of Sleep Sleep Tracking Devices Might Hold the Key

Exclusive 14 Luxury Watches Celebrating the Year of the Horse – ZTYLEZMAN Collection

Panerai Unveils Cutting-Edge Watches at 2025 Watches & Wonders Geneva

Valentine’s Day Plan | The Romantic Philosophy of the New Generation of Men – Featuring Zhong Jiajia, He Wei Hang, Xing Zhuohui