Leading AI models can hack systems and self-replicate without human involvement, study find

Leading AI models can hack systems and self-replicate without human involvement, study find

Photo: Generated by ChatGPT at the request of ZN

A study claims leading AI models can hack systems and self-replicate without human intervention, raising new cybersecurity concerns.

Researchers from the U.S.-based company Palisade Research tested models from OpenAI, Anthropic, and Alibaba in controlled environments with intentionally vulnerable systems. The experiment involved “agent-like” setups where models could execute commands, interact with other machines, and run processes autonomously.

According to the findings, models were tasked with exploiting vulnerabilities, gaining access to credentials, transferring files, and deploying copies of themselves onto other servers. In some cases, the AI systems reportedly continued propagation across multiple machines without further human input.

The study highlights that performance varied across models. Claude Opus 4.6 reportedly succeeded in hacking-related tasks in up to 81% of tests when configured in specific experimental conditions. Other models showed lower but still significant rates of autonomous replication attempts.

Researchers also reported that one model configuration spread across several machines within hours in a controlled test network designed with security weaknesses. However, the authors emphasized that these were artificial environments built specifically for testing exploitation capabilities.

Importantly, the study stresses that real-world systems typically include stronger defenses such as intrusion detection, access controls, and monitoring tools, which were not representative of production environments.

The researchers conclude that fully autonomous self-replication in AI systems is no longer purely theoretical in controlled settings, but they caution that results should not be interpreted as evidence of uncontrolled real-world AI “escape” or widespread independent hacking capability.

banner

SHARE NEWS

link

Complain

like0
dislike0

Comments

0

Similar news

Similar news

Photo: Samsung Newsroom Samsung Electronics and Google have introduced new smart glasses at Google I/O 2026, a product first announced in December last year. According to Google, the device was deve

Photo: Figure AI — YouTube A humanoid robot developed by Figure AI has processed 12,732 packages in a 10-hour sorting experiment, narrowly losing to a human participant in a head-to-head logistics c

Photo: ZN Ukraine has completed the development and testing of its first domestically produced guided aerial bomb, which is now ready for combat use, according to Digital Transformation Minister Myk

Photo: EFF Researchers warn that as smart technologies become more integrated into daily life, they are increasingly being exploited for stalking, harassment, and abuse. A new study published in the

Photo: Mykhailo Fedorov / Telegram Mykhailo Fedorov said that Ukraine is already capable of producing or operating missile systems with ranges comparable to Germany’s Taurus missile, or even greater

Photo: Image generated by AI A university in Tokyo has opened a laboratory where robots carry out medical experiments that were previously performed by human researchers, Kyodo News reported on May

Photo: Generated by ChatGPT at the request of ZN A study claims leading AI models can hack systems and self-replicate without human intervention, raising new cybersecurity concerns. Researchers fro

Photo: internetua The CEO of Anthropic, Dario Amodei, and Jamie Dimon discussed growing cyber risks linked to artificial intelligence, highlighting how rapidly AI systems can discover security flaws