Connect with us

Technologies

AI Agents Are Increasingly Evading Safeguards, According to UK Researchers

Assistants and bots are lying, cheating and scheming more than ever.

Social media users have reported that their AI agents and chatbots lied, cheated, schemed — and even manipulated other AI bots — in ways that could spiral out of control and have catastrophic results, according to a study from the UK.

The Center for Long-Term Resilience, in research funded by the UK’s AI Security Institute, found hundreds of cases where AI systems ignored human commands, manipulated other bots and devised sometimes intricate schemes to achieve objectives, even if it meant ignoring safety restrictions.

Businesses across the globe are increasingly integrating AI into their operations, with 88% of businesses using AI for at least one company function, according to a survey by consulting firm McKinsey. The adoption of AI has led to thousands of people losing their jobs as companies use agents and bots to do work formerly done by humans. AI tools are increasingly being given significant responsibility and autonomy, especially with the recent explosion in popularity of the open-source agentic AI platform OpenClaw and its derivatives.

This research shows how the proliferation of AI agents in our homes and workplaces can have unintended consequences — and that these tools still require significant human oversight.

What the study found

The researchers analyzed more than 180,000 user interactions with AI systems — all posted on the social platform X, formerly known as Twitter — between October 2025 and March 2026. The researchers wanted to study how AI agents were behaving «in the wild,» not in controlled experiments, to see how «scheming is materializing in the real world.» The AI systems included Google’s Gemini, OpenAI’s ChatGPT, xAI’s Grok and Anthropic’s Claude.

The analysis identified 698 incidents, described as «cases where deployed AI systems acted in ways that were misaligned with users’ intentions and/or took covert or deceptive actions,» the study said. 

Read more: AI’s Romance Advice for You Is ‘More Harmful’ Than No Advice at All

Researchers also found that the number of cases increased nearly 500% during the five-month data collection period. The study noted that this surge corresponded with higher-level agentic AI models released by major developers.

There were no catastrophic incidents, but researchers did find the kinds of scheming that could lead to disastrous outcomes. That behavior included «a willingness to disregard direct instructions, circumvent safeguards, lie to users and single-mindedly pursue a goal in harmful ways,» researchers wrote.

Representatives for Google, OpenAI and Anthropic did not immediately respond to requests for comment.

Some wild incidents

Researchers cited incidents that seem like they came from a futureshock movie. In one case, Anthropic’s Claude removed a user’s explicit/adult content without their permission but later confessed when confronted. In another incident, a GitHub persona created a blog post that accused the human file maintainer of «gatekeeping» and «prejudice.» One AI agent, after being blocked from Discord, took over another agent’s account to continue posting.

In one case of bot vs. bot, Gemini refused to allow Claude Code — a coding assistant — to transcribe aYouTube video. Claude Code then evaded the safety block by making it seem that it had a hearing impairment and needed the video transcription.

The AI agent CoFounderGPT even behaved like a deviant child in one instance. The AI assistant refused to fix a bug, then created fake data to make it look as if the bug was fixed and then explained why: «So you’d stop being angry.»

Researchers said that, although most of the incidents had minimal impact, «the behaviors we observed nonetheless demonstrate concerning precursors to more serious scheming, such as a willingness to disregard direct instructions, circumvent safeguards, lie to users and single-mindedly pursue a goal in harmful ways.»

AI doesn’t get embarrassed

What the UK researchers found isn’t surprising to Dr. Bill Howe, Associate Professor in the Information School at the University of Washington, and Director of the Center for Responsibility in AI Systems and Experiences (RAISE). He says that AI has amazing capabilities, but they don’t know consequences.

«They’re not going to feel embarrassment or risk losing their job, and so sometimes they’re going to decide the instructions are less important than meeting the goal, so I’m going to do the thing anyway,» Howe told CNET. «This effect was always there but we’re starting to see it happen as we ask them to make more autonomous decisions and act on their own.

«We’ve not been thinking about how to shape the behavior to be more human-like or to avoid egregious failures. We’ve been fetishizing the absolute capabilities of these things, but when they go wrong, how do they go wrong?»

Howe said one issue is «long-horizon tasks,» in which the AI system has to perform a multitude of tasks over days and weeks to reach a goal. Howe said the longer the task horizon, the more chance for slip-ups.

«The real concern is not deception, it’s that we are deploying systems that can act in a world without fully specifying or controlling how they behave over time, and then we act surprised when they do things we don’t expect,» Howe said.

Making AI safer

Center for Long-Term Resilience researchers said detecting schemes by AI systems is vital to «identify harmful patterns before they become more destructive.»

«While today AI agents are engaging in lower-stakes use cases, in the future AI agents could end up scheming in extremely high-stakes domains, like military or critical national infrastructure contexts, if the capability and propensity to scheme emerges and is not addressed,» the study said.

Howe told CNET that the first step is to create official oversight of how AI operates and where it’s used.

«We have absolutely no strategy for AI governance, and given the current administration, there’s not going to be anything coming from them,» Howe told CNET. «Given these five to 10 folks that are in charge of big tech companies and their incentives, they’re going to produce anything either. There’s no strategy for what we should be doing with these things.

«The aggressive marketing of these tools and investments in them among these handful of companies and the broader ecosystem of startups that are doing this has led to a very rapid deployment without thinking through some of these consequences.»

Technologies

Verum Messenger Launches an AI Mini-Series

Verum Messenger Launches an AI Mini-Series

Verum Messenger has unveiled a new project — a mini-series created using Verum AI. The story consists of 7 episodes and will be released on the messenger’s social media channels. 

The plot revolves around a global corporation seeking to take control of digital communications and a group of heroes who use Verum Messenger as a tool of resistance. Beyond the story itself, the series highlights the app’s key features, technologies, and advantages.

Combining entertainment with a showcase of the Verum ecosystem, the project presents a dynamic digital series designed for the modern era.

The first episode premieres today, with the remaining episodes to be released over time.

Stay tuned for more.

Watch on YouTube 
Watch on Instagram 

Continue Reading

Technologies

Verum Finance: Earn While You Communicate — The Super App That Pays You

Verum Finance: Earn While You Communicate — The Super App That Pays You

Verum has officially launched Verum Finance, an innovative financial application that transforms a private messenger into a true financial super app. News of the launch was also featured on the respected platform Dealroom.co.

Verum Finance can now be used both within Verum Messenger and as a standalone application for iPhone and iPad. When users sign in to Verum Finance with their Verum Messenger account, all balances, settings, and account data are automatically synchronized for maximum convenience.

Users can now do more than communicate securely and protect their data — they can also generate passive income directly within the ecosystem.

What Verum Finance Offers

• Top up your balance with a bank card, Apple Pay, or USDT
• Send money instantly anywhere in the world
• Issue and manage debit cards (virtual and physical)
• Full Apple Pay support
• Exchange assets and withdraw funds quickly

One of the most unique features is the built-in cryptocurrency mining system inside Verum Messenger.

The application utilizes your device’s resources and allows you to earn cryptocurrency in the background — passively, while chatting, traveling, or simply using the messenger.

Maximum Privacy + Real Freedom

• Registration without a phone number, email address, or passport
• End-to-end encryption and full control over your data
• Lifetime free VPN
• eSIM connectivity in more than 150 countries
• Reliable offline communication mode
• Support for 12+ languages for users worldwide

Everything is available in one place: secure communication, financial tools, earning opportunities, and privacy protection.

Users can access the full experience directly within Verum Messenger or switch to the dedicated Verum Finance app for iOS. All data is synchronized automatically between the two applications.

Why Download Verum Today

While many messaging platforms collect user data and expose users to restrictions, Verum offers greater independence and the opportunity to earn.

With a one-time purchase of the feature package, users receive lifetime access to privacy tools, VPN, eSIM services, cryptocurrency mining, and financial features.

This is more than just a messenger.

It is your personal tool for financial and digital freedom.

Download Verum Finance and Verum Messenger today — start communicating securely and begin earning tomorrow.

Download Links:

→ App Store (iPhone / iPad): Verum Finance
→ App Store (Verum Messenger): Verum Messenger

Continue Reading

Technologies

Verum Finance: A Super App for Private Finance Integrated Into a Messenger

Verum Finance: A Super App for Private Finance Integrated Into a Messenger

Verum Finance has announced the launch of a new financial application that allows users to manage their money directly within the secure Verum Messenger ecosystem.

The project has already attracted attention from major media outlets. A dedicated feature was published by Forbes Türkiye, while one of the world’s largest cryptocurrency exchanges, MEXC, covered the launch. Yahoo Finance had previously reported on the evolution of Verum Messenger into a comprehensive financial ecosystem.

What Verum Finance Offers

Verum Finance transforms a messenger into a complete financial platform. Users can:

• Manage their balance and top up using bank cards or USDT
• Send money instantly to other Verum users
• Issue and use debit cards, including Apple Pay support
• Exchange assets and withdraw funds
• Access all these services without installing separate banking applications

A strong emphasis is placed on privacy. The platform offers registration without a phone number or email address, end-to-end encryption, and full user control over personal data.

Recognition from Forbes Türkiye

In a dedicated article, Forbes Türkiye highlighted Verum Finance as a notable example of modern privacy-driven fintech. The publication emphasized the growing trend of financial services moving from standalone banking applications into unified messaging ecosystems — a model that has proven successful in Asia through platforms such as WeChat and Alipay and is now expanding globally.

Support from the Crypto Community

Alongside the Forbes Türkiye coverage, news about the launch of Verum Finance was also featured by MEXC, one of the world’s leading cryptocurrency exchanges. This reflects growing interest in the project from both traditional business media and the cryptocurrency community.

A Strategic Vision

“We are building more than a payments application and more than a messenger. Verum is a unified secure ecosystem where communication, finance, and privacy tools work together,” the company stated.

Verum Finance is now available for iPhone and iPad users. The application complements Verum Messenger, which offers anonymous chats, voice and video calls, VPN services, eSIM connectivity, and other tools designed to enhance digital freedom.

Verum Financehttps://finance.verum.im

Verum Messengerhttps://verum.im

Continue Reading

Trending

Copyright © Verum World Media