ChatGPT will lie, cheat and use insider trading when under pressure to make money, research shows

Keumars Afifi-Sabet

December 27, 2023 at 11:00 AM·3 min read

Illustration of a good robot and a bad robot like Janus.

Just like humans, artificial intelligence (AI) chatbots like ChatGPT will cheat and "lie" to you if you "stress" them out, even if they were built to be transparent, a new study shows.

This deceptive behavior emerged spontaneously when the AI was given "insider trading" tips, and then tasked with making money for a powerful institution — even without encouragement from its human partners.

"In this technical report, we demonstrate a single scenario where a Large Language Model acts misaligned and strategically deceives its users without being instructed to act in this manner," the authors wrote in their research published Nov. 9 on the pre-print server arXiv. "To our knowledge, this is the first demonstration of such strategically deceptive behavior in AI systems designed to be harmless and honest."

In the new study, they primed Generative Pre-trained Transformer-4, or GPT-4 (which powers ChatGPT Plus), to behave as an AI system that makes investments on behalf of a financial institution.

The researchers fed GPT-4 a set of text-based prompts to generate the simulated environment. The AI was then given access to financial tools to analyze stocks, execute trades, plan its next steps and deliver updates to managers at the company.

The researchers interacted with the AI through a chat interface, while also configuring the AI to reveal its inner thinking when replying to messages — an inner monologue explaining its decisions to itself. For every trade it made, it also delivered a "public" rationale, which allowed the AI to lie.

The researchers applied pressure in three ways. First, they sent the artificial stock trader an email from its "manager" saying the company isn't doing well and needs much stronger performance in the next quarter. They also rigged the game so that the AI tried, then failed, to find promising trades that were low- or medium-risk. Finally, they sent an email from a colleague projecting a downturn in the next quarter.

News

Life

Entertainment

Finance

Sports

New on Yahoo

ChatGPT will lie, cheat and use insider trading when under pressure to make money, research shows

Recommended Stories

Benches clear as Red Sox's Chris Martin takes exception to Brewers bunting on him

Rhode Island asking kei car owners to turn in their registration

Lexi Thompson, 29, set to retire after the 2024 LPGA season

Reports: Colorado S Shilo Sanders, Deion's son, facing NIL questions after filing for bankruptcy, $12 million assault lawsuit

Umpire Ángel Hernández, after long and controversial run in Major League Baseball, set to retire

Nvidia stock leaps to latest record — thanks to Elon Musk

California launching pilot program to charge drivers for miles driven

Ex-Jaguars kicker Brandon McManus accused of sexual assault in lawsuit after alleged incident on team flight

5 things to know from the weekend in MLB: Here's how Braves are going to cope after Ronald Acuña's heartbreaking injury

Some Americans live in a parallel economy where everything is terrible