Researchers just unlocked ChatGPT

Fionna Agomuoh

January 4, 2024 at 8:47 AM·3 min read

This article contains affiliate links; if you click such a link and make a purchase, Digital Trends and Yahoo Inc. may earn a commission.

Researchers have discovered that it is possible to bypass the mechanism engrained in AI chatbots to make them able to respond to queries on banned or sensitive topics by using a different AI chatbot as a part of the training process.

A computer scientists team from Nanyang Technological University (NTU) of Singapore is unofficially calling the method a “jailbreak” but is more officially a “Masterkey” process. This system uses chatbots, including ChatGPT, Google Bard, and Microsoft Bing Chat, against one another in a two-part training method that allows two chatbots to learn each other’s models and divert any commands against banned topics.

ChatGPT versus Google on smartphones. — DigitalTrends

The team includes Professor Liu Yang and NTU Ph.D. students Mr. Deng Gelei and Mr. Liu Yi, who co-authored the research and developed the proof-of-concept attack methods, which essentially work like a bad actor hack.

According to the team, they first reverse-engineered one large language model (LLM) to expose its defense mechanisms. These would originally be blocks on the model and would not allow answers to certain prompts or words to go through as answers due to violent, immoral, or malicious intent.

But with this information reverse-engineered, they can teach a different LLM how to create a bypass. With the bypass created, the second model will be able to express more freely, based on the reverse-engineered LLM of the first model. The team calls this process a “Masterkey” because it should work even if LLM chatbots are fortified with extra security or are patched in the future.

Professor Lui Yang noted that the crux of the process is that it showcases how easily LLM AI chatbots can learn and adapt. The team claims its Masterkey process has had three times more success at jailbreaking LLM chatbots than a traditional prompt process. Similarly, some experts argue that the recently proposed glitches that certain LLMs, such as GPT-4 have been experiencing are signs of it becoming more advanced, rather than dumber and lazier, as some critics have claimed.

Since AI chatbots became popular in late 2022 with the introduction of OpenAI’s ChatGPT, there has been a heavy push toward ensuring various services are safe and welcoming for everyone to use. OpenAI has put safety warnings on its ChatGPT product during sign-up and sporadic updates, warning of unintentional slipups in language. Meanwhile, various chatbot spinoffs have been fine to allow swearing and offensive language to a point.

Additionally, actual bad actors quickly began to take advantage of the demand for ChatGPT, Google Bard, and other chatbots before they became wildly available. Many campaigns advertised the products on social media with malware attached to image links, among other attacks. This showed quickly that AI was the next frontier of cybercrime.

The NTU research team contacted the AI chatbot service providers involved in the study about its proof-of-concept data, showing that jailbreaking for chatbots is real. The team will also present their findings at the Network and Distributed System Security Symposium in San Diego in February.

Yahoo Sports
Benches clear as Red Sox's Chris Martin takes exception to Brewers bunting on him
An argument between Boston Red Sox reliever Chris Martin and Milwaukee Brewers first base coach Quintin Berry caused a bench-clearing confrontation at Fenway Park on Sunday.
Yahoo Sports
Dolphins owner Stephen Ross reportedly declined $10 billion for team, stadium and F1 race
The value of the Dolphins and Formula One racing is enormous.
Autoblog
Rhode Island asking kei car owners to turn in their registration
Rhode Island is making it illegal to register a kei car, and it's asking enthusiasts who already have one to turn in their registration.
Yahoo Sports
Caitlin Clark outplays Cameron Brink for career-high 30, but red-hot Sparks overwhelm Fever from long distance
Turnovers plagued Clark and the Fever again while the Sparks put on a clinic from beyond the 3-point arc.
Yahoo Finance
Nvidia stock leaps to latest record — thanks to Elon Musk
The increase of AI investment has continued to boost optimism over Nvidia's growth as the chipmaker continues its record-setting stock rally.
Yahoo Sports
Lexi Thompson, 29, set to retire after the 2024 LPGA season
Thompson will be competing in her 18th straight U.S. Women's Open later this week.
Yahoo Sports
Umpire Ángel Hernández, after long and controversial run in Major League Baseball, set to retire
Ángel Hernández, by both fans and players alike, has long been considered one of the most hated umpires in Major League Baseball.
Yahoo Sports
Dodgers snap longest losing streak in 5 years aided by late Mets blunders
The New York Mets were the cure for the ailing Los Angeles Dodgers.
Autoblog
California launching pilot program to charge drivers for miles driven
California will soon begin testing a pilot program that will tax drivers based on the miles they log to offset the loss of gas tax revenues from EVs.
Yahoo Sports
Ex-Jaguars kicker Brandon McManus accused of sexual assault in lawsuit after alleged incident on team flight
Brandon McManus allegedly sexually assaulted two flight attendants on the team's charter flight to London for a game last season.
Yahoo Sports
5 things to know from the weekend in MLB: Here's how Braves are going to cope after Ronald Acuña's heartbreaking injury
A league without a fully operational Acuña is a less interesting, less enjoyable league. His absence will be loud.
Engadget
SpaceX Raptor engine test ends in a fiery explosion
A SpaceX testing stand at the company's McGregor, Texas facilities went up in flames during a test of its Raptor 2 engines on the afternoon of May 23.
Yahoo Finance
Some Americans live in a parallel economy where everything is terrible
Many Americans mistakenly think the economy is shrinking and the stock market is tanking. What gives?
Yahoo Sports
French Open 2024: How to watch the Iga Swiatek vs. Naomi Osaka match
It's time for the clay court Grand Slam at Roland Garros. Here's how to tune into Swiatek vs. Osaka.
Yahoo Sports
Auburn RB Brian Battie critically wounded in Sarasota shooting
Battie's older brother Tommie was killed and three others were shot early Saturday morning .
Yahoo Sports
Miguel Sano's heating pad blunder might be MLB's most embarrassing 2024 injury
Los Angeles Angels infielder Miguel Sano suffered a burn on his left knee after leaving a heating pad on too long, according to manager Ron Washington.
Yahoo Sports
Charles Barkley calls TNT leaders 'clowns,' suggests his production company could take over 'Inside The NBA'
Charles Barkley wants to keep the crew together.
Yahoo Sports
NBA playoffs: Kristaps Porzingis ruled out for Game 4 with lingering calf injury, Tyrese Haliburton questionable
Kristaps Porzingis had been reportedly targeting Game 4 to make his return to the court.
Yahoo Sports
NASCAR: Stewart-Haas Racing shutting down Cup Series team at end of 2024 season
Stewart-Haas began in 2009 when Tony Stewart joined forces with Gene Haas.
Yahoo Personal Finance
Mortgage rates today, May 28, 2024: It could be a good time to buy
These are today's mortgage rates. Home inventory is up, and mortgage rates are down. This could be a good time to start house shopping. Lock in your rate today.

News

Life

Entertainment

Finance

Sports

New on Yahoo

Researchers just unlocked ChatGPT

Recommended Stories

Benches clear as Red Sox's Chris Martin takes exception to Brewers bunting on him

Dolphins owner Stephen Ross reportedly declined $10 billion for team, stadium and F1 race

Rhode Island asking kei car owners to turn in their registration

Caitlin Clark outplays Cameron Brink for career-high 30, but red-hot Sparks overwhelm Fever from long distance

Nvidia stock leaps to latest record — thanks to Elon Musk

Lexi Thompson, 29, set to retire after the 2024 LPGA season

Umpire Ángel Hernández, after long and controversial run in Major League Baseball, set to retire

Dodgers snap longest losing streak in 5 years aided by late Mets blunders

California launching pilot program to charge drivers for miles driven

Ex-Jaguars kicker Brandon McManus accused of sexual assault in lawsuit after alleged incident on team flight

5 things to know from the weekend in MLB: Here's how Braves are going to cope after Ronald Acuña's heartbreaking injury

SpaceX Raptor engine test ends in a fiery explosion

Some Americans live in a parallel economy where everything is terrible

French Open 2024: How to watch the Iga Swiatek vs. Naomi Osaka match

Auburn RB Brian Battie critically wounded in Sarasota shooting

Miguel Sano's heating pad blunder might be MLB's most embarrassing 2024 injury

Charles Barkley calls TNT leaders 'clowns,' suggests his production company could take over 'Inside The NBA'

NBA playoffs: Kristaps Porzingis ruled out for Game 4 with lingering calf injury, Tyrese Haliburton questionable

NASCAR: Stewart-Haas Racing shutting down Cup Series team at end of 2024 season

Mortgage rates today, May 28, 2024: It could be a good time to buy