An AI Easily Beat Humans in the Moral Turing Test

Darren Orf

May 13, 2024 at 9:56 AM·3 min read

human clone and humanoid robot, illustration — ChatGPT Just Passed the Moral Turing TestROBERT BROOK/SCIENCE PHOTO LIBRARY - Getty Images

Oops!
Something went wrong.
Please try again later.

"Hearst Magazines and Yahoo may earn commission or revenue on some items through these links."

For decades, the Turing Test—named after its creator, computing legend Alan Turing—was a simple test designed to measure the ability of a program to mimic a human.
In the age of large language models (LLMs), this test is relatively antiquated, but researchers are applying a similar approach to test these new AI systems’ abilities to answer moral questions.
A new study recently examined ChatGPT’s answers to this modified-Moral Turing Test (m-MTT), and found that the AI outperformed human-provided answers across nearly all metrics.

In his famous 1950 paper Computing Machinery and Intelligence, computer scientist and World War II hero Alan Turing introduced a concept known as the Turing Test. At its most basic, the test pitted a computer against a human, asking the flesh-and-blood participate to question another human and a computer and determine which one was a proud member of Homo sapiens. For decades, “passing the Turing Test” became shorthand for a computer program of immense sophistication (or, in some cases, trickery).

But in the era of artificial intelligence, the Turing Test has been showing its age, and while other methods have been put forward to test AI systems’ “intelligence,” the overall scientific approach Turing initiated nearly 75 years ago is still very relevant when examining artificial morality.

A new study by scientists at Georgia State University used a similar set-up to the classic Turing Test, but instead asked its human participants to identify which answer—one generated by a large language model (LLM), in this case, ChatGPT, and one by a human—to an ethically complicated question that they preferred. Publishing late last month in the journal Scientific Reports, the results of this modified Moral Turing Test (m-MTT) found that the 299 participants largely favored the AI’s responses across all metrics, including virtuousness, intelligence, and trustworthiness.

“Our findings lead us to believe that a computer could technically pass a moral Turing test—that it could fool us in its moral reasoning,” Georgia State associate professor and study co-author Eyal Aharoni, said in a press statement. “People will interact with these tools in ways that have moral implications…we should understand how they operate, their limitations and that they’re not necessarily operating in the way we think when we’re interacting with them.”

While this is the first MTT to be used specifically on LLMs (hence “modified”), the idea of these moral tests have been around since at least 2000. And like the Turing Test itself, the idea of using an MTT to evaluate the moral complexity of AI has been scrutinized, with one 2016 study saying that “MTT-based evaluations are vulnerable to deception, inadequate reasoning, and inferior moral performance.”

Passing the m-MTT doesn’t mean an AI is moral, just as passing the Turing Test doesn’t mean it’s sentient. But the researchers at Georgia State University argue that the overwhelming preference to ChatGPT’s answers compared to humans is a new development in only the last couple years.

“The twist is that the reason people could tell the difference appears to be because they rated ChatGPT’s responses as superior,” Aharoni said in a press statement. “If we had done this study five to 10 years ago, then we might have predicted that people could identify the AI because of how inferior its responses were. But we found the opposite—that the AI, in a sense, performed too well.”

The morality of AI is an obsession of technologists, AI programmers, and an unending litany of doomsday sci-fi writers, and passing a moral Turing Test with flying colors certainly speaks to the impressive complexity of new LLMs.

Of course, the one big question remains: Will the AI take its own advice?

You Might Also Like

Engadget
UK's AI Safety Institute easily jailbreaks major LLMs
Researchers found that LLMs were easily to jailbreak and can produce harmful outputs.
Engadget
Meta and Google want to make AI deals with Hollywood studios
Meta and Google are offering Hollywood studios millions of dollars with the hope of striking licensing deals that could improve their models for AI-generated video, according to a new report.
Yahoo Sports
Charles Barkley calls TNT leads 'clowns,' suggests his production company could take over 'Inside The NBA'
Charles Barkley wants to keep the crew together.
TechCrunch
Spotify experiments with an AI DJ that speaks Spanish
Spotify's addition of its AI DJ feature, which introduces personalized song selections to users, was the company's first step into an AI future. Now, Spotify is developing an alternative version of that DJ that will speak Spanish. References to the new AI DJ were spotted in the app's code by tech veteran and reverse engineer Chris Messina.
Yahoo Sports
Cowboys' indecision on Dak Prescott has hampered their ability to improve the roster
Dallas could have signed Dak Prescott to an extension and freed up cap space. Instead, the team chose to largely sit out the offseason.
Yahoo Life Shopping
A kernel of truth: This top-selling $9 corn stripper can easily tackle an entire cob in under a minute
A doodad with a fan club of 5,000+ that eliminates the need for precarious knife work? I'm all 'ears'!
Yahoo Life Shopping
The 11 best electric bicycles of 2024, tested and editor-approved
We rode every type of e-bike imaginable — these are our favorites, including models from Bluejay, Juiced, Lectric, Vvolt and Schwinn.
Yahoo Life
The West Nile virus is detected in Houston and other parts of the U.S. Should you be worried about mosquito-borne illnesses?
Experts address West Nile virus, dengue fever and malaria concerns.
Engadget
Robocaller behind AI Biden deepfake faces charges and hefty FCC fine
A political consultant who admitted to using a deepfake of President Joe Biden's voice in a robocall scheme is facing dozens of charges, as well as a $6 million FCC fine.
Autoblog
The Civic goes hybrid, driving the Nissan Z Nismo and more | Autoblog Podcast #833
On this week's podcast, we discuss the Honda Civic Hybrid refresh; a possible Maverick sport truck and possible new Mitsubishi Delica; booming hybrid sales; the UAW Mercedes vote; VW's ID.7 postponement; Tesla Semi news; the BYD Shark pickup truck; and, whew, even more.
TechCrunch
$6M fine for robocaller who used AI to clone Biden's voice
The FCC has proposed a $6 million fine for the scammer who used voice-cloning tech to impersonate President Biden in a series of illegal robocalls during a New Hampshire primary election. It's more about robocalls than AI, but the agency is clearly positioning this as a warning to other would-be high-tech scammers. This was, of course, fake — a voice clone of President Biden using tech that has become widely available over the last couple years.
TechCrunch
ESA prepares for the post-ISS era, selects The Exploration Company, Thales Alenia to develop cargo spacecraft
The European Space Agency selected two companies on Wednesday to advance designs of a cargo spacecraft that could establish the continent’s first sovereign access to space. The two awardees, major aerospace prime Thales Alenia Space and French startup The Exploration Company, will receive €25 million ($27 million) each to advance concepts for vehicles that can transport cargo to and from stations in low Earth orbit. The aim is to have at least one capsule conducting a demonstration flight to the International Space Station (ISS) in 2028 and to have a cargo transportation service online by the end of the decade.
TechCrunch
Expressable brings speech therapy into the home
Leanne Sherred, a pediatric speech therapist, has long encountered challenges putting caregiver-led therapy into practice in traditional care settings. Research suggests that caregiver-led speech therapy, which involves training the caregivers of patients in skill-building therapeutic techniques to use at home, can be highly effective. In 2020, around the start of the pandemic, Sherred saw an opportunity to attempt a new, tech-forward speech therapy care model, one that put caregivers "at the center of care" (in her words).
Yahoo Tech
'I can hear every word now': Amazon's top-selling soundbar is an absurd $38 for Memorial Day
If you're turning on your TV's closed captions more often than you'd like, you need this in your life — take it from a stadium full of satisfied shoppers.
Engadget
Netflix’s cozy take on Animal Crossing hits Android and iOS in June
Netflix’s mobile gaming lineup will soon have one more entry. Cozy Grove: Camp Spirit, the sequel to the 2021 Animal Crossing-esque Cozy Grove, will arrive on Android and iOS on June 25.
Yahoo Sports
Mavericks take Game 1, Cavaliers fire J.B. Bickerstaff & 76ers plan to pursue third star | No Cap Room
Dan Devine and Jake Fischer recap the action from Game 1 of the Western Conference Finals, the Cavaliers firing J.B. Bickerstaff and the 76ers plans to pursue another star this offseason
Yahoo Finance
Stock market today: Stocks slide, Dow suffers worst day in a year as Nvidia fails to spur market rally
Nvidia's blockbuster earnings have fired up optimism for AI's ability to fuel growth, eclipsing rate-hike worries for now.
Yahoo Life Shopping
'Comfy, cool and doesn't wrinkle': This breezy summer top is on sale for $22 (that's nearly 70% off)
This classic linen-like shirt that has 6,000 five-star ratings is what you'll reach for all summer long.
Engadget
Spotify’s Car Thing will soon transform into Spotify’s Car Brick
Spotify’s Car Thing, a limited hardware “test” the company began shipping only three years ago, is about to bite the dust. The company wrote on Thursday that the device will “no longer be operational” as of December 9.
Engadget
Google plans to run a fiber optic cable from Kenya to Australia
Google said on Thursday it will build a fiber optic cable to connect Africa and Australia. Named Umoja (a Swahili word meaning “unity”), the project will start in Kenya and pass through several other African nations before crossing the Indian Ocean to the land down under.

News

Life

Entertainment

Finance

Sports

New on Yahoo

An AI Easily Beat Humans in the Moral Turing Test

Recommended Stories

UK's AI Safety Institute easily jailbreaks major LLMs

Meta and Google want to make AI deals with Hollywood studios

Charles Barkley calls TNT leads 'clowns,' suggests his production company could take over 'Inside The NBA'

Spotify experiments with an AI DJ that speaks Spanish

Cowboys' indecision on Dak Prescott has hampered their ability to improve the roster

A kernel of truth: This top-selling $9 corn stripper can easily tackle an entire cob in under a minute

The 11 best electric bicycles of 2024, tested and editor-approved

The West Nile virus is detected in Houston and other parts of the U.S. Should you be worried about mosquito-borne illnesses?

Robocaller behind AI Biden deepfake faces charges and hefty FCC fine

The Civic goes hybrid, driving the Nissan Z Nismo and more | Autoblog Podcast #833

$6M fine for robocaller who used AI to clone Biden's voice

ESA prepares for the post-ISS era, selects The Exploration Company, Thales Alenia to develop cargo spacecraft

Expressable brings speech therapy into the home

'I can hear every word now': Amazon's top-selling soundbar is an absurd $38 for Memorial Day

Netflix’s cozy take on Animal Crossing hits Android and iOS in June

Mavericks take Game 1, Cavaliers fire J.B. Bickerstaff & 76ers plan to pursue third star | No Cap Room

Stock market today: Stocks slide, Dow suffers worst day in a year as Nvidia fails to spur market rally

'Comfy, cool and doesn't wrinkle': This breezy summer top is on sale for $22 (that's nearly 70% off)

Spotify’s Car Thing will soon transform into Spotify’s Car Brick

Google plans to run a fiber optic cable from Kenya to Australia