OpenAI Safety Worker Quit Due to Losing Confidence Company "Would Behave Responsibly Around the Time of AGI"

Victor Tangermann

May 13, 2024 at 1:52 PM·3 min read

Oops!
Something went wrong.
Please try again later.

An OpenAI safety worker quit his job, arguing in an online forum that he had lost confidence that the Sam Altman-led company will "behave responsibly around the time of [artificial general intelligence]," the theoretical point at which an AI can outperform a human.

As Business Insider reports, researcher Daniel Kokotajlo, a philosophy PhD student who worked in OpenAI's governance team, left the company last month.

In several followup posts on the forum LessWrong, Kokotajlo explained his "disillusionment" that led to him quitting, which was related to a growing call to put a pause on research that could eventually lead to the establishment of AGI.

It's a heated debate, with experts long warning of the potential dangers of an AI that exceeds the cognitive capabilities of humans. Last year, over 1,100 artificial intelligence experts, CEOs, and researchers — including SpaceX CEO Elon Musk — signed an open letter calling for a six-month moratorium on "AI experiments."

"I think most people pushing for a pause are trying to push against a 'selective pause' and for an actual pause that would apply to the big labs who are at the forefront of progress," Kokotajlo wrote.

However, he argued that such a "selective pause" would end up not applying to the "big corporations that most need to pause."

"My disillusionment about this is part of why I left OpenAI," he concluded.

Kokotajlo quit roughly two months after research engineer William Saunders left the company as well.

The Superalignment team, which Saunders was part of at OpenAI for three years, was cofounded by computer scientist and former OpenAI chief scientist Ilya Sutskever and his colleague Jan Leike. It's tasked with ensuring that "AI systems much smarter than humans follow human intent," according to OpenAI's website.

"Superintelligence will be the most impactful technology humanity has ever invented, and could help us solve many of the world’s most important problems," the company's description of the team reads. "But the vast power of superintelligence could also be very dangerous, and could lead to the disempowerment of humanity or even human extinction."

Instead of having a "solution for steering or controlling a potentially superintelligent AI, and preventing it from going rogue," the company is hoping that "scientific and technical breakthroughs" could lead to an equally superhuman alignment tool that can keep systems that are "much smarter than us" in check.

But given Saunders' departure, it seems like not everybody on the Superalignment team were themselves aligned on the company's ability to police an eventual AGI.

The debate surrounding the dangers of an unchecked superintelligent AI may have played a role in the firing and eventual rehiring of CEO Sam Altman last year. Sutskever, who used to sit on the original board of OpenAI's non-profit entity, reportedly disagreed with Altman on the topic of AI safety before Altman was ousted, and was later kicked off the board.

To be clear, all of this is still an entirely theoretical discussion. Despite plenty of predictions by experts that AGI is only a matter of years away, there's no guarantee that we'll ever reach a point at which an AI could outperform humans.

But if they do, it'll raise an incredibly important question: how do we ensure AGI systems don't go rogue if they're inherently more capable than us?

And not everybody is confident in OpenAI's ability and longterm commitment to controlling AGI, with the company risking becoming too big to be effectively regulated, as Kokotajlo argued.

More on OpenAI: OpenAI Mocked for Issuing Infringement Claim Over Its Logo While Scraping the Entire Web to Train AI Models

Engadget
OpenAI’s new safety team is led by board members, including CEO Sam Altman
OpenAI has created a new Safety and Security Committee less than two weeks after the company dissolved the team tasked with protecting humanity from AI’s existential threats. This latest iteration will include two board members and CEO Sam Altman.
TechCrunch
This Week in AI: OpenAI considers allowing AI porn
This week in AI, OpenAI revealed that it's exploring how to "responsibly" generate AI porn. Announced in a document intended to peel back the curtains and gather feedback on its AI's instructions, OpenAI's new NSFW policy is intended to start a conversation about how -- and where -- the company might allow explicit images and text in its AI products, OpenAI said. "We want to ensure that people have maximum control to the extent that it doesn't violate the law or other peoples' rights," Joanne Jang, a member of the product team at OpenAI, told NPR.
Engadget
OpenAI's board allegedly learned about ChatGPT launch on Twitter
“[The] board was not informed in advance of that,” Toner said on Tuesday on a podcast called The Ted AI Show. “We learned about ChatGPT on Twitter.”
TechCrunch
OpenAI's new safety committee is made up of all insiders
OpenAI has formed a new committee to oversee "critical" safety and security decisions related to the company's projects and operations. Altman and the rest of the Safety and Security Committee -- OpenAI board members Bret Taylor, Adam D’Angelo and Nicole Seligman as well as chief scientist Jakub Pachocki, Aleksander Madry (who leads OpenAI's "preparedness" team), Lilian Weng (head of safety systems), Matt Knight (head of security) and John Schulman (head of "alignment science") -- will be responsible for evaluating OpenAI's safety processes and safeguards over the next 90 days, according to a post on the company's corporate blog.
TechCrunch
AI models have favorite numbers, because they think they're people
AI models are always surprising us, not just in what they can do, but what they can't, and why. An interesting new behavior is both superficial and revealing about these systems: they pick random numbers as if they're human beings. This is actually a very old and well known limitation we, humans, have: we overthink and misunderstand randomness.
Yahoo Finance
Nvidia stock leaps to latest record — thanks to Elon Musk
The increase of AI investment has continued to boost optimism over Nvidia's growth as the chipmaker continues its record-setting stock rally.
Yahoo Sports
Stetson Bennett says his missed season was due to mental health after returning to Rams
Bennett missed last season and the Rams wouldn't say why.
Yahoo Sports
Justin Fields laughs off idea that he’ll run kicks back for the Steelers: ‘I’m not here to do that’
After it took off on social media, Justin Fields officially shut down the idea that he’d be playing on special teams for the Steelers.
Yahoo Celebrity
Sean Kingston accused of stealing $1M with his mom in fraud case, faces 10 charges in Florida: The latest
Arrest warrants shed light on Sean Kingston and his mother's alleged crime spree. The singer remains jailed in California.
Yahoo Sports
French Open 2024: How to watch the Iga Swiatek vs. Naomi Osaka match
It's time for the clay court Grand Slam at Roland Garros. Here's how to tune into Swiatek vs. Osaka.
Engadget
Opera is adding Google's Gemini AI to its browser
Opera ha teamed up with Google to integrate its Gemini AI models into its Aria AI browser assistant.
Yahoo Sports
Reports: Texans signing WR Nico Collins to $72 million extension
The Houston Texas are continuing to invest in their offense around quarterback C.J. Stroud.
Yahoo Sports
Ravens TE Mark Andrews all for NFL’s ban on hip-drop tackles after injury last season
“I think defenses can find a way to get around that.”
TechCrunch
Spyware maker pcTattletale shutters after data breach
The founder of the spyware app pcTattletale said his company is "out of business and completely done" following a data breach over the weekend. The shutdown comes days after a hacker defaced the spyware maker's website and published links containing large amounts of data from pcTattletale's servers, including databases of customers' information and some victims’ stolen data. The now-defunct app had 138,000 customers who had signed up to use the service, per data breach notification site Have I Been Pwned.
Engadget
VR classics Job Simulator and Vacation Simulator come to Apple Vision Pro
Job Simulator and Vacation Simulator have been released for the Apple Vision Pro. This is a version developed specifically for the platform with optimized hand-and-eye tracking.
Engadget
Ooni's larger, dual-zone Koda 2 Max pizza oven is now available for pre-order
Ooni's largest pizza oven yet allows you to monitor food and ambient temps from your phone. It's now available for pre-order and ships in July.
Yahoo Personal Finance
USAA mortgage review 2024
USAA offers conventional and VA loans, but no home equity lending. There is no origination fee on VA products. Find out if you qualify for a USAA mortgage.
TechCrunch
A comprehensive list of 2024 tech layoffs
The tech layoff wave is still going strong in 2024. Following significant workforce reductions in 2022 and 2023, this year has already seen 60,000 job cuts across 254 companies, according to independent layoffs tracker Layoffs.fyi. Companies like Tesla, Amazon, Google, TikTok, Snap and Microsoft have conducted sizable layoffs in the first months of 2024.
Yahoo Life Shopping
Martha Stewart 'never leaves the house' without this tinted sunscreen, recommended by her dermatologist
It protects skin from UVA and UVB rays, but that's not all — it's also formulated to give you a glowy, even complexion.
TechCrunch
The demise of BaaS fintech Synapse could derail the funding prospects for other startups in the space
Last week, we reported on how Copper Banking, a digital banking service aimed at teens, abruptly discontinued its bank deposit accounts and debit cards. The situation was just one of many where companies and consumers are being impacted by the implosion of banking-as-a-service company (BaaS) Synapse.

News

Life

Entertainment

Finance

Sports

New on Yahoo

OpenAI Safety Worker Quit Due to Losing Confidence Company "Would Behave Responsibly Around the Time of AGI"

Recommended Stories

OpenAI’s new safety team is led by board members, including CEO Sam Altman

This Week in AI: OpenAI considers allowing AI porn

OpenAI's board allegedly learned about ChatGPT launch on Twitter

OpenAI's new safety committee is made up of all insiders

AI models have favorite numbers, because they think they're people

Nvidia stock leaps to latest record — thanks to Elon Musk

Stetson Bennett says his missed season was due to mental health after returning to Rams

Justin Fields laughs off idea that he’ll run kicks back for the Steelers: ‘I’m not here to do that’

Sean Kingston accused of stealing $1M with his mom in fraud case, faces 10 charges in Florida: The latest

French Open 2024: How to watch the Iga Swiatek vs. Naomi Osaka match

Opera is adding Google's Gemini AI to its browser

Reports: Texans signing WR Nico Collins to $72 million extension

Ravens TE Mark Andrews all for NFL’s ban on hip-drop tackles after injury last season

Spyware maker pcTattletale shutters after data breach

VR classics Job Simulator and Vacation Simulator come to Apple Vision Pro

Ooni's larger, dual-zone Koda 2 Max pizza oven is now available for pre-order

USAA mortgage review 2024

A comprehensive list of 2024 tech layoffs

Martha Stewart 'never leaves the house' without this tinted sunscreen, recommended by her dermatologist

The demise of BaaS fintech Synapse could derail the funding prospects for other startups in the space