OpenAI Reportedly Dissolves Its Existential AI Risk Team

Maxwell Zeff

May 18, 2024 at 11:00 AM·3 min read

OpenAI’s Superalignment team, charged with controlling the existential danger of a superhuman AI system, has reportedly been disbanded, according to Wired on Friday. The news comes just days after the team’s founders, Ilya Sutskever and Jan Leike, simultaneously quit the company.

Wired reports that OpenAI’s Superalignment team, first launched in July 2023 to prevent superhuman AI systems of the future from going rogue, is no more. The report states that the group’s work will be absorbed into OpenAI’s other research efforts. Research on the risks associated with more powerful AI models will now be led by OpenAI cofounder John Schulman, according to Wired. Sutskever and Leike were some of OpenAI’s top scientists focused on AI risks.

Leike posted a long thread on X Friday vaguely explaining why he left OpenAI. He says he’s been fighting with OpenAI leadership about core values for some time, but reached a breaking point this week. Leike noted the Superaligment team has been “sailing against the wind,” struggling to get enough compute for crucial research. He thinks that OpenAI needs to be more focused on security, safety, and alignment.

I joined because I thought OpenAI would be the best place in the world to do this research.

However, I have been disagreeing with OpenAI leadership about the company's core priorities for quite some time, until we finally reached a breaking point.
— Jan Leike (@janleike) May 17, 2024

OpenAI’s press team directed us to Sam Altman’s tweet when asked whether the Superalignment team was disbanded. Altman says he’ll have a longer post in the next couple of days and OpenAI has “a lot more to do.”

i'm super appreciative of @janleike's contributions to openai's alignment research and safety culture, and very sad to see him leave. he's right we have a lot more to do; we are committed to doing it. i'll have a longer post in the next couple of days.

🧡 https://t.co/t2yexKtQEk
— Sam Altman (@sama) May 17, 2024

Later, an OpenAI spokesperson clarified that “Superalignment is now going to be more deeply ingrained across research, which will help us better achieve our Superalignment objectives.” The company says this integration started “weeks ago” and will ultimately move Superalignment’s team members and projects to other teams.

“Currently, we don’t have a solution for steering or controlling a potentially superintelligent AI, and preventing it from going rogue,” said the Superalignment team in an OpenAI blog post when it launched in July. “But humans won’t be able to reliably supervise AI systems much smarter than us, and so our current alignment techniques will not scale to superintelligence. We need new scientific and technical breakthroughs.”

It’s now unclear if the same attention will be put into those technical breakthroughs. Undoubtedly, there are other teams at OpenAI focused on safety. Schulman’s team, which is reportedly absorbing Superalignment’s responsibilities, is currently responsible for fine-tuning AI models after training. However, Superalignment focused specifically on the most severe outcomes of a rogue AI. As Gizmodo noted yesterday, several of OpenAI’s most outspoken AI safety advocates have resigned or been fired in the last few months.

Earlier this year, the group released a notable research paper about controlling large AI models with smaller AI models—considered a first step towards controlling superintelligent AI systems. It’s unclear who will make the next steps on these projects at OpenAI.

Sam Altman’s AI startup kicked off this week by unveiling GPT-4 Omni, the company’s latest frontier model which featured ultra-low latency responses that sounded more human than ever. Many OpenAI staffers remarked on how its latest AI model was closer than ever to something from science fiction, specifically the movie Her.

For the latest news, Facebook, Twitter and Instagram.

TechCrunch
Anthropic hires former OpenAI safety lead to head up new team
Jan Leike, a leading AI researcher who earlier this month resigned from OpenAI before publicly criticizing the company's approach to AI safety, has joined OpenAI rival Anthropic to lead a new "superalignment" team. In a post on X, Leike said that his team at Anthropic will focus on various aspects of AI safety and security, specifically "scalable oversight," "weak-to-strong generalization" and automated alignment research. In many ways, Leike's team sounds similar in mission to OpenAI's recently dissolved Superalignment team.
TechCrunch
OpenAI created a team to control 'superintelligent' AI — then let it wither, source says
OpenAI's Superalignment team, responsible for developing ways to govern and steer "superintelligent" AI systems, was promised 20% of the company's compute resources, according to a person from that team. Leike went public with some reasons for his resignation on Friday morning. "I have been disagreeing with OpenAI leadership about the company's core priorities for quite some time, until we finally reached a breaking point," Leike wrote in a series of posts on X.
TechCrunch
This Week in AI: OpenAI moves away from safety
This week in AI, OpenAI once again dominated the news cycle (despite Google's best efforts) with not only a product launch, but also with some palace intrigue. The company unveiled GPT-4o, its most capable generative model yet, and just days later effectively disbanded a team working on the problem of developing controls to prevent "superintelligent" AI systems from going rogue. Reporting -- including ours -- suggests that OpenAI deprioritized the team's safety research in favor of launching new products like the aforementioned GPT-4o, ultimately leading to the resignation of the team's two co-leads, Jan Leike and OpenAI co-founder Ilya Sutskever.
TechCrunch
This Week in AI: OpenAI and publishers are partners of convenience
This week in AI, OpenAI announced that it reached a deal with News Corp, the new publishing giant, to train OpenAI-developed generative AI models on articles from News Corp brands, including The Wall Street Journal, Financial Times and MarketWatch. The agreement, which the companies describe as "multi-year" and "historic," also gives OpenAI the right to display News Corp mastheads within apps like ChatGPT in response to certain questions -- presumably in cases where the answers are sourced partly or in whole from News Corp publications. News Corp gets an infusion of cash for its content -- over $250 million, reportedly -- at a time when the media industry's outlook is even grimmer than usual.
Yahoo News
OpenAI in the crosshairs: Scarlett Johansson's scathing statement, mass exodus of executives are among the company’s latest woes
OpenAI CEO Sam Altman was at one point dubbed "the Oppenheimer of AI." So what's going on under his leadership?
Engadget
OpenAI co-founder and Chief Scientist Ilya Sutskever is leaving the company
Last year, Ilya Sutskever admitted that he regretted the role he played in the sudden dismissal of OpenAI CEO Sam Altman and President Greg Brockman.
TechCrunch
Ilya Sutskever, OpenAI co-founder and longtime chief scientist, departs
Ilya Sutskever, OpenAI's longtime chief scientist and one of its co-founders, has left the company. "This is very sad to me; Ilya is easily one of the greatest minds of our generation, a guiding light of our field, and a dear friend," Altman said. Replacing Sutskever is Jakub Pachocki, OpenAI's director of research.
TechCrunch
OpenAI inks deal to train AI on Reddit data
OpenAI has reached a deal with Reddit to use the social news site's data for training AI models. Reddit content will be incorporated into ChatGPT, OpenAI's popular conversational AI, and the companies will work together to bring unspecified new "AI-powered features" to both Reddit users and moderators. OpenAI will also become a Reddit advertising partner.
TechCrunch
This Week in AI: OpenAI considers allowing AI porn
This week in AI, OpenAI revealed that it's exploring how to "responsibly" generate AI porn. Announced in a document intended to peel back the curtains and gather feedback on its AI's instructions, OpenAI's new NSFW policy is intended to start a conversation about how -- and where -- the company might allow explicit images and text in its AI products, OpenAI said. "We want to ensure that people have maximum control to the extent that it doesn't violate the law or other peoples' rights," Joanne Jang, a member of the product team at OpenAI, told NPR.
TechCrunch
This Week in AI: Can we (and could we ever) trust OpenAI?
This week in AI, OpenAI launched discounted plans for nonprofits and education customers and drew back the curtains on its most recent efforts to stop bad actors from abusing its AI tools. OpenAI removed one of the voices used by its AI-powered chatbot ChatGPT after users pointed out that it sounded eerily similar to Johansson's. Johansson later released a statement saying that she hired legal counsel to inquire about the voice and get exact details about how it was developed -- and that she'd refused repeated entreaties from OpenAI to license her voice for ChatGPT. Now, a piece in The Washington Post implies that OpenAI didn't in fact seek to clone Johansson's voice and that any similarities were accidental.
TechCrunch
AI training data has a price tag that only Big Tech can afford
Data is at the heart of today's advanced AI systems, but it's costing more and more -- making it out of reach for all but the wealthiest tech companies. Last year, James Betker, a researcher at OpenAI, penned a post on his personal blog about the nature of generative AI models and the datasets on which they're trained. In it, Betker claimed that training data -- not a model's design, architecture or any other characteristic -- was the key to increasingly sophisticated, capable AI systems.
Autoblog
Junkyard Gem: 1999 Pontiac Firebird Coupe
A 1999 Pontiac Firebird coupe with 3.8-liter Buick V6 and five-speed manual transmission, found in a Colorado wrecking yard.
Autoblog
2025 Cadillac XT4 drops base trim, adds standard equipment
Cadillac cuts the 2024 XT4's base trim for 2025 and adds standard equipment to the remaining two trims. Prices don't go up as much as expected, though.
Yahoo Entertainment
A timeline of Jennifer Lopez's 'This Is Me … Now' tour: Rumors of marital tension, low ticket sales reported before cancellation
It’s unclear if Lopez’s tour cancellation has anything to do with her marriage to Ben Affleck, but there’s no denying that her new album is inextricably linked to their romance.
TechCrunch
You can no longer use Tumblr’s tipping feature
The reason for shutting down the feature is simple: it failed to gain traction. Additionally, it has been Tumblr's plan since last year's reorg to put an end to things that aren't performing well. In 2022, Tumblr introduced Tips as a complementary feature to Post+, its subscription offering that allowed creators to put content behind a paywall.
Yahoo Life Shopping
The best Amazon deals this weekend — snag an Apple iPad for its all-time lowest price
Save up to 80% on everything from vacuums to patio sets while scoring super-low prices on top brands like Bissell, Ninja and more.
Yahoo Finance
Aston Martin owner on why new models, F1 allure will boost British luxury automaker
Billionaire Lawrence Stroll put his money where his mouth is and believes others should do the same with his latest endeavor.
Yahoo Life Shopping
Eva Longoria adores this L'Oreal anti-aging eye cream — and it's 50% off
This anti-aging treatment helps keep fine lines and puffiness in check — all for just $14.
Engadget
The Tribeca Film Festival will debut a bunch of short films made by AI
The Tribeca Film Festival will debut a bunch of short films made by AI. They are being made using OpenAI’s Sora model.
Autoblog
Kia's Australian division wants a body-on-frame SUV to take on the Land Cruiser
Kia's Australian division wants a Tasman pickup-based body-on-frame SUV to take on Ford and Toyota, but the company might not get its wish.

News

Life

Entertainment

Finance

Sports

New on Yahoo

OpenAI Reportedly Dissolves Its Existential AI Risk Team

Recommended Stories

Anthropic hires former OpenAI safety lead to head up new team

OpenAI created a team to control 'superintelligent' AI — then let it wither, source says

This Week in AI: OpenAI moves away from safety

This Week in AI: OpenAI and publishers are partners of convenience

OpenAI in the crosshairs: Scarlett Johansson's scathing statement, mass exodus of executives are among the company’s latest woes

OpenAI co-founder and Chief Scientist Ilya Sutskever is leaving the company

Ilya Sutskever, OpenAI co-founder and longtime chief scientist, departs

OpenAI inks deal to train AI on Reddit data

This Week in AI: OpenAI considers allowing AI porn

This Week in AI: Can we (and could we ever) trust OpenAI?

AI training data has a price tag that only Big Tech can afford

Junkyard Gem: 1999 Pontiac Firebird Coupe

2025 Cadillac XT4 drops base trim, adds standard equipment

A timeline of Jennifer Lopez's 'This Is Me … Now' tour: Rumors of marital tension, low ticket sales reported before cancellation

You can no longer use Tumblr’s tipping feature

The best Amazon deals this weekend — snag an Apple iPad for its all-time lowest price

Aston Martin owner on why new models, F1 allure will boost British luxury automaker

Eva Longoria adores this L'Oreal anti-aging eye cream — and it's 50% off

The Tribeca Film Festival will debut a bunch of short films made by AI

Kia's Australian division wants a body-on-frame SUV to take on the Land Cruiser