OpenAI created a team to control 'superintelligent' AI — then let it wither, source says

Kyle Wiggers

Updated May 20, 2024 at 1:17 PM·4 min read

OpenAI's Superalignment team, responsible for developing ways to govern and steer "superintelligent" AI systems, was promised 20% of the company's compute resources, according to a person from that team. But requests for a fraction of that compute were often denied, blocking the team from doing their work.

That issue, among others, pushed several team members to resign this week, including co-lead Jan Leike, a former DeepMind researcher who, while at OpenAI, was involved with the development of ChatGPT, GPT-4 and ChatGPT's predecessor, InstructGPT.

Leike went public with some reasons for his resignation on Friday morning. "I have been disagreeing with OpenAI leadership about the company's core priorities for quite some time, until we finally reached a breaking point," Leike wrote in a series of posts on X. "I believe much more of our bandwidth should be spent getting ready for the next generations of models, on security, monitoring, preparedness, safety, adversarial robustness, (super)alignment, confidentiality, societal impact, and related topics. These problems are quite hard to get right, and I am concerned we aren't on a trajectory to get there."

https://twitter.com/janleike/status/1791498183543251017

OpenAI did not immediately return a request for comment about the resources promised and allocated to that team.

OpenAI formed the Superalignment team last July, and it was led by Leike and OpenAI co-founder Ilya Sutskever, who also resigned from the company this week. It had the ambitious goal of solving the core technical challenges of controlling superintelligent AI in the next four years. Joined by scientists and engineers from OpenAI’s previous alignment division, as well as researchers from other orgs across the company, the team was to contribute research informing the safety of both in-house and non-OpenAI models and, through initiatives including a research grant program, solicit from and share work with the broader AI industry.

The Superalignment team did manage to publish a body of safety research and funnel millions of dollars in grants to outside researchers. But, as product launches began to take up an increasing amount of OpenAI leadership's bandwidth, the Superalignment team found itself having to fight for more upfront investments -- investments it believed were critical to the company's stated mission of developing superintelligent AI for the benefit of all humanity.

"Building smarter-than-human machines is an inherently dangerous endeavor," Leike continued. "But over the past years, safety culture and processes have taken a backseat to shiny products."

Sutskever's battle with OpenAI CEO Sam Altman served as a major added distraction.

Sutskever, along with OpenAI's old board of directors, moved to abruptly fire Altman late last year over concerns that Altman hadn't been "consistently candid" with the board's members. Under pressure from OpenAI's investors, including Microsoft, and many of the company's own employees, Altman was eventually reinstated, much of the board resigned and Sutskever reportedly never returned to work.

According to the source, Sutskever was instrumental to the Superalignment team -- not only contributing research but also serving as a bridge to other divisions within OpenAI. He would also serve as an ambassador of sorts, impressing upon key OpenAI decision-makers the importance of the team's work.

After Leike's departure, Altman wrote on X that he agreed there is "a lot more to do," and that they are "committed to doing it." He hinted at a longer explanation, which co-founder Greg Brockman supplied Saturday morning:

https://twitter.com/gdb/status/1791869138132218351

Though there is little concrete in Brockman's response as far as policies or commitments, he said that "we need to have a very tight feedback loop, rigorous testing, careful consideration at every step, world-class security, and harmony of safety and capabilities."

Following the departures of Leike and Sutskever, John Schulman, another OpenAI co-founder, has moved to head up the type of work the Superalignment team was doing, but there will no longer be a dedicated team -- instead, it will be a loosely associated group of researchers embedded in divisions throughout the company. An OpenAI spokesperson described it as "integrating [the team] more deeply."

The fear is that, as a result, OpenAI's AI development won't be as safety-focused as it could've been.

We're launching an AI newsletter! Sign up here to start receiving it in your inboxes on June 5.

TechCrunch
Anthropic hires former OpenAI safety lead to head up new team
Jan Leike, a leading AI researcher who earlier this month resigned from OpenAI before publicly criticizing the company's approach to AI safety, has joined OpenAI rival Anthropic to lead a new "superalignment" team. In a post on X, Leike said that his team at Anthropic will focus on various aspects of AI safety and security, specifically "scalable oversight," "weak-to-strong generalization" and automated alignment research. In many ways, Leike's team sounds similar in mission to OpenAI's recently dissolved Superalignment team.
TechCrunch
This Week in AI: OpenAI moves away from safety
This week in AI, OpenAI once again dominated the news cycle (despite Google's best efforts) with not only a product launch, but also with some palace intrigue. The company unveiled GPT-4o, its most capable generative model yet, and just days later effectively disbanded a team working on the problem of developing controls to prevent "superintelligent" AI systems from going rogue. Reporting -- including ours -- suggests that OpenAI deprioritized the team's safety research in favor of launching new products like the aforementioned GPT-4o, ultimately leading to the resignation of the team's two co-leads, Jan Leike and OpenAI co-founder Ilya Sutskever.
Yahoo News
OpenAI in the crosshairs: Scarlett Johansson's scathing statement, mass exodus of executives are among the company’s latest woes
OpenAI CEO Sam Altman was at one point dubbed "the Oppenheimer of AI." So what's going on under his leadership?
Engadget
The OpenAI team tasked with protecting humanity is no more
In the summer of 2023, OpenAI created a “Superalignment” team whose goal was to steer and control future AI systems that could be so powerful they could lead to human extinction. Less than a year later, that team is dead.
TechCrunch
This Week in AI: OpenAI and publishers are partners of convenience
This week in AI, OpenAI announced that it reached a deal with News Corp, the new publishing giant, to train OpenAI-developed generative AI models on articles from News Corp brands, including The Wall Street Journal, Financial Times and MarketWatch. The agreement, which the companies describe as "multi-year" and "historic," also gives OpenAI the right to display News Corp mastheads within apps like ChatGPT in response to certain questions -- presumably in cases where the answers are sourced partly or in whole from News Corp publications. News Corp gets an infusion of cash for its content -- over $250 million, reportedly -- at a time when the media industry's outlook is even grimmer than usual.
TechCrunch
This Week in AI: OpenAI considers allowing AI porn
This week in AI, OpenAI revealed that it's exploring how to "responsibly" generate AI porn. Announced in a document intended to peel back the curtains and gather feedback on its AI's instructions, OpenAI's new NSFW policy is intended to start a conversation about how -- and where -- the company might allow explicit images and text in its AI products, OpenAI said. "We want to ensure that people have maximum control to the extent that it doesn't violate the law or other peoples' rights," Joanne Jang, a member of the product team at OpenAI, told NPR.
TechCrunch
This Week in AI: Can we (and could we ever) trust OpenAI?
This week in AI, OpenAI launched discounted plans for nonprofits and education customers and drew back the curtains on its most recent efforts to stop bad actors from abusing its AI tools. OpenAI removed one of the voices used by its AI-powered chatbot ChatGPT after users pointed out that it sounded eerily similar to Johansson's. Johansson later released a statement saying that she hired legal counsel to inquire about the voice and get exact details about how it was developed -- and that she'd refused repeated entreaties from OpenAI to license her voice for ChatGPT. Now, a piece in The Washington Post implies that OpenAI didn't in fact seek to clone Johansson's voice and that any similarities were accidental.
TechCrunch
AI training data has a price tag that only Big Tech can afford
Data is at the heart of today's advanced AI systems, but it's costing more and more -- making it out of reach for all but the wealthiest tech companies. Last year, James Betker, a researcher at OpenAI, penned a post on his personal blog about the nature of generative AI models and the datasets on which they're trained. In it, Betker claimed that training data -- not a model's design, architecture or any other characteristic -- was the key to increasingly sophisticated, capable AI systems.
Engadget
Starliner’s first crewed flight gets scrubbed just before launch
The first crewed launch of Boeing’s Starliner capsule has once again been called off, this time after an automatic hold was issued by the rocket’s ground launch sequencer less than four minutes before liftoff.
Autoblog
Junkyard Gem: 1999 Pontiac Firebird Coupe
A 1999 Pontiac Firebird coupe with 3.8-liter Buick V6 and five-speed manual transmission, found in a Colorado wrecking yard.
TechCrunch
Deal Dive: How (Re)vive grew 10x last year by helping retailers recycle and sell returned items
The fashion industry has a huge problem: Despite many returned items being unworn or undamaged, a lot, if not the majority, end up in the trash. An estimated 9.5 billion pounds of returns ended up in landfills in 2022 alone, according to data from return logistics software company Optoro. New York-based (Re)vive wants to help companies find a better ending for their returned items.
Autoblog
2025 Cadillac XT4 drops base trim, adds standard equipment
Cadillac cuts the 2024 XT4's base trim for 2025 and adds standard equipment to the remaining two trims. Prices don't go up as much as expected, though.
Yahoo Life Shopping
The 35+ best Walmart deals this weekend: Save big on summer essentials, tech, appliances and more
Massive markdowns on deck: A Dyson vacuum for 50% off, go-anywhere cotton shorts for $9 and a 4K smart TV for under $400.
Yahoo Entertainment
A timeline of Jennifer Lopez's 'This Is Me … Now' tour: Rumors of marital tension, low ticket sales reported before cancellation
It’s unclear if Lopez’s tour cancellation has anything to do with her marriage to Ben Affleck, but there’s no denying that her new album is inextricably linked to their romance.
Yahoo Finance
Aston Martin owner on why new models, F1 allure will boost British luxury automaker
Billionaire Lawrence Stroll put his money where his mouth is and believes others should do the same with his latest endeavor.
Yahoo Life Shopping
The best Amazon deals this weekend — snag an Apple iPad for its all-time lowest price
Save up to 80% on everything from vacuums to patio sets while scoring super-low prices on top brands like Bissell, Ninja and more.
Yahoo Life Shopping
'Lifesaver for my severe back pain': This bolster pillow is just $23 at Amazon
More than 8,000 shoppers rave about this cushion.
Yahoo Life Shopping
No more 'hat hair': The ingenious adjustable visor you need for vacation is just $20
Your next trip is about to get a lot shadier — in a good way.
Autoblog
Kia's Australian division wants a body-on-frame SUV to take on the Land Cruiser
Kia's Australian division wants a Tasman pickup-based body-on-frame SUV to take on Ford and Toyota, but the company might not get its wish.
Yahoo Life Shopping
'Cuts like a hot knife through butter': These powerful Fiskars pruning shears are down to just $14
Great for trimming in the garden or clipping fresh flowers, these No. 1 bestselling snippers have nearly 35,000 five-star fans.

News

Life

Entertainment

Finance

Sports

New on Yahoo

OpenAI created a team to control 'superintelligent' AI — then let it wither, source says

Recommended Stories

Anthropic hires former OpenAI safety lead to head up new team

This Week in AI: OpenAI moves away from safety

OpenAI in the crosshairs: Scarlett Johansson's scathing statement, mass exodus of executives are among the company’s latest woes

The OpenAI team tasked with protecting humanity is no more

This Week in AI: OpenAI and publishers are partners of convenience

This Week in AI: OpenAI considers allowing AI porn

This Week in AI: Can we (and could we ever) trust OpenAI?

AI training data has a price tag that only Big Tech can afford

Starliner’s first crewed flight gets scrubbed just before launch

Junkyard Gem: 1999 Pontiac Firebird Coupe

Deal Dive: How (Re)vive grew 10x last year by helping retailers recycle and sell returned items

2025 Cadillac XT4 drops base trim, adds standard equipment

The 35+ best Walmart deals this weekend: Save big on summer essentials, tech, appliances and more

A timeline of Jennifer Lopez's 'This Is Me … Now' tour: Rumors of marital tension, low ticket sales reported before cancellation

Aston Martin owner on why new models, F1 allure will boost British luxury automaker

The best Amazon deals this weekend — snag an Apple iPad for its all-time lowest price

'Lifesaver for my severe back pain': This bolster pillow is just $23 at Amazon

No more 'hat hair': The ingenious adjustable visor you need for vacation is just $20

Kia's Australian division wants a body-on-frame SUV to take on the Land Cruiser

'Cuts like a hot knife through butter': These powerful Fiskars pruning shears are down to just $14