OpenAI Staffers Responsible for Safety Are Jumping Ship

Maxwell Zeff

May 16, 2024 at 2:25 PM·5 min read

Oops!
Something went wrong.
Please try again later.
Oops!
Something went wrong.
Please try again later.

OpenAI launched its Superalignment team almost a year ago with the ultimate goal of controlling hypothetical super-intelligent AI systems and preventing them from turning against humans. Naturally, many people were concerned—why did a team like this need to exist in the first place? Now, something more concerning has occurred: the team’s leaders, Ilya Sutskever and Jan Leike, just quit OpenAI.

The resignation of Superalignment’s leadership is the latest in a series of notable departures from the company, some of which came from within Sutskever and Leike’s safety-focused team. Back in November of 2023, Sutskever and OpenAI’s board led a failed effort to oust CEO Sam Altman. Six months later, several OpenAI staff members have left the company that were either outspoken about AI safety or worked on key safety teams.

Sutskever ended up apologizing for the coup (my bad, dude!) and signed a letter alongside 738 OpenAI employees (out of 770 total) asking to reinstate Altman and President Greg Brockman. However, according to a copy of the letter obtained by The New York Times with 702 signatures (the most complete public copy Gizmodo could find), several staffers who have now quit either did not sign the show of support for OpenAI’s leadership or were laggards to do so.

The names of Superalignment team members Jan Leike, Leopold Aschenbrenner, and William Saunders—who have since quit—do not appear alongside more than 700 other OpenAI staffers showing support for Altman and Brockman in the Times’ copy. World-renowned AI researcher Andrej Karpathy and former OpenAI staffers Daniel Kokotajlo and Cullen O’Keefe also do not appear in this early version of the letter and have since left OpenAI. These individuals may have signed the later version of the letter to signal support, but if so, they seem to have been the last to do it.

Gizmodo has reached out to OpenAI for comment on who will be leading the Superalignment team from here on out but we did not immediately hear back.

More broadly, safety at OpenAI has always been a divisive issue. That’s what caused Dario and Daniela Amodei in 2021 to start their own AI company, Anthropic, alongside nine other former OpenAI staffers. The safety concerns were also what reportedly led OpenAI’s nonprofit board members to oust Altman and Brockman. These board members were replaced with some infamous Silicon Valley entrepreneurs.

OpenAI still has a lot of people working on safety at the company. After all, the startup’s stated mission is to safely create AGI that benefits humanity! That said, here is Gizmodo’s running list of notable AI safety advocates who have left OpenAI since Altman’s ousting. Click through on desktop or just keep scrolling mobile.

Ilya Sutskever & Jan Leike

Photo: JACK GUEZ / AFP (Getty Images), Screenshot: X (Getty Images)

The former leaders of OpenAI’s Superalignment team simultaneously quit this week, one day after the company released its impressive GPT-4 Omni model. The goal of Superalignment, outlined during its July 2023 launch, was to develop methods for “steering or controlling a potentially superintelligent AI, and preventing it from going rogue.”

At the time, OpenAI noted it was trying to build these superintelligent models but did not have a solution for controlling them. It’s unclear if the departure of Leike and Sutskever was related to safety concerns, but their absence certainly leaves some room for speculation.

Andrej Karpathy

Photo: Michael Macor/The San Francisco Chronicle (Getty Images)

Karpathy, a founding member of OpenAI, left the company for the second time in Feb. 2024. He remarked at the time that “nothing happened,” though his departure comes roughly one year after he left Tesla to rejoin OpenAI. Karpathy is widely regarded as one of the most influential and respected minds in artificial intelligence. He worked under the “godmother of AI” Fei-Fei Li at Stanford, another outspoken AI safety advocate.

The Old OpenAI Board

Helen Toner and Tasha McCauley were the first victims of Sam Altman’s return to power. When Altman came back, these two were out.

At the time, Toner said the decision to fire Altman was about “the board’s ability to effectively supervise the company.” While somewhat ominous, Toner said she would continue her work focusing on AI policy, safety, and security. McCauley, on the other hand, said even less at the time, though she has ties to the effective altruism movement which claims to prioritize addressing the dangers of AI over short-term profits.

Leopold Aschenbrenner & Pavel Izmailov

Aschenbrennar was a known ally to Sutskever and a member of the Superalignment team. He was fired in April 2024 for allegedly leaking information to journalists, according to The Information. He also has ties to the effective altruism movement.

Izmailov was another staffer fired for leaking information to journalists in April 2024. He worked on the reasoning team but also spent time on the safety side of things.

William Saunders

Photo: Joan Cros/NurPhoto (Getty Images)

Saunders, an OpenAI staffer on Sutskever and Leike’s Superalignment team, resigned in Feb. 2024, according to Business Insider. It’s unclear why exactly he resigned.

Daniel Kokotajlo

Kokotajlo resigned from OpenAI in April 2024, after working on the company’s governance team, also reported by Business Insider. On Kokotajlo’s Less Wrong page, he writes that he “quit OpenAI due to losing confidence that it would behave responsibly around the time of AGI.”

Cullen O’Keefe

O’Keefe appears to have resigned from OpenAI in April 2024 after over four years on the company’s governance team, according to his LinkedIn. O’Keefe frequently blogs about AI safety, and says in a post he continues to pursue “safe and beneficial AGI, but now in an independent role.”

For the latest news, Facebook, Twitter and Instagram.

TechCrunch
Anthropic hires former OpenAI safety lead to head up new team
Jan Leike, a leading AI researcher who earlier this month resigned from OpenAI before publicly criticizing the company's approach to AI safety, has joined OpenAI rival Anthropic to lead a new "superalignment" team. In a post on X, Leike said that his team at Anthropic will focus on various aspects of AI safety and security, specifically "scalable oversight," "weak-to-strong generalization" and automated alignment research. In many ways, Leike's team sounds similar in mission to OpenAI's recently dissolved Superalignment team.
TechCrunch
OpenAI signs 100K PwC workers to ChatGPT's enterprise tier as PwC becomes its first resale partner
Now its creator, OpenAI, on Wednesday announced that it has signed a major enterprise customer that it hopes will indicate how a similar effect could play out in the world of work. PwC, the management consulting giant, will become OpenAI’s biggest customer to date, covering 100,000 users. Alongside that, the consulting firm will become OpenAI’s first partner for selling the AI company's enterprise offerings to other businesses.
Engadget
OpenAI's board allegedly learned about ChatGPT launch on Twitter
“[The] board was not informed in advance of that,” Toner said on Tuesday on a podcast called The Ted AI Show. “We learned about ChatGPT on Twitter.”
TechCrunch
OpenAI created a team to control 'superintelligent' AI — then let it wither, source says
OpenAI's Superalignment team, responsible for developing ways to govern and steer "superintelligent" AI systems, was promised 20% of the company's compute resources, according to a person from that team. Leike went public with some reasons for his resignation on Friday morning. "I have been disagreeing with OpenAI leadership about the company's core priorities for quite some time, until we finally reached a breaking point," Leike wrote in a series of posts on X.
Engadget
OpenAI co-founder and Chief Scientist Ilya Sutskever is leaving the company
Last year, Ilya Sutskever admitted that he regretted the role he played in the sudden dismissal of OpenAI CEO Sam Altman and President Greg Brockman.
TechCrunch
OpenAI and Google lay out their competing AI visions
This week had two major events from OpenAI and Google. Hot off OpenAI’s tail, Google’s I/O conference featured a smattering of announcements and integrations for its flagship model, Gemini. This week also saw some major shake-ups at AWS and OpenAI.
TechCrunch
OpenAI inks deal to train AI on Reddit data
OpenAI has reached a deal with Reddit to use the social news site's data for training AI models. Reddit content will be incorporated into ChatGPT, OpenAI's popular conversational AI, and the companies will work together to bring unspecified new "AI-powered features" to both Reddit users and moderators. OpenAI will also become a Reddit advertising partner.
TechCrunch
This Week in AI: OpenAI considers allowing AI porn
This week in AI, OpenAI revealed that it's exploring how to "responsibly" generate AI porn. Announced in a document intended to peel back the curtains and gather feedback on its AI's instructions, OpenAI's new NSFW policy is intended to start a conversation about how -- and where -- the company might allow explicit images and text in its AI products, OpenAI said. "We want to ensure that people have maximum control to the extent that it doesn't violate the law or other peoples' rights," Joanne Jang, a member of the product team at OpenAI, told NPR.
TechCrunch
OpenAI says it's building a tool to let content creators 'opt out' of AI training
OpenAI says that it's developing a tool to let creators better control how their content's used in training generative AI. The tool, called Media Manager, will allow creators and content owners to identify their works to OpenAI and specify how they want those works to be included or excluded from AI research and training. The goal is to have the tool in place by 2025, OpenAI says, as the company works with "creators, content owners and regulators" toward a standard -- perhaps through the industry steering committee it recently joined.
TechCrunch
Stack Overflow signs deal with OpenAI to supply data to its models
OpenAI is collaborating with Stack Overflow, the Q&A forum for software developers, to improve its generative AI models' performance on programming-related tasks. As a result of the partnership, announced Monday, OpenAI's models, including models served through its ChatGPT chatbot platform, should get better over time at answering programming-related questions, the two companies say. At the same time, Stack Overflow will benefit from OpenAI's expertise in developing new generative AI integrations on the Stack Overflow platform.
Engadget
The Tribeca Film Festival will debut a bunch of short films made by AI
The Tribeca Film Festival will debut a bunch of short films made by AI. They are being made using OpenAI’s Sora model.
Yahoo Sports
Kings coach Mike Brown agrees to contract extension after previous impasse with team, per reports
Mike Brown could make up to $10 million annually with the Kings.
Yahoo News
What went wrong with Google's new AI search feature — and what the company is doing to try to fix it
Google's new AI search feature was bound to have its issues. Now, after weeks of jokes and memes on social media, the tech giant is responding.
Yahoo Life Shopping
8 most affordable hearing aids of 2024, according to experts
Listen up! Brands like Zepp, Jabra, Signia and Horizon are delivering some of the best inexpensive hearing aids on the market.
Yahoo Sports
President Biden dons Chiefs helmet, jokes with Travis Kelce in Super Bowl champs' second White House visit
Travis Kelce crashed the White House podium last year. This time, he received a direct invitation.
Yahoo Sports
Birmingham-Southern rallies late, but loses DIII College World Series opener to Salve Regina
Birmingham-Southern fell behind 7–0 in its Division III College World Series matchup with Salve Regina and couldn't overcome the deficit in a 7–5 defeat.
Engadget
Meta says the future of Facebook is young adults (again)
Meta is once again telling the world it intends to reorient its platform in order to appeal to younger users.
Yahoo Sports
Orioles' John Means, Tyler Wells to both undergo season-ending surgery for second torn UCLs
Means, a former All-Star, is a free agent after this season.
TechCrunch
General Catalyst-backed Jasper Health lays off staff
Jasper Health, a cancer care platform startup, laid off a substantial part of its workforce, TechCrunch has learned. Engineering and product design were among the departments impacted by the cuts, according to posts on LinkedIn from impacted employees. TechCrunch was unable to independently verify the exact number of people who were cut, but an industry source who knew impacted people believes it was approximately half of Jasper Health's small team.
TechCrunch
Live Nation confirms Ticketmaster was hacked, says personal information stolen in data breach
Entertainment giant Live Nation has confirmed its ticketing subsidiary Ticketmaster has been hacked. Live Nation confirmed the data breach in a filing with government regulators late on Friday after the markets closed. In its statement, Live Nation said the breach occurred on May 20, and that a cybercriminal "offered what it alleged to be Company user data for sale via the dark web."

News

Life

Entertainment

Finance

Sports

New on Yahoo

OpenAI Staffers Responsible for Safety Are Jumping Ship

Ilya Sutskever & Jan Leike

Andrej Karpathy

The Old OpenAI Board

Leopold Aschenbrenner & Pavel Izmailov

William Saunders

Daniel Kokotajlo

Cullen O’Keefe

Recommended Stories

Anthropic hires former OpenAI safety lead to head up new team

OpenAI signs 100K PwC workers to ChatGPT's enterprise tier as PwC becomes its first resale partner

OpenAI's board allegedly learned about ChatGPT launch on Twitter

OpenAI created a team to control 'superintelligent' AI — then let it wither, source says

OpenAI co-founder and Chief Scientist Ilya Sutskever is leaving the company

OpenAI and Google lay out their competing AI visions

OpenAI inks deal to train AI on Reddit data

This Week in AI: OpenAI considers allowing AI porn

OpenAI says it's building a tool to let content creators 'opt out' of AI training

Stack Overflow signs deal with OpenAI to supply data to its models

The Tribeca Film Festival will debut a bunch of short films made by AI

Kings coach Mike Brown agrees to contract extension after previous impasse with team, per reports

What went wrong with Google's new AI search feature — and what the company is doing to try to fix it

8 most affordable hearing aids of 2024, according to experts

President Biden dons Chiefs helmet, jokes with Travis Kelce in Super Bowl champs' second White House visit

Birmingham-Southern rallies late, but loses DIII College World Series opener to Salve Regina

Meta says the future of Facebook is young adults (again)

Orioles' John Means, Tyler Wells to both undergo season-ending surgery for second torn UCLs

General Catalyst-backed Jasper Health lays off staff

Live Nation confirms Ticketmaster was hacked, says personal information stolen in data breach