It was their job to make sure humans are safe from OpenAI's superintelligence. They just quit.

Cheryl Teh

May 15, 2024 at 3:59 AM·3 min read

Machine learning researcher Jan Leike and scientist Ilya Sutskever just resigned from OpenAI.
Their team's job, essentially, was to make sure humans are safe from OpenAI's superintelligence.
The team was racing to make sure AI remains aligned with mankind's interests.

It's too soon to say what major departures at OpenAI mean for the company, or if Sam Altman plans to replace the people in these roles with new staffers. But this isn't a great day for AI doomsayers.

OpenAI cofounder Ilya Sutskever, the company's chief scientist, said on X on Tuesday that he "made the decision to leave OpenAI." Calling it "an honor and a privilege to have worked together" with Altman and crew, Sutskever bowed out from the role, saying he's "confident that OpenAI will build AGI that is both safe and beneficial."

The more abrupt departure came from Jan Leike, another top OpenAI executive. On Tuesday night, Leike posted a blunt, two-word confirmation of his exit from OpenAI on X: "I resigned."

Leike and Sutskever led the superalignment team at OpenAI, which has seen a number of other departures in recent months.

"We need scientific and technical breakthroughs to steer and control AI systems much smarter than us," OpenAI said of superalignment in a July 5, 2023 post on its website. "To solve this problem within four years, we're starting a new team, co-led by Ilya Sutskever and Jan Leike, and dedicating 20% of the compute we've secured to date to this effort."

So it follows that part of the duo's work was to, in OpenAI's words, "ensure AI systems much smarter than humans follow human intent."

And the fact that there aren't such controls in place yet is a problem OpenAI recognized, per its July 2023 post.

"Currently, we don't have a solution for steering or controlling a potentially superintelligent AI and preventing it from going rogue. Our current techniques for aligning AI, such as reinforcement learning from human feedback, rely on humans' ability to supervise AI," read OpenAI's post. "But humans won't be able to reliably supervise AI systems much smarter than us, and so our current alignment techniques will not scale to superintelligence. We need new scientific and technical breakthroughs."

Leike — who worked at Google's DeepMind before his gig at OpenAI — had big aspirations for keeping humans safe from the superintelligence we've created.

"It's like we have this hard problem that we've been talking about for years and years and years, and now we have a real shot at actually solving it," Leike said on an August 2023 episode of the "80,000 Hours" podcast.

On his Substack, Leike has outlined how the alignment problem —when machines don't act in accordance with humans' intentions — can be solved and what's needed to solve it.

"Maybe a once-and-for-all solution to the alignment problem is located in the space of problems humans can solve. But maybe not," Leike wrote in March 2022. "By trying to solve the whole problem, we might be trying to get something that isn't within our reach. Instead, we can pursue a less ambitious goal that can still ultimately lead us to a solution, a minimal viable product (MVP) for alignment: Building a sufficiently aligned AI system that accelerates alignment research to align more capable AI systems."

Sutskever, Leike, and representatives for OpenAI did not immediately respond to requests for comment from Business Insider, sent outside regular business hours.

Read the original article on Business Insider

TechCrunch
Anthropic hires former OpenAI safety lead to head up new team
Jan Leike, a leading AI researcher who earlier this month resigned from OpenAI before publicly criticizing the company's approach to AI safety, has joined OpenAI rival Anthropic to lead a new "superalignment" team. In a post on X, Leike said that his team at Anthropic will focus on various aspects of AI safety and security, specifically "scalable oversight," "weak-to-strong generalization" and automated alignment research. In many ways, Leike's team sounds similar in mission to OpenAI's recently dissolved Superalignment team.
Engadget
OpenAI's board allegedly learned about ChatGPT launch on Twitter
“[The] board was not informed in advance of that,” Toner said on Tuesday on a podcast called The Ted AI Show. “We learned about ChatGPT on Twitter.”
Engadget
OpenAI co-founder and Chief Scientist Ilya Sutskever is leaving the company
Last year, Ilya Sutskever admitted that he regretted the role he played in the sudden dismissal of OpenAI CEO Sam Altman and President Greg Brockman.
Engadget
OpenAI scraps controversial nondisparagement agreement with employees
OpenAI will not enforce any nondisparagement agreement former employees had signed and will remove the language from its exit paperwork altogether, the company told Bloomberg.
Yahoo News
OpenAI in the crosshairs: Scarlett Johansson's scathing statement, mass exodus of executives are among the company’s latest woes
OpenAI CEO Sam Altman was at one point dubbed "the Oppenheimer of AI." So what's going on under his leadership?
Engadget
OpenAI reportedly didn't intend to copy Scarlett Johansson's voice
OpenAI cast the actor of Sky's voice months before Sam Altman contacted Scarlett Johansson, and it had no intention of finding someone who sounded like her, according to The Washington Post.
TechCrunch
OpenAI created a team to control 'superintelligent' AI — then let it wither, source says
OpenAI's Superalignment team, responsible for developing ways to govern and steer "superintelligent" AI systems, was promised 20% of the company's compute resources, according to a person from that team. Leike went public with some reasons for his resignation on Friday morning. "I have been disagreeing with OpenAI leadership about the company's core priorities for quite some time, until we finally reached a breaking point," Leike wrote in a series of posts on X.
TechCrunch
OpenAI and Google lay out their competing AI visions
This week had two major events from OpenAI and Google. Hot off OpenAI’s tail, Google’s I/O conference featured a smattering of announcements and integrations for its flagship model, Gemini. This week also saw some major shake-ups at AWS and OpenAI.
TechCrunch
OpenAI inks deal to train AI on Reddit data
OpenAI has reached a deal with Reddit to use the social news site's data for training AI models. Reddit content will be incorporated into ChatGPT, OpenAI's popular conversational AI, and the companies will work together to bring unspecified new "AI-powered features" to both Reddit users and moderators. OpenAI will also become a Reddit advertising partner.
TechCrunch
This Week in AI: OpenAI considers allowing AI porn
This week in AI, OpenAI revealed that it's exploring how to "responsibly" generate AI porn. Announced in a document intended to peel back the curtains and gather feedback on its AI's instructions, OpenAI's new NSFW policy is intended to start a conversation about how -- and where -- the company might allow explicit images and text in its AI products, OpenAI said. "We want to ensure that people have maximum control to the extent that it doesn't violate the law or other peoples' rights," Joanne Jang, a member of the product team at OpenAI, told NPR.
TechCrunch
Stack Overflow signs deal with OpenAI to supply data to its models
OpenAI is collaborating with Stack Overflow, the Q&A forum for software developers, to improve its generative AI models' performance on programming-related tasks. As a result of the partnership, announced Monday, OpenAI's models, including models served through its ChatGPT chatbot platform, should get better over time at answering programming-related questions, the two companies say. At the same time, Stack Overflow will benefit from OpenAI's expertise in developing new generative AI integrations on the Stack Overflow platform.
Engadget
The Internet Archive has been fending off DDoS attacks for days
The nonprofit organization has announced that it's currently in its "third day of warding off an intermittent DDoS cyber-attack."
Yahoo Sports
Thunder GM Sam Presti admits he ‘missed’ on Gordon Hayward trade: ‘That’s on me’
Gordon Hayward said his short time with the Thunder was “disappointing” and “frustrating” after the mid-season trade landed him in Oklahoma City.
TechCrunch
Avendus, India's top venture advisor, confirms it's looking to raise a $350 million fund
Avendus, the top investment bank for venture deals in India, confirmed on Wednesday it is looking to raise up to $350 million for its new private equity fund. The new fund, called Future Leaders Fund III, will enable the Mumbai-headquartered firm to write larger checks and maintain a meaningful position in the startups it backs, said its managing partner Ritesh Chandra in an interview with TechCrunch. TechCrunch reported in early April that Avendus was putting together a plan to raise a new fund.
Yahoo Sports
Ángel Hernández, MLB’s most infamous umpire, leaves us with an odd legacy
Being an MLB umpire is a thankless job, both emotionally taxing and physically strenuous. But Hernández’s outwardly standoffish attitude and penchant for comically bad calls did him no favors.
Yahoo News
Trump trial live updates: Marathon closing arguments come to an end; jury to receive instructions at 10 a.m.
Attorneys for the prosecution and defense are delivering closing arguments on Tuesday.
Yahoo Life
Actress Judi Dench says she 'can't even see' due to macular degeneration. Here's what to know about the leading cause of vision loss for people over 50.
The eye condition causes progressive sight loss in the center of vision.
Yahoo Sports
Justin Fields laughs off idea that he’ll run kicks back for the Steelers: ‘I’m not here to do that’
After it took off on social media, Justin Fields officially shut down the idea that he’d be playing on special teams for the Steelers.
Yahoo Sports
Dodgers snap longest losing streak in 5 years aided by late Mets blunders
The New York Mets were the cure for the ailing Los Angeles Dodgers.
Yahoo Sports
French Open 2024: How to watch the Iga Swiatek vs. Naomi Osaka match
It's time for the clay court Grand Slam at Roland Garros. Here's how to tune into Swiatek vs. Osaka.

News

Life

Entertainment

Finance

Sports

New on Yahoo

It was their job to make sure humans are safe from OpenAI's superintelligence. They just quit.

Recommended Stories

Anthropic hires former OpenAI safety lead to head up new team

OpenAI's board allegedly learned about ChatGPT launch on Twitter

OpenAI co-founder and Chief Scientist Ilya Sutskever is leaving the company

OpenAI scraps controversial nondisparagement agreement with employees

OpenAI in the crosshairs: Scarlett Johansson's scathing statement, mass exodus of executives are among the company’s latest woes

OpenAI reportedly didn't intend to copy Scarlett Johansson's voice

OpenAI created a team to control 'superintelligent' AI — then let it wither, source says

OpenAI and Google lay out their competing AI visions

OpenAI inks deal to train AI on Reddit data

This Week in AI: OpenAI considers allowing AI porn

Stack Overflow signs deal with OpenAI to supply data to its models

The Internet Archive has been fending off DDoS attacks for days

Thunder GM Sam Presti admits he ‘missed’ on Gordon Hayward trade: ‘That’s on me’

Avendus, India's top venture advisor, confirms it's looking to raise a $350 million fund

Ángel Hernández, MLB’s most infamous umpire, leaves us with an odd legacy

Trump trial live updates: Marathon closing arguments come to an end; jury to receive instructions at 10 a.m.

Actress Judi Dench says she 'can't even see' due to macular degeneration. Here's what to know about the leading cause of vision loss for people over 50.

Justin Fields laughs off idea that he’ll run kicks back for the Steelers: ‘I’m not here to do that’

Dodgers snap longest losing streak in 5 years aided by late Mets blunders

French Open 2024: How to watch the Iga Swiatek vs. Naomi Osaka match