Sarah Silverman Leads Class Action Copyright Suit Against ChatGPT

Jon Blistein

July 10, 2023 at 11:48 AM·3 min read

Oops!
Something went wrong.
Please try again later.
Oops!
Something went wrong.
Please try again later.
Oops!
Something went wrong.
Please try again later.

Sarah Silverman Performs At The Ryman Auditorium - Credit: Jason Kempin/Getty Images

Sarah Silverman is part of a new copyright class action suit against OpenAI, which alleges that ChatGPT was illegally trained on copyrighted books.

Silverman is one of three lead plaintiffs, alongside the authors Christopher Golden and Richard Kadrey. They claim, on behalf of the prospective class, that their books were part of a trove of copyrighted material “copied by OpenAI” and used to train ChatGPT “without consent, without credit, and without compensation.”

More from Rolling Stone

The new lawsuit is similar to one brought by authors Paul Tremblay and Mona Awad against OpenAI earlier this year. In fact, Silverman and co. have enlisted the same attorney to represent them, Matthew Butterick of the Joseph Saveri Law Firm.

OpenAI — a research lab with nonprofit and corporate arms — officially launched ChatGPT last year. The software, known as a “large language model” (LLM), is fed copious amounts of text and is thus able to generate human-like responses to text inputs.

Noting that an LLM’s output is “entirely and uniquely reliant on the material in its dataset,” the new lawsuit alleges that “much of the material in OpenAI’s training datasets [came] from copyrighted works.” The suit alleges that books by the three lead plaintiffs — Silverman’s memoir The Bedwetter, Golden’s Ararat, and Kadrey’s Sandman Slim — were among the copyrighted works used to train ChatGPT.

The plaintiffs’ lawyers say they can prove this by using ChatGPT itself: “The reason ChatGPT can accurately summarize a certain copyrighted book is because that book was copied by OpenAI and ingested b the underlying OpenAI Language Model as part of its training data,” the suit alleges. “When ChatGPT was prompted to summarize books written by each of the Plaintiffs, it generated very accurate summaries.”

While the suit acknowledged that the summaries got “some details wrongs,” it asserts that such mistakes are “expected” since LLMs combine “expressive material derived from many sources.” Still, the suit claims the overall accuracy suggests “ChatGPT retains knowledge of particular works in the training dataset and is able to output similar textual content.”

The suit also proffers some theories as to how OpenAI may have allegedly trained ChatGPT on copyrighted works. It notes that a July 2020 paper about an earlier version of ChatGPT said the software was trained on two troves of books, known as Books1 and Books2. Though OpenAI did not specify where the books came from, the suit says statistics mentioned in that OpenAI paper suggest Books1 came from Project Gutenberg — “an online archive of e-books whose copyright has expired” — while Books2 allegedly came from “notorious ‘shadow library’ websites” where ebooks can be pirated illegally.

The lawsuit goes on to cite OpenAI’s recent March 2023 paper introducing ChatGPT-4, saying it “contained no information about its dataset at all.” Quoting from the paper, OpenAI said that “given both the competitive landscape and the safety implications of large-scale models like GPT-4, this report contains no further details about … dataset construction.”

OpenAI did not immediately return Rolling Stone‘s request for comment.

In June, Rolling Stone spoke with several experts about a wave of lawsuits being brought against artificial intelligence companies like OpenAI, which dealt with both privacy and copyright issues (a defamation suit was also brought). Several suggested that the similar copyright suit brought by Tremblay and Awad would face an uphill battle. For instance, Mehtab Khan, a resident fellow at Yale Law School and the lead for the Yale/Wikimedia Initiative on Intermediaries and Information, said the claim the plaintiffs’ books were used to train ChatGPT was “tenuous” and said the authors will have to prove their writing was infringed and that there’s “substantial similarity between their works and the output generated by the chatbot.”

Best of Rolling Stone

Click here to read the full article.

SportsYahoo Sports
Dolphins owner Stephen Ross reportedly declined $10 billion for team, stadium and F1 race
The value of the Dolphins and Formula One racing is enormous.
SportsYahoo Sports
What scouts think of Bronny James' NBA prospects
The biggest question looming over the NBA draft combine this week: How will Bronny James do?
SportsYahoo Sports
2024 NBA Mock Draft 7.0: Who will the Hawks take at No. 1? Our projections for every pick with lottery order now set
With the lottery order set, here's a look at Yahoo Sports' projections for both rounds of the 2024 NBA Draft.
SportsYahoo Sports
NFL schedule release: Chiefs to host Ravens in 2024 season opener
Chiefs vs. Ravens on Sept. 5 will be a rematch of last season's AFC Championship Game.
SportsYahoo Sports
Your favorite WNBA rookies didn’t make the cut. So what’s their path back to the league?
For rookies who were waived, the climb to their pro dreams is steeper, but the path ahead is well-worn with trail markers of established success.
SportsYahoo Sports
The Spin: Making a call on 5 slumping fantasy baseball stars
All five of these hitters were drafted highly in fantasy baseball leagues. So far, they have not lived up to their ADPs — and that's an understatement. Scott Pianowski analyzes.
BusinessYahoo Finance
Utility stocks are on fire — here are Wall Street analysts' top picks
Utility stocks are outperforming the broader markets. Here's a look at three top picks from analysts.
SportsYahoo Sports
Where does Jared Goff’s $212M extension leave Dak Prescott and Cowboys?
In one scenario, Dallas makes Prescott the highest paid player in NFL history. In another, the Cowboys decline that commitment, at which point another team will make him the top paid player in NFL history.
SportsYahoo Sports
Former MLB infielder, Little League World Series star Sean Burroughs dies at 43
The seven-year major leaguer collapsed while coaching his son's Little League game on Thursday.
SportsYahoo Sports
MLB Power Rankings: Phillies lead Dodgers, Braves as trio of NL contenders top this week's list
Here's a look at the rookies who have stood out on each team through the first quarter of the 2024 season.

News

Life

Entertainment

Finance

Sports

New on Yahoo

Sarah Silverman Leads Class Action Copyright Suit Against ChatGPT

Recommended Stories

Dolphins owner Stephen Ross reportedly declined $10 billion for team, stadium and F1 race

What scouts think of Bronny James' NBA prospects

2024 NBA Mock Draft 7.0: Who will the Hawks take at No. 1? Our projections for every pick with lottery order now set

NFL schedule release: Chiefs to host Ravens in 2024 season opener

Your favorite WNBA rookies didn’t make the cut. So what’s their path back to the league?

The Spin: Making a call on 5 slumping fantasy baseball stars

Utility stocks are on fire — here are Wall Street analysts' top picks

Where does Jared Goff’s $212M extension leave Dak Prescott and Cowboys?

Former MLB infielder, Little League World Series star Sean Burroughs dies at 43

MLB Power Rankings: Phillies lead Dodgers, Braves as trio of NL contenders top this week's list