In a first-of-its-kind decision, an AI company wins a copyright infringement lawsuit brought by authors
AI companies could have the legal right to train their large language models on copyrighted works — as long as they obtain copies of those works legally.
That’s the upshot of a first-of-its-kind ruling by a federal judge in San Francisco on Monday in an ongoing copyright infringement case that pits a group of authors against a major AI company.
The ruling is significant because it represents the first substantive decision on how fair use applies to generative AI systems.
Fair use doctrine enables copyrighted works to be used by third parties without the copyright holder’s consent in some circumstances such as illustrating a point in a news article. Claims of fair use are commonly invoked by AI companies trying to make the case for the use of copyrighted works to train their generative AI models. But authors and other creative industry plaintiffs have been pushing back with a slew of lawsuits.
Authors take on Anthropic
In their 2024 class action lawsuit, authors Andrea Bartz, Charles Graeber and Kirk Wallace Johnson alleged Anthropic AI used the contents of millions of digitized copyrighted books to train the large language models behind their chatbot, Claude, including at least two works by each plaintiff. The company also bought some hard copy books and scanned them before ingesting them into its model.
“Rather than obtaining permission and paying a fair price for the creations it exploits, Anthropic pirated them,” the authors’ complaint states.
In Monday’s order, Senior U.S. District Judge William Alsup supported Anthropic’s argument, stating the company’s use of books by the plaintiffs to train their AI model was acceptable.
“The training use was a fair use,” he wrote. “The use of the books at issue to train Claude and its precursors was exceedingly transformative.”
The judge said the digitization of the books purchased in print form by Anthropic could also be considered fair use, “because all Anthropic did was replace the print copies it had purchased for its central library with more convenient space-saving and searchable digital copies for its central library — without adding new copies, creating new works, or redistributing existing copies.”
However, Alsup also acknowledged that not all books were paid for. He wrote Anthropic “downloaded for free millions of copyrighted books in digital form from pirate sites on the internet” as part of its effort “to amass a central library of ‘all the books in the world’ to retain ‘forever,.'”
Alsup did not approve of Anthropic’s view “that the pirated library copies must be treated as training copies,” and is allowing the authors’ piracy complaint to proceed to trial.
“We will have a trial on the pirated copies used to create Anthropic’s central library and the resulting damages, actual or statutory (including for willfulness),” Alsup stated.
Bifurcated responses
Alsup’s bifurcated decision led to similarly divided responses from those involved in the case and industry stakeholders.
In a statement to NPR, Anthropic praised the judge’s recognition that using works to train large language models was “transformative — spectacularly so.” The company added: “Consistent with copyright’s purpose in enabling creativity and fostering scientific progress, Anthropic’s large language models are trained upon works not to race ahead and replicate or supplant them, but to turn a hard corner and create something different.”
However, Anthropic also said it disagrees with the court’s decision to proceed with a trial.
“We believe it’s clear that we acquired books for one purpose only — building large language models — and the court clearly held that use was fair,” the company stated.
A member of the plaintiffs’ legal team declined to speak publicly about the decision.
The Authors’ Guild, a major professional writers’ advocacy group, did share a statement: “We disagree with the decision that using pirated or scanned books for training large language models is fair use,” the statement said.
In an interview with NPR, the guild’s CEO, Mary Rasenberger, added that authors need not be too concerned with the ruling.
”The impact of this decision for book authors is actually quite good,” Rasenberger said. “The judge understood the outrageous piracy. And that comes with statutory damages for intentional copyright infringement, which are quite high per book.”
According to the Copyright Alliance, U.S. copyright law states willful copyright infringement can lead to statutory damages of up to $150,000 per infringed work. The ruling states Anthropic pirated more than 7 million copies of books. So the damages resulting from the upcoming trial could be huge.
The part of the case focused on Anthropic’s liability for using pirated works is scheduled to go to trial in December.
Other cases and a new ruling
Similar lawsuits have been brought by other prominent authors. Ta-Nehisi Coates, Michael Chabon, Junot Díaz and the comedian Sarah Silverman are involved in ongoing cases against AI players.
On Wednesday, U.S. District Judge Vince Chhabria ruled in favor of Meta in one of those cases. A copyright infringement lawsuit was brought by 13 authors including Richard Kadrey and Silverman. They sued Meta for allegedly using pirated copies of their novels to train LLaMA. Meta claimed fair use and won because the authors failed to present evidence that Meta’s use of their books impacted the market for their original work. However, the judge said the ruling applies only to the specific works included in the lawsuit and that in future cases, authors making similar claims could win if they make a stronger case.
“ These rulings are going to help tech companies and copyright holders to see where judges and courts are likely to go in the future,” said Ray Seilie, a lawyer based in Los Angeles with the firm Kinsella Holley Iser Kump Steinsapir, who focuses on AI and creativity. He is not involved with this particular case.
“ I think they can be seen as a victory for the AI community writ large because they create a precedent suggesting that AI companies can use legally-obtained material to train their models,” Seilie said.
But he said this doesn’t mean AI companies can immediately go out and scan whatever books they buy with impunity, since the rulings are likely to be appealed and the cases could potentially wind up before the Supreme Court.
“ Everything could change,” Seilie said.
Trump freezes $18 billion in funding for NYC, home to key Democratic leaders
On the first day of the federal government shutdown, the Trump administration froze "roughly $18 billion" in infrastructure projects for New York City, home to two of Trump's Democratic congressional opponents. It's not the first time Trump has threatened city funding over politics.
When will the government reopen? Here’s how long past shutdowns lasted
Government shutdowns lasting more than a few days were relatively rare — until recently. The 2018-2019 shutdown was the longest in U.S. history, stretching on for five weeks.
Lawsuit filed against immigration authorities after U.S. citizen’s arrests in raids
An Alabama construction worker and U.S. citizen who says he was detained twice by immigration agents within just a few weeks has filed a lawsuit in federal court demanding an end to Trump administration workplace raids targeting industries with large immigrant workforces.
What will Congress do after the shutdown? We asked a Republican and a Democrat
Most of the federal government is shut down after Congress failed to reach a funding agreement. We asked two House members — a Democrat and Republican — where they think talks go from here
The BET Hip Hop Awards cut the cord as DEI dies
The show's suspension comes amid broader efforts to curb diversity at the institutional level. The next attempt to canonize the movement must learn lessons from its successes — and its missteps.
Trump’s controversial nominee drops bid to head vital data-gathering agency
Trump's pick to lead the agency tracking unemployment and inflation has withdrawn after withering criticism from across the political spectrum. The White House says a new nominee will be named soon.