Nvidia's AI Training Under Fire: Lawsuit Claims Use of Copyrighted Books Without Permission.
  • 289 views
  • 2 min read

Nvidia, a leading chip manufacturer, is facing a class-action lawsuit alleging that it used copyrighted books without permission to train its artificial intelligence (AI) models. The lawsuit, initially filed in March 2024, has gained momentum with recent revelations of Nvidia's alleged direct engagement with online repositories of pirated materials.

The plaintiffs, a group of authors including Abdi Nazemian, Brian Keene, Stewart O'Nan, Andre Dubus III, and Susan Orlean, claim that Nvidia utilized their copyrighted works to train AI models like NeMo Megatron and Nemotron-4. They allege that Nvidia downloaded copyrighted material from various "shadow libraries," including LibGen, Sci-Hub, Z-Library, and Anna's Archive. These shadow libraries are known for providing free access to copyrighted materials.

Internal company documents revealed in court filings indicate that Nvidia contacted Anna's Archive to gain high-speed access to copyrighted material for AI training. A member of Nvidia's data strategy team allegedly wrote to Anna's Archive, expressing interest in including the archive's content in pre-training data for large language models (LLMs). Despite warnings from Anna's Archive regarding the illegal nature of its collections, Nvidia management purportedly gave the "green light" to proceed. Anna's Archive then offered access to approximately 500 terabytes of data, including millions of copyrighted books, for a fee.

The lawsuit further claims that Nvidia provided scripts and tools to corporate customers, enabling them to automatically download datasets containing pirated books. Nvidia had previously trained its AI models on the Books3 dataset, which contains approximately 196,640 books copied from the pirate site Bibliotik.

Nvidia defends its actions by arguing that AI training on copyrighted material constitutes fair use under copyright law. The company contends that AI models use books as statistical data rather than reproducing them directly. This argument echoes similar defenses made by other AI companies facing copyright infringement lawsuits.

The case is significant as it marks the first time correspondence between a major US technology company and Anna's Archive has been publicly revealed in court proceedings. The authors are seeking statutory damages, actual damages, and compensation for what they describe as willful copyright violations. Hundreds of additional authors whose works appear in the pirated libraries could join the class-action suit.

This lawsuit is part of a growing wave of copyright litigation against AI companies. Other major AI companies, including Meta and Anthropic, have also faced lawsuits alleging they trained models on pirated books from shadow libraries. These lawsuits highlight the tension between the rapid development of AI technology and the protection of intellectual property rights. The outcome of these cases could have significant implications for the future of AI development and the use of copyrighted material in AI training.

Advertisement

Latest Post


Software is dying. Not the code, the business model. For twenty years, the tech industry has been a giant game of "counting heads," a lucrative racket where companies charge $150 a month for every human being sitting in a chair clicking a mouse. But...
  • 425 views
  • 3 min

The bill finally arrived. It was always coming, tucked under the plate while we were busy marveling at the magic tricks. OpenAI is officially testing advertisements within ChatGPT for a handful of users in the United States. If you’re one of the l...
  • 469 views
  • 3 min

Deloitte wants to sell you a brain. Not a human one—those are expensive, prone to burnout, and insist on things like "weekends" and "labor laws. " No, Deloitte India is pivoting to the only thing the Big Four care about lately: a proprietary AI pl...
  • 155 views
  • 3 min

Another year, another glass rectangle. We’re still months away from the official stage lights of San Francisco or Seoul, but the Samsung Galaxy S26 leaks are already trickling out of the supply chain like a leaky faucet in a house you can't afford t...
  • 221 views
  • 3 min

Advertisement
About   •   Terms   •   Privacy
© 2026 TechScoop360