Authors sue OpenAI for copyright infringement

The inevitable has occurred: There’s a lawsuit against OpenAI’s ChatGPT that seeks class action status on behalf of authors. It’s already well established that OpenAI’s model, as well as others like it, has been trained on works protected under copyright. However, as these models continue to operate with little transparency, a lot of unknowns remain. We do know that entire books—some from Smashwords and others likely from pirate sites—have been scraped off the open web without permission.

Critical questions, likely to be debated for years to come, lie at the heart of this case: Is copyright being violated in the training of ChatGPT? Is ChatGPT itself an infringing derivative work? Are the outputs of ChatGPT infringing on copyright? Did OpenAI violate the DMCA? While authors may feel the answers are clear cut, these are complex issues.

The Authors Guild has issued a statement in support of the lawsuit and is “standing by to provide support if requested.” Their statement continues, “Using books and other copyrighted works to build highly profitable generative AI technologies without the consent or compensation of the authors of those works is blatantly unfair—whether or not a court ultimately finds it to be fair use.” Reading between the lines, one might conclude the Authors Guild believes this will be a challenging case to win. Thus far, the Guild has pursued legislation to protect authors in the AI era rather than sue the tech companies. That isn’t so surprising, given the failure of their suit against Google’s book scanning initiative, which was found to be fair use.

Meanwhile, not just authors are suing OpenAI. There’s another class action lawsuit involving all internet users, brought by another law firm, that wants to represent “real people whose information was stolen and commercially misappropriated to create this very powerful technology.”

Jane Friedman

Jane Friedman has spent her entire career working in the publishing industry, with a focus on business reporting and author education. Established in 2015, her newsletter The Bottom Line provides nuanced market intelligence to thousands of authors and industry professionals; in 2023, she was named Publishing Commentator of the Year by Digital Book World.

Jane’s expertise regularly features in major media outlets such as The New York Times, The Atlantic, NPR, The Today Show, Wired, The Guardian, Fox News, and BBC. Her book, The Business of Being a Writer, Second Edition (The University of Chicago Press), is used as a classroom text by many writing and publishing degree programs. She reaches thousands through speaking engagements and workshops at diverse venues worldwide, including NYU’s Advanced Publishing Institute, Frankfurt Book Fair, and numerous MFA programs.

Related Articles

I Would Rather See My Books Get Pirated Than This (Or: Why Goodreads and Amazon Are Becoming Dumpster Fires)

My Concerns About the Authors Guild Human Authored Certification—and Their Comprehensive Response

How AI-Generated Books Could Hurt Self-Publishing Authors

Writers and Artists Need a Way to Label AI Use: Here’s What That Could Look Like