[nexa] Report: Potential NYT lawsuit could force OpenAI to wipe ChatGPT and start over

Alberto Cammozzo via nexa Fri, 25 Aug 2023 01:26:08 -0700


<https://arstechnica.com/tech-policy/2023/08/report-potential-nyt-lawsuit-could-force-openai-to-wipe-chatgpt-and-start-over/>

Weeks after The New York Times updated its terms of service (TOS) toprohibit AI companies from scraping its articles and images<https://arstechnica.com/information-technology/2023/08/the-new-york-times-prohibits-ai-vendors-from-devouring-its-content/>to train AI models, it appears that the Times may be preparing to sueOpenAI. The result, experts speculate, could be devastating to OpenAI,including the destruction of ChatGPT's dataset and fines up to $150,000<https://www.copyright.gov/title17/92chap5.html> per infringing piece ofcontent.

NPR spoke<https://www.npr.org/2023/08/16/1194202562/new-york-times-considers-legal-action-against-openai-as-copyright-tensions-swirl>to two people "with direct knowledge" who confirmed that the Times'lawyers were mulling whether a lawsuit might be necessary "to protectthe intellectual property rights" of the Times' reporting.

Neither OpenAI nor the Times immediately responded to Ars' request tocomment.

If the Times were to follow through and sue ChatGPT-maker OpenAI, NPRsuggested that the lawsuit could become "the most high-profile" legalbattle yet over copyright protection since ChatGPT's explosively popularlaunch. This speculation comes a month after Sarah Silverman joinedother popular authors suing OpenAI<https://arstechnica.com/information-technology/2023/07/book-authors-sue-openai-and-meta-over-text-used-to-train-ai/>oversimilar concerns, seeking to protect the copyright of their books.

Of course, ChatGPT isn't the only generative AI tool drawing legalchallenges over copyright claims. In April, experts told Ars thatimage-generator Stable Diffusion could be a "legal earthquake"<https://arstechnica.com/tech-policy/2023/04/stable-diffusion-copyright-lawsuits-could-be-a-legal-earthquake-for-ai/>due to copyright concerns.

But OpenAI seems to be a prime target for early lawsuits, and NPRreported that OpenAI risks a federal judge ordering ChatGPT's entiredata set to be completely rebuilt—if the Times successfully proves thecompany copied its content illegally and the court restricts OpenAItraining models to only include explicitly authorized data. OpenAI couldface huge fines for each piece of infringing content, dealing OpenAI amassive financial blow just months after The Washington Post reported<https://www.washingtonpost.com/technology/2023/07/07/chatgpt-users-decline-future-ai-openai/>that ChatGPT has begun shedding users, "shaking faith in AI revolution."Beyond that, a legal victory could trigger an avalanche of similarclaims from other rights holders.

Unlike authors who appear most concerned about retaining the option toremove their books from OpenAI's training models, the Times has otherconcerns about AI tools like ChatGPT. NPR reported that a "top concern"is that ChatGPT could use The Times' content to become a "competitor" by"creating text that answers questions based on the original reportingand writing of the paper's staff."

As of this month, the Times' TOS prohibits any use of its content for"the development of any software program, including, but not limited to,training a machine learning or artificial intelligence (AI) system."

Now it seems clear that this update provides the Times with an extralayer of protection as NPR reports that the media outlet is seeminglyreconsidering a licensing deal with OpenAI. That licensing deal wouldhave ensured that OpenAI paid for NYT content used to train its models.According to NPR, meetings between OpenAI and the Times have become"contentious," making the deal appear increasingly unlikely as the Timesseemingly weighs whether any licensing deal would be worth participatingin, as the resulting product could become its fiercest competitor.

To defend its AI training models, OpenAI would likely have to claim"fair use" of all the web content the company sucked up to train toolslike ChatGPT. In the potential New York Times case, that would meanproving that copying the Times' content to craft ChatGPT responses wouldnot compete with the Times.

Experts told NPR that would be challenging for OpenAI because unlikeGoogle Books—which won a federal copyright challenge in 2015 because itsexcerpts of books did not create a "significant market substitute" forthe actual books—ChatGPT could actually replace for some web users theTimes' website as a source of its reporting.

The Times' lawyers appear to think this is a real risk, and NPR reportedthat, in June, NYT leaders issued a memo to staff that seems like anearly warning of that risk. In the memo, the Times' chief productofficer, Alex Hardiman, and deputy managing editor Sam Dolnick said atop "fear" for the company was "protecting our rights" againstgenerative AI tools.

"How do we ensure that companies that use generative AI respect ourintellectual property, brands, reader relationships, and investments?"the memo asked, echoing a question being raised in newsrooms that arebeginning to weigh the benefits and risks of generative AI.

Last month, the Associated Press became one of the first newsorganizations to strike a licensing deal with OpenAI, but the terms ofthe deal were not disclosed<https://apnews.com/article/openai-chatgpt-associated-press-ap-f86f84c5bcc2f3b98074b38521f5f75a>.Today, AP reported<https://apnews.com/article/artificial-intelligence-guidelines-ap-news-532b417395df6a9e2aed57fd63ad416a>that it had joined other news organizations in developing standards forthe use of AI in newsrooms, acknowledging that many "news organizationsare concerned about their material being used by AI companies withoutpermission or payment."

In April, the News Media Alliance published AI principles<https://www.newsmediaalliance.org/ai-principles/>, seeking to defendpublishers' intellectual property by insisting that generative AI"developers and deployers must negotiate with publishers for the rightto use" publishers' content for AI training, AI tools surfacinginformation, and AI tools synthesizing information.

Ars could not immediately reach the News Media Alliance for comment onthe potential impact of the NYT case.



       reader comments

347 with<https://arstechnica.com/tech-policy/2023/08/report-potential-nyt-lawsuit-could-force-openai-to-wipe-chatgpt-and-start-over/?comments=1>

<https://arstechnica.com/author/ashleybelanger>

Ashley Belanger <https://arstechnica.com/author/ashleybelanger> Ashleyis a senior policy reporter for Ars Technica, dedicated to trackingsocial impacts of emerging policies and new technologies. She is aChicago-based journalist with 20 years of experience.


Advertisement

_______________________________________________
nexa mailing list
nexa@server-nexa.polito.it
https://server-nexa.polito.it/cgi-bin/mailman/listinfo/nexa

[nexa] Report: Potential NYT lawsuit could force OpenAI to wipe ChatGPT and start over

Reply via email to