Leading Authors of Today's Magazine
  • Home
  • Editorial
  • Featured New Authors
  • Anthologies
    • Moguls Unleashed
      • Dr. Dashnay Holmes is a Dynamic Entrepreneur!
      • Dr. Jane Mukami
      • Dr. Demaryl Roberts-Singleton
      • Dr. Desirie Sykes
      • Dr. Terry Golightly
      • Dr. Shontae Davidson
      • Dr. Adrienne Velazquez
      • Dr. Nichole Pettway
      • Dr. Daniela Peel: Corporate Wellness
  • News and Updates
  • More
    • Multimedia
    • Author of the Month
    • Book Reviews
    • Interviews and Conversations
    • Community and Engagement
    • Writing Resources
    • Genre Explorations
No Result
View All Result
Leading Authors Of Today's Magazine
No Result
View All Result

New Lawsuit Accuses OpenAI Of Using Copyrighted Books Illegally For AI Model Training

May 23, 2024
in How-to
0
Home How-to
0
SHARES
0
VIEWS
Share on FacebookShare on Twitter
New Lawsuit Accuses OpenAI Of Using Copyrighted Books Illegally For AI Model Training


OpenAI seems to be in deep waters after being sent a new lawsuit in its direction by The Authors Guild.

They’re accusing the AI giant of making use of a massive collection of copyrighted books illegally for the sake of training AI models.

The newly published unsealed papers spoke about how the AI startup even went on to delete datasets used for training of GPT-3 which featured a wide number of book collections. Moreover, other shocking facts worth a mention include how researchers that made datasets were removed from the organization and therefore no longer work there today.

The class action legal case delineated the datasets as Books 1 and Books 2 which were used for training the older versions of GPT like GPT-3. As revealed in the court filings recently, there were close to 100k published books that were unlawfully used for this purpose, knowing very well how they were supported by copyright terms and conditions.

Lawyers on this front have spoken about how the Authors Guild tried to attain more data from OpenAI regarding this. And while the firm did offer major resistance at the start, citing reasons such as confidentiality, the truth did eventually come out about how copies of the datasets were in face deleted as mentioned in the latest report by Business Insider.

The material used for training was of the highest standards and today stand as an integral part of the firm’s AI models who are revolutionizing the world as we speak. The company and many others made use of plenty of data found online such as books to better curate and refine its models.

But a lot of firms that created the material claim it’s not fair to unjustly use material that is under the ownership of others without any form of consent of compensation provided. Intelligence is being used and righty so, they should be paid for it. Now, the courts are fighting many such battles and it appears to be a long legal woe with no end in sight soon.

Meanwhile, other similar stories on this front relate to a white paper rolled out in 2020 where the AI giant called the datasets as books based on the internet and they ended up making just 16% of data used for training models like GPT-3.

The white paper highlighted how 67 billion tokens featuring data or close to 50 billion words were used. And that’s a lot of content when you come to think of it.

The letter that’s now unsealed from lawyers of OpenAI mentioned how such datasets in question that were used for training were discontinued during the latter part of 2021.

After that, they were deleted for not being in use. And then the letter that’s dubbed very confidential adds how no other kinds of data which were used for training of models were deleted. So the company did offer lawyers from The Authors Guild to access them and other kinds of datasets too.

The documents are now unsealed and they reveal how several researchers who gave rise to the databooks in question are now working in the firm so this is why they will not be revealing their identities either.

Right now, we can confirm how their identification was give for the sake of the investigation by the Authors Guild but no public revealing was or will be done right now. Moreover, the firm continues to petition inside court how the employees names and their datasets would be under seal and remain in that way.

But as one can expect, this information was not taken well by The Authors Guild who argued and opposed this. They felt the public had every right to know and now, the matter is an ongoing dispute.

For now, OpenAI remains very clear and bold on its stance. It says the models that power its GPT and API were not created through the use of such datasets. This was the statement rolled out publicly by the company on Tuesday. They were last said to be used in the year 2021 and since then, have been deleted as they were produced by employees who are no longer working with the AI giant.

Image: DIW-Aigen

Read next: Google’s Future Focuses on AI, Says Former CEO





Read More

Previous Post

Orion Magazine – An Interview with Tommy Orange

Next Post

Bestselling children’s author David Walliams to attend Franschhoek Literary Festival in May

Next Post
Bestselling children’s author David Walliams to attend Franschhoek Literary Festival in May

Bestselling children's author David Walliams to attend Franschhoek Literary Festival in May

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Random News

Bihar Board Matric Topper Interview 2025 | BSEB Class10th Topper Interview 2025 | 10th Topper Name

Bihar Board Matric Topper Interview 2025 | BSEB Class10th Topper Interview 2025 | 10th Topper Name

...

How non-fiction inspires reluctant readers and boosts reading for pleasure

How non-fiction inspires reluctant readers and boosts reading for pleasure

...

The Special Forces and War that Forged Special Operations – “The Unvanquished” by Patrick O’Donnell

The Special Forces and War that Forged Special Operations – “The Unvanquished” by Patrick O’Donnell

...

Picatrix (Ghayat al-Hakim): The World’s Most Famous Book of Magic?

Picatrix (Ghayat al-Hakim): The World’s Most Famous Book of Magic?

...

I asked Meta’s A.I. chatbot what it thought of my books. What I learned was deeply worrying.

I asked Meta’s A.I. chatbot what it thought of my books. What I learned was deeply worrying.

...

3 Books That Will Change Your Life

3 Books That Will Change Your Life

...

About us

Today's Author Magazine

Welcome to Today's Author Magazine, the go-to destination for discovering fresh talent in the literary world. We shine a light on new authors and captivating anthologies, providing readers with a diverse array of stories and insights. Here's a look at the vibrant categories that make up our magazine

RecentNews

Dr. Donald Variste

The Power of Voice: Rev. Dr. Serena J. Rowan’s Journey of Leadership and Influence

Dr. Janie Melinda Cauthorne

Dr. Tracy Banks Carr

Betrayed by George R. R. Martin | Tolarian Community College X Dragonsteel | #brandonsanderson

Categories

  • Anthologies
  • Author of the Month
  • Book Reviews
  • Community and Engagement
  • Editorial
  • Featured
  • Featured New Authors
  • Genre Explorations
  • How-to
  • Interviews and Conversations
  • Multimedia
  • News and Updates
  • Other
  • Writing Resources

RandomNews

Former World Leader Boldly Proclaims Christian Faith in New Book, ‘Plans for Your Good’

Developing AI Like Raising Kids

2021 Westerville writing contest winners announced

On the Making of That New York Times Best Books of the Century List ‹ Literary Hub

Bestselling author to unpack Britain’s biggest murder probe in Evesham

  • Home
  • About
  • Privacy
  • Terms
  • Contact

© 2024 Today's Author Magazine. All Rights Are Reserved.

No Result
View All Result
  • About
  • Contact
  • Home
  • Moguls Unleashed
  • Privacy
  • Terms

© 2024 Today's Author Magazine. All Rights Are Reserved.