16
submitted 10 months ago by btp@kbin.social to c/tech@kbin.social

The New York Times is suing OpenAI and Microsoft for copyright infringement, claiming the two companies built their AI models by “copying and using millions” of the publication’s articles and now “directly compete” with its content as a result.

As outlined in the lawsuit, the Times alleges OpenAI and Microsoft’s large language models (LLMs), which power ChatGPT and Copilot, “can generate output that recites Times content verbatim, closely summarizes it, and mimics its expressive style.” This “undermine[s] and damage[s]” the Times’ relationship with readers, the outlet alleges, while also depriving it of “subscription, licensing, advertising, and affiliate revenue.”

The complaint also argues that these AI models “threaten high-quality journalism” by hurting the ability of news outlets to protect and monetize content. “Through Microsoft’s Bing Chat (recently rebranded as “Copilot”) and OpenAI’s ChatGPT, Defendants seek to free-ride on The Times’s massive investment in its journalism by using it to build substitutive products without permission or payment,” the lawsuit states.

The full text of the lawsuit can be found here

you are viewing a single comment's thread
view the rest of the comments
[-] Zima@kbin.social 1 points 10 months ago

Ok i believe that you believe that. It’s ok. I have professional experience in this space so you’re either not reading carefully or you don’t understand much about the topic.

Perhaps you might want to reconsider this in more abstract terms. The engine example you ignored could help you with that.

Do you really think that the fact that we have language models that don’t memorize and are simple enough that we can know for certain is not all we need to show that language models don’t necessarily have to memorize? You keep repeating the same (illogical) argument and ignore the simpler arguments that disprove your claim.

[-] EvilMonkeySlayer@kbin.social 1 points 10 months ago

So, now it's gone from "reasonable effort" to most definitely you can say without any doubt that all the trained models contain no copyrighted data at all?

Come on. Make up your mind.

[-] Zima@kbin.social 1 points 10 months ago

You still haven’t backed up your claim. Once again just because you don’t know it doesn’t mean it’s not possible to do something.

[-] EvilMonkeySlayer@kbin.social 1 points 10 months ago

My man, now you're just trying to put the onus on me.

Which is it?

Is it they don't retain or they do?

You made the claim. 🤷‍♂️

[-] Zima@kbin.social 1 points 10 months ago

Lol. You already forgot you claimed that they need to retain the training data first.

[-] EvilMonkeySlayer@kbin.social 1 points 10 months ago

Oh, I've broken you.

[-] Zima@kbin.social 1 points 10 months ago

Lol. You already forgot you claimed that they need to retain the training data first.

[-] EvilMonkeySlayer@kbin.social 1 points 10 months ago

Pointing out your arguments inconsistency is forgetting?

Are you okay?

this post was submitted on 27 Dec 2023
16 points (100.0% liked)

Technology

165 readers
1 users here now

This magazine is dedicated to discussions on the latest developments, trends, and innovations in the world of technology. Whether you are a tech enthusiast, a developer, or simply curious about the latest gadgets and software, this is the place for you. Here you can share your knowledge, ask questions, and engage in discussions on topics such as artificial intelligence, robotics, cloud computing, cybersecurity, and more. From the impact of technology on society to the ethical considerations of new technologies, this category covers a wide range of topics related to technology. Join the conversation and let's explore the ever-evolving world of technology together!

founded 2 years ago