darkphotonstudio ,

I think people would have less issues with AI training if it was non-profit and for the common good. And there are open source AI projects, many in fact. But yeah, these deals by companies like this are sleazy.

NeatNit ,

OpenAI was literally that until it wasn't

darkphotonstudio ,

I don't think OpenAI actually released any FOSS code, did they?

hagar ,

StackOverflow: *grabs money on monetizing massive amounts of user-contributed content without consulting or compensating the users in any way*

Users: *try to delete it all to prevent it*

StackOverflow: *your contributions belong to the community, you can't do that*

Pretty fucked-up laws. A lot of lawsuits going on right now against AI companies for similar issues. In this case, StackOverflow is entitled to be compensated for its partnership, and because the answers are all CC BY-SA 3.0, no one can complain. Now, that SA? Whatever.

jubilationtcornpone ,

Data Rule Numero Uno:

Garbage in, garbage out.

Have fun training your LLM on a big steaming pile of hot garbage. That's 80% of Stack Overflows content.

helenslunch ,
@helenslunch@feddit.nl avatar

Would be a shame if someone used ChatGPT to generate bad answers and a short script to resubmit them back to Stackoverflow. So awful.

henfredemars ,

I feel like this content craze is going to evaporate soon because all the new content from here forward is sure to be polluted by LLM output already. AI is fast becoming a snake eating its own tail.

That reminds me. I should go update my licenses to spit in the face of AI training companies.

davel ,
@davel@lemmy.ml avatar

Good luck with the deleting. It often just means UPDATE comments SET is_deleted = 1 WHERE ID = 666;.

chiisana ,
@chiisana@lemmy.chiisana.net avatar

There was similar things done on Reddit during the big exit. I doubt it achieved what people expected it to achieve. Even if they’re not visible externally, I’m sure they can easily access (thereby make deals to license) the data out of their backend / backup; just a matter of how hard they want to try (hint: it’s really not very hard).

duncesplayed ,

Yeah during the reddit exodus, people were recommending to overwrite your comment with garbage before deleting it.
This (probably) forces them to restore your comment from backup.
But realistically they were always going to harvest the comments stored in backup anyway, so I don't think it caused them any more work.

If anything, this probably just makes reddit's/SO's partnership more valuable because your comments are now exclusive to reddit's/SO's backend, and other companies can't scrape it.

Lemongrab ,

It was to make the data inaccessible to general people, therefore removing the reason people visit reddit. Even if reddit could still get the data, regular people would be inconvenienced (in theory) and look somewhere else.

CaptObvious ,

Stack Overflow just earned a place under Reddit in the hosts block list.

stembolts ,

This is similar to when I heard reddit was doing the API lockdown, I wrote an automation bot over the weekend that self-destructed my subreddit and the entire post history. The bot also automatically downloaded and archived all of the content on my local machine.

It was annoying because at first I couldn't get access to older posts since at the time reddit had changed their API to only show the first X posts (100 or 1,000 or whatever). So I told my bot to delete the posts as it archived them so as I deleted content, reddit had no choice but to populate the page with the older posts.

And that's how I archived my subreddit. Reddit banned me two days later for automation, lol. I did not break any of the reddit or reddit api ToS during this process but I guess I upset someone.

GBU_28 ,

Unfortunately they still have everything. It's good for the "human" visibility (lack of) but they have the data still

SuckMyWang ,

We can’t even communicate with out being leeched upon. Fuck this is grim

  • All
  • Subscribed
  • Moderated
  • Favorites
  • random
  • technology@lemmy.ml
  • test
  • worldmews
  • mews
  • All magazines