AI is learning from what you said on Reddit, Stack Overflow or Facebook. Are you OK with that? ( apnews.com )

n3m37h , 2 days ago

I've said so many wrong and stupid things on reddit over the years both purposely and not. I wouldn't trust anything trained on reddit. And facebook shouldn't even be thought if as a training source as its mainly bots, idiots and propaganda.

Stack overflow, I've never used but seems to be mostly smart people.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

ZDL , 2 days ago

Cool! So they're training AI on the worst humanity has to offer: Young Nazis for Freedom, "No Means Try Again Later" Techbrodudes, and Old Nazis who Don't Care About Freedom.

Cool cool cool!

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

werefreeatlast , 2 days ago

Plus rants.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

cows_are_underrated , 21 hours ago

You're forgetting the enormous amounts of shitposts

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

conciselyverbose , 2 days ago

If they hadn't made it impossible to use a functional app to browse their site, I probably wouldn't care.

But the fact that they blocked 100 apps that would, if you merged the absolute worst of all of them, still blow the actual Reddit app out of the water? Nope. Fuck 'em.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

BradleyUffner , 2 days ago

Yep. I knew it was a public when I posted it. The thing I have a problem with is when my non-public content is used by people I didn't intend.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

Fades , 2 days ago

Exactly, but it's far more easier to spread panic with generalizations than it is to speak about the specifics.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

thericofactor , 2 days ago

According to gdpr it is not allowed to use people's data for purposes other than the ones they agreed to.
I had an e-mail discussion with them and I filed a complaint with my countries gdpr enforcement agency.
They take a while to investigate and react, but hopefully if enough people complain they will take it seriously.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

Cagi , 2 days ago

An AI using what I say to train it will only make AI more like me. As far as I'm concerned, that's an improvement that may help others while not affecting my life one iota. It's not like it remembers who said what, it all just tweaks the algorithm.

Advertisers have been recording and storing what you say word for word to build a profile specific to you, containing your deepest, darkest truths so they can trick you into giving them more of your money. Who cares about AI training on publicly posted comments, it's just taking energy away from the real privacy issues.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

Lost_My_Mind , 2 days ago

In 2022 I was among the 1% of most karma awarded on reddit. Hadn't been banned anywhere.

Then in 2024 their IPO went public. A week later it was said that AI bots were now scrubbing the site to sell your data to google.

Within 3 weeks I was permanently banned, for 3 posts that had nothing wrong with them. One of them I understand how an AI would flag if its just using keywords. In context of the conversation nothing was wrong.

The other 2 didn't even have keywords which an AI would pick up on.

So my assumption is the AI scrubbed my data, started acting like me, which is to say an absurdist, and google didn't like the results of my data. So reddit banned me.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

Fades , 2 days ago

Exactly!!! All this fear about AI being trained as if our data hasn't been being vacuumed for years and years now. Tracking profiles, fingerprinting, etc. There are legitimate things to get energized and concerned over, no need to make up fear-based narratives

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

Tarkcanis , 2 days ago

I don't use any of them, but it sucks for the people that do.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

TheColonel , 2 days ago

No, that’s why I deleted my fucking account of 17 years.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

nickwitha_k , 2 days ago

It was all soft deletes.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

TheColonel , 2 days ago

Dunno, I used one of those overwrite services before I did.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

DannyBoy , 2 days ago

Did you delete your regular account as well?

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

henfredemars , 2 days ago

Jokes on them they paid for my drunk shitposts.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

harrys_balzac , 2 days ago

Exactly. I think about 90% of what I posted was Grade A sarcastic bullshit.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

I_Has_A_Hat , 2 days ago

Yes.

What? You were expecting me to give a shit about what happens to my comments on a public forum?

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

sunzu , 2 days ago

It is public... I have a bigger issue with reddit thinking they own it.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

rob_t_firefly , 2 days ago

According to the TOS you agree to when signing up to reddit, while they don't outright own your content they do own a license to do pretty much anything they want with your content.

You retain any ownership rights you have in Your Content, but you grant Reddit the following license to use that Content:

When Your Content is created with or submitted to the Services, you grant us a worldwide, royalty-free, perpetual, irrevocable, non-exclusive, transferable, and sublicensable license to use, copy, modify, adapt, prepare derivative works of, distribute, store, perform, and display Your Content and any name, username, voice, or likeness provided in connection with Your Content in all media formats and channels now known or later developed anywhere in the world. This license includes the right for us to make Your Content available for syndication, broadcast, distribution, or publication by other companies, organizations, or individuals who partner with Reddit. You also agree that we may remove metadata associated with Your Content, and you irrevocably waive any claims and assertions of moral rights or attribution with respect to Your Content.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

el_bhm , 2 days ago

No no. They own it up until the point it is illegal. Then, it is your content and your fault.

What I am getting at, double standards.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

professionalvirgin , 21 hours ago

exactly

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

Rentlar , 2 days ago

I can't wait to become a chess master with the help of AI powered by the smart folks at r/anarchychess (!anarchychess is the Lemmy equivalent)

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

FlyingSquid , 2 days ago

That is exactly the sort of community I wish I was smart enough to be a part of because it looks awesome.

On my side, I joined Chess Club in middle school and lost my first game in two moves. Because I suck that much at chess.

Think three moves ahead? I can't even think three moves behind.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

zephorah , 2 days ago

How much glue should you put in your pizza?

Seriously. Eating Reddit means some truly strange shit is going to emerge. We all know the pattern of fun replies on Reddit, but the AI does not. Hence the glue thing.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

ArbitraryValue , 2 days ago

I don't think that's an impossible problem. Existing models can reliability distinguish between, for example, different languages. Most of their training data is presumably in English but while this may make them better at generating English text, it doesn't make them randomly switch from other languages to English. A sufficiently advanced model would likewise distinguish between descriptions of reality and shit-posts because the content of shit-posts would not be useful for predicting descriptions of reality. Some fine tuning would teach it to produce just the descriptions of reality.

Or look at it this way: the folks developing these LLMs aren't ignorant of the fact that Reddit content is often false and meant to be funny. They're not going to make the sort of silly mistake that someone who isn't an expert can still easily predict and they're not going to train their LLMs on that content if it makes the LLMs worse, although we're still going to see some glue on pizza while the technology continues to develop.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

SpacetimeMachine , 2 days ago

It can cross check a language with tons of other words and examples of that language already in its data set. There is no such data for whether or not something confirms with reality. That simply doesn't exist and really won't ever exist. They are not similar problems. One is immensely more challenging to solve than the other.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

Lost_My_Mind , 2 days ago

Glue is still a more valid pizza topping than PINEAPPLE!!!

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

elbarto777 , 2 days ago

Bring back anchovies.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

Lost_My_Mind , 2 days ago

Found Zoidberg!

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

n3m37h , 2 days ago

Pineapple and bacon is the best toppings for pizza to ever exist

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

Snowpix , 20 hours ago

Pineapple and jalapeno together on pizza is pretty dope as well.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

galloFino , 2 days ago

In my opinion stack overflow has vague answers 80% of the time. If AI helps filtering all the bullshit, I’m ok with that. And I don’t know how Reddit or Facebook bot comments are a good source of information.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

FMT99 , 2 days ago

I mean whatever I posted there I posted for public use. Whatever it can learn from my few ancient Stack Overflow posts, it's welcome to it.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

thericofactor , 2 days ago

They are using your knowledge and talents to profit immensely by selling or using it. Selling it right back to you and everyone else.
I have a problem with that.

It would be fine if it was free.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

pubquiz , 2 days ago

Consider the source of the Reddit comments, Stack Overflow questions and FB photos: a substantial number of them are poorly written, irrelevant, re-posts and were already churned out by AI. So the ouroboros is the master of and source of its own fate. GIGO.

So fill yer boots, AI, it's your own shit you're eating.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

FlyingSquid , 2 days ago

Hence Google's AI giving supposedly real advice based on Reddit joke posts.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...