Activity - LLMs are trained to see parts of a document and reproduce the other parts,... - Mbin

Lemvi , 4 months ago

LLMs are trained to see parts of a document and reproduce the other parts, that's why they are called "language models".

For example, they might learn that the words "strawberries are" are often followed by the words "delicious", "red", or "fruits", but never by the words "airplanes", "bottles" or "are".

Likewise, they learn to mimic reasoning contained in their training data. They learn the words and structures involved in an argument, but they also learn the conclusions they should arrive at. If the training dataset consists of 80 documents arguing for something, and 20 arguing against it (assuming nothing else differentiates those documents (like length etc.)), the LLM will adopt the standpoint of the 80 documents, and argue for that thing. If those 80 documents contain flawed logic, so will the LLM's reasoning.

Of course, you could train a LLM on a carefully curated selection of only documents without any logical fallacies. Perhaps, such a model might be capable of actual logical reasoning (though it would still be biased by the conclusions contained in the training dataset)

But to train an LLM you need vasts amount of data. Filtering out documents containing flawed logic does not only require a lot of effort, it also reduces the size of the training dataset.

Of course, that is exactly what the big companies are currently researching and I am confident that LLMs will only get better over time, but the LLMs of today are trained on large datasets rather than perfect ones, and their architecture and training prioritize language modelling, not logical reasoning.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

/m/noncredibledefense@sh.itjust.works

Thread

nuke

@nuke@sh.itjust.works

Added: 4 months ago
Views: 14
Ratio: 1 (50%)

Magazine

NonCredibleDefense

@noncredibledefense@sh.itjust.works

A community for your defence shitposting needs

Rules

Be niceDo not make personal attacks against each other, call for violence against anyone, or intentionally antagonize people in the comment sections.
Explain incorrect defense articles and takesIf you want to post a non-credible take, it must be from a “credible” source (news article, politician, or military leader) and must have a comment laying out exactly why it’s non-credible. Random twitter and YouTube comments belong in the Low Hanging Fruit thread.
Content must be relevantPosts must be about military hardware or international security/defense. This is not the page to fawn over Youtube personalities, simp over political leaders, or discuss other areas of international policy.
No racism / hatespeechNo slurs. No advocating for the killing of people or insulting them based on physical, religious, or ideological traits.
No politicsWe don’t care if you’re Republican, Democrat, Socialist, Stalinist, Baathist, or some other hot mess. Leave it at the door. This applies to comments as well.
No seriouspostingWe don’t want your uncut war footage, fundraisers, credible news articles, or other such things. The world is already serious enough as it is.
No classified materialClassified information is off limits regardless of how “open source” and “easy to find” it is.
Source artworkIf you use somebody’s art in your post or as your post, the OP must provide a direct link to the art’s source in the comment section, or a good reason why this was not possible (such as the artist deleting their account). The source should be a place that the artist themselves uploaded the art. A booru is not a source. A watermark is not a source.
No low-effort postsNo egregiously low effort posts. These include Social media screenshots with a title punchline / no punchline, recent (after the start of the Ukraine War) reposts, simple reaction & template memes, and images with the punchline in the title. Put these in weekly Low effort thread instead.
Don't get us bannedNo brigading or harassing other communities. Do not post memes with a “haha people that I hate died… haha” punchline or violating the sh.itjust.works rules (below). This includes content illegal in Canada.

Join our Matrix chatroom https://img.shields.io/matrix/ncdchat%3Atchncs.de?server_fqdn=matrix.org&style=flat-square&label=NCD%20Chat&color=blue&link=https%3A%2F%2Fmatrix.to%2F%23%2F%23ncdchat%3Atchncs.de

Other communities you may be interested in

!MilitaryPorn (credible)

Banner made by u/Fertility18

Created: 5 months ago
Subscribers: 5978

Moderators

Active people

mbin.grits.dev

Powered by Mbin
●
1.4.1
●
Report issue