Activity - llama2 gguf with 2bit quantisation only needs ~5gb vram. 8bits need >9gb....

Mike1576218 , 17 days ago

llama2 gguf with 2bit quantisation only needs ~5gb vram. 8bits need >9gb. Anything inbetween is possible. There are even 1.5bit and even 1bit options (not gguf AFAIK). Generally fewer bits means worse results though.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

/m/artificial_intel@lemmy.ml

Threads (227)

Microblog (0)

People

Magazines

Thread

makeasnek

@makeasnek@lemmy.ml

Added: 20 days ago
Views: 9
Ratio: 0

Magazine

Artificial intelligence (AI) is intelligence demonstrated by machines, unlike the natural intelligence displayed by humans and animals, which involves consciousness and emotionality. The distinction between the former and the latter categories is often revealed by the acronym chosen.

Created: 5 months ago
Subscribers: 3912