Slop.

809 readers

570 users here now

For posting all the anonymous reactionary bullshit that you can't post anywhere else.

Rule 1: All posts must include links to the subject matter, and no identifying information should be redacted.

Rule 2: If your source is a reactionary website, please use archive.is instead of linking directly.

Rule 3: No sectarianism.

Rule 4: TERF/SWERFs Not Welcome

Rule 5: No bigotry of any kind, including ironic bigotry.

Rule 6: Do not post fellow hexbears.

Rule 7: Do not individually target federated instances' admins or moderators.

founded 1 year ago

MODERATORS

VILenin@hexbear.net

MiraculousMM@hexbear.net

Hestia@hexbear.net

oscardejarjayes@hexbear.net

Diva@lemmy.ml

Diva@hexbear.net

mathemachristian@hexbear.net

187

Hurry up and die you parasitic industry (hexbear.net)

submitted 1 week ago by Dort_Owl@hexbear.net to c/slop@hexbear.net

57 comments fedilink hide all child comments

Thanks capitalism for doing the stupidest implementation of this technology possible

you are viewing a single comment's thread
view the rest of the comments

[–] SuperZutsuki@hexbear.net 2 points 1 week ago* (last edited 1 week ago) (2 children)

You got a guide for Linux? I have 64gb RAM and 16gb VRAM, running Arch, and have only messed with deepseek on my phone.

[–] LaughingLion@hexbear.net 1 points 1 week ago* (last edited 1 week ago)

Also, 16gb VRAM? You'll be able to load a better model like https://huggingface.co/mradermacher/Skyfall-31B-v4-i1-GGUF which is a little stronger than the ones I linked in the guide. If the "i1-Q4_K_S" is too large then try the "i1-IQ4_XS" quant.

Probably try it offloading just the down tensors (top option) in the guide. Make sure your KV Batch size is 1024 (or better) so the context gets offloaded on the GPU faster to cut down on response times. Otherwise everything else in the guide is good for you. If you find you have a little bit of VRAM space at 16k context and 1024 batch size try upping the context a little until your VRAM is like 15GB utilized or better.

[–] LaughingLion@hexbear.net 1 points 1 week ago

The guide is the same you'll just need to follow the install instructions for Linux for Koboldcpp and SilltyTavern that is documented on the github pages linked.

The performance guide is the same as well as the link on how to set up the "character" to act as a GM. If you run into any hitches maybe I can help but to be clear I haven't touched Linux in 10 years.