What does an MCP server do?
The point in time after the first qbit based supercomputers transitioned from theoretical abstraction to physical proven reality. Thus opening up the can-of-worms of feasabily cracking classical cryptographic encryptions like an egg within human acceptable time frames instead of longer-than-the-universes-lifespan timeframes.. Thanks, superposition probability based parallel computations.
Thank you for deciding to engage with our community here! You're in good company.
Kobold just released a bunch of tools for quant making you may want to check out.
I have not made my own quants. I usually just find whatever imatrix gguf bartowlski or the other top makers on HF release.
I too am in the process of upgrading my homelab and opening up my model engine as a semi public service. The biggest performance gains ive found are using CUDA and loading everything in vram. So far just been working with my old nvidia 1070ti 8gb card.
Havent tried vllm engine just kobold. I hear good things about vllm it will be something to look into sometime. I'm happy and comfortable with my model engine system as it got everything setup just the way I want is but I'm always open to performance optimization.
If you havent already try running vllm with its CPU nicencess set to highest priority. If vllm can use flash attention try that too.
I'm just enough of a computer nerd to get the gist of technical things and set everything up software/networking side. Bought a domain name, set up a web server and hardened it. Kobolds webui didnt come with https SSL/TLS cert handling so I needed to get a reverse proxy working to get the connection properly encrypted.
I am really passionate about this even though so much of the technical nitty gritty under the hood behind models goes over my head. I was inspired enough to buy a p100 Tesla 16gb and try shoving it into an old gaming desktop which is my current homelab project. I dont have a lot of money so this was months of saving for the used server class GPU and the PSU to run it + the 1070ti 8gb I have later.
The PC/server building hardware side scares me but I'm working on it. I'm not used to swapping parts out at all. when I tried to build my own PC a decade ago it didnt last long before something blew so there's a bit of residual trauma there. I'm worried about things not fit right in the case, or destroying something or the the card not working and it all.
Those are unhealthy worries when I'm trying to apply myself to this cutting edge stuff. I'm really trying to work past that anxiety and just try my best to install the stupid GPU. I figure if I fail I fail thats life it will be a learning experience either way.
I want to document the upgrade process journey on my new self hosted site. I also want to open my kobold service to public use by fellow hobbyist. I'm not quite confident in sharing my domain on the public web though just yet I'm still cooking.
Coincidentally the same name as my geometry themed experimental grunge rock band
The first thought I had was this same scenario but all grown up. Imagine two fully grown 700 lb bovines crammed in your kitchen staring down your dishes lol theyre all cute until they become living flesh tanks then they're still cute but hella bulky and slightly intimidating
nods and continues to use original doom wads with the red cross design for health pickups because the green one from BFG editions look like shit
Being an alternate protocol nerd is a trip. Most people have no clue what a gopher/Gemini/spartan/finger is or how they differ from the web. The few handful on this planet that do are just other nerds who like to blogspam tech nerd things. It would be nice if the web enshittified so much even the average non techie was put into a position to look into these alternatives.
Right now THCA mail-order is under fire from goons in house and senate so if your gonna order on bulk legally may want to do it soon the lawmaking could go either way. I recommend eight horse hemp for cheap mid bulk and wnc-cbd for the top shelf premium
Have you by chance checked out kobold.cpp lite webUI? It allows some of what your asking for like RAG for worldbuilding, adding images for the llm to describe to add into the story, easy editing of input and output, lots of customization in settings. I have a public instance of kobold webui setup on my website and I'm cool with allowing fellow hobbyist using my compute to experiment with things. If your interested in trying it out to see if its more what youre looking for, feel free to send me a pm and I'll send you the address and a api key/password.
In an ideal work what exactly would you want an AI integrated text editor to do? Depending on what you need to have happen in your workflow you can automate copy pasting and automatic output logging with python scripts and your engines api.
Editing and audiing stories isnt that much different from auditing codebases. It all boils down to the understanding and correct use of language to convey abstraction. I bet tweaking some agebic personalities and goals in vscode+roo could get you somewhere
Check if your country has a national weather service and use their direct website.