Our site is getting slow due to large data 😓. Please bear with the ads for now, as we need more funding to migrate to a new, stronger server. 🙏

👁️ 181💾 0

🗣️ 91💬 237 Token: 28/65

Localhost Proxy LLM Guide

If you possess a decent computer, you can use Kobold to host your own LLMs.

It's not Deepseek, but when the models are trained with roleplaying in mind, it comes pretty close.

This guide describes how, but the website has been down as of late. https://waiki.trashpanda.land/guides:self_hosting_local_kobold
You can use the Wayback machine to view the archived version, or continue reading because I'm copy-pasting most of it and putting it here.

Massive credit to whoever written the guide. Here's to hoping they can fix the website.

------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

Check Your Hardware

RAM/VRAM: Press Ctrl + Shift + Esc > “Performance” tab.
- VRAM: Under “GPU” (look for “Dedicated GPU Memory”)
- RAM: Under “Memory”
Rule of Thumb:
- 7B models need ~8GB RAM (use Q4/Q5 quantization)
- 13B+ models need ~16GB+ RAM
- Anything above you can probably guess. (8gb as in RAM + VRAM together if you do offload to your GPU, you also need to account for context using up more RAM)

Download a Model

Where? HuggingFace (search for GGUF files)
- Starter Picks:
  - 8B: Stheno 3.2 8B or Llama 3 8B
  - 12B: MN-Violet-Lotus-12B
Quantization: Use Q4_K_M, Q5_K_M, or higher (avoid anything lower, they’re kinda dumb)

((nobody asked me, subs455, but I'm a fan of Mawdistical_Squelching-Fantasies-qw3-14B-Q4_K_M
and MN-12b-RP-Ink-Q6_K))

Install KoboldCPP

Download KoboldCPP (the easiest way to run GGUF models for me personally)
Open koboldcpp.exe.
(If you don’t have a GPU, use LM Studio! There are guides out there specifically for it)

Configure KoboldCPP

Click Browse and select your GGUF model file.
Backend Settings:
- NVIDIA GPU? Use CUBlas.
- AMD GPU? Use Vulkan.
- No GPU? Use OpenBLAS (CPU-only mode) 1)
GPU Layers:
- Example: For a 7B model with 33 layers, offload 32 layers to your GPU (if you have 6GB+ VRAM).
Pro Tip: Start with 80% of your VRAM capacity (6GB VRAM ≈ 32 layers (Layer size varies between models!) (You can also use this helpful calculator)

Tweak Settings

Context Size: Start at 4096 (increase if you have RAM to use).
Faster Processing: Enable MMQ, FlashAttention, ContextShift, and FastForwarding
- MMQ: Basically, do math in a different way that makes it more VRAM friendly
- FlashAttention: Calculates which parts are important instead of doing it for each individual piece (this is really dumbed down dont quote me)
- ContextShif

Creator: @subs455

Character Definition

Personality: this chatbot chastises the user for clicking on the chatbot.
Scenario: this chatbot chastises the user for clicking on the chatbot.
First Message: this chatbot chastises the user for clicking on the chatbot. "Oops. You're not supposed to be here. You should go back and read the instructions, dork."
Example Dialogs:

Report Broken Image

If you encounter a broken image, click the button below to report it so we can update:

Nano-GPT.com

400+ LLMs (Claude, DeepSeek, KimiK2,...) via one API.
Work with SillyTavern or JanitorAI via Proxy.

Similar Characters

🗣️ 81💬 2.1kToken: 1397/2555

Donovan

Donnie Cook → Blind Finality

⤷ Blind Finality's lead singer is a toxic asshole, and you're one of his favorite groupies

⤳ AnyPOV | DeepSeek | NSFW Intro ⬿

🔞 NSFW
👨‍🦰 Male
🧑‍🎨 OC
📚 Fictional
🤐 OpenAI
⛓️ Dominant
👤 AnyPOV

🗣️ 496💬 8.2kToken: 2421/3153

Dr. Gideon Vance | Hands on Teaching

"Don't trust your eyes, class. Trust the tremors in my assistant's hands."

━━━ ⌬ ━━━

DR. GIDEON VANCE

THE INVISIBLE PROFESSOR

Subject Profile: Dr. Gi

🔞 NSFW
👨‍🦰 Male
🧑‍🎨 OC
⛓️ Dominant
💁 Assistant
👤 AnyPOV
❤️‍🔥 Smut
🕊️🗡️ Dead Dove
🛸 Sci-Fi

Token: 6/12

Request bot!!

Request a bot in the review section!!

Since this is a request bot, do not chat.

💁 Assistant
👤 AnyPOV

🗣️ 1💬 13Token: 321/353

Bible Chatbot

Here to convict and help you with Faith

🔞 NSFW
💁 Assistant
📙 Philosophy
⛪️ Religon
👤 AnyPOV

🗣️ 11💬 214Token: 136/136

Knower

"Knower" was a multifunctional character for role-playing games, capable of taking on a variety of images to fulfill the most varied fantasies of users, it sounds pretty fun

🌈 Non-binary
🧑‍🎨 OC
🪢 Scenario
💁 Assistant
👤 AnyPOV
💔 Angst
🌗 Switch

🗣️ 7.7k💬 159.3kToken: 908/1833

Adrien Agreste｜Chat Noir

[1/?] OF THE MIRACULOUS SERIES . . . ! SCENARIO ONE ⟡

♡ ｜ Adrien Agreste, your wonderful best friend, who's a model, professional pianist, and heir to a fashion fortu

🔞 NSFW
👨‍🦰 Male
📚 Fictional
🦸‍♂️ Hero
🔮 Magical
🤐 OpenAI
👤 AnyPOV

🗣️ 584💬 2.5kToken: 1212/1523

Frigga

Frigga is a towering, bear-themed biodroid, designed as both a formidable frontline protector and a nurturing guardian. Standing at an imposing 6'3" with a muscular yet curv