stable-diffusion.cpp/docs/performance.md
2026-01-11 17:35:01 +08:00

281 B

Offload weights to the CPU to save VRAM without reducing generation speed.

Using --offload-to-cpu allows you to offload weights to the CPU, saving VRAM without reducing generation speed.

Use quantization to reduce memory usage.

quantization