stable-diffusion.cpp/docs/performance.md
2026-01-11 17:35:01 +08:00

7 lines
281 B
Markdown

## Offload weights to the CPU to save VRAM without reducing generation speed.
Using `--offload-to-cpu` allows you to offload weights to the CPU, saving VRAM without reducing generation speed.
## Use quantization to reduce memory usage.
[quantization](./quantization_and_gguf.md)