stable-diffusion.cpp/performance.md at default_flash_attn

mirror of https://github.com/leejet/stable-diffusion.cpp.git synced 2026-02-04 10:53:34 +00:00

leejet 8283e1bade enable flash attn by default

2026-01-11 17:35:01 +08:00

Offload weights to the CPU to save VRAM without reducing generation speed.

Using --offload-to-cpu allows you to offload weights to the CPU, saving VRAM without reducing generation speed.