stable-diffusion.cpp

mirror of https://github.com/leejet/stable-diffusion.cpp.git synced 2026-02-04 10:53:34 +00:00

Author	SHA1	Message	Date
leejet	dd75fc081c	refactor: unify the naming style of ggml extension functions (#921 ) master-343-dd75fc0	2025-10-28 23:26:48 +08:00
stduhpf	77eb95f8e4	docs: fix taesd direct download link (#917 )	2025-10-28 23:26:23 +08:00
Wagner Bruna	8a45d0ff7f	chore: clean up stb includes (#919 ) master-341-8a45d0f	2025-10-28 23:25:45 +08:00
leejet	9e28be6479	feat: add chroma radiance support (#910 ) * add chroma radiance support * fix ci * simply generate_init_latent * workaround: avoid ggml cuda error * format code * add chroma radiance doc master-340-9e28be6	2025-10-25 23:56:14 +08:00
akleine	062490aa7c	feat: add SSD1B and tiny-sd support (#897 ) * feat: add code and doc for running SSD1B models * Added some more lines to support SD1.x with TINY U-Nets too. * support SSD-1B.safetensors * fix sdv1.5 diffusers format loader --------- Co-authored-by: leejet <leejet714@gmail.com> master-339-062490a	2025-10-25 23:35:54 +08:00
stduhpf	faabc5ad3c	feat: allow models to run without all text encoder(s) (#645 ) master-338-faabc5a	2025-10-25 22:00:56 +08:00
leejet	69b9511ce9	sync: update ggml	2025-10-24 00:32:45 +08:00
stduhpf	917f7bfe99	fix: support `--flow-shift` for flux models with default pred (#913 ) master-336-917f7bf	2025-10-23 21:35:18 +08:00
leejet	48e0a28ddf	feat: add shift factor support (#903 ) master-335-48e0a28	2025-10-23 01:20:29 +08:00
leejet	d05e46ca5e	chore: add .clang-tidy configuration and apply modernize checks (#902 ) master-334-d05e46c	2025-10-18 23:23:40 +08:00
Wagner Bruna	64a7698347	chore: report number of Qwen layers as info (#901 ) master-333-64a7698	2025-10-18 23:22:01 +08:00
leejet	0723ee51c9	refactor: optimize option printing (#900 ) master-332-0723ee5	2025-10-18 17:50:30 +08:00
leejet	90ef5f8246	feat: add auto-resize support for reference images (was Qwen-Image-Edit only) (#898 ) master-331-90ef5f8	2025-10-18 16:37:09 +08:00
leejet	db6f4791b4	feat: add wtype stat (#899 ) master-330-db6f479	2025-10-17 23:40:32 +08:00
leejet	b25785bc10	sync: update ggml	2025-10-17 21:46:39 +08:00
leejet	0585e2609d	docs: split README sections (build, performance, etc.) into separate docs	2025-10-16 23:22:06 +08:00
leejet	683d6d08a8	chore: add github issue template	2025-10-16 21:04:41 +08:00
leejet	40a6a8710e	fix: resolve precision issues in SDXL VAE under fp16 (#888 ) * fix: resolve precision issues in SDXL VAE under fp16 * add --force-sdxl-vae-conv-scale option * update docs master-326-40a6a87	2025-10-15 23:01:00 +08:00
Daniele	e3702585cb	feat: added prediction argument (#334 ) master-325-e370258	2025-10-15 23:00:10 +08:00
cmdr2	a7d6d296c7	chore: allow building ggml as a separate shared lib (#468 ) master-324-a7d6d29	2025-10-15 22:10:26 +08:00
leejet	2e9242e37f	feat: add Qwen Image Edit support (#877 ) * add ref latent support for qwen image * optimize clip_preprocess and fix get_first_stage_encoding * add qwen2vl vit support * add qwen image edit support * fix qwen image edit pipeline * add mmproj file support * support dynamic number of Qwen image transformer blocks * set prompt_template_encode_start_idx every time * to_add_out precision fix * to_out.0 precision fix * update docs master-323-2e9242e	2025-10-13 23:17:18 +08:00
Wagner Bruna	c64994dc1d	fix: better progress display for second-order samplers (#834 ) master-322-c64994d	2025-10-13 22:12:48 +08:00
Wagner Bruna	5436f6b814	fix: correct canny preprocessor (#861 ) master-321-5436f6b	2025-10-13 22:02:35 +08:00
leejet	1c32fa03bc	fix: avoid generating black images when running T5 on the GPU (#882 ) master-320-1c32fa0	2025-10-13 00:01:06 +08:00
Wagner Bruna	9727c6bb98	fix: resolve VAE tiling problem in Qwen Image (#873 ) master-319-9727c6b	2025-10-12 23:45:53 +08:00
leejet	beb99a2de2	feat: add Qwen Image support (#851 ) * add qwen tokenizer * add qwen2.5 vl support * mv qwen.hpp -> qwenvl.hpp * add qwen image model * add qwen image t2i pipeline * fix qwen image flash attn * add qwen image i2i pipline * change encoding of vocab_qwen.hpp to utf8 * fix get_first_stage_encoding * apply jeffbolz f32 patch https://github.com/leejet/stable-diffusion.cpp/pull/851#issuecomment-3335515302 * fix the issue that occurs when using CUDA with k-quants weights * optimize the handling of the FeedForward precision fix * to_add_out precision fix * update docs master-318-beb99a2	2025-10-12 23:23:19 +08:00
Wagner Bruna	aa68b875b9	refactor: deal with default img-cfg-scale at the library level (#869 ) master-317-aa68b87	2025-10-12 23:17:52 +08:00
Wagner Bruna	5b261b9cee	feat: add a stand-alone upscale mode (#865 ) * feat: add a stand-alone upscale mode * fix prompt option check * format code * update README.md --------- Co-authored-by: leejet <leejet714@gmail.com> master-316-5b261b9	2025-10-12 23:10:02 +08:00
Pedrito	e70d0205ca	feat: add support for more esrgan models & x2 & x1 models (#855 ) master-315-e70d020	2025-10-12 22:53:31 +08:00
leejet	02af48a97f	chore: fix vulkan ci (#878 ) master-314-02af48a	2025-10-11 00:40:57 +08:00
leejet	e12d5e0aaf	fix: ensure directory iteration results are sorted by filename (#858 )	2025-10-11 00:18:39 +08:00
Serkan Sahin	940a2018e1	chore: fix dockerfile libgomp1 dependency + improvements (#852 )	2025-10-11 00:17:45 +08:00
Sharuzzaman Ahmat Raslan	b451728b2f	docs: update README.md (#866 )	2025-10-11 00:11:10 +08:00
stduhpf	11f436c483	feat: add support for Flux Controls and Flex.2 (#692 )	2025-10-11 00:06:57 +08:00
leejet	35843c77ea	fix: optimize the handling of embedding weight (#859 ) master-309-35843c7	2025-09-25 23:09:59 +08:00
leejet	6ad46bb700	sync: update ggml	2025-09-25 21:57:43 +08:00
leejet	1ba30ce005	sync: update ggml	2025-09-25 00:38:38 +08:00
leejet	2abe9451c4	fix: optimize the handling of CLIP embedding weight (#840 ) master-306-2abe945	2025-09-25 00:28:20 +08:00
Wagner Bruna	f3140eadbb	fix: tensor loading thread count (#854 ) master-305-f3140ea	2025-09-25 00:26:38 +08:00
Stefan-Olt	98ba155fc6	docs: HipBLAS / ROCm build instruction fix (#843 )	2025-09-25 00:03:05 +08:00
Wagner Bruna	513f36d495	docs: include Vulkan compatibility for LoRA quants (#845 )	2025-09-25 00:01:10 +08:00
rmatif	1e0d2821bb	fix: correct tensor deduplication logic (#844 ) master-302-1e0d282	2025-09-24 23:22:40 +08:00
leejet	fd693ac6a2	refactor: remove unused --normalize-input parameter (#835 ) master-301-fd693ac	2025-09-18 00:12:53 +08:00
Wagner Bruna	171b2222a5	fix: avoid segfault for pix2pix models without reference images (#766 ) * fix: avoid segfault for pix2pix models with no reference images * fix: default to empty reference on pix2pix models to avoid segfault * use resize instead of reserve * format code --------- Co-authored-by: leejet <leejet714@gmail.com> master-300-171b222	2025-09-18 00:11:38 +08:00
leejet	567f9f14f0	fix: avoid multithreading issues in the model loader master-299-567f9f1	2025-09-18 00:00:15 +08:00
leejet	1e5f207006	chore: fix workflow (#836 ) master-298-1e5f207	2025-09-17 22:11:55 +08:00
leejet	79426d578e	chore: set release tag by commit count	2025-09-16 23:24:36 +08:00
vmobilis	97ad3e7ff9	refactor: simplify DPM++ (2S) Ancestral (#667 ) master-97ad3e7	2025-09-16 23:05:25 +08:00
Erik Scholz	8909523e92	refactor: move tiling cacl and debug print into the tiling code branch (#833 ) master-8909523	2025-09-16 22:46:56 +08:00
rmatif	8376dfba2a	feat: add sgm_uniform scheduler, simple scheduler, and support for NitroFusion (#675 ) * feat: Add timestep shift and two new schedulers * update readme * fix spaces * format code * simplify SGMUniformSchedule * simplify shifted_timestep logic * avoid conflict --------- Co-authored-by: leejet <leejet714@gmail.com> master-8376dfb	2025-09-16 22:42:09 +08:00

1 2 3 4 5 ...

343 Commits