Daniele
6448430dbb
feat: add break pseudo token support ( #422 )
...
---------
Co-authored-by: Urs Ganse <urs.ganse@helsinki.fi>
master-359-6448430
2025-11-16 14:45:20 +08:00
leejet
347710f68f
feat: support applying LoRA at runtime ( #969 )
master-358-347710f
2025-11-13 21:48:44 +08:00
lcy
59ebdf0bb5
chrore: enable Windows ROCm(HIP) build release ( #956 )
...
* build: fix missing commit sha in macOS and Ubuntu build zip name
The build workflows for macOS and Ubuntu incorrectly check for the
"main" branch instead of "master" when retrieving the commit hash for
naming the build artifacts.
* build: correct Vulkan SDK installation condition in build workflow
* build: Enable Windows ROCm(HIP) build release
Refer to the build workflow of llama.cpp to add a Windows ROCm (HIP)
build release to the workflow.
Since there are many differences between the HIP build and other
builds, this commit add a separate "windows-latest-cmake-hip" job,
instead of enabling the ROCm matrix entry in the existing Windows
build job.
Main differences include:
- Install ROCm SDK from AMD official installer.
- Add a cache step for ROCm installation and a ccache step for build
processing, since the HIP build takes much longer time than other
builds.
- Include the ROCm/HIP artifact in the release assets.
master-357-59ebdf0
2025-11-12 00:28:55 +08:00
Flavio Bizzarri
4ffcbcaed7
fix: specify enum modifier in sd_set_preview_callback signature ( #959 )
master-356-4ffcbca
2025-11-12 00:27:23 +08:00
leejet
694f0d9235
refactor: optimize the logic for name conversion and the processing of the LoRA model ( #955 )
master-355-694f0d9
2025-11-10 00:12:20 +08:00
stduhpf
8ecdf053ac
feat: add image preview support ( #522 )
master-354-8ecdf05
2025-11-10 00:12:02 +08:00
leejet
ee89afc878
fix: resolve issue with pmid ( #957 )
master-353-ee89afc
2025-11-09 22:47:53 +08:00
akleine
d2d3944f50
feat: add support for SD2.x with TINY U-Nets ( #939 )
master-352-d2d3944
2025-11-09 22:47:37 +08:00
akleine
0fa3e1a383
fix: prevent core dump in PM V2 in case of incomplete cmd line ( #950 )
master-351-0fa3e1a
2025-11-09 22:36:43 +08:00
leejet
c2d8ffc22c
fix: compatibility for models with modified tensor shapes ( #951 )
master-350-c2d8ffc
2025-11-07 23:04:41 +08:00
stduhpf
fb748bb8a4
fix: TAE encoding ( #935 )
master-349-fb748bb
2025-11-07 22:58:59 +08:00
leejet
8f6c5c217b
refactor: simplify the model loading logic ( #933 )
...
* remove String2GGMLType
* remove preprocess_tensor
* fix clip init
* simplify the logic for reading weights
master-348-8f6c5c2
2025-11-03 21:21:34 +08:00
leejet
6103d86e2c
refactor: introduce GGMLRunnerContext ( #928 )
...
* introduce GGMLRunnerContext
* add Flash Attention enable control through GGMLRunnerContext
* add conv2d_direct enable control through GGMLRunnerContext
master-347-6103d86
2025-11-02 02:11:04 +08:00
stduhpf
c42826b77c
fix: resolve multiple inpainting issues ( #926 )
...
* Fix inpainting masked image being broken by side effect
* Fix unet inpainting concat not being set
* Fix Flex.2 inpaint mode crash (+ use scale factor)
master-346-c42826b
2025-11-02 02:10:32 +08:00
Wagner Bruna
945d9a9ee3
docs: add Koboldcpp as an available UI ( #930 )
2025-11-02 02:03:01 +08:00
Wagner Bruna
353e708844
docs: update ggml and llama.cpp URLs ( #931 )
2025-11-02 02:02:44 +08:00
leejet
dd75fc081c
refactor: unify the naming style of ggml extension functions ( #921 )
master-343-dd75fc0
2025-10-28 23:26:48 +08:00
stduhpf
77eb95f8e4
docs: fix taesd direct download link ( #917 )
2025-10-28 23:26:23 +08:00
Wagner Bruna
8a45d0ff7f
chore: clean up stb includes ( #919 )
master-341-8a45d0f
2025-10-28 23:25:45 +08:00
leejet
9e28be6479
feat: add chroma radiance support ( #910 )
...
* add chroma radiance support
* fix ci
* simply generate_init_latent
* workaround: avoid ggml cuda error
* format code
* add chroma radiance doc
master-340-9e28be6
2025-10-25 23:56:14 +08:00
akleine
062490aa7c
feat: add SSD1B and tiny-sd support ( #897 )
...
* feat: add code and doc for running SSD1B models
* Added some more lines to support SD1.x with TINY U-Nets too.
* support SSD-1B.safetensors
* fix sdv1.5 diffusers format loader
---------
Co-authored-by: leejet <leejet714@gmail.com>
master-339-062490a
2025-10-25 23:35:54 +08:00
stduhpf
faabc5ad3c
feat: allow models to run without all text encoder(s) ( #645 )
master-338-faabc5a
2025-10-25 22:00:56 +08:00
leejet
69b9511ce9
sync: update ggml
2025-10-24 00:32:45 +08:00
stduhpf
917f7bfe99
fix: support --flow-shift for flux models with default pred ( #913 )
master-336-917f7bf
2025-10-23 21:35:18 +08:00
leejet
48e0a28ddf
feat: add shift factor support ( #903 )
master-335-48e0a28
2025-10-23 01:20:29 +08:00
leejet
d05e46ca5e
chore: add .clang-tidy configuration and apply modernize checks ( #902 )
master-334-d05e46c
2025-10-18 23:23:40 +08:00
Wagner Bruna
64a7698347
chore: report number of Qwen layers as info ( #901 )
master-333-64a7698
2025-10-18 23:22:01 +08:00
leejet
0723ee51c9
refactor: optimize option printing ( #900 )
master-332-0723ee5
2025-10-18 17:50:30 +08:00
leejet
90ef5f8246
feat: add auto-resize support for reference images (was Qwen-Image-Edit only) ( #898 )
master-331-90ef5f8
2025-10-18 16:37:09 +08:00
leejet
db6f4791b4
feat: add wtype stat ( #899 )
master-330-db6f479
2025-10-17 23:40:32 +08:00
leejet
b25785bc10
sync: update ggml
2025-10-17 21:46:39 +08:00
leejet
0585e2609d
docs: split README sections (build, performance, etc.) into separate docs
2025-10-16 23:22:06 +08:00
leejet
683d6d08a8
chore: add github issue template
2025-10-16 21:04:41 +08:00
leejet
40a6a8710e
fix: resolve precision issues in SDXL VAE under fp16 ( #888 )
...
* fix: resolve precision issues in SDXL VAE under fp16
* add --force-sdxl-vae-conv-scale option
* update docs
master-326-40a6a87
2025-10-15 23:01:00 +08:00
Daniele
e3702585cb
feat: added prediction argument ( #334 )
master-325-e370258
2025-10-15 23:00:10 +08:00
cmdr2
a7d6d296c7
chore: allow building ggml as a separate shared lib ( #468 )
master-324-a7d6d29
2025-10-15 22:10:26 +08:00
leejet
2e9242e37f
feat: add Qwen Image Edit support ( #877 )
...
* add ref latent support for qwen image
* optimize clip_preprocess and fix get_first_stage_encoding
* add qwen2vl vit support
* add qwen image edit support
* fix qwen image edit pipeline
* add mmproj file support
* support dynamic number of Qwen image transformer blocks
* set prompt_template_encode_start_idx every time
* to_add_out precision fix
* to_out.0 precision fix
* update docs
master-323-2e9242e
2025-10-13 23:17:18 +08:00
Wagner Bruna
c64994dc1d
fix: better progress display for second-order samplers ( #834 )
master-322-c64994d
2025-10-13 22:12:48 +08:00
Wagner Bruna
5436f6b814
fix: correct canny preprocessor ( #861 )
master-321-5436f6b
2025-10-13 22:02:35 +08:00
leejet
1c32fa03bc
fix: avoid generating black images when running T5 on the GPU ( #882 )
master-320-1c32fa0
2025-10-13 00:01:06 +08:00
Wagner Bruna
9727c6bb98
fix: resolve VAE tiling problem in Qwen Image ( #873 )
master-319-9727c6b
2025-10-12 23:45:53 +08:00
leejet
beb99a2de2
feat: add Qwen Image support ( #851 )
...
* add qwen tokenizer
* add qwen2.5 vl support
* mv qwen.hpp -> qwenvl.hpp
* add qwen image model
* add qwen image t2i pipeline
* fix qwen image flash attn
* add qwen image i2i pipline
* change encoding of vocab_qwen.hpp to utf8
* fix get_first_stage_encoding
* apply jeffbolz f32 patch
https://github.com/leejet/stable-diffusion.cpp/pull/851#issuecomment-3335515302
* fix the issue that occurs when using CUDA with k-quants weights
* optimize the handling of the FeedForward precision fix
* to_add_out precision fix
* update docs
master-318-beb99a2
2025-10-12 23:23:19 +08:00
Wagner Bruna
aa68b875b9
refactor: deal with default img-cfg-scale at the library level ( #869 )
master-317-aa68b87
2025-10-12 23:17:52 +08:00
Wagner Bruna
5b261b9cee
feat: add a stand-alone upscale mode ( #865 )
...
* feat: add a stand-alone upscale mode
* fix prompt option check
* format code
* update README.md
---------
Co-authored-by: leejet <leejet714@gmail.com>
master-316-5b261b9
2025-10-12 23:10:02 +08:00
Pedrito
e70d0205ca
feat: add support for more esrgan models & x2 & x1 models ( #855 )
master-315-e70d020
2025-10-12 22:53:31 +08:00
leejet
02af48a97f
chore: fix vulkan ci ( #878 )
master-314-02af48a
2025-10-11 00:40:57 +08:00
leejet
e12d5e0aaf
fix: ensure directory iteration results are sorted by filename ( #858 )
2025-10-11 00:18:39 +08:00
Serkan Sahin
940a2018e1
chore: fix dockerfile libgomp1 dependency + improvements ( #852 )
2025-10-11 00:17:45 +08:00
Sharuzzaman Ahmat Raslan
b451728b2f
docs: update README.md ( #866 )
2025-10-11 00:11:10 +08:00
stduhpf
11f436c483
feat: add support for Flux Controls and Flex.2 ( #692 )
2025-10-11 00:06:57 +08:00