leejet
694f0d9235
refactor: optimize the logic for name conversion and the processing of the LoRA model ( #955 )
2025-11-10 00:12:20 +08:00
leejet
8f6c5c217b
refactor: simplify the model loading logic ( #933 )
...
* remove String2GGMLType
* remove preprocess_tensor
* fix clip init
* simplify the logic for reading weights
2025-11-03 21:21:34 +08:00
leejet
6103d86e2c
refactor: introduce GGMLRunnerContext ( #928 )
...
* introduce GGMLRunnerContext
* add Flash Attention enable control through GGMLRunnerContext
* add conv2d_direct enable control through GGMLRunnerContext
2025-11-02 02:11:04 +08:00
leejet
dd75fc081c
refactor: unify the naming style of ggml extension functions ( #921 )
2025-10-28 23:26:48 +08:00
leejet
9e28be6479
feat: add chroma radiance support ( #910 )
...
* add chroma radiance support
* fix ci
* simply generate_init_latent
* workaround: avoid ggml cuda error
* format code
* add chroma radiance doc
2025-10-25 23:56:14 +08:00
leejet
d05e46ca5e
chore: add .clang-tidy configuration and apply modernize checks ( #902 )
2025-10-18 23:23:40 +08:00
Wagner Bruna
64a7698347
chore: report number of Qwen layers as info ( #901 )
2025-10-18 23:22:01 +08:00
leejet
40a6a8710e
fix: resolve precision issues in SDXL VAE under fp16 ( #888 )
...
* fix: resolve precision issues in SDXL VAE under fp16
* add --force-sdxl-vae-conv-scale option
* update docs
2025-10-15 23:01:00 +08:00
leejet
2e9242e37f
feat: add Qwen Image Edit support ( #877 )
...
* add ref latent support for qwen image
* optimize clip_preprocess and fix get_first_stage_encoding
* add qwen2vl vit support
* add qwen image edit support
* fix qwen image edit pipeline
* add mmproj file support
* support dynamic number of Qwen image transformer blocks
* set prompt_template_encode_start_idx every time
* to_add_out precision fix
* to_out.0 precision fix
* update docs
2025-10-13 23:17:18 +08:00
leejet
beb99a2de2
feat: add Qwen Image support ( #851 )
...
* add qwen tokenizer
* add qwen2.5 vl support
* mv qwen.hpp -> qwenvl.hpp
* add qwen image model
* add qwen image t2i pipeline
* fix qwen image flash attn
* add qwen image i2i pipline
* change encoding of vocab_qwen.hpp to utf8
* fix get_first_stage_encoding
* apply jeffbolz f32 patch
https://github.com/leejet/stable-diffusion.cpp/pull/851#issuecomment-3335515302
* fix the issue that occurs when using CUDA with k-quants weights
* optimize the handling of the FeedForward precision fix
* to_add_out precision fix
* update docs
2025-10-12 23:23:19 +08:00