rmatif
0e52afc651
feat: enable vae tiling for vid gen ( #1152 )
...
* enable vae tiling for vid gen
* format code
* eliminate compilation warning
---------
Co-authored-by: leejet <leejet714@gmail.com>
2026-01-08 23:23:05 +08:00
Flavio Bizzarri
dfe6d6c664
fix: missing newline after seed in sd_img_gen_params_to_str ( #1183 )
2026-01-08 22:52:22 +08:00
leejet
b90b1ee9cf
chore: eliminate compilation warnings under MSVC ( #1170 )
2026-01-04 22:26:57 +08:00
Wagner Bruna
d0d836ae74
feat: support mmap for model loading ( #1059 )
2025-12-28 22:38:29 +08:00
leejet
a2d83dd0c8
refactor: move pmid condition logic into get_pmid_condition ( #1148 )
2025-12-27 16:48:15 +08:00
leejet
ccb6b0ac9d
feat: add __index_timestep_zero__ support ( #1146 )
2025-12-26 22:07:40 +08:00
leejet
860a78e248
fix: avoid crash when using taesd for preview only ( #1141 )
2025-12-24 23:30:12 +08:00
leejet
a0adcfb148
feat: add support for qwen image edit 2511 ( #1096 )
2025-12-24 23:00:08 +08:00
rmatif
298b11069f
feat: add more caching methods ( #1066 )
2025-12-22 23:52:11 +08:00
Phylliida Dev
50ff966445
feat: add seamless texture generation support ( #914 )
...
* global bool
* reworked circular to global flag
* cleaner implementation of tiling support in sd cpp
* cleaned rope
* working simplified but still need wraps
* Further clean of rope
* resolve flux conflict
* switch to pad op circular only
* Set ggml to most recent
* Revert ggml temp
* Update ggml to most recent
* Revert unneded flux change
* move circular flag to the GGMLRunnerContext
* Pass through circular param in all places where conv is called
* fix of constant and minor cleanup
* Added back --circular option
* Conv2d circular in vae and various models
* Fix temporal padding for qwen image and other vaes
* Z Image circular tiling
* x and y axis seamless only
* First attempt at chroma seamless x and y
* refactor into pure x and y, almost there
* Fix crash on chroma
* Refactor into cleaner variable choices
* Removed redundant set_circular_enabled
* Sync ggml
* simplify circular parameter
* format code
* no need to perform circular pad on the clip
* simplify circular_axes setting
* unify function naming
* remove unnecessary member variables
* simplify rope
---------
Co-authored-by: Phylliida <phylliidadev@gmail.com>
Co-authored-by: leejet <leejet714@gmail.com>
2025-12-21 18:06:47 +08:00
stduhpf
60abda56e0
feat: select vulkan device with env variable ( #629 )
2025-12-21 15:35:38 +08:00
stduhpf
23fce0bd84
feat: add support for Chroma Radiance x0 ( #1091 )
...
* Add x0 Flux pred (+prepare for others)
* Fix convert models with empty tensors
* patch_32 exp support attempt
* improve support for patch_32
* follow official pipeline
---------
Co-authored-by: leejet <leejet714@gmail.com>
2025-12-20 00:55:57 +08:00
Wagner Bruna
7c88c4765c
chore: give feedback about cfg values smaller than 1 ( #1088 )
2025-12-19 23:41:52 +08:00
Wagner Bruna
78e15bd4af
feat: default to LCM scheduler for LCM sampling ( #1109 )
...
* feat: default to LCM scheduler for LCM sampling
* fix bug and attempt to get default scheduler for vid_gen when none is set
---------
Co-authored-by: leejet <leejet714@gmail.com>
2025-12-18 21:43:39 +08:00
Daniele
97cf2efe45
feat: add KL Optimal scheduler ( #1098 )
2025-12-18 21:02:55 +08:00
leejet
bda7fab9f2
chore: remove unused debug code
2025-12-17 23:43:37 +08:00
leejet
c2e18c86e8
fix: make flash attn work with high noise diffusion model ( #1111 )
2025-12-17 23:28:59 +08:00
stduhpf
9fa7f415df
feat: add taehv support for Wan/Qwen ( #937 )
2025-12-16 22:57:34 +08:00
Wagner Bruna
e687913bf1
chore: remove lora_model_dir parameter ( #1100 )
2025-12-16 22:37:45 +08:00
rmatif
8f05f5bc6e
feat: add support for custom scheduler ( #694 )
...
---------
Co-authored-by: leejet <leejet714@gmail.com>
2025-12-13 16:20:02 +08:00
leejet
8823dc48bc
feat: align the spatial size to the corresponding multiple ( #1073 )
2025-12-10 23:15:08 +08:00
leejet
d939f6e86a
refactor: optimize the handling of LoRA models ( #1070 )
2025-12-10 00:26:07 +08:00
stduhpf
583a02e29e
feat: add Flux.2 VAE proj matrix for previews ( #1017 )
2025-12-09 22:00:45 +08:00
leejet
96c3e64057
refactor: optimize the handling of embedding ( #1068 )
...
* optimize the handling of embedding
* support case-insensitive embedding names
2025-12-08 23:59:04 +08:00
leejet
2f0bd31a84
feat: add ovis image support ( #1057 )
2025-12-07 12:32:56 +08:00
leejet
bfbb929790
feat: do not convert bf16 to f32 ( #1055 )
2025-12-06 23:55:51 +08:00
leejet
985aedda32
refactor: optimize the handling of pred type ( #1048 )
2025-12-04 23:31:55 +08:00
leejet
3f3610b5cd
chore: optimize lora log ( #1047 )
2025-12-04 22:44:58 +08:00
stduhpf
bcc9c0d0b3
feat: handle ggml compute failures without crashing the program ( #1003 )
...
* Feat: handle compute failures more gracefully
* fix Unreachable code after return
Co-authored-by: idostyle <idostyl3@googlemail.com>
* adjust z_image.hpp
---------
Co-authored-by: idostyle <idostyl3@googlemail.com>
Co-authored-by: leejet <leejet714@gmail.com>
2025-12-04 22:04:27 +08:00
leejet
5865b5e703
refactor: split SDParams to SDCliParams/SDContextParams/SDGenerationParams ( #1032 )
2025-12-03 22:31:46 +08:00
Wagner Bruna
e4c50f1de5
chore: add sd_ prefix to a few functions ( #967 )
2025-12-01 22:43:52 +08:00
rmatif
0743a1b3b5
fix: fix vae tiling for flux2 ( #1025 )
2025-12-01 22:41:56 +08:00
leejet
34a6fd4e60
feat: add z-image support ( #1020 )
...
* add z-image support
* use flux_latent_rgb_proj for z-image
* fix qwen3 rope type
* add support for qwen3 4b gguf
* add support for diffusers format lora
* fix nan issue that occurs when using CUDA with k-quants weights
* add z-image docs
2025-12-01 22:39:43 +08:00
leejet
20eb674100
fix: avoid crash when the lora file is not found using immediately mode ( #1022 )
2025-11-30 12:19:37 +08:00
Wagner Bruna
0249509a30
refactor: add user data pointer to the image preview callback ( #1001 )
2025-11-30 11:34:17 +08:00
leejet
52b67c538b
feat: add flux2 support ( #1016 )
...
* add flux2 support
* rename qwenvl to llm
* add Flux2FlowDenoiser
* update docs
2025-11-30 11:32:56 +08:00
leejet
20345888a3
refactor: optimize the handling of sample method ( #999 )
2025-11-22 14:00:25 +08:00
Wagner Bruna
45c46779af
feat: add LCM scheduler ( #983 )
2025-11-22 13:53:31 +08:00
leejet
869d023416
refactor: optimize the handling of scheduler ( #998 )
2025-11-22 12:48:53 +08:00
akleine
e9bc3b6c06
fix: check the PhotoMaker id_embeds tensor ONLY in PhotoMaker V2 mode ( #987 )
2025-11-22 12:47:40 +08:00
leejet
5498cc0d67
feat: add Wan2.1-I2V-1.3B(SkyReels) support ( #988 )
2025-11-19 23:56:46 +08:00
rmatif
a14e2b321d
feat: add easycache support ( #940 )
2025-11-19 23:19:32 +08:00
leejet
b88cc32346
fix: avoid using same type but diff instances for rng and sampler_rng ( #982 )
2025-11-16 23:37:14 +08:00
leejet
d5b05f70c6
feat: support independent sampler rng ( #978 )
2025-11-16 17:11:02 +08:00
akleine
6d6dc1b8ed
fix: make PhotoMakerV2 more robust by image count check ( #970 )
2025-11-16 17:10:48 +08:00
Wagner Bruna
199e675cc7
feat: support for --tensor-type-rules on generation modes ( #932 )
2025-11-16 17:07:32 +08:00
leejet
742a7333c3
feat: add cpu rng ( #977 )
2025-11-16 14:48:15 +08:00
Wagner Bruna
aa44e06890
fix: avoid crash with LoRAs and type override ( #974 )
2025-11-16 14:47:36 +08:00
leejet
347710f68f
feat: support applying LoRA at runtime ( #969 )
2025-11-13 21:48:44 +08:00
leejet
694f0d9235
refactor: optimize the logic for name conversion and the processing of the LoRA model ( #955 )
2025-11-10 00:12:20 +08:00