leejet
6e66a1a4a4
fix: allow oversized Vulkan parameter tensors ( #1662 )
master-704-6e66a1a
2026-06-15 23:18:52 +08:00
leejet
bb90bfa00f
feat: support backend-specific max-vram budgets
master-703-bb90bfa
2026-06-14 22:46:32 +08:00
leejet
517abc777d
sync: update ggml ( #1656 )
2026-06-14 20:45:05 +08:00
leejet
6f00939f75
docs: refresh README guide links
2026-06-14 17:58:58 +08:00
stduhpf
c2df4e1228
feat: add RPC support ( #1629 )
master-700-c2df4e1
2026-06-14 17:30:23 +08:00
leejet
9838264c49
refactor: simplify ControlNet output caching ( #1655 )
master-699-9838264
2026-06-14 16:58:37 +08:00
leejet
17d70b91e6
docs: replace example option lists with help commands
2026-06-14 16:55:15 +08:00
leejet
5db680c2c7
refactor: route cpu placement through backend specs ( #1654 )
master-697-5db680c
2026-06-14 15:52:24 +08:00
leejet
749186c0eb
refactor: remove vae_decode_only context flag ( #1653 )
master-696-749186c
2026-06-14 15:23:29 +08:00
leejet
bdb431ad95
feat: support disk params backend ( #1651 )
master-695-bdb431a
2026-06-14 14:48:50 +08:00
leejet
276025e054
fix: mark LoKR w2_a tensor as applied ( #1650 )
master-694-276025e
2026-06-14 02:11:02 +08:00
leejet
8d4c7af95b
refactor: route all runner params through model manager ( #1649 )
master-693-8d4c7af
2026-06-14 02:05:23 +08:00
leejet
9b0fceb41b
refactor: manage upscaler params through model manager ( #1645 )
master-692-9b0fceb
2026-06-13 15:39:57 +08:00
leejet
563137a592
refactor: centralize runner weight staging and cleanup ( #1644 )
master-691-563137a
2026-06-13 13:19:13 +08:00
Wyatt Caldwell
3a54597776
fix: SD3 conditioning crash when clip_l text encoder is missing ( #1638 )
master-690-3a54597
2026-06-13 13:16:59 +08:00
Cyberhan123
1365008348
chore: add script for automatic code formatting ( #1636 )
2026-06-13 13:13:07 +08:00
Cyberhan123
1fb6b22850
feat: add free_sd_images function to manage memory for C API ( #1633 )
master-688-1fb6b22
2026-06-13 13:08:14 +08:00
stduhpf
c20769b2c8
feat: add circular RoPE support for ideogram4 ( #1627 )
master-687-c20769b
2026-06-13 13:06:34 +08:00
RapidMark
1b702a51e7
fix: correct mask shape for masked flash attention ( #1625 )
master-686-1b702a5
2026-06-13 13:01:20 +08:00
RapidMark
19bdfe22d2
feat: set tensor names on block params ( #1622 )
master-685-19bdfe2
2026-06-08 23:25:52 +08:00
stduhpf
138da14cc3
apg: normalize diff_norm calculation by tensor size ( #1620 )
master-684-138da14
2026-06-08 21:56:15 +08:00
fszontagh
17a2b4a315
perf: cap planner budget when model dwarfs the streaming budget ( #1612 )
master-683-17a2b4a
2026-06-08 21:53:54 +08:00
leejet
b3d56d0ba1
refactor: split model loader from model definitions ( #1619 )
master-682-b3d56d0
2026-06-07 23:20:12 +08:00
leejet
2a07540c2a
refactor: move photomaker into generation extension ( #1618 )
master-681-2a07540
2026-06-07 22:40:02 +08:00
Wagner Bruna
81abfb2548
chore: rename and reformat gits_noise.inl ( #1617 )
master-680-81abfb2
2026-06-07 22:30:20 +08:00
leejet
f3fd359b58
refactor: reorganize src model layout ( #1615 )
master-679-f3fd359
2026-06-07 03:21:12 +08:00
leejet
dfb2390dd4
refactor: extract Wan VAE implementation ( #1614 )
master-678-dfb2390
2026-06-07 01:33:49 +08:00
leejet
cfbc19d186
refactor: unify model config detection ( #1613 )
master-677-cfbc19d
2026-06-07 01:05:12 +08:00
leejet
b9254dda0d
feat: add ideogram4 support ( #1609 )
master-676-b9254dd
2026-06-06 16:34:16 +08:00
fszontagh
0648f4426b
perf: ratchet streaming budget so plan stops re-merging every step ( #1611 )
master-675-0648f44
2026-06-06 16:32:03 +08:00
YOSHIDA Keiji
74f513d512
fix: Suppress spurious error output for --help ( #1607 ) ( #1608 )
...
Signed-off-by: kei-g <km.8k6ce+github@gmail.com>
master-674-74f513d
2026-06-06 16:23:44 +08:00
fszontagh
064001b524
perf: allocate CPU-offloaded params from runtime device pinned host buffer ( #1601 )
master-673-064001b
2026-06-06 16:22:18 +08:00
leejet
1f9ee88e09
fix: zero Wan2.2 TI2V timesteps for fixed frames ( #1604 )
master-672-1f9ee88
2026-06-03 23:32:31 +08:00
fszontagh
a7f2e03da4
perf: keep chunk-K residency engaged with runtime LoRA ( #1598 )
master-671-a7f2e03
2026-06-03 23:12:00 +08:00
stduhpf
4513e3fda9
refactor: img-cond->img_uncond ( #1594 )
...
* refactor: img-cond->img_uncond
* align APG and CFG++ with img-uncond CFG
* set default img_cfg to 1.f
---------
Co-authored-by: leejet <leejet714@gmail.com>
master-670-4513e3f
2026-06-03 22:57:42 +08:00
leejet
2d40a8b2ad
feat: make Wan2.2 5B FLF2V work ( #1110 )
master-669-2d40a8b
2026-06-02 23:16:09 +08:00
leejet
9c7f9a20b3
chore: embed server web UI in Docker images ( #1597 )
master-668-9c7f9a2
2026-06-02 22:46:25 +08:00
fszontagh
ed74577c40
feat: --stream-layers for streaming weights from CPU during generation ( #1576 )
master-667-ed74577
2026-06-02 22:35:28 +08:00
RapidMark
7948df8ac1
fix(cmake): build HIP backend with PIC so the static-lib PIE link succeeds ( #1593 )
master-666-7948df8
2026-06-02 00:07:48 +08:00
Wagner Bruna
02f06370a7
refactor: call CPU backend functions dynamically ( #1591 )
...
Co-authored-by: leejet <leejet714@gmail.com>
master-665-02f0637
2026-06-01 23:41:21 +08:00
stduhpf
f8935d6f25
feat: support img-cfg for edit models ( #929 )
...
Co-authored-by: leejet <leejet714@gmail.com>
master-664-f8935d6
2026-06-01 22:54:25 +08:00
stduhpf
be65ac7511
feat: add support for APG (adaptive projected guidance) + unconditionnal SLG ( #593 )
master-663-be65ac7
2026-06-01 00:55:49 +08:00
leejet
20901f6d8e
fix: remove kv padding from flash attention wrapper ( #1453 )
master-662-20901f6
2026-05-31 23:23:19 +08:00
leejet
0982807139
feat: add PiD support ( #1585 )
master-661-0982807
2026-05-31 22:38:39 +08:00
leejet
d2797b8667
fix: correct Gemma3 rope settings and vram limit propagation ( #1583 )
master-660-d2797b8
2026-05-30 22:23:49 +08:00
leejet
d3b2cb047e
fix: split tokens before normalization ( #1582 )
master-659-d3b2cb0
2026-05-30 18:38:46 +08:00
akleine
b4ba55d8d7
fix: prevent crash in case of a mem alloc error and graceful exit ( #1566 )
master-658-b4ba55d
2026-05-30 18:34:07 +08:00
Wagner Bruna
b54bd83a3f
fix: explicitly exclude f8, f64 and i64 tensors from mmap ( #1575 )
master-657-b54bd83
2026-05-30 18:31:08 +08:00
Wagner Bruna
0e4ee04488
fix: correct tae for models that use the flux2 vae ( #1571 )
master-656-0e4ee04
2026-05-28 09:13:16 +08:00
leejet
29ab511fc7
fix: resolve LLM norm tensor names by architecture ( #1570 )
master-655-29ab511
2026-05-28 00:36:16 +08:00