703 Commits

Author SHA1 Message Date
leejet
bb90bfa00f feat: support backend-specific max-vram budgets master-703-bb90bfa 2026-06-14 22:46:32 +08:00
leejet
517abc777d
sync: update ggml (#1656) 2026-06-14 20:45:05 +08:00
leejet
6f00939f75 docs: refresh README guide links 2026-06-14 17:58:58 +08:00
stduhpf
c2df4e1228
feat: add RPC support (#1629) master-700-c2df4e1 2026-06-14 17:30:23 +08:00
leejet
9838264c49
refactor: simplify ControlNet output caching (#1655) master-699-9838264 2026-06-14 16:58:37 +08:00
leejet
17d70b91e6 docs: replace example option lists with help commands 2026-06-14 16:55:15 +08:00
leejet
5db680c2c7
refactor: route cpu placement through backend specs (#1654) master-697-5db680c 2026-06-14 15:52:24 +08:00
leejet
749186c0eb
refactor: remove vae_decode_only context flag (#1653) master-696-749186c 2026-06-14 15:23:29 +08:00
leejet
bdb431ad95
feat: support disk params backend (#1651) master-695-bdb431a 2026-06-14 14:48:50 +08:00
leejet
276025e054
fix: mark LoKR w2_a tensor as applied (#1650) master-694-276025e 2026-06-14 02:11:02 +08:00
leejet
8d4c7af95b
refactor: route all runner params through model manager (#1649) master-693-8d4c7af 2026-06-14 02:05:23 +08:00
leejet
9b0fceb41b
refactor: manage upscaler params through model manager (#1645) master-692-9b0fceb 2026-06-13 15:39:57 +08:00
leejet
563137a592
refactor: centralize runner weight staging and cleanup (#1644) master-691-563137a 2026-06-13 13:19:13 +08:00
Wyatt Caldwell
3a54597776
fix: SD3 conditioning crash when clip_l text encoder is missing (#1638) master-690-3a54597 2026-06-13 13:16:59 +08:00
Cyberhan123
1365008348
chore: add script for automatic code formatting (#1636) 2026-06-13 13:13:07 +08:00
Cyberhan123
1fb6b22850
feat: add free_sd_images function to manage memory for C API (#1633) master-688-1fb6b22 2026-06-13 13:08:14 +08:00
stduhpf
c20769b2c8
feat: add circular RoPE support for ideogram4 (#1627) master-687-c20769b 2026-06-13 13:06:34 +08:00
RapidMark
1b702a51e7
fix: correct mask shape for masked flash attention (#1625) master-686-1b702a5 2026-06-13 13:01:20 +08:00
RapidMark
19bdfe22d2
feat: set tensor names on block params (#1622) master-685-19bdfe2 2026-06-08 23:25:52 +08:00
stduhpf
138da14cc3
apg: normalize diff_norm calculation by tensor size (#1620) master-684-138da14 2026-06-08 21:56:15 +08:00
fszontagh
17a2b4a315
perf: cap planner budget when model dwarfs the streaming budget (#1612) master-683-17a2b4a 2026-06-08 21:53:54 +08:00
leejet
b3d56d0ba1
refactor: split model loader from model definitions (#1619) master-682-b3d56d0 2026-06-07 23:20:12 +08:00
leejet
2a07540c2a
refactor: move photomaker into generation extension (#1618) master-681-2a07540 2026-06-07 22:40:02 +08:00
Wagner Bruna
81abfb2548
chore: rename and reformat gits_noise.inl (#1617) master-680-81abfb2 2026-06-07 22:30:20 +08:00
leejet
f3fd359b58
refactor: reorganize src model layout (#1615) master-679-f3fd359 2026-06-07 03:21:12 +08:00
leejet
dfb2390dd4
refactor: extract Wan VAE implementation (#1614) master-678-dfb2390 2026-06-07 01:33:49 +08:00
leejet
cfbc19d186
refactor: unify model config detection (#1613) master-677-cfbc19d 2026-06-07 01:05:12 +08:00
leejet
b9254dda0d
feat: add ideogram4 support (#1609) master-676-b9254dd 2026-06-06 16:34:16 +08:00
fszontagh
0648f4426b
perf: ratchet streaming budget so plan stops re-merging every step (#1611) master-675-0648f44 2026-06-06 16:32:03 +08:00
YOSHIDA Keiji
74f513d512
fix: Suppress spurious error output for --help (#1607) (#1608)
Signed-off-by: kei-g <km.8k6ce+github@gmail.com>
master-674-74f513d
2026-06-06 16:23:44 +08:00
fszontagh
064001b524
perf: allocate CPU-offloaded params from runtime device pinned host buffer (#1601) master-673-064001b 2026-06-06 16:22:18 +08:00
leejet
1f9ee88e09
fix: zero Wan2.2 TI2V timesteps for fixed frames (#1604) master-672-1f9ee88 2026-06-03 23:32:31 +08:00
fszontagh
a7f2e03da4
perf: keep chunk-K residency engaged with runtime LoRA (#1598) master-671-a7f2e03 2026-06-03 23:12:00 +08:00
stduhpf
4513e3fda9
refactor: img-cond->img_uncond (#1594)
* refactor: img-cond->img_uncond

* align APG and CFG++ with img-uncond CFG

* set default img_cfg to 1.f

---------

Co-authored-by: leejet <leejet714@gmail.com>
master-670-4513e3f
2026-06-03 22:57:42 +08:00
leejet
2d40a8b2ad
feat: make Wan2.2 5B FLF2V work (#1110) master-669-2d40a8b 2026-06-02 23:16:09 +08:00
leejet
9c7f9a20b3
chore: embed server web UI in Docker images (#1597) master-668-9c7f9a2 2026-06-02 22:46:25 +08:00
fszontagh
ed74577c40
feat: --stream-layers for streaming weights from CPU during generation (#1576) master-667-ed74577 2026-06-02 22:35:28 +08:00
RapidMark
7948df8ac1
fix(cmake): build HIP backend with PIC so the static-lib PIE link succeeds (#1593) master-666-7948df8 2026-06-02 00:07:48 +08:00
Wagner Bruna
02f06370a7
refactor: call CPU backend functions dynamically (#1591)
Co-authored-by: leejet <leejet714@gmail.com>
master-665-02f0637
2026-06-01 23:41:21 +08:00
stduhpf
f8935d6f25
feat: support img-cfg for edit models (#929)
Co-authored-by: leejet <leejet714@gmail.com>
master-664-f8935d6
2026-06-01 22:54:25 +08:00
stduhpf
be65ac7511
feat: add support for APG (adaptive projected guidance) + unconditionnal SLG (#593) master-663-be65ac7 2026-06-01 00:55:49 +08:00
leejet
20901f6d8e
fix: remove kv padding from flash attention wrapper (#1453) master-662-20901f6 2026-05-31 23:23:19 +08:00
leejet
0982807139
feat: add PiD support (#1585) master-661-0982807 2026-05-31 22:38:39 +08:00
leejet
d2797b8667
fix: correct Gemma3 rope settings and vram limit propagation (#1583) master-660-d2797b8 2026-05-30 22:23:49 +08:00
leejet
d3b2cb047e
fix: split tokens before normalization (#1582) master-659-d3b2cb0 2026-05-30 18:38:46 +08:00
akleine
b4ba55d8d7
fix: prevent crash in case of a mem alloc error and graceful exit (#1566) master-658-b4ba55d 2026-05-30 18:34:07 +08:00
Wagner Bruna
b54bd83a3f
fix: explicitly exclude f8, f64 and i64 tensors from mmap (#1575) master-657-b54bd83 2026-05-30 18:31:08 +08:00
Wagner Bruna
0e4ee04488
fix: correct tae for models that use the flux2 vae (#1571) master-656-0e4ee04 2026-05-28 09:13:16 +08:00
leejet
29ab511fc7
fix: resolve LLM norm tensor names by architecture (#1570) master-655-29ab511 2026-05-28 00:36:16 +08:00
leejet
55c2aed52c
refactor: simplify diffusion model runner params (#1569) 2026-05-28 00:12:35 +08:00