Wagner Bruna
|
710bc91c8f
|
fix: correct conversion from sd_type_t to ggml_type (#1519)
|
2026-06-16 23:54:42 +08:00 |
|
leejet
|
bb90bfa00f
|
feat: support backend-specific max-vram budgets
|
2026-06-14 22:46:32 +08:00 |
|
stduhpf
|
c2df4e1228
|
feat: add RPC support (#1629)
|
2026-06-14 17:30:23 +08:00 |
|
leejet
|
9838264c49
|
refactor: simplify ControlNet output caching (#1655)
|
2026-06-14 16:58:37 +08:00 |
|
leejet
|
5db680c2c7
|
refactor: route cpu placement through backend specs (#1654)
|
2026-06-14 15:52:24 +08:00 |
|
leejet
|
bdb431ad95
|
feat: support disk params backend (#1651)
|
2026-06-14 14:48:50 +08:00 |
|
leejet
|
8d4c7af95b
|
refactor: route all runner params through model manager (#1649)
|
2026-06-14 02:05:23 +08:00 |
|
leejet
|
563137a592
|
refactor: centralize runner weight staging and cleanup (#1644)
|
2026-06-13 13:19:13 +08:00 |
|
RapidMark
|
1b702a51e7
|
fix: correct mask shape for masked flash attention (#1625)
|
2026-06-13 13:01:20 +08:00 |
|
RapidMark
|
19bdfe22d2
|
feat: set tensor names on block params (#1622)
|
2026-06-08 23:25:52 +08:00 |
|
fszontagh
|
17a2b4a315
|
perf: cap planner budget when model dwarfs the streaming budget (#1612)
|
2026-06-08 21:53:54 +08:00 |
|
leejet
|
f3fd359b58
|
refactor: reorganize src model layout (#1615)
|
2026-06-07 03:21:12 +08:00 |
|