diff --git a/README.md b/README.md index cef0bac..fe64cea 100644 --- a/README.md +++ b/README.md @@ -21,6 +21,7 @@ API and command-line option may change frequently.*** - [SD3/SD3.5](./docs/sd3.md) - [Flux-dev/Flux-schnell](./docs/flux.md) - [Chroma](./docs/chroma.md) + - [Qwen Image](./docs/qwen_image.md) - Image Edit Models - [FLUX.1-Kontext-dev](./docs/kontext.md) - Video Models @@ -296,6 +297,7 @@ arguments: --clip_g path to the clip-g text encoder --clip_vision path to the clip-vision encoder --t5xxl path to the t5xxl text encoder + --qwen2vl path to the qwen2vl text encoder --vae [VAE] path to vae --taesd [TAESD_PATH] path to taesd. Using Tiny AutoEncoder for fast decoding (low quality) --control-net [CONTROL_PATH] path to control net model @@ -464,6 +466,7 @@ Thank you to all the people who have already contributed to stable-diffusion.cpp ## References - [ggml](https://github.com/ggerganov/ggml) +- [diffusers](https://github.com/huggingface/diffusers) - [stable-diffusion](https://github.com/CompVis/stable-diffusion) - [sd3-ref](https://github.com/Stability-AI/sd3-ref) - [stable-diffusion-stability-ai](https://github.com/Stability-AI/stablediffusion) diff --git a/assets/qwen/example.png b/assets/qwen/example.png new file mode 100644 index 0000000..f51bbd5 Binary files /dev/null and b/assets/qwen/example.png differ diff --git a/docs/qwen_image.md b/docs/qwen_image.md new file mode 100644 index 0000000..a6f26c5 --- /dev/null +++ b/docs/qwen_image.md @@ -0,0 +1,23 @@ +# How to Use + +## Download weights + +- Download Qwen Image + - safetensors: https://huggingface.co/Comfy-Org/Qwen-Image_ComfyUI/tree/main/split_files/diffusion_models + - gguf: https://huggingface.co/QuantStack/Qwen-Image-GGUF/tree/main +- Download vae + - safetensors: https://huggingface.co/Comfy-Org/Qwen-Image_ComfyUI/tree/main/split_files/vae +- Download qwen_2.5_vl 7b + - safetensors: https://huggingface.co/Comfy-Org/Qwen-Image_ComfyUI/tree/main/split_files/text_encoders + - gguf: https://huggingface.co/mradermacher/Qwen2.5-VL-7B-Instruct-GGUF/tree/main + +## Examples + +``` +.\bin\Release\sd.exe --diffusion-model ..\..\ComfyUI\models\diffusion_models\qwen-image-Q8_0.gguf --vae ..\..\ComfyUI\models\vae\qwen_image_vae.safetensors --qwen2vl ..\..\ComfyUI\models\text_encoders\Qwen2.5-VL-7B-Instruct-Q8_0.gguf -p '一个穿着"QWEN"标志的T恤的中国美女正拿着黑色的马克笔面相镜头微笑。她身后的玻璃板上手写体写着 “一、Qwen-Image的技术路线: 探索视觉生成基础模型的极限,开创理解与生成一体化的未来。二、Qwen-Image的模型特色:1、复杂文字渲染。支持中英渲染、自动布局; 2、精准图像编辑。支持文字编辑、物体增减、风格变换。三、Qwen-Image的未来愿景:赋能专业内容创作、助力生成式AI发展。”' --cfg-scale 2.5 --sampling-method euler -v --offload-to-cpu -H 1024 -W 1024 --diffusion-fa --flow-shift 3 +``` + +qwen example + + +