This is a workflow i made in comfy ui it can run
Wan2.1 T2V
Wan2.1 I2V
On just a RTX3050 Laptop edition 4gb Vram
I am a beginner but here is what I did:
Used the GGUF custom nodes and models
Nodes: (or use comfyui manager to install custom nodes)
https://github.com/city96/ComfyUI-GGUF
https://github.com/kijai/ComfyUI-WanVideoWrapper
https://github.com/BlenderNeko/ComfyUI_TiledKSampler
only models
Models:https://huggingface.co/calcuis/wan-gguf
install the vae,clip,clip vision,Text encoder from here
https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/tree/main/split_files
You should also use tools like Xformer,sageatten,triton or settings to speed it up a bit.
Specs:
It uses Tiled Ksampler tiled vae decoder everything tiled and about 480p quality but then using an upscaller i can get a pretty good result.
Now to the results:
Generation for 53 frames at 480x848 plus an upscaller to 1080p at 25 steps took 5.32 hours
yes it takes forever and forever however it does not give you a oom error thats good enough for me.
Notes:
If you want to do just text use a hunyuan empty latent video I liked these custom nodes for that
I also like there sampler as you can use it aswell with custom gguf nodes.
Go ahead and upload yours!
Your query returned no results β please try removing some filters or trying a different term.