This is a workflow i made in comfy ui it can run
Wan2.1 T2V
Wan2.1 I2V
On just a RTX3050 Laptop edition 4gb Vram
I am a beginner but here is what I did:
Used the GGUF custom nodes and models
Nodes: (or use comfyui manager to install custom nodes)
https://github.com/city96/ComfyUI-GGUF
https://github.com/kijai/ComfyUI-WanVideoWrapper
https://github.com/BlenderNeko/ComfyUI_TiledKSampler
only models
Models:https://huggingface.co/calcuis/wan-gguf
install the vae,clip,clip vision,Text encoder from here
https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/tree/main/split_files
You should also use tools like Xformer,sageatten,triton or settings to speed it up a bit.
Specs:
It uses Tiled Ksampler tiled vae decoder everything tiled and about 480p quality but then using an upscaller i can get a pretty good result.
Now to the results:
Generation for 53 frames at 480x848 plus an upscaller to 1080p at 25 steps took 5.32 hours
yes it takes forever and forever however it does not give you a oom error thats good enough for me.
Notes:
If you want to do just text use a hunyuan empty latent video I liked these custom nodes for that
I also like there sampler as you can use it aswell with custom gguf nodes.
All the features
Upscaller and Frame interpolaration
Lora Loader (Hunyuan Loras should mostly work)
In this version I uploaded the workflow image so you can have all the groupings.
the workflow has updated notes and groups aswell as applys rifle x to improve the speed and length you can create from 5sec to 8sec I still use Rifle47 to add frames after.
Extends the potential frame count of HunyuanVideo using this method: https://github.com/thu-ml/RIFLEx
Go ahead and upload yours!
Your query returned no results – please try removing some filters or trying a different term.