Welcome to the Unlimited Length I2V workflow for ComfyUI, pushing the boundaries of video generation by leveraging the FramePack system to produce videos of virtually unlimited length (i.e. number of frames are no longer limited to 96 for previous implementations of Hunyuai or even Wan).
Just a few weeks ago, this kind of output would have been impossible β now, it's a matter of a few nodes.
β οΈ This is a first working draft. Expect massive improvements soon (see below).
This workflow uses FramePack to perform image-to-video (I2V) generation with long, coherent sequences. By combining the original FramePack I2V architecture with the modular flexibility of ComfyUI and support from native models, this setup opens new creative possibilities for animating images far beyond the usual frame count limitations.
It currently features :
automatically resize input image to nearest supported format
end frame support
any input resolution accepted (will be rounded to the nearest valid one)
LLM use for image description
teacache use
I also tried to explain each setting with a note directly on the workflow. No need to keep this page open when using it !
To run this workflow, you need the following:
Nvidia GPU in RTX 30XX, 40XX, 50XX series that supports fp16 and bf16. The GTX 10XX/20XX are not tested.
6GB of VRAM (yes, only ! it can work on your laptop !)
Kijaiβs FramePack Wrapper for ComfyUI
β https://github.com/kijai/ComfyUI-FramePackWrapper
At the time of writing, it was not available with Comfy UI interface. You can install it through Manager > Install via git URL > https://github.com/kijai/ComfyUI-FramePackWrapper.git
1. Native models (text encoders, VAE, sigclip):
2. Transformer (FramePack) model:
π§ Autodownload (recommended):
From HuggingFace: lllyasviel/FramePackI2V_HY
β Place in: ComfyUI/models/diffusers/lllyasviel/FramePackI2V_HY
π§ Manual download (single safetensor files):
Place in: ComfyUI/models/diffusion_models/
FramePackI2V_HY_fp8_e4m3fn.safetensors
(Optimized for low-memory GPUs, with FP8 and reduced precision for better compatibility.)
FramePackI2V_HY_bf16.safetensors
(Better suited for high-memory GPUs, offering higher fidelity thanks to BF16 precision.)
Teacache is a smart caching system for diffusion models that stores intermediate computation states. This drastically speeds up generation times, especially during iterative tweaking or when generating multiple video segments with similar inputs.
The workflow includes a switch to enable or disable Teacache, depending on your memory availability and whether you're prioritizing speed or full fresh runs.
Teacache boost: Up to 2x speed improvement on repeat runs
Tested on my "old" RTX 3090:
Resolution: 704x544
Length: 150 frames
Generation time: 11 minutes
Another test :
384x448, 600 frames generated on 15 minutes.
The original project claims that with an RTX 4090 desktop it generates at a speed of 2.5 seconds/frame (unoptimized) or 1.5 seconds/frame (teacache)
This release is an second draft. It is mostly working and "straight to the point".
This is also my VERY FIRST WORKFLOW CONTRIBUTION on Civit.ai ! Please be gentle on your comments.
Next steps are :
LORA support (need some Python scripting, will get this done in the next few days)
Upscaling (coming very soon too)
FramePack is originally developed by lllyasviel. This workflow wraps it in ComfyUI thanks to Kijai work and additional optimizations and user-friendly features.
@lllyasviel for the original FramePack architecture
@Kijai for the ComfyUI node wrapper
Comfy-Org for the models and pipeline integration
Everyone in the ComfyUI community for testing and feedback
The default settings were based on my RTX 3090 (24GB of vRAM). If you have less and you have memory usage, first change FramePack model to use fp8 model, then if it's not enough, try lowering VAE batch parameters.
Please, post all videos made with my workflow here, I really want to see what you are doing with it !
Go ahead and upload yours!
Your query returned no results β please try removing some filters or trying a different term.