Pretty simple workflow for LTXV with 3 modes so far and quite an early version. There is no prompt enchancer; in my opinion, it's useless garbage as a tool, and text LLM work mostly works very slow in ComfyUI. There is no latent upscaling. ~~Almost the same useless garbage~~. Maybe this is useful for T2V and that all.

Setting the full resolution will be better than half resolution and latent upscaling with an interpolated result, even it requires resources like second generation. If your system is not powerful enough for the full resolution you set, almost certainly the server will run out of memory on the latent upscale stage. And that's even worse in IMG2video cases with the pixel upscaler itself.

The workflow works fine on my 8GB VRAM videocard at 720p; I haven't tried higher resolution yet on FP8 dev model with distilled lora. Controlling it is the same as any other my workflow. If you try even one of my workflows, it will be similar.

Put your attention on the note under V2V group. I also tried OpenPose and canny; they are much worse than Depth mask.

I tried ID Lora, maybe it's good but extremely slow

Description

LTX-2.3, simple workflow. T2V, I2V, I2V external audio, FFLF, V2V (IC lora + depth mask)

Model Details

Available Files

Tags

Versions

Related Models

Model Information