vrgamedevgirl
4 months ago

πŸŒ€ Wan2.1_14B_FusionX β€” Merged models for Faster, Richer Motion & Detail in as little as 8 steps!

πŸ“Œ Important Details- Please read the full description below because small changes to settings will provide totally different results in a bad way! I have been testing and already found better settings so just please read below! Thank you :)

πŸ’‘Workflows can be found HERE (This is a wip and more will be added soon.)

πŸ› οΈUpdates section has been moved to the end of the description.

A high-performance text-to-video model built on top of the base WAN 2.1 14B T2V model β€” carefully merged with multiple research-grade models to enhance motion quality, scene consistency, and visual detail, comparable to some of the many close source models.

## πŸ“’ Join The Community!

We're building a friendly space to chat, share creations, and get support.

πŸ‘‰ Click here to join the Discord!

Come say hi in #welcome, check out the rules, and show off your creations! 🎨🧠

πŸ’‘ What’s Inside this base model:

  • 🧠 CausVid – Causal motion modeling for better scene flow and dramatic speed boot

  • 🎞️ AccVideo – Improves temporal alignment and realism along with speed boot

  • 🎨 MoviiGen1.1 – Brings cinematic smoothness and lighting

  • 🧬 MPS Reward LoRA – Tuned for motion dynamics and detail

  • ✨ Custom LoRAs (by me) – Focused on texture, clarity, and facial details.


πŸ”₯ Highlights:

  • πŸ“ Accepts standard prompt + negative prompt setup

  • πŸŒ€ Tuned for high temporal coherence and expressive, cinematic scenes

  • πŸ” Drop-in replacement for WAN 2.1 T2V β€” just better

  • πŸš€ Renders up to 50% faster than the base model (especially with SageAttn enabled)

  • 🧩 Fully compatible with VACE

  • 🧠 Optimized for use in ComfyUI, especially with the Kaji Wan Wrapper


πŸ“Œ Important Details for text to video:

  • πŸ”§ CGF must be set to 1 β€” anything higher will not provide acceptable results.

  • πŸ”§ Shift - Results can vary based on Resolution. 1024x576 should start at 1 and if using 1080x720 start at 2. Note: For more realism lower shift values is what you need. If your looking for a more stylized look then test higher shift values between 3-9

  • Scheduler: Most of my examples used Uni_pc but you can get different results using others. Is really all about experimenting. I noticed depending on the prompt that the flowmatch_causvid works well too and helps with small details.

πŸ“Œ Important Details for image to video:

  • πŸ”§ CGF must be set to 1 β€” anything higher will not provide acceptable results.

  • πŸ”§ Shift - For image to video I found that 2 is best but you can experiment.

  • Scheduler: Most of my examples used dmp++_sde/beta and seems to work best but you can experiment.

  • After testing, to get more motion and reduce the slow-mo look, set your frame count to 121 and frames per second to 24. This can provide up to a 50% motion speed boost.

πŸ“ŒOther Important Details:

  • ⚑ Video generation works with as few as 6 steps, but 8–10 steps yield the best quality. Lower steps are great for fast drafts with huge speed gains.

  • 🧩 Best results using the Kaji Wan Wrapper custom node:
    https://github.com/kijai/ComfyUI-WanVideoWrapper

  • πŸ§ͺ Also tested with the native WAN workflow, generation time is a bit longer but results match wrapper.

  • ❗ Do not re-add CausVid, AccVideo, or MPS LoRAs β€” they’re already baked into the model and may cause unwanted results.

  • 🎨 You can use other LoRAs for additional styling β€” feel free to experiment.

  • πŸ“½οΈ All demo videos were generated at 1024x576, 81 frames, using only this model β€” no upscaling, interpolation, or extra LoRAs.

  • πŸ–₯️ Rendered on an RTX 5090 β€” each video takes around 138 seconds with the listed settings.

  • 🧠 If you run out of VRAM, enable block swapping β€” start at 5 blocks and adjust as needed.

  • πŸš€ SageAttn was enabled, providing up to a 30% speed boost. (Wrapper only)

  • Workflows for each model can be found here: HERE

  • 🚫 Do not use teacache β€” it’s unnecessary due to the low step count.

  • πŸ” β€œEnhance a video” and β€œSLG” features were not tested β€” feel free to explore on your own. -- Edit. I did test "Enhance a video" and you can get more vibrant results with this turned on. Settings between 2-4. Experiment! SLG has not been tested much.

  • πŸ’¬ Have questions? You’re welcome to leave a message or join the community:

  • πŸ“ Want better prompts? All my example video prompts were created using this custom GPT:
    🎬 WAN Cinematic Video Prompt Generator
    Try asking it to add extra visual and cinematic details β€” it makes a noticeable difference.


⚠️ Disclaimer:

  • Videos generated using this model are intended for personal, educational, or experimental use only, unless you’ve completed your own legal due diligence.

  • This model is a merge of multiple research-grade sources, and is not guaranteed to be free of copyrighted or proprietary data.

  • You are solely responsible for any content you generate and how it is used.

  • If you choose to use outputs commercially, you assume all legal liability for copyright infringement, misuse, or violation of third-party rights.

When in doubt, consult a qualified legal advisor before monetizing or distributing any generated content.


⚠️ More gguf models of the main model.

Can be found HERE


⚠️ Native VACE models for use in the Native VACE WF only.

Non gguf versions can be found HERE

gguf versions can be found HERE


⚠️ More gguf models of the image to video version!

Can be found here: HERE



πŸ“Œgguf comparisons!
I'm slowly adding to this list, but here you can see how the models compare against the main model.

Text to video:

--------

πŸ› οΈUpdate 6/8/2025 - Image to video model is published! Settings that I use in the example videos: Steps = 10 / cfg = 1 / shift = 2 / schedular = dmp++_sde i'll post a WF soon.

πŸ› οΈUpdate 6/7/2025 - Published a i2v phantom model that can take up to 4 reference images and combine them into a video. Posting workflow soon

Phantom WF is getting uploaded soon.

πŸ› οΈUpdate 6/6/2025 - Added a new gguf model! If you want the highest quality and have enough VRAM get the V1.0 model otherwise gguf is the next best thing! When using the gguf's it will take longer to generate even on an RTX 5090.

Read more...

What is Wan2.1_14B_FusionX?

Wan2.1_14B_FusionX is a highly specialized Image generation AI Model of type Safetensors / Checkpoint AI Model created by AI community user vrgamedevgirl. Derived from the powerful Stable Diffusion (Wan Video 14B i2v 720p) model, Wan2.1_14B_FusionX has undergone an extensive fine-tuning process, leveraging the power of a dataset consisting of images generated by other AI models or user-contributed data. This fine-tuning process ensures that Wan2.1_14B_FusionX is capable of generating images that are highly relevant to the specific use-cases it was designed for, such as base model, merge, wan.

With a rating of 0 and over 0 ratings, Wan2.1_14B_FusionX is a popular choice among users for generating high-quality images from text prompts.

Can I download Wan2.1_14B_FusionX?

Yes! You can download the latest version of Wan2.1_14B_FusionX from here.

How to use Wan2.1_14B_FusionX?

To use Wan2.1_14B_FusionX, download the model checkpoint file and set up an UI for running Stable Diffusion models (for example, AUTOMATIC1111). Then, provide the model with a detailed text prompt to generate an image. Experiment with different prompts and settings to achieve the desired results. If this sounds a bit complicated, check out our initial guide to Stable Diffusion – it might be of help. And if you really want to dive deep into AI image generation and understand how set up AUTOMATIC1111 to use Safetensors / Checkpoint AI Models like Wan2.1_14B_FusionX, check out our crash course in AI image generation.

Download (6.76 GB) Download available on desktop only
You'll need to use a program like A1111 to run this – learn how in our crash course

Popularity

2k ~10

Info

Base model: Wan Video 14B i2v 720p

Latest version (FusionX_i2v_gguf_Q3_K_S): 1 File

To download these files, please visit this page from a desktop computer.

5 Versions

πŸ˜₯ There are no Wan2.1_14B_FusionX FusionX_i2v_gguf_Q3_K_S prompts yet!

Go ahead and upload yours!

No results

Your query returned no results – please try removing some filters or trying a different term.