Deepseek Janus Pro 1B / 7B [Safetensors]

Support files only
8 months ago

https://huggingface.co/deepseek-ai/Janus-Pro-1B

https://huggingface.co/deepseek-ai/Janus-Pro-7B

Note: The CY-CHENYUE/ComfyUI-Janus-Pro nodes doesn't support .safetensors.

So I updated/forked the model_loader.py to automatically download, and support .safetensors. It refused to let me rename the files, so you need to keep them named model.safetensors

For the 7B version, I could not get shard-merging to work. So they will be sharded in 3 parts.

Installation instructions

  • Install ComfyUI

  • Install the CY-CHENYUE/ComfyUI-Janus-Pro node-pack

  • Manually overwrite the model_loader.py in ComfyUI\custom_nodes\ComfyUI-Janus-Pro\nodes\model_loader.py with the one above

  • You can use the ComfyUI Workflow above

  • The updated model_loader script will automatically download the model and place it in the correct folder

  • To do it manually, unzip the files for your desired version in the model list above so that the folder structure looks something like the screenshot below.

So the model path for the 1B version should be:

ComfyUI/models/Janus-Pro/Janus-Pro-1B/model.safetensors

But remember that you also need the config and the rest of the files, which is why it's uploaded as a .zip

There's also a version that is just the support-files, if you would rather combine that with the original .bin checkpoint models.

Congratulations!

With a 3090, 24gb, you can enjoy speedy 8-minute generations for a 384x384 image that looks much worse than anything Stable Diffusion 1.5 spits out in 0.5 second.

Janus-Pro is a novel autoregressive framework that unifies multimodal understanding and generation. It addresses the limitations of previous approaches by decoupling visual encoding into separate pathways, while still utilizing a single, unified transformer architecture for processing. The decoupling not only alleviates the conflict between the visual encoder’s roles in understanding and generation, but also enhances the framework’s flexibility. Janus-Pro surpasses previous unified model and matches or exceeds the performance of task-specific models. The simplicity, high flexibility, and effectiveness of Janus-Pro make it a strong candidate for next-generation unified multimodal models.
Janus-Pro is a unified understanding and generation MLLM, which decouples visual encoding for multimodal understanding and generation. Janus-Pro is constructed based on the DeepSeek-LLM-1.5b-base/DeepSeek-LLM-7b-base.
For multimodal understanding, it uses the SigLIP-L as the vision encoder, which supports 384 x 384 image input. For image generation, Janus-Pro uses the tokenizer from here with a downsample rate of 16.

This is the converted .safetensors version of the model.

The original 7B ones can be found here: https://huggingface.co/deepseek-ai/Janus-Pro-7B/tree/e6ac502c7931490e5b56b0ff2d30413f2a21b887

Read more...

What is Deepseek Janus Pro 1B / 7B [Safetensors]?

Deepseek Janus Pro 1B / 7B [Safetensors] is a highly specialized Image generation AI Model of type Safetensors / Checkpoint AI Model created by AI community user mnemic. Derived from the powerful Stable Diffusion (Other) model, Deepseek Janus Pro 1B / 7B [Safetensors] has undergone an extensive fine-tuning process, leveraging the power of a dataset consisting of images generated by other AI models or user-contributed data. This fine-tuning process ensures that Deepseek Janus Pro 1B / 7B [Safetensors] is capable of generating images that are highly relevant to the specific use-cases it was designed for, such as base model, pro, checkpoint.

With a rating of 0 and over 0 ratings, Deepseek Janus Pro 1B / 7B [Safetensors] is a popular choice among users for generating high-quality images from text prompts.

Can I download Deepseek Janus Pro 1B / 7B [Safetensors]?

Yes! You can download the latest version of Deepseek Janus Pro 1B / 7B [Safetensors] from here.

How to use Deepseek Janus Pro 1B / 7B [Safetensors]?

To use Deepseek Janus Pro 1B / 7B [Safetensors], download the model checkpoint file and set up an UI for running Stable Diffusion models (for example, AUTOMATIC1111). Then, provide the model with a detailed text prompt to generate an image. Experiment with different prompts and settings to achieve the desired results. If this sounds a bit complicated, check out our initial guide to Stable Diffusion – it might be of help. And if you really want to dive deep into AI image generation and understand how set up AUTOMATIC1111 to use Safetensors / Checkpoint AI Models like Deepseek Janus Pro 1B / 7B [Safetensors], check out our crash course in AI image generation.

Download (2.68 MB) Download available on desktop only
You'll need to use a program like A1111 to run this – learn how in our crash course

Popularity

260 ~10

Info

Base model: Other

Latest version (Support files only): 1 File

To download these files, please visit this page from a desktop computer.

5 Versions

😥 There are no Deepseek Janus Pro 1B / 7B [Safetensors] Support files only prompts yet!

Go ahead and upload yours!

No results

Your query returned no results – please try removing some filters or trying a different term.