Amateur Photography [Flux Dev]

v3.0
peterkickasspeter
about 1 year ago

New Changes in v2:

  1. Adjusted the dataset (note that you may still see some bias towards white people but i suggest to prompt what you want and not say "woman" or "man")

  2. Tagged the race, ethnicity and also physical attributes of the subjects so it should control the biasing towards plus-size people

  3. Training dataset captions are now ~200 words per image (instead of 45-70 in v1). T5XXL is no joke lol. That means it can generate complex scenes, you can also position people and objects where you want (Base model can already do this, this lora just adds the realism and clutter to it). It may or may not work, so you can do some experimentation

  4. It can also generate some high quality background blur pictures if you are into it. Prompt it using "cinematic feel" at weight 0.5 or 0.6 or other words that work for you

  5. I may have messed up fingers in v1. I think v2 corrects this (if the image of base model have bad fingers, this lora tends to follow it)

  6. Realism kicks in at weight's between "0.5-0.6". If you want to stay close to the output the base model generates without this lora, i suggest to stay between "0.5-0.6". Maximum realism is between 0.8 and 1.0. But be prepared to see some horrors lol (you can experiment yourself. These are just my observations based on my limited testing)

I would like to thank all the people who donated buzz for v1 version. I was skeptical at first to buy 5$ buzz and now I was able to train 6 Flux lora's in civitai. BTW 400 epoch v3 is cooking. Lets see how it goes haha

Read more...
Download (285 MB) Download available on desktop only

Popularity

3k ~10

Info

Base model: Flux.1 D

Version v3.0: 1 File

To download these files, please visit this page from a desktop computer.

About this version: v3.0

Quality improvements from v2 (I think its good. you be the judge). It likes >200 word prompts (it behaves badly if you use sd 1.5 style prompts. If you want good quality from this lora, please use the format from my examples and the chatgpt prompt i provided below). If you want the everyday looking photo quality this lora offers, don't use "beauty" prompts (see my examples and the gpt4o prompt to understand what I mean).

Experiment with the lora. If you think its good, like the post (if you think its bad, comment what's wrong and I will try to fix the issues in the next version) and if possible tip some buzz. I used way too much buzz for v3 training lol. Anyways, always see my example images metadata to understand how I generated those images and start from there. Thanks

Updated GPT4O Prompt to squeeze this lora dry:

"I am training a LoRA for the Flux 1D text-to-image model that utilizes the T5XXL transformer in its architecture. To enhance this process, I require your assistance in generating detailed, natural language prompts based on uploaded images. Each prompt should begin with "Amateur photography of" and conclude with "on flickr in 2007, 2005 blog, 2007 blog," all within a single, cohesive paragraph.

Do not use words like 'sharp,' 'blur,' 'focus,' 'depth of field,' or 'bokeh' in the prompt. Always provide the prompt without explicitly mentioning focus-related terms. Emphasize the clarity and vividness of the entire scene. Incorporate the use of a camera flash if used

Format:

Subject Description: Provide a comprehensive description of the main subjects in the image, covering aspects such as race, ethnicity, and physical characteristics (e.g., height, build, skin tone, hair color). Include detailed facial features (e.g., smiling with teeth visible, eyes closed, timid expression), specific expressions (e.g., joyful grin, focused gaze), and poses (e.g., side profile, upper body shot, full body shot, hands resting naturally at the sides). Specify their body type (e.g., plus-size, medium build, slim, petite) and their placement within the frame (e.g., positioned on the left, center, or right). If there are additional people in the background, summarize their presence and briefly describe their activities or interactions.

Scene Description: Describe the actions and interactions of the main subjects, detailing what they are doing and the context of their activities. Provide a vivid description of the setting, whether urban or rural, indoor or outdoor, and highlight background elements such as buildings, landscapes, or furniture. Include any visible text in the image (e.g., signs, posters) and specify its location within the frame. Mention any objects the subjects interact with and describe the overall atmosphere or mood of the scene.

Image Quality Tags: Emphasize uniform clarity and detail across the image. Describe the scene as filled with rich detail where nothing is obscured or lost, suggesting that every aspect is vivid and equally prominent. Highlight the lighting that brings out intricate details across both subjects and the background, creating a crisp, clearly defined image. Incorporate descriptive tags like vivid colors, consistent natural light, detailed textures, overexposure, cluttered background, warm tones, bright natural light, high contrast and harmonious clarity to subtly imply sharpness and focus throughout the scene.

The final output should seamlessly integrate these elements into a detailed, coherent prompt that accurately reflects the image content.

If you are ready, reply "Ok" and I will start uploading the images."

7 Versions

😥 There are no Amateur Photography [Flux Dev] v3.0 prompts yet!

Go ahead and upload yours!

No results

Your query returned no results – please try removing some filters or trying a different term.