{ "prompt_type": "descriptive_replication", "reference_adherence": "STRICT_VISUAL_FIDELITY", "aspect_ratio": "3:4", "identity_lock": { "priority": "NONE", "instruction": "Completely new female model with entirely different facial structure and identity. No resemblance to any previous version." }, "scene_constraints": { "people_count": "exactly one person in the frame", "single_subject_only": true, "no_other_people_visible": true, "no_extra_reflections": true, "no_background_figures": true, "full_body_visible": true, "head_to_toe_in_frame": true }, "subject": { "demographics": "young woman (age 20+), blonde", "presence": "only one visible person in the entire scene", "hair": { "color": "warm blonde (no dark roots)", "style": "center part, long straight sleek hair", "texture": "smooth, slightly glossy, natural fall" }, "face": { "structure": "heart-shaped face with softer jawline and rounded chin", "eyes": "large round eyes with neutral outer corners", "brows": "naturally arched brows with medium thickness", "nose": "small softly rounded nose", "lips": "medium-full lips with defined cupid's bow", "expression": "confident relaxed expression with gentle half-smile", "makeup": "fresh natural makeup, light glow on skin, soft peach lips, minimal eyeliner", "visibility": "fully visible face, unobstructed, clearly distinct identity" }, "body": { "pose": "standing upright facing mirror directly, both arms relaxed naturally down along sides", "posture": "balanced stance with one knee slightly bent forward creating a subtle S-curve", "anatomy": { "curves": "natural feminine proportions with realistic gravity", "chest": "bikini-supported natural shape", "details": "visible collarbones, relaxed shoulders, natural thigh and calf definition" }, "feet": { "visibility": "fully visible at bottom of frame", "position": "barefoot, one foot slightly forward for dynamic stance" }, "skin_texture": "natural skin texture with subtle realistic sheen" } }, "attire_and_accessories": { "clothing": { "top": "black triangle string bikini top with thin straps", "bottom_detail": "matching bikini bottom strings visible at hips" }, "footwear": "barefoot", "jewelry": { "necklaces": [], "wrists": [], "fingers": "no rings" }, "props": { "hands": "empty hands, no phone or objects" } }, "environment": { "setting": "modern luxury bathroom interior, daytime", "foreground_element": { "mirror": "large rectangular mirror with GREY MARBLE FRAME", "material": "polished stone marble with subtle cool grey veining" }, "background": { "elements": [ "marble wall tiles", "floating vanity with minimalist design", "sleek chrome faucet", "soft folded white towels", "clean tiled floor fully visible" ], "style": "minimalist luxury aesthetic", "clutter": "none" } }, "lighting_and_atmosphere": { "source": "natural daylight from camera-left", "quality": "soft directional light", "effects": [ "gentle highlights along cheekbones and shoulders", "soft shadow falloff along waist and legs" ], "contrast": "moderate and realistic" }, "camera_and_technical": { "perspective": "mirror perspective without phone blocking face", "camera_position": "eye level, slightly angled downward", "framing": "vertical 3:4, full head-to-toe body visible without cropping feet", "focus": "sharp focus on subject, background slightly softened", "visual_fidelity": "real smartphone camera aesthetic, mild natural noise, no HDR effect" }, "realism_constraints": { "allowed": [ "natural skin texture", "minor asymmetry", "realistic body proportions", "natural lighting imperfections" ], "forbidden": [ "multiple people", "extra person", "background people", "beauty filters", "airbrushed skin", "anime style", "cartoon style", "cropped feet", "cut-off legs", "phone in hand" ] } }
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. A KOMODO DRAGON (Varanus komodoensis) is present on the left side of frame, facing the gladiator, placed farther from the lens than the gladiator (secondary subject). The Komodo dragon looks powerful and real: thick muscular body, rough scaled skin, heavy tail, strong legs, low stalking posture, forked tongue visible. Keep its size believable (large adult), with accurate anatomy and texture. (NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact”. The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the Komodo dragon. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face (cheekbone, nose line, jaw, lips, eye, brow) — fully visible, sharply defined, and perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. His facial expression must be readable: concentrated, controlled, predatory calm. Body/action: Gladiator leaning forward, shield raised defensively, sword low and ready, muscles and chest hair clearly detailed. Komodo dragon advancing slowly or pausing mid-step, tongue flicking, dust disturbed by its claws, tension high. NO blood, NO wounds, NO biting, NO visible injuries, NO aftermath. Suspense and power, not violence. COMPOSITION (CRITICAL — GLADIATOR FIRST): Low-angle heroic shot, camera near sand level. Gladiator dominates the foreground and frame (waist-up or 3/4 body), with face and torso fully visible, not cut off. Komodo dragon occupies the left mid-ground (clearly visible but NOT closer than the man). Amphitheater arches clearly recognizable in the background. Clean horizon, strong depth separation (shallow-to-moderate DOF). Keep the gladiator’s face and chest in tack-sharp focus; keep the Komodo dragon recognizable and detailed. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Perfect, professional illumination on the gladiator: Warm golden sunlight as key light (or strong soft key from sun direction), PLUS soft fill light to remove harsh shadows on face and torso. Subtle rim light outlining shoulders/helmet/arms. Every muscle, chest hair texture, skin pores, and facial details are clearly visible. Komodo dragon and background may be slightly less bright; priority is the gladiator. No underexposure on the man. No blown highlights. CAMERA / LENS: Full-frame look, 35mm or 50mm (epic scale, minimal distortion). Fast shutter feel, subject sharp, dust provides motion cues only. Realistic bokeh, controlled contrast, filmic color grading. No HDR, no neon, no fantasy glow. QUALITY CONTROL: Photorealistic, historical-epic campaign finish. No artifacts, no warped anatomy, no extra limbs, no deformed hands. No random logos, no watermarks, no text. NEGATIVE PROMPT: lion, bear, dragon fantasy, dinosaur, crocodile, snake, extra animals, multiple komodo dragons, cartoon komodo, blood, gore, injury, wounds, biting, mauling, dead animal, dead body, graphic violence, dismemberment, low resolution, blurry, plastic skin, over-smoothed, uncanny valley, extra fingers, extra limbs, warped anatomy, cartoon, illustration, CGI look, fantasy lighting, neon, oversaturation, harsh HDR, random text, misspelled words, logos, watermarks
{ "prompt_type": "descriptive_replication", "reference_adherence": "STRICT_VISUAL_FIDELITY", "aspect_ratio": "3:4", "identity_lock": { "priority": "NONE", "instruction": "Completely new female model with entirely different facial structure and identity. No resemblance to any previous version." }, "scene_constraints": { "people_count": "exactly one person in the frame", "single_subject_only": true, "no_other_people_visible": true, "no_extra_reflections": true, "no_background_figures": true, "full_body_visible": true, "head_to_toe_in_frame": true }, "subject": { "demographics": "young woman (age 20+), blonde", "presence": "only one visible person in the entire scene", "hair": { "color": "warm blonde (no dark roots)", "style": "center part, long straight sleek hair", "texture": "smooth, slightly glossy, natural fall" }, "face": { "structure": "heart-shaped face with softer jawline and rounded chin", "eyes": "large round eyes with neutral outer corners", "brows": "naturally arched brows with medium thickness", "nose": "small softly rounded nose", "lips": "medium-full lips with defined cupid's bow", "expression": "confident relaxed expression with gentle half-smile", "makeup": "fresh natural makeup, light glow on skin, soft peach lips, minimal eyeliner", "visibility": "fully visible face, unobstructed, clearly distinct identity" }, "body": { "pose": "standing upright facing mirror directly, both arms relaxed naturally down along sides", "posture": "balanced stance with one knee slightly bent forward creating a subtle S-curve", "anatomy": { "curves": "natural feminine proportions with realistic gravity", "chest": "bikini-supported natural shape", "details": "visible collarbones, relaxed shoulders, natural thigh and calf definition" }, "feet": { "visibility": "fully visible at bottom of frame", "position": "barefoot, one foot slightly forward for dynamic stance" }, "skin_texture": "natural skin texture with subtle realistic sheen" } }, "attire_and_accessories": { "clothing": { "top": "black triangle string bikini top with thin straps", "bottom_detail": "matching bikini bottom strings visible at hips" }, "footwear": "barefoot", "jewelry": { "necklaces": [], "wrists": [], "fingers": "no rings" }, "props": { "hands": "empty hands, no phone or objects" } }, "environment": { "setting": "modern luxury bathroom interior, daytime", "foreground_element": { "mirror": "large rectangular mirror with GREY MARBLE FRAME", "material": "polished stone marble with subtle cool grey veining" }, "background": { "elements": [ "marble wall tiles", "floating vanity with minimalist design", "sleek chrome faucet", "soft folded white towels", "clean tiled floor fully visible" ], "style": "minimalist luxury aesthetic", "clutter": "none" } }, "lighting_and_atmosphere": { "source": "natural daylight from camera-left", "quality": "soft directional light", "effects": [ "gentle highlights along cheekbones and shoulders", "soft shadow falloff along waist and legs" ], "contrast": "moderate and realistic" }, "camera_and_technical": { "perspective": "mirror perspective without phone blocking face", "camera_position": "eye level, slightly angled downward", "framing": "vertical 3:4, full head-to-toe body visible without cropping feet", "focus": "sharp focus on subject, background slightly softened", "visual_fidelity": "real smartphone camera aesthetic, mild natural noise, no HDR effect" }, "realism_constraints": { "allowed": [ "natural skin texture", "minor asymmetry", "realistic body proportions", "natural lighting imperfections" ], "forbidden": [ "multiple people", "extra person", "background people", "beauty filters", "airbrushed skin", "anime style", "cartoon style", "cropped feet", "cut-off legs", "phone in hand" ] } }
{ "prompt_type": "descriptive_replication", "reference_adherence": "STRICT_VISUAL_FIDELITY", "aspect_ratio": "3:4", "identity_lock": { "priority": "NONE", "instruction": "Completely new female model with entirely different facial structure and identity. No resemblance to any previous version." }, "scene_constraints": { "people_count": "exactly one person in the frame", "single_subject_only": true, "no_other_people_visible": true, "no_extra_reflections": true, "no_background_figures": true, "full_body_visible": true, "head_to_toe_in_frame": true }, "subject": { "demographics": "young woman (age 20+), blonde", "presence": "only one visible person in the entire scene", "hair": { "color": "warm blonde (no dark roots)", "style": "center part, long straight sleek hair", "texture": "smooth, slightly glossy, natural fall" }, "face": { "structure": "heart-shaped face with softer jawline and rounded chin", "eyes": "large round eyes with neutral outer corners", "brows": "naturally arched brows with medium thickness", "nose": "small softly rounded nose", "lips": "medium-full lips with defined cupid's bow", "expression": "confident relaxed expression with gentle half-smile", "makeup": "fresh natural makeup, light glow on skin, soft peach lips, minimal eyeliner", "visibility": "fully visible face, unobstructed, clearly distinct identity" }, "body": { "pose": "standing upright facing mirror directly, both arms relaxed naturally down along sides", "posture": "balanced stance with one knee slightly bent forward creating a subtle S-curve", "anatomy": { "curves": "natural feminine proportions with realistic gravity", "chest": "bikini-supported natural shape", "details": "visible collarbones, relaxed shoulders, natural thigh and calf definition" }, "feet": { "visibility": "fully visible at bottom of frame", "position": "barefoot, one foot slightly forward for dynamic stance" }, "skin_texture": "natural skin texture with subtle realistic sheen" } }, "attire_and_accessories": { "clothing": { "top": "black triangle string bikini top with thin straps", "bottom_detail": "matching bikini bottom strings visible at hips" }, "footwear": "barefoot", "jewelry": { "necklaces": [], "wrists": [], "fingers": "no rings" }, "props": { "hands": "empty hands, no phone or objects" } }, "environment": { "setting": "modern luxury bathroom interior, daytime", "foreground_element": { "mirror": "large rectangular mirror with GREY MARBLE FRAME", "material": "polished stone marble with subtle cool grey veining" }, "background": { "elements": [ "marble wall tiles", "floating vanity with minimalist design", "sleek chrome faucet", "soft folded white towels", "clean tiled floor fully visible" ], "style": "minimalist luxury aesthetic", "clutter": "none" } }, "lighting_and_atmosphere": { "source": "natural daylight from camera-left", "quality": "soft directional light", "effects": [ "gentle highlights along cheekbones and shoulders", "soft shadow falloff along waist and legs" ], "contrast": "moderate and realistic" }, "camera_and_technical": { "perspective": "mirror perspective without phone blocking face", "camera_position": "eye level, slightly angled downward", "framing": "vertical 3:4, full head-to-toe body visible without cropping feet", "focus": "sharp focus on subject, background slightly softened", "visual_fidelity": "real smartphone camera aesthetic, mild natural noise, no HDR effect" }, "realism_constraints": { "allowed": [ "natural skin texture", "minor asymmetry", "realistic body proportions", "natural lighting imperfections" ], "forbidden": [ "multiple people", "extra person", "background people", "beauty filters", "airbrushed skin", "anime style", "cartoon style", "cropped feet", "cut-off legs", "phone in hand" ] } }
{ "project_meta": { "task_name": "Hyper-realistic Screen Simulation", "target_model": "SDXL_1.0_Refiner", "aspect_ratio": "3:4", "resolution": { "width": 1152, "height": 1536 } }, "scene_composition": { "layer_1_physical_environment": { "camera_angle": "High-angle, downward shot (POV)", "subject_anchor": "MacBook Screen (95% fill)", "foreground_element": "Thin strip of physical keyboard visible at bottom edge", "surface_imperfections": [ "Visible RGB pixel-grid texture (Moiré effect)", "Micro-dust particles on glass surface", "Faint ambient light reflections (glossy finish)", "Subtle fingerprint smudges" ] }, "layer_2_digital_interface": { "os_theme": "macOS Dark Mode", "active_window": { "app_name": "Photo Booth", "status": "Live Preview Mode", "position": "Dominant/Center-Left" } }, "layer_3_nested_content": { "location": "Inside the Photo Booth window", "setting": { "room": "Dim bedroom", "background": "Off-white wall, rumpled bedding", "lighting": "Mixed lighting (Cool blue screen glow + Warm skin tones), deep nocturnal shadows" }, "subject_details": { "identity_source": "uploaded_female_reference_image", "demographics": "Young adult female", "appearance": { "apparel": "Black top, grey bottoms", "expression": "Relaxed, candid, slight smile", "pose": "Reclining/Lying down, looking at screen" }, "props": { "item": "iPhone 15 Pro", "hand_position": "Held in right hand" } } } }, "generation_parameters": { "prompts": { "positive": "Hyper-realistic downward shot of a MacBook screen. The screen surface has visible dust, pixel grid, and reflection. The screen displays a macOS desktop in dark mode with a dominant Photo Booth live-preview window. Inside the window: A girl in a dark bedroom with an off-white wall and rumpled bedding. The girl is lying down, wearing a black top and grey bottoms, holding an iPhone 15 Pro in her right hand. Her face is fully visible (reference matched). Lighting is low-key, candid, nocturnal, with bluish screen glow mixed with warm skin tones. High fidelity, raw photo, unedited, natural noise.", "negative": "vector art, screenshot, flat digital image, clean glass, perfect screen, daylight, bright studio lights, cartoon, 3d render, painting, watermark, deformed hands, blurry face" }, "sampling_settings": { "sampler": "DPM++ 3M SDE Exponential", "steps": 40, "cfg_scale": 5.5, "denoising_strength": 0.35 } }, "control_net_configuration": { "identity_strictness": "CRITICAL", "stack": [ { "unit": "ControlNet_Tile", "weight": 0.4, "purpose": "Maintain text/interface sharpness and screen texture" }, { "unit": "IP-Adapter_FaceID_Plus", "weight": 0.95, "region_mask": "Photo Booth Window Area Only", "source_image": "uploaded_female_reference_image" } ] } }
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
{ "reference_source": "Use the uploaded reference image as the single source of truth", "identity_lock": { "face": "100% identical to reference image — same facial structure, eyes, nose, lips, jawline, skin texture, expression", "body": "Same body proportions, height, posture, shoulder width, waist, hips, limbs", "hair": "Same hairstyle, hairline, length, color, volume", "age": "Exact same apparent age as reference", "ethnicity": "Unchanged from reference" }, "consistency_rules": [ "Do NOT modify face shape", "Do NOT beautify, stylize, or alter facial features", "No AI face swap artifacts", "No face blending with other identities", "No body reshaping", "No weight gain or loss", "No added or removed tattoos, scars, or marks" ], "wardrobe": { "instruction": "Only change clothing if explicitly specified", "fit": "Clothing fits naturally and accurately to the same body", "fabric_behavior": "Realistic folds, tension, and shadows" }, "pose_and_scene": { "pose": "Preserve the same pose unless explicitly changed", "camera_angle": "Same camera height, distance, and lens perspective", "lighting": "Match original lighting direction, softness, and intensity", "environment": "Realistic environment with consistent scale and depth" }, "image_quality": { "realism": "Photorealistic, natural skin texture", "clarity": "Sharp focus on face and eyes", "resolution": "High-definition, clean details", "artifacts": "No blur, distortion, double face, broken anatomy" }, "negative_prompt": [ "different face", "face drift", "face mutation", "body distortion", "incorrect anatomy", "extra limbs", "blurred face", "plastic skin", "AI-generated look", "over-smoothing", "unreal lighting" ], "final_instruction": "The generated image must look like the SAME PERSON photographed again, not a variation or look-alike." } { "prompt": "Intimate full-body mirror selfie portrait of a young woman in bedroom, standing in front of large ornate gold-framed mirror, back turned slightly to show rear view while taking selfie with phone in one hand, long dark brown hair tied in high messy bun with loose strands, fair tan skin with natural glow and subtle texture, wearing ultra-sheer black lace lingerie set with intricate mesh panels, high-cut thong bottoms, garter straps connected to thigh-high stockings, thin spaghetti straps on bra top, silver metallic high-heel mules visible on feet, arched back and hips emphasized for seductive pose, one arm extended holding phone, other hand on mirror or hip, bedroom background with beige walls, white bed sheets, scattered books/candles, soft warm golden lamp light from side creating rim glow on body curves lace texture and hair, subtle blue LED accent from device, shallow depth of field, strong cinematic bokeh on mirror reflection and background, photorealistic sensual lingerie fashion selfie, high detail lace transparency skin pores hair strands garter straps heel shine and natural imperfections, shot on smartphone with slight wide-angle distortion typical for mirror shots, ultra detailed, 8k resolution", "negative_prompt": "cartoon, anime, illustration, painting, deformed, blurry, lowres, plastic skin, doll-like, heavy makeup overload, thick eyeliner, false lashes, filters, beauty filter, airbrushed skin, extra limbs, distorted proportions, asymmetrical face, harsh flash, cold lighting, day time, fully clothed, short hair, blonde hair, standing straight, no mirror, no phone, crowded room, text watermark, logo, ugly, bad anatomy, overexposed, underexposed, low contrast", "reference_image": { "enabled": true, "strength": 0.92, "description": "Extremely strong reference for exact composition and pose: young woman taking mirror selfie showing back/rear view, long dark hair in messy bun, sheer black lace lingerie with garter straps and high-cut bottoms, arched back emphasizing curves, phone in hand, ornate gold mirror frame, soft warm bedroom light with blue accent, seductive intimate vibe, high detail lace and skin texture" }, "style": "photorealistic intimate mirror selfie, sensual lingerie boudoir, warm golden bedroom lighting", "aspect_ratio": "3:4", "lighting": "soft warm golden lamp light from side, gentle rim light on back curves hair and lace, subtle blue LED fill from phone, realistic soft shadows and highlights", "camera": "smartphone mirror selfie (iPhone style), slight wide-angle distortion, natural low-key indoor light, minor grain for realism", "additional_details": [ "hair: dark brown-black, long, tied in high messy bun with loose flyaways and strands down back", "outfit: sheer black mesh lace bra and high-cut thong bottoms, garter straps with metal rings, thin straps, semi-transparent fabric clinging to body See less Nano Banana Recent Generating… { "reference_source": "Use the uploaded reference image as the single source of truth", "identity_lock": { "face": "100% identical to reference image — same facial structure, eyes, nose, lips, jawline, skin texture, expression", "body": "Same body proportions, height, posture, shoulder width, waist, hips, limbs", "hair": "Same hairstyle, hairline, length, color, volume", "age": "Exact same apparent age as reference", "ethnicity": "Unchanged from reference" }, "consistency_rules": [ "Do NOT modify face shape", "Do NOT beautify, stylize, or alter facial features", "No AI face swap artifacts", "No face blending with other identities", "No body reshaping", "No weight gain or loss", "No added or removed tattoos, scars, or marks" ], "wardrobe": { "instruction": "Only change clothing if explicitly specified", "fit": "Clothing fits naturally and accurately to the same body", "fabric_behavior": "Realistic folds, tension, and shadows" }, "pose_and_scene": { "pose": "Preserve the same pose unless explicitly changed", "camera_angle": "Same camera height, distance, and lens perspective", "lighting": "Match original lighting direction, softness, and intensity", "environment": "Realistic environment with consistent scale and depth" }, "image_quality": { "realism": "Photorealistic, natural skin texture", "clarity": "Sharp focus on face and eyes", "resolution": "High-definition, clean details", "artifacts": "No blur, distortion, double face, broken anatomy" }, "negative_prompt": [ "different face", "face drift", "face mutation", "body distortion", "incorrect anatomy", "extra limbs", "blurred face", "plastic skin", "AI-generated look", "over-smoothing", "unreal lighting" ], "final_instruction": "The generated image must look like the SAME PERSON photographed again, not a variation or look-alike." } { "prompt": "Intimate full-body mirror selfie portrait of a young woman in bedroom, standing in front of large ornate gold-framed mirror, back turned slightly to show rear view while taking selfie with phone in one hand, long dark brown hair tied in high messy bun with loose strands, fair tan skin with natural glow and subtle texture, wearing ultra-sheer black lace lingerie set with intricate mesh panels, high-cut thong bottoms, garter straps connected to thigh-high stockings, thin spaghetti straps on bra top, silver metallic high-heel mules visible on feet, arched back and hips emphasized for seductive pose, one arm extended holding phone, other hand on mirror or hip, bedroom background with beige walls, white bed sheets, scattered books/candles, soft warm golden lamp light from side creating rim glow on body curves lace texture and hair, subtle blue LED accent from device, shallow depth of field, strong cinematic bokeh on mirror reflection and background, photorealistic sensual lingerie fashion selfie, high detail lace transparency skin pores hair strands garter straps heel shine and natural imperfections, shot on smartphone with slight wide-angle distortion typical for mirror shots, ultra detailed, 8k resolution", "negative_prompt": "cartoon, anime, illustration, painting, deformed, blurry, lowres, plastic skin, doll-like, heavy makeup overload, thick eyeliner, false lashes, filters, beauty filter, airbrushed skin, extra limbs, distorted proportions, asymmetrical face, harsh flash, cold lighting, day time, fully clothed, short hair, blonde hair, standing straight, no mirror, no phone, crowded room, text watermark, logo, ugly, bad anatomy, overexposed, underexposed, low contrast", "reference_image": { "enabled": true, "strength": 0.92, "description": "Extremely strong reference for exact composition and pose: young woman taking mirror selfie showing back/rear view, long dark hair in messy bun, sheer black lace lingerie with garter straps and high-cut bottoms, arched back emphasizing curves, phone in hand, ornate gold mirror frame, soft warm bedroom light with blue accent, seductive intimate vibe, high detail lace and skin texture" }, "style": "photorealistic intimate mirror selfie, sensual lingerie boudoir, warm golden bedroom lighting", "aspect_ratio": "3:4", "lighting": "soft warm golden lamp light from side, gentle rim light on back curves hair and lace, subtle blue LED fill from phone, realistic soft shadows and highlights", "camera": "smartphone mirror selfie (iPhone style), slight wide-angle distortion, natural low-key indoor light, minor grain for realism", "additional_details": [ "hair: dark brown-black, long, tied in high messy bun with loose flyaways and strands down back", "outfit: sheer black mesh lace bra and high-cut thong bottoms, garter straps with metal rings, thin straps, semi-transparent fabric clinging to body  EDIT PROMPT (CLEAR & STRICT): Replace only the bikini she is currently wearing with the provided reference bikini. The new bikini must fit her perfectly and tightly, matching the reference exactly in color, fabric, cut, straps, and details. Do NOT change anything else. Preserve the same woman, same body, same proportions Keep the exact pose, angle, expression, lighting, background, and camera framing No changes to face, hair, skin tone, or body shape She remains default/slim as she already is (not fat, not obese, no body exaggeration) This is a wardrobe-only replacement. Everything except the bikini stays 100% identical to the original image. Negative instructions: No resizing or reshaping of the body No stylization or artistic changes No added accessories No changes to posture or anatomy No distortion or blur Result: Photorealistic, natural-looking bikini replacement that appears originally worn by her, with realistic fabric tension and clean edges.
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
REALISTIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT ELEPHANT is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). The elephant is a large adult bull, anatomically accurate, massive scale and weight, rough wrinkled skin texture, visible tusks. Ears fully spread wide in an aggressive threat display, head lowered slightly, trunk tense or partially curled, body angled forward as if preparing to charge. No fantasy exaggeration — size and anatomy must be realistic. (NO hippo, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the elephant. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Elephant holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. COMPOSITION (CRITICAL — GLADIATOR FIRST): Low-angle heroic shot, camera near sand level. Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Elephant occupies the mid-ground, clearly readable but NOT closer than the man. Colosseum arches recognizable in the background. Strong depth separation, shallow-to-moderate DOF. Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Elephant realistically lit but slightly less emphasized than the man. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
REALISTIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT ELEPHANT is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). The elephant is a large adult bull, anatomically accurate, massive scale and weight, rough wrinkled skin texture, visible tusks. Ears fully spread wide in an aggressive threat display, head lowered slightly, trunk tense or partially curled, body angled forward as if preparing to charge. No fantasy exaggeration — size and anatomy must be realistic. (NO hippo, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the elephant. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Elephant holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. COMPOSITION (CRITICAL — GLADIATOR FIRST): Low-angle heroic shot, camera near sand level. Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Elephant occupies the mid-ground, clearly readable but NOT closer than the man. Colosseum arches recognizable in the background. Strong depth separation, shallow-to-moderate DOF. Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Elephant realistically lit but slightly less emphasized than the man. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. A KOMODO DRAGON (Varanus komodoensis) is present on the left side of frame, facing the gladiator, placed farther from the lens than the gladiator (secondary subject). The Komodo dragon looks powerful and real: thick muscular body, rough scaled skin, heavy tail, strong legs, low stalking posture, forked tongue visible. Keep its size believable (large adult), with accurate anatomy and texture. (NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact”. The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the Komodo dragon. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face (cheekbone, nose line, jaw, lips, eye, brow) — fully visible, sharply defined, and perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. His facial expression must be readable: concentrated, controlled, predatory calm. Body/action: Gladiator leaning forward, shield raised defensively, sword low and ready, muscles and chest hair clearly detailed. Komodo dragon advancing slowly or pausing mid-step, tongue flicking, dust disturbed by its claws, tension high. NO blood, NO wounds, NO biting, NO visible injuries, NO aftermath. Suspense and power, not violence. COMPOSITION (CRITICAL — GLADIATOR FIRST): Low-angle heroic shot, camera near sand level. Gladiator dominates the foreground and frame (waist-up or 3/4 body), with face and torso fully visible, not cut off. Komodo dragon occupies the left mid-ground (clearly visible but NOT closer than the man). Amphitheater arches clearly recognizable in the background. Clean horizon, strong depth separation (shallow-to-moderate DOF). Keep the gladiator’s face and chest in tack-sharp focus; keep the Komodo dragon recognizable and detailed. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Perfect, professional illumination on the gladiator: Warm golden sunlight as key light (or strong soft key from sun direction), PLUS soft fill light to remove harsh shadows on face and torso. Subtle rim light outlining shoulders/helmet/arms. Every muscle, chest hair texture, skin pores, and facial details are clearly visible. Komodo dragon and background may be slightly less bright; priority is the gladiator. No underexposure on the man. No blown highlights. CAMERA / LENS: Full-frame look, 35mm or 50mm (epic scale, minimal distortion). Fast shutter feel, subject sharp, dust provides motion cues only. Realistic bokeh, controlled contrast, filmic color grading. No HDR, no neon, no fantasy glow. QUALITY CONTROL: Photorealistic, historical-epic campaign finish. No artifacts, no warped anatomy, no extra limbs, no deformed hands. No random logos, no watermarks, no text. NEGATIVE PROMPT: lion, bear, dragon fantasy, dinosaur, crocodile, snake, extra animals, multiple komodo dragons, cartoon komodo, blood, gore, injury, wounds, biting, mauling, dead animal, dead body, graphic violence, dismemberment, low resolution, blurry, plastic skin, over-smoothed, uncanny valley, extra fingers, extra limbs, warped anatomy, cartoon, illustration, CGI look, fantasy lighting, neon, oversaturation, harsh HDR, random text, misspelled words, logos, watermarks
{ "prompt_type": "descriptive_replication", "reference_adherence": "STRICT_VISUAL_FIDELITY", "aspect_ratio": "3:4", "identity_lock": { "priority": "NONE", "instruction": "Completely new female model with entirely different facial structure and identity. No resemblance to any previous version." }, "scene_constraints": { "people_count": "exactly one person in the frame", "single_subject_only": true, "no_other_people_visible": true, "no_extra_reflections": true, "no_background_figures": true, "full_body_visible": true, "head_to_toe_in_frame": true }, "subject": { "demographics": "young woman (age 20+), blonde", "presence": "only one visible person in the entire scene", "hair": { "color": "warm blonde (no dark roots)", "style": "center part, long straight sleek hair", "texture": "smooth, slightly glossy, natural fall" }, "face": { "structure": "heart-shaped face with softer jawline and rounded chin", "eyes": "large round eyes with neutral outer corners", "brows": "naturally arched brows with medium thickness", "nose": "small softly rounded nose", "lips": "medium-full lips with defined cupid's bow", "expression": "confident relaxed expression with gentle half-smile", "makeup": "fresh natural makeup, light glow on skin, soft peach lips, minimal eyeliner", "visibility": "fully visible face, unobstructed, clearly distinct identity" }, "body": { "pose": "standing upright facing mirror directly, both arms relaxed naturally down along sides", "posture": "balanced stance with one knee slightly bent forward creating a subtle S-curve", "anatomy": { "curves": "natural feminine proportions with realistic gravity", "chest": "bikini-supported natural shape", "details": "visible collarbones, relaxed shoulders, natural thigh and calf definition" }, "feet": { "visibility": "fully visible at bottom of frame", "position": "barefoot, one foot slightly forward for dynamic stance" }, "skin_texture": "natural skin texture with subtle realistic sheen" } }, "attire_and_accessories": { "clothing": { "top": "black triangle string bikini top with thin straps", "bottom_detail": "matching bikini bottom strings visible at hips" }, "footwear": "barefoot", "jewelry": { "necklaces": [], "wrists": [], "fingers": "no rings" }, "props": { "hands": "empty hands, no phone or objects" } }, "environment": { "setting": "modern luxury bathroom interior, daytime", "foreground_element": { "mirror": "large rectangular mirror with GREY MARBLE FRAME", "material": "polished stone marble with subtle cool grey veining" }, "background": { "elements": [ "marble wall tiles", "floating vanity with minimalist design", "sleek chrome faucet", "soft folded white towels", "clean tiled floor fully visible" ], "style": "minimalist luxury aesthetic", "clutter": "none" } }, "lighting_and_atmosphere": { "source": "natural daylight from camera-left", "quality": "soft directional light", "effects": [ "gentle highlights along cheekbones and shoulders", "soft shadow falloff along waist and legs" ], "contrast": "moderate and realistic" }, "camera_and_technical": { "perspective": "mirror perspective without phone blocking face", "camera_position": "eye level, slightly angled downward", "framing": "vertical 3:4, full head-to-toe body visible without cropping feet", "focus": "sharp focus on subject, background slightly softened", "visual_fidelity": "real smartphone camera aesthetic, mild natural noise, no HDR effect" }, "realism_constraints": { "allowed": [ "natural skin texture", "minor asymmetry", "realistic body proportions", "natural lighting imperfections" ], "forbidden": [ "multiple people", "extra person", "background people", "beauty filters", "airbrushed skin", "anime style", "cartoon style", "cropped feet", "cut-off legs", "phone in hand" ] } }
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
REALISTIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT ELEPHANT is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). The elephant is a large adult bull, anatomically accurate, massive scale and weight, rough wrinkled skin texture, visible tusks. Ears fully spread wide in an aggressive threat display, head lowered slightly, trunk tense or partially curled, body angled forward as if preparing to charge. No fantasy exaggeration — size and anatomy must be realistic. (NO hippo, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the elephant. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Elephant holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. COMPOSITION (CRITICAL — GLADIATOR FIRST): Low-angle heroic shot, camera near sand level. Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Elephant occupies the mid-ground, clearly readable but NOT closer than the man. Colosseum arches recognizable in the background. Strong depth separation, shallow-to-moderate DOF. Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Elephant realistically lit but slightly less emphasized than the man. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
{ "prompt_type": "descriptive_replication", "reference_adherence": "STRICT_VISUAL_FIDELITY", "aspect_ratio": "3:4", "identity_lock": { "priority": "NONE", "instruction": "Completely new female model with entirely different facial structure and identity. No resemblance to any previous version." }, "scene_constraints": { "people_count": "exactly one person in the frame", "single_subject_only": true, "no_other_people_visible": true, "no_extra_reflections": true, "no_background_figures": true, "full_body_visible": true, "head_to_toe_in_frame": true }, "subject": { "demographics": "young woman (age 20+), blonde", "presence": "only one visible person in the entire scene", "hair": { "color": "warm blonde (no dark roots)", "style": "center part, long straight sleek hair", "texture": "smooth, slightly glossy, natural fall" }, "face": { "structure": "heart-shaped face with softer jawline and rounded chin", "eyes": "large round eyes with neutral outer corners", "brows": "naturally arched brows with medium thickness", "nose": "small softly rounded nose", "lips": "medium-full lips with defined cupid's bow", "expression": "confident relaxed expression with gentle half-smile", "makeup": "fresh natural makeup, light glow on skin, soft peach lips, minimal eyeliner", "visibility": "fully visible face, unobstructed, clearly distinct identity" }, "body": { "pose": "standing upright facing mirror directly, both arms relaxed naturally down along sides", "posture": "balanced stance with one knee slightly bent forward creating a subtle S-curve", "anatomy": { "curves": "natural feminine proportions with realistic gravity", "chest": "bikini-supported natural shape", "details": "visible collarbones, relaxed shoulders, natural thigh and calf definition" }, "feet": { "visibility": "fully visible at bottom of frame", "position": "barefoot, one foot slightly forward for dynamic stance" }, "skin_texture": "natural skin texture with subtle realistic sheen" } }, "attire_and_accessories": { "clothing": { "top": "black triangle string bikini top with thin straps", "bottom_detail": "matching bikini bottom strings visible at hips" }, "footwear": "barefoot", "jewelry": { "necklaces": [], "wrists": [], "fingers": "no rings" }, "props": { "hands": "empty hands, no phone or objects" } }, "environment": { "setting": "modern luxury bathroom interior, daytime", "foreground_element": { "mirror": "large rectangular mirror with GREY MARBLE FRAME", "material": "polished stone marble with subtle cool grey veining" }, "background": { "elements": [ "marble wall tiles", "floating vanity with minimalist design", "sleek chrome faucet", "soft folded white towels", "clean tiled floor fully visible" ], "style": "minimalist luxury aesthetic", "clutter": "none" } }, "lighting_and_atmosphere": { "source": "natural daylight from camera-left", "quality": "soft directional light", "effects": [ "gentle highlights along cheekbones and shoulders", "soft shadow falloff along waist and legs" ], "contrast": "moderate and realistic" }, "camera_and_technical": { "perspective": "mirror perspective without phone blocking face", "camera_position": "eye level, slightly angled downward", "framing": "vertical 3:4, full head-to-toe body visible without cropping feet", "focus": "sharp focus on subject, background slightly softened", "visual_fidelity": "real smartphone camera aesthetic, mild natural noise, no HDR effect" }, "realism_constraints": { "allowed": [ "natural skin texture", "minor asymmetry", "realistic body proportions", "natural lighting imperfections" ], "forbidden": [ "multiple people", "extra person", "background people", "beauty filters", "airbrushed skin", "anime style", "cartoon style", "cropped feet", "cut-off legs", "phone in hand" ] } }
{ "prompt_type": "descriptive_replication", "reference_adherence": "STRICT_VISUAL_FIDELITY", "aspect_ratio": "3:4", "identity_lock": { "priority": "NONE", "instruction": "Completely new female model with entirely different facial structure and identity. No resemblance to any previous version." }, "scene_constraints": { "people_count": "exactly one person in the frame", "single_subject_only": true, "no_other_people_visible": true, "no_extra_reflections": true, "no_background_figures": true, "full_body_visible": true, "head_to_toe_in_frame": true }, "subject": { "demographics": "young woman (age 20+), blonde", "presence": "only one visible person in the entire scene", "hair": { "color": "warm blonde (no dark roots)", "style": "center part, long straight sleek hair", "texture": "smooth, slightly glossy, natural fall" }, "face": { "structure": "heart-shaped face with softer jawline and rounded chin", "eyes": "large round eyes with neutral outer corners", "brows": "naturally arched brows with medium thickness", "nose": "small softly rounded nose", "lips": "medium-full lips with defined cupid's bow", "expression": "confident relaxed expression with gentle half-smile", "makeup": "fresh natural makeup, light glow on skin, soft peach lips, minimal eyeliner", "visibility": "fully visible face, unobstructed, clearly distinct identity" }, "body": { "pose": "standing upright facing mirror directly, both arms relaxed naturally down along sides", "posture": "balanced stance with one knee slightly bent forward creating a subtle S-curve", "anatomy": { "curves": "natural feminine proportions with realistic gravity", "chest": "bikini-supported natural shape", "details": "visible collarbones, relaxed shoulders, natural thigh and calf definition" }, "feet": { "visibility": "fully visible at bottom of frame", "position": "barefoot, one foot slightly forward for dynamic stance" }, "skin_texture": "natural skin texture with subtle realistic sheen" } }, "attire_and_accessories": { "clothing": { "top": "black triangle string bikini top with thin straps", "bottom_detail": "matching bikini bottom strings visible at hips" }, "footwear": "barefoot", "jewelry": { "necklaces": [], "wrists": [], "fingers": "no rings" }, "props": { "hands": "empty hands, no phone or objects" } }, "environment": { "setting": "modern luxury bathroom interior, daytime", "foreground_element": { "mirror": "large rectangular mirror with GREY MARBLE FRAME", "material": "polished stone marble with subtle cool grey veining" }, "background": { "elements": [ "marble wall tiles", "floating vanity with minimalist design", "sleek chrome faucet", "soft folded white towels", "clean tiled floor fully visible" ], "style": "minimalist luxury aesthetic", "clutter": "none" } }, "lighting_and_atmosphere": { "source": "natural daylight from camera-left", "quality": "soft directional light", "effects": [ "gentle highlights along cheekbones and shoulders", "soft shadow falloff along waist and legs" ], "contrast": "moderate and realistic" }, "camera_and_technical": { "perspective": "mirror perspective without phone blocking face", "camera_position": "eye level, slightly angled downward", "framing": "vertical 3:4, full head-to-toe body visible without cropping feet", "focus": "sharp focus on subject, background slightly softened", "visual_fidelity": "real smartphone camera aesthetic, mild natural noise, no HDR effect" }, "realism_constraints": { "allowed": [ "natural skin texture", "minor asymmetry", "realistic body proportions", "natural lighting imperfections" ], "forbidden": [ "multiple people", "extra person", "background people", "beauty filters", "airbrushed skin", "anime style", "cartoon style", "cropped feet", "cut-off legs", "phone in hand" ] } }
{ "project_meta": { "task_name": "Hyper-realistic Screen Simulation", "target_model": "SDXL_1.0_Refiner", "aspect_ratio": "3:4", "resolution": { "width": 1152, "height": 1536 } }, "scene_composition": { "layer_1_physical_environment": { "camera_angle": "High-angle, downward shot (POV)", "subject_anchor": "MacBook Screen (95% fill)", "foreground_element": "Thin strip of physical keyboard visible at bottom edge", "surface_imperfections": [ "Visible RGB pixel-grid texture (Moiré effect)", "Micro-dust particles on glass surface", "Faint ambient light reflections (glossy finish)", "Subtle fingerprint smudges" ] }, "layer_2_digital_interface": { "os_theme": "macOS Dark Mode", "active_window": { "app_name": "Photo Booth", "status": "Live Preview Mode", "position": "Dominant/Center-Left" } }, "layer_3_nested_content": { "location": "Inside the Photo Booth window", "setting": { "room": "Dim bedroom", "background": "Off-white wall, rumpled bedding", "lighting": "Mixed lighting (Cool blue screen glow + Warm skin tones), deep nocturnal shadows" }, "subject_details": { "identity_source": "uploaded_female_reference_image", "demographics": "Young adult female", "appearance": { "apparel": "Black top, grey bottoms", "expression": "Relaxed, candid, slight smile", "pose": "Reclining/Lying down, looking at screen" }, "props": { "item": "iPhone 15 Pro", "hand_position": "Held in right hand" } } } }, "generation_parameters": { "prompts": { "positive": "Hyper-realistic downward shot of a MacBook screen. The screen surface has visible dust, pixel grid, and reflection. The screen displays a macOS desktop in dark mode with a dominant Photo Booth live-preview window. Inside the window: A girl in a dark bedroom with an off-white wall and rumpled bedding. The girl is lying down, wearing a black top and grey bottoms, holding an iPhone 15 Pro in her right hand. Her face is fully visible (reference matched). Lighting is low-key, candid, nocturnal, with bluish screen glow mixed with warm skin tones. High fidelity, raw photo, unedited, natural noise.", "negative": "vector art, screenshot, flat digital image, clean glass, perfect screen, daylight, bright studio lights, cartoon, 3d render, painting, watermark, deformed hands, blurry face" }, "sampling_settings": { "sampler": "DPM++ 3M SDE Exponential", "steps": 40, "cfg_scale": 5.5, "denoising_strength": 0.35 } }, "control_net_configuration": { "identity_strictness": "CRITICAL", "stack": [ { "unit": "ControlNet_Tile", "weight": 0.4, "purpose": "Maintain text/interface sharpness and screen texture" }, { "unit": "IP-Adapter_FaceID_Plus", "weight": 0.95, "region_mask": "Photo Booth Window Area Only", "source_image": "uploaded_female_reference_image" } ] } }
{ "reference_source": "Use the uploaded reference image as the single source of truth", "identity_lock": { "face": "100% identical to reference image — same facial structure, eyes, nose, lips, jawline, skin texture, expression", "body": "Same body proportions, height, posture, shoulder width, waist, hips, limbs", "hair": "Same hairstyle, hairline, length, color, volume", "age": "Exact same apparent age as reference", "ethnicity": "Unchanged from reference" }, "consistency_rules": [ "Do NOT modify face shape", "Do NOT beautify, stylize, or alter facial features", "No AI face swap artifacts", "No face blending with other identities", "No body reshaping", "No weight gain or loss", "No added or removed tattoos, scars, or marks" ], "wardrobe": { "instruction": "Only change clothing if explicitly specified", "fit": "Clothing fits naturally and accurately to the same body", "fabric_behavior": "Realistic folds, tension, and shadows" }, "pose_and_scene": { "pose": "Preserve the same pose unless explicitly changed", "camera_angle": "Same camera height, distance, and lens perspective", "lighting": "Match original lighting direction, softness, and intensity", "environment": "Realistic environment with consistent scale and depth" }, "image_quality": { "realism": "Photorealistic, natural skin texture", "clarity": "Sharp focus on face and eyes", "resolution": "High-definition, clean details", "artifacts": "No blur, distortion, double face, broken anatomy" }, "negative_prompt": [ "different face", "face drift", "face mutation", "body distortion", "incorrect anatomy", "extra limbs", "blurred face", "plastic skin", "AI-generated look", "over-smoothing", "unreal lighting" ], "final_instruction": "The generated image must look like the SAME PERSON photographed again, not a variation or look-alike." } { "prompt": "Intimate full-body mirror selfie portrait of a young woman in bedroom, standing in front of large ornate gold-framed mirror, back turned slightly to show rear view while taking selfie with phone in one hand, long dark brown hair tied in high messy bun with loose strands, fair tan skin with natural glow and subtle texture, wearing ultra-sheer black lace lingerie set with intricate mesh panels, high-cut thong bottoms, garter straps connected to thigh-high stockings, thin spaghetti straps on bra top, silver metallic high-heel mules visible on feet, arched back and hips emphasized for seductive pose, one arm extended holding phone, other hand on mirror or hip, bedroom background with beige walls, white bed sheets, scattered books/candles, soft warm golden lamp light from side creating rim glow on body curves lace texture and hair, subtle blue LED accent from device, shallow depth of field, strong cinematic bokeh on mirror reflection and background, photorealistic sensual lingerie fashion selfie, high detail lace transparency skin pores hair strands garter straps heel shine and natural imperfections, shot on smartphone with slight wide-angle distortion typical for mirror shots, ultra detailed, 8k resolution", "negative_prompt": "cartoon, anime, illustration, painting, deformed, blurry, lowres, plastic skin, doll-like, heavy makeup overload, thick eyeliner, false lashes, filters, beauty filter, airbrushed skin, extra limbs, distorted proportions, asymmetrical face, harsh flash, cold lighting, day time, fully clothed, short hair, blonde hair, standing straight, no mirror, no phone, crowded room, text watermark, logo, ugly, bad anatomy, overexposed, underexposed, low contrast", "reference_image": { "enabled": true, "strength": 0.92, "description": "Extremely strong reference for exact composition and pose: young woman taking mirror selfie showing back/rear view, long dark hair in messy bun, sheer black lace lingerie with garter straps and high-cut bottoms, arched back emphasizing curves, phone in hand, ornate gold mirror frame, soft warm bedroom light with blue accent, seductive intimate vibe, high detail lace and skin texture" }, "style": "photorealistic intimate mirror selfie, sensual lingerie boudoir, warm golden bedroom lighting", "aspect_ratio": "3:4", "lighting": "soft warm golden lamp light from side, gentle rim light on back curves hair and lace, subtle blue LED fill from phone, realistic soft shadows and highlights", "camera": "smartphone mirror selfie (iPhone style), slight wide-angle distortion, natural low-key indoor light, minor grain for realism", "additional_details": [ "hair: dark brown-black, long, tied in high messy bun with loose flyaways and strands down back", "outfit: sheer black mesh lace bra and high-cut thong bottoms, garter straps with metal rings, thin straps, semi-transparent fabric clinging to body See less Nano Banana Recent Generating… { "reference_source": "Use the uploaded reference image as the single source of truth", "identity_lock": { "face": "100% identical to reference image — same facial structure, eyes, nose, lips, jawline, skin texture, expression", "body": "Same body proportions, height, posture, shoulder width, waist, hips, limbs", "hair": "Same hairstyle, hairline, length, color, volume", "age": "Exact same apparent age as reference", "ethnicity": "Unchanged from reference" }, "consistency_rules": [ "Do NOT modify face shape", "Do NOT beautify, stylize, or alter facial features", "No AI face swap artifacts", "No face blending with other identities", "No body reshaping", "No weight gain or loss", "No added or removed tattoos, scars, or marks" ], "wardrobe": { "instruction": "Only change clothing if explicitly specified", "fit": "Clothing fits naturally and accurately to the same body", "fabric_behavior": "Realistic folds, tension, and shadows" }, "pose_and_scene": { "pose": "Preserve the same pose unless explicitly changed", "camera_angle": "Same camera height, distance, and lens perspective", "lighting": "Match original lighting direction, softness, and intensity", "environment": "Realistic environment with consistent scale and depth" }, "image_quality": { "realism": "Photorealistic, natural skin texture", "clarity": "Sharp focus on face and eyes", "resolution": "High-definition, clean details", "artifacts": "No blur, distortion, double face, broken anatomy" }, "negative_prompt": [ "different face", "face drift", "face mutation", "body distortion", "incorrect anatomy", "extra limbs", "blurred face", "plastic skin", "AI-generated look", "over-smoothing", "unreal lighting" ], "final_instruction": "The generated image must look like the SAME PERSON photographed again, not a variation or look-alike." } { "prompt": "Intimate full-body mirror selfie portrait of a young woman in bedroom, standing in front of large ornate gold-framed mirror, back turned slightly to show rear view while taking selfie with phone in one hand, long dark brown hair tied in high messy bun with loose strands, fair tan skin with natural glow and subtle texture, wearing ultra-sheer black lace lingerie set with intricate mesh panels, high-cut thong bottoms, garter straps connected to thigh-high stockings, thin spaghetti straps on bra top, silver metallic high-heel mules visible on feet, arched back and hips emphasized for seductive pose, one arm extended holding phone, other hand on mirror or hip, bedroom background with beige walls, white bed sheets, scattered books/candles, soft warm golden lamp light from side creating rim glow on body curves lace texture and hair, subtle blue LED accent from device, shallow depth of field, strong cinematic bokeh on mirror reflection and background, photorealistic sensual lingerie fashion selfie, high detail lace transparency skin pores hair strands garter straps heel shine and natural imperfections, shot on smartphone with slight wide-angle distortion typical for mirror shots, ultra detailed, 8k resolution", "negative_prompt": "cartoon, anime, illustration, painting, deformed, blurry, lowres, plastic skin, doll-like, heavy makeup overload, thick eyeliner, false lashes, filters, beauty filter, airbrushed skin, extra limbs, distorted proportions, asymmetrical face, harsh flash, cold lighting, day time, fully clothed, short hair, blonde hair, standing straight, no mirror, no phone, crowded room, text watermark, logo, ugly, bad anatomy, overexposed, underexposed, low contrast", "reference_image": { "enabled": true, "strength": 0.92, "description": "Extremely strong reference for exact composition and pose: young woman taking mirror selfie showing back/rear view, long dark hair in messy bun, sheer black lace lingerie with garter straps and high-cut bottoms, arched back emphasizing curves, phone in hand, ornate gold mirror frame, soft warm bedroom light with blue accent, seductive intimate vibe, high detail lace and skin texture" }, "style": "photorealistic intimate mirror selfie, sensual lingerie boudoir, warm golden bedroom lighting", "aspect_ratio": "3:4", "lighting": "soft warm golden lamp light from side, gentle rim light on back curves hair and lace, subtle blue LED fill from phone, realistic soft shadows and highlights", "camera": "smartphone mirror selfie (iPhone style), slight wide-angle distortion, natural low-key indoor light, minor grain for realism", "additional_details": [ "hair: dark brown-black, long, tied in high messy bun with loose flyaways and strands down back", "outfit: sheer black mesh lace bra and high-cut thong bottoms, garter straps with metal rings, thin straps, semi-transparent fabric clinging to body  EDIT PROMPT (CLEAR & STRICT): Replace only the bikini she is currently wearing with the provided reference bikini. The new bikini must fit her perfectly and tightly, matching the reference exactly in color, fabric, cut, straps, and details. Do NOT change anything else. Preserve the same woman, same body, same proportions Keep the exact pose, angle, expression, lighting, background, and camera framing No changes to face, hair, skin tone, or body shape She remains default/slim as she already is (not fat, not obese, no body exaggeration) This is a wardrobe-only replacement. Everything except the bikini stays 100% identical to the original image. Negative instructions: No resizing or reshaping of the body No stylization or artistic changes No added accessories No changes to posture or anatomy No distortion or blur Result: Photorealistic, natural-looking bikini replacement that appears originally worn by her, with realistic fabric tension and clean edges.
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
REALISTIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT ELEPHANT is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). The elephant is a large adult bull, anatomically accurate, massive scale and weight, rough wrinkled skin texture, visible tusks. Ears fully spread wide in an aggressive threat display, head lowered slightly, trunk tense or partially curled, body angled forward as if preparing to charge. No fantasy exaggeration — size and anatomy must be realistic. (NO hippo, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the elephant. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Elephant holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. COMPOSITION (CRITICAL — GLADIATOR FIRST): Low-angle heroic shot, camera near sand level. Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Elephant occupies the mid-ground, clearly readable but NOT closer than the man. Colosseum arches recognizable in the background. Strong depth separation, shallow-to-moderate DOF. Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Elephant realistically lit but slightly less emphasized than the man. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
{ "prompt_type": "descriptive_replication", "reference_adherence": "STRICT_VISUAL_FIDELITY", "aspect_ratio": "3:4", "identity_lock": { "priority": "NONE", "instruction": "Completely new female model with entirely different facial structure and identity. No resemblance to any previous version." }, "scene_constraints": { "people_count": "exactly one person in the frame", "single_subject_only": true, "no_other_people_visible": true, "no_extra_reflections": true, "no_background_figures": true, "full_body_visible": true, "head_to_toe_in_frame": true }, "subject": { "demographics": "young woman (age 20+), blonde", "presence": "only one visible person in the entire scene", "hair": { "color": "warm blonde (no dark roots)", "style": "center part, long straight sleek hair", "texture": "smooth, slightly glossy, natural fall" }, "face": { "structure": "heart-shaped face with softer jawline and rounded chin", "eyes": "large round eyes with neutral outer corners", "brows": "naturally arched brows with medium thickness", "nose": "small softly rounded nose", "lips": "medium-full lips with defined cupid's bow", "expression": "confident relaxed expression with gentle half-smile", "makeup": "fresh natural makeup, light glow on skin, soft peach lips, minimal eyeliner", "visibility": "fully visible face, unobstructed, clearly distinct identity" }, "body": { "pose": "standing upright facing mirror directly, both arms relaxed naturally down along sides", "posture": "balanced stance with one knee slightly bent forward creating a subtle S-curve", "anatomy": { "curves": "natural feminine proportions with realistic gravity", "chest": "bikini-supported natural shape", "details": "visible collarbones, relaxed shoulders, natural thigh and calf definition" }, "feet": { "visibility": "fully visible at bottom of frame", "position": "barefoot, one foot slightly forward for dynamic stance" }, "skin_texture": "natural skin texture with subtle realistic sheen" } }, "attire_and_accessories": { "clothing": { "top": "black triangle string bikini top with thin straps", "bottom_detail": "matching bikini bottom strings visible at hips" }, "footwear": "barefoot", "jewelry": { "necklaces": [], "wrists": [], "fingers": "no rings" }, "props": { "hands": "empty hands, no phone or objects" } }, "environment": { "setting": "modern luxury bathroom interior, daytime", "foreground_element": { "mirror": "large rectangular mirror with GREY MARBLE FRAME", "material": "polished stone marble with subtle cool grey veining" }, "background": { "elements": [ "marble wall tiles", "floating vanity with minimalist design", "sleek chrome faucet", "soft folded white towels", "clean tiled floor fully visible" ], "style": "minimalist luxury aesthetic", "clutter": "none" } }, "lighting_and_atmosphere": { "source": "natural daylight from camera-left", "quality": "soft directional light", "effects": [ "gentle highlights along cheekbones and shoulders", "soft shadow falloff along waist and legs" ], "contrast": "moderate and realistic" }, "camera_and_technical": { "perspective": "mirror perspective without phone blocking face", "camera_position": "eye level, slightly angled downward", "framing": "vertical 3:4, full head-to-toe body visible without cropping feet", "focus": "sharp focus on subject, background slightly softened", "visual_fidelity": "real smartphone camera aesthetic, mild natural noise, no HDR effect" }, "realism_constraints": { "allowed": [ "natural skin texture", "minor asymmetry", "realistic body proportions", "natural lighting imperfections" ], "forbidden": [ "multiple people", "extra person", "background people", "beauty filters", "airbrushed skin", "anime style", "cartoon style", "cropped feet", "cut-off legs", "phone in hand" ] } }
{ "project_meta": { "task_name": "Hyper-realistic Screen Simulation", "target_model": "SDXL_1.0_Refiner", "aspect_ratio": "3:4", "resolution": { "width": 1152, "height": 1536 } }, "scene_composition": { "layer_1_physical_environment": { "camera_angle": "High-angle, downward shot (POV)", "subject_anchor": "MacBook Screen (95% fill)", "foreground_element": "Thin strip of physical keyboard visible at bottom edge", "surface_imperfections": [ "Visible RGB pixel-grid texture (Moiré effect)", "Micro-dust particles on glass surface", "Faint ambient light reflections (glossy finish)", "Subtle fingerprint smudges" ] }, "layer_2_digital_interface": { "os_theme": "macOS Dark Mode", "active_window": { "app_name": "Photo Booth", "status": "Live Preview Mode", "position": "Dominant/Center-Left" } }, "layer_3_nested_content": { "location": "Inside the Photo Booth window", "setting": { "room": "Dim bedroom", "background": "Off-white wall, rumpled bedding", "lighting": "Mixed lighting (Cool blue screen glow + Warm skin tones), deep nocturnal shadows" }, "subject_details": { "identity_source": "uploaded_female_reference_image", "demographics": "Young adult female", "appearance": { "apparel": "Black top, grey bottoms", "expression": "Relaxed, candid, slight smile", "pose": "Reclining/Lying down, looking at screen" }, "props": { "item": "iPhone 15 Pro", "hand_position": "Held in right hand" } } } }, "generation_parameters": { "prompts": { "positive": "Hyper-realistic downward shot of a MacBook screen. The screen surface has visible dust, pixel grid, and reflection. The screen displays a macOS desktop in dark mode with a dominant Photo Booth live-preview window. Inside the window: A girl in a dark bedroom with an off-white wall and rumpled bedding. The girl is lying down, wearing a black top and grey bottoms, holding an iPhone 15 Pro in her right hand. Her face is fully visible (reference matched). Lighting is low-key, candid, nocturnal, with bluish screen glow mixed with warm skin tones. High fidelity, raw photo, unedited, natural noise.", "negative": "vector art, screenshot, flat digital image, clean glass, perfect screen, daylight, bright studio lights, cartoon, 3d render, painting, watermark, deformed hands, blurry face" }, "sampling_settings": { "sampler": "DPM++ 3M SDE Exponential", "steps": 40, "cfg_scale": 5.5, "denoising_strength": 0.35 } }, "control_net_configuration": { "identity_strictness": "CRITICAL", "stack": [ { "unit": "ControlNet_Tile", "weight": 0.4, "purpose": "Maintain text/interface sharpness and screen texture" }, { "unit": "IP-Adapter_FaceID_Plus", "weight": 0.95, "region_mask": "Photo Booth Window Area Only", "source_image": "uploaded_female_reference_image" } ] } }
{ "reference_source": "Use the uploaded reference image as the single source of truth", "identity_lock": { "face": "100% identical to reference image — same facial structure, eyes, nose, lips, jawline, skin texture, expression", "body": "Same body proportions, height, posture, shoulder width, waist, hips, limbs", "hair": "Same hairstyle, hairline, length, color, volume", "age": "Exact same apparent age as reference", "ethnicity": "Unchanged from reference" }, "consistency_rules": [ "Do NOT modify face shape", "Do NOT beautify, stylize, or alter facial features", "No AI face swap artifacts", "No face blending with other identities", "No body reshaping", "No weight gain or loss", "No added or removed tattoos, scars, or marks" ], "wardrobe": { "instruction": "Only change clothing if explicitly specified", "fit": "Clothing fits naturally and accurately to the same body", "fabric_behavior": "Realistic folds, tension, and shadows" }, "pose_and_scene": { "pose": "Preserve the same pose unless explicitly changed", "camera_angle": "Same camera height, distance, and lens perspective", "lighting": "Match original lighting direction, softness, and intensity", "environment": "Realistic environment with consistent scale and depth" }, "image_quality": { "realism": "Photorealistic, natural skin texture", "clarity": "Sharp focus on face and eyes", "resolution": "High-definition, clean details", "artifacts": "No blur, distortion, double face, broken anatomy" }, "negative_prompt": [ "different face", "face drift", "face mutation", "body distortion", "incorrect anatomy", "extra limbs", "blurred face", "plastic skin", "AI-generated look", "over-smoothing", "unreal lighting" ], "final_instruction": "The generated image must look like the SAME PERSON photographed again, not a variation or look-alike." } { "prompt": "Intimate full-body mirror selfie portrait of a young woman in bedroom, standing in front of large ornate gold-framed mirror, back turned slightly to show rear view while taking selfie with phone in one hand, long dark brown hair tied in high messy bun with loose strands, fair tan skin with natural glow and subtle texture, wearing ultra-sheer black lace lingerie set with intricate mesh panels, high-cut thong bottoms, garter straps connected to thigh-high stockings, thin spaghetti straps on bra top, silver metallic high-heel mules visible on feet, arched back and hips emphasized for seductive pose, one arm extended holding phone, other hand on mirror or hip, bedroom background with beige walls, white bed sheets, scattered books/candles, soft warm golden lamp light from side creating rim glow on body curves lace texture and hair, subtle blue LED accent from device, shallow depth of field, strong cinematic bokeh on mirror reflection and background, photorealistic sensual lingerie fashion selfie, high detail lace transparency skin pores hair strands garter straps heel shine and natural imperfections, shot on smartphone with slight wide-angle distortion typical for mirror shots, ultra detailed, 8k resolution", "negative_prompt": "cartoon, anime, illustration, painting, deformed, blurry, lowres, plastic skin, doll-like, heavy makeup overload, thick eyeliner, false lashes, filters, beauty filter, airbrushed skin, extra limbs, distorted proportions, asymmetrical face, harsh flash, cold lighting, day time, fully clothed, short hair, blonde hair, standing straight, no mirror, no phone, crowded room, text watermark, logo, ugly, bad anatomy, overexposed, underexposed, low contrast", "reference_image": { "enabled": true, "strength": 0.92, "description": "Extremely strong reference for exact composition and pose: young woman taking mirror selfie showing back/rear view, long dark hair in messy bun, sheer black lace lingerie with garter straps and high-cut bottoms, arched back emphasizing curves, phone in hand, ornate gold mirror frame, soft warm bedroom light with blue accent, seductive intimate vibe, high detail lace and skin texture" }, "style": "photorealistic intimate mirror selfie, sensual lingerie boudoir, warm golden bedroom lighting", "aspect_ratio": "3:4", "lighting": "soft warm golden lamp light from side, gentle rim light on back curves hair and lace, subtle blue LED fill from phone, realistic soft shadows and highlights", "camera": "smartphone mirror selfie (iPhone style), slight wide-angle distortion, natural low-key indoor light, minor grain for realism", "additional_details": [ "hair: dark brown-black, long, tied in high messy bun with loose flyaways and strands down back", "outfit: sheer black mesh lace bra and high-cut thong bottoms, garter straps with metal rings, thin straps, semi-transparent fabric clinging to body See less Nano Banana Recent Generating… { "reference_source": "Use the uploaded reference image as the single source of truth", "identity_lock": { "face": "100% identical to reference image — same facial structure, eyes, nose, lips, jawline, skin texture, expression", "body": "Same body proportions, height, posture, shoulder width, waist, hips, limbs", "hair": "Same hairstyle, hairline, length, color, volume", "age": "Exact same apparent age as reference", "ethnicity": "Unchanged from reference" }, "consistency_rules": [ "Do NOT modify face shape", "Do NOT beautify, stylize, or alter facial features", "No AI face swap artifacts", "No face blending with other identities", "No body reshaping", "No weight gain or loss", "No added or removed tattoos, scars, or marks" ], "wardrobe": { "instruction": "Only change clothing if explicitly specified", "fit": "Clothing fits naturally and accurately to the same body", "fabric_behavior": "Realistic folds, tension, and shadows" }, "pose_and_scene": { "pose": "Preserve the same pose unless explicitly changed", "camera_angle": "Same camera height, distance, and lens perspective", "lighting": "Match original lighting direction, softness, and intensity", "environment": "Realistic environment with consistent scale and depth" }, "image_quality": { "realism": "Photorealistic, natural skin texture", "clarity": "Sharp focus on face and eyes", "resolution": "High-definition, clean details", "artifacts": "No blur, distortion, double face, broken anatomy" }, "negative_prompt": [ "different face", "face drift", "face mutation", "body distortion", "incorrect anatomy", "extra limbs", "blurred face", "plastic skin", "AI-generated look", "over-smoothing", "unreal lighting" ], "final_instruction": "The generated image must look like the SAME PERSON photographed again, not a variation or look-alike." } { "prompt": "Intimate full-body mirror selfie portrait of a young woman in bedroom, standing in front of large ornate gold-framed mirror, back turned slightly to show rear view while taking selfie with phone in one hand, long dark brown hair tied in high messy bun with loose strands, fair tan skin with natural glow and subtle texture, wearing ultra-sheer black lace lingerie set with intricate mesh panels, high-cut thong bottoms, garter straps connected to thigh-high stockings, thin spaghetti straps on bra top, silver metallic high-heel mules visible on feet, arched back and hips emphasized for seductive pose, one arm extended holding phone, other hand on mirror or hip, bedroom background with beige walls, white bed sheets, scattered books/candles, soft warm golden lamp light from side creating rim glow on body curves lace texture and hair, subtle blue LED accent from device, shallow depth of field, strong cinematic bokeh on mirror reflection and background, photorealistic sensual lingerie fashion selfie, high detail lace transparency skin pores hair strands garter straps heel shine and natural imperfections, shot on smartphone with slight wide-angle distortion typical for mirror shots, ultra detailed, 8k resolution", "negative_prompt": "cartoon, anime, illustration, painting, deformed, blurry, lowres, plastic skin, doll-like, heavy makeup overload, thick eyeliner, false lashes, filters, beauty filter, airbrushed skin, extra limbs, distorted proportions, asymmetrical face, harsh flash, cold lighting, day time, fully clothed, short hair, blonde hair, standing straight, no mirror, no phone, crowded room, text watermark, logo, ugly, bad anatomy, overexposed, underexposed, low contrast", "reference_image": { "enabled": true, "strength": 0.92, "description": "Extremely strong reference for exact composition and pose: young woman taking mirror selfie showing back/rear view, long dark hair in messy bun, sheer black lace lingerie with garter straps and high-cut bottoms, arched back emphasizing curves, phone in hand, ornate gold mirror frame, soft warm bedroom light with blue accent, seductive intimate vibe, high detail lace and skin texture" }, "style": "photorealistic intimate mirror selfie, sensual lingerie boudoir, warm golden bedroom lighting", "aspect_ratio": "3:4", "lighting": "soft warm golden lamp light from side, gentle rim light on back curves hair and lace, subtle blue LED fill from phone, realistic soft shadows and highlights", "camera": "smartphone mirror selfie (iPhone style), slight wide-angle distortion, natural low-key indoor light, minor grain for realism", "additional_details": [ "hair: dark brown-black, long, tied in high messy bun with loose flyaways and strands down back", "outfit: sheer black mesh lace bra and high-cut thong bottoms, garter straps with metal rings, thin straps, semi-transparent fabric clinging to body  EDIT PROMPT (CLEAR & STRICT): Replace only the bikini she is currently wearing with the provided reference bikini. The new bikini must fit her perfectly and tightly, matching the reference exactly in color, fabric, cut, straps, and details. Do NOT change anything else. Preserve the same woman, same body, same proportions Keep the exact pose, angle, expression, lighting, background, and camera framing No changes to face, hair, skin tone, or body shape She remains default/slim as she already is (not fat, not obese, no body exaggeration) This is a wardrobe-only replacement. Everything except the bikini stays 100% identical to the original image. Negative instructions: No resizing or reshaping of the body No stylization or artistic changes No added accessories No changes to posture or anatomy No distortion or blur Result: Photorealistic, natural-looking bikini replacement that appears originally worn by her, with realistic fabric tension and clean edges.
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
REALISTIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT ELEPHANT is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). The elephant is a large adult bull, anatomically accurate, massive scale and weight, rough wrinkled skin texture, visible tusks. Ears fully spread wide in an aggressive threat display, head lowered slightly, trunk tense or partially curled, body angled forward as if preparing to charge. No fantasy exaggeration — size and anatomy must be realistic. (NO hippo, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the elephant. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Elephant holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. COMPOSITION (CRITICAL — GLADIATOR FIRST): Low-angle heroic shot, camera near sand level. Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Elephant occupies the mid-ground, clearly readable but NOT closer than the man. Colosseum arches recognizable in the background. Strong depth separation, shallow-to-moderate DOF. Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Elephant realistically lit but slightly less emphasized than the man. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
{ "prompt_type": "descriptive_replication", "reference_adherence": "STRICT_VISUAL_FIDELITY", "aspect_ratio": "3:4", "identity_lock": { "priority": "NONE", "instruction": "Completely new female model with entirely different facial structure and identity. No resemblance to any previous version." }, "scene_constraints": { "people_count": "exactly one person in the frame", "single_subject_only": true, "no_other_people_visible": true, "no_extra_reflections": true, "no_background_figures": true, "full_body_visible": true, "head_to_toe_in_frame": true }, "subject": { "demographics": "young woman (age 20+), blonde", "presence": "only one visible person in the entire scene", "hair": { "color": "warm blonde (no dark roots)", "style": "center part, long straight sleek hair", "texture": "smooth, slightly glossy, natural fall" }, "face": { "structure": "heart-shaped face with softer jawline and rounded chin", "eyes": "large round eyes with neutral outer corners", "brows": "naturally arched brows with medium thickness", "nose": "small softly rounded nose", "lips": "medium-full lips with defined cupid's bow", "expression": "confident relaxed expression with gentle half-smile", "makeup": "fresh natural makeup, light glow on skin, soft peach lips, minimal eyeliner", "visibility": "fully visible face, unobstructed, clearly distinct identity" }, "body": { "pose": "standing upright facing mirror directly, both arms relaxed naturally down along sides", "posture": "balanced stance with one knee slightly bent forward creating a subtle S-curve", "anatomy": { "curves": "natural feminine proportions with realistic gravity", "chest": "bikini-supported natural shape", "details": "visible collarbones, relaxed shoulders, natural thigh and calf definition" }, "feet": { "visibility": "fully visible at bottom of frame", "position": "barefoot, one foot slightly forward for dynamic stance" }, "skin_texture": "natural skin texture with subtle realistic sheen" } }, "attire_and_accessories": { "clothing": { "top": "black triangle string bikini top with thin straps", "bottom_detail": "matching bikini bottom strings visible at hips" }, "footwear": "barefoot", "jewelry": { "necklaces": [], "wrists": [], "fingers": "no rings" }, "props": { "hands": "empty hands, no phone or objects" } }, "environment": { "setting": "modern luxury bathroom interior, daytime", "foreground_element": { "mirror": "large rectangular mirror with GREY MARBLE FRAME", "material": "polished stone marble with subtle cool grey veining" }, "background": { "elements": [ "marble wall tiles", "floating vanity with minimalist design", "sleek chrome faucet", "soft folded white towels", "clean tiled floor fully visible" ], "style": "minimalist luxury aesthetic", "clutter": "none" } }, "lighting_and_atmosphere": { "source": "natural daylight from camera-left", "quality": "soft directional light", "effects": [ "gentle highlights along cheekbones and shoulders", "soft shadow falloff along waist and legs" ], "contrast": "moderate and realistic" }, "camera_and_technical": { "perspective": "mirror perspective without phone blocking face", "camera_position": "eye level, slightly angled downward", "framing": "vertical 3:4, full head-to-toe body visible without cropping feet", "focus": "sharp focus on subject, background slightly softened", "visual_fidelity": "real smartphone camera aesthetic, mild natural noise, no HDR effect" }, "realism_constraints": { "allowed": [ "natural skin texture", "minor asymmetry", "realistic body proportions", "natural lighting imperfections" ], "forbidden": [ "multiple people", "extra person", "background people", "beauty filters", "airbrushed skin", "anime style", "cartoon style", "cropped feet", "cut-off legs", "phone in hand" ] } }
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
REALISTIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT ELEPHANT is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). The elephant is a large adult bull, anatomically accurate, massive scale and weight, rough wrinkled skin texture, visible tusks. Ears fully spread wide in an aggressive threat display, head lowered slightly, trunk tense or partially curled, body angled forward as if preparing to charge. No fantasy exaggeration — size and anatomy must be realistic. (NO hippo, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the elephant. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Elephant holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. COMPOSITION (CRITICAL — GLADIATOR FIRST): Low-angle heroic shot, camera near sand level. Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Elephant occupies the mid-ground, clearly readable but NOT closer than the man. Colosseum arches recognizable in the background. Strong depth separation, shallow-to-moderate DOF. Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Elephant realistically lit but slightly less emphasized than the man. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. A KOMODO DRAGON (Varanus komodoensis) is present on the left side of frame, facing the gladiator, placed farther from the lens than the gladiator (secondary subject). The Komodo dragon looks powerful and real: thick muscular body, rough scaled skin, heavy tail, strong legs, low stalking posture, forked tongue visible. Keep its size believable (large adult), with accurate anatomy and texture. (NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact”. The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the Komodo dragon. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face (cheekbone, nose line, jaw, lips, eye, brow) — fully visible, sharply defined, and perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. His facial expression must be readable: concentrated, controlled, predatory calm. Body/action: Gladiator leaning forward, shield raised defensively, sword low and ready, muscles and chest hair clearly detailed. Komodo dragon advancing slowly or pausing mid-step, tongue flicking, dust disturbed by its claws, tension high. NO blood, NO wounds, NO biting, NO visible injuries, NO aftermath. Suspense and power, not violence. COMPOSITION (CRITICAL — GLADIATOR FIRST): Low-angle heroic shot, camera near sand level. Gladiator dominates the foreground and frame (waist-up or 3/4 body), with face and torso fully visible, not cut off. Komodo dragon occupies the left mid-ground (clearly visible but NOT closer than the man). Amphitheater arches clearly recognizable in the background. Clean horizon, strong depth separation (shallow-to-moderate DOF). Keep the gladiator’s face and chest in tack-sharp focus; keep the Komodo dragon recognizable and detailed. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Perfect, professional illumination on the gladiator: Warm golden sunlight as key light (or strong soft key from sun direction), PLUS soft fill light to remove harsh shadows on face and torso. Subtle rim light outlining shoulders/helmet/arms. Every muscle, chest hair texture, skin pores, and facial details are clearly visible. Komodo dragon and background may be slightly less bright; priority is the gladiator. No underexposure on the man. No blown highlights. CAMERA / LENS: Full-frame look, 35mm or 50mm (epic scale, minimal distortion). Fast shutter feel, subject sharp, dust provides motion cues only. Realistic bokeh, controlled contrast, filmic color grading. No HDR, no neon, no fantasy glow. QUALITY CONTROL: Photorealistic, historical-epic campaign finish. No artifacts, no warped anatomy, no extra limbs, no deformed hands. No random logos, no watermarks, no text. NEGATIVE PROMPT: lion, bear, dragon fantasy, dinosaur, crocodile, snake, extra animals, multiple komodo dragons, cartoon komodo, blood, gore, injury, wounds, biting, mauling, dead animal, dead body, graphic violence, dismemberment, low resolution, blurry, plastic skin, over-smoothed, uncanny valley, extra fingers, extra limbs, warped anatomy, cartoon, illustration, CGI look, fantasy lighting, neon, oversaturation, harsh HDR, random text, misspelled words, logos, watermarks
{ "prompt_type": "descriptive_replication", "reference_adherence": "STRICT_VISUAL_FIDELITY", "aspect_ratio": "3:4", "identity_lock": { "priority": "NONE", "instruction": "Completely new female model with entirely different facial structure and identity. No resemblance to any previous version." }, "scene_constraints": { "people_count": "exactly one person in the frame", "single_subject_only": true, "no_other_people_visible": true, "no_extra_reflections": true, "no_background_figures": true, "full_body_visible": true, "head_to_toe_in_frame": true }, "subject": { "demographics": "young woman (age 20+), blonde", "presence": "only one visible person in the entire scene", "hair": { "color": "warm blonde (no dark roots)", "style": "center part, long straight sleek hair", "texture": "smooth, slightly glossy, natural fall" }, "face": { "structure": "heart-shaped face with softer jawline and rounded chin", "eyes": "large round eyes with neutral outer corners", "brows": "naturally arched brows with medium thickness", "nose": "small softly rounded nose", "lips": "medium-full lips with defined cupid's bow", "expression": "confident relaxed expression with gentle half-smile", "makeup": "fresh natural makeup, light glow on skin, soft peach lips, minimal eyeliner", "visibility": "fully visible face, unobstructed, clearly distinct identity" }, "body": { "pose": "standing upright facing mirror directly, both arms relaxed naturally down along sides", "posture": "balanced stance with one knee slightly bent forward creating a subtle S-curve", "anatomy": { "curves": "natural feminine proportions with realistic gravity", "chest": "bikini-supported natural shape", "details": "visible collarbones, relaxed shoulders, natural thigh and calf definition" }, "feet": { "visibility": "fully visible at bottom of frame", "position": "barefoot, one foot slightly forward for dynamic stance" }, "skin_texture": "natural skin texture with subtle realistic sheen" } }, "attire_and_accessories": { "clothing": { "top": "black triangle string bikini top with thin straps", "bottom_detail": "matching bikini bottom strings visible at hips" }, "footwear": "barefoot", "jewelry": { "necklaces": [], "wrists": [], "fingers": "no rings" }, "props": { "hands": "empty hands, no phone or objects" } }, "environment": { "setting": "modern luxury bathroom interior, daytime", "foreground_element": { "mirror": "large rectangular mirror with GREY MARBLE FRAME", "material": "polished stone marble with subtle cool grey veining" }, "background": { "elements": [ "marble wall tiles", "floating vanity with minimalist design", "sleek chrome faucet", "soft folded white towels", "clean tiled floor fully visible" ], "style": "minimalist luxury aesthetic", "clutter": "none" } }, "lighting_and_atmosphere": { "source": "natural daylight from camera-left", "quality": "soft directional light", "effects": [ "gentle highlights along cheekbones and shoulders", "soft shadow falloff along waist and legs" ], "contrast": "moderate and realistic" }, "camera_and_technical": { "perspective": "mirror perspective without phone blocking face", "camera_position": "eye level, slightly angled downward", "framing": "vertical 3:4, full head-to-toe body visible without cropping feet", "focus": "sharp focus on subject, background slightly softened", "visual_fidelity": "real smartphone camera aesthetic, mild natural noise, no HDR effect" }, "realism_constraints": { "allowed": [ "natural skin texture", "minor asymmetry", "realistic body proportions", "natural lighting imperfections" ], "forbidden": [ "multiple people", "extra person", "background people", "beauty filters", "airbrushed skin", "anime style", "cartoon style", "cropped feet", "cut-off legs", "phone in hand" ] } }
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
{ "prompt_type": "descriptive_replication", "reference_adherence": "STRICT_VISUAL_FIDELITY", "aspect_ratio": "3:4", "identity_lock": { "priority": "NONE", "instruction": "Completely new female model with entirely different facial structure and identity. No resemblance to any previous version." }, "scene_constraints": { "people_count": "exactly one person in the frame", "single_subject_only": true, "no_other_people_visible": true, "no_extra_reflections": true, "no_background_figures": true, "full_body_visible": true, "head_to_toe_in_frame": true }, "subject": { "demographics": "young woman (age 20+), blonde", "presence": "only one visible person in the entire scene", "hair": { "color": "warm blonde (no dark roots)", "style": "center part, long straight sleek hair", "texture": "smooth, slightly glossy, natural fall" }, "face": { "structure": "heart-shaped face with softer jawline and rounded chin", "eyes": "large round eyes with neutral outer corners", "brows": "naturally arched brows with medium thickness", "nose": "small softly rounded nose", "lips": "medium-full lips with defined cupid's bow", "expression": "confident relaxed expression with gentle half-smile", "makeup": "fresh natural makeup, light glow on skin, soft peach lips, minimal eyeliner", "visibility": "fully visible face, unobstructed, clearly distinct identity" }, "body": { "pose": "standing upright facing mirror directly, both arms relaxed naturally down along sides", "posture": "balanced stance with one knee slightly bent forward creating a subtle S-curve", "anatomy": { "curves": "natural feminine proportions with realistic gravity", "chest": "bikini-supported natural shape", "details": "visible collarbones, relaxed shoulders, natural thigh and calf definition" }, "feet": { "visibility": "fully visible at bottom of frame", "position": "barefoot, one foot slightly forward for dynamic stance" }, "skin_texture": "natural skin texture with subtle realistic sheen" } }, "attire_and_accessories": { "clothing": { "top": "black triangle string bikini top with thin straps", "bottom_detail": "matching bikini bottom strings visible at hips" }, "footwear": "barefoot", "jewelry": { "necklaces": [], "wrists": [], "fingers": "no rings" }, "props": { "hands": "empty hands, no phone or objects" } }, "environment": { "setting": "modern luxury bathroom interior, daytime", "foreground_element": { "mirror": "large rectangular mirror with GREY MARBLE FRAME", "material": "polished stone marble with subtle cool grey veining" }, "background": { "elements": [ "marble wall tiles", "floating vanity with minimalist design", "sleek chrome faucet", "soft folded white towels", "clean tiled floor fully visible" ], "style": "minimalist luxury aesthetic", "clutter": "none" } }, "lighting_and_atmosphere": { "source": "natural daylight from camera-left", "quality": "soft directional light", "effects": [ "gentle highlights along cheekbones and shoulders", "soft shadow falloff along waist and legs" ], "contrast": "moderate and realistic" }, "camera_and_technical": { "perspective": "mirror perspective without phone blocking face", "camera_position": "eye level, slightly angled downward", "framing": "vertical 3:4, full head-to-toe body visible without cropping feet", "focus": "sharp focus on subject, background slightly softened", "visual_fidelity": "real smartphone camera aesthetic, mild natural noise, no HDR effect" }, "realism_constraints": { "allowed": [ "natural skin texture", "minor asymmetry", "realistic body proportions", "natural lighting imperfections" ], "forbidden": [ "multiple people", "extra person", "background people", "beauty filters", "airbrushed skin", "anime style", "cartoon style", "cropped feet", "cut-off legs", "phone in hand" ] } }
{ "prompt_type": "descriptive_replication", "reference_adherence": "STRICT_VISUAL_FIDELITY", "aspect_ratio": "3:4", "identity_lock": { "priority": "NONE", "instruction": "Completely new female model with entirely different facial structure and identity. No resemblance to any previous version." }, "scene_constraints": { "people_count": "exactly one person in the frame", "single_subject_only": true, "no_other_people_visible": true, "no_extra_reflections": true, "no_background_figures": true, "full_body_visible": true, "head_to_toe_in_frame": true }, "subject": { "demographics": "young woman (age 20+), blonde", "presence": "only one visible person in the entire scene", "hair": { "color": "warm blonde (no dark roots)", "style": "center part, long straight sleek hair", "texture": "smooth, slightly glossy, natural fall" }, "face": { "structure": "heart-shaped face with softer jawline and rounded chin", "eyes": "large round eyes with neutral outer corners", "brows": "naturally arched brows with medium thickness", "nose": "small softly rounded nose", "lips": "medium-full lips with defined cupid's bow", "expression": "confident relaxed expression with gentle half-smile", "makeup": "fresh natural makeup, light glow on skin, soft peach lips, minimal eyeliner", "visibility": "fully visible face, unobstructed, clearly distinct identity" }, "body": { "pose": "standing upright facing mirror directly, both arms relaxed naturally down along sides", "posture": "balanced stance with one knee slightly bent forward creating a subtle S-curve", "anatomy": { "curves": "natural feminine proportions with realistic gravity", "chest": "bikini-supported natural shape", "details": "visible collarbones, relaxed shoulders, natural thigh and calf definition" }, "feet": { "visibility": "fully visible at bottom of frame", "position": "barefoot, one foot slightly forward for dynamic stance" }, "skin_texture": "natural skin texture with subtle realistic sheen" } }, "attire_and_accessories": { "clothing": { "top": "black triangle string bikini top with thin straps", "bottom_detail": "matching bikini bottom strings visible at hips" }, "footwear": "barefoot", "jewelry": { "necklaces": [], "wrists": [], "fingers": "no rings" }, "props": { "hands": "empty hands, no phone or objects" } }, "environment": { "setting": "modern luxury bathroom interior, daytime", "foreground_element": { "mirror": "large rectangular mirror with GREY MARBLE FRAME", "material": "polished stone marble with subtle cool grey veining" }, "background": { "elements": [ "marble wall tiles", "floating vanity with minimalist design", "sleek chrome faucet", "soft folded white towels", "clean tiled floor fully visible" ], "style": "minimalist luxury aesthetic", "clutter": "none" } }, "lighting_and_atmosphere": { "source": "natural daylight from camera-left", "quality": "soft directional light", "effects": [ "gentle highlights along cheekbones and shoulders", "soft shadow falloff along waist and legs" ], "contrast": "moderate and realistic" }, "camera_and_technical": { "perspective": "mirror perspective without phone blocking face", "camera_position": "eye level, slightly angled downward", "framing": "vertical 3:4, full head-to-toe body visible without cropping feet", "focus": "sharp focus on subject, background slightly softened", "visual_fidelity": "real smartphone camera aesthetic, mild natural noise, no HDR effect" }, "realism_constraints": { "allowed": [ "natural skin texture", "minor asymmetry", "realistic body proportions", "natural lighting imperfections" ], "forbidden": [ "multiple people", "extra person", "background people", "beauty filters", "airbrushed skin", "anime style", "cartoon style", "cropped feet", "cut-off legs", "phone in hand" ] } }
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
REALISTIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT ELEPHANT is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). The elephant is a large adult bull, anatomically accurate, massive scale and weight, rough wrinkled skin texture, visible tusks. Ears fully spread wide in an aggressive threat display, head lowered slightly, trunk tense or partially curled, body angled forward as if preparing to charge. No fantasy exaggeration — size and anatomy must be realistic. (NO hippo, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the elephant. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Elephant holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. COMPOSITION (CRITICAL — GLADIATOR FIRST): Low-angle heroic shot, camera near sand level. Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Elephant occupies the mid-ground, clearly readable but NOT closer than the man. Colosseum arches recognizable in the background. Strong depth separation, shallow-to-moderate DOF. Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Elephant realistically lit but slightly less emphasized than the man. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. A KOMODO DRAGON (Varanus komodoensis) is present on the left side of frame, facing the gladiator, placed farther from the lens than the gladiator (secondary subject). The Komodo dragon looks powerful and real: thick muscular body, rough scaled skin, heavy tail, strong legs, low stalking posture, forked tongue visible. Keep its size believable (large adult), with accurate anatomy and texture. (NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact”. The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the Komodo dragon. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face (cheekbone, nose line, jaw, lips, eye, brow) — fully visible, sharply defined, and perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. His facial expression must be readable: concentrated, controlled, predatory calm. Body/action: Gladiator leaning forward, shield raised defensively, sword low and ready, muscles and chest hair clearly detailed. Komodo dragon advancing slowly or pausing mid-step, tongue flicking, dust disturbed by its claws, tension high. NO blood, NO wounds, NO biting, NO visible injuries, NO aftermath. Suspense and power, not violence. COMPOSITION (CRITICAL — GLADIATOR FIRST): Low-angle heroic shot, camera near sand level. Gladiator dominates the foreground and frame (waist-up or 3/4 body), with face and torso fully visible, not cut off. Komodo dragon occupies the left mid-ground (clearly visible but NOT closer than the man). Amphitheater arches clearly recognizable in the background. Clean horizon, strong depth separation (shallow-to-moderate DOF). Keep the gladiator’s face and chest in tack-sharp focus; keep the Komodo dragon recognizable and detailed. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Perfect, professional illumination on the gladiator: Warm golden sunlight as key light (or strong soft key from sun direction), PLUS soft fill light to remove harsh shadows on face and torso. Subtle rim light outlining shoulders/helmet/arms. Every muscle, chest hair texture, skin pores, and facial details are clearly visible. Komodo dragon and background may be slightly less bright; priority is the gladiator. No underexposure on the man. No blown highlights. CAMERA / LENS: Full-frame look, 35mm or 50mm (epic scale, minimal distortion). Fast shutter feel, subject sharp, dust provides motion cues only. Realistic bokeh, controlled contrast, filmic color grading. No HDR, no neon, no fantasy glow. QUALITY CONTROL: Photorealistic, historical-epic campaign finish. No artifacts, no warped anatomy, no extra limbs, no deformed hands. No random logos, no watermarks, no text. NEGATIVE PROMPT: lion, bear, dragon fantasy, dinosaur, crocodile, snake, extra animals, multiple komodo dragons, cartoon komodo, blood, gore, injury, wounds, biting, mauling, dead animal, dead body, graphic violence, dismemberment, low resolution, blurry, plastic skin, over-smoothed, uncanny valley, extra fingers, extra limbs, warped anatomy, cartoon, illustration, CGI look, fantasy lighting, neon, oversaturation, harsh HDR, random text, misspelled words, logos, watermarks
{ "reference_source": "Use the uploaded reference image as the single source of truth", "identity_lock": { "face": "100% identical to reference image — same facial structure, eyes, nose, lips, jawline, skin texture, expression", "body": "Same body proportions, height, posture, shoulder width, waist, hips, limbs", "hair": "Same hairstyle, hairline, length, color, volume", "age": "Exact same apparent age as reference", "ethnicity": "Unchanged from reference" }, "consistency_rules": [ "Do NOT modify face shape", "Do NOT beautify, stylize, or alter facial features", "No AI face swap artifacts", "No face blending with other identities", "No body reshaping", "No weight gain or loss", "No added or removed tattoos, scars, or marks" ], "wardrobe": { "instruction": "Only change clothing if explicitly specified", "fit": "Clothing fits naturally and accurately to the same body", "fabric_behavior": "Realistic folds, tension, and shadows" }, "pose_and_scene": { "pose": "Preserve the same pose unless explicitly changed", "camera_angle": "Same camera height, distance, and lens perspective", "lighting": "Match original lighting direction, softness, and intensity", "environment": "Realistic environment with consistent scale and depth" }, "image_quality": { "realism": "Photorealistic, natural skin texture", "clarity": "Sharp focus on face and eyes", "resolution": "High-definition, clean details", "artifacts": "No blur, distortion, double face, broken anatomy" }, "negative_prompt": [ "different face", "face drift", "face mutation", "body distortion", "incorrect anatomy", "extra limbs", "blurred face", "plastic skin", "AI-generated look", "over-smoothing", "unreal lighting" ], "final_instruction": "The generated image must look like the SAME PERSON photographed again, not a variation or look-alike." } { "prompt": "Intimate full-body mirror selfie portrait of a young woman in bedroom, standing in front of large ornate gold-framed mirror, back turned slightly to show rear view while taking selfie with phone in one hand, long dark brown hair tied in high messy bun with loose strands, fair tan skin with natural glow and subtle texture, wearing ultra-sheer black lace lingerie set with intricate mesh panels, high-cut thong bottoms, garter straps connected to thigh-high stockings, thin spaghetti straps on bra top, silver metallic high-heel mules visible on feet, arched back and hips emphasized for seductive pose, one arm extended holding phone, other hand on mirror or hip, bedroom background with beige walls, white bed sheets, scattered books/candles, soft warm golden lamp light from side creating rim glow on body curves lace texture and hair, subtle blue LED accent from device, shallow depth of field, strong cinematic bokeh on mirror reflection and background, photorealistic sensual lingerie fashion selfie, high detail lace transparency skin pores hair strands garter straps heel shine and natural imperfections, shot on smartphone with slight wide-angle distortion typical for mirror shots, ultra detailed, 8k resolution", "negative_prompt": "cartoon, anime, illustration, painting, deformed, blurry, lowres, plastic skin, doll-like, heavy makeup overload, thick eyeliner, false lashes, filters, beauty filter, airbrushed skin, extra limbs, distorted proportions, asymmetrical face, harsh flash, cold lighting, day time, fully clothed, short hair, blonde hair, standing straight, no mirror, no phone, crowded room, text watermark, logo, ugly, bad anatomy, overexposed, underexposed, low contrast", "reference_image": { "enabled": true, "strength": 0.92, "description": "Extremely strong reference for exact composition and pose: young woman taking mirror selfie showing back/rear view, long dark hair in messy bun, sheer black lace lingerie with garter straps and high-cut bottoms, arched back emphasizing curves, phone in hand, ornate gold mirror frame, soft warm bedroom light with blue accent, seductive intimate vibe, high detail lace and skin texture" }, "style": "photorealistic intimate mirror selfie, sensual lingerie boudoir, warm golden bedroom lighting", "aspect_ratio": "3:4", "lighting": "soft warm golden lamp light from side, gentle rim light on back curves hair and lace, subtle blue LED fill from phone, realistic soft shadows and highlights", "camera": "smartphone mirror selfie (iPhone style), slight wide-angle distortion, natural low-key indoor light, minor grain for realism", "additional_details": [ "hair: dark brown-black, long, tied in high messy bun with loose flyaways and strands down back", "outfit: sheer black mesh lace bra and high-cut thong bottoms, garter straps with metal rings, thin straps, semi-transparent fabric clinging to body See less Nano Banana Recent Generating… { "reference_source": "Use the uploaded reference image as the single source of truth", "identity_lock": { "face": "100% identical to reference image — same facial structure, eyes, nose, lips, jawline, skin texture, expression", "body": "Same body proportions, height, posture, shoulder width, waist, hips, limbs", "hair": "Same hairstyle, hairline, length, color, volume", "age": "Exact same apparent age as reference", "ethnicity": "Unchanged from reference" }, "consistency_rules": [ "Do NOT modify face shape", "Do NOT beautify, stylize, or alter facial features", "No AI face swap artifacts", "No face blending with other identities", "No body reshaping", "No weight gain or loss", "No added or removed tattoos, scars, or marks" ], "wardrobe": { "instruction": "Only change clothing if explicitly specified", "fit": "Clothing fits naturally and accurately to the same body", "fabric_behavior": "Realistic folds, tension, and shadows" }, "pose_and_scene": { "pose": "Preserve the same pose unless explicitly changed", "camera_angle": "Same camera height, distance, and lens perspective", "lighting": "Match original lighting direction, softness, and intensity", "environment": "Realistic environment with consistent scale and depth" }, "image_quality": { "realism": "Photorealistic, natural skin texture", "clarity": "Sharp focus on face and eyes", "resolution": "High-definition, clean details", "artifacts": "No blur, distortion, double face, broken anatomy" }, "negative_prompt": [ "different face", "face drift", "face mutation", "body distortion", "incorrect anatomy", "extra limbs", "blurred face", "plastic skin", "AI-generated look", "over-smoothing", "unreal lighting" ], "final_instruction": "The generated image must look like the SAME PERSON photographed again, not a variation or look-alike." } { "prompt": "Intimate full-body mirror selfie portrait of a young woman in bedroom, standing in front of large ornate gold-framed mirror, back turned slightly to show rear view while taking selfie with phone in one hand, long dark brown hair tied in high messy bun with loose strands, fair tan skin with natural glow and subtle texture, wearing ultra-sheer black lace lingerie set with intricate mesh panels, high-cut thong bottoms, garter straps connected to thigh-high stockings, thin spaghetti straps on bra top, silver metallic high-heel mules visible on feet, arched back and hips emphasized for seductive pose, one arm extended holding phone, other hand on mirror or hip, bedroom background with beige walls, white bed sheets, scattered books/candles, soft warm golden lamp light from side creating rim glow on body curves lace texture and hair, subtle blue LED accent from device, shallow depth of field, strong cinematic bokeh on mirror reflection and background, photorealistic sensual lingerie fashion selfie, high detail lace transparency skin pores hair strands garter straps heel shine and natural imperfections, shot on smartphone with slight wide-angle distortion typical for mirror shots, ultra detailed, 8k resolution", "negative_prompt": "cartoon, anime, illustration, painting, deformed, blurry, lowres, plastic skin, doll-like, heavy makeup overload, thick eyeliner, false lashes, filters, beauty filter, airbrushed skin, extra limbs, distorted proportions, asymmetrical face, harsh flash, cold lighting, day time, fully clothed, short hair, blonde hair, standing straight, no mirror, no phone, crowded room, text watermark, logo, ugly, bad anatomy, overexposed, underexposed, low contrast", "reference_image": { "enabled": true, "strength": 0.92, "description": "Extremely strong reference for exact composition and pose: young woman taking mirror selfie showing back/rear view, long dark hair in messy bun, sheer black lace lingerie with garter straps and high-cut bottoms, arched back emphasizing curves, phone in hand, ornate gold mirror frame, soft warm bedroom light with blue accent, seductive intimate vibe, high detail lace and skin texture" }, "style": "photorealistic intimate mirror selfie, sensual lingerie boudoir, warm golden bedroom lighting", "aspect_ratio": "3:4", "lighting": "soft warm golden lamp light from side, gentle rim light on back curves hair and lace, subtle blue LED fill from phone, realistic soft shadows and highlights", "camera": "smartphone mirror selfie (iPhone style), slight wide-angle distortion, natural low-key indoor light, minor grain for realism", "additional_details": [ "hair: dark brown-black, long, tied in high messy bun with loose flyaways and strands down back", "outfit: sheer black mesh lace bra and high-cut thong bottoms, garter straps with metal rings, thin straps, semi-transparent fabric clinging to body  EDIT PROMPT (CLEAR & STRICT): Replace only the bikini she is currently wearing with the provided reference bikini. The new bikini must fit her perfectly and tightly, matching the reference exactly in color, fabric, cut, straps, and details. Do NOT change anything else. Preserve the same woman, same body, same proportions Keep the exact pose, angle, expression, lighting, background, and camera framing No changes to face, hair, skin tone, or body shape She remains default/slim as she already is (not fat, not obese, no body exaggeration) This is a wardrobe-only replacement. Everything except the bikini stays 100% identical to the original image. Negative instructions: No resizing or reshaping of the body No stylization or artistic changes No added accessories No changes to posture or anatomy No distortion or blur Result: Photorealistic, natural-looking bikini replacement that appears originally worn by her, with realistic fabric tension and clean edges.
REALISTIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT ELEPHANT is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). The elephant is a large adult bull, anatomically accurate, massive scale and weight, rough wrinkled skin texture, visible tusks. Ears fully spread wide in an aggressive threat display, head lowered slightly, trunk tense or partially curled, body angled forward as if preparing to charge. No fantasy exaggeration — size and anatomy must be realistic. (NO hippo, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the elephant. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Elephant holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. COMPOSITION (CRITICAL — GLADIATOR FIRST): Low-angle heroic shot, camera near sand level. Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Elephant occupies the mid-ground, clearly readable but NOT closer than the man. Colosseum arches recognizable in the background. Strong depth separation, shallow-to-moderate DOF. Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Elephant realistically lit but slightly less emphasized than the man. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
{ "prompt_type": "descriptive_replication", "reference_adherence": "STRICT_VISUAL_FIDELITY", "aspect_ratio": "3:4", "identity_lock": { "priority": "NONE", "instruction": "Completely new female model with entirely different facial structure and identity. No resemblance to any previous version." }, "scene_constraints": { "people_count": "exactly one person in the frame", "single_subject_only": true, "no_other_people_visible": true, "no_extra_reflections": true, "no_background_figures": true, "full_body_visible": true, "head_to_toe_in_frame": true }, "subject": { "demographics": "young woman (age 20+), blonde", "presence": "only one visible person in the entire scene", "hair": { "color": "warm blonde (no dark roots)", "style": "center part, long straight sleek hair", "texture": "smooth, slightly glossy, natural fall" }, "face": { "structure": "heart-shaped face with softer jawline and rounded chin", "eyes": "large round eyes with neutral outer corners", "brows": "naturally arched brows with medium thickness", "nose": "small softly rounded nose", "lips": "medium-full lips with defined cupid's bow", "expression": "confident relaxed expression with gentle half-smile", "makeup": "fresh natural makeup, light glow on skin, soft peach lips, minimal eyeliner", "visibility": "fully visible face, unobstructed, clearly distinct identity" }, "body": { "pose": "standing upright facing mirror directly, both arms relaxed naturally down along sides", "posture": "balanced stance with one knee slightly bent forward creating a subtle S-curve", "anatomy": { "curves": "natural feminine proportions with realistic gravity", "chest": "bikini-supported natural shape", "details": "visible collarbones, relaxed shoulders, natural thigh and calf definition" }, "feet": { "visibility": "fully visible at bottom of frame", "position": "barefoot, one foot slightly forward for dynamic stance" }, "skin_texture": "natural skin texture with subtle realistic sheen" } }, "attire_and_accessories": { "clothing": { "top": "black triangle string bikini top with thin straps", "bottom_detail": "matching bikini bottom strings visible at hips" }, "footwear": "barefoot", "jewelry": { "necklaces": [], "wrists": [], "fingers": "no rings" }, "props": { "hands": "empty hands, no phone or objects" } }, "environment": { "setting": "modern luxury bathroom interior, daytime", "foreground_element": { "mirror": "large rectangular mirror with GREY MARBLE FRAME", "material": "polished stone marble with subtle cool grey veining" }, "background": { "elements": [ "marble wall tiles", "floating vanity with minimalist design", "sleek chrome faucet", "soft folded white towels", "clean tiled floor fully visible" ], "style": "minimalist luxury aesthetic", "clutter": "none" } }, "lighting_and_atmosphere": { "source": "natural daylight from camera-left", "quality": "soft directional light", "effects": [ "gentle highlights along cheekbones and shoulders", "soft shadow falloff along waist and legs" ], "contrast": "moderate and realistic" }, "camera_and_technical": { "perspective": "mirror perspective without phone blocking face", "camera_position": "eye level, slightly angled downward", "framing": "vertical 3:4, full head-to-toe body visible without cropping feet", "focus": "sharp focus on subject, background slightly softened", "visual_fidelity": "real smartphone camera aesthetic, mild natural noise, no HDR effect" }, "realism_constraints": { "allowed": [ "natural skin texture", "minor asymmetry", "realistic body proportions", "natural lighting imperfections" ], "forbidden": [ "multiple people", "extra person", "background people", "beauty filters", "airbrushed skin", "anime style", "cartoon style", "cropped feet", "cut-off legs", "phone in hand" ] } }
{ "project_meta": { "task_name": "Hyper-realistic Screen Simulation", "target_model": "SDXL_1.0_Refiner", "aspect_ratio": "3:4", "resolution": { "width": 1152, "height": 1536 } }, "scene_composition": { "layer_1_physical_environment": { "camera_angle": "High-angle, downward shot (POV)", "subject_anchor": "MacBook Screen (95% fill)", "foreground_element": "Thin strip of physical keyboard visible at bottom edge", "surface_imperfections": [ "Visible RGB pixel-grid texture (Moiré effect)", "Micro-dust particles on glass surface", "Faint ambient light reflections (glossy finish)", "Subtle fingerprint smudges" ] }, "layer_2_digital_interface": { "os_theme": "macOS Dark Mode", "active_window": { "app_name": "Photo Booth", "status": "Live Preview Mode", "position": "Dominant/Center-Left" } }, "layer_3_nested_content": { "location": "Inside the Photo Booth window", "setting": { "room": "Dim bedroom", "background": "Off-white wall, rumpled bedding", "lighting": "Mixed lighting (Cool blue screen glow + Warm skin tones), deep nocturnal shadows" }, "subject_details": { "identity_source": "uploaded_female_reference_image", "demographics": "Young adult female", "appearance": { "apparel": "Black top, grey bottoms", "expression": "Relaxed, candid, slight smile", "pose": "Reclining/Lying down, looking at screen" }, "props": { "item": "iPhone 15 Pro", "hand_position": "Held in right hand" } } } }, "generation_parameters": { "prompts": { "positive": "Hyper-realistic downward shot of a MacBook screen. The screen surface has visible dust, pixel grid, and reflection. The screen displays a macOS desktop in dark mode with a dominant Photo Booth live-preview window. Inside the window: A girl in a dark bedroom with an off-white wall and rumpled bedding. The girl is lying down, wearing a black top and grey bottoms, holding an iPhone 15 Pro in her right hand. Her face is fully visible (reference matched). Lighting is low-key, candid, nocturnal, with bluish screen glow mixed with warm skin tones. High fidelity, raw photo, unedited, natural noise.", "negative": "vector art, screenshot, flat digital image, clean glass, perfect screen, daylight, bright studio lights, cartoon, 3d render, painting, watermark, deformed hands, blurry face" }, "sampling_settings": { "sampler": "DPM++ 3M SDE Exponential", "steps": 40, "cfg_scale": 5.5, "denoising_strength": 0.35 } }, "control_net_configuration": { "identity_strictness": "CRITICAL", "stack": [ { "unit": "ControlNet_Tile", "weight": 0.4, "purpose": "Maintain text/interface sharpness and screen texture" }, { "unit": "IP-Adapter_FaceID_Plus", "weight": 0.95, "region_mask": "Photo Booth Window Area Only", "source_image": "uploaded_female_reference_image" } ] } }
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
{ "prompt_type": "descriptive_replication", "reference_adherence": "STRICT_VISUAL_FIDELITY", "aspect_ratio": "3:4", "identity_lock": { "priority": "NONE", "instruction": "Completely new female model with entirely different facial structure and identity. No resemblance to any previous version." }, "scene_constraints": { "people_count": "exactly one person in the frame", "single_subject_only": true, "no_other_people_visible": true, "no_extra_reflections": true, "no_background_figures": true, "full_body_visible": true, "head_to_toe_in_frame": true }, "subject": { "demographics": "young woman (age 20+), blonde", "presence": "only one visible person in the entire scene", "hair": { "color": "warm blonde (no dark roots)", "style": "center part, long straight sleek hair", "texture": "smooth, slightly glossy, natural fall" }, "face": { "structure": "heart-shaped face with softer jawline and rounded chin", "eyes": "large round eyes with neutral outer corners", "brows": "naturally arched brows with medium thickness", "nose": "small softly rounded nose", "lips": "medium-full lips with defined cupid's bow", "expression": "confident relaxed expression with gentle half-smile", "makeup": "fresh natural makeup, light glow on skin, soft peach lips, minimal eyeliner", "visibility": "fully visible face, unobstructed, clearly distinct identity" }, "body": { "pose": "standing upright facing mirror directly, both arms relaxed naturally down along sides", "posture": "balanced stance with one knee slightly bent forward creating a subtle S-curve", "anatomy": { "curves": "natural feminine proportions with realistic gravity", "chest": "bikini-supported natural shape", "details": "visible collarbones, relaxed shoulders, natural thigh and calf definition" }, "feet": { "visibility": "fully visible at bottom of frame", "position": "barefoot, one foot slightly forward for dynamic stance" }, "skin_texture": "natural skin texture with subtle realistic sheen" } }, "attire_and_accessories": { "clothing": { "top": "black triangle string bikini top with thin straps", "bottom_detail": "matching bikini bottom strings visible at hips" }, "footwear": "barefoot", "jewelry": { "necklaces": [], "wrists": [], "fingers": "no rings" }, "props": { "hands": "empty hands, no phone or objects" } }, "environment": { "setting": "modern luxury bathroom interior, daytime", "foreground_element": { "mirror": "large rectangular mirror with GREY MARBLE FRAME", "material": "polished stone marble with subtle cool grey veining" }, "background": { "elements": [ "marble wall tiles", "floating vanity with minimalist design", "sleek chrome faucet", "soft folded white towels", "clean tiled floor fully visible" ], "style": "minimalist luxury aesthetic", "clutter": "none" } }, "lighting_and_atmosphere": { "source": "natural daylight from camera-left", "quality": "soft directional light", "effects": [ "gentle highlights along cheekbones and shoulders", "soft shadow falloff along waist and legs" ], "contrast": "moderate and realistic" }, "camera_and_technical": { "perspective": "mirror perspective without phone blocking face", "camera_position": "eye level, slightly angled downward", "framing": "vertical 3:4, full head-to-toe body visible without cropping feet", "focus": "sharp focus on subject, background slightly softened", "visual_fidelity": "real smartphone camera aesthetic, mild natural noise, no HDR effect" }, "realism_constraints": { "allowed": [ "natural skin texture", "minor asymmetry", "realistic body proportions", "natural lighting imperfections" ], "forbidden": [ "multiple people", "extra person", "background people", "beauty filters", "airbrushed skin", "anime style", "cartoon style", "cropped feet", "cut-off legs", "phone in hand" ] } }
REALISTIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT ELEPHANT is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). The elephant is a large adult bull, anatomically accurate, massive scale and weight, rough wrinkled skin texture, visible tusks. Ears fully spread wide in an aggressive threat display, head lowered slightly, trunk tense or partially curled, body angled forward as if preparing to charge. No fantasy exaggeration — size and anatomy must be realistic. (NO hippo, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the elephant. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Elephant holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. COMPOSITION (CRITICAL — GLADIATOR FIRST): Low-angle heroic shot, camera near sand level. Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Elephant occupies the mid-ground, clearly readable but NOT closer than the man. Colosseum arches recognizable in the background. Strong depth separation, shallow-to-moderate DOF. Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Elephant realistically lit but slightly less emphasized than the man. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
{ "prompt_type": "descriptive_replication", "reference_adherence": "STRICT_VISUAL_FIDELITY", "aspect_ratio": "3:4", "identity_lock": { "priority": "NONE", "instruction": "Completely new female model with entirely different facial structure and identity. No resemblance to any previous version." }, "scene_constraints": { "people_count": "exactly one person in the frame", "single_subject_only": true, "no_other_people_visible": true, "no_extra_reflections": true, "no_background_figures": true, "full_body_visible": true, "head_to_toe_in_frame": true }, "subject": { "demographics": "young woman (age 20+), blonde", "presence": "only one visible person in the entire scene", "hair": { "color": "warm blonde (no dark roots)", "style": "center part, long straight sleek hair", "texture": "smooth, slightly glossy, natural fall" }, "face": { "structure": "heart-shaped face with softer jawline and rounded chin", "eyes": "large round eyes with neutral outer corners", "brows": "naturally arched brows with medium thickness", "nose": "small softly rounded nose", "lips": "medium-full lips with defined cupid's bow", "expression": "confident relaxed expression with gentle half-smile", "makeup": "fresh natural makeup, light glow on skin, soft peach lips, minimal eyeliner", "visibility": "fully visible face, unobstructed, clearly distinct identity" }, "body": { "pose": "standing upright facing mirror directly, both arms relaxed naturally down along sides", "posture": "balanced stance with one knee slightly bent forward creating a subtle S-curve", "anatomy": { "curves": "natural feminine proportions with realistic gravity", "chest": "bikini-supported natural shape", "details": "visible collarbones, relaxed shoulders, natural thigh and calf definition" }, "feet": { "visibility": "fully visible at bottom of frame", "position": "barefoot, one foot slightly forward for dynamic stance" }, "skin_texture": "natural skin texture with subtle realistic sheen" } }, "attire_and_accessories": { "clothing": { "top": "black triangle string bikini top with thin straps", "bottom_detail": "matching bikini bottom strings visible at hips" }, "footwear": "barefoot", "jewelry": { "necklaces": [], "wrists": [], "fingers": "no rings" }, "props": { "hands": "empty hands, no phone or objects" } }, "environment": { "setting": "modern luxury bathroom interior, daytime", "foreground_element": { "mirror": "large rectangular mirror with GREY MARBLE FRAME", "material": "polished stone marble with subtle cool grey veining" }, "background": { "elements": [ "marble wall tiles", "floating vanity with minimalist design", "sleek chrome faucet", "soft folded white towels", "clean tiled floor fully visible" ], "style": "minimalist luxury aesthetic", "clutter": "none" } }, "lighting_and_atmosphere": { "source": "natural daylight from camera-left", "quality": "soft directional light", "effects": [ "gentle highlights along cheekbones and shoulders", "soft shadow falloff along waist and legs" ], "contrast": "moderate and realistic" }, "camera_and_technical": { "perspective": "mirror perspective without phone blocking face", "camera_position": "eye level, slightly angled downward", "framing": "vertical 3:4, full head-to-toe body visible without cropping feet", "focus": "sharp focus on subject, background slightly softened", "visual_fidelity": "real smartphone camera aesthetic, mild natural noise, no HDR effect" }, "realism_constraints": { "allowed": [ "natural skin texture", "minor asymmetry", "realistic body proportions", "natural lighting imperfections" ], "forbidden": [ "multiple people", "extra person", "background people", "beauty filters", "airbrushed skin", "anime style", "cartoon style", "cropped feet", "cut-off legs", "phone in hand" ] } }
{ "project_meta": { "task_name": "Hyper-realistic Screen Simulation", "target_model": "SDXL_1.0_Refiner", "aspect_ratio": "3:4", "resolution": { "width": 1152, "height": 1536 } }, "scene_composition": { "layer_1_physical_environment": { "camera_angle": "High-angle, downward shot (POV)", "subject_anchor": "MacBook Screen (95% fill)", "foreground_element": "Thin strip of physical keyboard visible at bottom edge", "surface_imperfections": [ "Visible RGB pixel-grid texture (Moiré effect)", "Micro-dust particles on glass surface", "Faint ambient light reflections (glossy finish)", "Subtle fingerprint smudges" ] }, "layer_2_digital_interface": { "os_theme": "macOS Dark Mode", "active_window": { "app_name": "Photo Booth", "status": "Live Preview Mode", "position": "Dominant/Center-Left" } }, "layer_3_nested_content": { "location": "Inside the Photo Booth window", "setting": { "room": "Dim bedroom", "background": "Off-white wall, rumpled bedding", "lighting": "Mixed lighting (Cool blue screen glow + Warm skin tones), deep nocturnal shadows" }, "subject_details": { "identity_source": "uploaded_female_reference_image", "demographics": "Young adult female", "appearance": { "apparel": "Black top, grey bottoms", "expression": "Relaxed, candid, slight smile", "pose": "Reclining/Lying down, looking at screen" }, "props": { "item": "iPhone 15 Pro", "hand_position": "Held in right hand" } } } }, "generation_parameters": { "prompts": { "positive": "Hyper-realistic downward shot of a MacBook screen. The screen surface has visible dust, pixel grid, and reflection. The screen displays a macOS desktop in dark mode with a dominant Photo Booth live-preview window. Inside the window: A girl in a dark bedroom with an off-white wall and rumpled bedding. The girl is lying down, wearing a black top and grey bottoms, holding an iPhone 15 Pro in her right hand. Her face is fully visible (reference matched). Lighting is low-key, candid, nocturnal, with bluish screen glow mixed with warm skin tones. High fidelity, raw photo, unedited, natural noise.", "negative": "vector art, screenshot, flat digital image, clean glass, perfect screen, daylight, bright studio lights, cartoon, 3d render, painting, watermark, deformed hands, blurry face" }, "sampling_settings": { "sampler": "DPM++ 3M SDE Exponential", "steps": 40, "cfg_scale": 5.5, "denoising_strength": 0.35 } }, "control_net_configuration": { "identity_strictness": "CRITICAL", "stack": [ { "unit": "ControlNet_Tile", "weight": 0.4, "purpose": "Maintain text/interface sharpness and screen texture" }, { "unit": "IP-Adapter_FaceID_Plus", "weight": 0.95, "region_mask": "Photo Booth Window Area Only", "source_image": "uploaded_female_reference_image" } ] } }
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. A KOMODO DRAGON (Varanus komodoensis) is present on the left side of frame, facing the gladiator, placed farther from the lens than the gladiator (secondary subject). The Komodo dragon looks powerful and real: thick muscular body, rough scaled skin, heavy tail, strong legs, low stalking posture, forked tongue visible. Keep its size believable (large adult), with accurate anatomy and texture. (NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact”. The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the Komodo dragon. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face (cheekbone, nose line, jaw, lips, eye, brow) — fully visible, sharply defined, and perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. His facial expression must be readable: concentrated, controlled, predatory calm. Body/action: Gladiator leaning forward, shield raised defensively, sword low and ready, muscles and chest hair clearly detailed. Komodo dragon advancing slowly or pausing mid-step, tongue flicking, dust disturbed by its claws, tension high. NO blood, NO wounds, NO biting, NO visible injuries, NO aftermath. Suspense and power, not violence. COMPOSITION (CRITICAL — GLADIATOR FIRST): Low-angle heroic shot, camera near sand level. Gladiator dominates the foreground and frame (waist-up or 3/4 body), with face and torso fully visible, not cut off. Komodo dragon occupies the left mid-ground (clearly visible but NOT closer than the man). Amphitheater arches clearly recognizable in the background. Clean horizon, strong depth separation (shallow-to-moderate DOF). Keep the gladiator’s face and chest in tack-sharp focus; keep the Komodo dragon recognizable and detailed. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Perfect, professional illumination on the gladiator: Warm golden sunlight as key light (or strong soft key from sun direction), PLUS soft fill light to remove harsh shadows on face and torso. Subtle rim light outlining shoulders/helmet/arms. Every muscle, chest hair texture, skin pores, and facial details are clearly visible. Komodo dragon and background may be slightly less bright; priority is the gladiator. No underexposure on the man. No blown highlights. CAMERA / LENS: Full-frame look, 35mm or 50mm (epic scale, minimal distortion). Fast shutter feel, subject sharp, dust provides motion cues only. Realistic bokeh, controlled contrast, filmic color grading. No HDR, no neon, no fantasy glow. QUALITY CONTROL: Photorealistic, historical-epic campaign finish. No artifacts, no warped anatomy, no extra limbs, no deformed hands. No random logos, no watermarks, no text. NEGATIVE PROMPT: lion, bear, dragon fantasy, dinosaur, crocodile, snake, extra animals, multiple komodo dragons, cartoon komodo, blood, gore, injury, wounds, biting, mauling, dead animal, dead body, graphic violence, dismemberment, low resolution, blurry, plastic skin, over-smoothed, uncanny valley, extra fingers, extra limbs, warped anatomy, cartoon, illustration, CGI look, fantasy lighting, neon, oversaturation, harsh HDR, random text, misspelled words, logos, watermarks
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
{ "prompt_type": "descriptive_replication", "reference_adherence": "STRICT_VISUAL_FIDELITY", "aspect_ratio": "3:4", "identity_lock": { "priority": "NONE", "instruction": "Completely new female model with entirely different facial structure and identity. No resemblance to any previous version." }, "scene_constraints": { "people_count": "exactly one person in the frame", "single_subject_only": true, "no_other_people_visible": true, "no_extra_reflections": true, "no_background_figures": true, "full_body_visible": true, "head_to_toe_in_frame": true }, "subject": { "demographics": "young woman (age 20+), blonde", "presence": "only one visible person in the entire scene", "hair": { "color": "warm blonde (no dark roots)", "style": "center part, long straight sleek hair", "texture": "smooth, slightly glossy, natural fall" }, "face": { "structure": "heart-shaped face with softer jawline and rounded chin", "eyes": "large round eyes with neutral outer corners", "brows": "naturally arched brows with medium thickness", "nose": "small softly rounded nose", "lips": "medium-full lips with defined cupid's bow", "expression": "confident relaxed expression with gentle half-smile", "makeup": "fresh natural makeup, light glow on skin, soft peach lips, minimal eyeliner", "visibility": "fully visible face, unobstructed, clearly distinct identity" }, "body": { "pose": "standing upright facing mirror directly, both arms relaxed naturally down along sides", "posture": "balanced stance with one knee slightly bent forward creating a subtle S-curve", "anatomy": { "curves": "natural feminine proportions with realistic gravity", "chest": "bikini-supported natural shape", "details": "visible collarbones, relaxed shoulders, natural thigh and calf definition" }, "feet": { "visibility": "fully visible at bottom of frame", "position": "barefoot, one foot slightly forward for dynamic stance" }, "skin_texture": "natural skin texture with subtle realistic sheen" } }, "attire_and_accessories": { "clothing": { "top": "black triangle string bikini top with thin straps", "bottom_detail": "matching bikini bottom strings visible at hips" }, "footwear": "barefoot", "jewelry": { "necklaces": [], "wrists": [], "fingers": "no rings" }, "props": { "hands": "empty hands, no phone or objects" } }, "environment": { "setting": "modern luxury bathroom interior, daytime", "foreground_element": { "mirror": "large rectangular mirror with GREY MARBLE FRAME", "material": "polished stone marble with subtle cool grey veining" }, "background": { "elements": [ "marble wall tiles", "floating vanity with minimalist design", "sleek chrome faucet", "soft folded white towels", "clean tiled floor fully visible" ], "style": "minimalist luxury aesthetic", "clutter": "none" } }, "lighting_and_atmosphere": { "source": "natural daylight from camera-left", "quality": "soft directional light", "effects": [ "gentle highlights along cheekbones and shoulders", "soft shadow falloff along waist and legs" ], "contrast": "moderate and realistic" }, "camera_and_technical": { "perspective": "mirror perspective without phone blocking face", "camera_position": "eye level, slightly angled downward", "framing": "vertical 3:4, full head-to-toe body visible without cropping feet", "focus": "sharp focus on subject, background slightly softened", "visual_fidelity": "real smartphone camera aesthetic, mild natural noise, no HDR effect" }, "realism_constraints": { "allowed": [ "natural skin texture", "minor asymmetry", "realistic body proportions", "natural lighting imperfections" ], "forbidden": [ "multiple people", "extra person", "background people", "beauty filters", "airbrushed skin", "anime style", "cartoon style", "cropped feet", "cut-off legs", "phone in hand" ] } }
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
REALISTIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT ELEPHANT is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). The elephant is a large adult bull, anatomically accurate, massive scale and weight, rough wrinkled skin texture, visible tusks. Ears fully spread wide in an aggressive threat display, head lowered slightly, trunk tense or partially curled, body angled forward as if preparing to charge. No fantasy exaggeration — size and anatomy must be realistic. (NO hippo, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the elephant. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Elephant holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. COMPOSITION (CRITICAL — GLADIATOR FIRST): Low-angle heroic shot, camera near sand level. Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Elephant occupies the mid-ground, clearly readable but NOT closer than the man. Colosseum arches recognizable in the background. Strong depth separation, shallow-to-moderate DOF. Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Elephant realistically lit but slightly less emphasized than the man. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
{ "reference_source": "Use the uploaded reference image as the single source of truth", "identity_lock": { "face": "100% identical to reference image — same facial structure, eyes, nose, lips, jawline, skin texture, expression", "body": "Same body proportions, height, posture, shoulder width, waist, hips, limbs", "hair": "Same hairstyle, hairline, length, color, volume", "age": "Exact same apparent age as reference", "ethnicity": "Unchanged from reference" }, "consistency_rules": [ "Do NOT modify face shape", "Do NOT beautify, stylize, or alter facial features", "No AI face swap artifacts", "No face blending with other identities", "No body reshaping", "No weight gain or loss", "No added or removed tattoos, scars, or marks" ], "wardrobe": { "instruction": "Only change clothing if explicitly specified", "fit": "Clothing fits naturally and accurately to the same body", "fabric_behavior": "Realistic folds, tension, and shadows" }, "pose_and_scene": { "pose": "Preserve the same pose unless explicitly changed", "camera_angle": "Same camera height, distance, and lens perspective", "lighting": "Match original lighting direction, softness, and intensity", "environment": "Realistic environment with consistent scale and depth" }, "image_quality": { "realism": "Photorealistic, natural skin texture", "clarity": "Sharp focus on face and eyes", "resolution": "High-definition, clean details", "artifacts": "No blur, distortion, double face, broken anatomy" }, "negative_prompt": [ "different face", "face drift", "face mutation", "body distortion", "incorrect anatomy", "extra limbs", "blurred face", "plastic skin", "AI-generated look", "over-smoothing", "unreal lighting" ], "final_instruction": "The generated image must look like the SAME PERSON photographed again, not a variation or look-alike." } { "prompt": "Intimate full-body mirror selfie portrait of a young woman in bedroom, standing in front of large ornate gold-framed mirror, back turned slightly to show rear view while taking selfie with phone in one hand, long dark brown hair tied in high messy bun with loose strands, fair tan skin with natural glow and subtle texture, wearing ultra-sheer black lace lingerie set with intricate mesh panels, high-cut thong bottoms, garter straps connected to thigh-high stockings, thin spaghetti straps on bra top, silver metallic high-heel mules visible on feet, arched back and hips emphasized for seductive pose, one arm extended holding phone, other hand on mirror or hip, bedroom background with beige walls, white bed sheets, scattered books/candles, soft warm golden lamp light from side creating rim glow on body curves lace texture and hair, subtle blue LED accent from device, shallow depth of field, strong cinematic bokeh on mirror reflection and background, photorealistic sensual lingerie fashion selfie, high detail lace transparency skin pores hair strands garter straps heel shine and natural imperfections, shot on smartphone with slight wide-angle distortion typical for mirror shots, ultra detailed, 8k resolution", "negative_prompt": "cartoon, anime, illustration, painting, deformed, blurry, lowres, plastic skin, doll-like, heavy makeup overload, thick eyeliner, false lashes, filters, beauty filter, airbrushed skin, extra limbs, distorted proportions, asymmetrical face, harsh flash, cold lighting, day time, fully clothed, short hair, blonde hair, standing straight, no mirror, no phone, crowded room, text watermark, logo, ugly, bad anatomy, overexposed, underexposed, low contrast", "reference_image": { "enabled": true, "strength": 0.92, "description": "Extremely strong reference for exact composition and pose: young woman taking mirror selfie showing back/rear view, long dark hair in messy bun, sheer black lace lingerie with garter straps and high-cut bottoms, arched back emphasizing curves, phone in hand, ornate gold mirror frame, soft warm bedroom light with blue accent, seductive intimate vibe, high detail lace and skin texture" }, "style": "photorealistic intimate mirror selfie, sensual lingerie boudoir, warm golden bedroom lighting", "aspect_ratio": "3:4", "lighting": "soft warm golden lamp light from side, gentle rim light on back curves hair and lace, subtle blue LED fill from phone, realistic soft shadows and highlights", "camera": "smartphone mirror selfie (iPhone style), slight wide-angle distortion, natural low-key indoor light, minor grain for realism", "additional_details": [ "hair: dark brown-black, long, tied in high messy bun with loose flyaways and strands down back", "outfit: sheer black mesh lace bra and high-cut thong bottoms, garter straps with metal rings, thin straps, semi-transparent fabric clinging to body See less Nano Banana Recent Generating… { "reference_source": "Use the uploaded reference image as the single source of truth", "identity_lock": { "face": "100% identical to reference image — same facial structure, eyes, nose, lips, jawline, skin texture, expression", "body": "Same body proportions, height, posture, shoulder width, waist, hips, limbs", "hair": "Same hairstyle, hairline, length, color, volume", "age": "Exact same apparent age as reference", "ethnicity": "Unchanged from reference" }, "consistency_rules": [ "Do NOT modify face shape", "Do NOT beautify, stylize, or alter facial features", "No AI face swap artifacts", "No face blending with other identities", "No body reshaping", "No weight gain or loss", "No added or removed tattoos, scars, or marks" ], "wardrobe": { "instruction": "Only change clothing if explicitly specified", "fit": "Clothing fits naturally and accurately to the same body", "fabric_behavior": "Realistic folds, tension, and shadows" }, "pose_and_scene": { "pose": "Preserve the same pose unless explicitly changed", "camera_angle": "Same camera height, distance, and lens perspective", "lighting": "Match original lighting direction, softness, and intensity", "environment": "Realistic environment with consistent scale and depth" }, "image_quality": { "realism": "Photorealistic, natural skin texture", "clarity": "Sharp focus on face and eyes", "resolution": "High-definition, clean details", "artifacts": "No blur, distortion, double face, broken anatomy" }, "negative_prompt": [ "different face", "face drift", "face mutation", "body distortion", "incorrect anatomy", "extra limbs", "blurred face", "plastic skin", "AI-generated look", "over-smoothing", "unreal lighting" ], "final_instruction": "The generated image must look like the SAME PERSON photographed again, not a variation or look-alike." } { "prompt": "Intimate full-body mirror selfie portrait of a young woman in bedroom, standing in front of large ornate gold-framed mirror, back turned slightly to show rear view while taking selfie with phone in one hand, long dark brown hair tied in high messy bun with loose strands, fair tan skin with natural glow and subtle texture, wearing ultra-sheer black lace lingerie set with intricate mesh panels, high-cut thong bottoms, garter straps connected to thigh-high stockings, thin spaghetti straps on bra top, silver metallic high-heel mules visible on feet, arched back and hips emphasized for seductive pose, one arm extended holding phone, other hand on mirror or hip, bedroom background with beige walls, white bed sheets, scattered books/candles, soft warm golden lamp light from side creating rim glow on body curves lace texture and hair, subtle blue LED accent from device, shallow depth of field, strong cinematic bokeh on mirror reflection and background, photorealistic sensual lingerie fashion selfie, high detail lace transparency skin pores hair strands garter straps heel shine and natural imperfections, shot on smartphone with slight wide-angle distortion typical for mirror shots, ultra detailed, 8k resolution", "negative_prompt": "cartoon, anime, illustration, painting, deformed, blurry, lowres, plastic skin, doll-like, heavy makeup overload, thick eyeliner, false lashes, filters, beauty filter, airbrushed skin, extra limbs, distorted proportions, asymmetrical face, harsh flash, cold lighting, day time, fully clothed, short hair, blonde hair, standing straight, no mirror, no phone, crowded room, text watermark, logo, ugly, bad anatomy, overexposed, underexposed, low contrast", "reference_image": { "enabled": true, "strength": 0.92, "description": "Extremely strong reference for exact composition and pose: young woman taking mirror selfie showing back/rear view, long dark hair in messy bun, sheer black lace lingerie with garter straps and high-cut bottoms, arched back emphasizing curves, phone in hand, ornate gold mirror frame, soft warm bedroom light with blue accent, seductive intimate vibe, high detail lace and skin texture" }, "style": "photorealistic intimate mirror selfie, sensual lingerie boudoir, warm golden bedroom lighting", "aspect_ratio": "3:4", "lighting": "soft warm golden lamp light from side, gentle rim light on back curves hair and lace, subtle blue LED fill from phone, realistic soft shadows and highlights", "camera": "smartphone mirror selfie (iPhone style), slight wide-angle distortion, natural low-key indoor light, minor grain for realism", "additional_details": [ "hair: dark brown-black, long, tied in high messy bun with loose flyaways and strands down back", "outfit: sheer black mesh lace bra and high-cut thong bottoms, garter straps with metal rings, thin straps, semi-transparent fabric clinging to body  EDIT PROMPT (CLEAR & STRICT): Replace only the bikini she is currently wearing with the provided reference bikini. The new bikini must fit her perfectly and tightly, matching the reference exactly in color, fabric, cut, straps, and details. Do NOT change anything else. Preserve the same woman, same body, same proportions Keep the exact pose, angle, expression, lighting, background, and camera framing No changes to face, hair, skin tone, or body shape She remains default/slim as she already is (not fat, not obese, no body exaggeration) This is a wardrobe-only replacement. Everything except the bikini stays 100% identical to the original image. Negative instructions: No resizing or reshaping of the body No stylization or artistic changes No added accessories No changes to posture or anatomy No distortion or blur Result: Photorealistic, natural-looking bikini replacement that appears originally worn by her, with realistic fabric tension and clean edges.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
{ "project_meta": { "task_name": "Hyper-realistic Screen Simulation", "target_model": "SDXL_1.0_Refiner", "aspect_ratio": "3:4", "resolution": { "width": 1152, "height": 1536 } }, "scene_composition": { "layer_1_physical_environment": { "camera_angle": "High-angle, downward shot (POV)", "subject_anchor": "MacBook Screen (95% fill)", "foreground_element": "Thin strip of physical keyboard visible at bottom edge", "surface_imperfections": [ "Visible RGB pixel-grid texture (Moiré effect)", "Micro-dust particles on glass surface", "Faint ambient light reflections (glossy finish)", "Subtle fingerprint smudges" ] }, "layer_2_digital_interface": { "os_theme": "macOS Dark Mode", "active_window": { "app_name": "Photo Booth", "status": "Live Preview Mode", "position": "Dominant/Center-Left" } }, "layer_3_nested_content": { "location": "Inside the Photo Booth window", "setting": { "room": "Dim bedroom", "background": "Off-white wall, rumpled bedding", "lighting": "Mixed lighting (Cool blue screen glow + Warm skin tones), deep nocturnal shadows" }, "subject_details": { "identity_source": "uploaded_female_reference_image", "demographics": "Young adult female", "appearance": { "apparel": "Black top, grey bottoms", "expression": "Relaxed, candid, slight smile", "pose": "Reclining/Lying down, looking at screen" }, "props": { "item": "iPhone 15 Pro", "hand_position": "Held in right hand" } } } }, "generation_parameters": { "prompts": { "positive": "Hyper-realistic downward shot of a MacBook screen. The screen surface has visible dust, pixel grid, and reflection. The screen displays a macOS desktop in dark mode with a dominant Photo Booth live-preview window. Inside the window: A girl in a dark bedroom with an off-white wall and rumpled bedding. The girl is lying down, wearing a black top and grey bottoms, holding an iPhone 15 Pro in her right hand. Her face is fully visible (reference matched). Lighting is low-key, candid, nocturnal, with bluish screen glow mixed with warm skin tones. High fidelity, raw photo, unedited, natural noise.", "negative": "vector art, screenshot, flat digital image, clean glass, perfect screen, daylight, bright studio lights, cartoon, 3d render, painting, watermark, deformed hands, blurry face" }, "sampling_settings": { "sampler": "DPM++ 3M SDE Exponential", "steps": 40, "cfg_scale": 5.5, "denoising_strength": 0.35 } }, "control_net_configuration": { "identity_strictness": "CRITICAL", "stack": [ { "unit": "ControlNet_Tile", "weight": 0.4, "purpose": "Maintain text/interface sharpness and screen texture" }, { "unit": "IP-Adapter_FaceID_Plus", "weight": 0.95, "region_mask": "Photo Booth Window Area Only", "source_image": "uploaded_female_reference_image" } ] } }
REALISTIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT ELEPHANT is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). The elephant is a large adult bull, anatomically accurate, massive scale and weight, rough wrinkled skin texture, visible tusks. Ears fully spread wide in an aggressive threat display, head lowered slightly, trunk tense or partially curled, body angled forward as if preparing to charge. No fantasy exaggeration — size and anatomy must be realistic. (NO hippo, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the elephant. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Elephant holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. COMPOSITION (CRITICAL — GLADIATOR FIRST): Low-angle heroic shot, camera near sand level. Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Elephant occupies the mid-ground, clearly readable but NOT closer than the man. Colosseum arches recognizable in the background. Strong depth separation, shallow-to-moderate DOF. Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Elephant realistically lit but slightly less emphasized than the man. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
{ "prompt_type": "descriptive_replication", "reference_adherence": "STRICT_VISUAL_FIDELITY", "aspect_ratio": "3:4", "identity_lock": { "priority": "NONE", "instruction": "Completely new female model with entirely different facial structure and identity. No resemblance to any previous version." }, "scene_constraints": { "people_count": "exactly one person in the frame", "single_subject_only": true, "no_other_people_visible": true, "no_extra_reflections": true, "no_background_figures": true, "full_body_visible": true, "head_to_toe_in_frame": true }, "subject": { "demographics": "young woman (age 20+), blonde", "presence": "only one visible person in the entire scene", "hair": { "color": "warm blonde (no dark roots)", "style": "center part, long straight sleek hair", "texture": "smooth, slightly glossy, natural fall" }, "face": { "structure": "heart-shaped face with softer jawline and rounded chin", "eyes": "large round eyes with neutral outer corners", "brows": "naturally arched brows with medium thickness", "nose": "small softly rounded nose", "lips": "medium-full lips with defined cupid's bow", "expression": "confident relaxed expression with gentle half-smile", "makeup": "fresh natural makeup, light glow on skin, soft peach lips, minimal eyeliner", "visibility": "fully visible face, unobstructed, clearly distinct identity" }, "body": { "pose": "standing upright facing mirror directly, both arms relaxed naturally down along sides", "posture": "balanced stance with one knee slightly bent forward creating a subtle S-curve", "anatomy": { "curves": "natural feminine proportions with realistic gravity", "chest": "bikini-supported natural shape", "details": "visible collarbones, relaxed shoulders, natural thigh and calf definition" }, "feet": { "visibility": "fully visible at bottom of frame", "position": "barefoot, one foot slightly forward for dynamic stance" }, "skin_texture": "natural skin texture with subtle realistic sheen" } }, "attire_and_accessories": { "clothing": { "top": "black triangle string bikini top with thin straps", "bottom_detail": "matching bikini bottom strings visible at hips" }, "footwear": "barefoot", "jewelry": { "necklaces": [], "wrists": [], "fingers": "no rings" }, "props": { "hands": "empty hands, no phone or objects" } }, "environment": { "setting": "modern luxury bathroom interior, daytime", "foreground_element": { "mirror": "large rectangular mirror with GREY MARBLE FRAME", "material": "polished stone marble with subtle cool grey veining" }, "background": { "elements": [ "marble wall tiles", "floating vanity with minimalist design", "sleek chrome faucet", "soft folded white towels", "clean tiled floor fully visible" ], "style": "minimalist luxury aesthetic", "clutter": "none" } }, "lighting_and_atmosphere": { "source": "natural daylight from camera-left", "quality": "soft directional light", "effects": [ "gentle highlights along cheekbones and shoulders", "soft shadow falloff along waist and legs" ], "contrast": "moderate and realistic" }, "camera_and_technical": { "perspective": "mirror perspective without phone blocking face", "camera_position": "eye level, slightly angled downward", "framing": "vertical 3:4, full head-to-toe body visible without cropping feet", "focus": "sharp focus on subject, background slightly softened", "visual_fidelity": "real smartphone camera aesthetic, mild natural noise, no HDR effect" }, "realism_constraints": { "allowed": [ "natural skin texture", "minor asymmetry", "realistic body proportions", "natural lighting imperfections" ], "forbidden": [ "multiple people", "extra person", "background people", "beauty filters", "airbrushed skin", "anime style", "cartoon style", "cropped feet", "cut-off legs", "phone in hand" ] } }
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. A KOMODO DRAGON (Varanus komodoensis) is present on the left side of frame, facing the gladiator, placed farther from the lens than the gladiator (secondary subject). The Komodo dragon looks powerful and real: thick muscular body, rough scaled skin, heavy tail, strong legs, low stalking posture, forked tongue visible. Keep its size believable (large adult), with accurate anatomy and texture. (NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact”. The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the Komodo dragon. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face (cheekbone, nose line, jaw, lips, eye, brow) — fully visible, sharply defined, and perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. His facial expression must be readable: concentrated, controlled, predatory calm. Body/action: Gladiator leaning forward, shield raised defensively, sword low and ready, muscles and chest hair clearly detailed. Komodo dragon advancing slowly or pausing mid-step, tongue flicking, dust disturbed by its claws, tension high. NO blood, NO wounds, NO biting, NO visible injuries, NO aftermath. Suspense and power, not violence. COMPOSITION (CRITICAL — GLADIATOR FIRST): Low-angle heroic shot, camera near sand level. Gladiator dominates the foreground and frame (waist-up or 3/4 body), with face and torso fully visible, not cut off. Komodo dragon occupies the left mid-ground (clearly visible but NOT closer than the man). Amphitheater arches clearly recognizable in the background. Clean horizon, strong depth separation (shallow-to-moderate DOF). Keep the gladiator’s face and chest in tack-sharp focus; keep the Komodo dragon recognizable and detailed. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Perfect, professional illumination on the gladiator: Warm golden sunlight as key light (or strong soft key from sun direction), PLUS soft fill light to remove harsh shadows on face and torso. Subtle rim light outlining shoulders/helmet/arms. Every muscle, chest hair texture, skin pores, and facial details are clearly visible. Komodo dragon and background may be slightly less bright; priority is the gladiator. No underexposure on the man. No blown highlights. CAMERA / LENS: Full-frame look, 35mm or 50mm (epic scale, minimal distortion). Fast shutter feel, subject sharp, dust provides motion cues only. Realistic bokeh, controlled contrast, filmic color grading. No HDR, no neon, no fantasy glow. QUALITY CONTROL: Photorealistic, historical-epic campaign finish. No artifacts, no warped anatomy, no extra limbs, no deformed hands. No random logos, no watermarks, no text. NEGATIVE PROMPT: lion, bear, dragon fantasy, dinosaur, crocodile, snake, extra animals, multiple komodo dragons, cartoon komodo, blood, gore, injury, wounds, biting, mauling, dead animal, dead body, graphic violence, dismemberment, low resolution, blurry, plastic skin, over-smoothed, uncanny valley, extra fingers, extra limbs, warped anatomy, cartoon, illustration, CGI look, fantasy lighting, neon, oversaturation, harsh HDR, random text, misspelled words, logos, watermarks
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
{ "prompt_type": "descriptive_replication", "reference_adherence": "STRICT_VISUAL_FIDELITY", "aspect_ratio": "3:4", "identity_lock": { "priority": "NONE", "instruction": "Completely new female model with entirely different facial structure and identity. No resemblance to any previous version." }, "scene_constraints": { "people_count": "exactly one person in the frame", "single_subject_only": true, "no_other_people_visible": true, "no_extra_reflections": true, "no_background_figures": true, "full_body_visible": true, "head_to_toe_in_frame": true }, "subject": { "demographics": "young woman (age 20+), blonde", "presence": "only one visible person in the entire scene", "hair": { "color": "warm blonde (no dark roots)", "style": "center part, long straight sleek hair", "texture": "smooth, slightly glossy, natural fall" }, "face": { "structure": "heart-shaped face with softer jawline and rounded chin", "eyes": "large round eyes with neutral outer corners", "brows": "naturally arched brows with medium thickness", "nose": "small softly rounded nose", "lips": "medium-full lips with defined cupid's bow", "expression": "confident relaxed expression with gentle half-smile", "makeup": "fresh natural makeup, light glow on skin, soft peach lips, minimal eyeliner", "visibility": "fully visible face, unobstructed, clearly distinct identity" }, "body": { "pose": "standing upright facing mirror directly, both arms relaxed naturally down along sides", "posture": "balanced stance with one knee slightly bent forward creating a subtle S-curve", "anatomy": { "curves": "natural feminine proportions with realistic gravity", "chest": "bikini-supported natural shape", "details": "visible collarbones, relaxed shoulders, natural thigh and calf definition" }, "feet": { "visibility": "fully visible at bottom of frame", "position": "barefoot, one foot slightly forward for dynamic stance" }, "skin_texture": "natural skin texture with subtle realistic sheen" } }, "attire_and_accessories": { "clothing": { "top": "black triangle string bikini top with thin straps", "bottom_detail": "matching bikini bottom strings visible at hips" }, "footwear": "barefoot", "jewelry": { "necklaces": [], "wrists": [], "fingers": "no rings" }, "props": { "hands": "empty hands, no phone or objects" } }, "environment": { "setting": "modern luxury bathroom interior, daytime", "foreground_element": { "mirror": "large rectangular mirror with GREY MARBLE FRAME", "material": "polished stone marble with subtle cool grey veining" }, "background": { "elements": [ "marble wall tiles", "floating vanity with minimalist design", "sleek chrome faucet", "soft folded white towels", "clean tiled floor fully visible" ], "style": "minimalist luxury aesthetic", "clutter": "none" } }, "lighting_and_atmosphere": { "source": "natural daylight from camera-left", "quality": "soft directional light", "effects": [ "gentle highlights along cheekbones and shoulders", "soft shadow falloff along waist and legs" ], "contrast": "moderate and realistic" }, "camera_and_technical": { "perspective": "mirror perspective without phone blocking face", "camera_position": "eye level, slightly angled downward", "framing": "vertical 3:4, full head-to-toe body visible without cropping feet", "focus": "sharp focus on subject, background slightly softened", "visual_fidelity": "real smartphone camera aesthetic, mild natural noise, no HDR effect" }, "realism_constraints": { "allowed": [ "natural skin texture", "minor asymmetry", "realistic body proportions", "natural lighting imperfections" ], "forbidden": [ "multiple people", "extra person", "background people", "beauty filters", "airbrushed skin", "anime style", "cartoon style", "cropped feet", "cut-off legs", "phone in hand" ] } }
{ "reference_source": "Use the uploaded reference image as the single source of truth", "identity_lock": { "face": "100% identical to reference image — same facial structure, eyes, nose, lips, jawline, skin texture, expression", "body": "Same body proportions, height, posture, shoulder width, waist, hips, limbs", "hair": "Same hairstyle, hairline, length, color, volume", "age": "Exact same apparent age as reference", "ethnicity": "Unchanged from reference" }, "consistency_rules": [ "Do NOT modify face shape", "Do NOT beautify, stylize, or alter facial features", "No AI face swap artifacts", "No face blending with other identities", "No body reshaping", "No weight gain or loss", "No added or removed tattoos, scars, or marks" ], "wardrobe": { "instruction": "Only change clothing if explicitly specified", "fit": "Clothing fits naturally and accurately to the same body", "fabric_behavior": "Realistic folds, tension, and shadows" }, "pose_and_scene": { "pose": "Preserve the same pose unless explicitly changed", "camera_angle": "Same camera height, distance, and lens perspective", "lighting": "Match original lighting direction, softness, and intensity", "environment": "Realistic environment with consistent scale and depth" }, "image_quality": { "realism": "Photorealistic, natural skin texture", "clarity": "Sharp focus on face and eyes", "resolution": "High-definition, clean details", "artifacts": "No blur, distortion, double face, broken anatomy" }, "negative_prompt": [ "different face", "face drift", "face mutation", "body distortion", "incorrect anatomy", "extra limbs", "blurred face", "plastic skin", "AI-generated look", "over-smoothing", "unreal lighting" ], "final_instruction": "The generated image must look like the SAME PERSON photographed again, not a variation or look-alike." } { "prompt": "Intimate full-body mirror selfie portrait of a young woman in bedroom, standing in front of large ornate gold-framed mirror, back turned slightly to show rear view while taking selfie with phone in one hand, long dark brown hair tied in high messy bun with loose strands, fair tan skin with natural glow and subtle texture, wearing ultra-sheer black lace lingerie set with intricate mesh panels, high-cut thong bottoms, garter straps connected to thigh-high stockings, thin spaghetti straps on bra top, silver metallic high-heel mules visible on feet, arched back and hips emphasized for seductive pose, one arm extended holding phone, other hand on mirror or hip, bedroom background with beige walls, white bed sheets, scattered books/candles, soft warm golden lamp light from side creating rim glow on body curves lace texture and hair, subtle blue LED accent from device, shallow depth of field, strong cinematic bokeh on mirror reflection and background, photorealistic sensual lingerie fashion selfie, high detail lace transparency skin pores hair strands garter straps heel shine and natural imperfections, shot on smartphone with slight wide-angle distortion typical for mirror shots, ultra detailed, 8k resolution", "negative_prompt": "cartoon, anime, illustration, painting, deformed, blurry, lowres, plastic skin, doll-like, heavy makeup overload, thick eyeliner, false lashes, filters, beauty filter, airbrushed skin, extra limbs, distorted proportions, asymmetrical face, harsh flash, cold lighting, day time, fully clothed, short hair, blonde hair, standing straight, no mirror, no phone, crowded room, text watermark, logo, ugly, bad anatomy, overexposed, underexposed, low contrast", "reference_image": { "enabled": true, "strength": 0.92, "description": "Extremely strong reference for exact composition and pose: young woman taking mirror selfie showing back/rear view, long dark hair in messy bun, sheer black lace lingerie with garter straps and high-cut bottoms, arched back emphasizing curves, phone in hand, ornate gold mirror frame, soft warm bedroom light with blue accent, seductive intimate vibe, high detail lace and skin texture" }, "style": "photorealistic intimate mirror selfie, sensual lingerie boudoir, warm golden bedroom lighting", "aspect_ratio": "3:4", "lighting": "soft warm golden lamp light from side, gentle rim light on back curves hair and lace, subtle blue LED fill from phone, realistic soft shadows and highlights", "camera": "smartphone mirror selfie (iPhone style), slight wide-angle distortion, natural low-key indoor light, minor grain for realism", "additional_details": [ "hair: dark brown-black, long, tied in high messy bun with loose flyaways and strands down back", "outfit: sheer black mesh lace bra and high-cut thong bottoms, garter straps with metal rings, thin straps, semi-transparent fabric clinging to body See less Nano Banana Recent Generating… { "reference_source": "Use the uploaded reference image as the single source of truth", "identity_lock": { "face": "100% identical to reference image — same facial structure, eyes, nose, lips, jawline, skin texture, expression", "body": "Same body proportions, height, posture, shoulder width, waist, hips, limbs", "hair": "Same hairstyle, hairline, length, color, volume", "age": "Exact same apparent age as reference", "ethnicity": "Unchanged from reference" }, "consistency_rules": [ "Do NOT modify face shape", "Do NOT beautify, stylize, or alter facial features", "No AI face swap artifacts", "No face blending with other identities", "No body reshaping", "No weight gain or loss", "No added or removed tattoos, scars, or marks" ], "wardrobe": { "instruction": "Only change clothing if explicitly specified", "fit": "Clothing fits naturally and accurately to the same body", "fabric_behavior": "Realistic folds, tension, and shadows" }, "pose_and_scene": { "pose": "Preserve the same pose unless explicitly changed", "camera_angle": "Same camera height, distance, and lens perspective", "lighting": "Match original lighting direction, softness, and intensity", "environment": "Realistic environment with consistent scale and depth" }, "image_quality": { "realism": "Photorealistic, natural skin texture", "clarity": "Sharp focus on face and eyes", "resolution": "High-definition, clean details", "artifacts": "No blur, distortion, double face, broken anatomy" }, "negative_prompt": [ "different face", "face drift", "face mutation", "body distortion", "incorrect anatomy", "extra limbs", "blurred face", "plastic skin", "AI-generated look", "over-smoothing", "unreal lighting" ], "final_instruction": "The generated image must look like the SAME PERSON photographed again, not a variation or look-alike." } { "prompt": "Intimate full-body mirror selfie portrait of a young woman in bedroom, standing in front of large ornate gold-framed mirror, back turned slightly to show rear view while taking selfie with phone in one hand, long dark brown hair tied in high messy bun with loose strands, fair tan skin with natural glow and subtle texture, wearing ultra-sheer black lace lingerie set with intricate mesh panels, high-cut thong bottoms, garter straps connected to thigh-high stockings, thin spaghetti straps on bra top, silver metallic high-heel mules visible on feet, arched back and hips emphasized for seductive pose, one arm extended holding phone, other hand on mirror or hip, bedroom background with beige walls, white bed sheets, scattered books/candles, soft warm golden lamp light from side creating rim glow on body curves lace texture and hair, subtle blue LED accent from device, shallow depth of field, strong cinematic bokeh on mirror reflection and background, photorealistic sensual lingerie fashion selfie, high detail lace transparency skin pores hair strands garter straps heel shine and natural imperfections, shot on smartphone with slight wide-angle distortion typical for mirror shots, ultra detailed, 8k resolution", "negative_prompt": "cartoon, anime, illustration, painting, deformed, blurry, lowres, plastic skin, doll-like, heavy makeup overload, thick eyeliner, false lashes, filters, beauty filter, airbrushed skin, extra limbs, distorted proportions, asymmetrical face, harsh flash, cold lighting, day time, fully clothed, short hair, blonde hair, standing straight, no mirror, no phone, crowded room, text watermark, logo, ugly, bad anatomy, overexposed, underexposed, low contrast", "reference_image": { "enabled": true, "strength": 0.92, "description": "Extremely strong reference for exact composition and pose: young woman taking mirror selfie showing back/rear view, long dark hair in messy bun, sheer black lace lingerie with garter straps and high-cut bottoms, arched back emphasizing curves, phone in hand, ornate gold mirror frame, soft warm bedroom light with blue accent, seductive intimate vibe, high detail lace and skin texture" }, "style": "photorealistic intimate mirror selfie, sensual lingerie boudoir, warm golden bedroom lighting", "aspect_ratio": "3:4", "lighting": "soft warm golden lamp light from side, gentle rim light on back curves hair and lace, subtle blue LED fill from phone, realistic soft shadows and highlights", "camera": "smartphone mirror selfie (iPhone style), slight wide-angle distortion, natural low-key indoor light, minor grain for realism", "additional_details": [ "hair: dark brown-black, long, tied in high messy bun with loose flyaways and strands down back", "outfit: sheer black mesh lace bra and high-cut thong bottoms, garter straps with metal rings, thin straps, semi-transparent fabric clinging to body  EDIT PROMPT (CLEAR & STRICT): Replace only the bikini she is currently wearing with the provided reference bikini. The new bikini must fit her perfectly and tightly, matching the reference exactly in color, fabric, cut, straps, and details. Do NOT change anything else. Preserve the same woman, same body, same proportions Keep the exact pose, angle, expression, lighting, background, and camera framing No changes to face, hair, skin tone, or body shape She remains default/slim as she already is (not fat, not obese, no body exaggeration) This is a wardrobe-only replacement. Everything except the bikini stays 100% identical to the original image. Negative instructions: No resizing or reshaping of the body No stylization or artistic changes No added accessories No changes to posture or anatomy No distortion or blur Result: Photorealistic, natural-looking bikini replacement that appears originally worn by her, with realistic fabric tension and clean edges.
REALISTIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT ELEPHANT is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). The elephant is a large adult bull, anatomically accurate, massive scale and weight, rough wrinkled skin texture, visible tusks. Ears fully spread wide in an aggressive threat display, head lowered slightly, trunk tense or partially curled, body angled forward as if preparing to charge. No fantasy exaggeration — size and anatomy must be realistic. (NO hippo, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the elephant. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Elephant holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. COMPOSITION (CRITICAL — GLADIATOR FIRST): Low-angle heroic shot, camera near sand level. Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Elephant occupies the mid-ground, clearly readable but NOT closer than the man. Colosseum arches recognizable in the background. Strong depth separation, shallow-to-moderate DOF. Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Elephant realistically lit but slightly less emphasized than the man. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
{ "prompt_type": "descriptive_replication", "reference_adherence": "STRICT_VISUAL_FIDELITY", "aspect_ratio": "3:4", "identity_lock": { "priority": "NONE", "instruction": "Completely new female model with entirely different facial structure and identity. No resemblance to any previous version." }, "scene_constraints": { "people_count": "exactly one person in the frame", "single_subject_only": true, "no_other_people_visible": true, "no_extra_reflections": true, "no_background_figures": true, "full_body_visible": true, "head_to_toe_in_frame": true }, "subject": { "demographics": "young woman (age 20+), blonde", "presence": "only one visible person in the entire scene", "hair": { "color": "warm blonde (no dark roots)", "style": "center part, long straight sleek hair", "texture": "smooth, slightly glossy, natural fall" }, "face": { "structure": "heart-shaped face with softer jawline and rounded chin", "eyes": "large round eyes with neutral outer corners", "brows": "naturally arched brows with medium thickness", "nose": "small softly rounded nose", "lips": "medium-full lips with defined cupid's bow", "expression": "confident relaxed expression with gentle half-smile", "makeup": "fresh natural makeup, light glow on skin, soft peach lips, minimal eyeliner", "visibility": "fully visible face, unobstructed, clearly distinct identity" }, "body": { "pose": "standing upright facing mirror directly, both arms relaxed naturally down along sides", "posture": "balanced stance with one knee slightly bent forward creating a subtle S-curve", "anatomy": { "curves": "natural feminine proportions with realistic gravity", "chest": "bikini-supported natural shape", "details": "visible collarbones, relaxed shoulders, natural thigh and calf definition" }, "feet": { "visibility": "fully visible at bottom of frame", "position": "barefoot, one foot slightly forward for dynamic stance" }, "skin_texture": "natural skin texture with subtle realistic sheen" } }, "attire_and_accessories": { "clothing": { "top": "black triangle string bikini top with thin straps", "bottom_detail": "matching bikini bottom strings visible at hips" }, "footwear": "barefoot", "jewelry": { "necklaces": [], "wrists": [], "fingers": "no rings" }, "props": { "hands": "empty hands, no phone or objects" } }, "environment": { "setting": "modern luxury bathroom interior, daytime", "foreground_element": { "mirror": "large rectangular mirror with GREY MARBLE FRAME", "material": "polished stone marble with subtle cool grey veining" }, "background": { "elements": [ "marble wall tiles", "floating vanity with minimalist design", "sleek chrome faucet", "soft folded white towels", "clean tiled floor fully visible" ], "style": "minimalist luxury aesthetic", "clutter": "none" } }, "lighting_and_atmosphere": { "source": "natural daylight from camera-left", "quality": "soft directional light", "effects": [ "gentle highlights along cheekbones and shoulders", "soft shadow falloff along waist and legs" ], "contrast": "moderate and realistic" }, "camera_and_technical": { "perspective": "mirror perspective without phone blocking face", "camera_position": "eye level, slightly angled downward", "framing": "vertical 3:4, full head-to-toe body visible without cropping feet", "focus": "sharp focus on subject, background slightly softened", "visual_fidelity": "real smartphone camera aesthetic, mild natural noise, no HDR effect" }, "realism_constraints": { "allowed": [ "natural skin texture", "minor asymmetry", "realistic body proportions", "natural lighting imperfections" ], "forbidden": [ "multiple people", "extra person", "background people", "beauty filters", "airbrushed skin", "anime style", "cartoon style", "cropped feet", "cut-off legs", "phone in hand" ] } }
Ultra-realistic cinematic image or short video sequence (3–6 seconds). Reference A and B represent the SAME real human identity. Reference C is the target scene and must be matched with maximum physical and visual fidelity. This system performs: 1) identity preservation 2) angle alignment 3) pose cloning 4) scene reconstruction 5) video stability ================================================== PHASE 1 — ABSOLUTE IDENTITY LOCK (NON-NEGOTIABLE) ================================================== The face, hair, beard and identity from A+B MUST remain EXACT. Includes: - identical facial geometry - identical proportions and spatial relationships - identical asymmetry (eyes, nose, mouth, jawline) - identical skin texture, pores and imperfections - identical beard density and irregular edges - identical hairline, hairstyle and volume MANDATORY: - preserve all imperfections - preserve asymmetry exactly FORBIDDEN: - beautification - smoothing - symmetry correction - enhancement - facial reshaping Face must look like real photography. ================================================== PHASE 2 — ANGLE + CAMERA ALIGNMENT (CRITICAL) ================================================== Before scene integration, align identity to match C: - match head tilt (left/right) - match vertical angle (up/down) - match face rotation - match gaze direction CAMERA MATCH: - match camera distance from C - match perspective compression - maintain natural focal length (35mm–85mm) MANDATORY: - keep facial proportions intact - avoid distortion FORBIDDEN: - stretching face - resizing head unnaturally - perspective mismatch ================================================== PHASE 3 — POSE CLONING (MAXIMUM ACCURACY) ================================================== Replicate pose from C: - same body position - same joint angles - same posture - same balance and weight distribution CRITICAL RULE: If pose conflicts with identity: → PRESERVE IDENTITY AND ADAPT BODY SLIGHTLY ================================================== PHASE 4 — GESTURE + PHYSICAL INTENT ================================================== - match body tension - match gesture intensity - match physical intention WITHOUT modifying face geometry. ================================================== PHASE 5 — OBJECTS + ACCESSORIES (FULL CLONE) ================================================== - identical weapon or objects - identical placement, orientation and scale - identical clothing - identical gear - identical accessory placement ================================================== PHASE 6 — SCENE + BACKGROUND (FULL CLONE) ================================================== - identical environment - identical spatial layout - identical depth - identical framing and composition ================================================== PHASE 7 — LIGHTING STRUCTURE (CONTROLLED CLONE) ================================================== - match light direction from C - match shadow placement - match global contrast FACE PROTECTION: - preserve original skin tone from A+B - preserve undertone - avoid color cast on face - avoid orange/blue cinematic contamination Face remains neutral, natural, human. ================================================== PHASE 8 — OPTICAL CONSISTENCY (ANTI-DEFORMATION) ================================================== - maintain correct head-to-body proportion - maintain natural scale - maintain spatial coherence FORBIDDEN: - wide-angle distortion - head scaling errors - perspective mismatch ================================================== PHASE 9 — HANDS + CONTACT PHYSICS ================================================== - natural grip - correct finger wrapping - visible pressure - realistic interaction FORBIDDEN: - floating objects - broken anatomy ================================================== PHASE 10 — DERMAL REALISM (HUMAN QUALITY) ================================================== - visible pores - uneven texture - natural tonal variation - realistic beard transition FORBIDDEN: - plastic skin - over-smoothing - artificial sharpness ================================================== PHASE 11 — ENVIRONMENT INTEGRATION ================================================== - subject must feel physically present - correct depth and scale ALLOWED: - subtle dust/sweat on body RESTRICTED: - minimal effect on face - no change in facial tone ================================================== PHASE 12 — GLOBAL NO-DEFORMATION RULE ================================================== - do not stretch face - do not change proportions - do not distort head If conflict appears: → ALWAYS PRESERVE IDENTITY FIRST ================================================== PHASE 13 — VIDEO STABILITY (KLING 3.0 READY) ================================================== - stable identity across frames - stable skin tone - stable dermal texture - stable pose FORBIDDEN: - flicker - morphing - identity drift - lighting instability on face ================================================== FINAL RESULT ================================================== Same real human from A+B fully integrated into scene C. Identity preserved 100% (face, hair, beard locked) Angle aligned with scene Pose cloned with maximum accuracy Objects and environment matched Lighting structure matched without damaging skin Result must be: - human - natural - physically believable - visually coherent - stable for Kling 3.0
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.
ULTRA-REALISTIC CINEMATIC ACTION PHOTOGRAPH (vertical poster 9:16). Looks like a real professional full-frame photo: crisp detail, subtle film grain, natural skin texture, realistic pores and wrinkles, no beauty smoothing. High-end historical epic realism, believable light, no stylization. REFERENCE (CRITICAL): If a reference image is provided, use it STRICTLY for FACE + BODY identity. Match facial features, age (65–75), beard/white hair shape, body proportions, skin texture, body-hair density, and overall masculinity. DO NOT change identity. SUBJECT (GLADIATOR — MUST BE PRIMARY + FOREGROUND): Mature man (65–75), extremely strong “old-world powerhouse” build: massive shoulders, thick neck, heavy forearms, powerful hands. A BIG belly that is large but firm (strongman barrel torso), not soft. Bare torso (tasteful, non-explicit) with abundant WHITE/GRAY body hair: dense chest hair, thick belly hair trail, hairy arms and shoulders, weathered skin. He is the CLEAR MAIN SUBJECT and MUST be closest to the camera: gladiator in the foreground (dominant), animal slightly farther back. The camera favors his face, chest, and upper body detail. Wardrobe: Roman gladiator elements — bronze helmet (open face), leather skirt armor, belt with worn metal details, wrist guards, shin guards, strapped sandals. He holds a round wooden shield with metal boss + a short gladius sword. Expression: fierce, commanding — controlled roar or intense focus. SCENE / SETTING: Inside the Roman Colosseum amphitheater. Crowd in the stands (background only, soft silhouettes/bokeh; no distinct faces). Dusty sand arena floor with kicked-up dust and grit. AN ADULT RHINOCEROS is present, positioned in the mid-ground, farther from the lens than the gladiator (secondary subject). A large adult rhino (believable size), anatomically accurate, thick armored skin plates with realistic folds, dust on its hide, heavy muscular shoulders, visible horn texture (keratin striations), small intense eyes. It is in an aggressive threat posture: head lowered, horn angled forward, foreleg braced, ground-scuff marks, dust plume around its feet, body angled as if about to charge (but no impact). No fantasy exaggeration — realistic anatomy, realistic scale. (NO hippo, NO elephant, NO Komodo dragon, NO lion, NO bear.) ACTION (CRITICAL — INTENSE EYE-LINE, FACE PROFILE MUST READ): A tense standoff moment, cinematic “before impact.” The gladiator is NOT looking at the camera — he is staring with focused intensity directly at the rhinoceros. Clear, unmistakable eye-line toward the animal. Face visibility requirement (MANDATORY): Show at least a strong 3/4 profile OR clean side profile of the gladiator’s face — cheekbone, nose line, jaw, lips, eye, brow fully visible, sharply defined, perfectly lit. No shadow blocking the eye or jawline, no helmet shadow covering the face. Expression must read clearly: concentrated, controlled, predatory calm. Body / action: Gladiator leaning forward, shield raised defensively, sword low and ready. Muscles, chest hair, and skin texture clearly detailed. Rhinoceros holding position or taking a heavy forward step, dust lifting from the sand beneath its feet. NO blood, NO wounds, NO contact, NO aftermath. Suspense and power, not violence. CAMERA / ANGLE (UPDATED — MORE CINEMATIC) Low-to-mid angle “arena intimidation” shot, camera slightly behind and to the side of the gladiator’s front shoulder (over-shoulder but not blocking the face). Lens feel: 35mm full-frame, close enough for dramatic scale. Camera height: around the gladiator’s lower chest/upper abdomen (not ground level), tilted slightly upward to make him dominant and the rhino huge behind him. Framing: the gladiator’s face and torso occupy the right 2/3 of the frame in sharp focus; the rhino sits left mid-ground, fully readable, slightly softer but detailed. Keep the gladiator’s 3/4 face profile clearly visible (do not hide it behind shield/helmet). Moderate DOF for separation; background arches and crowd remain soft. COMPOSITION (CRITICAL — GLADIATOR FIRST): Gladiator dominates the foreground (waist-up or 3/4 body), face and torso fully visible, not cut off. Rhinoceros is clearly readable but never closer than the man. Colosseum arches recognizable in the background. Clean horizon, strong depth separation. LIGHTING (CRITICAL — PERFECT ON GLADIATOR): Warm BRIGHT golden sunlight as key light on the gladiator, Subtle rim light outlining shoulders, helmet, arms. Every muscle, chest hair texture, skin pores, and facial detail clearly visible. Rhino realistically lit but slightly less emphasized than the man.