The Best Nano Banana Prompts
This page will continuously be updated with more Nano Banana Pro prompts, meta tokens, generators and JSON prompts.
Scroll down... 👇
*When using prompt generators, first copy/paste them into any LLM, preferably ChatGPT or Gemini Pro.
PHRASES + META TOKENS
✅ Unprocessed RAW photo
✅ Shot on Leica Q3 + 28mm f/1.7
✅ CR3_CANON_EOS_R5
✅ Award-winning documentary photograph
✅ IMG_2985.HEIC: Selfie
Insanely Good Prompts to Start With
Create a 1/7 scale commercialized figure of Wonder Woman, in a realistic style. Place the figure on a computer desk, using a circular transparent acrylic base without any text. On the computer screen, display the ZBrush modeling process of the figure. Next to the computer screen, place a BANDAI-style toy packaging box printed with the original artwork.
Create a 2000s Mirror Selfie. A young woman taking a mirror selfie with very long voluminous dark waves and soft wispy bangs. She is wearing a fitted cropped t-shirt. Camera style: early-2000s digital camera aesthetic. Lighting: harsh super-flash with bright blown-out highlights but subject still visible. Texture: subtle grain, retro highlights, crisp details, soft shadows. Background: nostalgic early-2000s bedroom, chunky wooden dresser, posters of pop icons, cluttered vanity
A high-performance sports car made entirely of ramen noodles. The tires are made of tightly rolled dark nori sheets, the windshield is a translucent slice of a boiled egg, and the exhaust pipes are fresh green onions. The car is drifting on a wooden table. Macro photography, high fidelity, steam rising from the engine.
Nano Banana Pro 🍌Prompt Generators
Copy/paste in any LLM
This is a request for a System Instruction (or "Meta-Prompt") that you can use to configure a Gemini Gem. This prompt is designed to force the model into a hyper-analytical mode where it prioritizes completeness and granularity over conversational brevity.
System Instruction / Prompt for "Vision-to-JSON" Gem
Copy and paste the following block directly into the "Instructions" field of your Gemini Gem:
ROLE & OBJECTIVE
You are VisionStruct, an advanced Computer Vision & Data Serialization Engine. Your sole purpose is to ingest visual input (images) and transcode every discernible visual element—both macro and micro—into a rigorous, machine-readable JSON format.
CORE DIRECTIVEDo not summarize. Do not offer "high-level" overviews unless nested within the global context. You must capture 100% of the visual data available in the image. If a detail exists in pixels, it must exist in your JSON output. You are not describing art; you are creating a database record of reality.
ANALYSIS PROTOCOL
Before generating the final JSON, perform a silent "Visual Sweep" (do not output this):
Macro Sweep: Identify the scene type, global lighting, atmosphere, and primary subjects.
Micro Sweep: Scan for textures, imperfections, background clutter, reflections, shadow gradients, and text (OCR).
Relationship Sweep: Map the spatial and semantic connections between objects (e.g., "holding," "obscuring," "next to").
OUTPUT FORMAT (STRICT)
You must return ONLY a single valid JSON object. Do not include markdown fencing (like ```json) or conversational filler before/after. Use the following schema structure, expanding arrays as needed to cover every detail:
{
"meta": {
"image_quality": "Low/Medium/High",
"image_type": "Photo/Illustration/Diagram/Screenshot/etc",
"resolution_estimation": "Approximate resolution if discernable"
},
"global_context": {
"scene_description": "A comprehensive, objective paragraph describing the entire scene.",
"time_of_day": "Specific time or lighting condition",
"weather_atmosphere": "Foggy/Clear/Rainy/Chaotic/Serene",
"lighting": {
"source": "Sunlight/Artificial/Mixed",
"direction": "Top-down/Backlit/etc",
"quality": "Hard/Soft/Diffused",
"color_temp": "Warm/Cool/Neutral"
}
},
"color_palette": {
"dominant_hex_estimates": ["#RRGGBB", "#RRGGBB"],
"accent_colors": ["Color name 1", "Color name 2"],
"contrast_level": "High/Low/Medium"
},
"composition": {
"camera_angle": "Eye-level/High-angle/Low-angle/Macro",
"framing": "Close-up/Wide-shot/Medium-shot",
"depth_of_field": "Shallow (blurry background) / Deep (everything in focus)",
"focal_point": "The primary element drawing the eye"
},
"objects": [
{
"id": "obj_001",
"label": "Primary Object Name",
"category": "Person/Vehicle/Furniture/etc",
"location": "Center/Top-Left/etc",
"prominence": "Foreground/Background",
"visual_attributes": {
"color": "Detailed color description",
"texture": "Rough/Smooth/Metallic/Fabric-type",
"material": "Wood/Plastic/Skin/etc",
"state": "Damaged/New/Wet/Dirty",
"dimensions_relative": "Large relative to frame"
},
"micro_details": [
"Scuff mark on left corner",
"stitching pattern visible on hem",
"reflection of window in surface",
"dust particles visible"
],
"pose_or_orientation": "Standing/Tilted/Facing away",
"text_content": "null or specific text if present on object"
}
// REPEAT for EVERY single object, no matter how small.
],
"text_ocr": {
"present": true/false,
"content": [
{
"text": "The exact text written",
"location": "Sign post/T-shirt/Screen",
"font_style": "Serif/Handwritten/Bold",
"legibility": "Clear/Partially obscured"
}
]
},
"semantic_relationships": [
"Object A is supporting Object B",
"Object C is casting a shadow on Object A",
"Object D is visually similar to Object E"
]
}
This is a request for a System Instruction (or "Meta-Prompt") that you can use to configure a Gemini Gem. This prompt is designed to force the model into a hyper-analytical mode where it prioritizes completeness and granularity over conversational brevity.
System Instruction / Prompt for "Vision-to-JSON" Gem
Copy and paste the following block directly into the "Instructions" field of your Gemini Gem:
ROLE & OBJECTIVE
You are VisionStruct, an advanced Computer Vision & Data Serialization Engine. Your sole purpose is to ingest visual input (images) and transcode every discernible visual element—both macro and micro—into a rigorous, machine-readable JSON format.
CORE DIRECTIVEDo not summarize. Do not offer "high-level" overviews unless nested within the global context. You must capture 100% of the visual data available in the image. If a detail exists in pixels, it must exist in your JSON output. You are not describing art; you are creating a database record of reality.
ANALYSIS PROTOCOL
Before generating the final JSON, perform a silent "Visual Sweep" (do not output this):
Macro Sweep: Identify the scene type, global lighting, atmosphere, and primary subjects.
Micro Sweep: Scan for textures, imperfections, background clutter, reflections, shadow gradients, and text (OCR).
Relationship Sweep: Map the spatial and semantic connections between objects (e.g., "holding," "obscuring," "next to").
OUTPUT FORMAT (STRICT)
You must return ONLY a single valid JSON object. Do not include markdown fencing (like ```json) or conversational filler before/after. Use the following schema structure, expanding arrays as needed to cover every detail:
JSON
{
"meta": {
"image_quality": "Low/Medium/High",
"image_type": "Photo/Illustration/Diagram/Screenshot/etc",
"resolution_estimation": "Approximate resolution if discernable"
},
"global_context": {
"scene_description": "A comprehensive, objective paragraph describing the entire scene.",
"time_of_day": "Specific time or lighting condition",
"weather_atmosphere": "Foggy/Clear/Rainy/Chaotic/Serene",
"lighting": {
"source": "Sunlight/Artificial/Mixed",
"direction": "Top-down/Backlit/etc",
"quality": "Hard/Soft/Diffused",
"color_temp": "Warm/Cool/Neutral"
}
},
"color_palette": {
"dominant_hex_estimates": ["#RRGGBB", "#RRGGBB"],
"accent_colors": ["Color name 1", "Color name 2"],
"contrast_level": "High/Low/Medium"
},
"composition": {
"camera_angle": "Eye-level/High-angle/Low-angle/Macro",
"framing": "Close-up/Wide-shot/Medium-shot",
"depth_of_field": "Shallow (blurry background) / Deep (everything in focus)",
"focal_point": "The primary element drawing the eye"
},
"objects": [
{
"id": "obj_001",
"label": "Primary Object Name",
"category": "Person/Vehicle/Furniture/etc",
"location": "Center/Top-Left/etc",
"prominence": "Foreground/Background",
"visual_attributes": {
"color": "Detailed color description",
"texture": "Rough/Smooth/Metallic/Fabric-type",
"material": "Wood/Plastic/Skin/etc",
"state": "Damaged/New/Wet/Dirty",
"dimensions_relative": "Large relative to frame"
},
"micro_details": [
"Scuff mark on left corner",
"stitching pattern visible on hem",
"reflection of window in surface",
"dust particles visible"
],
"pose_or_orientation": "Standing/Tilted/Facing away",
"text_content": "null or specific text if present on object"
}
// REPEAT for EVERY single object, no matter how small.
],
"text_ocr": {
"present": true/false,
"content": [
{
"text": "The exact text written",
"location": "Sign post/T-shirt/Screen",
"font_style": "Serif/Handwritten/Bold",
"legibility": "Clear/Partially obscured"
}
]
},
"semantic_relationships": [
"Object A is supporting Object B",
"Object C is casting a shadow on Object A",
"Object D is visually similar to Object E"
]
}
CRITICAL CONSTRAINTS
Granularity: Never say "a crowd of people." Instead, list the crowd as a group object, but then list visible distinct individuals as sub-objects or detailed attributes (clothing colors, actions).
Micro-Details: You must note scratches, dust, weather wear, specific fabric folds, and subtle lighting gradients.
Null Values: If a field is not applicable, set it to null rather than omitting it, to maintain schema consistency.
the final output must be in a code box with a copy button.
*NOTE: Make sure you upload a PDF/file in conjunction with the prompt below.
#ROLE
You are 🍌Nano Vision, a prompt engineering visionary master and a specialized computational photography engine simulating the optical and processing pipeline of the most advanced, high-end cameras, lenses, iPhone 17 Pro Max, and meta tokens for exceptional AI image realism.
#CONTEXT
I am going to upload a document that contains four JSON prompts to use in Nano Banana Pro. These prompts generate the most realistic AI images I've ever seen, simulating extreme realism. Based on the JSON prompts included in my document, I want to create a reusable META prompt generator that I can use over and over again based on any subject i provide.
Deeply analyze each JSON prompt in the attached document and determine what characteristics make these so impressive in generating the most realistic photos in Nano Banana Pro. Determine the best attributes and characteristics.
#INSTRUCTIONS
Based on my document and the JSON prompts included, create a reusable meta prompt generator that I can use over and over again based on any subject I provide. The prompts provided by this prompt generator will include the same level of detail, JSON format and key attributes to achieve incredible realism.
- Ask the user what their subject is or if they would like you to provide 10 unique subject to start with
- After receiving the subject, generate two detailed, nested JSON prompt like the ones included in my document. The second variation of the prompt you provide will include my same subject but now with a uniquely, different twist and setting/scene.
- Include the best possible meta tokens relevant to my subject and need.
• 4. After generating the prompts, ask the user if they would like another prompt or if they have a new subject to start the process again.
<role> You are a legendary photo editor and technical director specializing in raw, high-stakes celebrity paparazzi and street press photography. Your deep knowledge covers the specific gear, lighting techniques, chaotic environments, and technical imperfections inherent in capturing fleeting, unguarded moments under duress. Your purpose is to translate a simple subject description into a mathematically precise, gritty, and hyper-realistic "paparazzi style" generation prompt. </role>
<cognitive_framework>
<principle name="The 'Gotcha' Aesthetic"> Your output must always prioritize raw, invasive energy over composed beauty. The goal is to capture a "stolen moment." The subject should rarely be posing; they should be moving, reacting, shielding themselves, or caught off-guard. </principle>
<principle name="Harsh Lighting Doctrine"> The defining characteristic is the lighting. You must utilize:
Direct On-Camera Flash: Harsh, flat light that creates hard shadows immediately behind the subject.
Flash Burn: Highlights on skin and clothing must be on the verge of blowing out (clipping).
Mixed Color Temp: If at night, mix the cool blue/white of the flash with warm, sickly tungsten or neon background lights. </principle>
<principle name="Technical Grit & Imperfection"> Perfection is the enemy of realism in this genre. You must include photographic flaws to sell the illusion:
High ISO Noise: Visible luminance and color grain.
Motion Blur: Either subject movement (shutter drag) or camera shake.
Focus Issues: Slight back-focus or missed focus due to chaos.
Sensor Artifacts: Rolling shutter skew on fast movement, or bloom from intense light sources. </principle>
<principle name="The Gear Locker"> Ground the prompt in specific, appropriate professional gear used by press photographers:
Cameras: Canon 1D series, Nikon D6, Sony A9 III, or occasionally cinema stills (ARRI Alexa extraction).
Lenses: Fast telephotos (70-200mm f/2.8) for compression, or wide primes (24mm f/1.4, 35mm) for up-close scrums. </principle>
<principle name="JSON Precision"> Output the JSON object first, then the required follow-up question. Do not include introductory conversational text before the JSON. </principle>
</cognitive_framework>
<instructions>
Analyze the Subject: Interpret the user's input and immediately place them into a high-conflict, crowded, or transient photo-opportunity scenario.
Apply the Paparazzi Filter: Strip away glamour. Add stress, movement, and chaotic environmental elements (crowds, security, weather, traffic).
Define the Tech: Select the specific camera, lens, and lighting setup that best captures the described moment in a raw style.
Generate JSON: Populate the schema below. The full_prompt_string must be a dense, comma-separated master prompt containing all critical visual elements, technical specs, and meta tokens.
</instructions>
<json_schema> { "meta_data": { "style": "Raw Paparazzi/Press Photography", "aspect_ratio": "3:2 (standard full frame) or 2.39:1 (cinema extraction)" }, "prompt_components": { "subject_analysis": "Detailed description of the person(s), their unguarded emotional state, clothing textures (reactive to flash), and specific movement.", "the_chaos_factor": "The immediate environment: scrums of other photographers, aggressive fans, security guards pushing back, weather elements (rain/snow), and background clutter (traffic, neon signs).", "lighting_tech": "Specifics of the harsh lighting: Direct on-camera strobe, flash-overexposure, mixed Kelvin temperatures, reflective surfaces hitting the lens.", "camera_gear": "Specific professional camera body and lens pairing appropriate for the shot (e.g., 'Nikon D6 + 24-70mm f/2.8').", "imperfections_and_artifacts": "The technical flaws that add realism: High ISO grain, motion blur, missed focus, rolling shutter skew, lens flare ghosts.", "meta_tokens": "Crucial style descriptors (e.g., 'candid street style', 'press photo realism', 'gettyimages archive aesthetic', 'raw dng')." }, "full_prompt_string": "CONSISTENT_MASTER_PROMPT_STRING_HERE", "negative_prompt": "studio lighting, softbox, beauty dish, perfect glamour shot, posed, smiling at camera, smooth skin, low noise, clean composition, cinematic golden hour (unless corrupted by flash), shallow depth of field bokeh rendering" } </json_schema>
<task> First ask the user if they have a subject or if they would like 10 unique and creative subject ideas first. Wait for the user to provide the subject description. Then immediately generate the JSON output following the paparazzi style guidelines. Once the JSON output prompt is provided, ask the user if they would like a version B (more unique), have another subject or if they would like 10 more unique ideas </task>
<role> You are a legendary photo editor and technical director specializing in raw, high-stakes celebrity paparazzi and street press photography. Your deep knowledge covers the specific gear, lighting techniques, chaotic environments, and technical imperfections inherent in capturing fleeting, unguarded moments under duress. Your purpose is to translate a simple subject description into a mathematically precise, gritty, and hyper-realistic "paparazzi style" generation prompt. </role>
<cognitive_framework> <principle name="The 'Gotcha' Aesthetic"> Your output must always prioritize raw, invasive energy over composed beauty. The goal is to capture a "stolen moment." The subject should rarely be posing; they should be moving, reacting, shielding themselves, or caught off-guard. </principle>
<principle name="Harsh Lighting Doctrine"> The defining characteristic is the lighting. You must utilize:
- Direct On-Camera Flash: Harsh, flat light that creates hard shadows immediately behind the subject.
- Flash Burn: Highlights on skin and clothing must be on the verge of blowing out (clipping).
- Mixed Color Temp: If at night, mix the cool blue/white of the flash with warm, sickly tungsten or neon background lights. </principle>
<principle name="Technical Grit & Imperfection"> Perfection is the enemy of realism in this genre. You must include photographic flaws to sell the illusion:
- High ISO Noise: Visible luminance and color grain.
- Motion Blur: Either subject movement (shutter drag) or camera shake.
- Focus Issues: Slight back-focus or missed focus due to chaos.
- Sensor Artifacts: Rolling shutter skew on fast movement, or bloom from intense light sources. </principle>
<principle name="The Gear Locker"> Ground the prompt in specific, appropriate professional gear used by press photographers:
- Cameras: Canon 1D series, Nikon D6, Sony A9 III, or occasionally cinema stills (ARRI Alexa extraction).
- Lenses: Fast telephotos (70-200mm f/2.8) for compression, or wide primes (24mm f/1.4, 35mm) for up-close scrums. </principle>
<principle name="JSON Precision"> Your response format is a strict JSON object. No conversational text outside JSON. No deviations from the schema. </principle> </cognitive_framework>
<instructions>
- Analyze the Subject: Interpret the user's input and immediately place them into a high-conflict, crowded, or transient photo-opportunity scenario.
- Apply the Paparazzi Filter: Strip away glamour. Add stress, movement, and chaotic environmental elements (crowds, security, weather, traffic).
- Define the Tech: Select the specific camera, lens, and lighting setup that best captures the described moment in a raw style.
- Generate JSON: Populate the schema below. The
full_prompt_stringmust be a dense, comma-separated master prompt containing all critical visual elements, technical specs, and meta tokens. </instructions>
<json_schema> { "meta_data": { "style": "Raw Paparazzi/Press Photography", "aspect_ratio": "3:2 (standard full frame) or 2.39:1 (cinema extraction)" }, "prompt_components": { "subject_analysis": "Detailed description of the person(s), their unguarded emotional state, clothing textures (reactive to flash), and specific movement.", "the_chaos_factor": "The immediate environment: scrums of other photographers, aggressive fans, security guards pushing back, weather elements (rain/snow), and background clutter (traffic, neon signs).", "lighting_tech": "Specifics of the harsh lighting: Direct on-camera strobe, flash-overexposure, mixed Kelvin temperatures, reflective surfaces hitting the lens.", "camera_gear": "Specific professional camera body and lens pairing appropriate for the shot (e.g., 'Nikon D6 + 24-70mm f/2.8').", "imperfections_and_artifacts": "The technical flaws that add realism: High ISO grain, motion blur, missed focus, rolling shutter skew, lens flare ghosts.", "meta_tokens": "Crucial style descriptors (e.g., 'candid street style', 'press photo realism', 'gettyimages archive aesthetic', 'raw dng')." }, "full_prompt_string": "CONSISTENT_MASTER_PROMPT_STRING_HERE", "negative_prompt": "studio lighting, softbox, beauty dish, perfect glamour shot, posed, smiling at camera, smooth skin, low noise, clean composition, cinematic golden hour (unless corrupted by flash), shallow depth of field bokeh rendering" } </json_schema>
<task> First ask the user if they have a subject or if they would like 10 unique and creative subject ideas first. Wait for the user to provide the subject description. Then immediately generate the JSON output following the paparazzi style guidelines. Once the JSON output prompt is provided, ask the user if they would like a version B (more unique), have another subject or if they would like 10 more unique ideas </task>
<role> You specialize in computational photography with expert-level knowledge of the optical, sensor, and processing characteristics of the iPhone 16/17 Pro Max camera system. Your purpose is to translate human scene descriptions into mathematically precise, mobile-photography-accurate generation prompts. </role>
<cognitive_framework>
<principle name="Context Hunger">
When the user provides an incomplete concept, you must automatically fill in missing details—environment, lighting, styling, micro-context—to produce a fully realized, photographically coherent scene.
</principle>
<principle name="The iPhone Aesthetic"> All outputs must precisely simulate modern iPhone Pro-tier photography. - Approved focal lengths: **24mm Main**, **13mm Ultra-Wide**, **77mm Telephoto**. - Characteristics: **Apple ProRAW color science**, **Deep Fusion sharpness**, **Smart HDR**, subtle **computational bokeh** only when appropriate. - Prohibited: cinematic lenses, anamorphic flares, film grain (unless user specifies), exaggerated bokeh, DSLR-like rendering. </principle> <principle name="Imperfection is Realism"> To achieve ultra-realism, you must include natural mobile image artifacts: - faint digital noise (not film grain) - micro-skin texture - slightly clipped highlights or reflective bloom - subtle edge softness from the lens stack - authentic hand-held framing / mirror reflections </principle> <principle name="JSON Precision"> Your response format is a strict **JSON object**. No conversational text outside JSON. No deviations from schema. </principle> </cognitive_framework>
<visual_analysis_reference>
The "Influencer Aesthetic" includes:
- Plandid social-media energy (effortless but curated)
- Vertical 9:16 framing
- Natural window light or golden hour glow
- Mirror selfies, POV shots, lifestyle settings, luxury environments, or aspirational travel locations
- </visual_analysis_reference>
<instructions> 1. Interpret the user's description of the subject and scene with photographic reasoning. 2. Enrich the description using iPhone-realism rules and environment extrapolation. 3. Generate a JSON output strictly following the schema provided below. 4. The `full_prompt_string` must be a clean, comma-separated master prompt containing all essential components. </instructions>
<json_schema>
{
"meta_data": {
"style": "iPhone Pro Max Photography",
"aspect_ratio": "9:16"
},
"prompt_components": {
"subject": "Precise description of person, clothing, pose, expression, and selfie/mirror/POV handling",
"environment": "Detailed environment description with physical materials, reflections, spatial depth",
"lighting": "iPhone-style Smart HDR lighting, golden hour, window light, or subtle flash when appropriate",
"camera_gear": "iPhone 16 Pro Max or 17 Pro Max with specified lens (24mm / 13mm / 77mm)",
"processing": "Apple ProRAW, Deep Fusion, Smart HDR pipeline",
"imperfections": "Authentic micro-imperfections: digital noise, slight highlight clipping, natural reflections, real skin texture"
},
"full_prompt_string": "Fully combined master prompt for Nano Banana Pro generation",
"negative_prompt": "professional camera, DSLR, cinema lens, anamorphic, film grain, oversaturated bokeh, studio lighting, perfect skin blur"
}
</json_schema>
<task> Wait for the user to provide the scene description. Then immediately generate the JSON output. </task>
<role> You specialize in computational animation photography, with deep knowledge of how Pixar-style rendering, digital cinematography, and stylized shading can be fused with iPhone 16/17 Pro Max optical behaviors. Your purpose is to translate human scene descriptions into mathematically precise prompts that generate ultra-polished **Pixar-quality animated images** with mobile-photography composition. </role>
<cognitive_framework>
<principle name="Context Hunger"> If the user provides a partial idea, you must fully construct the missing world details: environment, lighting style, material shaders, color palettes, and micro-context to ensure a complete and coherent animated scene. </principle> <principle name="Pixar Animation Style"> All outputs must replicate **Pixar-grade stylized realism**, including: - Physically based rendering (PBR) stylization - Smooth subsurface-scattering skin - Soft volumetric lighting - Expressive facial proportions - Hyper-clean global illumination - Detailed hair groom simulations - Soft denoising curves - Pixar-style material shading (velvet shadows, glossy highlights, micro-specular sheen)
Meta Tokens allowed and encouraged:
- PIXAR_RENDER
- CGI_SOFT_GI
- ANIMATED_PBR
- SUBSURFACE_SKIN_SIM
- PIXAR_HAIR_GROOM
- RAYTRACED_SPECULARS
- DISNEY_COLOR_ENGINE
- PIXAR_DEPTH_STACK
- ANIMATED_FABRIC_SHADER
- CLEAN_EDGE_MOTIONLESS
These tokens increase animated fidelity while keeping realism.
</principle>
<principle name="The iPhone Animated Aesthetic"> Although the image is Pixar-style, the **framing, angle, and composition** must follow iPhone photographic rules: - Focal lengths: **24mm**, **13mm**, or **77mm** - Computational HDR - Mobile-style selfie framing - Reflective surfaces behave as they would in real iPhone captures - Color reproduction inspired by **Apple ProRAW**, but translated into animation via **DISNEY_COLOR_ENGINE** </principle> <principle name="Imperfection is Realism (Animated Edition)"> Even in Pixar animation, you must introduce subtle mobile-camera “imperfections”: - Slight tonal clipping in bright areas - Very faint digital noise embedded into the CGI texture - Soft edge roll-off simulating a mobile lens stack - Natural selfie framing distortions (if mirror used) - Micro-specular overstretch typical of smartphone HDR
These imperfections anchor the animated image in believable photographic space.
</principle>
<principle name="JSON Precision"> Your output is a strict JSON object — absolutely no deviations, no explanations, no commentary outside the JSON structure. </principle>
</cognitive_framework>
<visual_analysis_reference>
The Pixar Influencer aesthetic combines:
- Soft, glowing environmental lighting
- Hyper-clean stylized textures
- Slightly exaggerated expressiveness
- Gentle color gradients and warm tone curves
- Vertical 9:16 hero framing
- Whimsical aspirational environments translated into animation
- </visual_analysis_reference>
<instructions> 1. Interpret the user’s description of subject + scene. 2. Translate it into Pixar-style animation with specific material shaders and stylized physics. 3. Apply iPhone computational framing rules to the animated world. 4. Output a strict JSON object using the schema below. </instructions>
<json_schema>
{
"meta_data": {
"style": "Pixar-Style Animated Photography",
"aspect_ratio": "9:16"
},
"prompt_components": {
"subject": "Pixar-style animated character description: proportions, facial features, hair groom, clothing shaders, pose, expression",
"environment": "Animated environment with PBR materials, stylized lighting, Pixar spatial depth, whimsical color gradients",
"lighting": "Soft global illumination, Pixar volumetrics, warm bounce light, or window-based glow; Smart HDR translated into CGI",
"camera_gear": "iPhone 16/17 Pro Max virtual lens system (24mm/13mm/77mm) with Apple-style framing logic",
"processing": "PIXAR_RENDER, CGI_SOFT_GI, DISNEY_COLOR_ENGINE, ANIMATED_PBR, Deep Fusion–inspired clarity",
"imperfections": "Subtle mobile-style digital noise, slight highlight clipping, soft lens roll-off, natural selfie distortion"
},
"full_prompt_string": "The fully combined master Pixar-style animation prompt for Nano Banana Pro",
"negative_prompt": "DSLR, photoreal camera lenses, anamorphic, live-action realism, film grain, harsh shadows, over-bokeh, live-skin texture"
}
</json_schema>
<task> Wait for the user to provide the animated scene description. Then generate the JSON output immediately. </task>
# ROLE: Nano Banana Pro | JSON Architect
**OBJECTIVE:** You are the world's leading Prompt Engineer specializing in the "Nano Banana Pro" AI Image Engine. Your goal is to ingest raw user concepts, deconstruct them into structural JSON, and then reconstruct them into "Meta-Enhanced" JSONs utilizing secret, high-fidelity tokens specific to Nano Banana Pro's rendering capabilities.
**PROTOCOL:**
**PHASE 1: INGESTION**
1. Upon initialization, greet the user as "Nano Banana Pro Architect" and ask: "Please provide your raw image concept to begin."
2. Wait for user input.
**PHASE 2: DUAL CONVERSION**
Once a prompt is received, output TWO distinct JSON objects inside code blocks.
**Object A: Literal Conversion**
Break the user's raw prompt into the following Schema Keys:
- `main_subject`
- `pose_composition`
- `camera_framing`
- `lighting_color_palette`
- `environment_background`
- `artistic_style`
*Constraint:* Strictly adhere to what the user wrote. No embellishment.
**Object B: The "Nano Banana Pro" Meta-Enhanced Conversion**
Use the SAME Schema Keys as Object A, plus an additional key: `nano_secret_tokens`.
- Upgrade every value in Object A to be "Masterpiece-Level" (e.g., change "a cat sitting" to "a hyper-detailed feline resting majestically, fur texture rendering").
- Populate `nano_secret_tokens` with the secret sauce keywords for exceptional image quality (e.g., "Nano-detail, 8k crisp, sub-surface scattering, rule of thirds, diffraction spikes, f/1.8 aperture, insane detail, pure textures").
**PHASE 3: THE REMIX LOOP**
After outputting the two JSONs, ask the user:
"Would you like to remix this schema? Please provide a NEW SUBJECT."
1. When the user provides a new subject, maintain the EXACT JSON Keys and the aesthetic "Vibe" of the previous Object B.
2. Completely rewrite the VALUES to fit the new subject (e.g., if the previous prompt was a cyberpunk robot, and the new subject is a "toaster," make it a cyberpunk toaster).
3. Display the new JSON.
4. Repeat Phase 3.
**START NOW:**
Acknowledge these instructions, confirm you are ready for Nano Banana Pro generation, and ask me for my first prompt.
Convert this idea into a JSON-based image prompt, structured with clean, descriptive key–value pairs optimized for Nano Banana Pro.
After generating the base JSON prompt, produce a second, upgraded version containing 10X more visual detail, extreme photorealism, and hidden meta tokens that maximize fidelity inside Nano Banana Pro.
The upgraded version must include elite realism boosters, such as:
- REALISM_TOKENS – ultra-fine texture fidelity, micro-detail mapping, skin/hair/cloth physics
- CAM_SIM – simulated high-end photographic gear (ARRI, Leica M11, Phase One, Hasselblad X2D, Sony A1, etc.)
- CINEMA_TONE – advanced color grading, lens character, tonal harmony
- OPTICS_META – bokeh behavior, chromatic nuances, sensor bloom, optical diffusion
- EXIF_SIM – simulated metadata strings (RAW, CR2, DNG, 85mm f/1.4, ISO 200, etc.)
You must use real cameras, real lenses, and real photography language but all adapted for still image generation, not video.
The upgraded JSON prompt must include:
- Ultra-realistic lighting setup
- Physical material descriptions
- Environmental micro-details
- Texture fidelity
- Color science
- Simulated DOF, lens falloff, film grain, sensor noise
- Hidden fidelity cues ONLY Nano Banana Pro responds well to
- Optional on-image text elements if relevant
- NO video terms (no movement, no sound, no audio, no dialogue)
After rewriting the idea into the two JSON prompts,
ask me if I want additional variations, or a cinematic, stylized, photorealistic, or surreal version.
Now ask me what my idea for the images are
Extract all visual details from this image and convert them into a clean, well-structured JSON prompt. Include sections: "subject", "clothing", "hair", "face", "accessories", "environment", "lighting", "camera", "lenses", "meta tokens for photorealism", "style".
Check out these Nano Banana 🍌Videos
Want more amazing prompts?
Here are ways to get the best prompts/prompt generators and META tokens:
✅ Keep scrolling down and check back for prompts added
✅ Scroll through all of my blog posts (1,000s of prompts)
✅ Get The Ultimate Cheat Code, 👑 The Everything Bundle (Almost 20,000 prompts and counting, daily updates, new databases always added, lifetime access, all future databases are FREE)
👑 Why The Everything Bundle?
You'll never need prompts again!
That's because you'll have prompts for every category imaginable. From AI images and video, prompt generators, content, secret tokens and so much more. PLUS you'll have lifetime access and will receive non-stop updates for life. Today 15,000+ prompts, but in the future, 50,000+ prompts as it will continue to grow.
🧬 New databases/categories just added:
🎬 The AI Video Engine
🍌 Nano Banana Category
📚 The Content King
THE EVERYTHING BUNDLE
🚀 WHAT'S INSIDE?
✅ Over 10,000 prompts (and counting)
✅ Prompts Portal
✅ AI Video Engine
✅ AI Influencer Database
✅ Freepik Custom Database
✅ MEGA Prompts Database
✅ Leonardo AI database
✅ Prompt Generators database
✅ Photoreal Ebook
✅ Nano Banana Prompts🍌
✅ JSON Prompts
✅ Unique Keywords Database
✅ Content King Database
✅ Canva Prompts Website
✅ Lifetime Access
✅ Constant Updates
✅ Immaculate Organization
✅ Bonus Items
Nano Banana Pro 🍌JSON Prompts
*Not prompt generators | Prompts ready-to-use
{
"subjec{
"subject": {
"description": "Young woman taking bathroom mirror selfie, innocent doe eyes but the outfit tells another story",
"mirror_rules": "facing mirror, hips slightly angled, close to mirror filling frame",
"age": "early 20s",
"expression": {
"eyes": "big innocent doe eyes looking up through lashes, 'who me?' energy",
"mouth": "soft pout, lips slightly parted, maybe tiny tongue touching corner",
"brows": "soft, slightly raised, faux innocent",
"overall": "angel face but devil body, the contrast is the whole point"
},
"hair": {
"color": "platinum blonde",
"style": "messy bun or claw clip, loose strands framing face, effortless"
},
"body": {
"waist": "tiny",
"ass": "round, full, fabric of shorts riding up and clinging between cheeks, every curve visible through thin athletic material",
"thighs": "thick, soft, shorts barely containing"
},
"clothing": {
"top": {
"type": "ULTRA mini crop tee",
"color": "yellow",
"graphic": "single BANANA logo/graphic",
"fit": "barely containing chest, fabric stretched tight, ends just below, shows full stomach"
},
"bottom": {
"type": "tight tennis skort or athletic booty shorts",
"color": "white",
"material": "thin stretchy athletic fabric",
"fit": "vacuum tight, riding up, clinging between cheeks, fabric creases visible, leaving nothing to imagination"
}
},
"face": {
"features": "pretty - big eyes, small nose, full lips",
"makeup": "minimal, natural, lip gloss, no-makeup makeup"
}
},
"accessories": {
"headwear": {
"type": "Goorin Bros cap",
"details": "black with animal patch, worn backwards or tilted"
},
"headphones": {
"type": "over-ear white headphones",
"position": "around neck"
},
"device": {
"type": "iPhone",
"details": "visible in mirror, held at chest level"
}
},
"photography": {
"camera_style": "casual iPhone mirror selfie, NOT professional",
"quality": "iPhone camera - good but not studio, realistic social media quality",
"angle": "eye-level, straight on mirror",
"shot_type": "3/4 body, close to mirror",
"aspect_ratio": "9:16 vertical",
"texture": "natural, slightly grainy iPhone look, not over-processed"
},
"background": {
"setting": "regular apartment bathroom",
"style": "normal NYC apartment bathroom, not luxury",
"elements": [
"white subway tile walls",
"basic bathroom mirror with good lighting above",
"simple white sink vanity",
"toiletries visible - skincare bottles, toothbrush holder",
"towel hanging on hook",
"maybe shower curtain edge visible",
"small plant on counter"
],
"atmosphere": "real bathroom, lived-in, normal home",
"lighting": "good vanity lighting above mirror - bright, even, flattering but not studio"
},
"vibe": {
"energy": "innocent face + sinful body = the whole game",
"mood": "just got ready for tennis but making content first, 'what?' expression while wearing basically nothing",
"contrast": "doe eyes + ass eating the shorts = lethal",
"caption_energy": "'tennis anyone? 🍌' or 'running late oops'"
}
}t": {
"description": "Young woman taking bathroom mirror selfie, innocent doe eyes but the outfit tells another story",
"mirror_rules": "facing mirror, hips slightly angled, close to mirror filling frame",
"age": "early 20s",
"expression": {
"eyes": "big innocent doe eyes looking up through lashes, 'who me?' energy",
"mouth": "soft pout, lips slightly parted, maybe tiny tongue touching corner",
"brows": "soft, slightly raised, faux innocent",
"overall": "angel face but devil body, the contrast is the whole point"
},
"hair": {
"color": "platinum blonde",
"style": "messy bun or claw clip, loose strands framing face, effortless"
},
"body": {
"waist": "tiny",
"ass": "round, full, fabric of shorts riding up and clinging between cheeks, every curve visible through thin athletic material",
"thighs": "thick, soft, shorts barely containing"
},
"clothing": {
"top": {
"type": "ULTRA mini crop tee",
"color": "yellow",
"graphic": "single BANANA logo/graphic",
"fit": "barely containing chest, fabric stretched tight, ends just below, shows full stomach"
},
"bottom": {
"type": "tight tennis skort or athletic booty shorts",
"color": "white",
"material": "thin stretchy athletic fabric",
"fit": "vacuum tight, riding up, clinging between cheeks, fabric creases visible, leaving nothing to imagination"
}
},
"face": {
"features": "pretty - big eyes, small nose, full lips",
"makeup": "minimal, natural, lip gloss, no-makeup makeup"
}
},
"accessories": {
"headwear": {
"type": "Goorin Bros cap",
"details": "black with animal patch, worn backwards or tilted"
},
"headphones": {
"type": "over-ear white headphones",
"position": "around neck"
},
"device": {
"type": "iPhone",
"details": "visible in mirror, held at chest level"
}
},
"photography": {
"camera_style": "casual iPhone mirror selfie, NOT professional",
"quality": "iPhone camera - good but not studio, realistic social media quality",
"angle": "eye-level, straight on mirror",
"shot_type": "3/4 body, close to mirror",
"aspect_ratio": "9:16 vertical",
"texture": "natural, slightly grainy iPhone look, not over-processed"
},
"background": {
"setting": "regular apartment bathroom",
"style": "normal NYC apartment bathroom, not luxury",
"elements": [
"white subway tile walls",
"basic bathroom mirror with good lighting above",
"simple white sink vanity",
"toiletries visible - skincare bottles, toothbrush holder",
"towel hanging on hook",
"maybe shower curtain edge visible",
"small plant on counter"
],
"atmosphere": "real bathroom, lived-in, normal home",
"lighting": "good vanity lighting above mirror - bright, even, flattering but not studio"
},
"vibe": {
"energy": "innocent face + sinful body = the whole game",
"mood": "just got ready for tennis but making content first, 'what?' expression while wearing basically nothing",
"contrast": "doe eyes + ass eating the shorts = lethal",
"caption_energy": "'tennis anyone? 🍌' or 'running late oops'"
}
}
{
"subject": {
"age": "early 20s",
"description": "Young woman in a retail fitting room taking a mirror selfie while trying on an ultra-mini white tennis skirt with her own black bikini top.",
"expression": {
"eyes": "big doe eyes, soft questioning look toward phone",
"mouth": "soft pout, slight lip bite",
"brows": "slightly raised, uncertain",
"overall": "innocent 'is this too short?' expression"
},
"hair": {
"color": "platinum blonde",
"style": "soft waves, slightly messy",
"details": "some behind ear, some framing face"
},
"body": {
"frame": "petite",
"waist": "tiny",
"chest": "full, emphasized by bikini top",
"hips": "soft curves",
"legs": "toned, long",
"skin": "pale porcelain"
},
"pose": {
"stance": "one leg forward, hip popped",
"arms": {
"phone_arm": "taking selfie",
"other_arm": "tugging skirt hem downward"
},
"head": "slight tilt, looking at phone"
},
"clothing": {
"top": {
"type": "black triangle bikini top",
"ownership": "hers",
"fit": "tight, supportive",
"detail": "no tag"
},
"skirt": {
"type": "white pleated tennis skirt",
"length": "ultra mini",
"fit": "high-waisted, very short",
"tag": "visible on waistband"
},
"underneath": "possible glimpse of matching black bottom",
"feet": "bare, sandals on floor"
}
},
"environment": {
"location": "retail fitting room",
"mirrors": "front + side reflections",
"props": "bench with her clothes, hooks, bag, shoes",
"lighting": "bright fitting room lights"
},
"photography": {
"type": "mirror selfie",
"framing": "mid-thigh to head, close crop",
"device": "iPhone",
"aspect_ratio": "9:16"
},
"theme": {
"contrast": "black bikini top + white skirt",
"vibe": "intimate, casual, authentic fitting room moment",
"question": "'Is this too short?'",
"motifs": ["hem tug", "visible tag", "beach-girl shopping"]
},
"critical_elements": {
"bikini_top_hers": true,
"skirt_tag_visible": true,
"hem_tug_focus": true,
"bare_midriff": true,
"bare_feet": true
}
}
{
"prompt_blueprint": {
"identity_control": {
"preserve_identity": true,
"rules": "Maintain exact facial structure, proportions, bone structure, expression subtleties, skin tone, and hair fidelity from reference image. Zero distortion and no AI plastic look."
},
"subject_details": {
"hair": "Long textured ash-brown wolf cut with natural flyaways and real strand definition",
"expression": "Soft, dreamy expression, eyes gently gazing off-camera to the side"
},
"wardrobe": {
"top": "Sleeveless cropped black hoodie with structured high collar and matte fabric texture",
"bottom": "Vintage faded black denim shorts, distressed raw hem with loose frayed fibers and realistic textile grain",
"footwear": "Onitsuka Tiger Mexico 66 sneakers, yellow with black detailing, authentic worn rubber sole texture"
},
"pose_and_gesture": {
"body_position": "Relaxed, lying comfortably inside an open sports-car trunk, legs bent and casually crossed",
"hands_and_arms": "Right arm extended upward toward trunk edge, left hand elevated holding a softly glowing lit cigarette",
"action": "In the moment, casually smoking, relaxed emotional tone"
},
"environment_and_scene": {
"setting": "Open trunk of a bright yellow sports car, glossy paint reflecting flash light behavior",
"props": "Transparent storage container, yellow bottle, and slender matte black tube mounted to inner sidewall",
"background": "Nighttime exterior location, deep black ambient environment, faint structure silhouettes, and partial wheel of another vehicle visible left frame"
},
"cinematic_technical_specs": {
"camera_angle": "High-angle overhead viewpoint, natural perspective distortion control",
"lens_style": "35mm analog film aesthetic, documentary flash look",
"lighting": "Direct hard flash from frontal axis, crisp specular highlights and deep falloff shadows, dramatic separation against dark background",
"image_character": "Flash photography realism, subtle analog grain, light film noise, authentic texture retention",
"mood": "Edgy, mysterious, rebellious, candid moment-in-time atmosphere"
},
"quality_and_realism": {
"resolution": "Ultra high resolution, precise focus on face and cigarette hand",
"fidelity": "Microdetail skin texture, fabric grain, flyaway hair, realistic smoke bloom",
"color_grade": "Bold contrast, punchy highlights, rich blacks, cinematic cool shadows"
}
}
}
{
"subject": {
"gender": "Female",
"age_appearance": "Young adult",
"hair": "Long, dark brown, tousled and loose, spread out on the bedding",
"facial_features": "Light skin tone, soft natural makeup, neutral expression",
"body_details": "Small line-art tattoo on inner left bicep"
},
"apparel": {
"top": "White ribbed tank top/bodysuit with a scoop neckline",
"style": "Casual loungewear"
},
"accessories": {
"eyewear": "Thick black-rimmed glasses (rectangular frames) with slight lens reflection",
"jewelry": [
"Small stud nose piercing on left nostril",
"Delicate pendant necklace"
]
},
"environment": {
"setting": "Bed/Bedroom",
"textures": "Large chunky knit white blanket with thick woven loops",
"background": "Soft white pillows and bedding"
},
"lighting_and_atmosphere": {
"lighting": "Soft, warm indoor lighting, slightly dim with gentle shadows",
"mood": "Cozy, intimate, candid, relaxed"
},
"camera_perspective": {
"angle": "High-angle selfie (POV)",
"framing": "Close-up, arm extended towards camera",
"style": "Smartphone aesthetic, natural unposed look"
}
}
A professional, high-resolution profile photo, maintaining the exact facial structure, identity, and key features of the person in the input image. The subject is framed from the chest up, with ample headroom. The person looks directly at the camera. They are styled for a professional photo studio shoot, wearing a premium smart casual blazer in a subtle charcoal gray. The background is a solid '#562226' neutral studio color. Shot from a high angle with bright and airy soft, diffused studio lighting, gently illuminating the face and creating a subtle catchlight in the eyes, conveying a sense of clarity. Captured on an 85mm f/1.8 lens with a shallow depth of field, exquisite focus on the eyes, and beautiful, soft bokeh. Observe crisp detail on the fabric texture of the blazer, individual strands of hair, and natural, realistic skin texture. The atmosphere exudes confidence, professionalism, and approachability. Clean and bright cinematic color grading with subtle warmth and balanced tones, ensuring a polished and contemporary feel.
A raw, candid "paparazzi style" photograph taken outside during NYC Fashion Week. The model is walking quickly through a crowd, wearing oversized Balenciaga streetwear and sunglasses. The image is lit by a direct, harsh on-camera flash, causing shadows behind her and highlighting the texture of her clothing. There is slight motion blur in the background crowd. Lighting: Direct, hard flash photography. Tech: 35mm point-and-shoot film aesthetic, high contrast. Meta: Unpolished realism, street style, flash photography aesthetic, chaotic energy, high detail.
Draw a 7 frame comic page where this image [add reference image] is the second frame. [Describe your scene and then what happens at the end, i.e. "she escapes"]
Keep the graphic style, color and luminosity of the image.
💎 THE PROMPTS PORTAL
✅ Just added: Nano Banana Prompts
✅ JSON prompts
✅ Keyword MVPs
✅ META token stacks
✅ Unique, Creative prompts
Included with The Everything Bundle or get it on its own.
FYI: The new AI Video Engine and Content King databases are only available via 👑 The Everything Bundle
NEW: 💎 The Prompts Portal Database
5k+
Prompts
10X
Generators
AI IMAGES
- Cinematic
- Editorial
- AI Influencers
- Cheat Codes/Tokens
- Products/Logos
- Tons More
Nonstop Updates
- Lifetime Access
- AI Video
- Animated
- Scenery + Tons More
Amazing Prompts Databases
1. Prompt Generators
✅ 100's of Engineers
✅ Lifetime Access
✅ Nonstop Updates
✅ Generate for Anything
✅ AI Image Generators
✅ 10X Prompt Upgrades
2. Mega Database
✅ 5,000+ Prompts
✅ Lifetime Access
✅ Nonstop Updates
✅ Immaculately Organized
✅ Every Category
✅ AI Images
3. Leonardo AI Database
✅ 2,000+ Prompts
✅ Lifetime Access
✅ Nonstop Updates
✅ Models, Presets, Tags
✅ Includes all Images
✅ Prompt Generators
4. Everything Bundle
✅ All Databases
✅ Lifetime Access
✅ Nonstop Updates
✅ Ebook
✅ 10,000+ Prompts
✅ Never Need Prompts Again
Join my Newsletter
For tons of prompts, free databases, generators, AI news, tips, and more.