ViviNova AI Image Generator

ViviNova (Vivid + Nova) AI image generator transforms your text prompts into stunning, vivid visuals instantly. Create social media graphics, marketing content, or creative artwork — no design skills needed.

30s
4 Credits
Free

Nano Banana 2 — Adjustable Reasoning, Search Grounding, 4K

Generate AI images with Nano Banana 2 — Google Gemini 3.1 Flash. Adjustable thinking depth, image search grounding, 0.5K-4K output, and extreme aspect ratios.

The Most Technically Versatile Nano Banana

Nano Banana 2 runs on Gemini 3.1 Flash — the newest model in the family (February 2026). It is not positioned as the fastest (Nano Banana is faster) or the highest fidelity (Nano Banana Pro is more accurate). What it offers is control: adjustable reasoning depth, real-time visual search grounding, the widest resolution range, the most object reference slots, and aspect ratios that no other Gemini image model supports.

Adjustable Thinking Depth

Set reasoning to minimal (3-4s) or high (6-8s) per generation — exclusive to this model among Gemini image models

Web + Image Search Grounding

Retrieve real-world visual references from Google before generating — not just web text, but actual images

0.5K to 4K + Extreme Ratios

Four resolution tiers and aspect ratios up to 1:8 and 8:1 — the widest format support in the family

131K Input Context

Double the context window of Nano Banana and Pro — room for detailed prompts with multiple references

Thinking Config: Choose Your Reasoning Depth

Every other Nano Banana model has a fixed reasoning pipeline. Nano Banana always generates directly (no reasoning). Nano Banana Pro always runs its World Simulator (3-5 second Reasoning Pause). Nano Banana 2 lets you choose.

Minimal Thinking (default)

At thinking_level: "minimal", Nano Banana 2 generates in 3-4 seconds with lightweight reasoning. The model still processes spatial relationships and style cues, but does not spend time on deep compositional analysis.

When to use: Single subjects, simple backgrounds, rapid iteration, prompt exploration. Any situation where Nano Banana would work but you want access to 4K output or reference images.

High Thinking

At thinking_level: "high", the model spends 2-4 additional seconds reasoning through the prompt before rendering. It plans multi-subject positioning, resolves spatial conflicts, and considers how lighting interacts with materials.

When to use: Multi-subject compositions ("three objects arranged left to right"), scenes with specific spatial constraints, prompts with multiple style and content requirements. You can also set include_thoughts: true to see the model's reasoning process — useful for understanding why a particular output looks the way it does and how to refine your prompt.

The Trade-off vs Pro

Nano Banana 2 at thinking_level: "high" approaches Nano Banana Pro quality but does not match it. Pro's World Simulator performs deeper physics simulation (light refraction, fluid dynamics) and achieves ~94-96% text rendering accuracy vs Nano Banana 2's ~90%. The question is whether the quality gap justifies Pro's slower speed (8-12s vs 6-8s) and higher cost at 4K.

Google Search Grounding: Web + Image

When you prompt for a real-world subject — a specific landmark, a current fashion trend, a recognizable product — the model is limited by what it learned during training. Search grounding removes that limitation.

Nano Banana 2 supports both web and image search grounding — the only Nano Banana model with image search. (Nano Banana Pro supports web search only; Nano Banana supports neither.)

What image search grounding does: Before generating, the model queries Google Image Search for visual references matching relevant terms in your prompt. It uses these retrieved images as implicit context — learning what the subject actually looks like rather than relying on its training data.

Where it matters most:

  • Recent subjects: Anything that appeared or changed after the model's training cutoff (January 2025). A building completed in 2026, a new car model, a recent fashion collection.
  • Specific real-world objects: "The Louvre pyramid at golden hour" generates a more accurate Louvre with image grounding than without, because the model retrieves actual photos rather than reconstructing from memory.
  • Cultural and regional specifics: Traditional clothing, regional architecture, local food presentation styles — subjects where training data may be sparse but Google Images has extensive coverage.

Resolution and Aspect Ratio: The Widest Range

Four Resolution Tiers

Nano Banana 2 offers the most resolution options in the family:

TierPixelsBest For
0.5K~512pxRapid drafts, thumbnail previews, concept exploration at minimal cost
1K~1024pxSocial media, web content, blog images
2K~2048pxCampaign banners, editorial assets, screen-quality deliverables
4K~4096pxPrint, large-format display, crop-heavy workflows

The 0.5K tier is exclusive to Nano Banana 2. Neither Nano Banana (1K only) nor Nano Banana Pro (1K/2K/4K) offer it. At 512px, generation is fast enough for batch exploration — generate 20 concepts in under a minute, then scale the winner to 4K.

Extreme Aspect Ratios

Beyond standard ratios (1:1, 16:9, 9:16, etc.), Nano Banana 2 supports formats that no other Nano Banana model can produce:

  • 4:1 / 1:4 — Website hero banners, vertical mobile story covers
  • 8:1 / 1:8 — Panoramic strips, ultra-tall story formats, cinematic landscape compositions

These generate natively at the target ratio — no cropping a square image. An 8:1 panoramic at 4K produces a 4096×512 image that fills a wide-format display edge to edge.

Reference Images: Most Object Slots in the Family

Nano Banana 2 supports 14 total reference images — structured as up to 10 object fidelity slots and 4 character consistency slots.

This gives it the most object reference capacity of any Nano Banana model: 10 slots vs Nano Banana Pro's 6. For workflows built around product photography, environment reference, or style transfer with many visual inputs, Nano Banana 2 accommodates more reference material per generation.

Pro has more character slots (5 vs 4). For character-heavy work — illustration series, brand mascots, consistent character generation across many outputs — Pro's extra character slot and World Simulator reasoning produce stronger identity consistency.

Developer-Oriented Features

131K Input Context

Nano Banana 2 accepts 131,072 input tokens — double the 65K limit of both Nano Banana and Pro. This is not just about longer prompts. The larger context window accommodates:

  • Multiple high-resolution reference images in a single prompt (each image consumes tokens)
  • Detailed multi-paragraph scene descriptions
  • Long conversation histories for multi-turn generation workflows

Thinking Transparency

Setting include_thoughts: true returns the model's reasoning process alongside the generated image. This reveals how the model interpreted your prompt — which spatial constraints it prioritized, how it resolved ambiguities, why it placed elements where it did. Useful for prompt engineering: if the reasoning shows a misinterpretation, you know exactly which part of the prompt to rewrite.

Practical Tips

Start every session at 0.5K with minimal thinking. Explore directions at the lowest cost and fastest speed. Once you find a composition that works, step up resolution and thinking depth for the final output. This is structurally different from the other tiers: Nano Banana explores at 1K (its only resolution), and Nano Banana Pro explores at full World Simulator cost because reasoning is not adjustable.

Enable image search grounding for any real-world subject. If your prompt names something that exists — a specific city, product, architectural style, fashion trend — image grounding improves accuracy with no other change required.

Use extreme ratios for their intended formats. 8:1 for website hero banners, 1:4 for mobile story covers, 4:1 for email header images. Designing at the native aspect ratio avoids the quality loss of generating square and cropping.

Reserve high thinking for multi-constraint prompts. A single subject on a simple background does not benefit from high thinking — minimal is faster with no visible quality difference. Save high thinking for prompts with spatial instructions ("three items arranged by size"), multiple style requirements ("photorealistic but with illustration-style sky"), or compositional constraints ("rule of thirds, subject at left third").


More AI Image Generators on ViviNova

Ready to create with Nano Banana 2?

Free to use, no signup required. Start creating in seconds.

Get Started Free

FAQ

thinking_config is a parameter exclusive to Nano Banana 2 among Gemini image models. It controls how much reasoning the model applies before generating. At 'minimal' (default), generation takes 3-4 seconds. At 'high', the model reasons through spatial relationships, multi-subject positioning, and compositional constraints before rendering — 6-8 seconds but measurably better output on complex prompts. You can also set include_thoughts=True to see the model's reasoning process.
Nano Banana Pro supports web search grounding only. Nano Banana 2 supports both web AND image search grounding. Image search grounding lets the model retrieve visual references of real-world subjects (landmarks, products, fashion) from Google before generating — producing more accurate depictions than relying solely on training data.
Standard ratios (1:1, 16:9, 9:16, 4:3, 3:2, etc.) plus extreme ratios exclusive to this model: 1:4, 4:1, 1:8, and 8:1. Extreme ratios are useful for website hero banners, vertical story formats, panoramic compositions, and social media cover images.
Nano Banana 2 accepts 131,072 input tokens — double the 65K limit of Nano Banana and Pro. This larger context window allows longer, more detailed prompts and more reference material per generation. It is particularly useful when combining multiple reference images with detailed text instructions.
Yes. All images across all resolution tiers are cleared for commercial use with no watermarks.