Google has recently added Nano Banana Pro to its offerings. This article performs several tests on the API itself and the new capabilities of the model.
Related articles:
API Updates
- The model name is
gemini-3-pro-preview. You can use it with the generateContent or streamGenerateContent endpoints. - This model enforces a mandatory thinking process. The content parts would return the
thoughtSignature, which must be stored. Conversation could not continue without the signature being attached as part of the input. - It is possible to specify the image output size, such as 1K, 2K, or 4K. Sample:
{
......
"generationConfig": {
"maxOutputTokens": 32000,
"imageConfig": {
"aspectRatio": "16:9",
"imageSize": "1K"
},
"thinkingConfig": {
"thinkingBudget": 3000
}
}
}
Test #1 - 4K Image
The first test is a simple, straightforward request to generate an image based on a given prompt. The streaming takes 3 minutes to complete, and a JPEG image is received.

Check the image's metadata, and we can see it produced a 4K image.

Test #2 - With reference input images
The model allows up to 14 input images. In this test, we provide several images as a reference and observe how they are being used in the output.

The output looks great! Especially since it is done in 4K and a 16:9 ratio.
(This is a scaled-down version of the original output.)

Test #3 - Conversation & Image editing
In this test, we first generate an image, then request an edit to it.
Create a teddy bear image, sit behind the glass in a toy store. street view: modern tokyo.
slightly cloudy. only a few ppl on the streets. no cars.

add some Christmas decoration around the bear. keep the street unchanged.

Test #4 - Combination of realistic & imaginary/illustrative
In the last test, we asked the model to generate an image of a computer screen showing a shooting game.

Test #5 Combining Math & Handwriting Imitation
Solve this equation and write the steps with similar handwriting on the paper.

It returns a solution with the correct answer, written in a similar handwriting style.

Things to take note of
- Unlike the original Nano Banana, the new Pro model takes a much longer time (E.g., 2 to 4 minutes) to complete the request. It is recommended to use the streaming endpoint in a background process.
- If you have a restriction on file size (either firewall, server, or app level), remember to increase the value. The 4K image is huge; for example, a 16:9 image is around 8MB each.
- The Pro model utilises Gemini 3 Pro thinking capability. This is mandatory, and it increases the usage of tokens.
- The size of thoughtSignature can be huge. We have seen a range of hundreds of KB to a few MB. If your use case does not require a conversation, you should discard them without saving the signature.