Following Gemma 3 and Gemini Robotics earlier today, Google’s AI news continues with wider access to native image output in Gemini 2.0 Flash that allows for conversational image editing alongside other capabilities.

When Gemini 2.0 Flash was announced in December, Google talked about outputting audio and image in addition to text. It’s part of making Gemini a multimodal model that can accept various inputs and generate similar outputs.

Instead of just providing a prompt and getting back an image, native output allows you to “edit images through many turns of a natural language dialogue.” Context is preserved throughout the conversation. 

Meanwhile, 2.0 Flash can better render images with text, including long sequences. This has been difficult for today’s models.

Advertisement – scroll for more content

window.adSlotsConfig = window.adSlotsConfig || [];

adSlotsConfig.push( {
slotID: ‘/1049447/Outbrain’,
slotName: ‘div-gpt-ad-outbrain-ad-664929’,
sizes: [300, 250],
slotPosition: ‘mid_article’
} );

Compared to other standalone image generation models, this capability in 2.0 Flash “leverages world knowledge and enhanced reasoning to create the right image.” 

This makes it perfect for creating detailed imagery that’s realistic–like illustrating a recipe. While it strives for accuracy, like all language models, its knowledge is broad and general, not absolute or complete.

In the example below, the prompt is: “Give me a recipe for a chocolate chip cookie. Please include an image of each step.”

One example use case of being able to output text and images together is asking 2.0 Flash to tell a story with pictures that keep the “characters and settings consistent throughout.”

Back in December, Gemini 2.0 Flash’s native image output was just for trusted testers. All developers/users can now try it in Google AI Studio with the updated experimental version of Gemini 2.0 Flash (gemini-2.0-flash-exp), or the Gemini API. In the right-hand model picker (on desktop), go to the “preview” section. Set the “output format” to: Images + text. Daily limits are in place.

FTC: We use income earning auto affiliate links. More.

You’re reading 9to5Google — experts who break news about Google and its surrounding ecosystem, day after day. Be sure to check out our homepage for all the latest news, and follow 9to5Google on Twitter, Facebook, and LinkedIn to stay in the loop. Don’t know where to start? Check out our exclusive stories, reviews, how-tos, and subscribe to our YouTube channel