Prompt: Pretty blue shallow ocean with sand. With so many AI chatbots on the market, picking the best one can be challenging. To try and settle the debate, Google DeepMind pitted the leading chatbots against each other and found that users are most impressed by one image generator — Imagen 3.
Also: I just tried Google’s ImageFX AI image generator, and I’m shocked at how good it is
A report , published on Wednesday, details how Google DeepMind evaluated Imagen 3’s performance against its predecessor, Imagen 2, and leading external models, including DALL-E 3 , Midjourney v6, Stable Diffusion 3 Large, and Stable Diffusion XL 1.0, in both human and automatic evaluations.
The human evaluations tested five quality aspects of the text-to-image generation models: preference, prompt-image alignment, visual appeal, detailed prompt-image alignment, and numerical reasoning.
In the overall preference category, which measured how satisfied a user was with the image compared to the input prompt, Imagen 3 won with a significant lead over the competition, as seen in the image below:
Imagen 3 performed competitively in the other human evaluation categories, as well as the automatic evaluations, which tested prompt-image alignment (again) and image quality.
Also: Google’s AI Overviews get three useful updates. Here’s what’s new
"All in all, Imagen 3 clearly leads on prompt–image alignment, especially on detailed prompts and counting abilities; while on visual appeal, Midjourney v6 takes the lead, with Imagen 3 coming in second," concluded the report.
"When considering all the quality aspects, Imagen 3 clearly leads in overall preference, indicating it strikes the best balance of high-quality outputs that respect user intent."Sound too good to be true? Here is how you can test Imagen 3 in ImageFX, a tool in Google Labs that lets people create images with simple text prompts.
Google says its Imagen 3 AI image generator beats DALL-E 3. How to try it for yourself