Image generation

Last updated: June 6, 2025

Our image generation leaderboard evaluates AI models on their ability to generate high-quality images from textual descriptions. We assess various factors based on the project's specific criteria.

Human preference evaluation

Diverse pool of US-based Alignerrs, including generalists and creative artists

Consensus of three Alignerrs per task

Standardized instructions and ontology for consistent evaluations

Carefully curated prompt generation process, balancing creativity and clarity

Examples

PROMPT

Cinematic: A resolute soldier unlocking a mysterious wooden door in a dimly lit room. Dramatic chiaroscuro lighting highlights tension. Close-up perspective, emphasizing the soldier's anxious but righteous expression.

Imagen 3

DALL·E 3

Flux 1.1 Pro

Stable Diffusion 3

Ideogram 2.0

Recraft v3

Want us to evaluate your model?

If you’d like us to consider your model as part of the next set of leaderboard evaluations, contact us at leaderboard@labelbox.com.