logo
Leaderboards

Video generation

Last updated: November 1, 2024

Our video generation leaderboard evaluates AI models on their ability to generate high-quality videos from textual descriptions. We assess factors such as visual quality, adherence to the given text, and creativity.

Human preference evaluation

Diverse pool of US-based Alignerrs, including generalists and creative artists

Consensus of three Alignerrs per task

Standardized instructions and ontology for consistent evaluations

Carefully curated prompt generation process, balancing creativity and clarity

Overall preference

Prompt alignment

Realism

Examples

PROMPT

In a noir style, a detective unearths a savory dish from a decaying diner, with shadowy angles and a moody greyscale palette emphasizing the mysterious atmosphere

Luma dream machine

Pika

Runway gen 3