Your all-in-one hub for advancing frontier AI. Explore research, product updates, guides, and real-world use cases.
Our latest report examines the emerging expert economy driving frontier AI, detailing the backgrounds and disciplines of these knowledge workers and their impact on cutting-edge AI systems. It also covers their earnings and the high-skill data crucial for the next stage of AI development.
Labelbox•July 17, 2025
We tested rubric-based rewards and GRPO on a real-world e-commerce task and found they outperformed sparse rewards by 300%. This helps validate their effectiveness for complex, multi-step business workflows.
Labelbox•July 1, 2025
Agentic AI is emerging as a new frontier in autonomy, where models can plan, adapt, and take action independently. In this post we highlight three real-world projects with leading AI labs, from multi step tool use to structured reasoning and dynamic instruction following.
Labelbox•June 30, 2025
Enterprises need search-augmented LLMs that deliver fast, trustworthy, and up-to-date answers—not just polished language. Since public benchmarks rarely test for this, the Labelbox research team conducted its own study across three frontier models: Gemini 2.5 Pro, GPT-4.1, and Claude 4.0 Opus.
Labelbox•June 13, 2025
See how Labelbox utilizes custom rubric-based evaluations to help leading AI labs train and assess advanced frontier models with depth and nuance.
Labelbox•May 16, 2025
Discover how modern rubric-based evaluations and human evaluation are crucial for advancing the capabilities of prompt-to-app and AI app generators.
Labelbox•May 15, 2025
Learn how to fill your Reinforcement Learning with Verifiable Rewards (RLVR) pipelines to teach your models effective reasoning, especially for logic, math, and coding problems with clear solutions.
Labelbox•May 6, 2025
Discover Labelbox's redesigned Multimodal Chat editor, offering an intuitive, form-based experience for streamlined AI model evaluation and data generation.
Labelbox•April 24, 2025
Labelbox introduces a new Complex Reasoning Leaderboard, ranking Google's Gemini 2.5 Pro as the top AI model for advanced reasoning tasks.
Labelbox•April 18, 2025
Learn about the new Labelbox Workflow that introduces an interactive, node-based editor to create, manage, and visualize multi-step review workflows.
Labelbox•April 9, 2025
Get started for free or see how Labelbox can fit your specific needs by requesting a demo