Covering everything you need to know in order to build AI products faster.
Labelbox•August 22, 2025
Announcing R-ConstraintBench: A novel way to stress-test LLM reasoning abilities under interacting constraints
We've released a research paper on R-ConstraintBench, a novel benchmark for evaluating LLM reasoning on realistic resource-constrained project scheduling problems (RCPSP), a well-known NP-complete challenge.
Labelbox•August 5, 2025
Introducing Labelbox Evaluation Studio: Drive AGI advancements with real-time feedback on model performance
Labelbox Evaluation Studio unlocks a private, real-time platform where top AI teams unlock tailored insights, instantly spot strengths and weaknesses, and accelerate faster frontier model improvements.
Labelbox•July 17, 2025
An economic report on the human expertise fueling frontier AI
Our latest report examines the emerging expert economy driving frontier AI, detailing the backgrounds and disciplines of these knowledge workers and their impact on cutting-edge AI systems. It also covers their earnings and the high-skill data crucial for the next stage of AI development.
Labelbox•May 16, 2025
Rubric evaluations: Fueling the next wave of reinforcement learning
See how Labelbox utilizes custom rubric-based evaluations to help leading AI labs train and assess advanced frontier models with depth and nuance.
Labelbox•May 15, 2025
Prompt to production: How to improve AI app generators with rubric evals
Discover how modern rubric-based evaluations and human evaluation are crucial for advancing the capabilities of prompt-to-app and AI app generators.
Labelbox•May 6, 2025
How to fill your RLVR pipeline with advanced reasoning data
Learn how to fill your Reinforcement Learning with Verifiable Rewards (RLVR) pipelines to teach your models effective reasoning, especially for logic, math, and coding problems with clear solutions.
Labelbox•April 24, 2025
Reinventing AI evaluation: Discover the simplicity of Labelbox's new form-based MMC editor
Discover Labelbox's redesigned Multimodal Chat editor, offering an intuitive, form-based experience for streamlined AI model evaluation and data generation.
Labelbox•April 18, 2025
New Complex Reasoning Leaderboard: Gemini 2.5 debuts at the top
Labelbox introduces a new Complex Reasoning Leaderboard, ranking Google's Gemini 2.5 Pro as the top AI model for advanced reasoning tasks.
Labelbox•April 9, 2025
Introducing a powerful, new interactive Workflow editor
Learn about the new Labelbox Workflow that introduces an interactive, node-based editor to create, manage, and visualize multi-step review workflows.
Labelbox•April 7, 2025
Q1 spotlight: Accelerating AI development with new products and services
Catch up on Labelbox's latest news from Q1, including expanded Leaderboards, the Alignerr Connect launch, and platform advancements empowering the next generation of AI models.