Blog

Your all-in-one hub for advancing frontier AI. Explore research, product updates, guides, and real-world use cases.

Latest Applied research Releases Announcements Use cases Engineering

Bridging insight and innovation: Introducing Labelbox Applied Research

Today we’re launching Labelbox Applied Research with three flagship pillars: Labelbox Evals for unified model evaluation, Labelbox Agents for building reliable and interpretable agents, and Labelbox Robotics (LBRx) for delivering high-quality training data for advanced robotic manipulation.

Labelbox•November 19, 2025

Introducing Labelbox Evaluation Studio: Drive AGI advancements with real-time feedback on model performance

Labelbox Evaluation Studio unlocks a private, real-time platform where top AI teams unlock tailored insights, instantly spot strengths and weaknesses, and accelerate faster frontier model improvements.

Labelbox•August 5, 2025

Teaching agents to use tools with human supervision: MCP support now available

Meet Labelbox's MMC editor now with MCP support which enables human-in-the-loop evaluation by making it easy to inspect, label, and correct agentic-tool interactions.

Labelbox•July 24, 2025

Benchmarking deep research agents

Introducing Labelbox’s deep research leaderboard: an open, continuously‑updated scorecard that shows showing how top AI agents like OpenAI, Google, and Anthropic perform on long-form research tasks.

Labelbox•July 21, 2025

Benchmarking agentic search

Enterprises need search-augmented LLMs that deliver fast, trustworthy, and up-to-date answers—not just polished language. Since public benchmarks rarely test for this, the Labelbox research team conducted its own study across three frontier models: Gemini 2.5 Pro, GPT-4.1, and Claude 4.0 Opus.

Labelbox•June 13, 2025

Page 1 of 1