Your all-in-one hub for advancing frontier AI. Explore research, product updates, guides, and real-world use cases.
Today we’re launching Labelbox Applied Research with three flagship pillars: Labelbox Evals for unified model evaluation, Labelbox Agents for building reliable and interpretable agents, and Labelbox Robotics (LBRx) for delivering high-quality training data for advanced robotic manipulation.
Labelbox•November 19, 2025
Labelbox Evaluation Studio unlocks a private, real-time platform where top AI teams unlock tailored insights, instantly spot strengths and weaknesses, and accelerate faster frontier model improvements.
Labelbox•August 5, 2025
Meet Labelbox's MMC editor now with MCP support which enables human-in-the-loop evaluation by making it easy to inspect, label, and correct agentic-tool interactions.
Labelbox•July 24, 2025
Introducing Labelbox’s deep research leaderboard: an open, continuously‑updated scorecard that shows showing how top AI agents like OpenAI, Google, and Anthropic perform on long-form research tasks.
Labelbox•July 21, 2025
Enterprises need search-augmented LLMs that deliver fast, trustworthy, and up-to-date answers—not just polished language. Since public benchmarks rarely test for this, the Labelbox research team conducted its own study across three frontier models: Gemini 2.5 Pro, GPT-4.1, and Claude 4.0 Opus.
Labelbox•June 13, 2025