Catalog

Vector & traditional search, data exploration, and curation, all in a single place. Ship performant AI systems with raw data, metadata, embeddings, and ground truth labels at your fingertips. 

catalog icon 2022

search-index

Catalog video

Catalog eliminates the need to build your own traditional and vector database infrastructure. Build better AI applications by leveraging out-of-the-box search for images, text, videos, conversations, and documents across metadata, vector embeddings, and annotations. 

search data

A new way to search unstructured data

Speed up model development with enriched datasets

Explore datasets using AI insights

Not all data impacts model performance equally. Through our active learning workflows and uncertainty sampling, you can filter for data with low-confidence predictions to curate and label the right data–not just more data.

active learning

Improve models faster with active learning

Use inbuilt natural language search, pre-computed, or custom vector embeddings to find similar data clusters for automated classification. Optionally send to human review for maximal accuracy. 

Auto label

Perform zero shot classification with vector embeddings

View a detailed class distribution of ground truth labels or model inferences to get a better understanding of your data. See how performance metrics like F1 score vary across your data so you can make the most informed decisions when curating data to label.

analyze your data

Analyze your data with metrics that matter

Optimize labeling budget

Don’t let searching for data and edge cases slow your team down or hold up conversations with stakeholders or customers. Instead of relying on one-off query scripts, search and discover data faster inside Catalog.

Share 

Share and act on insights faster

“Catalog is huge for us. Pre-Catalog, we only had 50% accuracy on models and would have to rely on tedious manual data selection. With Catalog in Labelbox, we can quickly search and visualize all of our unstructured data and use active learning and weak supervision techniques to target data collection on models quickly – it takes a lot of the time and effort out of the data selection process."

Deque logo

Testimonial, Catalog, Deque

Footer with video

footer section annotate

Google Cloud partners with Labelbox to offer LLM human evaluation services

Google Cloud powers LLM evaluation service with Labelbox  

ado-B_hero

How a Fortune 500 creative tools company shipped generative AI across its products 

Blue River Technology_Data Engine_Customer Story_Header Image

BRT-logo

How John Deere's data engine automates data curation and labeling from 1B+ assets 

Catalog

Turn data into insight and action

A new way to search unstructured data

Explore datasets using AI insights

Improve models faster with active learning

Perform zero shot classification with vector embeddings

Analyze your data with metrics that matter

Optimize labeling budget

Share and act on insights faster

Our customers achieve breakthroughs with high quality data

Problem

Solution

Result

Try Labelbox today