Labelbox•March 2, 2023
One of the biggest challenges for AI teams is being able to select the right data to train models or for your use case. This month, we are releasing several updates to help you better visualize and explore your data so that you can make more informed decisions on how to organize and prioritize your growing data.
Like many of you, we’ve encountered the challenges of dealing with public datasets — whether you just want to easily browse them to see the data or need to find data within specific parameters or a query. Leveraging Catalog as its foundation, you can now search, explore, and browse large-scale datasets.
Discover, browse, and gain inspiration from innovative AI datasets that can inform your own pipeline. You can rapidly browse, visualize, organize and analyze petabytes of publicly-available datasets across diverse use cases and modalities.
To learn more about public datasets, you can read our latest blog post.
In addition to searching for data rows in Catalog by metadata, annotations, similarity, or dataset, you can now use natural language to find a specific subset of data rows. Rather than flying blind, this lets you easily:
By leveraging natural language search and other filters in Catalog, you can:
Capture the complexity of your use case: Layer natural language search with other structured search filters (such as metadata, annotations, media type, etc.) to better refine and organize your data.
Explore your ever-growing store of data: As your data grows, you can leverage natural language search and other filters in Catalog to query tens of millions of assets at scale.
Quickly gain insights from your data: Return search results on 10 million assets in as quick as 3 seconds.
Powered by CLIP embeddings, simply type an English text query to bring up all instances of images that match the description of the phrase:
You can try natural language search on your own data in Catalog or on one of the available public datasets today.
Making sense of your data across media types and datasets can be a challenging task. You can now use the “Find text” filter in Catalog to surface all instances of exact and partial matches to a word or phrase.
You can try to find specific instances of text on your own data in Catalog or on one of the available public datasets today.