Data row filters

Describes how to create Catalog filters that return data rows based on data row attribute values, such as global key and last activity.

Data row filters select data rows according to the values of various attributes associated with the data row, including External ID, Data row ID, Global key, Created at, and Last activity at.

Here, we describe each attribute, available search options, and other useful details.

To specify a data row filter

To set a Catalog data row filter:

  1. From Catalog, select Data row from the Search menu.

  2. Choose an attribute from the Attribute menu.

  3. Select the search operator

  4. Enter one or more search values.

Searchable attributes

The following sections describe the searchable attributes, the conditions each supports, and provides additional details and context to help you use them effectively.

External ID

External IDs are specified when importing data in a dataset. Not every data row has an external ID; they’re generally set to upload filenames.

To see if a data row has an external ID, open the data row details and review the Data panel.

External ID filters support the following conditions:

ConditionDescription
is anyReturns data rows with one of the specified search values.
is not anyReturns data rows that do not match the specified search values.
containsReturns data rows that contain a search value (not case-sensitive).
does not containReturns data rows that do not contain a search value.
starts withReturns data rows where External ID starts with a search value.
ends withReturns data rows with External ID values ending with a search value.

The use of external IDs as identifiers has been deprecated in favor of global keys. If your workflows currently rely on external IDs, you should transition to global keys.

Data row ID

Data row IDs are assigned when data is added to a dataset; they're read-only and meant to be unique within an organization.

To view the data row ID, review the Data panel of the data row details view.

Data row ID filters support the following conditions:

ConditionDescription
is one ofReturns data rows matching a specified search value.
is not one ofReturns data rows that do not match any specified search value.

Global key

Global keys uniquely identify data rows and can be set when data is loaded into a dataset or updated later (using the SDK).

Global key filters support the following conditions:

ConditionDescription
is anyReturns data rows with global keys matching a specified search value.
is not anyReturns data rows that do not match a specified search value.
containsReturns data rows with global keys containing a search value (not case-sensitive).
does not containReturns data rows with global keys that do not contain a search value.
starts withReturns data rows with global keys that start with a search value.
ends withReturns data rows with global keys that end with a search value.

To view the global key for a data row, use the Data panel in the data row detail view.

Created at

The Create at attribute is a DateTime value (GMT) set when the data row is imported into a dataset.

ConditionDescription
CustomReturns data rows created between two specified DateTime values.
Last X hoursReturns data rows created within the specified number of hours (integer).
Last X daysReturns data rows created within the specified number of days (integer).
Last 7 daysReturns data rows created within the last week.
Last 30 daysReturns data rows created within the last month (roughly).
Last 90 daysReturns data rows created within the last three months (roughly)
Last 365 daysReturns data rows created within the last year (roughly).
Month to dateReturns data rows created within the current month.
Quarter to dateReturns data rows created within the current quarter.
Year to dateReturns data rows created within the current calendar year.

To view the data row's creation date, use the Data panel of the data row detail view.

Last activity at

The Last activity at attribute is a timestamp that reflects the most recent action involving the data row.

This value changes frequently, potentially more often than you might think. It changes when:

  • Changes update a data row's: data (rowData), external ID (externalId), or global key (globalKey)
  • Changes are made to annotations, attachments, embeddings, or metadata
  • Data rows are added to batches
  • Data row labeling tasks change
  • Labels, reviews, comments, or issues are added to a project containing the data row

Data rows in multiple projects update Last activity at when such changes occur in any project containing the data rows.

Because Last activity at is affected by so many actions, it may not be a good candidate for your use case.

Last activity at filters support the following conditions:

ConditionDescription
CustomReturns data rows with activity between two specified DateTime values.
Last X hoursReturns data rows with activity with the specified number of hours (integer).
Last X daysReturns data rows with activity with the specified number of days (integer).
Last 7 daysReturns data rows with activity within the last week.
Last 30 daysReturns data rows with activity within the last month (roughly).
Last 90 daysReturns data rows with activity within the last three months (roughly)
Last 365 daysReturns data rows with activity within the last year (roughly).
Month to dateReturns data rows with activity within the current month.
Quarter to dateReturns data rows with activity within the current quarter.
Year to dateReturns data rows with activity within the current calendar year.

The Last activity at value is available only through the SDK.