Skip to main content
Version: Atlas v4.5

Configuring Data Hub

Data Hub is configured from Settings > Element Configuration. The Data Hub configuration area combines dataset management and utilization setup in one place so Atlas Admins can validate required settings, configure metadata fields, and run historical backfills from a single workflow.

For broader setup details, see Configuring Atlas.

Required Configuration

Data Hub requires the following items to be configured before the experience is fully available:

  • The appropriate metric indexes for License Usage and Data Utilization
  • The permissions required to read the configured indexes
  • The supporting access needed for Atlas to retrieve and display Data Hub information

The configuration area validates both the selected index setup and the current user's access so Atlas Admins can quickly identify issues that need attention.

Data Hub Settings

The Data Hub configuration area includes the following settings:

  • Custom Fields: Add metadata fields that appear on datasets alongside the default definition fields. These fields help teams capture environment-specific ownership, classification, and governance details.
  • Backfill Data: Retroactively collect historical utilization activity so Data Hub can report on searches and dashboards that ran before tracking was enabled.

Configure Data Hub

  1. Open Settings and select Element Configuration.
  2. Locate the Data Hub configuration area.
  3. Select the appropriate metric indexes for License Usage and Data Utilization.
  4. Review the validation results and resolve any index or permission issues shown in the status indicators.
  5. Add any custom fields your team wants to use for dataset definitions.
  6. Save the configuration changes.

Run a Backfill

Use Backfill Data when you want Data Hub to collect historical utilization information instead of waiting for new activity to accumulate naturally.

  1. Open Backfill Data from the Data Hub configuration area.
  2. Select the time range to process.
  3. Review the estimated processing time.
  4. Start the backfill and allow it to complete before reviewing older utilization data in Data Hub.