Skip to main content

How to Set Up Collibra

Updated over a week ago

Setup takes about 15 minutes.

Introduction

Connecting Collibra to Foundational allows Foundational to export its lineage into Collibra so that you have the full end-to-end accurate lineage from Foundational within Collibra.

To make the connection, you need to:

  • Create S3 bucket in AWS for use by Collibra OpenLineage

  • Create AWS connection in Collibra and enable the OpenLineage capability for it

  • Connect to Foundational using your S3 bucket used for Collibra OpenLineage


Prerequisites

Ensure you have:

  • Access to AWS to create a bucket and providing access to it

  • Permissions in Collibra to create an AWS connection and adding OpenLineage capability

  • Added Foundational IP addresses to the allowlist, see the article Allowing IP access to Foundational.


Access permissions

The permissions you assign in the S3 bucket allow Foundational to:

  • Upload OpenLineage exports as files to the S3 bucket under the specified path


Create S3 Bucket for Collibra OpenLineage

  1. Follow the steps in this guide to create the S3 bucket with the proper permissions for Collibra.

  2. Create AWS IAM credentials for the S3 bucket that can be used by Foundational to write to the S3 bucket (see this guide to create access keys for IAM users and this guide to allow read and write access to AWS S3).


Configure Collibra OpenLineage via AWS

Step 1: Create AWS Connection In Collibra

Follow the steps in this guide to create AWS connection in Collibra for OpenLineage

Step 2: Add OpenLineage Capability in Collibra

Follow the steps in this guide to add OpenLineage capability in Collibra

In the configuration, set the following values:

  • Name: Foundational OpenLineage Sync

  • Source ID: foundational

  • Shared Storage Connection: the AWS connection you created for Collibra

  • Delete Raw Metadata After Processing: Enable this option

  • Processing Level: Choose Analyze

  • Active: Enable the checkbox


Step 3: Add Synchronization Schedule for OpenLineage

Follow the steps in this guide to schedule the OpenLineage capability configured.

While the guide is titled for Informatica PowerCenter, you can configure it for OpenLineage capability as well.


Connect Foundational to Collibra

  1. In Foundational, open the Connectors & Integrations page.

  2. In Catalogs, select the Collibra card and click Connect.

  3. A screen opens. Click Add Account.

  4. The initial setup screen opens. Click Start Setup.

  5. The Set Up New Connection screen opens.

    Enter the details:

    • Cloud Provider: Pick AWS.

    • S3 Bucket Path: Enter the full S3 bucket and the destination path. This is the S3 path used by Collibra OpenLineage capability.

    • AWS Access Key ID: The IAM Key ID created for Foundational to use for writing to the S3 bucket.

    • AWS Secret Access Key: The IAM secret access key created for Foundational to use for writing to the S3 bucket.

  6. Click Next.

  7. The final setup screen opens. To complete the connection, click Save.

    That’s it. Foundational is now connected to Collibra.

Did this answer your question?