Data lineage
Data lineage is the process of tracking data as it is moved and used by different software tools. Use Manta Data Lineage to increases data pipeline transparency so you can determine data accuracy throughout the models and systems.
- Required services
- IBM Knowledge Catalog with IBM Manta Data Lineage service enabled. For more information how to enable data lineage, see Enable data lineage.
- Required permissions
-
- Manage data lineage or Access data lineage permission.
You can use data lineage in these ways:
- You can understand your data through visual representation on the lineage graph.
- You can track your data and learn where it came from, how it was transformed, and where the data was moved.
- You can check your data in one view for quality scores or transformations.
Before you start using Manta Data Lineage, you need to perform additional tasks to prepare your data. See Preparing data for data lineage.
To get started quickly, choose a learning path. See Quick start: Track data lineage.
Data lineage roles and permissions
How you can use the data lineage depends on your assigned roles and permissions. To determine your roles and permissions, see Determining your roles and permissions.
Permission | Tasks |
---|---|
Manage data lineage | - Run metadata import jobs - Publish assets from metadata jobs to projects or catalogs - View monitor and manage page - Delete lineage from monitor and manage page - View lineage repository page - View lineage graphs for all assets in the repository - Add or delete external agents - Update alias mappings and file system mappings - Select Cloud Object Storage to enable lineage |
Access data lineage | - View lineage repository - View lineage graphs for all assets in the repository |
Role | Permission |
---|---|
Lineage Administrator | - Manage data lineage - Access data lineage - Create data source definitions |
Supported data sources for data lineage
A list of all supported data sources for data lineage. For more information about supported data sources, see Supported data sources for curation and data quality.
Database connectors
Connectors and other data sources specific to metadata import
- IBM DataStage for Cloud Pak for Data
- InfoSphere DataStage
- Microsoft Power BI (Azure)
Learn more
Parent topic: Data governance