Skip to content

Lineage

Lineage shows how data and control flows from its origins to its various destinations. This includes details of the processing along the way. It is used to understand:

  • whether the data used in reports and analytics models has come from the correct sources and has passed through the correct processing (known as traceability of data).

  • what would be the impact on downstream processing and consumers if something was changed (known as impact analysis).

  • whether the operational processes that implement the data flows are executing correctly (known as governance by expectation).

Lineage Management describes how lineage is collected, managed and used in Egeria.

Further information

The open metadata types for lineage relationships are found in the following models:

  • Model 0210 for the DataSetContent relationship.
  • Model 0737 for the ImplementedBy relationship
  • Model 0750 for the DataFlow, ProcessCall and ControlFlow relationships.
  • Model 0755 for the UltimateSource and UltimateDestination relationships.
  • Model 0760 for the BusinessLineage classification.
  • Model 0770 for the LineageMapping and DataMapping relaitonships.

Also see how to set up an information supply chain.


Raise an issue or comment below