Skip to content

Leveraging your Apache Atlas estate

Work In Progress

This page is part of the new function for version 6.0 of Egeria. We aim to release this version in March 2026, so watch this space - new content is on its way :)

Apache Atlas is a metadata catalog originally designed for the Hadoop ecosystem. It offers integration services called Hooks and Bridges to capture the schemas and data sets of data platforms such as Apache Hive, Apache HBase and Apache Hadoop Distributed File System (HDFS) along with the processes for creating and maintaining data sets on these platforms. The metadata descriptions of these data sets and processes are linked together using lineage relationships, allowing an understanding of how data is flowing through a Hadoop deployment. Apache Atlas also supports glossaries and a tagging system that can be used both in searches and to control access to data through Apache Ranger (using the TagSync integration).

In recent years, Apache Atlas has been embedded in popular data catalogs such as Microsoft Purview and Atlan increasing the interest in being able to integrate with this metadata catalog.

Further information
  • Apache Atlas
  • Comparison between Apache Atlas and Egeria
  • Apache Atlas Connectors from Egeria - The Apache Atlas connectors provide a suite of function that integrates an Apache Atlas server into the open metadata ecosystem.

  • Apache Atlas REST Connector provides a Java interface to the Apache Atlas REST APIs.

  • Apache Atlas Survey Service builds a discovery analysis report on the contents of an Apache Atlas metadata repository.
  • Apache Atlas Integration Connector synchronizes metadata between Apache Atlas and the open metadata ecosystem.

Raise an issue or comment below