Apache Atlas
is open-source data
governance
and
metadata
management tool.
Apache Atlas
can be easily
integrated
with popular big data tools like
Hadoop
,
spark
Kafka
,
hive
, etc.
It allows data engineers to i
ngest, classify, discover, and govern data
assets from various data sources.
Atlas supports
Lineage
and has a simple UI to view the lineage of data as it moves through various processes
Some of the Capabilities of Apache Atlas are:- 1. Data Classification 2.Search & Lineage 3. Centralized Metadata 4. Security and Policy Engine
It has many pre-defined
types
, and users can add new types based on their requirements.
It supports a
SQL
-like
query engine
to search entities.
Apache atlas also provides many
REST APIs
to access and update lineage
Click to learn more