Apache Atlas

Apache Atlas is an open-source project from the Apache Foundation that aims to optimise data governance and metadata management. The framework allows metadata types management, data classification, data lineage, metadata and data search, as well as data security and masking.

All of these features make Atlas a good framework for strenghtening understanding of data governance by making it easier and more user- friendly.

Atlas was donated to the Apache Foundation by Hortonworks in 2015 and has been a top-level project since June 2017.

Related articles

Cloudera CDP and Cloud migration of your Data Warehouse

Cloudera CDP and Cloud migration of your Data Warehouse

Categories: Big Data, Cloud Computing | Tags: Azure, Cloudera, Data Hub, Data Lake, Data Warehouse

While one of our customer is anticipating a move to the Cloud and with the recent announcement of Cloudera CDP availability mi-september during the Strata conference, it seems like the appropriateā€¦

David WORMS

By David WORMS

Dec 16, 2019

Introduction to OpenLineage

Introduction to OpenLineage

Categories: Big Data, Data Governance, Infrastructure | Tags: Data Engineering, Infrastructure, Atlas, Data Lake, Data lakehouse, Data Warehouse, Data lineage

OpenLineage is an open-source specification for data lineage. The specification is complemented by Marquez, its reference implementation. Since its launch in late 2020, OpenLineage has been a presenceā€¦

Christophe PARREIRA

By Christophe PARREIRA

Dec 19, 2023

Canada - Morocco - France

We are a team of Open Source enthusiasts doing consulting in Big Data, Cloud, DevOps, Data Engineering, Data Scienceā€¦

We provide our customers with accurate insights on how to leverage technologies to convert their use cases to projects in production, how to reduce their costs and increase the time to market.

If you enjoy reading our publications and have an interest in what we do, contact us and we will be thrilled to cooperate with you.

Support Ukrain