The Enterprise Data Catalog

To Nha Notes | June 9, 2023, 11:58 a.m.

You must be aware that data catalogs that crawl IT landscapes (i.e., that pull, not push) come with standard connectors to only a selected set of data sources. So, not everything will be crawlable by the data catalog. Therefore, sometimes, useful assets have to be manually entered, by stewards or other subject-matter experts.

 Table in a data source and how it’s visible as an asset in the data catalog

The data observability component is not necessarily an integrated part of data catalogs, but an add-on component. A streaming-based, open source data catalog is DataHub, with the commercial variant Acryl Data. Other players in the data observability space are AcceldataAnomaloAtaccamaBigeyeKensu, and Monte Carlo.

Reference

https://data.world/

https://datahubproject.io/

https://www.acceldata.io/

https://www.anomalo.com/

https://www.acryldata.io/

https://www.bigeye.com/

https://www.kensu.io/

https://www.montecarlodata.com/