The conceptual architecture of the hub-spoke pattern

To Nha Notes | July 22, 2022, 10:33 a.m.

The hub-spoke pattern

The hub to spoke data sharing workflow

  1. Firstly, the data publishers, who have the data ownership, publish the metadata of the data lakehouse node into the data catalog.
  2. The hub node steward reviews the published catalog to ensure that it is aligned to the governance framework of the hub node.
  3. The steward then approves or rejects the published catalog contents. If approved, the catalog is updated with the metadata.
  4. When a spoke data requestor requires data from the hub node, the data requestor browses the hub data catalog to identify the data of interest.
  5. Once the data of interest is identified, the data requestor requests the data from the hub through Data Share Service.
  6. The request for data access is routed to the data publisher. The data publisher reviews the request and approves or rejects the request for data access.
  7. If the request is approved, the data publisher shares the data with the data requestor through Data Share Service, which enables data sharing between the hub and the spoke nodes. The terms of data usage are also clarified.
  8. Finally, the data requestor reviews the terms of data usage. Upon the acceptance of the terms, the data requestor can start consuming the data usage.
  9. The data publisher constantly monitors the data usage pattern through Data Share Service.

The source of images in the book Data Lakehouse in Action