Wherobots Data Federation
Wherobots Data Federation enables users to query external data sources in place, eliminating the need for data migration.
This is achieved by creating a Connection, which stores the access credentials and path to the data source.
Once connected, Wherobots presents the external data as a foreign catalog in the Data Hub. You can then query this external catalog's data in a Wherobots Notebook and write your results.
What is a foreign catalog?
When you connect to a foreign catalog, Wherobots acts as a client and data governance takes place at the data's source.
With a foreign catalog you can mirror and manage data from external data systems within Wherobots.
- External Source of Truth: The external system (such as Unity Catalog) is the authoritative source for metadata. Wherobots does not own or control that metadata.
- Read-Only Connection: Wherobots reads metadata from the foreign catalog to understand available tables and schemas but does not write to that specific table or manage its lifecycle.
- No Data Discovery: The external system (e.g., Databricks Unity Catalog) is responsible for discovering data and keeping its own catalog up to date. Wherobots simply reflects the current state of that external catalog.
Benefits of Wherobots Data Federation¶
- Zero-Copy architecture: Read tables managed by external cloud providers, like Databricks, without moving or duplicating data.
- Secure federation: Set up access to an external data source once by creating a single, secure connection.
- Get a unified view of all your Organization's data: All connected tables appear side-by-side in the Wherobots Data Hub. This allows your team to browse, query, and even join data from different sources as if it were all local, dramatically simplifying data discovery and analysis.
- Accelerated innovation on the lakehouse: Take spatial ideas to market faster using Wherobots' 300+ spatial functions, raster inference, and compute for physical world data on your Unity Catalog data.
graph LR
Z["<b>Wherobots Data Federation</b><br><br>A collection of features that enables users to query external data sources in place, eliminating the need for data migration."] --> A
A["<b>Wherobots Data Hub</b><br><br>Presents external data as a catalog once a <b>Connection</b> is established.<br><br><i>A <b>Connection</b> stores the access credentials and path to the data source.</i>"] --> D
D("<b>Wherobots Notebook/Job Run/SQL Session</b><br><br>The external catalog's data can be read and queried directly in a Wherobots Notebook<br><br>You can write results to either a Wherobots-managed table or back to a table in the original source platform, such as Databricks.")
Create a catalog Connection¶
To create a catalog Connection in the Wherobots Data Hub, select your data source from the list below and follow the step-by-step guide: