Unveiling the Secrets: External Tables vs. External Volumes in Azure Databricks Unity Catalog

Nnaemezue Obi-Eyisi
7 min readSep 25, 2023

While reviewing the Databricks documentation about Unity Catalog, I came across a concept that initially seemed a bit perplexing: the distinction between accessing data objects stored in our cloud storage using External Tables versus External Volumes. This inspired me to write an article exploring the different methods for accessing data from the enterprise data lake through Unity Catalog. In this article, I delve into various syntax and nuances, explaining how one can efficiently access the data, particularly for organizations that have already established a data lakehouse within their data lake.

Prerequisites for working with External tables, Managed tables, External Volumes

To create an external location, you must first establish a storage credential using the Databricks Access Connector. These steps are essential whether you are creating a Metastore or enabling Unity Catalog.

Default Metastore location for Managed Tables

By default, every Unity Catalog-enabled workspace comes with a preconfigured default Metastore location linked to the customer’s data lake (ADLS Gen 2) storage container. This location serves as the storage…

--

--

Nnaemezue Obi-Eyisi
Nnaemezue Obi-Eyisi

Written by Nnaemezue Obi-Eyisi

I am passionate about empowering, educating, and encouraging individuals pursuing a career in data engineering. Currently a Senior Data Engineer at Capgemini