š„ Building the Bronze Table in Hive Metastore ā Your First Layer of the Medallion š ļø
- Jun 2, 2025
- 1 min read
In this weekās cloud engineering series video, we start our journey through the Medallion Architecture using Hive Metastore by building the Bronze tableāyour foundational data layer.
Using Spark SQL and a Synthea-generated patient dataset, I walk you through the full table creation process. Think of the Bronze layer as your raw landing zoneāeverything from your source data goes here first.
This weekās tutorial covers:
š§¾ How to load and register a raw dataset
š Structuring schemas in Spark SQL
ā” Common pitfalls to avoid when working in Hive Metastore
š A sneak peek at how this Bronze table feeds into your Silver and Gold tables
This is just *one approach* to Medallion designāthereās a lot of flexibility depending on your teamās needs. And yes, weāll be exploring **Delta Live Tables** soon too!
š Follow along weekly as we explore how to build resilient, scalable, and governed cloud data platforms!
š If your team is exploring how to modernize your data stack or scale your cloud analyticsāreach out! Iād love to talk about how I can help.
#CloudEngineering #DataEngineering #Databricks #PySpark #ETL #ModernDataStack #DataOps #SyntheticData #Synthea #Azure #AWS #MedallionArchitecture





Comments