top of page

šŸ„‰ Building the Bronze Table in Hive Metastore – Your First Layer of the Medallion šŸ› ļø

  • Jun 2, 2025
  • 1 min read

In this week’s cloud engineering series video, we start our journey through the Medallion Architecture using Hive Metastore by building the Bronze table—your foundational data layer.


Using Spark SQL and a Synthea-generated patient dataset, I walk you through the full table creation process. Think of the Bronze layer as your raw landing zone—everything from your source data goes here first.


This week’s tutorial covers:


🧾 How to load and register a raw dataset

šŸ“ Structuring schemas in Spark SQL

⚔ Common pitfalls to avoid when working in Hive Metastore

šŸ” A sneak peek at how this Bronze table feeds into your Silver and Gold tables


This is just *one approach* to Medallion design—there’s a lot of flexibility depending on your team’s needs. And yes, we’ll be exploring **Delta Live Tables** soon too!


šŸ“Š Follow along weekly as we explore how to build resilient, scalable, and governed cloud data platforms!


šŸ‘‰ If your team is exploring how to modernize your data stack or scale your cloud analytics—reach out! I’d love to talk about how I can help.




Ā 
Ā 
Ā 

Comments


Social

  • LinkedIn
  • GitHub
  • Threads

© 2025 Midwest Dataworks. All rights reserved.

Contact us:
midwestdataworks@gmail.com
Grand Rapids, MI

bottom of page