Discover this podcast and so much more

Podcasts are free to enjoy without a subscription. We also offer ebooks, audiobooks, and so much more for just $11.99/month.

Creating Shared Context For Your Data Warehouse With A Controlled Vocabulary

Creating Shared Context For Your Data Warehouse With A Controlled Vocabulary

FromData Engineering Podcast


Creating Shared Context For Your Data Warehouse With A Controlled Vocabulary

FromData Engineering Podcast

ratings:
Length:
61 minutes
Released:
Jan 2, 2022
Format:
Podcast episode

Description

Communication and shared context are the hardest part of any data system. In recent years the focus has been on data catalogs as the means for documenting data assets, but those introduce a secondary system of record in order to find the necessary information. In this episode Emily Riederer shares her work to create a controlled vocabulary for managing the semantic elements of the data managed by her team and encoding it in the schema definitions in her data warehouse. She also explains how she created the dbtplyr package to simplify the work of creating and enforcing your own controlled vocabularies.
Released:
Jan 2, 2022
Format:
Podcast episode

Titles in the series (100)

Weekly deep dives on data management with the engineers and entrepreneurs who are shaping the industry