Discover this podcast and so much more

Podcasts are free to enjoy without a subscription. We also offer ebooks, audiobooks, and so much more for just $11.99/month.

Clean Up Your Data Using Scalable Entity Resolution And Data Mastering With Zingg

Clean Up Your Data Using Scalable Entity Resolution And Data Mastering With Zingg

FromData Engineering Podcast


Clean Up Your Data Using Scalable Entity Resolution And Data Mastering With Zingg

FromData Engineering Podcast

ratings:
Length:
47 minutes
Released:
Nov 7, 2022
Format:
Podcast episode

Description

Despite the best efforts of data engineers, data is as messy as the real world. Entity resolution and fuzzy matching are powerful utilities for cleaning up data from disconnected sources, but it has typically required custom development and training machine learning models. Sonal Goyal created and open-sourced Zingg as a generalized tool for data mastering and entity resolution to reduce the effort involved in adopting those practices. In this episode she shares the story behind the project, the details of how it is implemented, and how you can use it for your own data projects.
Released:
Nov 7, 2022
Format:
Podcast episode

Titles in the series (100)

Weekly deep dives on data management with the engineers and entrepreneurs who are shaping the industry