Discover this podcast and so much more

Podcasts are free to enjoy without a subscription. We also offer ebooks, audiobooks, and so much more for just $11.99/month.

Data Lakehouses and Apache Hudi

Data Lakehouses and Apache Hudi

FromThe Cloudcast


Data Lakehouses and Apache Hudi

FromThe Cloudcast

ratings:
Length:
31 minutes
Released:
Feb 15, 2023
Format:
Podcast episode

Description

Kyle Weller (@KyleJWeller, Head of Product @onehousehq) talks about the latest trends in  OSS Data Lakes, Data Warehouses, and the evolution to “Data Lakehouses” with Apache HudiSHOW: 694CLOUD NEWS OF THE WEEK - http://bit.ly/cloudcast-cnotwNEW TO CLOUD? CHECK OUT - "CLOUDCAST BASICS"SHOW SPONSORS:Datadog Synthetic Monitoring: Frontend and Backend Modern MonitoringEnsure frontend issues don’t impair user experience by detecting user-facing issues with API and browser tests with a free 14 day Datadog trial. Listeners of The Cloudcast will also receive a free Datadog T-shirt. Solve your IAM mess with Strata's Identity Orchestration platformHave an identity challenge you thought was too big, too complicated, or too expensive to fix? Let us solve it for you! Visit strata.io/cloudcast to share your toughest IAM challenge and receive a set of AirPods ProHow to Fix the Internet (A new podcast from the EFF)SHOW NOTES:Onehouse (homepage)Onehouse raises $25M Series A fundingApache Hudi (homepage)Delta Lake (homepage)Apache Iceberg (homepage)​​Apache Hudi vs Delta Lake vs Apache Iceberg - Lakehouse Feature ComparisonTopic 1 - Welcome to the show. Tell us a little bit of your background, and where you focus your efforts at Onehouse?Topic 2 - Your focus is on an emerging open source project, Apache Hudi. Before we dive into the project and technologies, we’re always interested in the background of what drove the creation of new projects. What problems existed before Hudi? Topic 3 - Let’s dive into Hudi. Data lakes, Delta Lakes, Lake houses, Icebergs. What is going on with all these water metaphors?  Topic 4 - Hudi is focused on streaming data lakes. What are some of the things (types of applications) that need a streaming data lake? Where do transactions come into play? Where do data warehouse capabilities come into play?Topic 5 - Stitching together open source projects and platforms can be complicated. How does the Onehouse platform simplify all of this for either data scientists or platform teams?Topic 6 - What are some examples of how companies are using Onehouse and Hudi today? FEEDBACK?Email: show at the cloudcast dot netTwitter: @thecloudcastnet
Released:
Feb 15, 2023
Format:
Podcast episode

Titles in the series (100)

The Cloudcast is the industry's leading, independent Cloud Computing podcast. Since 2011, co-hosts Aaron Delp & Brian Gracely have interviewed technology and business leaders that are shaping the future of computing. Topics will include Cloud Computing | Open Source | AWS | Azure | GCP | Serverless | DevOps | Big Data | ML | AI | Security | Kubernetes | AppDev | SaaS | PaaS | CaaS | IoT.