Discover this podcast and so much more

Podcasts are free to enjoy without a subscription. We also offer ebooks, audiobooks, and so much more for just $11.99/month.

Speed Up Your Analytics With The Alluxio Distributed Storage System

Speed Up Your Analytics With The Alluxio Distributed Storage System

FromData Engineering Podcast


Speed Up Your Analytics With The Alluxio Distributed Storage System

FromData Engineering Podcast

ratings:
Length:
60 minutes
Released:
Feb 19, 2019
Format:
Podcast episode

Description

Distributed storage systems are the foundational layer of any big data stack. There are a variety of implementations which support different specialized use cases and come with associated tradeoffs. Alluxio is a distributed virtual filesystem which integrates with multiple persistent storage systems to provide a scalable, in-memory storage layer for scaling computational workloads independent of the size of your data. In this episode Bin Fan explains how he got involved with the project, how it is implemented, and the use cases that it is particularly well suited for. If your storage and compute layers are too tightly coupled and you want to scale them independently then Alluxio is the tool for the job.
Released:
Feb 19, 2019
Format:
Podcast episode

Titles in the series (100)

Weekly deep dives on data management with the engineers and entrepreneurs who are shaping the industry