Discover this podcast and so much more

Podcasts are free to enjoy without a subscription. We also offer ebooks, audiobooks, and so much more for just $11.99/month.

Unleash The Power Of Dataframes At Any Scale With Modin

Unleash The Power Of Dataframes At Any Scale With Modin

FromThe Python Podcast.__init__


Unleash The Power Of Dataframes At Any Scale With Modin

FromThe Python Podcast.__init__

ratings:
Length:
39 minutes
Released:
Jul 22, 2021
Format:
Podcast episode

Description

When you start working on a data project there are always a variety of unknown factors that you have to explore. One of those is the volume of total data that you will eventually need to handle, and the speed and scale at which it will need to be processed. If you optimize for scale too early then it adds a high barrier to entry due to the complexities of distributed systems, but if you invest in a lot of engineering up front then it can be challenging to refactor for scale. Modin is a project that aims to remove that decision by letting you seamlessly replace your existing Pandas code and scale across CPU cores or across a cluster of machines. In this episode Devin Petersohn explains why he started working on solving this problem, how Modin is architected to allow for a smooth escalation from small to large volumes of data and compute, and how you can start using it today to accelerate your Pandas workflows.
Released:
Jul 22, 2021
Format:
Podcast episode

Titles in the series (100)

The podcast about Python and the people who make it great