#390: High performance and the lowest cost machine learning inference in the cloud with AWS Inferentia

FromAWS Podcast

Start listening View podcast show

#390: High performance and the lowest cost machine learning inference in the cloud with AWS Inferentia

FromAWS Podcast

ratings:

Length:

20 minutes

Released:

Sep 6, 2020

Format:

Podcast episode

Description

AWS Inferentia is custom built by AWS to provide high performance and lowest cost machine learning inference in the cloud. Amazon EC2 Inf1 instances, powered by AWS Inferentia, provide up to 3x higher throughput and up to 40% lower cost per inference over comparable GPU-based instances. In this podcast, learn more about AWS Inferentia, Inf1 instances, and how to get started with Inf1 instances. https://aws.amazon.com/machine-learning/inferentia/

Released:

Sep 6, 2020

Format:

Podcast episode

Titles in the series (100)

The AWS Podcast is the definitive cloud platform podcast for developers, dev ops, and cloud professionals seeking the latest news and trends in storage, security, infrastructure, serverless, and more. Join Simon Elisha and Jeff Barr for regular updates, deep dives and interviews. Whether you’re building machine learning and AI models, open source projects, or hybrid cloud solutions, the AWS Podcast has something for you.

Skip carousel

More Episodes from AWS Podcast

Skip carousel

Related podcast episodes

Skip carousel

Discover this podcast and so much more

#390: High performance and the lowest cost machine learning inference in the cloud with AWS Inferentia

#390: High performance and the lowest cost machine learning inference in the cloud with AWS Inferentia

Description

Titles in the series (100)

More Episodes from AWS Podcast

Related podcast episodes