Collaboration & evaluation for LLM apps

FromPractical AI: Machine Learning, Data Science

Start listening View podcast show

Collaboration & evaluation for LLM apps

FromPractical AI: Machine Learning, Data Science

ratings:

Length:

46 minutes

Released:

Jan 23, 2024

Format:

Podcast episode

Description

Small changes in prompts can create large changes in the output behavior of generative AI models. Add to that the confusion around proper evaluation of LLM applications, and you have a recipe for confusion and frustration. Raza and the Humanloop team have been diving into these problems, and, in this episode, Raza helps us understand how non-technical prompt engineers can productively collaborate with technical software engineers while building AI-driven apps.

Released:

Jan 23, 2024

Format:

Podcast episode

Titles in the series (100)

Making artificial intelligence practical, productive, and accessible to everyone. Practical AI is a show in which technology professionals, business people, students, enthusiasts, and expert guests engage in lively discussions about Artificial Intelligence and related topics (Machine Learning, Deep Learning, Neural Networks, GANs, MLOps, AIOps, and more). The focus is on productive implementations and real-world scenarios that are accessible to everyone. If you want to keep up with the latest advances in AI, while keeping one foot in the real world, then this is the show for you!

Skip carousel

More Episodes from Practical AI: Machine Learning, Data Science

Skip carousel

Related podcast episodes

Skip carousel

Discover this podcast and so much more

Collaboration & evaluation for LLM apps

Collaboration & evaluation for LLM apps

Description

Titles in the series (100)

More Episodes from Practical AI: Machine Learning, Data Science

Related podcast episodes