Discover this podcast and so much more

Podcasts are free to enjoy without a subscription. We also offer ebooks, audiobooks, and so much more for just $11.99/month.

Pioneering AI Models for Regional Languages // Aleksa Gordić // #203

Pioneering AI Models for Regional Languages // Aleksa Gordić // #203

FromMLOps.community


Pioneering AI Models for Regional Languages // Aleksa Gordić // #203

FromMLOps.community

ratings:
Length:
64 minutes
Released:
Jan 12, 2024
Format:
Podcast episode

Description

Aleksa Gordić is an ex-Google DeepMind / Microsoft ML engineer currently working on non-English LLMs at OrtusAI, open-sourcing Meta's NLLB (no language left behind) project and
YugoGPT.

MLOps podcast #203 with Aleksa Gordić, Founder of OrtusAI, Pioneering AI Models for Regional Languages.

// Abstract
Dive deep into Aleksa's work with the YugoGPT, a language model serving Serbian, Croatian, Bosnian, and Montenegrin dialects - emphasizing the need for multilingual AI developments.

Explore the unique language dynamics in the Balkans and Eastern Europe, the potential business opportunities around multilingual models, and the challenges in deploying large language models. Aleksa shares his experience with vision and image models, his collaborations with key tech players, and his use of advanced technologies. Hear about Aleksa Gordić's journey of being active and visible in the tech community and his insights into the world of machine learning and AI. Prepare to have your thinking challenged and horizons widened as we converse about the intriguing and complex world of MLOps.

// Bio
Working on non-English LLMs at OrtusAI, open-sourcing Meta's NLLB (no language left behind) project. Worked at DeepMind on the Flamingo project as a research engineer. Worked at Microsoft on the HoloLens 2 project & next-gen mixed reality glasses.

// MLOps Jobs board
https://mlops.pallet.xyz/jobs

// MLOps Swag/Merch
https://mlops-community.myshopify.com/

// Related Links
Website: https://gordicaleksa.com/
https://github.com/gordicaleksa - I build stuff :)
https://discord.com/invite/peBrCpheKE - active AI Discord server (~6000) I bring the best AI researchers in the world to give talks (James Betker DALL-E 3 author, Tri Dao (Flash Attention), etc.)
https://gordicaleksa.medium.com/how-i-got-a-job-at-deepmind-as-a-research-engineer-without-a-machine-learning-degree-1a45f2a781de - how I landed a job at DeepMind (and a couple more potentially interesting writings)
Aleksa Gordić The AI Epiphany Youtube Channel: https://www.youtube.com/channel/UCj8shE7aIn4Yawwbo2FceCQ/videos

--------------- ✌️Connect With Us ✌️ -------------
Join our slack community: https://go.mlops.community/slack
Follow us on Twitter: @mlopscommunity
Sign up for the next meetup: https://go.mlops.community/register
Catch all episodes, blogs, newsletters, and more: https://mlops.community/

Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/
Connect with Aleksa on LinkedIn: https://www.linkedin.com/in/aleksagordic/

Timestamps:
[00:00] Aleksa's preferred coffee
[00:17] Takeaways
[02:51] Humming the GPU's
[06:23] Built Chrome extension for communicating with videos
[08:04] Rig Doubles Throughput Time
[09:32] Vector databases advise
[10:38] Learning from experts, connecting, and gathering insights.
[13:47] Zero to Hero for MLOps
[15:37] Serendipitous moments
[17:52] Depth Over Breaking News
[19:50] Trust in GPT Content
[22:22] Exam Challenges and AI
[26:53] YugoGPT
[31:41] WandB Ad
[33:33] Linguistic Mysteries
[34:52] No Language Left Behind project (NLLB project)
[36:53] YugoGPT Development Overview
[37:49] NLLB vs YugoGPT
[39:35] Yugo GPT parameters
[41:16] Opportunities for unsupported languages
[43:08] Diffusion model
[44:39] Generative AI with image generation models
[47:45] AI Challenges and Excitement
[50:32] Challenges in different alphabet characters
[52:10] Need a co-founder
[56:05] Career transition and entrepreneurial mindset
[1:00:20] Big Tech salary misconceptions
[1:03:02] Inspiring wrap up
Released:
Jan 12, 2024
Format:
Podcast episode

Titles in the series (100)

Weekly talks and fireside chats about everything that has to do with the new space emerging around DevOps for Machine Learning aka MLOps aka Machine Learning Operations.