Discover this podcast and so much more

Podcasts are free to enjoy without a subscription. We also offer ebooks, audiobooks, and so much more for just $11.99/month.

E6: The Computer Vision Revolution with Junnan Li and Dongxu Li of BLIP and BLIP2

E6: The Computer Vision Revolution with Junnan Li and Dongxu Li of BLIP and BLIP2

From"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis


E6: The Computer Vision Revolution with Junnan Li and Dongxu Li of BLIP and BLIP2

From"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

ratings:
Length:
83 minutes
Released:
Mar 9, 2023
Format:
Podcast episode

Description

(00:00) Preview (01:17) Sponsor (01:35) Intro (05:50) Convergence of AI techniques (07:33) Evolution of BLIP to BLIP-2 (08:12) How BLIP-2 unlocked multimodal functionality(12:43) The size, training dynamics, and optimization function of BLIP (20:15) Practical/Business applications of BLIP (29:43) Efficiency of BLIP-2 compared to other models (41:52) Two-stage pre-training (47:11) Architecture of Blip-2’s connector model (58:52) Language models as the executive function of the brain (01:07:32) Vision for an ultimate multimodal system and democratized pre-training for models (01:12:59) Useful AI tools in these researchers’ day-to-day (01:14:56) Upcoming projects *Thank you Omneky for sponsoring The Cognitive Revolution. Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that actually work, customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off.  Twitter:@CogRev_Podcast@atroyn (Junna Li)@DongxuLi_(Dongxu Li)@labenz (Nathan)@eriktorenberg (Erik) Join 1000's of subscribers of our Substack: https://cognitiverevolution.substack.com/ Websites:Cognitivervolution.aihuggingface.co/spaces/Salesforce/BLIPhuggingface.co/spaces/Salesforce/BLIP2 Show Notes:- Original BLIP demo- BLIP 2 demo- BLIP is the #18 most highly-cited paper in AI- Image captioning comparison tool- Understanding images with AI - for use in language models and image generation- Image Aesthetics - Product & Model Reviews
Released:
Mar 9, 2023
Format:
Podcast episode

Titles in the series (100)

A weekly podcast where hosts Erik Torenberg and Nathan Labenz interview the builders on the edge of AI and explore the dramatic shift it will unlock in the coming years.