Discover this podcast and so much more

Podcasts are free to enjoy without a subscription. We also offer ebooks, audiobooks, and so much more for just $11.99/month.

The Timeline for Realistic 4-D: Devi Parikh from Meta on Research Hurdles for Generative AI in Video and Multimodality

The Timeline for Realistic 4-D: Devi Parikh from Meta on Research Hurdles for Generative AI in Video and Multimodality

FromNo Priors: Artificial Intelligence | Technology | Startups


The Timeline for Realistic 4-D: Devi Parikh from Meta on Research Hurdles for Generative AI in Video and Multimodality

FromNo Priors: Artificial Intelligence | Technology | Startups

ratings:
Length:
40 minutes
Released:
Jul 20, 2023
Format:
Podcast episode

Description

Video dominates modern media consumption, but video creation is still expensive and difficult. AI-generated and edited video is a holy grail of democratized creative expression. This week on No Priors, Sarah Guo and Elad Gil sit down with Devi Parikh. She is a Research Director in Generative AI at Meta and an Associate Professor in the School of Interactive Computing at Georgia Tech. Her work focuses on multimodality and AI for images, audio and video. Recently, she worked on Make a Video 3D, also called MAV3D, which creates animations from text prompts. She is also a talented AI-generated and analog artist herself.
Elad, Sarah and Devi talk about what’s exciting in computer vision, what’s blocking researchers from fully immersive Generative 4-D, and AI controllability.
No Priors is now on YouTube! Subscribe to the channel on YouTube and like this episode.

Show Links:


Devi Parikh - Google Scholar 

Text-To-4D Dynamic Scene Generation named MAV3D (Make-A-Video3D)

Full Research Paper

Website with examples of image to 4 D generation

Devi’s Substack


Sign up for new podcasts every week. Email feedback to show@no-priors.com
Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @DeviParikh

Show Notes:
(0:00:06) - Democratizing Creative Expression With AI-Generated Video
(0:08:31) - Challenges in Video Generation Research
(0:15:57) - Challenges and Implications of Video Processing
(0:20:43) - Control and Multi-Modal Inputs in Video
(0:25:50) - Audio's Role in Visual Content
(0:39:00) - Don't Self-Select & Devi’s tips for young researchers
Released:
Jul 20, 2023
Format:
Podcast episode

Titles in the series (64)

At this moment of inflection in technology, co-hosts Elad Gil and Sarah Guo talk to the world's leading AI engineers, researchers and founders about the biggest questions: How far away is AGI? What markets are at risk for disruption? How will commerce, culture, and society change? What’s happening in state-of-the-art in research? “No Priors” is your guide to the AI revolution. Email feedback to show@no-priors.com. Sarah Guo is a startup investor and the founder of Conviction, an investment firm purpose-built to serve intelligent software, or "Software 3.0" companies. She spent nearly a decade incubating and investing at venture firm Greylock Partners. Elad Gil is a serial entrepreneur and a startup investor. He was co-founder of Color Health, Mixer Labs (which was acquired by Twitter). He has invested in over 40 companies now worth $1B or more each, and is also author of the High Growth Handbook.