Building the howto100m Video Corpus

FromData Skeptic

Start listening View podcast show

Building the howto100m Video Corpus

FromData Skeptic

ratings:

Length:

23 minutes

Released:

Aug 19, 2019

Format:

Podcast episode

Description

Video annotation is an expensive and time-consuming process. As a consequence, the available video datasets are useful but small. The availability of machine transcribed explainer videos offers a unique opportunity to rapidly develop a useful, if dirty, corpus of videos that are "self annotating", as hosts explain the actions they are taking on the screen. This episode is a discussion of the HowTo100m dataset - a project which has assembled a video corpus of 136M video clips with captions covering 23k activities. Related Links The paper will be presented at ICCV 2019 @antoine77340 Antoine on Github Antoine's homepage

Released:

Aug 19, 2019

Format:

Podcast episode

Titles in the series (100)

Data Skeptic is a data science podcast exploring machine learning, statistics, artificial intelligence, and other data topics through short tutorials and interviews with domain experts.

Skip carousel

More Episodes from Data Skeptic

Skip carousel

Related podcast episodes

Skip carousel

Discover this podcast and so much more

Building the howto100m Video Corpus

Building the howto100m Video Corpus

Description

Titles in the series (100)

More Episodes from Data Skeptic

Related podcast episodes