word2vec

FromData Skeptic

Start listening View podcast show

word2vec

FromData Skeptic

ratings:

Length:

31 minutes

Released:

Feb 1, 2019

Format:

Podcast episode

Description

Word2vec is an unsupervised machine learning model which is able to capture semantic information from the text it is trained on. The model is based on neural networks. Several large organizations like Google and Facebook have trained word embeddings (the result of word2vec) on large corpora and shared them for others to use. The key algorithmic ideas involved in word2vec is the continuous bag of words model (CBOW). In this episode, Kyle uses excerpts from the 1983 cinematic masterpiece War Games, and challenges Linhda to guess a word Kyle leaves out of the transcript. This is similar to how word2vec is trained. It trains a neural network to predict a hidden word based on the words that appear before and after the missing location.

Released:

Feb 1, 2019

Format:

Podcast episode

Titles in the series (100)

Data Skeptic is a data science podcast exploring machine learning, statistics, artificial intelligence, and other data topics through short tutorials and interviews with domain experts.

Skip carousel

More Episodes from Data Skeptic

Skip carousel

Related podcast episodes

Skip carousel

Discover this podcast and so much more

word2vec

word2vec

Description

Titles in the series (100)

More Episodes from Data Skeptic

Related podcast episodes