2: data2vec

https://is1-ssl.mzstatic.com/image/thumb/Podcasts211/v4/cb/94/dd/cb94dd8c-3061-a5fc-079f-a47a75bb1a24/mza_8328803272468307441.jpg/600x600bb.jpg

Argmax

Vahe Hagopian, Taka Hasegawa, Farrukh Rahman

17 episodes

9 months ago

In this episode we talk about the paper "Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer, Azalia Mirhoseini, Krzysztof Maziarz, Andy Davis, Quoc Le, Geoffrey Hinton, Jeff Dean.

Mathematics

Science

RSS

All content for Argmax is the property of Vahe Hagopian, Taka Hasegawa, Farrukh Rahman and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.

Mathematics

Science

2: data2vec

Argmax

53 minutes

3 years ago

2: data2vec

Todays paper: data2vec (https://arxiv.org/abs/2202.03555)Summary of the paperA multimodal SSL algorithm that predicts latent representation of different types of input.Highlights of discussionWhat are the motivations of SSL and multimodalHow does the student teacher learning work?What are similarities and differences between ViT, BYOL, and Reinforcement Learning algorithms.