Home
Categories
EXPLORE
True Crime
Comedy
Society & Culture
Business
Sports
Technology
Health & Fitness
About Us
Contact Us
Copyright
© 2024 PodJoint
Podjoint Logo
US
00:00 / 00:00
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts211/v4/4f/1c/b1/4f1cb185-f5bb-229d-2dee-8aeea669a76e/mza_2035931246008308099.jpg/600x600bb.jpg
Future Is Already Here
Eksplain
32 episodes
1 week ago
“The future is already here — it's just not very evenly distributed,” said science fiction writer William Gibson. We agree. Our mission is to help change that. This podcast breaks down advanced technologies and innovations in simple, easy-to-understand ways, making cutting-edge ideas more accessible to everyone. Please note: Some of our content may be AI-generated, including voices, text, images, and videos.
Show more...
Technology
RSS
All content for Future Is Already Here is the property of Eksplain and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
“The future is already here — it's just not very evenly distributed,” said science fiction writer William Gibson. We agree. Our mission is to help change that. This podcast breaks down advanced technologies and innovations in simple, easy-to-understand ways, making cutting-edge ideas more accessible to everyone. Please note: Some of our content may be AI-generated, including voices, text, images, and videos.
Show more...
Technology
https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/42831029/42831029-1744939931749-250385e3389bb.jpg
Beyond Pixels: V-JEPA and the Future of Video AI
Future Is Already Here
17 minutes 55 seconds
8 months ago
Beyond Pixels: V-JEPA and the Future of Video AI

How do we teach AI to truly understand video? V-JEPA offers a new answer: by predicting features, not just pixels. We'll break down this fascinating technique, explaining how it helps AI learn more robust and meaningful visual representations from video. Join us to explore how V-JEPA is pushing the boundaries of video AI.


This paper explores feature prediction as a stand-alone objective for unsupervised learning from video and introduces V-JEPA, a collection of vision models trained solely using a feature prediction objective, without the use of pretrained image encoders, text, negative examples, reconstruction, or other sources of supervision. The models are trained on 2 million videos collected from public datasets and are evaluated on downstream image and video tasks. Our results show that learning by predicting video features leads to versatile visual representations that perform well on both motion and appearance-based tasks, without adaption of the model’s parameters; e.g., using a frozen backbone, our largest model, a ViT-H/16 trained only on videos, obtains 81.9% on Kinetics-400, 72.2% on Something-Something-v2, and 77.9% on ImageNet1K.


References:

This episode draws primarily from the following paper:

 

Revisiting Feature Prediction for Learning VisualRepresentations from Video

Adrien Bardes, Quentin Garrido, Jean Ponce, XinleiChen, Michael Rabbat, Yann LeCun, Mahmoud Assran, Nicolas Ballas

 

The paper references several other important works in this field. Please refer to the full paper for acomprehensive list.


Disclaimer:

Please note that parts or all this episode was generatedby AI. While the content is intended to be accurate and informative, it isrecommended that you consult the original research papers for a comprehensiveunderstanding.

Future Is Already Here
“The future is already here — it's just not very evenly distributed,” said science fiction writer William Gibson. We agree. Our mission is to help change that. This podcast breaks down advanced technologies and innovations in simple, easy-to-understand ways, making cutting-edge ideas more accessible to everyone. Please note: Some of our content may be AI-generated, including voices, text, images, and videos.