Home
Categories
EXPLORE
True Crime
Comedy
Society & Culture
Business
Sports
History
Music
About Us
Contact Us
Copyright
© 2024 PodJoint
00:00 / 00:00
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts211/v4/28/0d/20/280d209a-5b07-9e20-7d7a-c67d4f4e957a/mza_17291545061204122681.jpg/600x600bb.jpg
Talking Machines by SU PARK
Su Park
9 episodes
4 days ago
Join Su Park as she invites various guests to unpack the hottest Artificial Intelligence papers off the press. Each episode dives into the newest discoveries in AI and the sci-fi-slowly-becoming-our-reality era we’re living in.
Show more...
Education
RSS
All content for Talking Machines by SU PARK is the property of Su Park and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
Join Su Park as she invites various guests to unpack the hottest Artificial Intelligence papers off the press. Each episode dives into the newest discoveries in AI and the sci-fi-slowly-becoming-our-reality era we’re living in.
Show more...
Education
https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/43344290/43344290-1743044471027-52ce8605d5ebd.jpg
Tom, Jerry, and the Neural Net: AI’s Leap in Video Storytelling
Talking Machines by SU PARK
23 minutes 10 seconds
7 months ago
Tom, Jerry, and the Neural Net: AI’s Leap in Video Storytelling

In this episode of "Talking Machines by Su Park," the hosts explore a groundbreaking paper focused on generating one-minute videos using a novel approach called Test-Time Training (TTT) layers. This topic is significant as it addresses the limitations of current video generation models, which typically produce only short clips, often around 20 seconds. By leveraging TTT layers, the researchers aim to enhance both the length and narrative complexity of generated videos, showcasing their method through the engaging context of Tom and Jerry cartoons.


Key insights from the discussion include the innovative use of TTT layers to make hidden states more expressive, effectively allowing the model to function like a neural network at critical moments. This enhancement leads to a notable improvement in the coherence of the generated stories, with the researchers reporting a 34% performance boost over existing models. The implications of this work suggest a more advanced capability for AI in video generation, paving the way for richer and more complex visual storytelling.


One-Minute Video Generation with Test-Time Training by NVIDIA: https://arxiv.org/abs/2504.05298

Talking Machines by SU PARK
Join Su Park as she invites various guests to unpack the hottest Artificial Intelligence papers off the press. Each episode dives into the newest discoveries in AI and the sci-fi-slowly-becoming-our-reality era we’re living in.