The ML Digest

https://is1-ssl.mzstatic.com/image/thumb/Podcasts221/v4/83/65/e8/8365e89c-1fe3-146d-eb21-2eae63e65ac8/mza_14046476489363220326.jpg/600x600bb.jpg

The ML Digest

Julio Alonzo

2 episodes

1 week ago

Trending AI/ML papers explained

Technology

RSS

All content for The ML Digest is the property of Julio Alonzo and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.

Trending AI/ML papers explained

Technology

Episodes (2/2)

The ML Digest

Unifying LLM Post-Training: From SFT and RL to Hybrid Approaches

This episode of The ML Digest covers the paper “Towards a Unified View of Large Language Model Post-Training” from researchers at Tsinghua University, Shanghai AI Lab, and WeChat AI. The authors argue that seemingly distinct approaches—Supervised Fine-Tuning (SFT) with offline demonstrations and Reinforcement Learning (RL) with online rollouts—are in fact instances of a single optimization process.

Link to original paper: https://arxiv.org/pdf/2509.04419

1 month ago

25 minutes 31 seconds

The ML Digest

Are Small Language Models the Future of Agentic AI?

In this episode we go over the recent NVIDIA paper titled "Small Language Models are the Future of Agentic AI."

Link to the original paper: https://arxiv.org/pdf/2506.02153

1 month ago

25 minutes 36 seconds