Hugging Face Trending Papers

https://is1-ssl.mzstatic.com/image/thumb/Podcasts211/v4/1e/15/01/1e150157-3dfb-239d-9019-72f0d7d67bfb/mza_17852249358058477486.jpg/600x600bb.jpg

Code Coin Cognition LLC

11 episodes

3 days ago

Stay ahead in AI with Hugging Face Trending Papers — your daily digest of trending arXiv research. Hosts Vikas and Roger break down the most talked-about papers in machine learning, LLMs, generative AI, and robotics in just 5–6 minutes. Clear, conversational insights on problems, methods, benchmarks, and real-world impact — no jargon overload. Perfect for researchers, engineers, students, and AI enthusiasts.

Technology

RSS

All content for Hugging Face Trending Papers is the property of Code Coin Cognition LLC and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.

Technology

Episodes (11/11)

Hugging Face Trending Papers

Episode 11: Unlocking AI Reasoning: Breakthroughs in Looped Language Models

Papers discussed:

1. [Scaling Latent Reasoning via Looped Language Models](https://arxiv.org/pdf/2510.25741): This paper introduces a new kind of pre-trained looped language models, Ouro, which improves reasoning capabilities by integrating reasoning into the pre-training phase. The models have demonstrated superior performance due to enhanced knowledge manipulation capabilities.

2. [Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations](https://arxiv.org/pdf/2510.23607): The Concerto model combines 2D and 3D learning for improved spatial cognition in AI. This integration, involving 3D intra-modal self-distillation with 2D-3D cross-modal joint embedding, has yielded promising results in 3D scene perception and set new benchmarks in scene understanding.

3. [RECODE: Unify Plan and Action for Universal Granularity Control](https://arxiv.org/pdf/2510.23564): RECODE is a new paradigm that unifies planning and action within a single code representation, facilitating dynamic control of decision granularity. This approach has proven effective in enhancing inference performance and training data efficiency.

3 days ago

5 minutes 8 seconds

Hugging Face Trending Papers

Episode 10: AI's New Brain: LLM Reasoning, Memory, Agents

**Episode Summary:**This episode dives into cutting-edge advancements for Large Language Models, covering new methods to enhance reasoning reliability and efficiency, and introducing lightweight memory systems for more effective long-term interaction.

**Featured Papers:**
* **A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning** * *Key Insight:* Introduces RPC, a novel method that theoretically and empirically improves LLM reasoning by combining self-consistency and perplexity, achieving exponential error convergence and reducing sampling costs by 50%. * *Link:* https://arxiv.org/pdf/2510.15444

* **LIGHTMEM: LIGHTWEIGHT AND EFFICIENT MEMORY-AUGMENTED GENERATION** * *Key Insight:* Presents LightMem, a human-memory-inspired system that enables LLMs to leverage historical interactions efficiently, significantly reducing token usage, API calls, and runtime while boosting accuracy. * *Link:* https://arxiv.org/pdf/2510.18866

* **DeepAnalyze: Agentic Large Language Models for Autonomous Data Science** * *Key Insight:* Introduces an agentic LLM framework for autonomous data science, automating the entire process from raw data to analyst-graded research reports using multi-agent collaboration and feedback reasoning. * *Link:* https://arxiv.org/pdf/2510.16872

1 week ago

9 minutes 47 seconds

Hugging Face Trending Papers

Episode 9: Boosting AI Problem Solving: Tiny Networks and Early Experience Learning

In this episode of Hugging Face Trending Papers, we discuss three exciting AI research papers: "Less is More: Recursive Reasoning with Tiny Networks", "Agent Learning via Early Experience", and "Paper2Video: Automatic Video Generation from Scientific Papers".

## Papers Discussed
1. **[Less is More: Recursive Reasoning with Tiny Networks](https://arxiv.org/pdf/2510.04871)**: This paper introduces a Tiny Recursive Model that significantly improves accuracy on hard question-answer problems, using a simpler recursive reasoning approach and beating Large Language Models on complex tasks.
2. **[Agent Learning via Early Experience](https://arxiv.org/pdf/2510.08558)**: This research paper presents a new paradigm called "early experience", where AI agents learn from their own actions. The approach improved effectiveness and out-of-domain generalization in diverse environments.
3. **[Paper2Video: Automatic Video Generation from Scientific Papers](https://arxiv.org/pdf/2510.05096)**: This paper presents Paper2Video, a multi-agent framework designed to automate the labor-intensive process of generating academic presentation videos from scientific papers.

## Episode Links-

[Paper 1: Less is More: Recursive Reasoning with Tiny Networks](https://arxiv.org/pdf/2510.04871)-

[Paper 2: Agent Learning via Early Experience](https://arxiv.org/pdf/2510.08558)-

[Paper 3: Paper2Video: Automatic Video Generation from Scientific Papers](https://arxiv.org/pdf/2510.05096)

3 weeks ago

4 minutes 31 seconds

Hugging Face Trending Papers

Episode 8: Boosting AI Efficiency: Code Compression, Video Generation, and Experience-based Reasoning

In this episode, we discuss three trending AI research papers. We delve into the challenges and solutions related to code language models, video generation, and reinforcement learning.

Key Points Discussed
#LongCodeZip: Compress Long Context for Code Language Models- LongCodeZip is a novel framework for compressing code for Large Language Models (LLMs)- It addresses the issue of high API costs and generation latency associated with processing long inputs in codebases- The framework uses a dual-stage compression strategy, enabling it to preserve essential information while reducing context size- Evaluations show that LongCodeZip consistently outperforms baseline methods- This research could improve the efficiency and capability of code intelligence applications

#Self-Forcing++: Towards Minute-Scale High-Quality Video Generation- The paper addresses the computational cost of generating long videos with diffusion models- It proposes an approach that uses teacher models to guide student models through sampled segments from self-generated long videos- This method allows for video length scaling up to 20× beyond the teacher's capability- The authors manage to generate videos up to 4 minutes and 15 seconds long, substantially outperforming baseline methods

#EXGRPO: Learning to Reason from Experience- The paper investigates what makes a reasoning experience valuable in the context of Reinforcement Learning from Verifiable Rewards (RLVR)- The authors propose a framework that organizes and prioritizes valuable experiences- The approach aims to balance exploration with experience exploitation for efficient and scalable RLVR

### Links to Papers- [

LongCodeZip: Compress Long Context for Code Language Models](https://arxiv.org/pdf/2510.00446 )- [

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation](https://arxiv.org/pdf/2510.02283 )-

[EXGRPO: Learning to Reason from Experience](https://arxiv.org/pdf/2510.02245 )

1 month ago

4 minutes 17 seconds

Hugging Face Trending Papers

Episode 7: Agents of Change: From Interactive Papers to Lifelong AI Learning

In today’s episode of Hugging Face Trending Papers, we explore three cutting-edge ideas reshaping how we interact with AI and research.

First, we dive into Paper2Agent, a framework that transforms static research papers into interactive AI agents, making findings more transparent and usable. Next, we look at Scaling Agents via Continual Pre-training, which pushes the boundaries of agent reliability by teaching them through lifelong learning. Finally, we cover Documenting Machine Learning with AI Partners, a vision of AI tools that act as collaborative lab partners to capture, explain, and even co-author machine learning workflows.

Listen in for benchmarks, methods, and real-world implications — and discover how these papers could change the way we do science.

Papers discussed:

1 month ago

3 minutes 49 seconds

Hugging Face Trending Papers

Episode 6: Steering the Future: Real-Time Long Video, Training-Time Search, and a Gym for Agentic LLMs

In this episode, we unpack three fresh arXiv papers shaping how AI creates, reasons, and acts. First, arXiv:2509.22622 explores real-time, steerable long-form video generation you can guide on the fly (PDF: https://arxiv.org/pdf/2509.22622).

Next, arXiv:2509.25454 integrates tree search directly into reinforcement-learning training for verifiable reasoning—think math and code with checkable rewards (PDF: https://arxiv.org/pdf/2509.25454).

Finally, arXiv:2510.01051 introduces a unified “gym” for multi-turn, tool-using LLM agents so results are comparable and scalable (PDF: https://arxiv.org/pdf/2510.01051). We break down why each matters, the key technical ideas, and what they could unlock for creators, engineers, and autonomous AI workflows.

1 month ago

8 minutes 20 seconds

Hugging Face Trending Papers

Episode 5: Scaling Feedback, Forgetting Smartly, and Video Agents: AI’s Next Frontier

1. RLAIF at Scale: Reinforcement Learning from AI Feedback for Multi-Turn Reasoning

This paper explores using AI-generated feedback instead of expensive human labels to train reasoning models. The authors show that Reinforcement Learning from AI Feedback (RLAIF) can match or even outperform models trained with limited human feedback, especially in multi-turn reasoning tasks.

2. Learning to Forget: Dynamic Memory Compression in Long-Context Transformers

The authors propose a method for making transformers more efficient on long contexts by teaching them to “forget” unimportant details. Their dynamic memory compression reduces memory usage by over 40% while maintaining — and sometimes improving — accuracy on long-sequence benchmarks.

3. VidAgent: Scalable Video Agents with Spatio-Temporal Reasoning

This work introduces VidAgent, a system that can understand and reason over long videos by grounding events in both space and time. It achieves state-of-the-art performance on video QA benchmarks and opens up possibilities for advanced video search and monitoring applications.

1 month ago

7 minutes 55 seconds

Hugging Face Trending Papers

Episode 4: Panoramas, HALA, and the T2I Exam: Three Trends You Shouldn’t Miss

Today we cover three standout arXiv releases shaping vision, language, and evaluation. First, PANORAMA surveys the rise of omnidirectional, 360° perception for embodied AI—why standard pinhole vision isn’t enough, where datasets and models fall short, and how new backbones and adaptation methods are closing the gap. Read: https://arxiv.org/pdf/2509.12989 (arXiv:2509.12989).
Next, the HALA technical report details an Arabic-centric instruction and translation pipeline—from FP8 translator teachers to multi-million sample corpora—powering models from 350M to 9B with strong benchmark gains. Read: https://arxiv.org/pdf/2509.14008 (arXiv:2509.14008).
Finally, GenExam proposes a multidisciplinary “exam” for text-to-image models, revealing how strict, knowledge-heavy prompts expose major gaps in today’s generators. Read: https://arxiv.org/pdf/2509.14232 (arXiv:2509.14232).

1 month ago

10 minutes 21 seconds

Hugging Face Trending Papers

Episode 3: Swarms, Tiny Robot Policies & HuMo

In today’s 5–6 minute roundup, we cover:

(1) SAPO’s decentralized RL that shares rollouts across a swarm for cheaper, faster LM post-training (arXiv:2509.08721 PDF),

(2) VLA-Adapter’s “Bridge Attention” that makes small vision-language-action models both fast and state-of-the-art on robotics tasks (arXiv:2509.09372 PDF), and

(3) HuMo’s unified generator coordinating text, reference images, and audio for people-centric video with strong identity + lip-sync (arXiv:2509.08519 PDF). Subscribe for crisp takes on what was done, why it matters, and where it might go next.

1 month ago

8 minutes 40 seconds

Hugging Face Trending Papers

Episode 2: Boundaries Checked, Populations Evolved, Images Understood

In this episode, we cover three HuggingFace trending AI papers shaping the future of alignment, training, and creativity.

How models can reason over boundaries to stick to instructions (arXiv:2509.14760)
How populations of models can evolve without labels through consensus and novelty (arXiv:2509.15194)
How autoregressive generators can understand before they generate images (arXiv:2509.15185)

Three different paths, one goal: building smarter, safer, and more creative AI.

1 month ago

7 minutes 53 seconds

Hugging Face Trending Papers

Hugging Face Trending Papers (Ep. 1) — ScaleCUA, FlowRL, RynnVLA-001

Smarter computer agents, better reasoning, and robot manipulation breakthroughs.

Today’s Papers

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data
🔗 arXiv:2509.15221
➡️ Large dataset across 6 OSs and 3 task domains; closed-loop pipeline of auto-agents + human curation; big benchmark gains for GUI agents.
FlowRL: Matching Reward Distributions for LLM Reasoning
🔗 arXiv:2509.15207
➡️ Shifts RL objective from reward maximization to reward distribution matching; preserves diverse reasoning paths; strong math & code benchmark results.
RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation
🔗 arXiv:2509.15212
➡️ Two-stage pretraining on 12M ego-centric videos + trajectory-aware modeling; adds ActionVAE for action compression; stronger transfer to real robot tasks.

1 month ago

3 minutes 3 seconds