Getting to Know LLMs: Generative Models Fundamentals (Part 1)

https://is1-ssl.mzstatic.com/image/thumb/Podcasts221/v4/78/dc/33/78dc337d-6f9e-9344-102f-f24aebc696db/mza_9439020989584929103.jpg/600x600bb.jpg

The Second Brain AI Podcast ✨🧠

Rahul Singh

10 episodes

2 weeks ago

Send us a text What if not every part of an AI model needed to think at once? In this episode, we unpack Mixture of Experts, the architecture behind efficient large language models like Mixtral. From conditional computation and sparse activation to routing, load balancing, and the fight against router collapse, we explore how MoE breaks the old link between size and compute. As scaling hits physical and economic limits, could selective intelligence be the next leap toward general intelligence...

Technology

RSS

All content for The Second Brain AI Podcast ✨🧠 is the property of Rahul Singh and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.

Technology

https://storage.buzzsprout.com/i6eb0al0eeyu4dxwtp73b9dl36is?.jpg

Getting to Know LLMs: Generative Models Fundamentals (Part 1)

The Second Brain AI Podcast ✨🧠

21 minutes

4 months ago

Getting to Know LLMs: Generative Models Fundamentals (Part 1)

Send us a text In this episode, we introduce large language models (LLMs), what they are, how they work at a high level, and why prompting is key to using them effectively. You’ll learn about different types of prompts, how to structure them, and what makes an LLM respond the way it does. Source: "Foundations of Large Language Models" by Tong Xiao and Jingbo Zhu