Home
Categories
EXPLORE
True Crime
Comedy
Society & Culture
Business
Sports
Technology
Health & Fitness
About Us
Contact Us
Copyright
© 2024 PodJoint
Podjoint Logo
US
00:00 / 00:00
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts221/v4/f0/4a/f8/f04af822-008a-2330-e3f3-5fae4e00262c/mza_6620006532835236257.jpg/600x600bb.jpg
The Gist Talk
kw
237 episodes
3 days ago
Welcome to The Gist Talk, the podcast where we break down the big ideas from the world’s most fascinating business and non-fiction books. Whether you’re a busy professional, a lifelong learner, or just someone curious about the latest insights shaping the world, this show is for you. Each episode, we’ll explore the key takeaways, actionable lessons, and inspiring stories—giving you the ‘gist’ of every book, one conversation at a time. Join us for engaging discussions that make learning effortless and fun.
Show more...
Business
RSS
All content for The Gist Talk is the property of kw and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
Welcome to The Gist Talk, the podcast where we break down the big ideas from the world’s most fascinating business and non-fiction books. Whether you’re a busy professional, a lifelong learner, or just someone curious about the latest insights shaping the world, this show is for you. Each episode, we’ll explore the key takeaways, actionable lessons, and inspiring stories—giving you the ‘gist’ of every book, one conversation at a time. Join us for engaging discussions that make learning effortless and fun.
Show more...
Business
https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/42551424/42551424-1732839355363-f882e4dafe46.jpg
The Ultra-Scale Playbook Training LLMs on GPU Clusters
The Gist Talk
55 minutes 3 seconds
1 week ago
The Ultra-Scale Playbook Training LLMs on GPU Clusters

The excerpts provide an extensive guide on scaling Large Language Model (LLM) training across GPU clusters, detailing five core parallelism strategies: Data Parallelism (DP), Tensor Parallelism (TP), Sequence/Context Parallelism (SP/CP), Pipeline Parallelism (PP), and Expert Parallelism (EP). The text first addresses memory optimization techniques like activation recomputation and gradient accumulation before exploring how to distribute the model and data using methods like the ZeRO optimizer and various pipeline schedules to minimize idle GPU time. Finally, the source transitions to hardware-level optimizations, covering GPU architecture, the implementation of custom kernels (e.g., in Triton and CUDA), techniques like memory coalescing and tiling, and the use of mixed precision training to maximize throughput and computational efficiency. The discussion emphasizes the critical trade-off between memory savings, computation time, and communication overhead when configuring large-scale training

The Gist Talk
Welcome to The Gist Talk, the podcast where we break down the big ideas from the world’s most fascinating business and non-fiction books. Whether you’re a busy professional, a lifelong learner, or just someone curious about the latest insights shaping the world, this show is for you. Each episode, we’ll explore the key takeaways, actionable lessons, and inspiring stories—giving you the ‘gist’ of every book, one conversation at a time. Join us for engaging discussions that make learning effortless and fun.