Home
Categories
EXPLORE
True Crime
Comedy
Society & Culture
Business
News
Sports
TV & Film
About Us
Contact Us
Copyright
© 2024 PodJoint
Podjoint Logo
US
00:00 / 00:00
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts221/v4/cd/2d/f2/cd2df2f3-0cfc-65a5-8f91-efaa69fbe7cf/mza_10828257373046467182.jpg/600x600bb.jpg
The Inference Show
Automatan
10 episodes
5 days ago
Welcome to The Inference Show, where we explore the dynamic world of Education, Technology, and Artificial Intelligence. In each episode, we bring together diverse thought leaders to discuss the latest trends and their real-world impact. Our conversations dive into the applications, challenges, and opportunities shaping the future. Tune in as we uncover innovations driving advancements today and the strategies that will reshape tomorrow’s landscape.
Show more...
Technology
RSS
All content for The Inference Show is the property of Automatan and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
Welcome to The Inference Show, where we explore the dynamic world of Education, Technology, and Artificial Intelligence. In each episode, we bring together diverse thought leaders to discuss the latest trends and their real-world impact. Our conversations dive into the applications, challenges, and opportunities shaping the future. Tune in as we uncover innovations driving advancements today and the strategies that will reshape tomorrow’s landscape.
Show more...
Technology
https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/44275156/44275156-1755787472832-8efa86cdddc.jpg
Meta, Google & NVIDIA Engineer Manish Gupta on the future of GPUs & AI Infrastructure
The Inference Show
1 hour 10 minutes 29 seconds
1 month ago
Meta, Google & NVIDIA Engineer Manish Gupta on the future of GPUs & AI Infrastructure

In this episode of The Inference Show, we are joined by Dr. Manish Gupta, a leading expert in AI training, GPU performance, and compiler optimization. Manish brings a wealth of experience from his work at Magic, Meta, Google, NVIDIA, AMD, and Qualcomm, where he has been at the forefront of scaling custom compute systems, optimizing large language models, and pioneering GPU innovations.

Manish takes us on a journey through his career and dives deep into the cutting edge of AI infrastructure, discussing:

  • His early experiences with low-level assembly programming and how it shaped his approach to GPU optimization.
  • Insights from working on NVIDIA’s Cutlass project, which powers nearly every major AI training pipeline today.
  • The bottlenecks in scaling massive models like LLaMA, including precision trade-offs and checkpointing strategies.
  • How test-time compute and reinforcement learning are redefining the future of inference and model performance.
  • Why programmability and software-hardware co-design are key for emerging AI accelerators.
  • The evolution of GPU architecture from Volta to Blackwell and what it means for developers.
  • His vision for the future of AI-driven code generation and automated kernel development.

—

Manish’s work has directly influenced AI training and inference at scale, with his contributions now used by every major company developing foundational models. From building core libraries to optimizing for cutting-edge hardware, he offers a rare perspective on where AI infrastructure is heading and the deep technical challenges ahead.

—

About Dr. Manish Arora

Dr. Arora is the co-founder of LearnDesk and Insaito, where he leads marketing and sales. He has grown LearnDesk into a global platform supporting over 25,000 businesses and is now focused on Automatan, an AI platform for automating business workflows. With 80+ patents and decades of industry experience, Dr. Arora brings deep technical and strategic insights to every conversation.

—

The Inference Show

Stay connected with us and explore more about our guests, topics, and future episodes:

🔗 Manish Gupta: LinkedIn

🔗 Automatan: LinkedIn

🔗 Insaito: LinkedIn

🔗 Dr. Manish Arora: LinkedIn

🔗 Vivek Puri: LinkedIn

🔗 LearnDesk: Website

🔗 Insaito: Website

—

Be our next guest by emailing us at vivek@insaito.com

We’d love to hear your insights and have you join the conversation!

The Inference Show
Welcome to The Inference Show, where we explore the dynamic world of Education, Technology, and Artificial Intelligence. In each episode, we bring together diverse thought leaders to discuss the latest trends and their real-world impact. Our conversations dive into the applications, challenges, and opportunities shaping the future. Tune in as we uncover innovations driving advancements today and the strategies that will reshape tomorrow’s landscape.