Home
Categories
EXPLORE
True Crime
Comedy
Society & Culture
Business
Sports
Technology
History
About Us
Contact Us
Copyright
© 2024 PodJoint
Podjoint Logo
US
00:00 / 00:00
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts221/v4/bd/6b/33/bd6b33f3-f3b2-5a9f-eae8-30a5cf56d14a/mza_3488893396385669584.jpg/600x600bb.jpg
Gradient Descent - Podcast about AI and Data
Wisecube AI
6 episodes
6 days ago
“Gradient Descent" is a podcast that delves into the depths of artificial intelligence and data science. Hosted by Vishnu Vettrivel (Founder of Wisecube AI) and Alex Thomas (Principal Data Scientist), the show explores the latest trends, innovations, and practical applications in AI and data science. Join us to learn more about how these technologies are shaping our future.
Show more...
Technology
RSS
All content for Gradient Descent - Podcast about AI and Data is the property of Wisecube AI and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
“Gradient Descent" is a podcast that delves into the depths of artificial intelligence and data science. Hosted by Vishnu Vettrivel (Founder of Wisecube AI) and Alex Thomas (Principal Data Scientist), the show explores the latest trends, innovations, and practical applications in AI and data science. Join us to learn more about how these technologies are shaping our future.
Show more...
Technology
https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/43197403/43197403-1741271298817-7bd9c90c40ee3.jpg
AI Scaling Laws, DeepSeek’s Cost Efficiency & The Future of AI Training
Gradient Descent - Podcast about AI and Data
40 minutes 12 seconds
8 months ago
AI Scaling Laws, DeepSeek’s Cost Efficiency & The Future of AI Training

In this first episode of Gradient Descent, hosts Vishnu Vettrivel (CTO of Wisecube AI) and Alex Thomas (Principal Data Scientist) discuss the rapid evolution of AI, the breakthroughs in LLMs, and the role of Natural Language Processing in shaping the future of artificial intelligence. They also share their experiences in AI development and explain why this podcast differs from other AI discussions.


Chapters:

00:00 – Introduction

01:56 – DeepSeek Overview

02:55 – Scaling Laws and Model Performance

04:36 – Peak Data: Are we running out of quality training data?

08:10 – Industry reaction to DeepSeek

09:05 – Jevons' Paradox: Why cheaper AI can drive more demand

11:04 – Supervised Fine-Tuning vs Reinforcement Learning (RLHF)

14:49 – Why Reinforcement Learning helps AI models generalize

20:29 – Distillation and Training Efficiency

25:01 – AI safety concerns: Toxicity, bias, and censorship

30:25 – Future Trends in LLMs: Cheaper, more specialized AI models?

37:30 – Final thoughts and upcoming topics


Mentioned Materials:

- Jevons’ Paradox

- Scaling Laws for Neural Language Models

- Distilling the Knowledge in a Neural Network

- SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

- DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

- Reinforcement Learning: An Introduction (Sutton & Barto)


Follow us:

Pythia Website

Wisecube Website

YouTube

Linkedin

Facebook

X

Reddit

GitHub

Gradient Descent - Podcast about AI and Data
“Gradient Descent" is a podcast that delves into the depths of artificial intelligence and data science. Hosted by Vishnu Vettrivel (Founder of Wisecube AI) and Alex Thomas (Principal Data Scientist), the show explores the latest trends, innovations, and practical applications in AI and data science. Join us to learn more about how these technologies are shaping our future.