Home
Categories
EXPLORE
True Crime
Comedy
Society & Culture
Business
Sports
Technology
Health & Fitness
About Us
Contact Us
Copyright
© 2024 PodJoint
Podjoint Logo
US
00:00 / 00:00
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts221/v4/4d/7d/ee/4d7dee51-0b9c-cc29-ca9e-850100925df0/mza_959155509902739546.jpg/600x600bb.jpg
Data Hurdles
Michael Burke and Chris Detzel
52 episodes
6 days ago
Data Hurdles is a podcast that brings the stories of data professionals to life, showcasing the challenges, triumphs, and insights from those shaping the future of data. Hosted by Michael Burke and Chris Detzel, this podcast dives into the real-world experiences of data experts as they navigate topics like data quality, security, AI, data literacy, and machine learning. Each episode features guest data professionals who share their journeys, lessons learned, and the impact of data on industries, technology, and society. From overcoming obstacles in data pipelines to implementing groundbreaking AI solutions, Data Hurdles highlights the human side of data and the stories behind the innovations that are transforming the world. Join us to hear firsthand accounts of how data professionals are solving complex problems and driving the future of technology.
Show more...
Technology
Business
RSS
All content for Data Hurdles is the property of Michael Burke and Chris Detzel and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
Data Hurdles is a podcast that brings the stories of data professionals to life, showcasing the challenges, triumphs, and insights from those shaping the future of data. Hosted by Michael Burke and Chris Detzel, this podcast dives into the real-world experiences of data experts as they navigate topics like data quality, security, AI, data literacy, and machine learning. Each episode features guest data professionals who share their journeys, lessons learned, and the impact of data on industries, technology, and society. From overcoming obstacles in data pipelines to implementing groundbreaking AI solutions, Data Hurdles highlights the human side of data and the stories behind the innovations that are transforming the world. Join us to hear firsthand accounts of how data professionals are solving complex problems and driving the future of technology.
Show more...
Technology
Business
https://img.transistor.fm/N735AuYSWoPYBByzSMHzoJgG0wEYPeze5aN-WxJO--o/rs:fill:0:0:1/w:1400/h:1400/q:60/mb:500000/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9kOGUw/ZDA2MDg0ZGE2MWE5/YzIzYjAyM2JmZTdm/OTU1MC5wbmc.jpg
DeepSeek's Cost-Efficient Model Training ($5M vs hundreds of millions for competitors)
Data Hurdles
24 minutes
8 months ago
DeepSeek's Cost-Efficient Model Training ($5M vs hundreds of millions for competitors)

The episode features hosts Chris Detzel and Michael Burke discussing DeepSeek, a Chinese AI company making waves in the large language model (LLM) space. Here are the key discussion points:

Major Breakthrough in Cost Efficiency:
- DeepSeek claimed they trained their latest model for only $5 million, compared to hundreds of millions or billions spent by competitors like OpenAI
- This cost efficiency created market disruption, particularly affecting NVIDIA's stock as it challenged assumptions about necessary GPU resources

Mixture of Experts (MoE) Innovation:
- Instead of using one large model, DeepSeek uses multiple specialized "expert" models
- Each expert model focuses on specific areas/topics
- Uses reinforcement learning to route queries to the appropriate expert model
- This approach reduces both training and inference costs
- DeepSeek notably open-sourced their MoE architecture, unlike other major companies

Technical Infrastructure:
- Discussion of how DeepSeek achieved results without access to NVIDIA's latest GPUs
- Highlighted the dramatic price increase in NVIDIA GPUs (from $3,000 to $30,000-$50,000) due to AI demand
- Explained how inference costs (serving the model) often exceed training costs

Chain of Thought Reasoning:
- DeepSeek open-sourced their chain of thought reasoning system
- This allows models to break down complex questions into steps before answering
- Improves accuracy on complicated queries, especially math problems
- Comparable to Meta's LLAMA in terms of open-source contributions to the field

Broader Industry Impact:
- Discussion of how businesses are integrating AI into their products
- Example of ZoomInfo using AI to aggregate business intelligence and automate sales communications
- Noted how technical barriers to AI implementation are lowering through platforms like Databricks

The hosts also touched on data privacy concerns regarding Chinese tech companies entering the US market, drawing parallels to TikTok discussions. They concluded by discussing how AI tools are making technical development more accessible to non-experts and mentioned the importance of being aware of how much personal information these models collect about users.

Data Hurdles
Data Hurdles is a podcast that brings the stories of data professionals to life, showcasing the challenges, triumphs, and insights from those shaping the future of data. Hosted by Michael Burke and Chris Detzel, this podcast dives into the real-world experiences of data experts as they navigate topics like data quality, security, AI, data literacy, and machine learning. Each episode features guest data professionals who share their journeys, lessons learned, and the impact of data on industries, technology, and society. From overcoming obstacles in data pipelines to implementing groundbreaking AI solutions, Data Hurdles highlights the human side of data and the stories behind the innovations that are transforming the world. Join us to hear firsthand accounts of how data professionals are solving complex problems and driving the future of technology.