98 - Foundations of Large Language Models ( Tong Xiao and Jingbo Zhu)

https://is1-ssl.mzstatic.com/image/thumb/Podcasts211/v4/7c/85/fd/7c85fdbc-6f02-d644-ec5b-5b6f8f24c137/mza_16811120212298722539.jpg/600x600bb.jpg

AI Coach - Anil Nathoo

Anil Nathoo

102 episodes

3 days ago

AI Coach Podcast Welcome to the AI Coach Podcast—your go-to resource for Artificial intelligence. Each episode offers actionable insights, expert advice, and innovative strategies to help you achieve your AI goals. Whether you’re looking to boost your career, sharpen your skills, or improve your mindset, I’m here to guide you every step of the way. Let’s grow, learn, and thrive together!

Education

RSS

All content for AI Coach - Anil Nathoo is the property of Anil Nathoo and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.

Education

https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_episode/42712991/42712991-1755978077773-173d55d1a75b5.jpg

98 - Foundations of Large Language Models ( Tong Xiao and Jingbo Zhu)

AI Coach - Anil Nathoo

2 hours 1 minute 36 seconds

2 months ago

98 - Foundations of Large Language Models ( Tong Xiao and Jingbo Zhu)

Click here to read more.

This podcast is based on the paper "Foundations of Large Language Models" by Tong Xiao and Jingbo Zhu.

It offers a comprehensive exploration of Large Language Models (LLMs), beginning with an examination of pre-training methods in Natural Language Processing, including both supervised and self-supervised approaches like masked language modeling, and using models like BERT.

It then transitions to a detailed discussion of LLMs, covering their architecture, training challenges, and the critical concept of alignment with human preferences through techniques like Supervised Fine-tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF).

A significant portion of the podcast focuses on LLM inference, explaining fundamental algorithms such as prefilling and decoding, and various methods for improving efficiency and scalability, including prompt engineering and advanced search strategies.

The podcast also touches on crucial considerations like bias in training data, privacy concerns, and the emergent abilities and scaling laws that govern LLM performance.