Home
Categories
EXPLORE
True Crime
Comedy
Society & Culture
Business
Sports
History
News
About Us
Contact Us
Copyright
© 2024 PodJoint
00:00 / 00:00
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts116/v4/c0/89/b2/c089b257-3c7a-bd9d-d9d2-2c4c379b9746/mza_13630179887470389366.jpg/600x600bb.jpg
What's AI Podcast by Louis-François Bouchard
Louis-François Bouchard
45 episodes
1 week ago
Learn more about AI and how to better leverage it. This podcast aims to share exciting discussions with AI experts to demystify what they do and what they work on. We will cover specific AI-related topics (e.g., ChatGPT, DALLE...) and different roles related to artificial intelligence to share knowledge from the people who worked hard to gather it. I also want to showcase these people's unique paths to get where they are as AI builders, experts, and users. From building to leveraging AI technologies. Owner of the What's AI channel on YouTube, co-founder of Towards AI, and ex-PhD at Mila.
Show more...
Technology
RSS
All content for What's AI Podcast by Louis-François Bouchard is the property of Louis-François Bouchard and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
Learn more about AI and how to better leverage it. This podcast aims to share exciting discussions with AI experts to demystify what they do and what they work on. We will cover specific AI-related topics (e.g., ChatGPT, DALLE...) and different roles related to artificial intelligence to share knowledge from the people who worked hard to gather it. I also want to showcase these people's unique paths to get where they are as AI builders, experts, and users. From building to leveraging AI technologies. Owner of the What's AI channel on YouTube, co-founder of Towards AI, and ex-PhD at Mila.
Show more...
Technology
https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_episode/36750566/36750566-1742139576567-e66e567909264.jpg
OpenAI's NEW Fine-Tuning Method Changes EVERYTHING (Reinforcement Fine-Tuning Explained)
What's AI Podcast by Louis-François Bouchard
13 minutes 17 seconds
7 months ago
OpenAI's NEW Fine-Tuning Method Changes EVERYTHING (Reinforcement Fine-Tuning Explained)

Have you ever wanted to take a language model and make it answer the way you want without needing a mountain of data?

Well, OpenAI’s got something for us: Reinforcement Fine-Tuning, or RFT, and it changes how we customize AI models. Instead of retraining it with feeding examples of what we want and hoping it learns in the classical way, we actually teach it by rewarding correct answers and penalizing wrong ones, just like training a dog — but, you know, with fewer treats and more math.

Let’s break down reinforcement fine-tuning compared to supervised fine-tuning!

Both essentially have their use that we can discuss in one line:

  1. Supervised fine-tuning teaches new things the model does not know yet, like a new language, which is powerful for small and less “intelligent” models.

  2. While reinforcement fine-tuning orients the current model to what we really want it to say. It basically “aligns” the model to our needs, but we need an already powerful model. This is why reasoning models are a perfect fit.

I’ve already covered fine-tuning on the channel if you are interested in that. Today, let’s get into how RFT actually works!

What's AI Podcast by Louis-François Bouchard
Learn more about AI and how to better leverage it. This podcast aims to share exciting discussions with AI experts to demystify what they do and what they work on. We will cover specific AI-related topics (e.g., ChatGPT, DALLE...) and different roles related to artificial intelligence to share knowledge from the people who worked hard to gather it. I also want to showcase these people's unique paths to get where they are as AI builders, experts, and users. From building to leveraging AI technologies. Owner of the What's AI channel on YouTube, co-founder of Towards AI, and ex-PhD at Mila.