Home
Categories
EXPLORE
True Crime
Comedy
Society & Culture
Business
News
Sports
TV & Film
About Us
Contact Us
Copyright
Β© 2024 PodJoint
00:00 / 00:00
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts112/v4/9d/62/69/9d6269d1-ec6c-4686-16c2-20a117ef266c/mza_4869623135449196254.jpg/600x600bb.jpg
Behind the Craft
Peter Yang
83 episodes
21 hours ago
Expert interviews and guides to help you level up as a product leader and creator fast.
Show more...
Technology
RSS
All content for Behind the Craft is the property of Peter Yang and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
Expert interviews and guides to help you level up as a product leader and creator fast.
Show more...
Technology
https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/40755531/40755531-1710425611383-544c3b1963805.jpg
Complete Beginner's Course on AI Evaluations: Step by Step (2025) | Aman Khan
Behind the Craft
51 minutes 47 seconds
2 months ago
Complete Beginner's Course on AI Evaluations: Step by Step (2025) | Aman Khan

Today, I want to share a new episode with Aman Khan.The best way to learn about AI evaluations is to watch 2 PMs build them live from scratch. In our new episode, Aman and I walk through creating evals for an AI customer support agent β€” from labeling a golden dataset to aligning LLM judges. This is the complete beginners AI eval course you've been waiting for.Aman and I talked about:

(00:00) What are AI evals and how to get good at them

(02:52) The 4 types of AI evaluations everyone should know

(06:08) Live demo: Building evals for a customer support agent

(10:29) Using Anthropic's console to generate great prompts

(15:13) Creating the evaluation criteria

(17:40) Adding human labels to the golden dataset

(31:05) Scaling evals with LLM-judge prompts

(38:21) How to align LLM judges with human judgmentGet the takeaways: https://creatoreconomy.so/p/complete-beginner-course-on-ai-evaluations-aman-khanWhere to find Aman:

X: https://www.linkedin.com/in/amanberkeley/

Website: https://arize.com/πŸ“Œ Subscribe to this channel – more interviews coming soon!

Behind the Craft
Expert interviews and guides to help you level up as a product leader and creator fast.