Code, Care & Customer Support: Klarna’s U-Turn, Healthbench Hype, and OpenAI’s Coding Agent

https://is1-ssl.mzstatic.com/image/thumb/Podcasts221/v4/c8/4e/1e/c84e1e35-8326-9e8b-7244-8994663d69d1/mza_10845128310563015798.jpg/600x600bb.jpg

Fresh From the Labs

Pioneer Square Labs

23 episodes

3 days ago

Fresh From the Labs is your front-row seat to the future of AI — straight from the builders shaping it. Hosted by the product team at Pioneer Square Labs, a Seattle-based venture studio, each episode dives into the week's most exciting AI breakthroughs, tools, and trends. No hype, just hands-on insight from the people actually prototyping, experimenting, and pushing boundaries with the latest tech. Whether you're building with AI or just trying to keep up, this podcast is your lab-tested shortcut to what matters most.

Technology

RSS

All content for Fresh From the Labs is the property of Pioneer Square Labs and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.

Technology

https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/43357237/43357237-1743204497565-17888a5b23d1f.jpg

Code, Care & Customer Support: Klarna’s U-Turn, Healthbench Hype, and OpenAI’s Coding Agent

Fresh From the Labs

36 minutes 1 second

5 months ago

Code, Care & Customer Support: Klarna’s U-Turn, Healthbench Hype, and OpenAI’s Coding Agent

This week on Fresh from the Labs, Shilpa, Kevin, and Jared dive into three headline-grabbing stories, and what they mean for builders right now:

Klarna’s customer-support U-turn. The team dissects the "replace 700 agents with AI" experiment, why a two-humans-per-bot fallback isn’t a failure, and what sustainable AI adoption in ops should look like.
OpenAI’s new Healthbench. Thousands of curated physician conversations power a fresh benchmark that pushes GPT-4o and rivals toward real clinical usefulness. We unpack where the models still stumble (context seeking!), the Epic integrations everyone is watching, and why a safer WebMD can’t come soon enough.
Codex 1 & cloud coding agents. OpenAI plants a giant flag in the fully-agentic dev-tool space, right as rumors swirl about the Windsurf acquisition. Kevin shares war-stories from building his own open-source coding agent, and the crew debates whether verticalized startups or open-source stacks will win the long game.

Along the way you’ll hear about the perils of voice agents mispronouncing simple words, Hacker News snark, and why watching fourth-graders play Ultimate Frisbee might be the purest form of agentic chaos.