Build Wiz AI Show

EXPLORE

Society & Culture

Health & Fitness

© 2024 PodJoint

https://is1-ssl.mzstatic.com/image/thumb/Podcasts221/v4/e8/1e/bc/e81ebc24-dac9-31b4-5604-5987c7d85f0c/mza_526091057425586416.jpg/600x600bb.jpg

Build Wiz AI Show

Build Wiz AI

149 episodes

4 days ago

> Building the future of products with AI-powered innovation. < Build Wiz AI Show is your go-to podcast for transforming the latest and most interesting papers, articles, and blogs about AI into an easy-to-digest audio format. Using NotebookLM, we break down complex ideas into engaging discussions, making AI knowledge more accessible. Have a resource you’d love to hear in podcast form? Send us the link, and we might feature it in an upcoming episode! 🚀🎙️

Show more...

All content for Build Wiz AI Show is the property of Build Wiz AI and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.

> Building the future of products with AI-powered innovation. < Build Wiz AI Show is your go-to podcast for transforming the latest and most interesting papers, articles, and blogs about AI into an easy-to-digest audio format. Using NotebookLM, we break down complex ideas into engaging discussions, making AI knowledge more accessible. Have a resource you’d love to hear in podcast form? Send us the link, and we might feature it in an upcoming episode! 🚀🎙️

Show more...

Episodes (20/149)

Build Wiz AI Show

🛡️ Breaking Agent Backbones: Evaluating LLM Security in AI Agents

Breaking Agent Backbones: AI agents are being deployed at scale, but their security is challenged by non-deterministic behavior and novel vulnerabilities. This episode introduces the "threat snapshot" framework and the new b3 benchmark, which systematically isolate and evaluate security risks stemming from the backbone LLM. We reveal crucial findings: enhanced reasoning capabilities generally improve security, yet model size does not correlate with lower vulnerability scores.

4 days ago

16 minutes 3 seconds

Build Wiz AI Show

🚀 OpenAI's Future: Research, Product, and Infrastructure Vision

In this episode, OpenAI leaders share unprecedented transparency regarding their research goals, aiming for a fully automated AI researcher by March 2028 and discussing the rapid approach of superintelligence. They detail a new structure, featuring a nonprofit foundation that governs a Public Benefit Corporation, essential for attracting the resources needed for their colossal $1.4 trillion infrastructure commitment. The discussion also covers the pivot to an AI cloud platform model, the importance of accelerating scientific discovery, and the establishment of AI resilience efforts to handle societal risks.

4 days ago

15 minutes 45 seconds

Build Wiz AI Show

GitHub Universe 2025: Agent HQ, The Agent Workflow

Welcome to the new era of coding collaboration: Agent HQ is here, establishing GitHub as the centralized home for developers and a fleet of AI coding agents. We explore how the fully-fledged GitHub Copilot agent, alongside partners like Claude and Codex, now operates with deeper context and the ability to execute and coordinate tasks across the developer workflow. Discover how innovations like Mission Control and Plan Mode provide developers with the confidence and control to orchestrate parallel tasks and integrate AI natively into their existing processes, fundamentally changing the developer tool chain.

5 days ago

16 minutes 37 seconds

Build Wiz AI Show

Jensen Huang - NVIDIA - Keynote 10/2025

We delve into Jensen Huang's vision that Artificial Intelligence marks the New Industrial Revolution, positioning it as essential national infrastructure and America's next Apollo moment. We explore how NVIDIA's extreme co-design and Accelerated Computing enable new "AI Factories," achieving 10X generational performance leaps to drive down the cost of generating intelligence. The episode concludes by examining new strategic platforms, including 6G telecommunications (NVIDIA ARC), hybrid quantum computing, and the exponential rise of physical AI and robotics.

5 days ago

14 minutes 16 seconds

Build Wiz AI Show

Perplexity at Work: A Guide to Getting More Done

The modern workplace often buries professionals under context switching and scattered technology, hindering the productivity gains promised by AI. This episode explores the three stages of working smarter: Block Distractions, Scale Yourself, and Get Results, focusing on how a unified AI platform removes friction. Discover how to move past busywork, amplify your natural curiosity, and channel your enhanced capabilities toward strategic, measurable outcomes that define your career progression.

5 days ago

15 minutes 31 seconds

Build Wiz AI Show

Context Engineering for AI Agents - from LangChain vs Manus

Join Lance from LangChain and Pete from Manus as they dive deep into the crucial discipline of Context Engineering for building effective AI agents. This webinar explores the challenge of context explosion—where performance drops as long-running agents accumulate tool call observations—and the core themes used to combat it: offloading, reducing, retrieving, and isolating context. Pete shares fresh lessons from building Manis, detailing the difference between reversible compaction and irreversible summarization, and how their layered action space manages tool confusion.

6 days ago

16 minutes 32 seconds

Build Wiz AI Show

💻 A Survey of Vibe Coding with LLMs

Welcome to an essential discussion on Vibe Coding, the new paradigm where developers shift from writing code line-by-line to orchestrating and validating outputs from autonomous AI agents. We'll formalize Vibe Coding as an engineering discipline, exploring its foundations in Large Language Models, complex agent architectures (like planning and memory mechanisms), and integrated feedback loops. Join us as we break down the five distinct development models—from Unconstrained Automation to Test-Driven approaches—and debate the critical challenges of achieving reliable, secure, and scalable human-AI collaboration in software engineering.

1 week ago

13 minutes 49 seconds

Build Wiz AI Show

AI Adoption, Productivity, and System Thinking - from the interview with Huyen Chip

Chip Huyen, author of AI Engineering and AI strategy expert from NVIDIA and Netflix, breaks down the technical basics of building successful AI products, covering pre-training, RAG, RLHF, and effective evaluation design. We tackle the growing AI "idea crisis" and the crucial gap between what builders think improves AI applications (like chasing the latest news) versus what actually works (like focusing on user feedback and data preparation). Chip offers essential, in-depth insights into system thinking, organizational structure shifts, and where real productivity gains are being found in the field of AI engineering.

1 week ago

21 minutes 49 seconds

Build Wiz AI Show

The Hidden Dangers of Browsing AI Agents

In the hype of ChatGPT Atlas, lets talk about the darkside of Browsing AI Agents

1 week ago

14 minutes 51 seconds

Build Wiz AI Show

🤏 DeepSeek-OCR: Contexts Optical Compression

Welcome to the show, where we discuss DeepSeek-OCR and its investigation into using optical 2D mapping for contexts compression, addressing the computational challenges of quadratic scaling faced by Large Language Models. We explore the DeepEncoder, the core engine designed to achieve high compression ratios, delivering near-lossless OCR precision (approximately 97%) even at a 10× token reduction. This groundbreaking work demonstrates strong practical value, achieving state-of-the-art document parsing performance on OmniDocBench while using the fewest vision tokens, offering a promising direction for future memory systems.

1 week ago

15 minutes 28 seconds

Build Wiz AI Show

Claude Skills: Standard Operating Procedures for Agents

This episode explores Anthropic's revolutionary 'Skills,' a new way to implement Standard Operating Procedures (SOPs) for LLM agents, ensuring consistent, high-quality output for specialized tasks like Excel analysis and document formatting. We dive into how these portable folders contain instructions and executable code, allowing Claude to efficiently access deep, specialized expertise only when needed. Learn the best practices for authoring these skills—from conciseness and appropriate degrees of freedom to iterative testing—as LLM platforms rapidly evolve into customizable agentic environments.

2 weeks ago

17 minutes 36 seconds

Build Wiz AI Show

Self-Adapting Language Models (SEAL)

**SEAL, the Self-Adapting Language Model framework, is revolutionizing how LLMs learn by enabling them to generate their own finetuning data and update directives. We explore how these powerful models create "self-edits"—synthetic training data and optimization parameters—which are continuously refined through a reinforcement learning loop. Discover how this meta-learning approach allows LLMs to efficiently incorporate new factual knowledge and significantly improve few-shot generalization success rates.

2 weeks ago

14 minutes 14 seconds

Build Wiz AI Show

Training-Free Group Relative Policy Optimization for LLM Agents

Are expensive Large Language Model (LLM) fine-tuning methods holding back your specialized agents, demanding massive computational resources and data? We dive into Training-Free Group Relative Policy Optimization (Training-Free GRPO), a novel non-parametric method that enhances LLM agent behavior by distilling semantic advantages from group rollouts into lightweight token priors, eliminating costly parameter updates. Discover how this highly efficient approach achieves significant performance gains in specialized domains like mathematical reasoning and web searching, often surpassing traditional fine-tuning while using only dozens of training samples.

3 weeks ago

13 minutes 38 seconds

Build Wiz AI Show

OpenAI's Vision: AGI, Sora, and Bottlenecks

Join us for a deep dive with Greg Brockman on the future of AI, where he reveals the internal struggle ("pain and suffering") of managing compute scarcity and the immense physical infrastructure build required to scale systems like Sora 2. Brockman discusses the shift from viewing AGI as a destination to a continuous process, emphasizing that current scaling curves and algorithmic progress continue unabated. We also explore the inevitable move toward proactive AI agents and a fully generative web, predicting a major change to the social contract and web monetization.

3 weeks ago

12 minutes 21 seconds

Build Wiz AI Show

Agentic Context Engineering: Evolving Contexts for LLMs

Tune in as we explore Agentic Context Engineering (ACE), a novel framework designed to overcome limitations like "brevity bias" and "context collapse" that plague traditional LLM context adaptation methods. ACE transforms model contexts into continuously evolving, structured "playbooks" by employing a modular process of generation, reflection, and curation. We discuss how this approach enables scalable, self-improving agents, yielding substantial performance gains on complex tasks—such as +10.6% on agent benchmarks—while significantly lowering adaptation latency and cost.

3 weeks ago

16 minutes 35 seconds

Build Wiz AI Show

Less is More: Recursive Reasoning with Tiny Networks

This episode explores the Tiny Recursive Model (TRM), a novel approach that leverages a single, tiny network (as small as 7M parameters) to tackle hard puzzle tasks like Sudoku, Maze, and ARC-AGI. We investigate how this simplified, recursive reasoning strategy achieves significantly higher generalization and outperforms much larger models, including complex Large Language Models (LLMs) and the Hierarchical Reasoning Model (HRM). Discover why this "less is more" philosophy is leading to breakthroughs in parameter-efficient AI reasoning by simplifying complex mathematical theories and biological justifications.

3 weeks ago

14 minutes 28 seconds

Build Wiz AI Show

Understanding the 4 Main Approaches to LLM Evaluation - from Sebastian Raschka

Demystify Large Language Model (LLM) evaluation, breaking down the four main methods used to compare models: multiple-choice benchmarks, verifiers, leaderboards, and LLM judges. We offer a clear mental map of these techniques, distinguishing between benchmark-based and judgment-based approaches to help you interpret performance scores and measure progress in your own AI development. Discover the pros and cons of each method—from MMLU accuracy checks to the dynamic Elo ranking system—and learn why combining them is key to holistic model assessment.

Original blog post: https://magazine.sebastianraschka.com/p/llm-evaluation-4-approaches

3 weeks ago

15 minutes 16 seconds

Build Wiz AI Show

OpenAI DevDay 2025: Agents, Apps, and GPT-5 Pro

OpenAI DevDay 2025 marked the start of the "agentic era" of software development, focusing on making it "easier to build with AI" and transitioning AI from a "chatbot" into a "doer". We break down the revolutionary AgentKit, featuring Agent Builder, a visual, drag-and-drop platform launched to help developers rapidly deploy multi-step AI agents from prototype to production. We also discuss the new Apps SDK for seamlessly integrating third-party services into ChatGPT and the debut of powerful models like GPT-5 Pro and Sora 2, signifying that software development now takes minutes, not months.

3 weeks ago

12 minutes 26 seconds

Build Wiz AI Show

Self-Supervised Learning and the Future of AI - from a lecture given by Yann LeCun

Join us as Turing Award recipient Yann LeCun, Chief Scientist at Meta, critiques the state of AI, arguing that current systems, including Large Language Models (LLMs), are nowhere near matching the learning efficiency observed in humans and animals. LeCun proposes a major architectural shift, advocating that AI must abandon generative models for training and instead focus on building internal "World Models" to enable reasoning and planning. Discover how the Joint Embedding Predictive Architecture (JEPA) uses self-supervised learning to train machines to acquire robust, abstract representations of reality, a crucial step toward achieving common sense and human-level intelligence.

3 weeks ago

17 minutes 57 seconds

Build Wiz AI Show

Skill erosion, where relying on intelligent systems creates an "illusion of mastery" while core competence fades

Are smart machines making us forget how to think? This episode dives into the quiet phenomenon of AI-induced skill erosion, where relying on intelligent systems creates an "illusion of mastery" while core competence fades. We explore the organizational implications of deskilling and discuss strategies, such as targeted auditing and better system design, needed to preserve expertise when AI handles essential tasks.

How much do you know about this topic and what is your high level goal for learning about this topic?

4 weeks ago

15 minutes 8 seconds