
This episode dives deep on the Gemini-Robotics-1-5-Tech-Report report; significant advancement in generalist robots through the introduction of the Gemini Robotics 1.5 model family. This system features two core components: Gemini Robotics 1.5 (GR 1.5), a Vision-Language-Action (VLA) model that translates instructions into robot actions and supports multi-embodiment control, and Gemini Robotics-ER 1.5 (GR-ER 1.5), an enhanced Vision-Language Model (VLM) specialized in complex embodied reasoning and high-level task planning.