All content for Embodied AI 101 is the property of Shaoqing Tan and is served directly from their servers
with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
Stay in the loop on research in AI and physical intelligence.
Episode 2: Under the Hood of OpenVLA – Architecture and Inference
Embodied AI 101
10 minutes 39 seconds
3 months ago
Episode 2: Under the Hood of OpenVLA – Architecture and Inference
Welcome back! Last time we talked about what OpenVLA is at a high level. Now it’s time to lift the hood and see how this engine runs. How can one AI model look at a camera image, read a command, and then generate robot arm motions to fulfill it? In this episode, we’ll break down OpenVLA’s architecture and discuss how it processes inputs and produces actions. If you’re into AI model design (or just curious how the sausage is made), this one’s for you.
So, let’s start with the big picture. O
Embodied AI 101
Stay in the loop on research in AI and physical intelligence.