Shared Everything is VAST Data’s editorial and thought leadership platform, spotlighting the technical frontlines of AI infrastructure, datacenters, and cloud architecture. Through in-depth interviews, expert-led discussions, and narrative-driven content, we explore how the most advanced organizations are architecting for the Agentic Age—where AI, data, and compute converge. Whether it's the latest in GPU optimization, multitenancy design, or the future of data orchestration, we dive deep into the systems and strategies shaping tomorrow’s digital landscape.
All content for Shared Everything is the property of Nicole Hemsoth Prickett and is served directly from their servers
with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
Shared Everything is VAST Data’s editorial and thought leadership platform, spotlighting the technical frontlines of AI infrastructure, datacenters, and cloud architecture. Through in-depth interviews, expert-led discussions, and narrative-driven content, we explore how the most advanced organizations are architecting for the Agentic Age—where AI, data, and compute converge. Whether it's the latest in GPU optimization, multitenancy design, or the future of data orchestration, we dive deep into the systems and strategies shaping tomorrow’s digital landscape.
The Future of Reasoning Models and AI Infrastructure
Shared Everything
32 minutes 29 seconds
1 month ago
The Future of Reasoning Models and AI Infrastructure
In this episode of the Shared Everything, reasoning models take center stage. No longer just text predictors, they now loop, branch, and drag in outside data, which blows open context windows and GPU limits. Alon Horev, CTO of VAST Data, unpacks how this shift strains infrastructure, while Kevin Deierling, SVP of Networking at NVIDIA, explains how NVIDIA Dynamo moves KV caches and workloads across GPUs, networks, and storage to keep agentic workflows moving. Data platforms become an extension of memory, enabling longer chains of thought, real-time agents, and secure, observable data paths. The result is a vivid picture of the AI datacenter as the nervous system for reasoning at scale.
Shared Everything
Shared Everything is VAST Data’s editorial and thought leadership platform, spotlighting the technical frontlines of AI infrastructure, datacenters, and cloud architecture. Through in-depth interviews, expert-led discussions, and narrative-driven content, we explore how the most advanced organizations are architecting for the Agentic Age—where AI, data, and compute converge. Whether it's the latest in GPU optimization, multitenancy design, or the future of data orchestration, we dive deep into the systems and strategies shaping tomorrow’s digital landscape.