Cognition, Contracts, and Compression

https://is1-ssl.mzstatic.com/image/thumb/Podcasts221/v4/23/72/b6/2372b6f2-e946-2b4a-2d13-3f32527305e3/mza_2092205043051898135.jpg/600x600bb.jpg

Today in arXiv AI

Scot Bearss

7 episodes

4 days ago

Today in arXiv AI is your daily deep dive into the cutting edge of artificial intelligence. Every morning, we unpack the latest breakthroughs in LLM architectures, agentic AI, multimodal models, scaling strategies, safety research and more—mixing expert analysis, lively debate, and real‑world use cases. Whether you’re an AI practitioner, tech leader, or just curious about what’s next, we break down complex papers (and what they mean for you) into a fast‑paced, two‑host conversation you’ll actually enjoy. I am an independent creator and not affiliated with arXiv. Sources linked in descriptions

Technology

RSS

All content for Today in arXiv AI is the property of Scot Bearss and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.

Technology

https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/44125644/44125644-1753362261358-97f877a66347b.jpg

Cognition, Contracts, and Compression

Today in arXiv AI

26 minutes 30 seconds

3 months ago

Cognition, Contracts, and Compression

Generated Google NotebookLM.

Episode Description:
In this episode, we explore 10 new papers advancing our understanding of how LLMs think, how agents can be trusted, and how systems can scale more efficiently:

What LLMs really "know" – UCCT proposes a formal theory of cognition in LLMs, arguing intelligence is emergent and context-triggered—not intrinsic.
Rethinking RAG – CoCoA and CoCoA-zero show how multi-agent collaboration improves synergy between internal model memory and retrieved context.
Efficiency, by design – Efficient Agents sheds light on cost/performance trade-offs in agent systems, while Blueprint First separates logic from generation to enable deterministic workflows.
Contrastive learning, upgraded – Context-Adaptive Multi-Prompt Embedding improves vision-language alignment with adaptive token prompts and diversity constraints.
Inference-time teaming – CTTS scales up LLM performance via collective test-time scaling, using reward model ensembles and agent collaboration.
At the edge – A new adaptive agent placement and migration framework uses LLMs and ant colony optimization to meet real-time edge constraints.
Smarter chains of thought – A step entropy metric allows LLMs to prune redundant reasoning during inference, improving cost-efficiency without sacrificing accuracy.
Quantization, vision-style – VLMQ brings post-training quantization to Vision-Language Models, optimizing for both modality balance and efficiency.
Reliable by contract – A Design-by-Contract–inspired layer enables neurosymbolic agents to enforce input-output constraints, offering a formal basis for agent safety.

From the nature of LLM cognition to practical methods for verifiable, scalable deployment, this episode highlights where theory meets engineering—and where structure enhances trust.

Sources: