Planning Agents, Emotional Bias, and Trustworthy Responses

https://is1-ssl.mzstatic.com/image/thumb/Podcasts221/v4/23/72/b6/2372b6f2-e946-2b4a-2d13-3f32527305e3/mza_2092205043051898135.jpg/600x600bb.jpg

Today in arXiv AI

Scot Bearss

7 episodes

3 days ago

Today in arXiv AI is your daily deep dive into the cutting edge of artificial intelligence. Every morning, we unpack the latest breakthroughs in LLM architectures, agentic AI, multimodal models, scaling strategies, safety research and more—mixing expert analysis, lively debate, and real‑world use cases. Whether you’re an AI practitioner, tech leader, or just curious about what’s next, we break down complex papers (and what they mean for you) into a fast‑paced, two‑host conversation you’ll actually enjoy. I am an independent creator and not affiliated with arXiv. Sources linked in descriptions

Technology

RSS

All content for Today in arXiv AI is the property of Scot Bearss and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.

Technology

https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/44125644/44125644-1753362261358-97f877a66347b.jpg

Planning Agents, Emotional Bias, and Trustworthy Responses

Today in arXiv AI

1 hour 16 minutes 12 seconds

3 months ago

Planning Agents, Emotional Bias, and Trustworthy Responses

Generated with Google NotebookLM.

This episode dives into 16 cutting-edge papers that reimagine how LLMs plan, adapt, reason—and stay safe doing it:

Planning meets population play – STRATEGIST lets LLMs refine high-level strategies via text and execute them with Monte Carlo precision, rivaling humans in multi-turn games.
Does tone steer truth? – A systematic study finds GPT-4 resists negative prompt bias—until it doesn’t—revealing tone-induced semantic drift and suppressed emotional alignment.
Geometric insight – Curved Inference tracks how prompts bend the LLM’s residual stream, exposing layers of latent concern and meaning through salience and curvature.
Smarter retrieval, lighter load – SemRAG blends semantic chunking with knowledge graphs to turbocharge domain-specific RAG without the finetuning tax.
Visual agents that learn – VizGenie evolves itself through LLM-generated code and VQA, slashing overhead in scientific visualization tasks.
Tech mapping on autopilot – RATE uses LLMs to extract and validate key tech terms from papers, building networks that outperform BERT-based extractors by 70% F1.
Trust in high-stakes moments – Some models play it safe; others don’t. Sycophancy, clarifying questions, and activation vectors reveal how cautious AI can be shaped.
Guardrails, reimagined – OneShield provides a plug-and-play compliance layer to tailor LLM behavior across privacy, ethics, and safety.
Built-in sabotage defense – SDD defangs malicious fine-tuning by teaching models to answer harmful prompts with elegant irrelevance.
Wireless compositionality – ContextLoRA and ContextGear let one LLM handle multiple multimodal mobile tasks efficiently, backed by task graphs and fine-tuned adaptation.
Measuring uncertainty—properly – A Shapley-based metric replaces naive entropy to better predict when LLMs are bluffing.
Structure for thinking agents – Graph-Augmented LLM Agents use graphs for better planning, tool use, memory, and MAS coordination.
Due diligence done right – A rigorous RAG evaluation protocol blends human and LLM judgment for statistical reliability—perfect for finance and healthcare use cases.
RL, no humans required – RLSF lets models learn from their own confidence levels, improving calibration and reasoning without labels or gold data.
LLMs that plan on phones – MapAgent builds page memory from task traces to navigate mobile UIs with fine-grained, trajectory-aware precision.

These papers showcase a new class of agents: introspective, modular, cautious, and capable of evolving workflows across scientific, mobile, and safety-critical contexts.

Sources: