The explicit goal of OpenAI, DeepMind and others is to create AGI.This is insanely risky.It keeps me up at night.AIs smarter than us might:🚨Resist shutdown.🚨Resist us changing their goals.🚨Ruthlessly pursue goals, even if they know it’s not what we want or intended.Some people think I’m nuts for believing this. But they often come round once they hear the central arguments.At the core of the AI doom argument are two big ideas:💡Instrumental Convergence💡The Orthogonality Thesis❌If you don’t understand these ideas, you won’t truly understand why some AI researchers are so worried about AGI or Superintelligence.Oxford philosopher Rhys Southan joined me to explain the situation.💡Rhys Southan and his co-authors Helena Ward and Jen Semler argue that powerful AIs might NOT resist having their goals changed. Possibly a fatal flaw in the Instrumental Convergence Thesis.This would be a BIG DEAL. It would mean we could modify powerful AIs if they go wrong.While I don’t fully agree with their argument, it radically changed how I understand the Instrumental Convergence Thesis and forced me to rethink what it means for AIs to have goals.Check out the paper "A Timing Problem for Instrumental Convergence" here: https://link.springer.com/article/10.1007/s11098-025-02370-4
Do large language models like ChatGPT actually understand what they're saying? Can AI systems have beliefs, desires, or even consciousness? Philosophers Henry Shevlin and Alex Grzankowski debunk the common arguments against LLM minds and explore whether these systems genuinely think.This episode examines popular objections to AI consciousness - from "they're just next token predictors" to "it's just matrix multiplication" - and explains why these arguments fail. The conversation covers the Moses illusion, competence vs performance, the intentional stance, and whether we're applying unfair double standards to AI that we wouldn't apply to humans or animals.Key topics discussed:
Featured paper: "Deflating Deflationism: A Critical Perspective on Debunking Arguments Against LLM Mentality"Authored by Alex Grzankowski, Geoff Keeling, Henry Shevlin and Winnie Street
Guests:Henry Shevlin - Philosopher and AI ethicist at the Leverhulme Centre for the Future of Intelligence, University of CambridgeAlex Grzankowski - Philosopher at King's College London#AI #Philosophy #Consciousness #LLM #ArtificialIntelligence #ChatGPT #MachineLearning #CognitiveScience
LLMs like ChatGPT are incredibly useful for coding. So naturally they can also be useful for hacking. Tony Anscombe explains how his cybersecurity company ESET discovered the first AI powered ransomware, and its unexpected origins.
Different species solve different problems, so how can we say one is smarter than another? To me, it's intuitively obvious that humans are the most intelligent species on the planet. But Professor Peter Bentley from UCL argues we are intelligent in different ways and cannot be ranked.