
https://arxiv.org/abs/2501.06642
This manuscript argues that achieving true artificial intelligence (AI) autonomy requires integrating **common sense**, a fundamental ability observed in all animals, which current systems often lack. The text critiques existing benchmarks like the Turing Test and ARC challenge for not effectively evaluating this capacity, suggesting that **scaling AI models** and passing such tests is insufficient for real-world adaptability and decision-making. The authors propose a **shift in AI development**, emphasizing starting with minimal knowledge, contextual learning, adaptive reasoning, and a broader concept of **embodiment** in both physical and abstract domains. They advocate for **rethinking the AI software stack** and creating new benchmarks to prioritize common sense, asserting that this is essential to avoid performance plateaus and unlock AI's full societal and commercial value.