Rebooting AI and the Problem of Common Sense
Gary Marcus and Ernest Davis's Rebooting AI is useful because it refuses two lazy stories at once. It does not say artificial intelligence is fake. It says that impressive pattern recognition is not yet the same thing as robust understanding, and that the gap matters most when institutions hand systems real authority.
The Book
Rebooting AI: Building Artificial Intelligence We Can Trust was published by Pantheon in 2019 and later issued as a Vintage paperback. Penguin Random House lists Gary Marcus and Ernest Davis as the authors, gives the paperback edition as 288 pages, and describes the book as an argument for robust AI beyond narrow, closed systems.
Marcus is a cognitive scientist and AI critic whose work has long pushed against pure data-scaling stories. Davis is a computer scientist at NYU known for work on common-sense reasoning. Their collaboration matters because the book is not only a policy warning. It is a cognitive argument about what kind of machinery intelligence requires.
The Gap
The book's central move is to separate performance from understanding. A system can win a game, classify images, translate routine phrases, or produce plausible prose while still failing when the world changes, the task becomes open-ended, or the situation requires background knowledge that was never explicitly stated.
This distinction is easy to blur because AI systems are usually encountered through polished interfaces. The user sees an answer, a route, a recommendation, a generated paragraph, or a confidence score. The interface compresses uncertainty into a finished object. Rebooting AI asks the reader to look behind that surface and ask what the system actually knows, what it merely correlates, and what breaks when conditions shift.
That remains a live problem after the rise of large language models. Scale has made systems more fluent, more useful, and more surprising than the 2019 public conversation expected. But fluency has not abolished the difference between producing a plausible continuation and maintaining a reliable model of cause, context, exception, embodiment, and consequence.
Common Sense as Infrastructure
Marcus and Davis treat common sense as a technical problem, not a folksy virtue. Common sense includes ordinary physical reasoning, social expectations, causal structure, time, object permanence, goals, affordances, and the implicit background facts that humans use without noticing.
That kind of knowledge is not decorative. It is infrastructure for action. A medical system needs to know when a symptom is urgent even if the wording is unusual. A household robot needs to know that a glass near an edge can fall. A legal or welfare assistant needs to know that people describe the same life event in partial, frightened, contradictory ways. A content or safety system needs to know that context can reverse meaning.
The authors' broader point is that intelligence cannot be trusted merely because it works on benchmarks. Benchmarks are legible environments. Human life is not. A model can become excellent at the test-shaped version of the world while remaining brittle in the world people actually inhabit.
Trust Is a Design Burden
The subtitle, "Building Artificial Intelligence We Can Trust," is doing real work. Trust is not a feeling the interface should induce. It is a burden the system must earn through robustness, transparency, correction, causal competence, and bounded deployment.
This is where the book connects to institutional life. Many organizations want AI to absorb uncertainty: summarize the file, rank the applicant, flag the risk, draft the response, drive the vehicle, advise the patient, monitor the worker. The system becomes attractive precisely because it turns messy judgment into an output that can be routed through a workflow.
But if the system lacks common sense, the institution does not escape judgment. It only relocates judgment into training data, vendors, prompts, dashboards, review queues, and liability language. The person affected by the decision may then face a smooth administrative surface with no obvious place to contest the mistake.
The AI-Age Reading
The strongest AI-age reading of Rebooting AI is not "deep learning failed." That would be too simple. Deep learning has transformed language, vision, code, search, translation, recommender systems, and scientific tooling. The sharper reading is that capability gains do not automatically answer the reliability questions that matter when systems act on people.
Large models often behave like extraordinary cultural compressors. They can draw on patterns from vast bodies of human text, code, and media. That makes them powerful at the interface layer: explanation, imitation, drafting, routing, abstraction, and style transfer. It also makes them dangerous when interface success is mistaken for grounded competence.
The book's common-sense argument therefore belongs beside current work on world models, tool-using agents, evaluations, and AI assurance. A future agent may need language fluency, learned representations, explicit knowledge, causal models, memory, planning, and external tools. The question is less which faction wins and more whether the assembled system can remain inspectable, interruptible, and corrigible once it is embedded in real institutions.
Where the Book Needs Friction
Rebooting AI was written before the public explosion of frontier language models. Some readers will find its emphasis on deep-learning limits too confident in places where scale later changed the practical frontier. A fair review has to say that: many tasks that looked remote in 2019 became ordinary by the middle of the 2020s.
But that does not make the book obsolete. It shifts the burden of interpretation. The book is weakest if treated as a forecast of which architecture would dominate the next product cycle. It is strongest as a checklist for trust: Can the system handle novelty? Can it explain its grounds? Does it know cause from correlation? Can it use background knowledge? Does it fail gracefully? Can affected people appeal?
Independent reception also read the book in that spirit. A 2021 SIAM News review described it as both a scientific assessment and a "food for thought" text, especially useful for clarifying the difference between AI broadly and machine learning narrowly.
The Site Reading
For this site, Rebooting AI is a book about the danger of mistaking a responsive surface for a responsible mind.
That danger recurs across AI companions, workplace copilots, automated welfare systems, search answers, robotics, safety filters, and agentic tools. A system can sound helpful while lacking the world model needed to understand the downstream effects of help. It can produce confidence while hiding brittleness. It can make an institution feel more rational while narrowing the channels through which reality can object.
The practical lesson is not to reject AI systems until they become human-like. It is to match authority to demonstrated competence. Use models where their failure modes are tolerable, observable, and correctable. Keep humans responsible where context, care, rights, bodies, money, freedom, and public legitimacy are at stake. Require source trails, appeal paths, narrow permissions, independent audits, and deployment boundaries.
The book's lasting value is its insistence that intelligence is more than output. In an age of fluent machines, that distinction is no longer academic. It is the line between assistance and automated authority.
Sources
- Penguin Random House, Rebooting AI by Gary Marcus and Ernest Davis.
- Oana Marin, SIAM News, "How Intelligent is Artificial Intelligence?", January 25, 2021.
- Gary Marcus, "Deep Learning: A Critical Appraisal", arXiv, 2018.
- Ernest Davis, "Mathematics, word problems, common sense, and artificial intelligence", arXiv, 2023.
- Ernest Davis, NYU, author page for Rebooting AI.
Book links are paid affiliate links. As an Amazon Associate I earn from qualifying purchases.