How Data Happened and the History of Machine-Readable Power
Chris Wiggins and Matthew L. Jones's How Data Happened is useful because it refuses the origin myth of artificial intelligence. The book treats contemporary AI as the latest arrangement in a longer history of data, statistics, state administration, corporate measurement, military logistics, racial classification, search, surveillance, and machine learning.
The Book
How Data Happened: A History from the Age of Reason to the Age of Algorithms was published by W. W. Norton in 2023. Princeton's History Department lists the authors as Chris Wiggins and Matthew L. Jones, gives the ISBN as 978-1324006732, and describes the book as a history of data's technical, political, and ethical impact. SIAM Review's 2025 featured review identifies the paperback as xiv+367 pages.
The authors bring an unusual pairing to the subject. Wiggins is an applied mathematician at Columbia whose work includes machine learning and computational biology. Jones is a historian of science at Princeton who studies information technologies, intelligence, surveillance, mathematical thinking, and the politics of knowledge. UVA Law's entry for Jennifer Chapman's Law Library Journal review emphasizes that the book combines those specialties to give historical context to a data-dominated society.
The result is neither a simple tech history nor a warning pamphlet about algorithms. It is a long institutional history of how facts become countable, how countable facts become administrative objects, and how administrative objects become targets for prediction, sorting, and control.
Data as Power
The book's strongest move is to treat data as something made. Data does not arrive raw from the world. It is selected, formatted, cleaned, categorized, stored, defended, and made available to particular actors for particular purposes. That sounds obvious until a dashboard, model score, or generated answer arrives with the tone of neutral discovery.
Wiggins and Jones show that data history is also a history of institutions asking what they need to know in order to govern, sell, insure, police, manage, persuade, or fight. Census practices, statistics, eugenics, industrial measurement, wartime computation, behavioral advertising, search, and machine learning are not identical systems. But they share a recurring operation: turn messy life into records that can travel through institutions.
That operation is productive and dangerous at the same time. It can support public health, scientific discovery, logistics, and accountability. It can also give a durable technical form to prejudice, extraction, surveillance, and managerial fantasy. The point is not that counting is corrupt. The point is that counting is never outside power.
This matters for AI because many present-day arguments begin too late. They start at the model, the benchmark, the prompt, the chatbot, or the automated decision. How Data Happened pushes the reader backward: who made the categories, who collected the records, who got excluded, who paid for the infrastructure, who had the right to inspect it, and which older social order became the training substrate?
Legibility With Memory
Read beside Seeing Like a State, the book sharpens the idea of legibility. James C. Scott focused on how states simplify local life so it can be administered. Wiggins and Jones extend the problem into a world where legibility is computational, commercial, networked, and cumulative.
Modern legibility does not just make a person visible to an office. It can preserve traces, link databases, infer missing attributes, rank future behavior, personalize prices, automate suspicion, generate summaries, and feed the next model. The person becomes not only readable but reusable.
That is the bridge from data history to recursive reality. A classification changes the world it claims to describe. A credit score changes financial possibility. A search rank changes what becomes authoritative. A policing model changes where police look, which changes what is recorded, which changes the next model. A hiring filter changes the population of workers, then calls that population evidence of fit. Data systems do not merely observe social patterns; they help produce the patterns later treated as data.
The book is especially valuable because it keeps that recursion historical. It does not treat today's algorithmic systems as unprecedented magic. It shows the older administrative, mathematical, and commercial habits that made the current moment plausible.
The AI-Age Reading
Columbia Business School's account of Wiggins's 2023 talk highlights one of the book's central claims: AI is not a sudden arrival but a moving target whose meaning changes across communities and periods. The same article connects the book to facial recognition, automated loan and bail decisions, the 1956 Dartmouth meeting, Herbert Simon, and the continuing tension among state power, corporate power, and people power.
That framing is more useful than the usual binary of AI optimism versus AI doom. It asks what kind of social machinery has to exist before a model can matter. A frontier model needs data centers, chips, labor, benchmarks, web corpora, cloud contracts, APIs, evaluation regimes, procurement offices, compliance paperwork, user interfaces, and institutions ready to act on its outputs. Intelligence becomes power only when it is attached to pipes, records, permissions, budgets, and routines.
The book also helps separate capability from authority. A model may summarize, classify, predict, or generate with impressive fluency. But institutional authority comes from the surrounding arrangement: whether the output enters a medical record, a school discipline file, a welfare eligibility decision, a police report, a credit denial, a battlefield interface, or a workplace dashboard. The crucial question is not simply what the system can infer. It is where the inference lands.
That is why data provenance, appeal rights, audit trails, deletion, contestability, and public memory belong near the center of AI governance. They are not bureaucratic afterthoughts. They are the mechanisms by which people can resist being trapped inside bad records and self-confirming categories.
Where the Book Needs Friction
The breadth that makes How Data Happened valuable also creates limits. A history that runs from earlier statistical reason to contemporary machine learning must compress many episodes. Readers looking for a deep technical treatment of neural networks, transformer architectures, recommender systems, or data-center economics will need companion texts.
There is also a recurring temptation in broad data histories to make "data" carry too much explanatory weight. Some harms come from datafication itself. Others come from property regimes, labor markets, racism, policing, advertising incentives, weak public institutions, geopolitical competition, and procurement habits. The cleanest reading of the book is not that data causes everything. It is that data gives older powers new durability, scale, and operational speed.
The book's constructive argument also needs institutional specificity. Saying that societies can choose better data futures is right, but choice has to be located somewhere: courts, agencies, unions, professional standards, procurement rules, public-interest research, democratic oversight, technical standards, and refusal rights. Without those levers, "ethical data" becomes another soft phrase that powerful systems can absorb.
The Site Reading
How Data Happened belongs in this catalog because it explains the prehistory of the AI interface. Before the chatbot answers, before the agent acts, before the model gives a score, there is a long chain of choices about what can be known, stored, linked, optimized, and acted upon.
The book is a corrective to technological spectacle. It asks the reader to look beneath the polished surface of prediction and see the census form, the category, the database, the laboratory, the military contract, the advertising market, the search index, the compliance office, and the institutional appetite for machine-readable certainty.
That appetite is the real subject. Institutions like systems that make people easier to see, compare, price, rank, discipline, and serve. People often want the convenience those systems provide. The danger begins when the record becomes more actionable than the person, when the model's world becomes administratively easier to believe than lived reality, and when correction arrives too late to matter.
The best reason to read Wiggins and Jones now is that they make AI feel older without making it feel harmless. Today's systems inherit centuries of measurement politics, but they also add scale, speed, personalization, and automation. The lesson is not nostalgia for a pre-data world. It is discipline: every machine-readable reality has authors, funders, categories, blind spots, and beneficiaries. Governance starts by making those arrangements visible before they harden into common sense.
Sources
- Princeton University Department of History, How Data Happened: A History from the Age of Reason to the Age of Algorithms, publication details, ISBN, publisher, subject areas, and publisher description, reviewed May 20, 2026.
- Princeton University Department of History, Matthew L. Jones faculty profile, academic biography, research interests, and publication details, reviewed May 20, 2026.
- Columbia University Department of Systems Biology, Chris Wiggins faculty profile, academic biography, affiliations, and research areas, reviewed May 20, 2026.
- Columbia Business School, "How Data Happened: A History from the Age of Reason to the Age of Algorithms", August 2, 2023, account of Wiggins's talk and book themes, reviewed May 20, 2026.
- Jennifer Chapman, "Book Review: How Data Happened: A History from the Age of Reason to the Age of Algorithms", Law Library Journal 116, no. 3, 2024, via University of Virginia School of Law, reviewed May 20, 2026.
- Rachel Roca, "Featured Review: How Data Happened: A History from the Age of Reason to the Age of Algorithms", SIAM Review 67, no. 2, 2025, doi:10.1137/24M1635521, reviewed May 20, 2026.
Book links are paid affiliate links. As an Amazon Associate I earn from qualifying purchases.