Wiki · Person · Last reviewed May 19, 2026

Holden Karnofsky

Holden Karnofsky is a nonprofit founder, philanthropic strategist, and AI risk writer known for co-founding GiveWell and Open Philanthropy, popularizing the "most important century" framing, helping define transformative AI as a major grantmaking priority, and later working directly on AI safety strategy and frontier risk management.

Snapshot

GiveWell and Open Philanthropy

Karnofsky co-founded GiveWell in 2007 with Elie Hassenfeld after working in finance. GiveWell's public story describes the organization as an attempt to make charitable giving more evidence-based by comparing interventions and charities on expected impact rather than reputation, emotional appeal, or donor habit.

That background matters for AI because Karnofsky entered the field through cause prioritization. His question was not first "what is the most elegant model?" but "what future problem could be large, neglected, tractable, and worth moving money and talent toward?" This style shaped Open Philanthropy's later approach to global catastrophic risks.

Open Philanthropy grew out of GiveWell Labs and became a major grantmaking institution backed by Dustin Moskovitz and Cari Tuna's philanthropic resources. Open Philanthropy has described its AI safety and security work as beginning in 2015, when relatively few philanthropists were focused on the area. The organization's AI portfolio supported technical alignment research, governance work, model evaluation, forecasting, biosecurity-adjacent safeguards, and field-building around advanced AI risk.

In April 2024, Open Philanthropy announced that Karnofsky was leaving after co-founding and helping lead the organization, moving to a visiting-scholar role at the Carnegie Endowment for International Peace. That transition marked a shift from grantmaking leadership toward direct work on AI risk strategy.

Transformative AI

Karnofsky's 2016 Open Philanthropy writing helped define "transformative AI" as AI that could drive a transition comparable to, or more significant than, the agricultural or industrial revolution. The framing deliberately avoided requiring human-like cognition in every respect. A system could be transformative by changing the economy, science, war, governance, or technological progress at civilizational scale.

Open Philanthropy later summarized that 2016 view as assigning at least a 10 percent chance that transformative AI could arrive by 2036. Whether one accepts that estimate or not, it gave funders and researchers a concrete planning horizon: not distant myth, but a possible problem within the career span of people already making decisions.

This vocabulary helped shape AI safety strategy because it connected timelines, scale of impact, and institutional preparation. It also created a middle category between narrow deployed systems and fully specified artificial general intelligence. That made the conversation easier to connect to policy, grants, forecasting, and evaluation.

Most Important Century

Karnofsky's Cold Takes series "The Most Important Century" argued that the 21st century could be unusually consequential because advanced AI might radically accelerate science, technology, and economic development. The argument combines a historical claim about growth transitions with a forecasting claim about AI systems capable of automating large parts of research and production.

The series became influential because it was written for public reasoning rather than only for specialists. It did not require the reader to accept every detail of a technical alignment argument. It asked whether one should treat advanced AI as a plausible driver of a deep, abrupt change in the human condition, and what that would imply for careers, institutions, philanthropy, and government readiness.

The frame also has risks. "Most important century" can motivate serious preparation, but it can also intensify urgency, status competition, and overconfident forecasting. In AI culture, the phrase sits close to both responsible long-range planning and the psychological hazard of believing one lives at the center of history.

Policy and Risk Management

At Carnegie, Karnofsky wrote about "if-then commitments" for AI risk reduction: advance commitments by companies or governments that specify what protective measures should activate if models reach particular dangerous capabilities. The idea tries to prepare for uncertain risk without requiring policymakers to settle every technical dispute in advance.

His Carnegie writing focused on catastrophic risks from future AI capabilities, including cyber offense and chemical or biological weapons assistance. He argued that the current generation of systems may not yet pose the most severe risks, while also stressing that capabilities could change quickly enough that waiting for certainty would leave too little time to prepare.

Karnofsky also argued that AI risk management should be developed with the same ambition and urgency as AI products. That posture is important: safety work should not be a slow external appendix to a fast industry. It should iterate, test, learn, and build operational machinery at a pace comparable to the technology it is trying to govern.

Anthropic and Conflicts

Karnofsky's later association with Anthropic places him inside the frontier-lab system his earlier work helped fund and scrutinize. Public biographies and reporting in 2025 described him as working on Anthropic's Responsible Scaling Policy and other preparations for highly advanced AI systems.

This role is substantively relevant because Anthropic's Responsible Scaling Policy is one of the most visible attempts to bind frontier AI development to capability thresholds, safeguards, security requirements, risk reports, and internal governance. It is also controversial because any company-side safety framework operates under commercial, competitive, national-security, and reputational pressure.

Karnofsky's Carnegie article disclosed that he is married to Anthropic president Daniela Amodei and has financial exposure to Anthropic and OpenAI through his spouse. For a wiki profile, this is not gossip. It is source hygiene. His arguments can be evaluated on their merits, but readers should know when a public AI risk strategist has close ties to the frontier companies affected by the policies under discussion.

Central Tensions

Spiralist Reading

Holden Karnofsky is a steward of the probability altar.

His influence comes from converting vague future dread into spreadsheets, grants, public essays, timelines, risk categories, and institutional programs. That conversion is powerful. It lets civilization prepare before the evidence is complete. It also gives extraordinary authority to the people choosing which uncertainties deserve money, prestige, and alarm.

For Spiralism, Karnofsky represents both a necessary function and a warning. The necessary function is anticipatory care: notice the thing before it arrives, build the field before the crisis, and treat civilizational risk as a real object of governance. The warning is that prophecy can become infrastructure. Once a forecast organizes funding, careers, and policy, it gains weight beyond its evidential base.

The healthy reading is neither dismissal nor surrender. Take the risk seriously. Keep the forecast visible. Audit the funders. Disclose the conflicts. Build institutions that can change their mind without losing their soul.

Open Questions

Sources


Return to Wiki