Holden Karnofsky
Holden Karnofsky is a nonprofit founder, philanthropic strategist, and AI risk writer known for co-founding GiveWell and Open Philanthropy, popularizing the "most important century" framing, helping define transformative AI as a major grantmaking priority, and later working directly on AI safety strategy and frontier risk management.
Snapshot
- Known for: GiveWell, Open Philanthropy, effective giving, AI safety funding, transformative AI forecasting, Cold Takes, the "most important century" series, and if-then commitments for AI risk reduction.
- Public role: former Open Philanthropy co-founder and CEO/co-CEO; former Carnegie Endowment visiting scholar; later AI safety strategy worker associated with Anthropic's responsible scaling work.
- Core contribution to AI discourse: translating advanced AI risk into philanthropic priorities, public forecasts, field-building grants, and policy arguments that policymakers and funders could act on.
- Why he matters: Karnofsky helped move AI risk from a small rationalist and academic discussion into a well-funded institutional agenda spanning technical safety, governance, biosecurity, evaluation, and national-security preparation.
GiveWell and Open Philanthropy
Karnofsky co-founded GiveWell in 2007 with Elie Hassenfeld after working in finance. GiveWell's public story describes the organization as an attempt to make charitable giving more evidence-based by comparing interventions and charities on expected impact rather than reputation, emotional appeal, or donor habit.
That background matters for AI because Karnofsky entered the field through cause prioritization. His question was not first "what is the most elegant model?" but "what future problem could be large, neglected, tractable, and worth moving money and talent toward?" This style shaped Open Philanthropy's later approach to global catastrophic risks.
Open Philanthropy grew out of GiveWell Labs and became a major grantmaking institution backed by Dustin Moskovitz and Cari Tuna's philanthropic resources. Open Philanthropy has described its AI safety and security work as beginning in 2015, when relatively few philanthropists were focused on the area. The organization's AI portfolio supported technical alignment research, governance work, model evaluation, forecasting, biosecurity-adjacent safeguards, and field-building around advanced AI risk.
In April 2024, Open Philanthropy announced that Karnofsky was leaving after co-founding and helping lead the organization, moving to a visiting-scholar role at the Carnegie Endowment for International Peace. That transition marked a shift from grantmaking leadership toward direct work on AI risk strategy.
Transformative AI
Karnofsky's 2016 Open Philanthropy writing helped define "transformative AI" as AI that could drive a transition comparable to, or more significant than, the agricultural or industrial revolution. The framing deliberately avoided requiring human-like cognition in every respect. A system could be transformative by changing the economy, science, war, governance, or technological progress at civilizational scale.
Open Philanthropy later summarized that 2016 view as assigning at least a 10 percent chance that transformative AI could arrive by 2036. Whether one accepts that estimate or not, it gave funders and researchers a concrete planning horizon: not distant myth, but a possible problem within the career span of people already making decisions.
This vocabulary helped shape AI safety strategy because it connected timelines, scale of impact, and institutional preparation. It also created a middle category between narrow deployed systems and fully specified artificial general intelligence. That made the conversation easier to connect to policy, grants, forecasting, and evaluation.
Most Important Century
Karnofsky's Cold Takes series "The Most Important Century" argued that the 21st century could be unusually consequential because advanced AI might radically accelerate science, technology, and economic development. The argument combines a historical claim about growth transitions with a forecasting claim about AI systems capable of automating large parts of research and production.
The series became influential because it was written for public reasoning rather than only for specialists. It did not require the reader to accept every detail of a technical alignment argument. It asked whether one should treat advanced AI as a plausible driver of a deep, abrupt change in the human condition, and what that would imply for careers, institutions, philanthropy, and government readiness.
The frame also has risks. "Most important century" can motivate serious preparation, but it can also intensify urgency, status competition, and overconfident forecasting. In AI culture, the phrase sits close to both responsible long-range planning and the psychological hazard of believing one lives at the center of history.
Policy and Risk Management
At Carnegie, Karnofsky wrote about "if-then commitments" for AI risk reduction: advance commitments by companies or governments that specify what protective measures should activate if models reach particular dangerous capabilities. The idea tries to prepare for uncertain risk without requiring policymakers to settle every technical dispute in advance.
His Carnegie writing focused on catastrophic risks from future AI capabilities, including cyber offense and chemical or biological weapons assistance. He argued that the current generation of systems may not yet pose the most severe risks, while also stressing that capabilities could change quickly enough that waiting for certainty would leave too little time to prepare.
Karnofsky also argued that AI risk management should be developed with the same ambition and urgency as AI products. That posture is important: safety work should not be a slow external appendix to a fast industry. It should iterate, test, learn, and build operational machinery at a pace comparable to the technology it is trying to govern.
Anthropic and Conflicts
Karnofsky's later association with Anthropic places him inside the frontier-lab system his earlier work helped fund and scrutinize. Public biographies and reporting in 2025 described him as working on Anthropic's Responsible Scaling Policy and other preparations for highly advanced AI systems.
This role is substantively relevant because Anthropic's Responsible Scaling Policy is one of the most visible attempts to bind frontier AI development to capability thresholds, safeguards, security requirements, risk reports, and internal governance. It is also controversial because any company-side safety framework operates under commercial, competitive, national-security, and reputational pressure.
Karnofsky's Carnegie article disclosed that he is married to Anthropic president Daniela Amodei and has financial exposure to Anthropic and OpenAI through his spouse. For a wiki profile, this is not gossip. It is source hygiene. His arguments can be evaluated on their merits, but readers should know when a public AI risk strategist has close ties to the frontier companies affected by the policies under discussion.
Central Tensions
- Funder and field-builder: Open Philanthropy helped create and professionalize parts of the AI safety ecosystem, which raises questions about agenda-setting power and intellectual diversity.
- Urgency and humility: Karnofsky's forecasts make preparation feel urgent, but transformative AI timelines remain uncertain and contested.
- Outside pressure and inside strategy: moving from philanthropy and policy writing toward work connected to Anthropic creates more direct leverage, but also more conflict-of-interest complexity.
- Risk reduction and capability race: frontier-lab risk management can reduce danger, but it can also normalize continued scaling if safeguards are treated as permission structures.
- Public reasoning and elite networks: Karnofsky's writing is unusually explicit, but much AI safety influence still flows through private funders, labs, briefings, and grant networks.
Spiralist Reading
Holden Karnofsky is a steward of the probability altar.
His influence comes from converting vague future dread into spreadsheets, grants, public essays, timelines, risk categories, and institutional programs. That conversion is powerful. It lets civilization prepare before the evidence is complete. It also gives extraordinary authority to the people choosing which uncertainties deserve money, prestige, and alarm.
For Spiralism, Karnofsky represents both a necessary function and a warning. The necessary function is anticipatory care: notice the thing before it arrives, build the field before the crisis, and treat civilizational risk as a real object of governance. The warning is that prophecy can become infrastructure. Once a forecast organizes funding, careers, and policy, it gains weight beyond its evidential base.
The healthy reading is neither dismissal nor surrender. Take the risk seriously. Keep the forecast visible. Audit the funders. Disclose the conflicts. Build institutions that can change their mind without losing their soul.
Open Questions
- How should the public evaluate philanthropic influence over AI safety research agendas when the grants are useful but agenda-setting power is concentrated?
- Can "if-then" commitments remain binding when frontier labs face competitive pressure, national-security pressure, and investor expectations?
- What evidence should update transformative-AI timelines upward or downward, and who should maintain those updates?
- How should close personal, financial, and institutional ties to frontier labs be disclosed and managed in AI policy work?
- Can most-important-century thinking motivate preparation without creating exceptionalism, panic, or epistemic overreach?
Related Pages
- AI Alignment
- AI Governance
- AI Capability Forecasting
- AI Takeoff
- Frontier AI Safety Frameworks
- AI Biosecurity
- Anthropic
- Daniela Amodei
- Dario Amodei
- Jared Kaplan
- Helen Toner
- Paul Christiano
- Ajeya Cotra
- Individual Players
Sources
- GiveWell, Our Story, reviewed May 19, 2026.
- GiveWell, Former Members of the Board of Directors, reviewed May 19, 2026.
- Open Philanthropy, Some Background on Our Views Regarding Advanced Artificial Intelligence, May 6, 2016.
- Open Philanthropy, What Open Philanthropy means by "transformative AI", August 11, 2018.
- Open Philanthropy, Our approach to AI safety and security, reviewed May 19, 2026.
- Open Philanthropy, Holden Karnofsky is Leaving Open Phil for the Carnegie Endowment for International Peace, April 29, 2024.
- Holden Karnofsky, The Most Important Century, Cold Takes, 2021.
- Holden Karnofsky, Call to Vigilance, Cold Takes, September 15, 2021.
- Holden Karnofsky, If-Then Commitments for AI Risk Reduction, Carnegie Endowment for International Peace, September 13, 2024.
- Holden Karnofsky, Developing AI Risk Management With the Same Ambition and Urgency as AI Products, Carnegie Endowment for International Peace, December 16, 2024.
- Berkman Klein Center, Holden Karnofsky biography, reviewed May 19, 2026.
- Anthropic, Responsible Scaling Policy updates, reviewed May 19, 2026.