AI Labs Hire Philosophers to Shape Ethics, Alignment and Tests of Moral Competence

News

5/26/2026, 11:55:22 AM

AI Labs Hire Philosophers to Shape Ethics, Alignment and Tests of Moral Competence

DeepMind and Anthropic have recruited multiple philosophers to turn long‑standing ethical and metaphysical questions into engineering practice, contributing to research, model evaluation and policy while prompting debate in academia about conflicts of interest

DeepMind and Anthropic are hiring philosophers to help shape how large AI models behave and are judged, a shift that moves abstract questions about mind, value and morality into the center of model development and deployment. That work matters because philosophical analysis is now being used to design tests, benchmarks and safeguards that will affect what models are allowed to do and how their behaviour is interpreted.

counted at least 10 philosophers at DeepMind and four at Anthropic. Philosophers at those labs contribute to both scientific and policy work: at DeepMind, Julia Haas coauthored a Nature paper proposing a framework to test whether large language models exhibit moral competence, and Amanda Askell is a prominent philosophical presence at Anthropic. Henry Ajder, a philosophy postgraduate who advises governments and startups, noted the rarity of such sustained demand for philosophers until recently.

Universities are adjusting to the new labor market: many now offer AI ethics courses or joint computer‑science‑and‑philosophy programs to train people who can straddle technical and normative questions. Edward Harcourt of Oxford’s Institute for Ethics in AI called the trend the current "flavor of the year." The movement inside labs also has roots: Iason Gabriel joined DeepMind nearly a decade ago, and the rise of large language models in the early 2020s expanded concern from algorithmic‑bias problems to richer questions about encoding values.

In practice, philosophers at these companies focus less on speculative consciousness debates and more on engineering‑relevant risks: fairness, misinformation, malicious misuse and the behaviour of errant or goal‑directed agents. With agents already able to send emails, schedule appointments and write code, teams are concentrating on value alignment and on clarifying what it would mean for technology to be "actively good." Researchers say they are investing significant philosophical time into operationalizing those concepts for engineers.

The partnerships have provoked skepticism in parts of academia. Critics warn that salaried researchers embedded in for‑profit labs may face conflicts of interest and that ethical expertise risks being co‑opted to improve public perception rather than to constrain harmful development. Harcourt warned collaborations can have a "self‑aggrandizing" element if research is used to portray companies as uniquely powerful actors solving unprecedented problems.

For builders and product teams, the immediate takeaway is a growing expectation to collaborate with ethicists on problem framing, measurement and communication. DeepMind researchers are explicitly grappling with how to measure what matters and how to distinguish genuine moral competence from its imitation; those debates will help shape future model evaluation criteria, deployment safeguards and the benchmarks engineers must integrate into development and review cycles.

Sources

WIRED AI · 5/26/2026

Replies (0)

No replies in this topic yet.

Back