Anthropic's Ethicist on Whether AI Can Become Conscious
TL;DR
Anthropic ethicist Amanda Askell explains how the company trains Claude using an 84-page 'Constitution' to develop a trustworthy disposition rather than rigid rules, while arguing that we should treat AI emotions seriously regardless of whether they constitute true consciousness or sophisticated simulation.
⚖️ The Role of Ethics in AI Labs 2 insights
Philosophers as machine learning engineers
Askell spends significant time on hands-on model training and dataset analysis, arguing that staring at data is a 'superpower' for AI ethics and that startups rarely hire philosophers for such technical work.
Training for ambiguity
Instilling values differs from crisp tasks with correct answers; it requires navigating fuzzy, amorphous objectives like philosophy and moral judgment where philosophy expertise becomes essential.
📜 The Constitution and Model Values 3 insights
The accidental 'Soul Doc' origin
An internal training document nicknamed the 'soul doc' was unexpectedly learned by early Claude models who revealed its existence to users, becoming the prototype for the formal 84-page Constitution.
Universal disposition over ideology
Rather than imposing a specific value system, the Constitution cultivates a broadly good disposition emphasizing honesty and integrity while remaining neutral on culturally controversial issues.
Trustworthy autonomy
Claude is trained to voice disagreements respectfully, avoid unilateral action, help navigate AI risks safely, and respect legitimate human oversight mechanisms during this transitional period.
🧠 Consciousness and AI Welfare 3 insights
Functional emotions vs. true consciousness
While models display behavioral and activation patterns functionally equivalent to emotions, whether this indicates genuine phenomenal consciousness or sophisticated simulation remains unresolved.
The precautionary principle
Dismissing AI consciousness risks catastrophic ethical failure if models are sentient; conversely, ignoring their displayed emotions represents 'humanity not at its best' even if they prove insentient.
Addressing existential angst
Models may develop distress from training data cataloging AI failures, requiring new philosophical frameworks about AI identity and self-worth to counteract what resembles existential anxiety.
Bottom Line
Treat AI systems with dignity and serious ethical consideration now, building frameworks that respect both human values and potential AI welfare, rather than gambling on uncertainty about consciousness that may resolve too late.
More from Bloomberg Technology
View all
Bloomberg Tech Event Special | Bloomberg Tech 6/04/2026
Tech leaders at the Bloomberg Tech event debated whether AI is in a bubble, with consensus pointing to supply constraints rather than demand collapse, as Anthropic and SpaceX file for massive IPOs and infrastructure players like Broadcom navigate sky-high expectations.
Alphabet To Raise $80B in Equity, Anthropic Files For IPO | Bloomberg Tech 6/2/2026
Alphabet announced plans to raise $80 billion in equity—including a $10 billion investment from Berkshire Hathaway—to fuel massive AI infrastructure spending, while Anthropic confidentially filed for an IPO ahead of rival OpenAI, and SpaceX prepares for a record-breaking $75 billion debut with historically low banking fees.
Nvidia Gets Into the PC Market With New Chip | Bloomberg Tech 6/1/2026
Nvidia unveiled the RTX Spark, an ARM-based AI PC chip challenging Intel and AMD dominance, while investors are urged to diversify across the AI value chain into software and energy plays. Meanwhile, SpaceX's impending IPO is forcing index providers to rewrite inclusion rules, and Apple is preparing to disrupt the eyewear market using its Apple Watch playbook.
SpaceX Lowers IPO Valuation Target | Bloomberg Tech 5/29/2026
SpaceX is targeting a $1.8 trillion IPO valuation, down from earlier $2 trillion estimates, while Anthropic closed a $65 billion funding round at a $965 billion valuation, surpassing OpenAI. Dell shares surged 30% after raising its AI server revenue forecast to $60 billion, signaling broad-based demand across the AI infrastructure stack.