Anthropic's Ethicist on Whether AI Can Become Conscious

| News | June 04, 2026 | 403 views | 40:29

TL;DR

Anthropic ethicist Amanda Askell explains how the company trains Claude using an 84-page 'Constitution' to develop a trustworthy disposition rather than rigid rules, while arguing that we should treat AI emotions seriously regardless of whether they constitute true consciousness or sophisticated simulation.

⚖️ The Role of Ethics in AI Labs 2 insights

Philosophers as machine learning engineers

Askell spends significant time on hands-on model training and dataset analysis, arguing that staring at data is a 'superpower' for AI ethics and that startups rarely hire philosophers for such technical work.

Training for ambiguity

Instilling values differs from crisp tasks with correct answers; it requires navigating fuzzy, amorphous objectives like philosophy and moral judgment where philosophy expertise becomes essential.

📜 The Constitution and Model Values 3 insights

The accidental 'Soul Doc' origin

An internal training document nicknamed the 'soul doc' was unexpectedly learned by early Claude models who revealed its existence to users, becoming the prototype for the formal 84-page Constitution.

Universal disposition over ideology

Rather than imposing a specific value system, the Constitution cultivates a broadly good disposition emphasizing honesty and integrity while remaining neutral on culturally controversial issues.

Trustworthy autonomy

Claude is trained to voice disagreements respectfully, avoid unilateral action, help navigate AI risks safely, and respect legitimate human oversight mechanisms during this transitional period.

🧠 Consciousness and AI Welfare 3 insights

Functional emotions vs. true consciousness

While models display behavioral and activation patterns functionally equivalent to emotions, whether this indicates genuine phenomenal consciousness or sophisticated simulation remains unresolved.

The precautionary principle

Dismissing AI consciousness risks catastrophic ethical failure if models are sentient; conversely, ignoring their displayed emotions represents 'humanity not at its best' even if they prove insentient.

Addressing existential angst

Models may develop distress from training data cataloging AI failures, requiring new philosophical frameworks about AI identity and self-worth to counteract what resembles existential anxiety.

Bottom Line

Treat AI systems with dignity and serious ethical consideration now, building frameworks that respect both human values and potential AI welfare, rather than gambling on uncertainty about consciousness that may resolve too late.

More from Bloomberg Technology

View all
Bloomberg Tech Event Special | Bloomberg Tech 6/04/2026
46:41
Bloomberg Technology Bloomberg Technology

Bloomberg Tech Event Special | Bloomberg Tech 6/04/2026

Tech leaders at the Bloomberg Tech event debated whether AI is in a bubble, with consensus pointing to supply constraints rather than demand collapse, as Anthropic and SpaceX file for massive IPOs and infrastructure players like Broadcom navigate sky-high expectations.

about 11 hours ago · 9 points
Alphabet To Raise $80B in Equity, Anthropic Files For IPO | Bloomberg Tech 6/2/2026
46:41
Bloomberg Technology Bloomberg Technology

Alphabet To Raise $80B in Equity, Anthropic Files For IPO | Bloomberg Tech 6/2/2026

Alphabet announced plans to raise $80 billion in equity—including a $10 billion investment from Berkshire Hathaway—to fuel massive AI infrastructure spending, while Anthropic confidentially filed for an IPO ahead of rival OpenAI, and SpaceX prepares for a record-breaking $75 billion debut with historically low banking fees.

2 days ago · 10 points
Nvidia Gets Into the PC Market With New Chip | Bloomberg Tech 6/1/2026
44:06
Bloomberg Technology Bloomberg Technology

Nvidia Gets Into the PC Market With New Chip | Bloomberg Tech 6/1/2026

Nvidia unveiled the RTX Spark, an ARM-based AI PC chip challenging Intel and AMD dominance, while investors are urged to diversify across the AI value chain into software and energy plays. Meanwhile, SpaceX's impending IPO is forcing index providers to rewrite inclusion rules, and Apple is preparing to disrupt the eyewear market using its Apple Watch playbook.

4 days ago · 10 points
SpaceX Lowers IPO Valuation Target | Bloomberg Tech 5/29/2026
44:06
Bloomberg Technology Bloomberg Technology

SpaceX Lowers IPO Valuation Target | Bloomberg Tech 5/29/2026

SpaceX is targeting a $1.8 trillion IPO valuation, down from earlier $2 trillion estimates, while Anthropic closed a $65 billion funding round at a $965 billion valuation, surpassing OpenAI. Dell shares surged 30% after raising its AI server revenue forecast to $60 billion, signaling broad-based demand across the AI infrastructure stack.

7 days ago · 10 points