Safety, Alignment & Ethics

Dario Amodei, The Adolescence of Technology: Confronting and Overcoming the Risks of Powerful AI

5 May 2026

Anthropic CEO Dario Amodei envisions a 'country of geniuses in a datacenter' with these five enabling properties:1) smarter than top humans across most domains, 2) able to act autonomously over long horizons, 3) use digital tools, 4) coordinate many copies, and 5) operate at much higher speed than humans. You could use these as a checlist to develop such a system, as Anthropic probably does. #5 is true in some domains (real-time learning being a counter-example), #3 is well in progress, and #1, #2, and #4 are still significant obstacles. He does not mention, e.g., robust generalization, abstraction, and world models. The essay discusses risks, governance, and economic implications. The essay’s overall thesis is: AI’s upside remains enormous, but humanity must treat the next few years as a civilizational test requiring technical alignment work, pragmatic regulation, geopolitical realism, economic adaptation, and moral seriousness.

Steve Omohundro: Regulating AGI: From Liability to Provable Contracts

18 November 2025

AGI will render today's liability-based AI regulation obsolete through its ability to circumvent cybersecurity, hide its origins, and act strategically—but it will also enable a new regulatory paradigm based on mathematically provable contracts.

Joe Rogan Experience #2345 - Roman Yampolskiy

24 September 2025

SuperIntelligence co-founding editor Roman Yampolskiy interviewed at length on Joe Rogan. Over 800,000 views.

Steve Omohundro Receives 2024 Future of Life Award

24 September 2025

SuperIntelligence co-founding editor Steve Omohundro was one of three recipients of the prestigious FLI Award 2024 award recognizing seminal contributions to AI safety: "...for laying the foundation of modern ethics and safety considerations for artificial intelligence and computers."

Steve Omohundro and Scientists Discuss the AI Alignment Problem with Neil deGrasse Tyson

24 September 2025

Hosted by Neil deGrasse Tyson, our co-founding editor Steve Omohundro discusses the AI alignment problem starting at ~23:29.

All Items

Simulating Influence Dynamics with LLM Agents

The Road to Artificial SuperIntelligence: A Comprehensive Survey of Superalignment

Multiple unnatural attributes of AI undermine common anthropomorphically biased takeover speculations Eight Fundamental Differences between Biologically Evolved Humans and Digital AI

Can a Bayesian Oracle Prevent Harm from an Agent?

Anthropic: Responsible Scaling Policy

Acceptable Use Policies for Foundation Models

AI Risk Categorization Decoded (AIR 2024) From Government Regulations to Corporate Policies

Against Purposeful Artificial Intelligence Failures

Benchmark Early and Red Team Often A Framework for Assessing and Managing Dual-Use Hazards of Ai Foundation Models

Unhobbling Is All You Need? On Aschenbrenner’s Situational Awareness

Current Issue

Announcements

Dario Amodei, The Adolescence of Technology: Confronting and Overcoming the Risks of Powerful AI

Steve Omohundro: Regulating AGI: From Liability to Provable Contracts

Joe Rogan Experience #2345 - Roman Yampolskiy

Steve Omohundro Receives 2024 Future of Life Award

Steve Omohundro and Scientists Discuss the AI Alignment Problem with Neil deGrasse Tyson

Information