Risks

49 Items

Detail of the risks of AGI, e.g. pathways to malicious AI not aligned iwth universal human values.

All Items

  • Review: AI Governance through Markets Philip Moreira Tomei, Rupal Jain, Matija Franklin

    Kris Carlson
    DOI: https://doi.org/10.70777/si.v2i2.14601
  • Review: Large language Model-Powered AI Systems Achieve Self-Replication with No Human Intervention Xudong Pan (潘旭东), Jiarun Dai† (戴嘉润), Yihe Fan (范一禾), Minyuan Luo (罗铭源), Changyi Li (李长艺), Min Yang∗ (杨珉)

    Kris Carlson
    DOI: https://doi.org/10.70777/si.v2i2.14607
  • Review: Large Language Models Pass the Turing Test Cameron R. Jones and Benjamin K. Bergen

    Kris Carlson
    DOI: https://doi.org/10.70777/si.v2i2.14697
  • Review: On Regulating Downstream AI Developers Sophie Williams, Jonas Schuett, Markus Anderljung

    Kris Carlson
    DOI: https://doi.org/10.70777/si.v2i2.14587
  • Review: Safety at Scale: Comprehensive Survey of Large Model Safety Xingjun Ma, Yifeng Gao, Yixu Wang, Ruofan Wang, Xin Wang, Ye Sun, Yifan Ding, ... Yu-Gang Jiang

    Kris Carlson
    DOI: https://doi.org/10.70777/si.v2i2.14741
  • Review: Strategic Patience: Long-Horizon AI Dominance and the Erosion of Human Vigilance Roman Yampolskiy

    Kris Carlson
    DOI: https://doi.org/10.70777/si.v2i2.14603
  • Simulating Influence Dynamics with LLM Agents

    Mehwish Nasim, Syed Muslim Gilani, Amin Qasmi, Usman Naseem
    DOI: https://doi.org/10.70777/si.v2i1.13971
  • Standardizing Intelligence: Aligning Generative AI for Regulatory and Operational Compliance

    Joseph Marvin Imperial, Matthew D. Jones, Harish Tayyar Madabushi
    DOI: https://doi.org/10.70777/si.v2i5.16189
  • Strategic Patience: Long-Horizon AI Dominance and the Erosion of Human Vigilance

    Roman Yampolskiy
    DOI: https://doi.org/10.70777/si.v2i2.14435
  • The 2025 Foundation Model Transparency Index

    Alexander Wan, Kevin Klyman, Sayash Kapoor, Nestor Maslej, Shayne Longpre, Betty Xiong, Percy Liang, Rishi Bommasani
    DOI: https://doi.org/10.70777/si.v2i4.17165
  • The AI Productivity Index (APEX)

    Bertie Vidgen, Abby Fennelly, Evan Pinnix, Chirag Mahapatra, Zach Richards, Austin Bridges, Calix Huang, Ben Hunsberger, Fez Zafar, Brendan Foody, Dominic Barton, Cass R. Sunstein, Eric Topol, Osvald Nitski
    DOI: https://doi.org/10.70777/si.v2i4.17205
  • The First International AI Safety Report The International Scientific Report on the Safety of Advanced AI

    Yoshua Bengio
    DOI: https://doi.org/10.70777/si.v2i2.14755
  • The Iceberg Index: Measuring Workforce Exposure in the AI Economy

    Ayush Chopra, Santanu Bhattacharya, DeAndrea Salvador, Ayan Paul, Teddy Wright, Aditi Garg, Feroz Ahmad, Alice C. Schwarze, Ramesh Raskar, Prasanna Balaprakash
    DOI: https://doi.org/10.70777/si.v2i4.17207
  • The Perilous State of AI Governance, June 2025

    Kris Carlson
    DOI: https://doi.org/10.70777/si.v2i2.14801
  • The Singapore Consensus on Global AI Safety Research Priorities Building a Trustworthy, Reliable and Secure AI Ecosystem

    Yoshua Bengio, Max Tegmark, Stuart Russell, Dawn Song, Sören Mindermann, Lan Xue, Stephen Casper, Luke Ong, Vanessa Wilfred, Tegan Maharaj, Wan Sie Lee, Ya-Qin Zhang
    DOI: https://doi.org/10.70777/si.v2i5.15503
  • Timeline to Artificial General Intelligence 2025 – 2030+

    Gil Syswerda
    DOI: https://doi.org/10.70777/si.v2i3.15119
  • Timeline to Artificial General Intelligence 2025 – 2030+ A prediction of how AI will progress, year by year. Updated Oct 30, 2025.

    Gil Syswerda
    DOI: https://doi.org/10.70777/si.v2i6.16375
  • Towards Safety Reasoning in LLMs: AI-agentic Deliberation for Policy-embedded CoT Data Creation

    Tharindu Kumarage, Ninareh Mehrabi, Anil Ramakrishna, Xinyan Zhao, Richard Zemel, Kai-Wei Chang, Aram Galstyan, Rahul Gupta, Charith Peris
    DOI: https://doi.org/10.70777/si.v2i3.15249
  • Trends in Frontier AI Model Count: A Forecast to 2028

    Iyngkarran Kumar, Sam Manning
    DOI: https://doi.org/10.70777/si.v2i3.15155
  • Unconditional Basic Meaning as Digital Public Good

    Soenke Ziesche, Roman V. Yampolskiy
    DOI: https://doi.org/10.70777/si.v2i4.16427
  • What AI evaluations for preventing catastrophic risk can and cannot do

    Peter Barnett, Lisa Thiergart
    DOI: https://doi.org/10.70777/si.v2i4.17167
  • Why AI Alignment Failure Is Structural: Learned Human Interaction Structures and AGI as an Endogenous Evolutionary Shock

    Didier Sornette, Sandro Claudio Lera, Ke Wu
    DOI: https://doi.org/10.70777/si.v2i4.17163
  • Why Today’s Humanoids Won’t Learn Dexterity

    Rodney Brooks
    DOI: https://doi.org/10.70777/si.v3i3.17351
  • Why We Might Need Advanced AI to Save Us from Doomers, Rather than the Other Way Around A Review of If Anyone Builds It, Everyone Dies: Why Superhuman AI Would Kill Us All by Eliezer Yudkowsky and Nate Soares

    Preston Estep
    DOI: https://doi.org/10.70777/si.v2i6.16251
26-50 of 49