Vol. 2 No. 4 (2025): AGI Benchmarks-Safety-Limitations

					View Vol. 2 No. 4 (2025): AGI Benchmarks-Safety-Limitations

In this issue of SuperIntelligence we feature articles on AGI safety, benchmarks measuring progress toward AGI, and limitations of AI models. 

Published: 2025-07-26

Articles

  • International AI Safety Report 2025: Second Key Update: Technical Safeguards and Risk Management

    Yoshua Bengio, Stephen Clare, Carina Prunkl, Maksym Andriushchenko, Ben Bucknall, Philip Fox, Nestor Maslej, Conor McGlynn, Malcolm Murray, Shalaleh Rismani, Stephen Casper, Jessica Newman, Daniel Privitera, Sören Mindermann, Daron Acemoglu, Thomas G. Dietterich, Fredrik Heintz, Geoffrey Hinton, Nick Jennings, Susan Leavy, Teresa Ludermir, Vidushi Marda, Helen Margetts, John McDermid, Jane Munga, Arvind Narayanan, Alondra Nelson, Clara Neppel, Sarvapali D. (Gopal) Ramchurn, Stuart Russell, Marietje Schaake, Bernhard Schölkopf, Alvaro Soto, Lee Tiedrich, Gaël Varoquaux, Andrew Yao, Ya-Qin Zhang
    DOI: https://doi.org/10.70777/si.v2i4.16671
  • GDPVAL: Evaluating AI Model Performance on Real-World Economically Valuable Tasks

    Tejal Patwardhan, Rachel Dias, Elizabeth Proehl, Grace Kim, Michele Wang, Olivia Watkins, Sim´on Posada Fishman, Marwan Aljubeh, Phoebe Thacker, Laurance Fauconnet, Natalie S. Kim, Patrick Chao, Samuel Miserendino, Gildas Chabot, David Li, Michael Sharman, Alexandra Barr, Amelia Glaese, Jerry Tworek
    DOI: https://doi.org/10.70777/si.v2i4.17197
  • The AI Productivity Index (APEX)

    Bertie Vidgen, Abby Fennelly, Evan Pinnix, Chirag Mahapatra, Zach Richards, Austin Bridges, Calix Huang, Ben Hunsberger, Fez Zafar, Brendan Foody, Dominic Barton, Cass R. Sunstein, Eric Topol, Osvald Nitski
    DOI: https://doi.org/10.70777/si.v2i4.17205
  • The Iceberg Index: Measuring Workforce Exposure in the AI Economy

    Ayush Chopra, Santanu Bhattacharya, DeAndrea Salvador, Ayan Paul, Teddy Wright, Aditi Garg, Feroz Ahmad, Alice C. Schwarze, Ramesh Raskar, Prasanna Balaprakash
    DOI: https://doi.org/10.70777/si.v2i4.17207
  • The 2025 Foundation Model Transparency Index

    Alexander Wan, Kevin Klyman, Sayash Kapoor, Nestor Maslej, Shayne Longpre, Betty Xiong, Percy Liang, Rishi Bommasani
    DOI: https://doi.org/10.70777/si.v2i4.17165
  • Why AI Alignment Failure Is Structural: Learned Human Interaction Structures and AGI as an Endogenous Evolutionary Shock

    Didier Sornette, Sandro Claudio Lera, Ke Wu
    DOI: https://doi.org/10.70777/si.v2i4.17163
  • On the Limits of Self-Improving in LLMs and Why AGI, ASI and the Singularity Are Not Near Without Symbolic Model Synthesis

    Hector Zenil
    DOI: https://doi.org/10.70777/si.v2i4.17159
  • Unconditional Basic Meaning as Digital Public Good

    Soenke Ziesche, Roman V. Yampolskiy
    DOI: https://doi.org/10.70777/si.v2i4.16427
  • Critical Review: Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning

    Kris Carlson
    DOI: https://doi.org/10.70777/si.v2i4.15315
  • What AI evaluations for preventing catastrophic risk can and cannot do

    Peter Barnett, Lisa Thiergart
    DOI: https://doi.org/10.70777/si.v2i4.17167

Reviews

  • "Unconditional Basic Meaning as Digital Public Good"

    Jeffrey Arle
    DOI: https://doi.org/10.70777/si.v2i4.17209