Governance

45 Items

AGI Governance

All Items

  • A Framework for the Private Governance of Frontier Artificial Intelligence

    Dean Ball
    DOI: https://doi.org/10.70777/si.v2i2.14519
  • Acceptable Use Policies for Foundation Models

    Kevin Klyman
    20
    DOI: https://doi.org/10.70777/si.v1i1.10917
  • AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenges

    Ranjan Sapkota, Konstantinos I. Roumeliotis, Manoj Karkee
    DOI: https://doi.org/10.70777/si.v2i3.15161
  • AI Risk Categorization Decoded (AIR 2024) From Government Regulations to Corporate Policies

    Yi Zeng, Kevin Klyman, Andy Zhou, Yu Yang, Minzhou Pan, Ruoxi Jia, Dawn Song, Percy Liang, Bo Li
    DOI: https://doi.org/10.70777/si.v1i1.10603
  • Aligning Artificial Superintelligence via a Multi-Box Protocol

    Avraham Yair Negozio
    DOI: https://doi.org/10.70777/si.v2i5.15579
  • America's AI Action Plan Winning the Race

    Office of Science and Technology Policy (OSTP)
    DOI: https://doi.org/10.70777/si.v2i5.15507
  • Anthropic: Responsible Scaling Policy

    Evan Hubinger
    DOI: https://doi.org/10.70777/si.v2i1.13657
  • Comparing Apples to Oranges: A Taxonomy for Navigating the Global Landscape of AI Regulation

    Sacha Alanoca, Shira Gur-Arieh, Tom Zick, Kevin Klyman
    DOI: https://doi.org/10.70777/si.v2i3.15137
  • Deliberative Alignment: Reasoning Enables Safer Language Models

    Melody Y. Guan, Manas Joglekar, Eric Wallace, Saachi Jain, Boaz Barak, Alec Helyar, Rachel, Andrea Vallone, Hongyu Ren, Jason Wei, Hyung Won Chung, Sam Toyer, Johannes Heidecke, Alex, Amelia Glaese
    DOI: https://doi.org/10.70777/si.v2i3.15159
  • Enabling Frontier Lab Collaboration to Mitigate AI Safety Risks

    Nicholas Felstead
    DOI: https://doi.org/10.70777/si.v2i6.16439
  • Evidence Integrity Before Capability: A Prerequisite for Safe Artificial Intelligence

    Jennifer Flygare Kinne
    DOI: https://doi.org/10.70777/si.v2i6.16393
  • GDPVAL: Evaluating AI Model Performance on Real-World Economically Valuable Tasks

    Tejal Patwardhan, Rachel Dias, Elizabeth Proehl, Grace Kim, Michele Wang, Olivia Watkins, Sim´on Posada Fishman, Marwan Aljubeh, Phoebe Thacker, Laurance Fauconnet, Natalie S. Kim, Patrick Chao, Samuel Miserendino, Gildas Chabot, David Li, Michael Sharman, Alexandra Barr, Amelia Glaese, Jerry Tworek
    DOI: https://doi.org/10.70777/si.v2i4.17197
  • Hardware-Enabled Mechanisms for Verifying Responsible AI Development

    Aidan O’Gara, Gabriel, Will Hodgkins, James Petrie, Vincent Immler, Aydin Aysu, Kanad Basu, Shivam Bhasin, Stjepan Picek, Ankur Srivastava
    DOI: https://doi.org/10.70777/si.v2i3.15157
  • Highlights of the Issue: Singapore Consensus – Safety Technology In Progress

    Kris Carlson
    DOI: https://doi.org/10.70777/si.v2i5.15525
  • HYDRA: A Hybrid Heuristic-Guided Deep Representation Architecture for Predicting Latent Zero-Day Vulnerabilities in Patched Functions

    Mohammad Farhad, Sabbir Rahman, Shuvalaxmi Dass
    DOI: https://doi.org/10.70777/si.v3i2.18033
  • International AI Safety Report 2025: Second Key Update: Technical Safeguards and Risk Management

    Yoshua Bengio, Stephen Clare, Carina Prunkl, Maksym Andriushchenko, Ben Bucknall, Philip Fox, Nestor Maslej, Conor McGlynn, Malcolm Murray, Shalaleh Rismani, Stephen Casper, Jessica Newman, Daniel Privitera, Sören Mindermann, Daron Acemoglu, Thomas G. Dietterich, Fredrik Heintz, Geoffrey Hinton, Nick Jennings, Susan Leavy, Teresa Ludermir, Vidushi Marda, Helen Margetts, John McDermid, Jane Munga, Arvind Narayanan, Alondra Nelson, Clara Neppel, Sarvapali D. (Gopal) Ramchurn, Stuart Russell, Marietje Schaake, Bernhard Schölkopf, Alvaro Soto, Lee Tiedrich, Gaël Varoquaux, Andrew Yao, Ya-Qin Zhang
    DOI: https://doi.org/10.70777/si.v2i4.16671
  • International Al Safety Report: First Key Update Capabilities and Risk Implications

    Yoshua Bengio, Benjamin Bucknall, Stephen Clare, Carina Prunkl, Maksym Andriushchenko, Philip Fox, Tiancheng Hu, Cameron Jones, Sam Manning, Nestor Maslej, Vasilios Mavroudis, Conor McGlynn, Malcolm Murray, Shalaleh Rismani, Charlotte Stix, Lucia Velasco, Nicole Wheeler, Daniel Privitera, Sören Mindermann, Daron Acemoglu, Thomas G. Dietterich, Fredrik Heintz, Geoffrey Hinton, Nick Jennings, Susan Leavy, Teresa Ludermir, Vidushi Marda, Helen Margetts, John McDermid, Jane Munga, Arvind Narayanan, Alondra Nelson, Clara Neppel, Sarvapali D. (Gopal) Ramchurn, Stuart Russell, Marietje Schaake, Bernhard Schölkopf, Alvaro Soto, Lee Tiedrich, Gaël Varoquaux, Andrew Yao, Ya-Qin Zhan
    DOI: https://doi.org/10.70777/si.v2i6.16253
  • Measuring AI Agent Autonomy: Towards a Scalable Approach with Code Inspection

    Peter Cihon, Merlin Stein, Gagan Bansal, Sam Manning, Kevin Xu
    DOI: https://doi.org/10.70777/si.v2i3.15295
  • On DeepSeek and Export Controls

    Dario Amodei
    DOI: https://doi.org/10.70777/si.v2i1.10695
  • Outline: Proposed Zero Draft for a Standard on AI Testing, Evaluation, Verification, and Validation

    NIST
    DOI: https://doi.org/10.70777/si.v2i5.15513
  • Pitfalls of Evidence-Based AI Policy

    Stephen Casper, David Krueger, Dylan Hadfield-Menell
    DOI: https://doi.org/10.70777/si.v2i2.14611
  • Precedents for the Unprecedented: Historical Analogies for Thirteen Artificial Superintelligence Risks

    James D. Miller
    DOI: https://doi.org/10.70777/si.v2i6.16999
  • Responsible Agentic Reasoning and AI Agents: A Critical Survey Proposal for Safe Agentic AI via Responsible Reasoning AI Agents (R2A2)

    Shaina Raza, Ranjan Sapkota, Manoj Karkee, Christos Emmanouilidis
    DOI: https://doi.org/10.70777/si.v2i6.16169
  • Review: Addressing the challenges of harmonizing law and artificial intelligence technology in modern society Lamprini Seremeti, Sofia Anastasiadou, Andreas Masouras, Stylianos Papalexandris

    Kris Carlson
    DOI: https://doi.org/10.70777/si.v2i2.14807
  • Review: AI Governance through Markets Philip Moreira Tomei, Rupal Jain, Matija Franklin

    Kris Carlson
    DOI: https://doi.org/10.70777/si.v2i2.14601
1-25 of 45