Effective Mitigations for Systemic Risks from General-Purpose AI

Risto Uuk; Annemieke Brouwer; Tim Schreier; Noemi Dreksler; Valeria Pulignano; Rishi Bommasani

doi:10.70777/si.v2i1.13975

Authors

Risto Uuk Future of Life Institute
Annemieke Brouwer Future of Life Institute; KU Leuven
Tim Schreier Policy Researcher & Technical Standards Advisor, Future of Life Institute
Noemi Dreksler Centre for the Governance of AI
Valeria Pulignano Leading Investigator, ERLM
Rishi Bommasani Stanford University

DOI:

https://doi.org/10.70777/si.v2i1.13975

Keywords:

agi risk mitigation, agi risks, artificial general intelligence risks

Abstract

The systemic risks posed by general-purpose AI models are a growing concern, yet the effectiveness of mitigations remains underexplored. Previous research has proposed frameworks for risk mitigation, but has left gaps in our understanding of the perceived effectiveness of measures for mitigating systemic risks. Our study addresses this gap by evaluating how experts perceive different mitigations that aim to reduce the systemic risks of general-purpose AI models. We surveyed 76 experts whose expertise spans AI safety; critical infrastructure; democratic processes; chemical, biological, radiological, and nuclear risks (CBRN); and discrimination and bias. Among 27 mitigations identified through a literature review, we find that a broad range of risk mitigation measures are perceived as effective in reducing various systemic risks and technically feasible by domain experts. In particular, three mitigation measures stand out: safety incident reports and security information sharing, third-party pre-deployment model audits, and pre-deployment risk assessments. These measures show both the highest expert agreement ratings (>60%) across all four risk areas and are most frequently selected in experts’ preferred combinations of measures (>40%). The surveyed experts highlighted that external scrutiny, proactive evaluation and transparency are key principles for effective mitigation of systemic risks. We provide policy recommendations for implementing the most promising measures, incorporating the qualitative contributions from

The systemic risks posed by general-purpose AI models are a growing concern, yet the effectiveness of mitigations remains underexplored. Previous research has proposed frameworks for risk mitigation, but has left gaps in our understanding of the perceived effectiveness of measures for mitigating systemic risks. Our study addresses this gap by evaluating how experts perceive different mitigations that aim to reduce the systemic risks of general-purpose AI models. We surveyed 76 experts whose expertise spans AI safety; critical infrastructure; democratic processes; chemical, biological, radiological, and nuclear risks (CBRN); and discrimination and bias. Among 27 mitigations identified through a literature review, we find that a broad range of risk mitigation measures are perceived as effective in reducing various systemic risks and technically feasible by domain experts. In particular, three mitigation measures stand out: safety incident reports and security information sharing, third-party pre-deployment model audits, and pre-deployment risk assessments. These measures show both the highest expert agreement ratings (>60%) across all four risk areas and are most frequently selected in experts’ preferred combinations of measures (>40%). The surveyed experts highlighted that external scrutiny, proactive evaluation and transparency are key principles for effective mitigation of systemic risks. We provide policy recommendations for implementing the most promising measures, incorporating the qualitative contributions from experts. These insights should inform regulatory frameworks and industry practices for mitigating the systemic risks associated with general-purpose AI.

Author Biographies

Risto Uuk, Future of Life Institute

I work as the Head of EU Policy and Research at the Future of Life Institute in Brussels, leading policy and research efforts to maximize the societal benefits of increasingly powerful AI systems. I am currently a Visiting Researcher at Stanford Digital Economy Lab doing research related to the economic and societal impact of advanced AI. I am also a PhD Researcher at KU Leuven, where I research the assessment and mitigation of systemic risks posed by general-purpose AI. I run the biweekly EU AI Act Newsletter with over 40,000 subscribers, and have set up a website that is one of the top hits if you search for the AI Act. In addition, I am an Expert in the Working Group on Future of Work at Global Partnership on AI and contribute to the development of AI standards in CEN and CENELEC working groups. Finally, I am a Ronald Coase Fellow at The Mercatus Center at George Mason University for the 2024-2025 academic year.

Previously, I worked for the World Economic Forum on a project about positive AI economic futures together with Stuart Russell, Daniel Susskind and others, and did research for the European Commission on trustworthy AI. I completed a master’s degree in Philosophy and Public Policy at the London School of Economics and Political Science. Finally, I had the honor of being chosen as a fellow in the International Strategy Forum, an initiative by Schmidt Futures, a nonprofit by former Google CEO Eric Schmidt, in partnership with European Council on Foreign Relations. Additionally, I was awarded the Global Priorities Fellowship by the Forethought Foundation for Global Priorities Research.

Tim Schreier, Policy Researcher & Technical Standards Advisor, Future of Life Institute

Tim is a policy researcher at the Future of Life Institute, where he contributes to multiple policy projects and leads FLI’s engagement on the development of European standards for artificial intelligence. Tim holds bachelor’s degrees in Data Science and Philosophy from the University of Marburg, as well as a master’s degree in Machine Learning from the University of Tübingen. Prior to joining FLI, Tim worked as a consultant for data science & AI at d-fine.

Noemi Dreksler, Centre for the Governance of AI

At the societal, policy, and technical level, I believe we need to expend every effort to make sure AI is developed and regulated in the best interests of the public. We must tackle AI through the prism of our humanity, safeguard against risks, and secure a fairer and more just future where humans across the globe can flourish and live healthy and rewarding lives. My current projects are focused on gaining clarity on people’s attitudes towards AI, how different expert groups think about AI and its progress, and the perceptions of the myriad of ethical and governance challenges that result from this emerging technology. I am currently working on a broad spectrum of projects in this space: such as international surveys of the public and work focusing on what AI researchers, US local policymakers, and economists think about AI and its impacts.

My undergraduate years at the University of Oxford were focused on studying a breadth of topics across psychology and philosophy. My undergraduate dissertation focused on pain as a sensory modality and won a Gibbs prize. Whilst completing my Masters in Industrial, Organisational and Business Psychology at UCL, I examined the relationship between the dark triad and emotional intelligence in a review and meta-analysis.

I completed my DPhil on the associations between colours and shapes at the University of Oxford in 2020. You can download and read my thesis here. It brings together historical and empirical research on colour-shape correspondences through the lenses of multisensory perception, emotions, individual differences, and aesthetics.

Previously I have worked at the business psychology consultancy YSC and as a freelance external research associate for the Swiss business school IMD. I conducted the literature reviews for, co-designed workshops based on, and helped with the development of two books written by Nik Kinley and Schlomo Ben-Hur – Changing Employee Behavior and Leadership OS.

Valeria Pulignano, Leading Investigator, ERLM

Valeria Pulignano is Professor in Sociology at the Centre for Sociological Research (CESO) - KU Leuven. She is titularis of a Francqui Stichting Research Professorship mandate (2023-2026) and member of Leuven.AI - KU Leuven Institute for Artificial Intelligence and Fellow at IRRU University of Warwick and LISER (Luxembourg) and Co-Researcher at CRIMT (Centre for Globalisation and Work) at the University of Montreal and Laval in Canada. She has published extensively on topics related to the sociology of work, comparative European industrial (employment) relations, labour markets and inequality, working conditions, job quality and workers’ voice. She serves as Principal Coordinator of the research network on Work, Employment and Industrial Relations within the European Sociological Association and as member of the Executive Committee of the ILERA (Industrial and Employment Relations Assocation). She is PI of an ERC Advanced Grant ResPecTMe where she studies the forms of unpaid labour in the platform economy, creative industries and care and the way in which they account for– and develop a measurement of - precarious work. She is Editor of Work, Employment and Society and the Journal of Industrial Relations. Previously she has covered the role of Chief-editor of Work, Employment and Organization - Frontiers of Sociology.

Rishi Bommasani, Stanford University

I am the Society Lead at the Stanford Center for Research on Foundation Models (CRFM).
I am finishing my PhD at Stanford Computer Science, advised by Percy Liang and Dan Jurafsky.
Funding: Lieberman Fellowship (active), NSF Graduate Research Fellowship (completed).

Prior to Stanford, I began research at Cornell (BA Math, BA CS, MS CS) under Claire Cardie.
I am honored to have worked with the late Professor Arzoo Katiyar.
Cornell CS holds a special place in my heart: the department wrote this about my journey. [Profile] [Profile 2]

I research the societal impact of AI, especially foundation models, to advance evidence-based AI policy.
My research has been featured in The Atlantic, Axios, Bloomberg, Euractiv, Fast Company, Financial Times, Fortune, The Information, MIT Technology Review, Nature, The New York Times, Politico, Quanta, Rappler, Reuters, Tech Policy Press, VentureBeat, The Verge, Vox, The Wall Street Journal and The Washington Post.

References

Anthropic. (2024a). The Claude 3 Model Family: Opus, Sonnet,

Haiku. Retrieved from https://www-cdn.anthropic.com/

de8ba9b01c9ab7cbabf5c33b80b7bbc618857627/Model_Card_Claude_3

.pdf

Barrett, A. M., Newman, J., Nonnecke, B., Hendrycks, D., Murphy, E. R.,

& Jackson, K. (2023). AI Risk-Management Standards Profile for

General-Purpose AI Systems (GPAIS) and Foundation Models. Retrieved

from https://cltc.berkeley.edu/publication/ai-risk-management

-standards-profile

Bengio, Y., Privitera, D., Besiroglu, T., Bommasani, R., Casper, S., Choi, Y.,

. . . Mindermann, S. (2024). International Scientific Report on the Safety

of Advanced AI: Interim Report. Retrieved from https://hal.science/

hal-04612963

Bommasani, R., Hudson, D. A., Adeli, E., Altman, R., Arora, S., von Arx, S., . . .

Liang, P. (2022). On the Opportunities and Risks of Foundation Models.

arXiv. doi: 10.48550/arXiv.2108.07258

California State Portal. (2024). SB-1047 Safe and Secure Innovation

for Frontier Artificial Intelligence Models Act. Retrieved from

https://leginfo.legislature.ca.gov/faces/billNavClient.xhtml

?bill_id=202320240SB1047

Clarke, S., Whittlestone, J., Maas, M., Belfield, H., Hernández-Orallo,

J., & Ó hÉigeartaigh, S. (2021). Feedback from: University

of Cambridge. Retrieved from https://ec.europa.eu/info/law/

better-regulation/have-your-say/initiatives/12527-Artificial

-intelligence-ethical-and-legal-requirements/F2665626_en

EU AI High-Level Expert Group on Artificial Intelligence. (2019). Ethics

Guidelines for Trustworthy AI. Retrieved from https://www.aepd.es/

sites/default/files/2019-12/ai-ethics-guidelines.pdf

Future of Life Institute. (2021). FLI Position Paper on the EU AI Act.

Retrieved from https://futureoflife.org/document/fli-position

-paper-on-the-eu-ai-act/

Kolt, N. (2023). Algorithmic Black Swans. Washington University Law Review,

. Retrieved from https://papers.ssrn.com/sol3/papers.cfm

?abstract_id=4370566 (Forthcoming)

OpenAI. (2024b). OpenAI o1 System Card. Retrieved from https://cdn

.openai.com/o1-system-card-20240917.pdf

OpenAI. (2024c). Review Completed & Altman, Brockman to Continue to Lead

OpenAI. Retrieved from https://openai.com/index/review-completed

-altman-brockman-to-continue-to-lead-openai/

Partnership on AI. (2023). PAI’s Guidance for Safe Foundation

Model Deployment: A Framework for Collective Action.

Retrieved from https://partnershiponai.org/wp-content/uploads/

/10/PAI-Model-Deployment-Guidance.pdf

Schuett, J., et al. (2023). Towards Best Practices in AGI Safety and Governance.

SuperIntelligence – Robotics – Safety & Alignment 2025 2(1) Large Language Models I

Retrieved from https://cdn.governance.ai/AGI_Safety_Governance

_Practices_GovAIReport.pdf

Smuha, N. A. (2021). Beyond the Individual: Governing AI’s Societal Harm.

Internet Policy Review, 10 (3). doi: https://doi.org/10.14763/2021.3.1574 DOI: https://doi.org/10.14763/2021.3.1574

The White House. (2023). Ensuring Safe, Secure, and Trustworthy

AI. Retrieved from https://www.whitehouse.gov/wp-content/

uploads/2023/07/Ensuring-Safe-Secure-and-Trustworthy-AI.pdf

UK Government. (2023). Emerging Processes for Frontier AI Safety.

Retrieved from https://assets.publishing.service.gov.uk/

media/653aabbd80884d000df71bdc/emerging-processes-frontier

-ai-safety.pdf

UK Government. (2024). Frontier AI Safety Commitments, AI Seoul Summit

Retrieved from https://www.gov.uk/government/publications/

frontier-ai-safety-commitments-ai-seoul-summit-2024/

frontier-ai-safety-commitments-ai-seoul-summit-2024

Weidinger, L., et al. (2021). Ethical and Social Risks of Harm from Language

Models. arXiv. doi: 10.48550/arXiv.2112.04359

Effective Mitigations for Systemic Risks from General-Purpose AI

Authors

DOI:

Keywords:

Abstract

Author Biographies

Risto Uuk, Future of Life Institute

Tim Schreier, Policy Researcher & Technical Standards Advisor, Future of Life Institute

Noemi Dreksler, Centre for the Governance of AI

Valeria Pulignano, Leading Investigator, ERLM

Rishi Bommasani, Stanford University

References

Downloads

Published

How to Cite

Issue

Section

License

Current Issue

Announcements

Dario Amodei, The Adolescence of Technology: Confronting and Overcoming the Risks of Powerful AI

Steve Omohundro: Regulating AGI: From Liability to Provable Contracts

Joe Rogan Experience #2345 - Roman Yampolskiy

Steve Omohundro Receives 2024 Future of Life Award

Steve Omohundro and Scientists Discuss the AI Alignment Problem with Neil deGrasse Tyson

Information