Acceptable Use Policies for Foundation Models

Authors

  • Kevin Klyman Stanford University, Center for Research on Foundation Models; Harvard University, Belfer Center for Science and International Affairs

DOI:

https://doi.org/10.70777/si.v1i1.10917

Keywords:

foundation models, large language models, llms, artificial intelligence governance, ai governance, ai self-regulation

Abstract

Policymakers hoping to regulate foundation models have focused on preventing specific objectionable uses of AI systems, such as the creation of bioweapons, deepfakes, and child sexual abuse material. Effectively blocking these uses can be difficult in the case of foundation models as they are general-purpose technologies that in principle can be used to generate any type of content. Nevertheless, foundation model developers have been proactive in this area, adopting broad acceptable use policies that prohibit many dangerous uses that developers select themselves as part of their terms of service or model licenses. As part of the 2023 Foundation Model Transparency Index, researchers at the Stanford Center for Research on Foundation Models catalogued the acceptable use policies of 10 leading foundation model developers. All 10 companies publicly disclose the permitted, restricted, and prohibited uses of their models, but there is little additional information available about these policies or how they are implemented. Only 3 of 10 leading foundation model developers disclose how they enforce their acceptable use policy, while only 2 of 10 give any justification to users when they enforce the policy. We provide background on acceptable use policies for foundation models, a preliminary analysis of 30 developers’ acceptable use policies, and a discussion of policy considerations related to developers’ attempts to restrict the use of their foundation models.

References

June Ahn, Lauren K. Bivona, and Jeffrey DiScala. Social media access in k-12 schools: Intractable pol-icy controversies in an evolving world. Proceedings of the American Society for Information Science and Technology, 48(1):1–10, 2011. URL: https://asistdl.onlinelibrary.wiley.com/doi/abs/10.1002/meet.2011.14504801044, arXiv:https://asistdl.onlinelibrary.wiley.com/doi/pdf/10.1002/meet.2 011.14504801044, doi:10.1002/meet.2 011.14504801044. DOI: https://doi.org/10.1002/meet.2011.14504801044

Nouf Alfawzan, Markus Christen, Giovanni Spitale, and Nikola Biller-Andorno. Privacy, data sharing, and data security policies of women’s mhealth apps: Scoping review and content analysis. JMIR Mhealth Uhealth, 10(5):e33735, 2022. URL: https://mhealth.jmir.org/2022/5/e33735, doi: 10.2196/33735. DOI: https://doi.org/10.2196/33735

Anthropic. The claude 3 model family: Opus, sonnet, haiku, 2024. URL: https://www-cdn.anthr opic.com/de8ba9b01c9ab7cbabf5c33b8 0b7bbc618857627/Model_Card_Claude3.pdf.

David Atkinson and Jacob Morrison. A legal risk taxonomy for generative artificial intelligence, 2024. URL: https://arxiv.org/abs/2404.094 79, arXiv:2404.09479.

Yuntao Bai, Saurav Kadavath, Sandipan Kundu, Amanda Askell, Jackson Kernion, Andy Jones, Anna Chen, Anna Goldie, Azalia Mirhoseini, Cameron McKinnon, Carol Chen, Catherine Olsson, Christo-pher Olah, Danny Hernandez, Dawn Drain, Deep Ganguli, Dustin Li, Eli Tran-Johnson, Ethan Perez, Jamie Kerr, Jared Mueller, Jeffrey Ladish, Joshua Landau, Kamal Ndousse, Kamile Lukosuite, Liane Lovitt, Michael Sellitto, Nelson Elhage, Nicholas Schiefer, Noemi Mercado, Nova DasSarma, Robert Lasenby, Robin Larson, Sam Ringer, Scott John-ston, Shauna Kravec, Sheer El Showk, Stanislav Fort, Tamera Lanham, Timothy Telleen-Lawton, Tom Con-erly, Tom Henighan, Tristan Hume, Samuel R. Bow-man, Zac Hatfield-Dodds, Ben Mann, Dario Amodei, Nicholas Joseph, Sam McCandlish, Tom Brown, and Jared Kaplan. Constitutional ai: Harmlessness from ai feedback, 2022. URL: https://arxiv.org/abs/2212.08073, arXiv:2212.08073.

Julia Barnett. The ethical implications of generative audio models: A systematic literature review. In Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, AIES ’23. ACM, August 2023. URL: http://dx.doi.org/10.1145/36002 11.3604686, doi:10.1145/3600211.3604 686.

Adrien Basdevant, Camille Francois, Victor Storchan, Kevin Bankston, Ayah Bdeir, Brian Behlendorf, Merouane Debbah, Sayash Kapoor, Yann LeCun, Mark Surman, Helen King-Turvey, Nathan Lambert, Stefano Maffulli, Nik Marda, Govind Shivkumar, and Justine Tunney. Towards a framework for openness in foundation models: Proceedings from the columbia convening on openness in artificial intelligence, 2024. URL: https://arxiv.org/abs/2405.158 02, arXiv:2405.15802.

T. Bernier, A. Shah, L. E. Ross, C. H. Logie, and E. Seto. The use of information and communication technologies by sex workers to manage occupational health and safety: Scoping review. Journal of Medical Internet Research, 23(6):e26085, Jun 2021. doi:10.2196/26085. DOI: https://doi.org/10.2196/26085

Katherine E. Beyer. Busting the ghost guns: A technical, statutory, and practical approach to the 3-d printed weapon problem. Kentucky Law Journal, 103:433–456, 2014.

Sam Biddle. Openai quietly deletes ban on using chatgpt for “military and warfare”. The Intercept, January 2024. URL: https://theintercept.c om/2024/01/12/open-ai-military-ban-chatgpt/.

Abeba Birhane, William Isaac, Vinodkumar Prabhakaran, Mark Diaz, Madeleine Clare Elish, Iason Gabriel, and Shakir Mohamed. Power to the people? opportunities and challenges for participatory ai. In Proceedings of the 2nd ACM Conference on Equity and Access in Algorithms, Mechanisms, and Optimization, EAAMO ’22, New York, NY, USA, 2022. Association for Computing Machinery. doi: 10.1145/3551624.3555290. DOI: https://doi.org/10.1145/3551624.3555290

Rishi Bommasani, Drew A. Hudson, Ehsan Adeli, Russ Altman, Simran Arora, Sydney von Arx, Michael S. Bernstein, Jeannette Bohg, Antoine Bosselut, Emma Brunskill, Erik Brynjolfsson, Shyamal Buch, Dallas Card, Rodrigo Castellon, Niladri Chatterji, Annie Chen, Kathleen Creel, Jared Quincy Davis, Dora Demszky, Chris Donahue, Moussa Doumbouya, Esin Durmus, Stefano Ermon, John Etchemendy, Kawin Ethayarajh, Li Fei-Fei, Chelsea Finn, Trevor Gale, Lauren Gillespie, Karan Goel, Noah Goodman, Shelby Grossman, Neel Guha, Tatsunori Hashimoto, Peter Henderson, John He-witt, Daniel E. Ho, Jenny Hong, Kyle Hsu, Jing Huang, Thomas Icard, Saahil Jain, Dan Juraf-sky, Pratyusha Kalluri, Siddharth Karamcheti, Geoff Keeling, Fereshte Khani, Omar Khattab, Pang Wei, Koh, Mark Krass, Ranjay Krishna, Rohith Kudi-tipudi, Ananya Kumar, Faisal Ladhak, Mina Lee, Tony Lee, Jure Leskovec, Isabelle Levent, Xi-ang Lisa Li, Xuechen Li, Tengyu Ma, Ali Malik, Christopher D. Manning, Suvir Mirchandani, Eric Mitchell, Zanele Munyikwa, Suraj Nair, Avanika Narayan, Deepak Narayanan, Ben Newman, Allen Nie, Juan Carlos Niebles, Hamed Nilforoshan, Julian Nyarko, Giray Ogut, Laurel Orr, Isabel Papadimitriou, Joon Sung Park, Chris Piech, Eva Portelance, Christopher Potts, Aditi Raghunathan, Rob Reich, Hongyu Ren, Frieda Rong, Yusuf Roohani, Camilo Ruiz, Jack Ryan, Christopher Re, Dorsa Sadigh, Shiori Sagawa, Keshav Santhanam, Andy Shih, Krishnan Srinivasan, Alex Tamkin, Rohan Taori, Armin W. Thomas, Florian Tramer, Rose E. Wang, William Wang, Bohan Wu, Jiajun Wu, Yuhuai Wu, Sang Michael Xie, Michihiro Yasunaga, Jiaxuan You, Matei Zaharia, Michael Zhang, Tianyi Zhang, Xikun Zhang, Yuhui Zhang, Lucia Zheng, Kaitlyn Zhou, and Percy Liang. On the opportunities and risks of foundation models, 2022. arXiv:2108.07258.

Michael Brenes and William D. Hartung. Private finance and the quest to remake modern warfare. Research report, Quincy Institute for Responsible State-craft, jun 2024. URL: https://quincyinst.o rg/research/private-finance-and-the-quest-to-remake-modern-warfare/#executive-summary.

Brian Schatz, Ben Ray Luj´an, Peter Welch, Mark R. Warner, and Angus S. King, Jr. Letter to openai, Jul 2024. URL: https://www.schatz.senate. gov/imo/media/doc/letter_to_openai .pdf.

Anthony Brohan, Noah Brown, Justice Carbajal, Yev-gen Chebotar, Xi Chen, Krzysztof Choromanski, Tianli Ding, Danny Driess, Avinava Dubey, Chelsea Finn, Pete Florence, Chuyuan Fu, Montse Gonzalez Arenas, Keerthana Gopalakrishnan, Kehang Han, Karol Hausman, Alexander Herzog, Jasmine Hsu, Brian Ichter, Alex Irpan, Nikhil Joshi, Ryan Julian, Dmitry Kalashnikov, Yuheng Kuang, Isabel Leal, Lisa Lee, Tsang-Wei Edward Lee, Sergey Levine, Yao Lu, Henryk Michalewski, Igor Mordatch, Karl Pertsch, Kanishka Rao, Krista Reymann, Michael Ryoo, Grecia Salazar, Pannag Sanketi, Pierre Ser-manet, Jaspiar Singh, Anikait Singh, Radu Sori-cut, Huong Tran, Vincent Vanhoucke, Quan Vuong, Ayzaan Wahid, Stefan Welker, Paul Wohlhart, Jialin Wu, Fei Xia, Ted Xiao, Peng Xu, Sichun Xu, Tianhe Yu, and Brianna Zitkovich. Rt-2: Vision-language-action models transfer web knowledge to robotic con-trol, 2023. URL: https://arxiv.org/abs/23 07.15818, arXiv:2307.15818.

C. Bronstein. Deplatforming sexual speech in the age of fosta/sesta. Porn Studies, 8(4):367–380, 2021. do i:10.1080/23268743.2021.1993972. DOI: https://doi.org/10.1080/23268743.2021.1993972

Sarah Huiyi Cen, Aspen Hopkins, Andrew Ilyas, Aleksander Madry, Isabella Struckman, and Luis Videgaray Caso. Ai supply chains, April 2023. URL: https://ssrn.com/abstract=4789403.

Center for an Informed Public, Digital Forensic Re-search Lab, Graphika, and Stanford Internet Observatory. The long fuse: Misinformation and the 2020 election, 2021. Stanford Digital Repository: Election Integrity Partnership. v1.3.0. URL: https: //purl.stanford.edu/tr171zs0069.

Bilva Chandra, George Awad, Yooyoung Lee, Peter Fontana, Razvan Amironesei, Mark Przybocki, Kamie Roberts, Elham Tabassi, Mat Heyman, and Jesse Dunietz. Reducing risks posed by synthetic content, April 2024. URL: https://airc.nis.gov/docs/NIST.AI.1004.SyntheticContent.ipd.pdf.

Nicole Chi, Emma Lurie, and Deirdre K. Mulligan. Reconfiguring diversity and inclusion for ai ethics. In Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society, AIES ’21, page 447–457, New York, NY, USA, 2021. Association for Computing Machinery. doi:10.1145/3461702.3462 622. DOI: https://doi.org/10.1145/3461702.3462622

Leshem Choshen, Elad Venezian, Noam Slonim, and Yoav Katz. Fusing finetuned models for better pre-training, 2022. URL: https://arxiv.org/ab s/2204.03044, arXiv:2204.03044.

Paul Christiano, Jan Leike, Tom B. Brown, Miljan Martic, Shane Legg, and Dario Amodei. Deep reinforcement learning from human preferences, 2023. URL: https://arxiv.org/abs/1706.037 41, arXiv:1706.03741.

Cohere, OpenAI, and AI21 Labs. Best practices for deploying language models, Jul 2022. URL: https: //cdn.openai.com/papers/joint-recommendation-for-language-model-deplo yment.pdf.

Samantha Cole. Riley reid on ai: ’i don’t want porn to get left behind’, Oct 2023. URL: https://www.404media.co/riley-reid-clona-ai-chatbot-virtual-companion/.

Competition and Markets Authority. Ai foundation models initial report. Report, UK Competition & Markets Authority, Sep 2023. URL: https://as sets.publishing.service.gov.uk/med ia/650449e86771b90014fdab4c/Full_N on-Confidential_Report_PDFA.pdf.

Competition and Markets Authority. Ai foundation models: Technical update report. Technical report, UK Competition & Markets Authority, Apr 2024. URL: https://assets.publishing.serv ice.gov.uk/media/661e5a4c746919818 5bd3d62/AI_Foundation_Models_techn ical_update_report.pdf.

Danish Contractor, Daniel McDuff, Julia Katherine Haines, Jenny Lee, Christopher Hines, Brent Hecht, Nicholas Vincent, and Hanlin Li. Behavioral use licensing for responsible ai. In 2022 ACM Conference on Fairness, Accountability, and Transparency, FAccT ’22. ACM, June 2022. URL: http://dx .doi.org/10.1145/3531146.3533143, doi:10.1145/3531146.3533143. DOI: https://doi.org/10.1145/3531146.3533143

A. Feder Cooper, Emanuel Moss, Benjamin Laufer, and Helen Nissenbaum. Accountability in an algorithmic society: Relationality, responsibility, and robustness in machine learning. In Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency, FAccT ’22, page 864–876, New York, NY, USA, 2022. Association for Computing Machinery. doi:10.1145/3531146.3533150. DOI: https://doi.org/10.1145/3531146.3533150

Ned Cooper and Alexandra Zafiroglu. From fitting participation to forging relationships: The art of participatory ml. In Proceedings of the CHI Conference on Human Factors in Computing Systems, CHI ’24, New York, NY, USA, 2024. Association for Computing Machinery. doi:10.1145/3613904.3642 775. DOI: https://doi.org/10.1145/3613904.3642775

Cyberspace Administration of China. Interim measures for the management of generative artificial intelligence services. https://www.chinalawtr anslate.com/en/generative-ai-interim/, July 2023. Accessed: 2024-05-14.

Matthew Dahl, Varun Magesh, Mirac Suzgun, and Daniel E Ho. Large legal fictions: Profiling legal hallucinations in large language models. Journal of Legal Analysis, 16(1):64–93, January 2024. URL: ht tp://dx.doi.org/10.1093/jla/laae003, doi:10.1093/jla/laae003. DOI: https://doi.org/10.1093/jla/laae003

Fernando Delgado, Stephen Yang, Michael Madaio, and Qian Yang. The participatory turn in ai design: Theoretical foundations and the current state of practice. In Proceedings of the 3rd ACM Conference on Equity and Access in Algorithms, Mechanisms, and Optimization, EAAMO ’23, New York, NY, USA, 2023. Association for Computing Machinery. doi:10.1145/3617694.3623261. DOI: https://doi.org/10.1145/3617694.3623261

Neil Doherty, Leonidas Anastasakis, and Heather Fulford. Reinforcing the security of corporate information resources: A critical review of the role of the acceptable use policy. International Journal of Information Management, 31:201–209, 06 2011. doi: 10.1016/j.ijinfomgt.2010.06.001. DOI: https://doi.org/10.1016/j.ijinfomgt.2010.06.001

J. Donovan, E. Dreyfuss, and B. Friedberg. Meme Wars: The Untold Story of the Online Battles Upending Democracy in America. Bloomsbury Publishing, 2022. URL: https://books.google.com/b ooks?id=04l3EAAAQBAJ.

Evelyn Douek. Content moderation as systems thinking. Harvard Law Review, 2022. URL: https://ssrn.com/abstract=4005326, doi: 10.2139/ssrn.4005326. DOI: https://doi.org/10.2139/ssrn.4005326

Evelyn Douek. The meta oversight board and the empty promise of legitimacy. Harvard Journal of Law & Technology, 37, 2024. doi:10.2139/ss rn.4565180. DOI: https://doi.org/10.2139/ssrn.4565180

Kate Downing. Ai licensing can’t balance “open” with “responsible”, Jul 2023. URL: https://katedowninglaw.com/2023/07/13/ai-licensing-cant-balance-open-with-responsible/.

Kate Downing. Choose your own adventure: The eu ai act and openish ai, February 2024. URL: https://katedowninglaw.com/2023/07/13/ai-licensing-cant-balance-open-with-responsible/.

Francisco Eiras, Aleksandar Petrov, Bertie Vidgen, Christian Schroeder, Fabio Pizzati, Katherine Elkins, Supratik Mukhopadhyay, Adel Bibi, Aaron Purewal, Csaba Botos, Fabro Steibel, Fazel Keshtkar, Fazl Barez, Genevieve Smith, Gianluca Guadagni, Jon Chun, Jordi Cabot, Joseph Imperial, Juan Arturo Nolazco, Lori Landay, Matthew Jackson, Phillip H. S. Torr, Trevor Darrell, Yong Lee, and Jakob Foerster. Risks and opportunities of open-source generative ai, 2024. URL: https://arxiv.org/abs/2405 .08597, arXiv:2405.08597.

Satu Elo and Helvi Kyngas. The qualitative content analysis process. Journal of Advanced Nursing, 62(1):107–115, 2008. URL: https://onlineli brary.wiley.com/doi/abs/10.1111/j. 1365-2648.2007.04569.x, arXiv:https: //onlinelibrary.wiley.com/doi/pdf/ 10.1111/j.1365-2648.2007.04569.x, do i:10.1111/j.1365-2648.2007.04569.x.

Jerel M. Ezell, Babatunde Patrick Ajayi, Tapan Parikh, Kyle Miller, Alex Rains, and David Scales. Drug use and artificial intelligence: Weighing concerns and possibilities for prevention. American Journal of Preventive Medicine, 66(3):568–572, 2024. URL: https://www.sciencedirect.com/science/article/pii/S0749379723004 841, doi:10.1016/j.amepre.2023.11.0 24. DOI: https://doi.org/10.1016/j.amepre.2023.11.024

Steven Feldstein. The global expansion of AI surveillance. Carnegie Endowment for International Peace Washington, DC, 2019.

Grant Fergusson, Caitriona Fitzgerald, Chris Frascella, Megan Iorio, Tom McBrien, Calli Schroeder, Ben Winters, and Enid Zhou. Generating harms: Generative ai’s impact & paths forward. Technical report, Electronic Privacy Information Center, 2023. URL: https://epic.org/documents/generating-harms-generative-ais-impact-paths-forward/.

Thomas Ferretti. An institutionalist approach to ai ethics: Justifying the priority of government regulation over self-regulation. Moral Philosophy and Politics, 9(2):239–265, 2022. doi:10.1515/mopp-2020-0056. DOI: https://doi.org/10.1515/mopp-2020-0056

Jessica Fjeld, Nele Achten, Hannah Hilligoss, Adam Nagy, and Madhulika Srikumar. Principled artificial intelligence: Mapping consensus in ethical and rights-based approaches to principles for ai. Berkman Klein Center Research Publication, January 2020. http://dx.doi.org/10.2139/ssrn.3518482. DOI: https://doi.org/10.2139/ssrn.3518482

Luciano Floridi. Translating principles into practices of digital ethics: Five risks of being unethical. Philosophy & Technology, 32:185–193, 2019. doi: 10.1007/s13347-019-00354-x. DOI: https://doi.org/10.1007/s13347-019-00354-x

Aakash Gautam. Reconfiguring participatory design to resist ai realism. arXiv preprint arXiv:2406.03245, 2024. Presented at Participatory Design Conference 2024. URL: https://doi.org/10.48550/a rXiv.2406.03245, arXiv:2406.03245. DOI: https://doi.org/10.1145/3661455.3669867

Yinuo Geng. Comparing” deepfake” regulatory regimes in the united states, the european union, and china. Geo. L. Tech. Rev., 7:157, 2023.

T. Gillespie. Custodians of the Internet: Platforms, Content Moderation, and the Hidden Decisions that Shape Social Media. Yale University Press, 2018. URL: https://books.google.com/books ?id=-RteDwAAQBAJ. DOI: https://doi.org/10.12987/9780300235029

GitHub. Github acceptable use policies, 2024. Accessed: July 2024. URL: https://docs.github.com/en/site-policy/acceptable-use-policies/github-acceptable-use-policies.

Josh A Goldstein, Jason Chao, Shelby Grossman, Alex Stamos, and Michael Tomz. How persuasive is AI-generated propaganda? PNAS Nexus, 3(2):pgae034, 02 2024. arXiv:https://academic.oup.com/pnasnexus/article-pdf /3/2/pgae034/56712546/pgae034.pdf, doi:10.1093/pnasnexus/pgae034. DOI: https://doi.org/10.1093/pnasnexus/pgae034

Josh A. Goldstein, Girish Sastry, Micah Musser, Renee DiResta, Matthew Gentzel, and Katerina Sedova. Generative language models and automated influence operations: Emerging threats and potential mitigations, 2023. URL: https://arxiv.org/ab s/2301.04246, arXiv:2301.04246.

Google. Policy guidelines for the gemini app, 2024. URL: https://gemini.google/policy-g uidelines/.

Robert Gorwa and Michael Veale. Moderating model marketplaces: Platform governance puzzles for ai intermediaries, 2024. arXiv:2311.12573. DOI: https://doi.org/10.2139/ssrn.4716865

Pauline Gourlet, Donato Ricci, and Maxime Cr´epel. Reclaiming artificial intelligence accounts: A plea for a participatory turn in artificial intelligence inquiries. Big Data & Society, 11(2):20539517241248093, 2024. arXiv:https://doi.org/10.117 7/20539517241248093, doi:10.1177/20 539517241248093. DOI: https://doi.org/10.1177/20539517241248093

Declan Grabb, Max Lamparth, and Nina Vasan. Risks from language models for automated mental healthcare: Ethics and structure for implementation. medRxiv, 2024. URL: https://www.medrxiv.org/content/early/2024/04/08/202 4.04.07.24305462, arXiv:https://www.medrxiv.org/content/early/2024/0 4/08/2024.04.07.24305462.full.pdf, doi:10.1101/2024.04.07.24305462. DOI: https://doi.org/10.1101/2024.04.07.24305462

M.L. Gray and S. Suri. Ghost Work: How to Stop Sil-icon Valley from Building a New Global Underclass. Houghton Mifflin Harcourt, 2019. URL: https: //books.google.com/books?id=u10-uQE ACAAJ.

Nick Gregorio, Janahan Mathanamohan, Qusay H. Mahmoud, and May AlTaei. Hacking in the cloud. Internet Technology Letters, 2(1):e84, 2019. URL:https://onlinelibrary.wiley.com/do i/abs/10.1002/itl2.84, arXiv:https: //onlinelibrary.wiley.com/doi/pdf/ 10.1002/itl2.84, doi:10.1002/itl2.84. DOI: https://doi.org/10.1002/itl2.84

Philipp Hacker. Ai regulation in europe: From the ai act to future regulatory challenges, 2023. URL: https://arxiv.org/abs/2310.04072, ar Xiv:2310.04072.

Philipp Hacker, Johann Cordes, and Janina Rochon. Regulating gatekeeper artificial intelligence and data: Transparency, access and fairness under the digital markets act, the general data protection regulation and beyond. European Journal of Risk Regulation, 15(1):49–86, 2024. DOI: https://doi.org/10.1017/err.2023.81

Oliver L Haimson, Daniel Delmonaco, Peipei Nie, and Andrea Wegner. Disproportionate removals and differing content moderation experiences for conser-vative, transgender, and black social media users: Marginalization and moderation gray areas. Proceed-ings of the ACM on Human-Computer Interaction, 5(CSCW2):1–35, 2021. DOI: https://doi.org/10.1145/3479610

Vaughn Hamilton, Hanna Barakat, and Elissa M. Redmiles. Risk, resilience and reward: Impacts of shifting to digital sex work. Proc. ACM Hum.-Comput. Interact., 6(CSCW2), nov 2022. doi: 10.1145/3555650. DOI: https://doi.org/10.1145/3555650

Peter Henderson, Xuechen Li, Dan Jurafsky, Tat-sunori Hashimoto, Mark A. Lemley, and Percy Liang. Foundation models and fair use, 2023. arXiv: 2303.15715. DOI: https://doi.org/10.2139/ssrn.4404340

Peter Henderson, Xiangyu Qi, Yi Zeng, Tinghao Xie, Pin-Yu Chen, Ruoxi Jia, and Prateek Mittal. Safety risks from customizing foundation models via fine-tuning. Policy brief, Stanford Institute for Human-Centered AI, January 2024. URL: https://hai. stanford.edu/sites/default/files/2

- 01/Policy-Brief-Safety-Risks-Customizing-Foundation-Models-Fine-Tuning.pdf.

Mia Hoffmann and Heather Frase. Adding struc-ture to ai harm: An introduction to cset’s ai harm framework. Technical report, Center for Security and Emerging Technology, July 2023. doi:10.51593 /20230022. DOI: https://doi.org/10.51593/20230022

Krystal Hu, Greg Bensinger, and Jody Godoy. Exclusive: Ftc seeking details on amazon deal with ai startup adept, source says. Reuters, Jul 2024. Accessed . URL: https://www. reuters.com/technology/ftc-seeking-details-amazon-deal-with-ai-start up-adept-source-says-2024-07-16/.

Saffron Huang, Divya Siddarth, Liane Lovitt, Thomas I. Liao, Esin Durmus, Alex Tamkin, and Deep Ganguli. Collective constitutional ai: Aligning a language model with public input. In Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency, FAccT ’24, page 1395–1417, New York, NY, USA, 2024. Association for Computing Machinery. doi:10.1145/3630106.3658 979. DOI: https://doi.org/10.1145/3630106.3658979

Hugging Face. Content policy, August 2023. URL: https://huggingface.co/content-guidelines.

Hakan Inan, Kartikeya Upasani, Jianfeng Chi, Rashi Rungta, Krithika Iyer, Yuning Mao, Michael Tontchev, Qing Hu, Brian Fuller, Davide Testuggine, and Madian Khabsa. Llama guard: Llm-based input-output safeguard for human-ai conversations, 2023. URL: https://arxiv.org/abs/2312.066 74, arXiv:2312.06674.

Shagun Jhaver, Sucheta Ghoshal, Amy Bruckman, and Eric Gilbert. Online harassment and content moderation: The case of blocklists. ACM Trans-actions on Computer-Human Interaction (TOCHI), 25(2):1–33, 2018. DOI: https://doi.org/10.1145/3185593

Anna Jobin, Marcello Ienca, and Effy Vayena. The global landscape of ai ethics guidelines. Nature Machine Intelligence, 1(9):389–399, September 2019. doi:10.1038/s42256-019-0088-2. DOI: https://doi.org/10.1038/s42256-019-0088-2

A. Jones. Camming: Money, Power, and Pleasure in the Sex Work Industry. NYU Press, 2020. URL:https://books.google.com/books?id=30 SODwAAQBAJ. DOI: https://doi.org/10.18574/nyu/9781479842964.001.0001

Barbara M. Jones. 3d printing in libraries: A view from within the american library association: Pri-vacy, intellectual freedom and ethical policy framework. Bulletin of the Association for Information Science and Technology, 42(1):36–41, 2015. URL: https://asistdl.onlinelibrary.wi ley.com/doi/abs/10.1002/bul2.201 5.1720420113, arXiv:https://asistd

l. onlinelibrary.wiley.com/doi/pdf/ 10.1002/bul2.2015.1720420113, doi: 10.1002/bul2.2015.1720420113. DOI: https://doi.org/10.1002/bul2.2015.1720420113

Nektaria Kaloudi and Jingyue Li. The ai-based cy-ber threat landscape: A survey. ACM Comput. Surv., 53(1), feb 2020. doi:10.1145/3372823. DOI: https://doi.org/10.1145/3372823

Sayash Kapoor, Rishi Bommasani, Kevin Klyman, Shayne Longpre, Ashwin Ramaswami, Peter Cihon, Aspen Hopkins, Kevin Bankston, Stella Biderman, Miranda Bogen, Rumman Chowdhury, Alex Engler, Peter Henderson, Yacine Jernite, Seth Lazar, Stefano Maffulli, Alondra Nelson, Joelle Pineau, Aviya Skowron, Dawn Song, Victor Storchan, Daniel Zhang, Daniel E. Ho, Percy Liang, and Arvind Narayanan. On the societal impact of open foundation models, 2024. arXiv:2403.07918.

Rishabh Kaushal, Jacob van de Kerkhof, Catalina Goanta, Gerasimos Spanakis, and Adriana Iamnitchi. Automated transparency: A legal and empirical analysis of the digital services act transparency database, 2024. arXiv:2404.02894. DOI: https://doi.org/10.1145/3630106.3658960

Imrul Kayes and Adriana Iamnitchi. Privacy and se-curity in online social networks: A survey. Online Social Networks and Media, 3:1–21, 2017. DOI: https://doi.org/10.1016/j.osnem.2017.09.001

Daphne Keller. Amplification and its discontents: Why regulating the reach of online content is hard. J.Free Speech L., 1:227, 2021.

Paul Keller and Nicolo` Bonato. Growth of responsi-ble AI licensing. Analysis of license use for ML mod-els published on. Open Future, feb 7 2023. URL: https://openfuture.pubpub.org/pub/growth-of-responsible-ai-licensing.

Moo Jin Kim, Karl Pertsch, Siddharth Karamcheti, Ted Xiao, Ashwin Balakrishna, Suraj Nair, Rafael Rafailov, Ethan Foster, Grace Lam, Pannag Sanketi, Quan Vuong, Thomas Kollar, Benjamin Burchfiel, Russ Tedrake, Dorsa Sadigh, Sergey Levine, Percy Liang, and Chelsea Finn. Openvla: An open-source vision-language-action model, 2024. URL: https: //arxiv.org/abs/2406.09246, arXiv: 2406.09246.

Shiyang Lai, Yujin Potter, Junsol Kim, Richard Zhuang, Dawn Song, and James Evans. Position: Evolving AI collectives enhance human diversity and enable self-regulation. In Forty-first International Conference on Machine Learning, 2024. URL: ht tps://openreview.net/forum?id=u6Pe RHEsjL.

Nathan Lambert. Llama 3.1 405b, meta’s ai strategy, and the new, open frontier model ecosystem. Inter-connects, Jul 2024. URL: https://www.interc onnects.ai/p/llama-405b-open-front ier-model.

Nathan Lambert, Thomas Krendl Gilbert, and Tom Zick. The history and risks of reinforcement learning and human feedback, 2023. arXiv:2310.13595. DOI: https://doi.org/10.1145/3600211.3604698

Katherine Lee, A. Feder Cooper, and James Grim-melmann. Talkin’ ’bout ai generation: Copyright and the generative-ai supply chain, 2024. arXiv: 2309.08133.

V. Lehdonvirta. Cloud Empires: How Digital Plat-forms Are Overtaking the State and How We Can Re-gain Control. MIT Press, 2022. URL: https: //books.google.com/books?id=bc9UEA AAQBAJ. DOI: https://doi.org/10.7551/mitpress/14219.001.0001

Mark A. Lemley, Peter Henderson, and Tatsunori Hashimoto. Where’s the liabil-ity in harmful ai speech?, August 2023. http://dx.doi.org/10.2139/ssrn.4531029. DOI: https://doi.org/10.2139/ssrn.4531029

Han Li, Jie Zhang, and Rathindra Sarathy. Under-standing compliance with internet use policy from the perspective of rational choice theory. Decision Support Systems, 48(4):635–645, 2010. URL: https: //www.sciencedirect.com/science/article/pii/S0167923609002619, doi: 10.1016/j.dss.2009.12.005. DOI: https://doi.org/10.1016/j.dss.2009.12.005

Weixin Liang, Nazneen Rajani, Xinyu Yang, Ezin-wanne Ozoani, Eric Wu, Yiqun Chen, Daniel Scott Smith, and James Zou. What’s documented in ai? sys-tematic analysis of 32k ai model cards, 2024. URL: https://arxiv.org/abs/2402.05160, arXiv:2402.05160.

Shayne Longpre, Stella Biderman, Alon Albalak, Hailey Schoelkopf, Daniel McDuff, Sayash Kapoor, Kevin Klyman, Kyle Lo, Gabriel Ilharco, Nay San, Maribeth Rauh, Aviya Skowron, Bertie Vidgen, Laura Weidinger, Arvind Narayanan, Victor Sanh, David Adelani, Percy Liang, Rishi Bommasani, Peter Hen-derson, Sasha Luccioni, Yacine Jernite, and Luca Sol-daini. The responsible foundation model develop-ment cheatsheet: A review of tools & resources, 2024. URL: https://arxiv.org/abs/2406.167 46, arXiv:2406.16746.

Shayne Longpre, Sayash Kapoor, Kevin Klyman, Ashwin Ramaswami, Rishi Bommasani, Borhane Blili-Hamelin, Yangsibo Huang, Aviya Skowron, Zheng-Xin Yong, Suhas Kotha, Yi Zeng, Weiyan Shi, Xianjun Yang, Reid Southen, Alexander Robey, Patrick Chao, Diyi Yang, Ruoxi Jia, Daniel Kang, Sandy Pentland, Arvind Narayanan, Percy Liang, and Peter Henderson. A safe harbor for ai evaluation and red teaming, 2024. arXiv:2403.04893.

Se´an Looney. Content moderation through removal of service: Content delivery networks and extremist websites. Policy & Internet, 15(4):544–558, 2023. URL: https://onlinelibrary.wiley.co m/doi/abs/10.1002/poi3.370, arXiv: https://onlinelibrary.wiley.com/do i/pdf/10.1002/poi3.370, doi:10.1002/poi3.370. DOI: https://doi.org/10.1002/poi3.370

Alexandra Loverock, Tyler Marshall, Dylan Viste, Fahad Safi, Will Rioux, Navid Sedaghat, Megan Kennedy, and S. Monty Ghosh. Electronic harm reduction interventions for drug overdose monitoring and prevention: A scoping review. Drug and Alco-hol Dependence, 250:110878, 2023. URL: https: //www.sciencedirect.com/science/article/pii/S037687162301116X, doi: 10.1016/j.drugalcdep.2023.110878. DOI: https://doi.org/10.1016/j.drugalcdep.2023.110878

Michael J Madison. Reconstructing the software license. Loy. U. Chi. Lj, 35:275, 2003.

Yaaseen Mahomed, Charlie M. Crawford, Sanjana Gautam, Sorelle A. Friedler, and Danae¨ Metaxa. Au-diting gpt’s content moderation guardrails: Can chat-gpt write your favorite tv show? In Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency, FAccT ’24, page 660–686, New York, NY, USA, 2024. Association for Computing Machinery. doi:10.1145/3630106.3658932. DOI: https://doi.org/10.1145/3630106.3658932

Nahema Marchal, Rachel Xu, Rasmi Elasmar, Iason Gabriel, Beth Goldberg, and William Isaac. Generative ai misuse: A taxonomy of tactics and insights from real-world data, 2024. URL: https: //arxiv.org/abs/2406.13843, arXiv: 2406.13843.

G Alan Marlatt. Harm reduction: Come as you are. Addictive behaviors, 21(6):779–788, 1996. DOI: https://doi.org/10.1016/0306-4603(96)00042-1

Glenn Ellingson Matt Motyl. The unbearably high cost of cutting trust & safety corners, 2024. URL: https://www.techpolicy.press/the-u nbearably-high-cost-of-cutting-tru st-safety-corners/.

Natalie Maus, Patrick Chao, Eric Wong, and Jacob R Gardner. Black box adversarial prompting for foun-dation models. In The Second Workshop on New Frontiers in Adversarial Machine Learning, 2023.

Philipp Mayring, Angelika Bikner-Ahsbahs, Chris-tine Knipping, and Norma Presmeg. Qualitative Con-tent Analysis: Theoretical Background and Procedures, pages 365–380. Springer Netherlands, Dor-drecht, 2015. doi:10.1007/978-94-017-9181-6_13. DOI: https://doi.org/10.1007/978-94-017-9181-6_13

Daniel McDuff, Tim Korjakow, Scott Cambo, Jesse Josua Benjamin, Jenny Lee, Yacine Jernite, Carlos Munoz Ferrandis, Aaron Gokaslan, Alek Tarkowski, Joseph Lindley, A. Feder Cooper, and Danish Contractor. On the standardization of behavioral use clauses and their adoption for responsible licensing of ai, 2024. arXiv:2402.05979.

D. McMenemy, University of Strathclyde. Depart-ment of Computer, and Information Sciences. Public library digital services: Emergent issues of access and acceptable use, 2019. URL: https://books.go ogle.com/books?id=RDmO0AEACAAJ.

Michelle M. Mello and Neel Guha. Understanding liability risk from using health care artificial intel-ligence tools. New England Journal of Medicine, 390(3):271–278, 2024. URL: https://www. nejm.org/doi/full/10.1056/NEJMhl e2308901, arXiv:https://www.nejm.o rg/doi/pdf/10.1056/NEJMhle2308901, doi:10.1056/NEJMhle2308901. DOI: https://doi.org/10.1056/NEJMhle2308901

Rachel Metz and Brody Ford. Adobe’s ‘ethical’ fire-fly ai was trained on midjourney images. Bloomberg, April 2024. URL: https://www.bloomberg. com/news/articles/2024-04-12/adobes-ai-firefly-used-ai-generated-i mages-from-rivals-for-training.

Mary Minow, Tomas A. Lipinski, Gretchen McCord, et al. The Library’s Legal Answers for Makerspaces. ALA Editions, 2016. eBook.

Margaret Mitchell. The pillars of a rights-based ap-proach to ai development, 2023. URL: https: //www.techpolicy.press/the-pillars-of-a-rightsbased-approach-to-ai-development/.

Margaret Mitchell, Simone Wu, Andrew Zaldivar, Parker Barnes, Lucy Vasserman, Ben Hutchinson, Elena Spitzer, Inioluwa Deborah Raji, and Timnit Ge-bru. Model cards for model reporting. In Proceedings of the Conference on Fairness, Accountability, and Transparency, FAT* ’19, page 220–229, New York, NY, USA, 2019. Association for Computing Machin-ery. doi:10.1145/3287560.3287596. DOI: https://doi.org/10.1145/3287560.3287596

Christopher A. Mouton, Caleb Lucas, and Ella Guest. The Operational Risks of AI in Large-Scale Biolog-ical Attacks: Results of a Red-Team Study. RAND Corporation, Santa Monica, CA, 2024. doi:10.7 249/RRA2977-2.

Nikita Nangia, Clara Vania, Rasika Bhalerao, and Samuel R. Bowman. Crows-pairs: A challenge dataset for measuring social biases in masked lan-guage models, 2020. URL: https://arxiv.or g/abs/2010.00133, arXiv:2010.00133. DOI: https://doi.org/10.18653/v1/2020.emnlp-main.154

Davy Tsz Kit Ng, Jac Ka Lok Leung, Samuel Kai Wah Chu, and Maggie Shen Qiao. Conceptualiz-ing ai literacy: An exploratory review. Computers and Education: Artificial Intelligence, 2:100041, 2021. URL: https://www.sciencedirect.com/science/article/pii/S2666920X21000 357, doi:10.1016/j.caeai.2021.100041. DOI: https://doi.org/10.1016/j.caeai.2021.100041

Jonathan A. Obar and Anne Oeldorf-Hirsch. The biggest lie on the internet: ignoring the privacy policies and terms of service policies of social networking services. Information, Communication & Society, 23(1):128–147, 2020. arXiv:https://doi. org/10.1080/1369118X.2018.1486870, doi:10.1080/1369118X.2018.1486870. DOI: https://doi.org/10.1080/1369118X.2018.1486870

W. Ian O’Byrne. Acceptable Use Policies, pages 1–6. John Wiley & Sons, Ltd, 2019. URL: https://on linelibrary.wiley.com/doi/abs/10.1 002/9781118978238.ieml0001, arXiv:ht tps://onlinelibrary.wiley.com/doi/pdf/10.1002/9781118978238.ieml0001, doi:10.1002/9781118978238.ieml0001. DOI: https://doi.org/10.1002/9781118978238.ieml0001

National Technical Committee 260 on Cyber-security of Standardization Administration of China (SAC/TC260). Basic safety requirements for generative artificial intelligence services, April 2024. Translated by the Center for Security and Emerging Technology. URL: https://cset.georgetown.edu/publication/china-safety-req uirements-for-generative-ai-final/.

Open Source Initiative. The Open Source AI Defini-tion – draft v. 0.0.8, 2024. Accessed July 2024. URL: https://opensource.org/deepdive/dr afts/the-open-source-ai-definition-draft-v-0-0-8.

OpenAI. Model spec, May 2024. URL: https: //cdn.openai.com/spec/model-spec-2024-05-08.html.

OpenAI, Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Floren-cia Leoni Aleman, Diogo Almeida, Janko Al-tenschmidt, Sam Altman, Shyamal Anadkat, Red Avila, Igor Babuschkin, Suchir Balaji, Valerie Balcom, Paul Baltescu, Haiming Bao, Moham-mad Bavarian, Jeff Belgum, Irwan Bello, Jake Berdine, Gabriel Bernadett-Shapiro, Christopher Berner, Lenny Bogdonoff, Oleg Boiko, Madelaine Boyd, Anna-Luisa Brakman, Greg Brockman, Tim Brooks, Miles Brundage, Kevin Button, Trevor Cai, Rosie Campbell, Andrew Cann, Brittany Carey, Chelsea Carlson, Rory Carmichael, Brooke Chan, Che Chang, Fotis Chantzis, Derek Chen, Sully Chen, Ruby Chen, Jason Chen, Mark Chen, Ben Chess, Chester Cho, Casey Chu, Hyung Won Chung, Dave Cummings, Jeremiah Currier, Yunxing Dai, Cory Decareaux, Thomas Degry, Noah Deutsch, Damien Deville, Arka Dhar, David Dohan, Steve Dowling, Sheila Dunning, Adrien Ecoffet, Atty Eleti, Tyna Eloundou, David Farhi, Liam Fedus, Niko Felix, Sim´on Posada Fishman, Juston Forte, Isabella Ful-ford, Leo Gao, Elie Georges, Christian Gibson, Vik Goel, Tarun Gogineni, Gabriel Goh, Rapha Gontijo-Lopes, Jonathan Gordon, Morgan Grafstein, Scott Gray, Ryan Greene, Joshua Gross, Shixiang Shane Gu, Yufei Guo, Chris Hallacy, Jesse Han, Jeff Harris, Yuchen He, Mike Heaton, Johannes Heidecke, Chris Hesse, Alan Hickey, Wade Hickey, Peter Hoeschele, Brandon Houghton, Kenny Hsu, Shengli Hu, Xin Hu, Joost Huizinga, Shantanu Jain, Shawn Jain, Joanne Jang, Angela Jiang, Roger Jiang, Haozhun Jin, Denny Jin, Shino Jomoto, Billie Jonn, Hee-woo Jun, Tomer Kaftan, Łukasz Kaiser, Ali Ka-mali, Ingmar Kanitscheider, Nitish Shirish Keskar, Tabarak Khan, Logan Kilpatrick, Jong Wook Kim, Christina Kim, Yongjik Kim, Jan Hendrik Kirch-ner, Jamie Kiros, Matt Knight, Daniel Kokotajlo, Łukasz Kondraciuk, Andrew Kondrich, Aris Kon-stantinidis, Kyle Kosic, Gretchen Krueger, Vishal Kuo, Michael Lampe, Ikai Lan, Teddy Lee, Jan Leike, Jade Leung, Daniel Levy, Chak Ming Li, Rachel Lim, Molly Lin, Stephanie Lin, Mateusz Litwin, Theresa Lopez, Ryan Lowe, Patricia Lue, Anna Makanju, Kim Malfacini, Sam Manning, Todor Markov, Yaniv Markovski, Bianca Martin, Katie Mayer, Andrew Mayne, Bob McGrew, Scott Mayer McKinney, Christine McLeavey, Paul McMillan, Jake McNeil, David Medina, Aalok Mehta, Jacob Menick, Luke Metz, Andrey Mishchenko, Pamela Mishkin, Vinnie Monaco, Evan Morikawa, Daniel Mossing, Tong Mu, Mira Murati, Oleg Murk, David M´ely, Ashvin Nair, Reiichiro Nakano, Rajeev Nayak, Arvind Neelakantan, Richard Ngo, Hyeonwoo Noh, Long Ouyang, Cullen O’Keefe, Jakub Pachocki, Alex Paino, Joe Palermo, Ashley Pantuliano, Gi-ambattista Parascandolo, Joel Parish, Emy Parparita, Alex Passos, Mikhail Pavlov, Andrew Peng, Adam Perelman, Filipe de Avila Belbute Peres, Michael Petrov, Henrique Ponde de Oliveira Pinto, Michael, Pokorny, Michelle Pokrass, Vitchyr H. Pong, Tolly Powell, Alethea Power, Boris Power, Elizabeth Proehl, Raul Puri, Alec Radford, Jack Rae, Aditya Ramesh, Cameron Raymond, Francis Real, Kendra Rimbach, Carl Ross, Bob Rotsted, Henri Roussez, Nick Ryder, Mario Saltarelli, Ted Sanders, Shibani Santurkar, Girish Sastry, Heather Schmidt, David Schnurr, John Schulman, Daniel Selsam, Kyla Shep-pard, Toki Sherbakov, Jessica Shieh, Sarah Shoker, Pranav Shyam, Szymon Sidor, Eric Sigler, Maddie Simens, Jordan Sitkin, Katarina Slama, Ian Sohl, Benjamin Sokolowsky, Yang Song, Natalie Stau-dacher, Felipe Petroski Such, Natalie Summers, Ilya Sutskever, Jie Tang, Nikolas Tezak, Madeleine B. Thompson, Phil Tillet, Amin Tootoonchian, Elizabeth Tseng, Preston Tuggle, Nick Turley, Jerry Tworek, Juan Felipe Cer´on Uribe, Andrea Vallone, Arun Vijayvergiya, Chelsea Voss, Carroll Wainwright, Justin Jay Wang, Alvin Wang, Ben Wang, Jonathan Ward, Jason Wei, CJ Weinmann, Akila Welihinda, Peter Welinder, Jiayi Weng, Lilian Weng, Matt Wi-ethoff, Dave Willner, Clemens Winter, Samuel Wol-rich, Hannah Wong, Lauren Workman, Sherwin Wu, Jeff Wu, Michael Wu, Kai Xiao, Tao Xu, Sarah Yoo, Kevin Yu, Qiming Yuan, Wojciech Zaremba, Rowan Zellers, Chong Zhang, Marvin Zhang, Shengjia Zhao, Tianhao Zheng, Juntang Zhuang, William Zhuk, and Barret Zoph. Gpt-4 technical report, 2024. arXiv: 2303.08774.

Aviv Ovadya. Reimagining democracy for ai. Journal of Democracy, 34(4):162–170, Oct 2023. URL: ht tps://www.journalofdemocracy.org/a rticles/reimagining-democracy-for-a i/. DOI: https://doi.org/10.1353/jod.2023.a907697

Jessica A. Pater, Moon K. Kim, Elizabeth D. Mynatt, and Casey Fiesler. Characterizations of online harass-ment: Comparing policies across social media plat-forms. In Proceedings of the 2016 ACM International Conference on Supporting Group Work, GROUP ’16, page 369–374, New York, NY, USA, 2016. Association for Computing Machinery. doi:10.1145/29 57276.2957297. DOI: https://doi.org/10.1145/2957276.2957297

Riana Pfefferkorn. Addressing computer-generated child sex abuse imagery: Legal framework and policy implications. Lawfare, Feb 2024. Accessed . URL: https://www.lawfarem edia.org/article/addressing-computer-generated-child-sex-abuse-imagery-legal-framework-and-policy-implications.

Giada Pistilli, Carlos Mu˜noz Ferrandis, Yacine Jer-nite, and Margaret Mitchell. Stronger together: on the articulation of ethical charters, legal tools, and technical documentation in ml. In Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency, FAccT ’23, page 343–354, New York, NY, USA, 2023. Association for Computing Machinery. doi:10.1145/3593013.3594002. DOI: https://doi.org/10.1145/3593013.3594002

Priyanshu Priya, Mauajama Firdaus, and Asif Ek-bal. Computational politeness in natural language processing: A survey. ACM Computing Surveys, 56(9):1–42, May 2024. URL: http://dx.doi .org/10.1145/3654660, doi:10.1145/36 54660. DOI: https://doi.org/10.1145/3654660

Xiangyu Qi, Yi Zeng, Tinghao Xie, Pin-Yu Chen, Ruoxi Jia, Prateek Mittal, and Peter Henderson. Fine-tuning aligned language models compromises safety, even when users do not intend to!, 2023. arXiv: 2310.03693.

Michael L Rekart. Sex-work harm reduction. The Lancet, 366(9503):2123–2134, 2005. DOI: https://doi.org/10.1016/S0140-6736(05)67732-X

Anka Reuel, Ben Bucknall, Stephen Casper, Tim Fist, Lisa Soder, Onni Aarne, Lewis Hammond, Lujain Ibrahim, Alan Chan, Peter Wills, Markus Anderljung, Ben Garfinkel, Lennart Heim, Andrew Trask, Gabriel Mukobi, Rylan Schaeffer, Mauricio Baker, Sara Hooker, Irene Solaiman, Alexandra Sasha Luccioni, Nitarshan Rajkumar, Nicolas Mo¨es, Jef-frey Ladish, Neel Guha, Jessica Newman, Yoshua Bengio, Tobin South, Alex Pentland, Sanmi Koyejo, Mykel J. Kochenderfer, and Robert Trager. Open problems in technical ai governance, 2024. URL: https://arxiv.org/abs/2407.14981, arXiv:2407.14981.

S.T. Roberts. Behind the Screen: Content Modera-tion in the Shadows of Social Media. Yale University Press, 2019. URL: https://books.google.c om/books?id=uiCbDwAAQBAJ. DOI: https://doi.org/10.12987/9780300245318

Alexander Robey, Eric Wong, Hamed Hassani, and George J Pappas. Smoothllm: Defending large lan-guage models against jailbreaking attacks. arXiv preprint arXiv:2310.03684, 2023.

Elaine Robinson. The panoptic principle: privacy and surveillance in the public library as evidenced in the acceptable use policy. Thesis, University of Strathclyde, 2019. Accessed: 2021-07-01. URL: http://localhost/files/gq67jr277.

Elaine Robinson and David McMenemy. ‘to be un-derstood as to understand’: A readability analysis of public library acceptable use policies. Journal of Li-brarianship and Information Science, 52(3):713–725, 2020. arXiv:https://doi.org/10.1177/ 0961000619871598, doi:10.1177/096100 0619871598. DOI: https://doi.org/10.1177/0961000619871598

A.B. Ruighaver, S.B. Maynard, and M. Warren. Ethical decision making: Improving the quality of acceptable use policies. Computers & Security, 29(7):731–736, 2010. URL: https://www.sciencedir ect.com/science/article/pii/S01674 04810000386, doi:10.1016/j.cose.201 0.05.004. DOI: https://doi.org/10.1016/j.cose.2010.05.004

Teela Sanders, Jane Scoular, Rosie Campbell, Jane Pitcher, and Stewart Cunningham. Internet sex work: Beyond the gaze. Springer, 2018. DOI: https://doi.org/10.1007/978-3-319-65630-4

Johannes Schneider, Arianna Casanova Flores, and Anne-Catherine Kranz. Exploring human-llm conver-sations: Mental models and the originator of toxicity, 2024. URL: https://arxiv.org/abs/2407 .05977, arXiv:2407.05977.

Elizabeth Seger, Aviv Ovadya, Ben Garfinkel, Divya Siddarth, and Allan Dafoe. Democratising ai: Mul-tiple meanings, goals, and methods, 2023. URL: https://arxiv.org/abs/2303.12642, arXiv:2303.12642. DOI: https://doi.org/10.1145/3600211.3604693

Rusheb Shah, Quentin Feuillade Montixi, Soroush Pour, Arush Tagade, and Javier Rando. Scalable and transferable black-box jailbreaks for language mod-els via persona modulation. In Socially Responsible Language Modelling Research, 2023.

Megan Shahi, Adam Conner, and Nicole Alvarez. Generative ai should be developed and deployed re-sponsibly at every level for everyone, February 1 2024. URL: https://www.americanprogre ss.org/article/generative-ai-should-be-developed-and-deployed-responsibly-at-every-level-for-everyone/.

Gemma Sharp, John Torous, and Madeline L. West. Ethical challenges in ai approaches to eat-ing disorders. Journal of Medical Internet Re-search, 25:e50696, August 2023. ©Gemma Sharp, John Torous, Madeline L. West. Originally pub-lished in the Journal of Medical Internet Research (https://www.jmir.org), 14.08.2023. URL: https: //mhealth.jmir.org/2022/5/e33735, doi:10.2196/50696. DOI: https://doi.org/10.2196/50696

Renee Shelby, Shalaleh Rismani, Kathryn Henne, AJung Moon, Negar Rostamzadeh, Paul Nicholas, N’Mah Yilla, Jess Gallegos, Andrew Smart, Emilio Garcia, and Gurleen Virk. Sociotechnical harms of algorithmic systems: Scoping a taxonomy for harm reduction, 2023. arXiv:2210.05791. DOI: https://doi.org/10.1145/3600211.3604673

Xinyue Shen, Zeyuan Chen, Michael Backes, Yun Shen, and Yang Zhang. ” do anything now”: Characterizing and evaluating in-the-wild jailbreak prompts on large language models. arXiv preprint arXiv:2308.03825, 2023. DOI: https://doi.org/10.1145/3658644.3670388

Eugenia Siapera and Paloma Viejo-Otero. Governing hate: Facebook and digital racism. Television & New Media, 22(2):112–130, 2021. DOI: https://doi.org/10.1177/1527476420982232

Keng Siau, Fiona Fui-Hoon Nah, and Limei Teng. Acceptable internet use policy. Commun. ACM, 45(1):75–79, jan 2002. doi:10.1145/502269 .502302. DOI: https://doi.org/10.1145/502269.502302

Riley Simmons-Edler, Ryan Badman, Shayne Long-pre, and Kanaka Rajan. Ai-powered autonomous weapons risk geopolitical instability and threaten ai research, 2024. URL: https://arxiv.org/ab s/2405.01859, arXiv:2405.01859.

Zachary Small. Black artists say a.i. shows bias, with algorithms erasing their history. The New York Times, July 2023. URL: https://www.nytimes.co m/2023/07/04/arts/design/black-art ists-bias-ai.html.

Irene Solaiman. The gradient of generative ai release: Methods and considerations. In Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency, FAccT ’23, page 111–122, New York, NY, USA, 2023. Association for Computing Machinery. doi:10.1145/3593013.3593981. DOI: https://doi.org/10.1145/3593013.3593981

Irene Solaiman, Zeerak Talat, William Agnew, Lama Ahmad, Dylan Baker, Su Lin Blodgett, Canyu Chen, Hal Daume´ III au2, Jesse Dodge, Isabella Duan, El-lie Evans, Felix Friedrich, Avijit Ghosh, Usman Go-har, Sara Hooker, Yacine Jernite, Ria Kalluri, Alberto Lusoli, Alina Leidinger, Michelle Lin, Xiuzhu Lin, Sasha Luccioni, Jennifer Mickel, Margaret Mitchell, Jessica Newman, Anaelia Ovalle, Marie-Therese Png, Shubham Singh, Andrew Strait, Lukas Struppek, and Arjun Subramonian. Evaluating the social impact of generative ai systems in systems and society, 2024. URL: https://arxiv.org/abs/2306.059 49, arXiv:2306.05949.

Madhulika Srikumar, Jiyoo Chang, and Kasia Chmielinski. Risk mitigation strategies for the open foundation model value chain, July 11 2024. URL: https://partnershiponai.org/resour ce/risk-mitigation-strategies-for-t he-open-foundation-model-value-cha in/.

Zahra Stardust. Safe for work: Feminist porn, corpo-rate regulation and community standards. In Cather-ine Dale and Rosemary Overell, editors, Orienting Feminism: Media, Activism and Cultural Representa-tion, pages 155–179. Springer International Publish-ing, Cham, 2018. URL: https://link.sprin ger.com/chapter/10.1007/978-3-319-7 0660-3_9. DOI: https://doi.org/10.1007/978-3-319-70660-3_9

Farley Stewart. Internet acceptable use policies: Nav-igating the management, legal, and technical issues. Information Systems Security, 9(3):1–7, 2000. arXi v:https://doi.org/10.1201/1086/433 10.9.3.20000708/31360.6, doi:10.1201/1086/43310.9.3.20000708/31360.6. DOI: https://doi.org/10.1201/1086/43310.9.3.20000708/31360.6

Angelika Strohmayer, Jenn Clamen, and Mary Laing. Technologies for social justice: Lessons from sex workers on the front lines. In Proceedings of the 2019 CHI conference on human factors in computing systems, pages 1–14, 2019. DOI: https://doi.org/10.1145/3290605.3300882

Substance Abuse and Mental Health Services Admin-istration. Harm reduction framework. Technical report, Center for Substance Abuse Prevention, Substance Abuse and Mental Health Services Administration, 2023. URL: https://www.samhsa.gov/sites/default/files/harm-reducti on-framework.pdf.

Lucy Suchman. Algorithmic warfare and the rein-vention of accuracy. Critical Studies on Security, 8(2):175–187, 2020. arXiv:https://doi. org/10.1080/21624887.2020.1760587, doi:10.1080/21624887.2020.1760587. DOI: https://doi.org/10.1080/21624887.2020.1760587

Supreme Court of the United States. Netchoice, llcv. paxton. United States Supreme Court, Jul 2024. Docket No. 22-555, 598 U.S. (2024). URL: https: //www.supremecourt.gov/opinions/23 pdf/22-555_h3ci.pdf.

Harini Suresh, Emily Tseng, Meg Young, Mary Gray, Emma Pierson, and Karen Levy. Participation in the age of foundation models. In Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency, FAccT ’24, page 1609–1621, New York, NY, USA, 2024. Association for Computing Machinery. doi:10.1145/3630106.3658992. DOI: https://doi.org/10.1145/3630106.3658992

Alex Tamkin, Amanda Askell, Liane Lovitt, Esin Durmus, Nicholas Joseph, Shauna Kravec, Karina Nguyen, Jared Kaplan, and Deep Ganguli. Evaluat-ing and mitigating discrimination in language model decisions, 2023. URL: https://arxiv.org/ab s/2312.03689, arXiv:2312.03689.

David Thiel, Melissa Stroebel, and Rebecca Port-noff. Generative ml and csam: Implications and mit-igations. Report, Stanford Internet Observatory, Jun 2023. URL: https://purl.stanford.edu/jv206yg3793.

Thorn. Safety by design for generative ai: Preventing child sexual abuse, 2024. URL: https://info.t horn.org/hubfs/thorn-safety-by-des ign-for-generative-AI.pdf.

Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Niko-lay Bashlykov, Soumya Batra, Prajjwal Bhargava, Shruti Bhosale, Dan Bikel, Lukas Blecher, Cris-tian Canton Ferrer, Moya Chen, Guillem Cucurull, David Esiobu, Jude Fernandes, Jeremy Fu, Wenyin Fu, Brian Fuller, Cynthia Gao, Vedanuj Goswami, Naman Goyal, Anthony Hartshorn, Saghar Hosseini, Rui Hou, Hakan Inan, Marcin Kardas, Viktor Kerkez, Madian Khabsa, Isabel Kloumann, Artem Korenev, Punit Singh Koura, Marie-Anne Lachaux, Thibaut Lavril, Jenya Lee, Diana Liskovich, Yinghai Lu, Yun-ing Mao, Xavier Martinet, Todor Mihaylov, Pushkar Mishra, Igor Molybog, Yixin Nie, Andrew Poulton, Jeremy Reizenstein, Rashi Rungta, Kalyan Saladi, Alan Schelten, Ruan Silva, Eric Michael Smith, Ran-jan Subramanian, Xiaoqing Ellen Tan, Binh Tang, Ross Taylor, Adina Williams, Jian Xiang Kuan, Puxin Xu, Zheng Yan, Iliyan Zarov, Yuchen Zhang, Angela Fan, Melanie Kambadur, Sharan Narang, Aurelien Rodriguez, Robert Stojnic, Sergey Edunov, and Thomas Scialom. Llama 2: Open foundation and fine-tuned chat models, 2023. arXiv:2307.09288.

Jon Truby, Rafael Dean Brown, Imad Antoine Ibrahim, and Oriol Caudevilla Parellada. A sandbox approach to regulating high-risk artificial intelligence applications. European Journal of Risk Regulation, 13(2):270–294, 2022. DOI: https://doi.org/10.1017/err.2021.52

European Union. Proposal for a regulation of the eu-ropean parliament and of the council laying down har-monised rules on artificial intelligence (artificial in-telligence act) and amending certain union legislative acts, 2024. Accessed: 2024-05-14. URL: https://data.consilium.europa.eu/doc/document/ST-5662-2024-INIT/en/pdf.

United States District Court for the District of Columbia. United states et al. v. google llc, Aug 2024. Case No. 20-cv-3010 (APM), Memorandum Opinion. URL: https://storage.courtlistener. com/recap/gov.uscourts.dcd.223205/gov.uscourts.dcd.223205.1033.0_1.p df.

Margrethe Vestager, Sarah Cardell, Jonathan Kanter, and Lina M. Khan. Joint statement on competition in generative ai foundation models and ai products, Jul 2024. URL: https://www.ftc.gov/system /files/ftc.gov/pdf/ai-joint-statement.pdf.

Luis Villa. Evaluating the rail license family, Novem-ber 2022. URL: https://blog.tidelift.co m/evaluating-the-rail-license-famil y.

Sandra Wachter and Brent Mittelstadt. A right to rea-sonable inferences: re-thinking data protection law in the age of big data and ai. Colum. Bus. L. Rev., page 494, 2019. DOI: https://doi.org/10.31228/osf.io/mu2kf

Boxin Wang, Weixin Chen, Hengzhi Pei, Chulin Xie, Mintong Kang, Chenhui Zhang, Chejian Xu, Zidi Xiong, Ritik Dutta, Rylan Schaeffer, Sang T. Truong, Simran Arora, Mantas Mazeika, Dan Hendrycks, Zi-nan Lin, Yu Cheng, Sanmi Koyejo, Dawn Song, and Bo Li. Decodingtrust: A comprehensive assessment of trustworthiness in gpt models, 2024. arXiv: 2306.11698.

Junlin Wang, Jue Wang, Ben Athiwaratkun, Ce Zhang, and James Zou. Mixture-of-agents enhances large language model capabilities, 2024. URL: https://arxiv.org/abs/2406.046 92, arXiv:2406.04692.

Alexander Wei, Nika Haghtalab, and Jacob Stein-hardt. Jailbroken: How does llm safety training fail?arXiv preprint arXiv:2307.02483, 2023.

Laura Weidinger, John Mellor, Bernat Guillen Pegueroles, Nahema Marchal, Ravin Kumar, Kristian Lum, Canfer Akbulut, Mark Diaz, Stevie Bergman, Mikel Rodriguez, Verena Rieser, and William Isaac. Star: Sociotechnical approach to red teaming language models, 2024. URL: https://arxiv.or g/abs/2406.11757, arXiv:2406.11757. DOI: https://doi.org/10.18653/v1/2024.emnlp-main.1200

Laura Weidinger, Maribeth Rauh, Nahema Mar-chal, Arianna Manzini, Lisa Anne Hendricks, Juan Mateos-Garcia, Stevie Bergman, Jackie Kay, Conor Griffin, Ben Bariach, Iason Gabriel, Verena Rieser, and William Isaac. Sociotechnical safety evaluation of generative ai systems, 2023. arXiv:2310.119 86.

Jake Weidman and Jens Grossklags. The acceptable state: An analysis of the current state of acceptable use policies in academic institutions. In Proceed-ings of the 27th European Conference on Informa-tion Systems (ECIS), page Research Papers, Stock-holm & Uppsala, Sweden, 2019. URL: https: //aisel.aisnet.org/ecis2019_rp/99.

Matt White, Ibrahim Haddad, Cailean Osborne, Xiao-Yang Liu Yanglet, Ahmed Abdelmonsef, and Sachin Varghese. The model openness framework: Promot-ing completeness and openness for reproducibility, transparency, and usability in artificial intelligence, 2024. URL: https://arxiv.org/abs/24 03.13784, arXiv:2403.13784.

White House. Voluntary ai commitments, July 2023. Accessed: 2024-05-14. URL: https://www.wh itehouse.gov/wp-content/uploads/202 3/07/Ensuring-Safe-Secure-and-Trust worthy-AI.pdf.

David Gray Widder. Epistemic power in ai ethics labor: Legitimizing located complaints. In Pro-ceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency, FAccT ’24, page 1295–1304, New York, NY, USA, 2024. Association for Computing Machinery. doi:10.1145/3630 106.3658973. DOI: https://doi.org/10.1145/3630106.3658973

David Gray Widder, Sarah West, and Meredith Whit-taker. Open (for business): Big tech, concentrated power, and the political economy of open ai. SSRN, 2023. URL: https://ssrn.com/abstract= 4543807, doi:10.2139/ssrn.4543807. DOI: https://doi.org/10.2139/ssrn.4543807

Xianjun Yang, Xiao Wang, Qi Zhang, Linda Pet-zold, William Yang Wang, Xun Zhao, and Dahua Lin. Shadow alignment: The ease of subverting safely-aligned language models. arXiv preprint arXiv:2310.02949, 2023.

Yi Zeng, Yu Yang, Andy Zhou, Jeffrey Ziwei Tan, Yuheng Tu, Yifan Mai, Kevin Klyman, Minzhou Pan, Ruoxi Jia, Dawn Song, Percy Liang, and Bo Li. Air-bench 2024: A safety benchmark based on risk categories from regulations and policies, 2024. URL: https://arxiv.org/abs/2407.17436, arXiv:2407.17436. DOI: https://doi.org/10.70777/agi.v1i1.10863

Yi Zeng, Kevin Klyman, Andy Zhou, Yu Yang, Minzhou Pan, Ruoxi Jia, Dawn Song, Percy Liang, and Bo Li. Ai risk categorization decoded (air 2024): From government regulations to corporate policies, 2024. URL: https://arxiv.org/abs/2406.178 64, arXiv:2406.17864. DOI: https://doi.org/10.70777/agi.v1i1.10603

Qiusi Zhan, Richard Fang, Rohan Bindu, Akul Gupta, Tatsunori Hashimoto, and Daniel Kang. Removing RLHF Protections in GPT-4 via Fine-Tuning. arXiv preprint arXiv:2311.05553, 2023. DOI: https://doi.org/10.18653/v1/2024.naacl-short.59

Angela Huyue Zhang. Angela Huyue Zhang. The promise and perils of china’s regulation of artificial intelligence. University of Hong Kong Faculty of Law Research Paper, 2024(02), 2024. http://dx.doi.org/10.2139/ssrn.4708676. URL: https://ssrn.com/abstract=4708676. DOI: https://doi.org/10.2139/ssrn.4708676

Jieyu Zhao, Tianlu Wang, Mark Yatskar, Vicente Ordonez, and Kai-Wei Chang. Gender bias in coreference resolution: Evaluation and debiasing methods, 2018. URL: https://arxiv.org/abs/1804.06876, arXiv:1804.06876. DOI: https://doi.org/10.18653/v1/N18-2003

Wenting Zhao, Xiang Ren, Jack Hessel, Claire Cardie, Yejin Choi, and Yuntian Deng. Wildchat: 1m chatgpt interaction logs in the wild, 2024.URL:https://arxiv.org/abs/2405.01470,arXiv:2405.01470.

Andy Zou, Zifan Wang, J Zico Kolter, and Matt Fredrikson. Universal and transferable adversarial attacks on aligned language models. arXiv preprint arXiv:2307.15043, 2023.

Partial taxonomy of violative uses from developers' use policies

Downloads

Published

2024-11-26

How to Cite

Klyman, K. (2024). Acceptable Use Policies for Foundation Models. SuperIntelligence - Robotics - Safety & Alignment, 1(1), 20. https://doi.org/10.70777/si.v1i1.10917