Guan, Melody Y., et al. “Deliberative Alignment: Reasoning Enables Safer Language Models”. SuperIntelligence - Robotics - Safety & Alignment, vol. 2, no. 3, July 2025, doi:10.70777/si.v2i3.15159.