GUAN, Melody Y. et al. Deliberative Alignment: Reasoning Enables Safer Language Models. SuperIntelligence - Robotics - Safety & Alignment, [S. l.], v. 2, n. 3, 2025. DOI: 10.70777/si.v2i3.15159. Disponível em: https://s-rsa.com/index.php/agi/article/view/15159. Acesso em: 18 may. 2026.