[1]

M. Y. Guan, “Deliberative Alignment: Reasoning Enables Safer Language Models”, SI, vol. 2, no. 3, Jul. 2025.