The Evolution of AI Communication: From Chain-of-Thought to Neuralese and the Case for Interpretability Agents
DOI:
https://doi.org/10.70777/si.v2i5.16187Keywords:
neuralese, latent vector communication, ai communication, inter-ai-communication, explainable ai, mechanistic interpretability, safe ai, safe artificial general intelligence, safe superintelligenceAbstract
The field of artificial intelligence is approaching an inflection point that few researchers saw coming: the potential emergence of AI-native communication protocols that bypass human language entirely. A recent scenario analysis, "AI 2027," introduces a fascinating concept called "neuralese recurrence" that deserves serious academic and industry attention.
References
• Anthropic. (2024). "Probes Catch Sleeper Agents." Anthropic Research Blog.
• Bengio, Y., Courville, A., & Vincent, P. (2013). "Representation learning: A review and new perspectives." IEEE transactions on pattern analysis and machine intelligence, 35(8), 1798-1828.
• Conneau, A., et al. (2020). "Unsupervised Cross-lingual Representation Learning at Scale." Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.
• Elhage, N., et al. (2021). "A Mathematical Framework for Transformer Circuits." Anthropic Research.
• Hao, S., et al. (2024). "Recurrent Neural Language Models." Meta AI Research.
• Olah, C., et al. (2020). "Zoom In: An Introduction to Circuits." Distill, 5(3).
• Russell, S. (2019). Human Compatible: Artificial Intelligence and the Problem of Control. Viking.
• Wei, J., et al. (2022). "Chain-of-Thought Prompting Elicits Reasoning in Large Language Models." NeurIPS.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Erhan Arslan

This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License.