Learning Emergent Behaviors and Mitigating Critical Phase Transitions in AI and Cryptoeconomic Systems

Background: Recent advances in artificial intelligence have led to the development of increasingly sophisticated multi-agent systems that constantly adapt to their changing environment. These systems have wide-ranging applications, from managing traffic networks and power grids to enabling the creation of virtual economies on blockchain networks. However, as they evolve over time, these systems encounter uncertain or adversarial environments that compromise their safety and stability in the long-run. These encounters may lead to dramatic changes in the future evolution or equilibrium states of a given system, often only due to small perturbations in its critical parameters that are hard to control or predict.

Such critical phase transitions are frequently observed both in the laboratory, e.g., during the training of neural networks or simulation of dynamical systems, and in the real-world, for instance, in financial markets, genetics, disease outbreaks, opinion swifts in social networks, or mass adoption of novel technologies. However, despite their prevalence, the dynamics behind such transitions are still not well understood. In fact, their complexity still constitutes a critical bottleneck in the ability of scientists and policymakers to increase the adoption of these novel technologies and to make the underlying AI systems more safe, trustworthy, fair, socially efficient, and less brittle against adversarial manipulation.

Objectives: The objective of this PhD project is to develop a dynamic game-theoretic framework to improve the safety and trustworthiness in AI systems. Particular emphasis will be placed on systems with complex decision making processes in which humans repeatedly interact with AI algorithms such as virtual economies and cryptoeconomic mechanisms. Based on this framework, the main goal of the project will be to design mechanisms and protocols that will be robust to uncertain and adversarial environments. The proposed mechanisms will be used to identify potential critical (or phase) transitions that can occur in these systems, and to develop strategies for predicting and mitigating such events. This will involve analyzing the impact of different game-theoretic strategies on stability and robustness, and identifying optimal strategies for achieving safe and trustworthy behavior.

Techniques: The work in this PhD will employ a combination of theoretical work in the above areas and empirical evaluation of the proposed mechanisms or algorithms using simulations and real-world datasets. The approach will integrate formal mathematical methods with game dynamics and machine learning techniques (e.g., offline, or online reinforcement learning) to model and analyze the behavior of the considered AI systems over time.

Impact: The potential impact of this project is significant and manifold. By improving the safety and trust in AI and cryptoeconomic systems, this project has the potential to enable the wider adoption of these technologies, and to unlock their full potential for a wide range of applications. This could have significant benefits for society, including improving the efficiency and reliability of critical infrastructure, and enabling the creation of innovative and reliable services.

1. R. Humphry, P. Avramovic, F. Acevedo, S. Alkhair, A. Gervais. Research Note: Review of Maximal Extractable Value and Blockchain Oracles. (fca.org.uk), 2024.
2. S. Leonardos, G. Piliouras. Exploration-exploitation in multi-agent learning: Catastrophe theory meets game theory, Artificial Intelligence, 2022.
3. S. Leonardos, G. Piliouras, K. Spendlove. Exploration-Exploitation in Multi-Agent Competition: Convergence with Bounded Rationality, in Advances in Neural Information Processing Systems (NeurIPS), 2021.
4. S. Leonardos, W. Overman, I. Panageas, and G. Piliouras. Global Convergence of Multi-Agent Policy Gradient in Markov Potential Games, in International Conference on Learning Representations (ICLR), 2022.
5. S. Leonardos, J. Sakos, C. Courcoubetis, G. Piliouras. Catastrophe by Design in Population Games: A Mechanism to Destabilize Inefficient Locked-in Technologies. ACM Trans. Econ. Comput, 2023.
6. V. Buterin, D. Reijsbergen, S. Leonardos, G. Piliouras. Incentives in Ethereum’s hybrid Casper protocol. Int J Network Mgmt, 2020.
7. S. Leonardos, B. Monnot, D. Reijsbergen, E. Skoulakis, and G. Piliouras. Dynamical analysis of the EIP-1559 Ethereum fee market. In Proceedings of the 3rd ACM Conference on Advances in Financial Technologies (AFT), 2021.
8. S. Leonardos, D. Reijsbergen, B. Monnot, G. Piliouras. Optimality despite chaos in fee markets. In Financial Cryptography and Data Security, 2024.
9. F. Wu, T. Thiery, S. Leonardos, C. Ventre. Strategic Bidding Wars in On-chain Auctions, 2024.
10. HM Treasury. UK sets out plans to regulate crypto and protect consumers, 2023.

Project ID

STAI-CDT-2024-KCL-24

Supervisor

Dr Stefanos Leonardosstefanos.leonardos@kcl.ac.uk https://stefanosleonardos.wordpress.com/

Learning Emergent Behaviors and Mitigating Critical Phase Transitions in AI and Cryptoeconomic Systems

Project ID

Supervisor

Programme

People

Partnerships