Browse by Journal or other publication
Jump to: Article
Number of items: 1.
Article
Darm, Paul and Riccardi, Annalisa (2025) Head-specific intervention can induce misaligned AI coordination in large language models. Transactions on Machine Learning Research. ISSN 2835-8856

Up a level