Nathalie Kirch is a PhD student in Computer Science Research at Imperial and King’s College London. Her research mainly focuses on mechanistic interpretability and robustness in LLMs. She has a broad interdisciplinary background in cognitive psychology, philosophy, and artificial intelligence, thinking that making progress in the mechanistic interpretability is crucial to create trustworthy and safe AI systems. She completed her undergraduate studies on Psychology and Philosophy at Erasmus University Rotterdam, and her masters in Artificial Intelligence at Utrecht University.