Poster / Demo
Xie, Shifeng; Yuan, Rui; Rossi, Simone; Hannagan, Thomas
The initialization determines whether in-context learning is gradient descent
NeurIPS 2025, Workshop, What Can(’t) Transformers Do?, 39th Annual Conference on Neural Information Processing Systems, 2-7 December 2025, San Diego, USA