On the Convergence of Encoder-only Shallow Transformers

Yongtao Wu, Fanghui Liu, Grigorios Chrysos, Volkan Cevher

Author Locations

No location data available for the ELLIS authors of this paper.

ELLIS Edge Newsletter
Join the 6,000+ people who get the monthly newsletter filled with the latest news, jobs, events and insights from the ELLIS Network.