Announcement_2
3 ICML papers accepted, with one Spotlight! 😉 Two of them are about the fundamental understanding of the Transformer architecture and its training dynamics. The other is about private training of SGD with adaptive clipping strategies.