There are links on the tree branches below. Please click on them.
Selected publications (see also my
Google
Scholar page):
Probability Distributions Computed by
Autoregressive Transformers
Andy Yang, Anej Svete, Jiaoda Li, Anthony Widjaja Lin, Jonathan Rawski, Ryan Cotterell, David Chiang
In Proc. ICLR. 2026.
The Transformer Cookbook
Andy Yang, Christopher Watson, Anton Xue, Satwik Bhattamishra, Jose Llarena, William Merrill,
Emile
Dos
Santos Ferreira, Anej Svete, and David Chiang.
Transactions on Machine Learning Research, January 2026.