Andy J Yang
杨加锋
There are links on the tree branches below. Please click on them.
PhD student at the University of Notre Dame, advised by
Dr. David Chiang
in the
NLP lab.
Supported by a
Notre Dame Deans' Fellowship
and an
NSF Graduate Research Fellowship
. Co-organizer for
FLaNN
with
Anej Svete
.
My research interests include:
Formal language theory
Machine learning theory
Computational linguistics
I am in the industry of creating mathematical models to explain the limitations and capabilities of machine learning architectures
Publications
A Length generalization bounds for transformers
Andy Yang, Pascal Bergsträßer, Georg Zetzsche, David Chiang, Anthony W. Lin.
ArXiv preprint.
Probability Distributions Computed by Autoregressive Transformers
Andy Yang, Anej Svete, Jiaoda Li, Anthony Widjaja Lin, Jonathan Rawski, Ryan Cotterell, David Chiang
In Proc. ICLR. 2026.
The Transformer Cookbook
Andy Yang, Christopher Watson, Anton Xue, Satwik Bhattamishra, Jose Llarena, William Merrill, Emile Dos Santos Ferreira, Anej Svete, and David Chiang.
Transactions on Machine Learning Research, January 2026.
Knee-deep in C-RASP: a transformer depth hierarchy
Andy Yang, Michaël Cadilhac, and David Chiang.
In Proc. NeurIPS 38. 2025.
Simulating hard attention using soft attention
Andy Yang, Lena Strobl, David Chiang, and Dana Angluin.
Transactions of the Association for Computational Linguistics, 2025.
A Formal Framework for Understanding Length Generalization in Transformers
Xinting Huang, Andy Yang, Satwik Bhattamishra, Yash Sarrof, Andreas Krebs, Hattie Zhou, Preetum Nakkiran, Michael Hahn.
In Proc. ICLR. 2025.
Counting like transformers: compiling temporal counting logic into softmax transformers
Andy Yang and David Chiang.
In Proc. CoLM. 2024.
Masked hard-attention transformers recognize exactly the star-free languages.
Andy Yang, David Chiang, and Dana Angluin.
In Proc. NeurIPS 37, 10202–10235. 2024.
Origami
Old Friends
On Building Community with Origami
My Origami Flickr
JMM 2024 Origami