Y-Agent Research Blog
Zhuoran Yang Research Group
Archives
All Categories
All Tags
Sparse Autoencoders
1 article
Feature Recovery
In-Context Learning
Interpretability
LLM
Sparse Autoencoders
Transformers
Taming Polysemanticity in LLMs: Provable Feature Recovery via Sparse Autoencoders
15 minute read