Y-Agent Research Blog
Y-Agent Research Blog
Zhuoran Yang Research Group
  • Archives
  • All Categories
  • All Tags

Interpretability

1 article

Feature Recovery Feature-Learning Fourier-Features Grokking In-Context Learning Interpretability LLM Mechanistic-Interpretability Modular-Addition Sparse Autoencoders Transformers
Taming Polysemanticity in LLMs: Provable Feature Recovery via Sparse Autoencoders

Taming Polysemanticity in LLMs: Provable Feature Recovery via Sparse Autoencoders

15 minute read

© 2025 - 2026 Y-Agent Research Blog

© 2025 - 2026 Y-Agent Research Blog