--> Explained: Towards Monosemanticity in LLMs | Mithilesh Vaidya Explained: Towards Monosemanticity in LLMs | Mithilesh Vaidya

Explained: Towards Monosemanticity in LLMs

Nov 4, 2023

I gave a short presentation on Anthropic’s work Towards Monosemanticity: Decomposing Language Models With Dictionary Learning.

Slides can be accessed here.