arxiv:2606.12138
Gleb Gerasimov
gudleifrr
AI & ML interests
NLP, interpretability
Recent Activity
authored a paper 4 days ago
Small Vectors, Big Effects: A Mechanistic Study of RL-Induced Reasoning
via Steering Vectors authored a paper 4 days ago
Train One Sparse Autoencoder Across Multiple Sparsity Budgets to
Preserve Interpretability and Accuracy authored a paper 4 days ago
Teach Old SAEs New Domain Tricks with BoostingOrganizations
None yet