10 1

Gleb Gerasimov

gudleifrr

humdinger-g

AI & ML interests

NLP, interpretability

Recent Activity

authored a paper 4 days ago

Small Vectors, Big Effects: A Mechanistic Study of RL-Induced Reasoning via Steering Vectors

authored a paper 4 days ago

Train One Sparse Autoencoder Across Multiple Sparsity Budgets to Preserve Interpretability and Accuracy

authored a paper 4 days ago

Teach Old SAEs New Domain Tricks with Boosting

View all activity

Organizations

None yet

Papers 5

spaces 3

Lime Rebuttal

🚀

Visualize project metrics in real-time

Lime

🚀

Test 0

🚀

Visualize project metrics and runs

models 214

gudleifrr/gpt2_saes

Updated Jan 9

gudleifrr/sae_Qwen_Qwen2.5-Math-7B_diff_blocks.15.hook_resid_post_16384_batchtopk_64_0.001_9715

Updated Aug 25, 2025

gudleifrr/sae_Qwen_Qwen2.5-Math-7B_diff_blocks.10.hook_resid_post_16384_batchtopk_64_0.001_1376

Updated Aug 25, 2025

gudleifrr/sae_Qwen_Qwen2.5-Math-7B_diff_blocks.10.hook_resid_post_16384_batchtopk_64_0.001_3866

Updated Aug 25, 2025

gudleifrr/sae_Qwen_Qwen2.5-Math-7B_diff_blocks.10.hook_resid_post_16384_batchtopk_64_0.001_9689

Updated Aug 21, 2025

gudleifrr/sae_Qwen_Qwen2.5-7B_diff_blocks.10.hook_resid_post_16384_batchtopk_64_0.001_8634

Updated Aug 21, 2025

gudleifrr/sae_Qwen_Qwen2.5-Math-7B_diff_blocks.10.hook_resid_post_16384_batchtopk_64_0.001_r1

Updated Aug 21, 2025

gudleifrr/sae_Qwen_Qwen2.5-7B_diff_blocks.10.hook_resid_post_16384_batchtopk_64_0.001_r1

Updated Aug 21, 2025

gudleifrr/sae_Qwen_Qwen2.5-7B_default_ln_final.hook_normalized_16384_batchtopk_64_0.001

Updated Aug 15, 2025

gudleifrr/sae_Qwen_Qwen2.5-7B_default_blocks.10.hook_resid_post_16384_batchtopk_64_0.001

Updated Aug 15, 2025

View 214 models

datasets 9

Gleb Gerasimov

AI & ML interests

Recent Activity

Organizations

Papers 5

spaces 3 Sort: Recently updated

Lime Rebuttal

Lime

Test 0

models 214 Sort: Recently updated

datasets 9 Sort: Recently updated

spaces 3

models 214

datasets 9