arxiv:2606.10029
Nikita Balagansky
elephantmipt
AI & ML interests
None yet
Recent Activity
authored a paper 2 days ago
Small Vectors, Big Effects: A Mechanistic Study of RL-Induced Reasoning
via Steering Vectors authored a paper 2 days ago
Steering LLM Reasoning Through Bias-Only Adaptation authored a paper 2 days ago
Train One Sparse Autoencoder Across Multiple Sparsity Budgets to
Preserve Interpretability and Accuracy