Locate, steer, and improve: A practical survey of actionable mechanistic interpretability in large language models
Author:
Hengyuan Zhang,Zhihao Zhang,Mingyang Wang,Zunhai Su,Yiwei Wang,Qianli Wang,Shuzhou Yuan,Ercong Nie,Xufeng Duan,Qibo Xue,Zeping Yu,Chenming Shang,Xiao Liang,Jing Xiong,Hui Shen,Chaofan Tao,Zhengwu Liu,Senjie Jin,Zhiheng Xi,Dongdong Zhang et al.
Publication:
Computer Science Review
© 2026 Elsevier Inc. All rights are reserved, including those for text and data mining, AI training, and similar technologies.