📝 Selected Papers
Language Model Semantics
ArXiv Distilling Language Models via Capacity-Aware Reaction Transfer
Jian Gu, Aldeida Aleti, Chunyang Chen, Hongyu Zhang, Peng Di, Sheng Gao
ACL'26 Cross-Scale Knowledge Transfer in Language Models through Latent Semantic Alignment
Jian Gu, Aldeida Aleti, Chunyang Chen, Hongyu Zhang
ACL'25 Semantic-Aware Layer-Freezing for Computation-Efficient Fine-Tuning of LMs
Jian Gu, Aldeida Aleti, Chunyang Chen, Hongyu Zhang
ArXiv Vocabulary-Defined Semantics: Latent Space Clustering for Beyond-Context Learning
Jian Gu, Aldeida Aleti, Chunyang Chen, Hongyu Zhang
Software Engineering for Machine Learning (SE4AI)
ArXiv SemRF: A Semantic Reference Frame for Residual-Stream Dynamics in LMs
Jian Gu, Aldeida Aleti, Chunyang Chen, Hongyu Zhang
ArXiv Rethinking Weight Tying: Pseudo-Inverse Tying for Stable LM Training and Updates
Jian Gu, Aldeida Aleti, Chunyang Chen, Hongyu Zhang
ICSE'26 Semantic-based Optimization for Repairing LLMs: Case Study on Code Generation
Jian Gu, Aldeida Aleti, Chunyang Chen, Hongyu Zhang
ArXiv Focus-Aware Neurons: Contextual LM Repair leveraging Selective Gating Attention
Jian Gu, Aldeida Aleti, Chunyang Chen, Hongyu Zhang
TOSEM 2026 Neuron Patching: Semantic-based Neuron-level LM Repair for Code Generation
Jian Gu, Aldeida Aleti, Chunyang Chen, Hongyu Zhang
Machine Learning for Software Engineering (AI4SE)
FSE'23 (IVR) Towards Top-Down Automated Development in Limited Scopes:
A Neuro-Symbolic Framework from Expressibles to Executables
Jian Gu, Harald C. Gall
SANER'22 Assemble Foundation Models for Automatic Code Summarization
Jian Gu, Pasquale Salza, Harald C. Gall
ICSME'21 Multimodal Representation for Neural Code Search
Jian Gu, Zimin Chen, Martin Monperrus
TSE 2022 On the Effectiveness of Transfer Learning for Code Search
Pasquale Salza, Christoph Schwizer, Jian Gu, Harald C. Gall
TSE 2021 Automated Classification of Overfitting Patches with Statically Extracted Code Features
He Ye, Jian Gu, Matias Martinez, Thomas Durieux, Martin Monperrus