Biography

Here is Guanhua Chen (陈冠华). I am an assistant professor in the Department of Statistics and Data Science in Southern University of Science and Technology (SUSTech). I received Ph.D. from the Department of Computer Science, The University of Hong Kong in 2022, under the supervision of Prof. Jia Pan and Prof. Wenping Wang. I obtained the Bachelor and Master degree from Tsinghua University in 2012 and 2014, respectively. I was a research intern in Microsoft Research Asia and Huawei Noah’s Ark Lab. Currently, I am an area chair of ACL, EMNLP and CCL, also the reviewer for top AI conferences and journal like ICML, NeurIPS, ICLR, ACL, EMNLP, NAACL, and TASLP. My research interests are natural language processing (NLP) and applied machine learning, such as data synthesis with large language model, multimodal LLMs, and complex reasoning with LLMs, etc.

Positions available: I am looking for self-motivated PostDoc/PhD/Master/visiting students to join our lab. If you are interested, please send me an email with your CV and transcripts. For your reference, here are the latest information of the PhD and Master application process of this year. Currently, we have 16 RTX 4090 GPUs (24GB), 16 NVIDIA L40 GPUs (48GB), and 4 A100 GPUs (40GB) available for students.

Selected Publications

（^* denotes corresponding author）

ImPart: Importance-Aware Delta-Sparsification for Improved Model Compression and Merging in LLMs
- Yan Yang, Yixia Li, Hongru Wang, Xuetao Wei, James Jianqiao Yu, Yun Chen, Guanhua Chen^*.
- In Proceedings of ACL 2025 (CCF A, long paper in main conference)
PlanGPT: Enhancing Urban Planning with Tailored Language Model and Efficient Retrieval
- He Zhu, Guanhua Chen^*，Wenjia Zhang*.
- In Proceedings of ACL 2025 (industry track) (CCF A, oral, long paper)
FANNO: Augmenting High-Quality Instruction Data with Open-Sourced LLMs Only
- He Zhu, Yifan Ding, Yicheng Tao, Zhiwen Ruan, Yixia Li, Wenjia Zhang, Yun Chen, Guanhua Chen^*.
- In Findings of ACL 2025 (long paper) [arxiv]
Tag-Instruct: Controlled Instruction Complexity Enhancement through Structure-based Augmentation
- He Zhu, Zhiwen Ruan, Junyou Su, Xingwei He, Yun Chen, Wenjia Zhang*, Guanhua Chen^*.
- In Findings of ACL 2025 (long paper) [arxiv]
MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning
- Hanqing Wang, Yixia Li, Shuo Wang, Guanhua Chen^*, Yun Chen^*.
- In Proceedings of NAACL 2025 (CCF B, long paper in main conference) [pdf] [code]
SeqAR: Jailbreak LLMs with Sequential Auto-Generated Characters
- Yan Yang, Zeguan Xiao, Xin Lu, Hongru Wang, Xuetao Wei, Hailiang Huang, Guanhua Chen^*, Yun Chen^*.
- In Proceedings of NAACL 2025 (CCF B, long paper in main conference) [pdf] [code]
Self-DC: When to Reason and When to Act? Self Divide-and-Conquer for Compositional Unknown Questions
- Hongru Wang, Boyang Xue, Baohang Zhou, Tianhua Zhang, Cunxiang Wang, Huimin Wang, Guanhua Chen^*, Kam-Fai Wong^*
- In Proceedings of NAACL 2025 (CCF B, long paper in main conference) [pdf]
LayAlign: Enhancing Multilingual Reasoning in LLMs via Layer-Wise Adaptive Fusion and Alignment Strategy
- Zhiwen Ruan, Yixia Li, He Zhu, Longyue Wang, Weihua Luo, Kaifu Zhang, Yun Chen, Guanhua Chen^*.
- In Findings of NAACL 2025 (long paper) [pdf] [code]
SeTAR: Out-of-Distribution Detection with Selective Low-Rank Approximation
- Yixia Li, Boya Xiong, Guanhua Chen^*, Yun Chen^*
- In Proceedings of NeurIPS 2024 (CCF A, long paper) [pdf] [code]
Distract Large Language Models for Automatic Jailbreak Attack
- Zeguan Xiao, Yan Yang, Guanhua Chen^*, Yun Chen^*
- In Proceedings of EMNLP 2024 (CCF B, long paper in main conference) [pdf]
PACIT: Unlocking the Power of Examples for Better In-Context Instruction Tuning
- Tianci Xue, Ziqi Wang, Yixia Li, Yun Chen, Guanhua Chen^*
- In Findings of ACL 2024 (long paper) [pdf] [code]
mCLIP: Multilingual CLIP via Cross-lingual Transfer
- Guanhua Chen, Lu Hou, Yun Chen, Wenliang Dai, Lifeng Shang, Xin Jiang, Qun Liu, Jia Pan, Wenping Wang
- In Proceedings of ACL 2023 (CCF A, long paper in main conference, oral) [pdf] [code]
XLM-D: Decorate Cross-lingual Pre-training Model as Non-Autoregressive Neural Machine Translation
- Yong Wang, Shilin He, Guanhua Chen^*, Yun Chen, Daxin Jiang^*.
- In Proceedings of EMNLP 2022 (CCF B, long paper in main conference) [pdf]
Multilingual Sentence Transformer as A Multilingual Word Aligner
- Weikang Wang^†, Guanhua Chen^†, Hanqing Wang, Yue Han, Yun Chen.
- In Findings of EMNLP 2022 (CCF B, short paper) [pdf] [code]
Towards Making the Most of Cross-Lingual Transfer for Zero-Shot Neural Machine Translation
- Guanhua Chen, Shuming Ma, Yun Chen, Dongdong Zhang, Jia Pan, Wenping Wang, Furu Wei
- In Proceedings of ACL 2022 (CCF A, long paper in main conference) [pdf] [code]
Zero-shot Cross-lingual Transfer of Neural Machine Translation with Multilingual Pretrained Encoders
- Guanhua Chen, Shuming Ma, Yun Chen, Li Dong, Dongdong Zhang, Jia Pan, Wenping Wang, Furu Wei
- In Proceedings of EMNLP 2021 (CCF B, long paper in main conference, oral) [pdf] [code]
Lexically Constrained Neural Machine Translation with Explicit Alignment Guidance
- Guanhua Chen, Yun Chen, Victor O.K. Li
- In Proceedings of AAAI 2021 (CCF A, long paper) [pdf] [code]
Lexical-Constraint-Aware Neural Machine Translation via Data Augmentation
- Guanhua Chen, Yun Chen, Yong Wang, Victor O.K. Li
- In Proceedings of IJCAI 2020 (CCF A, long paper) [pdf] [code]

Working Papers

Pi-SQL: Enhancing Text-to-SQL with Fine-Grained Guidance from Pivot Programming Languages
- Yongdong Chi, Hanqing Wang, Yun Chen, Yan Yang, Jian Yang, Zonghan Yang, Xiao Yan, Guanhua Chen^*.
ADAMIX: Adaptive Mixed-Precision Delta-Compression with Quantization Error Optimization for LLMs
- Boya Xiong, Shuo Wang, Weifeng Ge, Guanhua Chen^*, Yun Chen.
Automatic Robustness Stress Testing of LLMs as Mathematical Problem Solvers
- Yutao Hou, Zeguan Xiao, Fei Yu, Yihan Jiang, Xuetao Wei, Hailiang Huang, Yun Chen*, Guanhua Chen^*.
Towards Bridging the Reward-Generation Gap in Direct Alignment Algorithms.
- Zeguan Xiao, Yun Chen, Guanhua Chen^*.
CompoundQA: A Benchmark for Evaluating LLMs on Compound Questions
- Yutao Hou, Yajing Luo, Zhiwen Ruan, Hongru Wang, Weifeng Ge, Yun Chen, Guanhua Chen^*. [arxiv]

Teaching

Undergraduate course STA323: Big Data Analysis Software and Application (Hadoop or Spark)
Graduate course STA5007: Advanced Natural Language Processing

Group Members

PhD candidates: Zhiwen Ruan(2024), Yixia Li(2023)
Master students: Qi Wang(2025), Youxin Zhu(2025), Peng Lai(2024), Xiaodong Lao(2024), Jianjie Zheng(2024), Yongjie Wang(2023)

(Last updated on May 22, 2025)

Guanhua CHEN

Biography

Selected Publications

Working Papers

Teaching

Group Members