Biography
Here is Guanhua Chen (陈冠华). I am a tenure-track assistant professor in the Department of Statistics and Data Science in Southern University of Science and Technology (SUSTech). I received Ph.D. from the Department of Computer Science, The University of Hong Kong in 2022, under the supervision of Prof. Jia Pan and Prof. Wenping Wang. I obtained the Bachelor and Master degree from Tsinghua University in 2012 and 2014, respectively. I was a research intern in Microsoft Research Asia and Huawei Noah’s Ark Lab. Currently, I am an area chair of ACL, EMNLP and CCL, also the reviewer for top AI conferences and journal like ICML, NeurIPS, ICLR, ACL, EMNLP, NAACL, and TASLP. I was awarded as Microsoft Research Asia StarTrack Scholar in 2025. My research interests are natural language processing (NLP) and large language models (LLMs), especially reasoning LLMs, multimodal LLMs, agentic RL and LLM for low-resource applications like health and engineering, etc.
Positions available: I am looking for self-motivated PostDoc/PhD/Master/visiting students to join our lab. If you are interested, please send me an email with your CV. For your reference, here are the latest information of the PhD and Master application process of this year. Currently, we have 16 RTX 4090 GPUs (24GB), 16 NVIDIA L40 GPUs (48GB), and 4 A100 GPUs (40GB) available for students. Enough APIs of open-sourced/proprietary LLMs are also provided for students.
Selected Publications
(* denotes corresponding author)
InfoScan: Information-Efficient Visual Scanning via Resource-Adaptive Walks
- Yifeng Wu, S. Zhou, H. Huang, Y. Huang, H. Zheng, Y. Chen, Xian Wu, Ruize Han*, Guanhua Chen*
- In Proceedings of ICLR 2026 [openreview]
Anchored Supervised Fine-Tuning
- He Zhu, Junyou Su, Peng Lai, Ren Ma, Wenjia Zhang, Linyi Yang, Guanhua Chen*
- In Proceedings of ICLR 2026 [openreview]
BiasScope: Towards Automated Detection of Bias in LLM-as-a-Judge Evaluation
- Peng Lai, Zhihao Ou, Yong Wang, Longyue Wang, Jian Yang, Yun Chen, Guanhua Chen*
- In Proceedings of ICLR 2026 [openreview]
Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
- Yutao Hou, Yajing Luo, Zhiwen Ruan, Hongru Wang, Weifeng Ge, Yun Chen, Guanhua Chen*.
- In Proceedings of ICASSP 2026 (CCF B) [arxiv]
Enhancing Uncertainty Estimation in LLMs with Expectation of Aggregated Internal Belief
- Zeguan Xiao, Diyang Dou, Boya Xiong, Yun Chen*, Guanhua Chen*.
- In Proceedings of AAAI 2026 (CCF A, poster) [arxiv]
Beyond the Surface: Enhancing LLM-as-a-Judge Alignment with Human via Internal Representations
G2: Guided Generation for Enhanced Output Diversity in LLMs
Pi-SQL: Enhancing Text-to-SQL with Fine-Grained Guidance from Pivot Programming Languages
ImPart: Importance-Aware Delta-Sparsification for Improved Model Compression and Merging in LLMs
PlanGPT: Enhancing Urban Planning with Tailored Language Model and Efficient Retrieval
- He Zhu, Guanhua Chen*,Wenjia Zhang*.
- In Proceedings of ACL 2025 (industry track) (CCF A, oral, long paper) [pdf]
Fanno: Augmenting High-Quality Instruction Data with Open-Sourced LLMs Only
Tag-Instruct: Controlled Instruction Complexity Enhancement through Structure-based Augmentation
MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning
SeqAR: Jailbreak LLMs with Sequential Auto-Generated Characters
Self-DC: When to Reason and When to Act? Self Divide-and-Conquer for Compositional Unknown Questions
- Hongru Wang, Boyang Xue, Baohang Zhou, Tianhua Zhang, Cunxiang Wang, Huimin Wang, Guanhua Chen*, Kam-Fai Wong*
- In Proceedings of NAACL 2025 (CCF B, long paper in main conference) [pdf]
LayAlign: Enhancing Multilingual Reasoning in LLMs via Layer-Wise Adaptive Fusion and Alignment Strategy
SeTAR: Out-of-Distribution Detection with Selective Low-Rank Approximation
Distract Large Language Models for Automatic Jailbreak Attack
- Zeguan Xiao, Yan Yang, Guanhua Chen*, Yun Chen*
- In Proceedings of EMNLP 2024 (CCF B, long paper in main conference) [pdf]
PACIT: Unlocking the Power of Examples for Better In-Context Instruction Tuning
mCLIP: Multilingual CLIP via Cross-lingual Transfer
XLM-D: Decorate Cross-lingual Pre-training Model as Non-Autoregressive Neural Machine Translation
- Yong Wang, Shilin He, Guanhua Chen*, Yun Chen, Daxin Jiang*.
- In Proceedings of EMNLP 2022 (CCF B, long paper in main conference) [pdf]
Multilingual Sentence Transformer as A Multilingual Word Aligner
Towards Making the Most of Cross-Lingual Transfer for Zero-Shot Neural Machine Translation
Zero-shot Cross-lingual Transfer of Neural Machine Translation with Multilingual Pretrained Encoders
Lexically Constrained Neural Machine Translation with Explicit Alignment Guidance
Lexical-Constraint-Aware Neural Machine Translation via Data Augmentation
Co-Authored Publications
Tree Search for LLM Agent Reinforcement Learning [ICLR’26]
VisCodex: Unified Multimodal Code Generation via Merging Vision and Coding Models [ICLR’26]
Fair Decision Utility in Human-AI Collaboration: Interpretable Confidence Adjustment for Humans with Cognitive Disparities [ICLR’26]
From Abstract to Contextual: What LLMs Still Cannot Do in Mathematics [ICLR’26]
Synthesizing Multimodal Verifiable Game Data to Boost VLMs’ General Reasoning [ICLR’26]
ConInstruct: Evaluating Large Language Models on Conflict Detection and Resolution in Instructions [AAAI’26]
Alleviating Hallucinations in Large Language Models through Multi-Model Contrastive Decoding and Dynamic Hallucination Detection [NeurIPS’25]
PlanGPT-VL: Enhancing Urban Planning with Domain-Specific Vision-Language Models [EMNLP’25]
A Joint Learning of Force Feedback of Robotic Manipulation and Textual Cues for Granular Materials Classification [RAL’25]
LLMs Trust Humans More, That’s a Problem! Unveiling and Mitigating the Authority Bias in Retrieval-Augmented Generation [ACL’25]
The Elephant in the Room: Exploring the Role of Neutral Words in Language Model Group-Agnostic Debiasing [ACL’25 Findings]
Understanding Particles from Video: Property Estimation of Granular Materials via Visuo-Haptic Learning [RAL’25]
StyleBART: Decorate Pretrained Model with Style Adapters for Unsupervised Stylistic Headline Generation [EMNLP’23 Findings]
Evaluating Explanation Methods for Vision-and-Language Navigation [ECAI’23]
Accurate Word Alignment Induction from Neural Machine Translation [EMNLP’20]
Working Papers
(all are corresponding authored papers)
From Word to World: Can Large Language Models be Implicit Text-based World Models?
SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks
InstructDiff: Domain-Adaptive Data Selection via Differential Entropy for Efficient LLM Fine-Tuning
StatABench: Dataset and Framework for Evaluating Statistical Analysis Capabilities of LLMs
Beyond Static Rules: Automated Discovery of Latent Vulnerabilities in Text-to-SQL
BAM-MT: Batch-Adaptive Multi-Objective Reinforcement Learning for Medical Machine Translation
Representation-Guided Parameter-Efficient LLM Unlearning
VFA: Empoweing Multilingual MLLMs via Vision-Free Adaptation
FinSafetyBench: Evaluating LLM Safety in Real-World Financial Scenarios
GIFT: Guided Fine-Tuning and Transfer for Enhancing Instruction-Tuned Language Models
AlignDiff: Exploiting Model-Intrinsic Information for Better Preference Data Selection
Enhancing Large Language Model Reasoning via Selective Critical Token Fine-Tuning
Evaluating Memory Capability in Continuous Lifelog Scenario
Unveiling Over-Memorization in Finetuning LLMs for Reasoning Tasks
Enhancing Delta Compression in LLMs via SVD-based Quantization Error Minimization
Automatic Robustness Stress Testing of LLMs as Mathematical Problem Solvers
Towards Bridging the Reward-Generation Gap in Direct Alignment Algorithms
Teaching
- Undergraduate course STA323: Big Data Analysis Software and Application (Hadoop or Spark)
- Graduate course STA5007: Advanced Natural Language Processing
Group Members
- PhD candidates: Peng Lai(2026), Yunpeng Sun(SLAI,2025), Zhiwen Ruan(2024), Yixia Li(2023)
- Master students: Xuanzhe Xu(2026), Qianyu Yang(2026), Qi Wang(2025), Youxin Zhu(2025), Xiaodong Lao(2024), Jianjie Zheng(2024)
(Last updated on Jan. 27, 2026)