Hello! I’m a PhD student in the NLP group at King’s College London, working with Prof. Yulan He and Dr. Lin Gui. You can reach me at xyz1998seu@gmail.com.
Professional Summary
My current research centers on agentic coding in machine learning and artificial intelligence. I explore methods for reliable code reproduction from academic papers, effective memory management, tool utilization, and the creation of synthetic trajectories to perform supervised fine-tuning and reinforcement learning for improving large language model performance on complex coding tasks.
Experience
- Meta — Contractor (Nov. 2025 – Dec. 2025). Hired by Dr. Yoram Bachrach to develop AI Scientist agents: LLM systems that automate research tasks and machine learning engineering workflows.
- AstraZeneca — Internship (Jul. 2025 – Oct. 2025). Collaborated with Dr. Saseendran and Dr. Jin on generating multiple tokens per decoding step in Diffusion Language Models to accelerate decoding.
Education
- King's College London, London, United Kingdom Sep. 2023 – Jun. 2027 (expected)Ph.D. in Computer Science.Supervised by Prof. Yulan He with Dr. Lin Gui as co-advisor. Focus areas: large language models, in-context learning, interpretability.
- Southeast University, Nanjing, China Sep. 2020 – Jun. 2023M.S. in Software Engineering.Average score: 86.42/100. Supervised by Prof. Deyu Zhou with research on question answering and code generation.
- Hefei University of Technology, Hefei, China Sep. 2016 – Jun. 2020B.E. in Computer Science and Technology.Average score: 90.10/100. Coursework included linear algebra, advanced mathematics, probability theory, data structures, Java programming, software engineering, and internet protocols.
Awards and Honors
- NMES International Studentship (2023 – 2027)
- Outstanding Graduate Student, Hefei University of Technology (2020)
- Merit Student, Hefei University of Technology (2018)
Invited Talks
- Meta, LLaMA Community Meet-up (Apr. 6, 2025): “Towards Automatic Code Reproduction for Scientific Papers: Benchmarks and Methodologies.” [event post]
Competitions
- National First Prize (Top 0.65%), China Undergraduate Mathematical Contest in Modelling (2018). Team-based modeling competition solving open-ended applied problems.
- 1st Place, Spider Leaderboard (2022). Our model G3R achieved the top rank on the “exact set match without values” metric and is currently 5th overall.
Publications
-   SciReplicate-Bench: Benchmarking LLMs in Agent-driven Algorithmic Reproduction from Research Papers  [arXiv] 
 Yanzheng Xiang, Hanqi Yan, Shuyin Ouyang, Lin Gui, Yulan He
 In COLM 2025.
-   G3R: A Graph-Guided Generate-and-Rerank Framework for Cross-domain Text-to-SQL Generation  [paper] 
 Yanzheng Xiang, Qian-Wen Zhang, Xu Zhang, Zejie Liu, Yunbo Cao, Deyu Zhou
 In Findings of ACL 2023.
-   Addressing Order Sensitivity of In-Context Demonstration Examples in Causal Language Models  [arXiv] 
 Yanzheng Xiang, Hanqi Yan, Lin Gui, Yulan He
 In Findings of ACL 2024.
-   The Mystery of In-Context Learning: A Comprehensive Survey on Interpretation and Analysis  [paper] 
 Yuxiang Zhou, Jiazheng Li, Yanzheng Xiang, Hanqi Yan, Lin Gui, Yulan He
 In EMNLP 2024.
-   A Divide-And-Conquer Approach for Multi-label Multi-hop Relation Detection in Knowledge Base QA  [paper] 
 Deyu Zhou, Yanzheng Xiang, Linhai Zhang, Chenchen Ye, Qian-Wen Zhang, Yunbo Cao
 In Findings of EMNLP 2021.
-   PECAN: LLM-Guided Dynamic Progress Control with Attention-Guided Hierarchical Weighted Graph for Long-Document QA  [paper] 
 Xinyu Wang, Yanzheng Xiang, Lin Gui, Yulan He
 In Findings of ACL 2025.
-   Encourage or Inhibit Monosemanticity? Revisit Monosemanticity from a Feature Decorrelation Perspective  [paper] 
 Hanqi Yan, Yanzheng Xiang, Guangyi Chen, Yifei Wang, Lin Gui, Yulan He
 In EMNLP 2024.
