CV

Professional Summary

My current research focuses on AI for Scientific Discovery and Agentic Coding within machine learning. I investigate approaches for reliable code reproduction from academic papers, efficient memory management, and tool use, as well as the generation of synthetic trajectories to enable supervised fine-tuning and reinforcement learning aimed at enhancing LLMs’ capabilities in complex research-oriented coding tasks.

Education

King's College London, London, United Kingdom Sep. 2023 – Jun. 2027 (expected)
Ph.D. in Computer Science.
Supervised by Prof. Yulan He with Dr. Lin Gui as co-advisor. Research topics: large language models, in-context learning, interpretability.
Southeast University, Nanjing, China Sep. 2020 – Jun. 2023
M.S. in Software Engineering.
Average score: 86.42/100. Supervised by Prof. Deyu Zhou with research on question answering and code generation.
Hefei University of Technology, Hefei, China Sep. 2016 – Jun. 2020
B.E. in Computer Science and Technology.
Average score: 90.10/100. Coursework highlights: linear algebra, advanced mathematics, probability theory, data structures, Java programming, software engineering, internet protocols.

Awards and Honors

NMES International Studentship (2023 – 2027)
Outstanding Graduate Student, Hefei University of Technology (2020)
Merit Student, Hefei University of Technology (2018)

Work Experience

Meta — Contractor (Nov. 2025 – Dec. 2025)
Worked with Dr. Yoram Bachrach on AI Scientist agents, LLM-based systems that automate research workflows and machine learning engineering tasks.
AstraZeneca — Internship (Jul. 2025 – Oct. 2025)
Supervised by Dr. Saseendran and Dr. Jin. Investigated generating multiple tokens per decoding step in Diffusion Language Models to accelerate decoding.

Invited Talks

Meta, LLaMA Community Meet-up (Apr. 6, 2025): “Towards Automatic Code Reproduction for Scientific Papers: Benchmarks and Methodologies.”

Competitions

National First Prize (Top 0.65%), China Undergraduate Mathematical Contest in Modelling (2018).
1st Place, Spider Leaderboard (2022). Model G3R ranked first on the “exact set match without values” metric and is currently fifth overall.

Publications

SciReplicate-Bench: Benchmarking LLMs in Agent-driven Algorithmic Reproduction from Research Papers [arXiv]
Yanzheng Xiang, Hanqi Yan, Shuyin Ouyang, Lin Gui, Yulan He
In COLM 2025.
G3R: A Graph-Guided Generate-and-Rerank Framework for Cross-domain Text-to-SQL Generation [paper]
Yanzheng Xiang, Qian-Wen Zhang, Xu Zhang, Zejie Liu, Yunbo Cao, Deyu Zhou
In Findings of ACL 2023.
Addressing Order Sensitivity of In-Context Demonstration Examples in Causal Language Models [arXiv]
Yanzheng Xiang, Hanqi Yan, Lin Gui, Yulan He
In Findings of ACL 2024.
The Mystery of In-Context Learning: A Comprehensive Survey on Interpretation and Analysis [paper]
Yuxiang Zhou, Jiazheng Li, Yanzheng Xiang, Hanqi Yan, Lin Gui, Yulan He
In EMNLP 2024.
A Divide-And-Conquer Approach for Multi-label Multi-hop Relation Detection in Knowledge Base QA [paper]
Deyu Zhou, Yanzheng Xiang, Linhai Zhang, Chenchen Ye, Qian-Wen Zhang, Yunbo Cao
In Findings of EMNLP 2021.
PECAN: LLM-Guided Dynamic Progress Control with Attention-Guided Hierarchical Weighted Graph for Long-Document QA [paper]
Xinyu Wang, Yanzheng Xiang, Lin Gui, Yulan He
In Findings of ACL 2025.
Encourage or Inhibit Monosemanticity? Revisit Monosemanticity from a Feature Decorrelation Perspective [paper]
Hanqi Yan, Yanzheng Xiang, Guangyi Chen, Yifei Wang, Lin Gui, Yulan He
In EMNLP 2024.

Talks

Towards Automatic Code Reproduction for Scientific Papers: Benchmarks and Methodologies

Invited Talk, Meta, LLaMA Community Meet-up, London, United Kingdom, April 06, 2025

I presented our latest work on SciReplicate-Bench and shared methodologies for building agentic LLM systems that can reliably reproduce code from scientific publications. The talk covered benchmarking strategies, memory management, and tooling considerations for research automation.

Teaching

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 1

Undergraduate course, University 1, Department, 2014