Talks and presentations

Towards Automatic Code Reproduction for Scientific Papers: Benchmarks and Methodologies

April 06, 2025

Invited Talk, Meta, LLaMA Community Meet-up, London, United Kingdom

I presented our latest work on SciReplicate-Bench and shared methodologies for building agentic LLM systems that can reliably reproduce code from scientific publications. The talk covered benchmarking strategies, memory management, and tooling considerations for research automation.

Talk 1 on Relevant Topic in Your Field

March 01, 2012

Talk, UC San Francisco, Department of Testing, San Francisco, California

This is a description of your talk, which is a markdown files that can be all markdown-ified like any other post. Yay markdown!