CV

Education

Sorbonne Logo MSc in Mathematics - Sorbonne University, 2023
Master’s degree in machine learning with a thesis on automated theorem proving.
Grade: First Class Honours.

Sorbonne Logo BSc in Mathematics - Sorbonne University, 2020
Bachelor’s degree in mathematics.

Experience

Cambridge Logo Research internship - University of Cambridge, 2023
Research internship on automated theorem proving with the Isabelle proof assistant, using Monte Carlo tree search combined with the large language models LLaMA and GPT-2.
Supervisor: Dr. Wenda Li, Research Associate at the University of Cambridge.

OpenAI Logo Research internship - OpenAI, 2022
Research internship on automated theorem proving and auto-formalisation with the Lean proof assistant, using GPT-3.5 trained with expert iteration on synthetic data.
Supervisor: Stanislas Polu, Research Engineer at OpenAI.

Publications

Llemma: An Open Language Model For Mathematics
Zhangir Azerbayev, Hailey Schoelkopf, Keiran Paster, Marco Dos Santos, Stephen McAleer, Albert Q. Jiang, Jia Deng, Stella Biderman, Sean Welleck
ICLR 2024 (poster) and MATH-AI Workshop at NeurIPS 2023 (poster)
📄 Paper | 🤗 Model | 🤗 Dataset | Code

OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text
Keiran Paster, Marco Dos Santos, Zhangir Azerbayev, Jimmy Ba
ICLR 2024 (poster) and MATH-AI Workshop at NeurIPS 2023 (oral & poster)
📄 Paper | 🤗 Dataset | Code