Winter 2026 Math AI Lab Projects
Project teams meet Mondays and Wednesdays from 4–5:30 pm in Denny Hall 303.
Lean Together meets Fridays 3:30–6 pm in CMU B-006.
The goal of projects is a publication or a significant open-source contribution, e.g. to
mathlib4.
Students are expected to commit at least 5 hours per week to their project outside of meeting times.
Autoformalization Projects
The goal of autoformalization projects is to use Lean-specific AI assistants such as Harmonic’s
Aristotle
to formalize large chunks of mathematics, and contribute to
mathlib4.
Geometric Measure Theory
- Project Leader: Ignacio Tejeda
- Formalization target: Theorem 4.2 in Falconer’s Geometry of Fractal Sets.
- Code: GitHub repo
-
Mathlib PRs:
1,
2
- Members: Theo Meek, Nathan Pao, Annie Cao, Josh
- Prerequisites: Lean
Commutative Algebra
- Project Leaders: Haoming Ning and Leo Mayer
-
Formalization target:
The theorem that a regular local ring is a UFD, referred by some as the Auslander-Buchsbaum theorem.
See Stacks Project 0AG0.
- Members: Nailin Guan, Dora Kassabova
- Prerequisites: Lean
Algebraic Geometry
- Project Leaders: Bianca Viray and Bryan Boehnke
-
Formalization target:
Monogenic extensions of regular local rings following
arXiv:2503.07846, lemmas 3.1 and 3.2.
- Members: George Peykanu, Grant Yang
- Prerequisites: Lean
Category Theory
- Project Leader: Nelson Niu
- Formalization target: Nelson Niu & David Spivak’s Polynomial Functors textbook.
- Members: Sukhman Singh
- Prerequisites: Lean
Formalization: zero-knowledge proofs
- Project Leaders: Eric Klavins and Alexandra Aiello
-
Description:
We would outline a framework for verifying mathematical theorems while keeping the respective proofs secret by
leveraging dependent combinatory logic as a host language for Zero-Knowledge (ZK) proof circuits. Specifically,
we would interpret the axioms of the dependent SK combinator calculus as a universal ZK circuit capable of
checking mathematical proofs encodeable in the calculus. We would target ZK-STARKs as our ideal ZK scheme,
enabling quantum resistance of the proofs without trusted setup [Ben+18]. This would be a first-of-its kind
result, with wide applications. Goal: publication
- Prerequisites: Lean, type theory
Metaprogramming: Provable Computation in Lean
- Project Leader: Dhruv Bhatia
-
Description:
While Lean has seen extensive use as a theorem-proving assistant, its capabilities as a computational programming
language have been underutilized. The goal of this project is to begin filling that gap. Along the way, we will
learn the basics of functional programming, monads, and Lean’s metaprogramming framework to implement algorithms
that can both be run efficiently and be reasoned about. Our main goal is to implement basic algorithms with
applications to linear algebra while also proving (in Lean) correctness of said algorithms.
- Members: Joseph Qian, Junye Ji, Veer Shukla, Alan Chang
- Code: GitHub repo
AI Projects
The goal of AI projects is to use AI and ML to advance mathematical research. For example, by helping
mathematicians find relevant theorems, or by training models to learn mathematical functions or by creating
math-specific datasets or models. AI projects are expected to result in a publication in a submission to a major
ML conference such as ICML, ICLR, EMNLP or Neurips.
Lean error correction with LLMs
- Project Leader: Vasily Ilin
-
Description:
Fine-tune LLMs to correct Lean errors, using compiler feedback. Create a diverse dataset of errors and train on
it. Submit to ICML 2026 in January.
- Members: Evan Wang, Simon Chess, Daniel Lee, Siyuan Ge, Ajit Mallavarapu
- Code: GitHub repo
Mathematician's copilot: Semantic Theorem Search
- Project Leaders: Giovanni Inchiostro and Vasily Ilin
-
Description:
We help research mathematicians find relevant theorems quickly. We collected the dataset of all theorems on
ArXiv, Stacks Project and other sources, and built vectorized search over it. Goal: submit to ICML 2026 in
January.
- Demo: HF link
- Members: Eric Leonen, Sophie Szeto, Artemii Remizov, Luke Alexander
- Code: GitHub repo
Mathematician's copilot: Math2Vec
- Project Leader: Henry Kvinge (PNNL)
- Members: Saharsh Bhargava, Cecilia, Jiahe Lu, Michael, Kedar Chintalapati, Rachit Jaiswal, Samarth Rao, Jared Darlington, Leo Carlin
-
Description:
Train a text embedder that understands math, LaTeX and Lean. This will improve search over Lean, and natural
language theorem search. It can be used for RAG as well. Use arXiv, MathOverflow, mathlib, Lean Reservoir, and
possibly other sources. Create an evaluation benchmark as well. Goal: submit to EMNLP 2026 in May.
CayleyPy: search on massive combinatorial graphs
- Project Leaders: TBD
-
Description:
Optimize CayleyPy, make a CLI, and use it on various combinatorial problems, such as estimating diameters of symmetric groups. Goal: submission to ICLR 2027 in September. See project proposal document.
- Members: Danny Zhang, Gaurang Pendharkar, Merav Frank, Sambhu Ganesan, Xiaoxing Zhang
- Prerequisites: solid Python, some group theory or combinatorics.
Reinforcement Learning for Polynomials
- Project Leader: Michael Zeng
- Description: Use RL to find efficient arithmetic circuits for polynomials.
- Members: Kyle Zhang, Rohan Pandey, Naomi Morato
- Code: GitHub repo
AI for Quantum Code Compilation
- Project Leader: Andres Paz
-
Description:
Quantum error correction (QEC) codes are traditionally described using stabilizers, which define the subspace
preserved by the code. However, implementing these codes requires translating stabilizers into fault-tolerant
quantum circuits—an inherently nontrivial task that depends on the constraints and capabilities of the
underlying hardware architecture.
This project aims to develop an AI agent capable of synthesizing such circuits in a way that:
-
In the ideal (noiseless) setting, the resulting circuits implement the intended stabilizer structure of the
code on the target architecture.
-
In the noisy setting, the agent searches over circuit variations to optimize fidelity, taking into account
realistic noise models and architectural constraints.
The outcome would be a toolchain bridging the gap between abstract QEC code design and concrete, high-performance
circuit implementations, enabling better exploration of architecture-specific tradeoffs in fault-tolerant quantum
computing.
- Members: Christian Tarta, Sylvie Lausier, Mayee Sun, Sarju Patel
How good are LLMs at Lean?
- Project Leader: Tyson Klingner
-
Description:
Design an evaluation procedure for various Lean tasks such as next step generation or entire proof generation.
Run our algorithms on frontier models to see which LLMs perform the best, providing guidance to the Lean
community on which LLMs to use. Use a scalable architecture so that our algorithms can be rerun when new models
are released.
- Members: Escher Crawford, Drew Bladek
Deep learning for number theory
- Project Leader: Junaid Hasan
-
Description:
The goal is to explore to what extent modern machine learning algorithms (e.g., feedforward neural networks,
transformers, LLMs) can learn number-theoretic functions such as modular arithmetic, the Möbius function, or gcd.
We can first attempt to replicate results from the literature
(arXiv:2502.10335,
arXiv:2308.15594)
and then explore our own functions and algorithms. We aim to submit to one of the major ML conferences such as
ICML or ICLR.
- Prerequisites: Python (must have), ML experience (desired)
- Members: Hemkesh Bandi, Akhil Srinivasan, Andrew Chen, Nina Tharamal, Claire Xu
- Code: GitHub repo