1 of 17

AlphaGeometry:

A Step Toward Automated Math Reasoning

Hoang Huy Nguyen

ISyE Georgia Tech & former Student Researcher at Google DeepMind

2 of 17

Overview of my work

Stochastic Control

Markov chain mixing

[TAC, under review]

Optimal Transport

[JMLR 2024], [AAAI 2024]

AI reasoning

AI for Sciences

AlphaGeometry 2

[JMLR 2025]

[Nature Geoscience 2025]

Foundations

Applied Probability

AI & Deep Learning

Transfer Learning

Stochastic Networks

[SIGMETRICS 2025]

3 of 17

Mathematics and AI

First analog computer (300 BC)

Computation

Four color Theorem (1976)

Search/Enumeration

Kepler Conjecture (1998)

Verification

What would it take to achieve end-to-end automated reasoning?

4 of 17

International Mathematical Olympiad (1959 – Present)

Topics

  • Algebra
  • Combinatorics
  • Number Theory
  • Geometry

  • One of the most prestigious pre-collegiate math competitions
  • Elegant, yet difficult problems, require only high school knowledge
  • AI-IMO challenge: Build the first AI to get gold at IMO

What would it take to build an AI to solve IMO?

Topics

  • Algebra
  • Combinatorics
  • Number Theory
  • Geometry

5 of 17

Challenges

Hallucinations

Data scarcity

?

?

?

1+1=3

Answer hallucination

Citation hallucination

 

 

Lacks high quality, structured data!

Next token predictor

Reward outcome

Mechanism

6 of 17

Challenges

Hallucinations

Data scarcity

?

?

?

1+1=3

Answer hallucination

Citation hallucination

 

 

Lacks high quality, structured data!

Next token predictor

Reward outcome

Mechanism

A consistent solver is needed!

7 of 17

Idea #1: AlphaGeometry is a neuro-symbolic solver

7

8 of 17

Symbolic Engine

 

Not use:

  • Advanced Theorems
  • Coordinates
  • Transformations

9 of 17

Idea #1: AlphaGeometry is a neuro-symbolic solver

10 of 17

Idea #1: AlphaGeometry is a neuro-symbolic solver

Magic Construction

11 of 17

Idea #1: AlphaGeometry is a neuro-symbolic solver

How to train the language model to make constructions?

12 of 17

Idea #2: Synthetic Data Generation at Scale

Knowledge-building process similar to humans.

13 of 17

Overview of the data generation process

Solves data scarcity w/o human intervention!

(to find the minimal problem)

Masking

14 of 17

The Results

14

15 of 17

All-time IMO Geometry results (2000-2024)

Faster symbolic engine

Knowledge sharing

Better search algorithm

Autoformalization

AG 1

AG 2

16 of 17

Takeaway

Hallucinations

Data scarcity

?

?

?

1+1=3

Symbolic engines

Language models

Synthetic data generation

LLM for Math Research

17 of 17

Thank you for listening!

Stochastic Control

Markov chain mixing

[TAC, under review]

Optimal Transport

[JMLR 2024], [AAAI 2024]

AI reasoning

AI for Sciences

AlphaGeometry 2

[JMLR 2025]

[Nature Geoscience 2025]

Foundations

Applied Probability

AI & Deep Learning

Transfer Learning

Stochastic Networks

[SIGMETRICS 2025]