The International Mathematical Olympiad (IMO) is perhaps the most celebrated mental competition in the world and as such is among the ultimate grand challenges for Artificial Intelligence (AI). We have constructed a consistent formal plane geometry system called FormalGeo. This will serve as a crucial bridge between IMO-level plane geometry challenges and readable AI automated reasoning.

With FormalGeo in place, we have been able to seamlessly integrate modern AI models with our formal system. Within this formal framework, AI is now capable of providing deductive reasoning solutions to IMO-level plane geometry problems, just like handling other natural languages, and these proofs are readable, traceable, and verifiable.

Regarding the research achievements in geometric formalization, we have divided them into five parts: geometry formalization theory, geometry formal system, formal geometric problem solver, formal geometric problem dataset, and AI-assisted geometric problem solving.

Regarding the geometry formalization theory, you can read the original paper:

You can learn more about solvers through these projects:

Read the chapter Datasets to review the published formal systems and datasets. New formal system and datasets will also be published in this project:

AI-assisted formal geometric problem solving:

FormalGeo adopts open source as its growth strategy. We welcome anyone to contribute to FormalGeo, even if you are new to open source. FormalGeo is currently maintained by the FormalGeo Development Team and supported by Geometric Cognitive Reasoning Group .


News



Publications


FGeo-Parser: Autoformalization and Solution of Plane Geometric Problems
Na Zhu, Xiaokai Zhang, Qike Huang, Fangzhen Zhu, Zhenbing Zeng, Tuo Leng
In this paper, we propose an enhanced geometric problem parsing method called FGeo-Parser, which converts problem diagrams and text into the formal language of the FormalGeo. Specifically, diagram parser leverages the BLIP to generate the construction CDL and image CDL, while text parser employs the T5 to produce the text CDL and goal CDL where these neural networks are both based on a symmetric encoder–decoder architecture. It also supports reverse formalization to generate human-like solutions, reflecting the symmetry between parsing and generating.
Symmetry 2025  [Paper]  [Project]  [BibTex]

Formal Representation and Solution of Plane Geometric Problems
Xiaokai Zhang, Na Zhu, Cheng Qin, Yang Li, Zhenbing Zeng, Tuo Leng
In this paper, we present formalgeo7k, a geometric problem dataset based on rigorous geometry formalization theory and consistent geometry formal system, serving as a benchmark for various tasks such as geometric diagram parsing and geometric problem solving.
NeurIPS 2024 Workshop on MATH-AI  [Paper]  [Project]  [BibTex]

FGeo-HyperGNet: Geometry Problem Solving Integrating Formal Symbolic System and Hypergraph Neural Network
Xiaokai Zhang Na Zhu, Cheng Qin, Yang Li, Zhenbing Zeng, Tuo Leng
We built a neural-symbolic system to automatically perform human-like geometric deductive reasoning. The symbolic part is a formal system built on FormalGeo and the neural part is a hypergraph neural network based on the attention mechanism.
arxiv: 2402.11461  [Paper]  [Project]  [BibTex]

FGeo-DRL:Deductive Reasoning for Geometric Problems through Deep Reinforcement Learning
Jia Zou, Xiaokai Zhang, Yiming He, Na Zhu, and Tuo Leng
We built a neural-symbolic system, called FGeoDRL, to automatically perform human-like geometric deductive reasoning. The neural part is an AI agent based on reinforcement learning and the symbolic part is a reinforcement learning environment based on geometry formalization theory and FormalGeo.
Symmetry 2024  [Paper]  [Project]  [BibTex]

FGeo-TP: A Language Model-Enhanced Solver for Geometry Problems
Yiming He, Jia Zou, Xiaokai Zhang, Na Zhu, Tuo Leng
we introduced FGeo-TP (Theorem Predictor), which utilizes the language model to predict theorem sequences for solving geometry problems. Our results demonstrate a significant increase in the problem-solving success rate of the language model-enhanced FGeo-TP on the FormalGeo7k dataset, rising from 39.7% to 80.86%.
Symmetry 2024  [Paper]  [Project]  [BibTex]

FGeo-SSS: A Search-Based Symbolic Solver for Human-Like Automated Geometric Reasoning
Xiaokai Zhang, Na Zhu, Yiming He, Jia Zou, Cheng Qin, Yang Li, Tuo Leng
Based on the geometry formalization theory and the FormalGeo geometric formal system, we have developed the Formal Geometric Problem Solver (FGPS) in Python, which can serve as an interactive assistant for verifying problem-solving processes and as an automated problem solver that utilizes a variable search-based method and strategy.
Symmetry 2024  [Paper]  [Project]  [BibTex]

FormalGeo: An Extensible Formalized Framework for Olympiad Geometric Problem Solving
Xiaokai Zhang, Na Zhu, Yiming He, Jia Zou, Qike Huang, Xiaoxiao Jin, Yanjun Guo, Chenyang Mao, Zhe Zhu, Dengfeng Yue, Fangzhen Zhu, Yang Li, Yifan Wang, Yiwen Huang, Runan Wang, Cheng Qin, Zhenbing Zeng, Shaorong Xie, Xiangfeng Luo, Tuo Leng
We have constructed a consistent formal plane geometry system. This will serve as a crucial bridge between IMO-level plane geometry challenges and readable AI automated reasoning. With this formal system in place, we have been able to seamlessly integrate modern AI models with our formal system.
arxiv:2310.18021  [Paper]  [Project]  [BibTex]


Datasets


formalgeo7k
FormalGeo7K
Xiaokai Zhang, Na Zhu, Yiming He, Jia Zou, Qike Huang, Xiaoxiao Jin, Yanjun Guo, Chenyang Mao, Zhe Zhu, Dengfeng Yue, Fangzhen Zhu, Yang Li, Yifan Wang, Yifan Wang, Runan Wang and Cheng Qin
We’ve annotated the FormalGeo7K dataset, a geometric problem dataset containing 7,000 SAT-level geometric problems, serving as a benchmark for various tasks such as geometric diagram parsing and geometric problem solving. All problems are annotated with problem text, problem diagram, formal descriptions, and solution.

FormalGeo-IMO
Xiaokai Zhang, Yiming He, Na Zhu and Jia Zou
We have attempted to annotate 18 (expand to 2,627) IMO-level geometric problems, sourced from carefully selected International Olympiads, Chinese Olympiads, and others. During the annotation process, we discovered that IMO-level problems require more efficient construction algorithms and a more comprehensive formalization system. A larger and more comprehensive dataset will be published after these problems are solved.


Members

Tuo Leng

Tuo Leng1,2,3

Xiaokai Zhang

Xiaokai Zhang1,3

Na Zhu

Na Zhu1,2,3

Cheng Qin

Cheng Qin1,2,3


Yang Li

Yang Li1,3

Qike Huang

Qike Huang1,2,3

Zhengyu Hu

Zhengyu Hu1,3

YuChang Yang

YuChang Yang1,3

1 School of Computer Engineering and Science, Shanghai University, Shanghai, China
2 Institute of Artificial Intelligence, Shanghai University, Shanghai, China
3 Geometric Cognitive Reasoning Group, Shanghai University, Shanghai, China


Former Members


Jia Zou

Jia Zou

Yiming He

Yiming He


Contact


Questions about FormalGeo, or want to get in touch? Email formalgeo@gmail.com, open up an issue on GitHub, or contact team leader Tuo Leng.