The International Mathematical Olympiad (IMO) is perhaps the most celebrated mental competition in the world and as such is among the ultimate grand challenges for Artificial Intelligence (AI). We have constructed a consistent formal plane geometry system called FormalGeo. This will serve as a crucial bridge between IMO-level plane geometry challenges and readable AI automated reasoning.

With FormalGeo in place, we have been able to seamlessly integrate modern AI models with our formal system. Within this formal framework, AI is now capable of providing deductive reasoning solutions to IMO-level plane geometry problems, just like handling other natural languages, and these proofs are readable, traceable, and verifiable.

Regarding the research achievements in geometric formalization, we have divided them into five parts: geometry formalization theory, geometry formal system, formal geometric problem solver, formal geometric problem dataset, and AI-assisted geometric problem solving.

Regarding the geometry formalization theory, you can read the original paper:

You can learn more about solvers through these projects:

Read the chapter Datasets to review the published formal systems and datasets. New formal system and datasets will be published in this project:

AI-assisted formal geometric problem solving:

FormalGeo adopts open source as its growth strategy. We welcome anyone to contribute to FormalGeo, even if you are new to open source. FormalGeo is currently maintained by the FormalGeo Development Team and supported by Geometric Cognitive Reasoning Group .


News



Publications


FGeo-HyperGNet: Geometry Problem Solving Integrating Formal Symbolic System and Hypergraph Neural Network
Xiaokai Zhang, Na Zhu, Cheng Qin, ..., and Tuo Leng
We built a neural-symbolic system to automatically perform human-like geometric deductive reasoning. The symbolic part is a formal system built on FormalGeo and the neural part is a hypergraph neural network based on the attention mechanism.
arxiv: 2402.11461  [Paper]  [Project]  [BibTex]

FGeo-DRL:Deductive Reasoning for Geometric Problems through Deep Reinforcement Learning
Jia Zou, Xiaokai Zhang, Yiming He, ..., and Tuo Leng
We built a neural-symbolic system, called FGeoDRL, to automatically perform human-like geometric deductive reasoning. The neural part is an AI agent based on reinforcement learning and the symbolic part is a reinforcement learning environment based on geometry formalization theory and FormalGeo.
Symmetry 2024  [Paper]  [Project]  [BibTex]

FGeo-TP: A Language Model-Enhanced Solver for Geometry Problems
Yiming He, Jia Zou, Xiaokai Zhang, ..., and Tuo Leng
we introduced FGeo-TP (Theorem Predictor), which utilizes the language model to predict theorem sequences for solving geometry problems. Our results demonstrate a significant increase in the problem-solving success rate of the language model-enhanced FGeo-TP on the FormalGeo7k dataset, rising from 39.7% to 80.86%.
Symmetry 2024  [Paper]  [Project]  [BibTex]

FGeo-SSS: A Search-Based Symbolic Solver for Human-Like Automated Geometric Reasoning
Xiaokai Zhang, Na Zhu, Yiming He, ..., and Tuo Leng
Based on the geometry formalization theory and the FormalGeo geometric formal system, we have developed the Formal Geometric Problem Solver (FGPS) in Python, which can serve as an interactive assistant for verifying problem-solving processes and as an automated problem solver that utilizes a variable search-based method and strategy.
Symmetry 2024  [Paper]  [Project]  [BibTex]

FormalGeo: An Extensible Formalized Framework for Olympiad Geometric Problem Solving
Xiaokai Zhang, Na Zhu, Yiming He, ..., and Tuo Leng
We have constructed a consistent formal plane geometry system. This will serve as a crucial bridge between IMO-level plane geometry challenges and readable AI automated reasoning. With this formal system in place, we have been able to seamlessly integrate modern AI models with our formal system.
arxiv:2310.18021  [Paper]  [Project]  [BibTex]


Datasets


formalgeo7k
formalgeo7k
Xiaokai Zhang, Na Zhu, Yiming He, Jia Zou, Qike Huang, Xiaoxiao Jin, Yanjun Guo, Chenyang Mao, Zhe Zhu, Dengfeng Yue, Fangzhen Zhu, Yang Li, Yifan Wang, Yifan Wang, Runan Wang and Cheng Qin
We’ve annotated the formalgeo7k dataset, containing 6,981 (expand to 133,818 through data augmentation) SAT-level geometry problems. Each problem comprises a complete natural language description, geometric shapes, formal language annotations, and theorem sequences annotations.

formalgeo-imo
Xiaokai Zhang, Yiming He, Na Zhu and Jia Zou
We have attempted to annotate 18 (expand to 2,627) IMO-level geometric problems, sourced from carefully selected International Olympiads, Chinese Olympiads, and others. During the annotation process, we discovered that IMO-level problems require more efficient construction algorithms and a more comprehensive formalization system. A larger and more comprehensive dataset will be published after these problems are solved.


Members

Tuo Leng

Tuo Leng1,2,3

Xiaokai Zhang

Xiaokai Zhang1,3

Na Zhu

Na Zhu1,2,3

Yiming He

Yiming He1,2,3


Jia Zou

Jia Zou1,2,3

Cheng Qin

Cheng Qin1,2,3

Yang Li

Yang Li1,3

Qike Huang

Qike Huang1,2,3

Zhengyu Hu

Zhengyu Hu1,3

1 School of Computer Engineering and Science, Shanghai University, Shanghai, China
2 Institute of Artificial Intelligence, Shanghai University, Shanghai, China
3 Geometric Cognitive Reasoning Group, Shanghai University, Shanghai, China


Former Members



Contact


Questions about FormalGeo, or want to get in touch? Email formalgeo@gmail.com, open up an issue on GitHub, or contact team leader Xiaokai Zhang.