internlm/Lean-Workbook
Viewer • Updated • 25.2k • 652 • 56
This repository contains the policy model used in the paper "Rethinking Supervision Granularity: Segment-Level Learning for LLM-Based Theorem Proving".
The model is fully fine-tuned from Qwen2.5-Math-7B using segment-level supervision constructed from Lean 4 proof trajectories.
The model is trained using proof trajectories derived from STP, LeanWorkbook, and NuminaMath-LEAN.
Please refer to the GitHub repository for preprocessing, inference, and evaluation details:
https://github.com/NJUDeepEngine/SEG-ATP
@article{xu2026rethinking,
title={Rethinking Supervision Granularity: Segment-Level Learning for LLM-Based Theorem Proving},
author={Xu, Shuo and Zhang, Jiakun and Lai, Junyu and Cao, Chun and Xu, Jingwei},
journal={arXiv preprint},
year={2026}
}