DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Authors: Huajian Xin, Z.Z. Ren, Junxiao Song, Zhihong Shao, Wanjia Zhao, Haocheng Wang, Bo Liu, Liyue Zhang, Xuan Lu, Qiushi Du, Wenjun Gao, Qihao Zhu, Dejian Yang, Zhibin Gou, Z.F. Wu, Fuli Luo, Chong Ruan

Year: 2025

Venue: ICLR

Type: inproceedings

URL: https://arxiv.org/abs/2408.08152

arXiv: 2408.08152

Cite as: [@xin2024deepseekproverv15]

Raw Files

No raw files yet. Run node scripts/fetch-bibliography-raw.mjs --only xin2024deepseekproverv15 to populate, or drop files into raw/bibliography/xin2024deepseekproverv15/.

BibTeX

@inproceedings{xin2024deepseekproverv15,
  title = {DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search},
  author = {Huajian Xin and Z.Z. Ren and Junxiao Song and Zhihong Shao and Wanjia Zhao and Haocheng Wang and Bo Liu and Liyue Zhang and Xuan Lu and Qiushi Du and Wenjun Gao and Qihao Zhu and Dejian Yang and Zhibin Gou and Z.F. Wu and Fuli Luo and Chong Ruan},
  year = {2025},
  booktitle = {ICLR},
  url = {https://arxiv.org/abs/2408.08152}
}

Notes

No notes yet. Create notes/xin2024deepseekproverv15.md to add notes.