Authors: Huajian Xin, Z.Z. Ren, Junxiao Song, Zhihong Shao, Wanjia Zhao, Haocheng Wang, Bo Liu, Liyue Zhang, Xuan Lu, Qiushi Du, Wenjun Gao, Qihao Zhu, Dejian Yang, Zhibin Gou, Z.F. Wu, Fuli Luo, Chong Ruan
Year: 2025
Venue: ICLR
Type: inproceedings
URL: https://arxiv.org/abs/2408.08152
arXiv: 2408.08152
Cite as: [@xin2024deepseekproverv15]
No raw files yet. Run node scripts/fetch-bibliography-raw.mjs --only xin2024deepseekproverv15 to populate, or drop files into raw/bibliography/xin2024deepseekproverv15/.
@inproceedings{xin2024deepseekproverv15,
title = {DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search},
author = {Huajian Xin and Z.Z. Ren and Junxiao Song and Zhihong Shao and Wanjia Zhao and Haocheng Wang and Bo Liu and Liyue Zhang and Xuan Lu and Qiushi Du and Wenjun Gao and Qihao Zhu and Dejian Yang and Zhibin Gou and Z.F. Wu and Fuli Luo and Chong Ruan},
year = {2025},
booktitle = {ICLR},
url = {https://arxiv.org/abs/2408.08152}
}No notes yet. Create notes/xin2024deepseekproverv15.md to add notes.