Authors: Shangmin Guo, Biao Zhang, Tianlin Liu, Tianqi Liu, Misha Khalman, Felipe Llinares, Alexandre Rame, Thomas Mesnard, Yao Zhao, Bilal Piot, Johan Ferret, Mathieu Blondel
Year: 2024
Venue: arXiv preprint arXiv:2402.04792
Type: article
URL: https://arxiv.org/abs/2402.04792
arXiv: 2402.04792
Cite as: [@guo2024direct]
No raw files yet. Run node scripts/fetch-bibliography-raw.mjs --only guo2024direct to populate, or drop files into raw/bibliography/guo2024direct/.
@article{guo2024direct,
title = {Direct Language Model Alignment from Online AI Feedback},
author = {Shangmin Guo and Biao Zhang and Tianlin Liu and Tianqi Liu and Misha Khalman and Felipe Llinares and Alexandre Rame and Thomas Mesnard and Yao Zhao and Bilal Piot and Johan Ferret and Mathieu Blondel},
year = {2024},
journal = {arXiv preprint arXiv:2402.04792},
url = {https://arxiv.org/abs/2402.04792}
}No notes yet. Create notes/guo2024direct.md to add notes.