Skip to main content
← Back to Bibliography

Direct Language Model Alignment from Online AI Feedback

Authors: Shangmin Guo, Biao Zhang, Tianlin Liu, Tianqi Liu, Misha Khalman, Felipe Llinares, Alexandre Rame, Thomas Mesnard, Yao Zhao, Bilal Piot, Johan Ferret, Mathieu Blondel

Year: 2024

Venue: arXiv preprint arXiv:2402.04792

Type: article

URL: https://arxiv.org/abs/2402.04792

arXiv: 2402.04792

Cite as: [@guo2024direct]

Raw Files

No raw files yet. Run node scripts/fetch-bibliography-raw.mjs --only guo2024direct to populate, or drop files into raw/bibliography/guo2024direct/.

BibTeX

@article{guo2024direct,
  title = {Direct Language Model Alignment from Online AI Feedback},
  author = {Shangmin Guo and Biao Zhang and Tianlin Liu and Tianqi Liu and Misha Khalman and Felipe Llinares and Alexandre Rame and Thomas Mesnard and Yao Zhao and Bilal Piot and Johan Ferret and Mathieu Blondel},
  year = {2024},
  journal = {arXiv preprint arXiv:2402.04792},
  url = {https://arxiv.org/abs/2402.04792}
}

Notes

No notes yet. Create notes/guo2024direct.md to add notes.