Skip to main content
← Back to Bibliography

GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints

Authors: Joshua Ainslie, James Lee-Thorp, Michiel de Jong

Year: 2023

Venue: EMNLP

Type: article

URL: https://arxiv.org/abs/2305.13245

arXiv: 2305.13245

Cite as: [@ainslie2023gqa]

Raw Files

No raw files yet. Run node scripts/fetch-bibliography-raw.mjs --only ainslie2023gqa to populate, or drop files into raw/bibliography/ainslie2023gqa/.

BibTeX

@inproceedings{ainslie2023gqa,
  title = {GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints},
  author = {Joshua Ainslie and James Lee-Thorp and Michiel de Jong},
  year = {2023},
  booktitle = {EMNLP},
  url = {https://arxiv.org/abs/2305.13245}
}

Notes

No notes yet. Create notes/ainslie2023gqa.md to add notes.