Skip to main content
← Back to Bibliography

H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models

Authors: Zhenyu Zhang, Ying Sheng, Tianyi Zhou

Year: 2023

Venue: NeurIPS

Type: article

URL: https://arxiv.org/abs/2306.14048

arXiv: 2306.14048

Cite as: [@zhang2024h2o]

Raw Files

No raw files yet. Run node scripts/fetch-bibliography-raw.mjs --only zhang2024h2o to populate, or drop files into raw/bibliography/zhang2024h2o/.

BibTeX

@inproceedings{zhang2024h2o,
  title = {H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models},
  author = {Zhenyu Zhang and Ying Sheng and Tianyi Zhou},
  year = {2023},
  booktitle = {NeurIPS},
  url = {https://arxiv.org/abs/2306.14048}
}

Notes

No notes yet. Create notes/zhang2024h2o.md to add notes.