Authors: Zhenyu Zhang, Ying Sheng, Tianyi Zhou
Year: 2023
Venue: NeurIPS
Type: article
URL: https://arxiv.org/abs/2306.14048
arXiv: 2306.14048
Cite as: [@zhang2024h2o]
No raw files yet. Run node scripts/fetch-bibliography-raw.mjs --only zhang2024h2o to populate, or drop files into raw/bibliography/zhang2024h2o/.
@inproceedings{zhang2024h2o,
title = {H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models},
author = {Zhenyu Zhang and Ying Sheng and Tianyi Zhou},
year = {2023},
booktitle = {NeurIPS},
url = {https://arxiv.org/abs/2306.14048}
}No notes yet. Create notes/zhang2024h2o.md to add notes.