Authors: Pratyush Patel, Esha Choukse, Chaojie Zhang, Aashaka Shah, Iñigo Goiri, Saeed Maleki, Ricardo Bianchini
Year: 2024
Venue: ISCA
Type: article
URL: https://arxiv.org/abs/2311.18677
arXiv: 2311.18677
Cite as: [@patel2024splitwise]
No raw files yet. Run node scripts/fetch-bibliography-raw.mjs --only patel2024splitwise to populate, or drop files into raw/bibliography/patel2024splitwise/.
@article{patel2024splitwise,
title = {Splitwise: Efficient Generative LLM Inference Using Phase Splitting},
author = {Pratyush Patel and Esha Choukse and Chaojie Zhang and Aashaka Shah and Iñigo Goiri and Saeed Maleki and Ricardo Bianchini},
year = {2024},
journal = {ISCA},
url = {https://arxiv.org/abs/2311.18677}
}No notes yet. Create notes/patel2024splitwise.md to add notes.