Skip to main content
← Back to Bibliography

Splitwise: Efficient Generative LLM Inference Using Phase Splitting

Authors: Pratyush Patel, Esha Choukse, Chaojie Zhang, Aashaka Shah, Iñigo Goiri, Saeed Maleki, Ricardo Bianchini

Year: 2024

Venue: ISCA

Type: article

URL: https://arxiv.org/abs/2311.18677

arXiv: 2311.18677

Cite as: [@patel2024splitwise]

Raw Files

No raw files yet. Run node scripts/fetch-bibliography-raw.mjs --only patel2024splitwise to populate, or drop files into raw/bibliography/patel2024splitwise/.

BibTeX

@article{patel2024splitwise,
  title = {Splitwise: Efficient Generative LLM Inference Using Phase Splitting},
  author = {Pratyush Patel and Esha Choukse and Chaojie Zhang and Aashaka Shah and Iñigo Goiri and Saeed Maleki and Ricardo Bianchini},
  year = {2024},
  journal = {ISCA},
  url = {https://arxiv.org/abs/2311.18677}
}

Notes

No notes yet. Create notes/patel2024splitwise.md to add notes.