Skip to main content
← Back to Bibliography

Fast Inference from Transformers via Speculative Decoding

Authors: Yaniv Leviathan, Matan Kalman, Yossi Matias

Year: 2023

Venue: ICML

Type: article

URL: https://arxiv.org/abs/2211.17192

arXiv: 2211.17192

Cite as: [@leviathan2023fast]

Raw Files

No raw files yet. Run node scripts/fetch-bibliography-raw.mjs --only leviathan2023fast to populate, or drop files into raw/bibliography/leviathan2023fast/.

BibTeX

@inproceedings{leviathan2023fast,
  title = {Fast Inference from Transformers via Speculative Decoding},
  author = {Yaniv Leviathan and Matan Kalman and Yossi Matias},
  year = {2023},
  booktitle = {ICML},
  url = {https://arxiv.org/abs/2211.17192}
}

Notes

No notes yet. Create notes/leviathan2023fast.md to add notes.