Skip to main content
← Back to Bibliography

FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

Authors: Tri Dao, Daniel Y. Fu, Stefano Ermon, Atri Rudra, Christopher Re

Year: 2022

Venue: NeurIPS

Type: article

URL: https://arxiv.org/abs/2205.14135

arXiv: 2205.14135

Cite as: [@dao2022flashattention]

Raw Files

No raw files yet. Run node scripts/fetch-bibliography-raw.mjs --only dao2022flashattention to populate, or drop files into raw/bibliography/dao2022flashattention/.

BibTeX

@inproceedings{dao2022flashattention,
  title = {FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness},
  author = {Tri Dao and Daniel Y. Fu and Stefano Ermon and Atri Rudra and Christopher Re},
  year = {2022},
  booktitle = {NeurIPS},
  url = {https://arxiv.org/abs/2205.14135}
}

Notes

No notes yet. Create notes/dao2022flashattention.md to add notes.