Authors: Tri Dao, Daniel Y. Fu, Stefano Ermon, Atri Rudra, Christopher Re
Year: 2022
Venue: NeurIPS
Type: article
URL: https://arxiv.org/abs/2205.14135
arXiv: 2205.14135
Cite as: [@dao2022flashattention]
No raw files yet. Run node scripts/fetch-bibliography-raw.mjs --only dao2022flashattention to populate, or drop files into raw/bibliography/dao2022flashattention/.
@inproceedings{dao2022flashattention,
title = {FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness},
author = {Tri Dao and Daniel Y. Fu and Stefano Ermon and Atri Rudra and Christopher Re},
year = {2022},
booktitle = {NeurIPS},
url = {https://arxiv.org/abs/2205.14135}
}No notes yet. Create notes/dao2022flashattention.md to add notes.