Authors: Jay Shah, Ganesh Bikshandi, Ying Zhang, Vijay Thakkar, Pradeep Ramani, Tri Dao
Year: 2024
Venue: arXiv
Type: article
URL: https://arxiv.org/abs/2407.08691
arXiv: 2407.08691
Cite as: [@shah2024flashattention3]
No raw files yet. Run node scripts/fetch-bibliography-raw.mjs --only shah2024flashattention3 to populate, or drop files into raw/bibliography/shah2024flashattention3/.
@article{shah2024flashattention3,
title = {FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision},
author = {Jay Shah and Ganesh Bikshandi and Ying Zhang and Vijay Thakkar and Pradeep Ramani and Tri Dao},
year = {2024},
journal = {arXiv},
url = {https://arxiv.org/abs/2407.08691}
}No notes yet. Create notes/shah2024flashattention3.md to add notes.