Skip to main content
← Back to Bibliography

FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision

Authors: Jay Shah, Ganesh Bikshandi, Ying Zhang, Vijay Thakkar, Pradeep Ramani, Tri Dao

Year: 2024

Venue: arXiv

Type: article

URL: https://arxiv.org/abs/2407.08691

arXiv: 2407.08691

Cite as: [@shah2024flashattention3]

Raw Files

No raw files yet. Run node scripts/fetch-bibliography-raw.mjs --only shah2024flashattention3 to populate, or drop files into raw/bibliography/shah2024flashattention3/.

BibTeX

@article{shah2024flashattention3,
  title = {FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision},
  author = {Jay Shah and Ganesh Bikshandi and Ying Zhang and Vijay Thakkar and Pradeep Ramani and Tri Dao},
  year = {2024},
  journal = {arXiv},
  url = {https://arxiv.org/abs/2407.08691}
}

Notes

No notes yet. Create notes/shah2024flashattention3.md to add notes.