Skip to main content
← Back to Bibliography

FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning

Authors: Tri Dao

Year: 2024

Venue: ICLR

Type: article

URL: https://arxiv.org/abs/2307.08691

arXiv: 2307.08691

Cite as: [@dao2024flashattention2]

Raw Files

No raw files yet. Run node scripts/fetch-bibliography-raw.mjs --only dao2024flashattention2 to populate, or drop files into raw/bibliography/dao2024flashattention2/.

BibTeX

@inproceedings{dao2024flashattention2,
  title = {FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning},
  author = {Tri Dao},
  year = {2024},
  booktitle = {ICLR},
  url = {https://arxiv.org/abs/2307.08691}
}

Notes

No notes yet. Create notes/dao2024flashattention2.md to add notes.