Skip to main content
← Back to Bibliography

Mixture-of-Depths: Dynamically Allocating Compute in Transformer-Based Language Models

Authors: David Raposo, Sam Ritter, Blake Richards

Year: 2024

Venue: arXiv

Type: article

URL: https://arxiv.org/abs/2404.02258

arXiv: 2404.02258

Cite as: [@raposo2024mixture]

Raw Files

No raw files yet. Run node scripts/fetch-bibliography-raw.mjs --only raposo2024mixture to populate, or drop files into raw/bibliography/raposo2024mixture/.

BibTeX

@article{raposo2024mixture,
  title = {Mixture-of-Depths: Dynamically Allocating Compute in Transformer-Based Language Models},
  author = {David Raposo and Sam Ritter and Blake Richards},
  year = {2024},
  journal = {arXiv},
  url = {https://arxiv.org/abs/2404.02258}
}

Notes

No notes yet. Create notes/raposo2024mixture.md to add notes.