Skip to main content
← Back to Bibliography

Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity

Authors: William Fedus, Barret Zoph, Noam Shazeer

Year: 2022

Venue: JMLR

Type: article

URL: https://arxiv.org/abs/2101.03961

arXiv: 2101.03961

Cite as: [@fedus2022switch]

Raw Files

No raw files yet. Run node scripts/fetch-bibliography-raw.mjs --only fedus2022switch to populate, or drop files into raw/bibliography/fedus2022switch/.

BibTeX

@article{fedus2022switch,
  title = {Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity},
  author = {William Fedus and Barret Zoph and Noam Shazeer},
  year = {2022},
  journal = {JMLR},
  url = {https://arxiv.org/abs/2101.03961}
}

Notes

No notes yet. Create notes/fedus2022switch.md to add notes.