Bibliography
All references cited across Mochi Lab. Use [@key] in any markdown file to cite a reference.
Showing 597 of 597 references
@singh2025agenticarticleAditi Singh (2025) — arXiv
@yuan2025native_sparse_attentionarticleJingyang Yuan, Huazuo Gao, Damai Dai, et al. (2025) — arXiv
@liu2025waveletarticlePeng Liu (2025) — Neurocomputing
@specache2025speculativearticleSpeCache (2025) — ICML
@scialom2025continualarticleThomas Scialom (2025) — EMNLP Tutorials
@zhang2025dfloat11articleTianyi Zhang, Mohsen Hariri, Shaochen Zhong, et al. (2025) — NeurIPS
@zhang2025storm_wmarticleWeipu Zhang, Gang Wang, Jian Sun, et al. (2025) — NeurIPS
@ibrahim2024investigatingarticleAdam Ibrahim (2024) — OpenReview
@bardes2024vjepaarticleAdrien Bardes, Quentin Garrido, Jean Ponce, et al. (2024) — arXiv
@asai2024selfragarticleAkari Asai, Zeqiu Wu, Yizhong Wang, et al. (2024) — ICLR
@gu2024mambaarticleAlbert Gu, Tri Dao (2024) — arXiv
@jiang2024mixtralarticleAlbert Q. Jiang, Alexandre Sablayrolles, Antoine Roux (2024) — arXiv
@drouin2024browsergymarticleAlexandre Drouin (2024) — arXiv
@zhou2024latsarticleAndy Zhou, Kai Yan, Michal Shlapentokh-Rothman (2024) — ICML
@ahmadian2024rlooarticleArash Ahmadian, Chris Cremer, Matthias Gallee, et al. (2024) — ACL
@jimenez2024swebencharticleCarlos E. Jimenez, John Yang, Alexander Wettig (2024) — ICLR
@snell2024scaling_testtimearticleCharlie Snell, Jaehoon Lee, Kelvin Xu, et al. (2024) — arXiv
@huang2024loramoearticleCheng Huang (2024) — ACL
@hsieh2024rulerarticleCheng-Ping Hsieh, Simeng Sun, Samuel Kriman, et al. (2024) — arXiv
@dai2024deepseekmoearticleDamai Dai, Chengqi Deng, Chenggang Zhao (2024) — arXiv
@kondratyuk2024videopoetarticleDan Kondratyuk, Lijun Yu, Xiuye Gu, et al. (2024) — ICML
@valevski2024diffusionarticleDani Valevski, Yaniv Leviathan, Moab Arar, et al. (2024) — arXiv
@edge2024graphragarticleDarren Edge, Ha Trinh, Newman Cheng, et al. (2024) — arXiv
@raposo2024mixturearticleDavid Raposo, Sam Ritter, Blake Richards (2024) — arXiv
@deepseek2024deepseekv2articleDeepSeek-AI (2024) — arXiv
@alonso2024diffusionarticleEloi Alonso, Adam Jelley, Anssi Kanervisto, et al. (2024) — NeurIPS
@zelikman2024quietstararticleEric Zelikman, Georges Harik, Yijia Shao, et al. (2024) — arXiv
@deepmind2024alphaproofarticleGoogle DeepMind (2024) — Google DeepMind Blog
@wang2024legoproverarticleHaiming Wang (2024) — ICLR
@wu2024continual_llm_surveyarticleHaizhou Shi, Zihao Xu, Hengyi Wang, et al. (2024) — arXiv
@liu2024ringattentionarticleHao Liu, Matei Zaharia, Pieter Abbeel (2024) — ICLR
@he2024webvoyagerarticleHongliang He (2024) — ACL
@xin2024deepseekproverarticleHuajian Xin (2024) — arXiv
@bruce2024geniearticleJake Bruce, Michael Dennis, Ashley Edwards (2024) — ICML
@shah2024flashattention3articleJay Shah, Ganesh Bikshandi, Ying Zhang, et al. (2024) — arXiv
@lin2024learningarticleJessy Lin, Yilun Du, Olivia Watkins (2024) — ICLR
@lin2024awqarticleJi Lin, Jiaming Tang, Haotian Tang (2024) — MLSys
@wu2024ivideogptarticleJialong Wu (2024) — NeurIPS
@wu2024prearticleJialong Wu, Haoyu Ma, Chaoyi Deng, et al. (2024) — NeurIPS
@su2024roformerarticleJianlin Su, Yu Lu, Shengfeng Pan, et al. (2024) — Neurocomputing
@su2024ropearticleJianlin Su, Murtadha Ahmed, Yu Lu, et al. (2024) — Neurocomputing
@xiang2024languagearticleJiannan Xiang, Tianhua Tao, Yi Gu, et al. (2024) — NeurIPS
@zhao2024galorearticleJiawei Zhao, Zhenyu Zhang, Beidi Chen, et al. (2024) — ICML
@yu2024boosting_moe_clarticleJiazuo Yu, Yunzhi Zhuge, Lu Zhang, et al. (2024) — CVPR
@koh2024visualwebarenaarticleJing Yu Koh, Robert Lo, Lawrence Jang, et al. (2024) — ACL
@lee2024geckoarticleJinhyuk Lee, Zhuyun Dai, Xiaoqi Ren, et al. (2024) — arXiv
@yang2024sweagentarticleJohn Yang, Carlos E. Jimenez, Alexander Wettig, et al. (2024) — arXiv
@kim2024sddgrarticleJunsu Kim (2024) — CVPR
@tirumala2024d4articleKushal Tirumala, Daniel Simig, Armen Aghajanyan, et al. (2024) — NeurIPS
@yu2024language_darearticleLe Yu, Bowen Yu, Haiyang Yu, et al. (2024) — ICML
@wang2024surveyarticleLei Wang, Chen Ma, Xueyang Feng (2024) — Frontiers of Computer Science
@wang2024e5articleLiang Wang, Nan Yang, Xiaolong Huang, et al. (2024) — ACL
@zheng2024sglangarticleLianmin Zheng, Liangsheng Yin, Zhiqiang Xie, et al. (2024) — arXiv
@zheng2024sglangradixtreearticleLianmin Zheng, Liangsheng Yin, Zhiqiang Xie, et al. (2024) — NeurIPS
@wang2024comprehensivearticleLiyuan Wang, Xingxing Zhang, Hang Su, et al. (2024) — IEEE TPAMI
@wang2024hierarchical_hidearticleLiyuan Wang, Jingyi Xie, Xingxing Zhang, et al. (2024) — NeurIPS
@okada2024dreamerv3articleMasashi Okada, Tadahiro Taniguchi (2024) — arXiv
@beck2024xlstmarticleMaximilian Beck, Korbinian Poppel, Markus Spanring (2024) — NeurIPS
@sun2024wandaarticleMingjie Sun, Zhuang Liu, Anna Bair, et al. (2024) — ICLR
@hansen2024tdmpc2articleNicklas Hansen, Hao Su, Xiaolong Wang (2024) — ICLR
@nvidia2024cosmosarticleNVIDIA (2024) — NVIDIA Technical Report
@khattab2024dspyarticleOmar Khattab, Arnav Singhvi, Paridhi Maheshwari, et al. (2024) — ICLR
@openai2024soraarticleOpenAI (2024) — OpenAI Technical Report
@lieber2024jambaarticleOpher Lieber, Barak Lenz, Hofit Bata (2024) — arXiv
@glorioso2024zambaarticlePaolo Glorioso, Quentin Anthony, Yury Tokpanov, et al. (2024) — arXiv
@yadav2024tiesarticlePrateek Yadav, Derek Tam, Leshem Choshen, et al. (2024) — NeurIPS
@patel2024splitwisearticlePratyush Patel, Esha Choukse, Chaojie Zhang, et al. (2024) — ISCA
@es2024ragasarticleShahul Es, Jithin James, Luis Espinosa-Anke, et al. (2024) — EACL
@yan2024cragarticleShi-Qi Yan, Jia-Chen Gu, Yun Zhu, et al. (2024) — arXiv
@xu2024searcharticleShicheng Xu, Liang Pang, Huawei Shen, et al. (2024) — WWW
@liu2024doraarticleShih-Yang Liu, Chien-Yi Wang, Hongxu Yin, et al. (2024) — ICML
@xiao2024bgearticleShitao Xiao, Zheng Liu, Peitian Zhang, et al. (2024) — SIGIR
@ma2024bitnet158articleShuming Ma, Hongyu Wang, Lingxiao Ma, et al. (2024) — arXiv
@zhou2024webarenaarticleShuyan Zhou, Frank F. Xu, Hao Zhu (2024) — ICLR
@arora2024simplearticleSimran Arora, Sabri Eyuboglu, Michael Zhang (2024) — ICML
@arora2024basedarticleSimran Arora, Sabri Eyuboglu, Michael Zhang, et al. (2024) — ICML
@de2024griffinarticleSoham De, Samuel L. Smith, Anushan Fernando (2024) — arXiv
@jeong2024adaptivearticleSoyeong Jeong, Jinheon Baek, Sukmin Cho, et al. (2024) — NAACL
@xie2024osworldarticleTianbao Xie, Danyang Zhang, Jixuan Chen, et al. (2024) — NeurIPS
@zhang2024raftarticleTianjun Zhang, Shishir G. Patil, Naman Jain, et al. (2024) — arXiv
@cai2024medusaarticleTianle Cai, Yuhong Li, Zhengyang Geng, et al. (2024) — ICML
@wu2024mitigatingarticleTianqi Wu (2024) — ACL
@gao2024alcearticleTianyu Gao, Howard Yen, Jiatong Yu, et al. (2024) — EMNLP
@ye2024diff_transformerarticleTianzhu Ye, Li Dong, Yuqing Xia, et al. (2024) — arXiv
@ye2024differentialarticleTianzhu Ye, Li Dong, Yuqing Xia, et al. (2024) — arXiv
@dao2024flashattention2articleTri Dao (2024) — ICLR
@dao2024transformersarticleTri Dao, Albert Gu (2024) — ICML
@dao2024mamba2articleTri Dao, Albert Gu (2024) — ICML
@wang2024driving_world_surveyarticleTuo Wang, Guangming Wang, Yanfeng Wang, et al. (2024) — arXiv
@shi2024trustingarticleWeijia Shi, Sewon W. Han, Mike Lewis (2024) — NAACL Findings
@sun2024rankgptarticleWeiwei Sun, Lingyong Yan, Xinyu Ma, et al. (2024) — EMNLP
@zheng2024occworldarticleWenzhao Zheng, Weiliang Chen, Yuanhui Huang, et al. (2024) — ECCV
@gurnee2024languagearticleWes Gurnee, Max Tegmark (2024) — ICLR
@deng2024mind2webarticleXiang Deng, Yu Gu, Boyuan Zheng (2024) — NeurIPS
@liu2024agentbencharticleXiao Liu, Hao Yu, Hanchen Zhang, et al. (2024) — ICLR
@yang2024cragarticleXiao Yang, Kai Sun, Hao Xin, et al. (2024) — arXiv
@wang2024openhandsarticleXingyao Wang, Boxuan Li, Yufan Song, et al. (2024) — arXiv
@chen2024cot_decodingarticleXuezhi Wang, Denny Zhou (2024) — arXiv
@shao2024stormarticleYijia Shao, Yucheng Jiang, Theodore A. Kanell (2024) — NAACL
@du2024videoarticleYilun Du, Mengjiao Yang, Pete Florence, et al. (2024) — ICLR
@sheng2024sloraarticleYing Sheng, Shiyi Cao, Dacheng Li, et al. (2024) — MLSys
@li2024snapkvarticleYuhong Li, Yingbing Huang, Bowen Yang, et al. (2024) — arXiv
@gao2024modular_ragarticleYunfan Gao, Yun Xiong, Xinyu Gao, et al. (2024) — arXiv
@feng2024retrievalarticleZhangyin Feng, Xiaocheng Feng, Dongyan Zhao, et al. (2024) — EMNLP Findings
@zhou2024robodreamerarticleZhenan Zhou (2024) — arXiv
@zhang2024h2oarticleZhenyu Zhang, Ying Sheng, Tianyi Zhou (2024) — NeurIPS
@shao2024deepseekmatharticleZhihong Shao, Peiyi Wang, Qihao Zhu, et al. (2024) — arXiv
@zhu2024deepseekmathrlarticleZhihong Shao, Peiyi Wang, Qihao Zhu, et al. (2024) — arXiv
@liu2024scissorhandsarticleZichang Liu, Aditya Desai, Fangshuo Liao, et al. (2024) — NeurIPS
@jiang2024longragarticleZiyan Jiang, Xueguang Ma, Wenhu Chen (2024) — arXiv
@galashov2023continuallyarticleAlexandre Galashov (2023) — CoLLAs
@madaan2023selfrefinearticleAman Madaan, Niket Tandon, Prakhar Gupta, et al. (2023) — NeurIPS
@abbas2023semdeduparticleAmro Abbas, Kushal Tirumala, Daniel Simig, et al. (2023) — arXiv
@blattmann2023stablearticleAndreas Blattmann, Tim Dockhorn, Sumith Kulal, et al. (2023) — arXiv
@brohan2023rtarticleAnthony Brohan, Noah Brown, Justice Carbajal, et al. (2023) — arXiv
@hu2023gaia1articleAnthony Hu, Lloyd Russell, Hudson Yeo, et al. (2023) — arXiv
@jiang2023vadarticleBo Jiang, Shaoyu Chen, Qing Xu, et al. (2023) — ICCV
@peng2023rwkvarticleBo Peng, Eric Alcaide, Quentin Anthony (2023) — EMNLP Findings
@chen2023acceleratingarticleCharlie Chen, Sebastian Borgeaud, Geoffrey Irving (2023) — arXiv
@hafner2023masteringarticleDanijar Hafner, Jurgis Pasukonis, Jimmy Ba, et al. (2023) — arXiv
@hafner2023dreamerv3articleDanijar Hafner, Jurgis Pasukonis, Jimmy Ba, et al. (2023) — arXiv
@zhou2023leasttomostarticleDenny Zhou, Nathanael Scharli, Le Hou, et al. (2023) — ICLR
@frantar2023gptqarticleElias Frantar, Saleh Ashkboos, Torsten Hoefler, et al. (2023) — ICLR
@frantar2023sparsegptarticleElias Frantar, Dan Alistarh (2023) — ICML
@keles2023computationalarticleFeyza Duman Keles, Pruthuvi Mahesakya Wijewardena, Cengiz Candan, et al. (2023) — ALT
@ilharco2023editingarticleGabriel Ilharco, Marco Tulio Ribeiro, Mitchell Wortsman, et al. (2023) — ICLR
@kim2023treearticleTree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Models
Gangwoo Kim, Sungdong Kim, Byeongguk Jeon, et al. (2023) — EMNLP
@izacard2023atlasarticleGautier Izacard, Patrick Lewis, Maria Lomeli (2023) — JMLR
@voyager2023articleGuanzhi Wang, Yuqi Xie, Yunfan Jiang, et al. (2023) — arXiv
@rashkin2023measuringarticleHannah Rashkin, Vitaly Nikolaev, Matthew Lamm (2023) — CL
@trivedi2023ircotarticleHarsh Trivedi, Niranjan Balasubramanian, Tushar Khot, et al. (2023) — ACL
@su2023embedderarticleHongjin Su, Weijia Shi, Jungo Kasai, et al. (2023) — ACL Findings
@wang2023bitnetarticleHongyu Wang, Shuming Ma, Li Dong, et al. (2023) — arXiv
@touvron2023llamaarticleHugo Touvron, Thibaut Lavril, Gautier Izacard, et al. (2023) — arXiv
@lightman2023prmarticleHunter Lightman, Vineet Kosaraju, Yura Burda, et al. (2023) — ICLR
@smith2023codapromptarticleJames Seale Smith, Leonid Karlinsky, Vyshnavi Gutta, et al. (2023) — CVPR
@robine2023transformerarticleJan Robine, Marc Hoftmann, Tobias Uelwer, et al. (2023) — ICLR
@hoelscher2023detectingarticleJason Hoelscher-Obermaier, Julia Perber, Fazl Barez, et al. (2023) — ACL Findings
@ainslie2023gqaarticleJoshua Ainslie, James Lee-Thorp, Michiel de Jong (2023) — EMNLP
@greshake2023morearticleKai Greshake, Sahar Abdelnabi, Shailesh Mishra, et al. (2023) — AISec
@yang2023leandojoarticleKaiyu Yang (2023) — NeurIPS
@li2023emergent_othelloarticleKenneth Li, Aspen K. Hopkins, David Bau, et al. (2023) — ICLR
@meng2023massarticleKevin Meng, Arnab Sen Sharma, Alex Andonian, et al. (2023) — ICLR
@ahn2023sayplanarticleKrishan Rana, Jesse Haviland, Sourav Garg, et al. (2023) — CoRL
@gupta2023continualarticleKshitij Gupta, Benjamin Acting, Yi Tay (2023) — ICML Workshop
@zheng2023judgingarticleLianmin Zheng, Wei-Lin Chiang, Ying Sheng (2023) — NeurIPS
@guan2023leveragingarticleLin Guan, Karthik Valmeekam, Sarath Sreedharan, et al. (2023) — NeurIPS
@wong2023wordarticleLionel Wong, Gabriel Grand, Alexander Lew (2023) — arXiv
@gao2023hydearticleLuyu Gao, Xueguang Ma, Jimmy Lin, et al. (2023) — ACL
@masana2023classarticleMarc Masana, Xialei Liu, Bartlomiej Twardowski (2023) — IEEE TPAMI
@mitchell2023debatearticleMelanie Mitchell, David C. Krakauer (2023) — PNAS
@yang2023learningarticleMengjiao Yang, Yilun Du, Kamyar Ghasemipour (2023) — arXiv
@poli2023hyenaarticleMichael Poli, Stefano Massaroli, Eric Nguyen (2023) — ICML
@nanda2023emergentarticleNeel Nanda, Andrew Lee, Martin Wattenberg (2023) — NeurIPS ATTRIB Workshop
@liu2023evaluatingarticleNelson F. Liu, Tianyi Zhang, Percy Liang (2023) — EMNLP Findings
@muennighoff2023mtebarticleNiklas Muennighoff, Nouamane Tazi, Loic Magne, et al. (2023) — EACL
@shinn2023reflexionarticleNoah Shinn, Federico Cassano, Ashwin Gopinath, et al. (2023) — NeurIPS
@press2023measuringarticleOfir Press, Muru Zhang, Sewon Min, et al. (2023) — EMNLP Findings
@khattab2023dsparticleOmar Khattab, Keshav Santhanam, Xiang Lisa Li, et al. (2023) — arXiv
@ram2023incontextarticleOri Ram, Yoav Levine, Itay Dalmedigos, et al. (2023) — TACL
@liang2023holisticarticlePercy Liang, Rishi Bommasani, Tony Lee (2023) — TMLR
@wu2023daydreamerarticlePhilipp Wu, Alejandro Escontrela, Danijar Hafner, et al. (2023) — CoRL
@wu2023autogenarticleQingyun Wu, Gagan Bansal, Jieyu Zhang, et al. (2023) — arXiv
@rafailov2023dpoarticleRafael Rafailov, Archit Sharma, Eric Mitchell, et al. (2023) — NeurIPS
@rafailov2023directarticleRafael Rafailov, Archit Sharma, Eric Mitchell, et al. (2023) — NeurIPS
@lam2023graphcastarticleRemi Lam, Alvaro Sanchez-Gonzalez, Matthew Willson, et al. (2023) — Science
@gao2023ddgrarticleRui Gao (2023) — ICML
@kim2023achievingarticleSanghwan Kim, Lorenzo Noci, Antonio Orvieto, et al. (2023) — CVPR
@mehta2023empiricalarticleSanket Vaibhav Mehta, Darshan Patil, Sarath Chandar, et al. (2023) — JMLR
@min2023factscorearticleSewon Min, Kalpesh Krishna, Xinxi Lyu, et al. (2023) — EMNLP
@lin2023trainarticleSheng-Chieh Lin, Akari Asai, Minghan Li, et al. (2023) — EMNLP Findings
@hao2023reasoningarticleShibo Hao, Yi Gu, Haodi Ma (2023) — EMNLP
@patil2023gorillaarticleShishir G. Patil, Tianjun Zhang, Xin Wang, et al. (2023) — arXiv
@li2023languagearticleShuang Li, Xavier Puig, Chris Paxton, et al. (2023) — NeurIPS
@yao2023reactarticleShunyu Yao, Jeffrey Zhao, Dian Yu (2023) — ICLR
@yao2023treearticleShunyu Yao, Dian Yu, Jeffrey Zhao (2023) — NeurIPS
@significant2023autogptarticleSignificant Gravitas (2023) — GitHub
@moerland2023modelarticleThomas M. Moerland, Joost Broekens, Aske Plaat, et al. (2023) — Foundations and Trends in Machine Learning
@gao2023enablingarticleTianyu Gao, Howard Yen, Jiatong Yu, et al. (2023) — EMNLP
@dettmers2023qloraarticleTim Dettmers, Artidoro Pagnoni, Ari Holtzman, et al. (2023) — NeurIPS
@schick2023toolformerarticleTimo Schick, Jane Dwivedi-Yu, Roberto Dessi (2023) — NeurIPS
@dao2023flashattention2articleTri Dao (2023) — ICLR
@micheli2023transformersarticleVincent Micheli, Eloi Alonso, Francois Fleuret (2023) — ICLR
@shi2023replugarticleWeijia Shi, Sewon Min, Michihiro Yasunaga, et al. (2023) — arXiv
@huang2023innerarticleWenlong Huang, Fei Xia, Ted Xiao, et al. (2023) — CoRL
@kwon2023efficientarticleWoosuk Kwon, Zhuohan Li, Siyuan Zhuang (2023) — SOSP
@ma2023queryarticleXinbei Ma (2023) — EMNLP
@wang2023selfconsistencyarticleXuezhi Wang, Jason Wei, Dale Schuurmans, et al. (2023) — ICLR
@leviathan2023fastarticleYaniv Leviathan, Matan Kalman, Yossi Matias (2023) — ICML
@zhao2023pytorcharticleYanli Zhao, Andrew Gu, Rohan Varma (2023) — VLDB
@hu2023planningarticleYihan Hu, Jiazhi Yang, Li Chen, et al. (2023) — CVPR
@seo2023maskedarticleYounggyo Seo, Danijar Hafner, Hao Liu, et al. (2023) — CoRL
@li2023teacherlmarticleYuanzhi Li, Sebastien Bubeck, Ronen Eldan, et al. (2023) — arXiv
@qin2023toolarticleYujia Qin, Shengding Hu, Yankai Lin (2023) — arXiv
@luo2023empiricalarticleYun Luo (2023) — arXiv
@yao2023editingarticleYunzhi Yao, Peng Wang, Bozhong Tian, et al. (2023) — EMNLP
@sun2023retentivearticleYutao Sun, Li Dong, Shaohan Huang, et al. (2023) — arXiv
@li2023gtearticleZehan Li, Xin Zhang, Yanzhao Zhang, et al. (2023) — arXiv
@jiang2023flarearticleZhengbao Jiang, Frank F. Xu, Luyu Gao (2023) — EMNLP
@shao2023enhancingarticleZhihong Shao, Yeyun Gong, Yelong Shen (2023) — EMNLP Findings
@wang2023orthogonalarticleZhilin Wang (2023) — EMNLP Findings
@wu2023slotformerarticleZiyi Wu, Nikita Dvornik, Klaus Greff, et al. (2023) — ICLR
@parisi2022talmarticleAaron Parisi, Yao Zhao, Noah Fishi (2022) — arXiv
@gu2022efficientlyarticleAlbert Gu, Karan Goel, Christopher Re (2022) — ICLR
@hu2022milearticleAnthony Hu, Gianluca Corrado, Nicolas Griffiths, et al. (2022) — NeurIPS
@zoph2022stmoearticleBarret Zoph, Irwan Bello, Sameer Kumar (2022) — arXiv
@chen2022gslidearticleBeidi Chen (2022) — IEEE TPDS
@chen2022transdreamerarticleChang Chen, Yi-Fu Wu, Jaesik Yoon, et al. (2022) — arXiv
@li2022neuralarticleChunyun Li (2022) — ACM Computing Surveys
@zhou2022memoarticleDa-Wei Zhou, Qi-Wei Wang, Han-Jia Ye, et al. (2022) — ICLR
@hafner2022directorarticleDanijar Hafner, Kuang-Huei Lee, Ian Fischer, et al. (2022) — NeurIPS
@hu2022loraarticleEdward J. Hu, Yelong Shen, Phillip Wallis, et al. (2022) — ICLR
@arani2022learningarticleElahe Arani, Fahad Sarfraz, Bahram Zonooz (2022) — ICLR
@mitchell2022fastarticleEric Mitchell, Charles Lin, Antoine Bosselut, et al. (2022) — ICLR
@zelikman2022stararticleEric Zelikman, Yuhuai Wu, Jesse Mu, et al. (2022) — NeurIPS
@normandin2022sequoiaarticleFabrice Normandin, Florian Golemo, Oleksiy Ostapenko, et al. (2022) — arXiv
@liu2022randomarticleFanghui Liu, Xiaolin Huang, Yudong Chen, et al. (2022) — IEEE TPAMI
@deng2022dreamerproarticleDreamerPro: Reconstruction-Free Model-Based Reinforcement Learning with Prototypical Representations
Fei Deng, Ingook Jang, Sungjin Ahn (2022) — ICML
@wang2022fosterarticleFu-Yun Wang, Da-Wei Zhou, Han-Jia Ye, et al. (2022) — ECCV
@izacard2022contrieverarticleGautier Izacard, Mathilde Caron, Lucas Hosseini, et al. (2022) — TMLR
@vandeven2022three_typesarticleGido M. van de Ven, Hava T. Siegelmann, Andreas S. Tolias (2022) — Nature Machine Intelligence
@yu2022orcaarticleGyeong-In Yu, Joo Seong Jeong, Geon-Woo Kim, et al. (2022) — OSDI
@trivedi2022musiquearticleHarsh Trivedi, Niranjan Balasubramanian, Tushar Khot, et al. (2022) — TACL
@evron2022catastrophicarticleItay Evron, Edward Moroshko, Rachel Ward, et al. (2022) — COLT
@pathak2022fourcastnetarticleJaideep Pathak, Shashank Subramanian, Peter Harrington, et al. (2022) — arXiv
@sevilla2022computearticleJaime Sevilla, Lennart Heim, Anson Ho, et al. (2022) — arXiv
@leethorp2022fnetarticleJames Lee-Thorp, Joshua Ainslie, Ilya Eckstein, et al. (2022) — NAACL
@wei2022chainarticleJason Wei, Xuezhi Wang, Dale Schuurmans, et al. (2022) — NeurIPS
@jang2022towardsarticleJoel Jang, Seonghyeon Ye, Sungdong Yang (2022) — ICLR
@guibas2022adaptivearticleJohn Guibas, Morteza Mardani, Zongyi Li (2022) — ICLR
@guibas2022afnoarticleJohn Guibas, Morteza Mardani, Zongyi Li, et al. (2022) — ICLR
@ho2022videoarticleJonathan Ho, Tim Salimans, Alexey Gritsenko (2022) — NeurIPS
@ho2022cfgarticleJonathan Ho, Tim Salimans (2022) — NeurIPS Workshop
@ho2022imagenarticleJonathan Ho, William Chan, Chitwan Saharia, et al. (2022) — arXiv
@hoffmann2022trainingarticleJordan Hoffmann, Sebastian Borgeaud, Arthur Mensch (2022) — NeurIPS
@mendez2022modular_clarticleJorge A. Mendez, Eric Eaton (2022) — ICML
@lee2022deduplicatingarticleKatherine Lee (2022) — ACL
@santhanam2022colbertv2articleKeshav Santhanam, Omar Khattab, Jon Saad-Falcon, et al. (2022) — NAACL
@meng2022locatingarticleKevin Meng, David Bau, Alex Andonian, et al. (2022) — NeurIPS
@ouyang2022trainingarticleLong Ouyang, Jeff Wu, Xu Jiang, et al. (2022) — NeurIPS
@caccia2022newarticleLucas Caccia, Rahaf Aljundi, Nader Asadi, et al. (2022) — ICLR
@boschini2022classarticleMatteo Boschini, Lorenzo Bonicelli, Pietro Buzzega, et al. (2022) — IEEE TPAMI
@wortsman2022model_soupsarticleMitchell Wortsman, Gabriel Ilharco, Samir Yitzhak Gadre, et al. (2022) — ICML
@hansen2022temporalarticleNicklas Hansen, Xiaolong Su, Xiaolong Wang (2022) — ICML
@micikevicius2022fp8articlePaulius Micikevicius, Dusan Stosic, Neil Burgess, et al. (2022) — arXiv
@jeevan2022wavemixarticlePranav Jeevan, Amit Sethi (2022) — arXiv
@rombach2022higharticleRobin Rombach, Andreas Blattmann, Dominik Lorenz, et al. (2022) — CVPR
@srivastava2022behaviorarticleSanjana Srivastava, Chengshu Li, Michael Lingelbach, et al. (2022) — CoRL
@borgeaud2022retroarticleSebastian Borgeaud, Arthur Mensch, Jordan Hoffmann (2022) — ICML
@yao2022webshoparticleShunyu Yao, Howard Chen, John Yang, et al. (2022) — NeurIPS
@kojima2022largearticleTakeshi Kojima, Shixiang Shane Gu, Machel Reid, et al. (2022) — NeurIPS
@schuster2022confidentarticleTal Schuster, Adam Fisch, Jai Gupta (2022) — NeurIPS
@dettmers2022gpt3int8articleTim Dettmers, Mike Lewis, Younes Belkada, et al. (2022) — NeurIPS
@dao2022flashattentionarticleTri Dao, Daniel Y. Fu, Stefano Ermon, et al. (2022) — NeurIPS
@singer2022makearticleUriel Singer, Adam Polyak, Thomas Hayes, et al. (2022) — arXiv
@voleti2022mcvdarticleVikram Voleti, Alexia Jolicoeur-Martineau, Christopher Pal (2022) — NeurIPS
@fedus2022switcharticleWilliam Fedus, Barret Zoph, Noam Shazeer (2022) — JMLR
@wang2022spromptsarticleYabin Wang, Zhiwu Huang, Xiaopeng Hong (2022) — NeurIPS
@lecun2022patharticleYann Lecun (2022) — openreview.net
@lecun2024sora_critiquearticleYann LeCun (2022) — OpenReview
@zhou2022expert_choicearticleYanqi Zhou, Tao Lei, Hanxiao Liu, et al. (2022) — NeurIPS
@li2022alphacodearticleYujia Li, David Choi, Junyoung Chung (2022) — Science
@li2022bevformerarticleZhiqi Li, Wenhai Wang, Hongyang Li, et al. (2022) — ECCV
@wang2022learning_l2particleZifeng Wang, Zizhao Zhang, Chen-Yu Lee, et al. (2022) — CVPR
@wang2022dualpromptarticleZifeng Wang, Zizhao Zhang, Sayna Ebrahimi, et al. (2022) — ECCV
@dosovitskiy2021imagearticleAlexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, et al. (2021) — ICLR
@madotto2021continualarticleAndrea Madotto, Zhaojiang Lin, Zhenpeng Zhou, et al. (2021) — EMNLP
@lester2021prompt_tuningarticleBrian Lester, Rami Al-Rfou, Noah Constant (2021) — EMNLP
@hendrycks2021mmluarticleDan Hendrycks, Collin Burns, Steven Basart, et al. (2021) — ICLR
@hendrycks2021appsarticleDan Hendrycks, Steven Basart, Saurav Kadavath, et al. (2021) — NeurIPS
@hafner2021masteringarticleDanijar Hafner, Timothy Lillicrap, Mohammad Norouzi, et al. (2021) — ICLR
@narayanan2021efficientarticleDeepak Narayanan, Mohammad Shoeybi, Jared Casper (2021) — SC
@lepikhin2021gshardarticleDmitry Lepikhin, HyoukJoong Lee, Yuanzhong Xu (2021) — ICLR
@metzler2021rethinkingarticleDonald Metzler, Yi Tay, Dara Bahri, et al. (2021) — SIGIR Forum
@petroni2021kiltarticleFabio Petroni, Aleksandra Piktus, Angela Fan (2021) — NAACL
@izacard2021fidarticleGautier Izacard, Edouard Grave (2021) — EACL
@saha2021gradientarticleGobinda Saha, Isha Garg, Kaushik Roy (2021) — ICLR
@benmeziane2021comprehensivearticleHadjer Benmeziane, Kaoutar El Maghraoui, Hamza Ouarnoughi (2021) — arXiv
@peng2021rfaarticleHao Peng, Nikolaos Pappas, Dani Yogatama, et al. (2021) — ICLR
@peng2021random_feature_attentionarticleHao Peng, Nikolaos Pappas, Dani Yogatama, et al. (2021) — ICLR
@ahn2021ssilarticleHongjoon Ahn, Jihwan Kwak, Subin Lim, et al. (2021) — ICCV
@cha2021co2larticleHyuntak Cha, Jaeho Lee, Jinwoo Shin (2021) — ICCV
@yoon2021federatedarticleJaehong Yoon, Wonyong Jeong, Giwoong Lee, et al. (2021) — ICML
@johnson2021faissarticleJeff Johnson, Matthijs Douze, Herve Jegou (2021) — IEEE TBD
@choromanski2021rethinkingarticleKrzysztof Choromanski, Valerii Likhosherstov, David Dohan (2021) — ICLR
@chen2021humanevalarticleMark Chen, Jerry Tworek, Heewoo Jun, et al. (2021) — arXiv
@delange2021continualarticleMatthias De Lange, Rahaf Aljundi, Marc Masana (2021) — IEEE TPAMI
@lewis2021basearticleMike Lewis (2021) — ICML
@chen2022autoformer_nasarticleMinghao Chen, Houwen Peng, Jianlong Fu, et al. (2021) — ICCV
@babaeizadeh2021fitvidarticleMohammad Babaeizadeh, Mohammad Taghi Saffar, Suraj Nair (2021) — arXiv
@geva2023strategyqaarticleMor Geva, Daniel Khashabi, Elad Segal, et al. (2021) — TACL
@thakur2021beirarticleNandan Thakur, Nils Reimers, Andreas Ruckle, et al. (2021) — NeurIPS
@nakkiran2021deeparticlePreetum Nakkiran, Gal Kaplun, Yamini Bansal, et al. (2021) — JMLR
@nakano2021webgptarticleReiichiro Nakano (2021) — arXiv
@lee2021continualarticleSebastian Lee, Sebastian Goldt, Andrew Saxe (2021) — ICML
@wang2021orthogonal_adamnarticleShipeng Wang, Xiaorong Li, Jian Sun, et al. (2021) — CVPR
@yan2021der_clarticleShipeng Yan, Jiangwei Xie, Xuming He (2021) — CVPR
@hospedales2021metalearningarticleTimothy Hospedales, Antreas Antoniou, Paul Micaelli, et al. (2021) — IEEE TPAMI
@veniat2021efficientarticleTom Veniat, Ludovic Denoyer, Marc'Aurelio Ranzato (2021) — ICLR
@lomonaco2021avalanchearticleVincenzo Lomonaco, Lorenzo Pellegrini, Andrea Cossu (2021) — CLVision Workshop at CVPR
@ye2021masteringarticleWeirui Ye, Shaohuai Liu, Thanard Kurutach, et al. (2021) — NeurIPS
@yan2021videogptarticleWilson Yan, Yunzhi Zhang, Pieter Abbeel, et al. (2021) — arXiv
@li2021prefixarticleXiang Lisa Li, Percy Liang (2021) — ACL
@tay2021longarticleYi Tay, Mostafa Dehghani, Samira Abnar (2021) — ICLR
@dong2021attentionarticleYihe Dong, Jean-Baptiste Cordonnier, Andreas Loukas (2021) — ICML
@xiong2021nystromformerarticleYunyang Xiong, Zhanpeng Zeng, Rudrasis Chakraborty (2021) — AAAI
@liu2021swinarticleZe Liu, Yutong Lin, Yue Cao, et al. (2021) — ICCV
@li2021fourierarticleZongyi Li, Nikola Kovachki, Kamyar Azizzadenesheli, et al. (2021) — ICLR
@gu2020hippoarticleAlbert Gu, Tri Dao, Stefano Ermon, et al. (2020) — NeurIPS
@lee2020stochasticarticleAlex X. Lee, Anusha Nagabandi, Pieter Abbeel, et al. (2020) — NeurIPS
@sanchez2020learningarticleAlvaro Sanchez-Gonzalez, Jonathan Godwin, Tobias Pfaff, et al. (2020) — ICML
@katharopoulos2020transformersarticleAngelos Katharopoulos, Apoorv Vyas, Nikolaos Pappas, et al. (2020) — ICML
@zela2020understandingarticleArber Zela, Thomas Elsken, Tonmoy Saikia, et al. (2020) — ICLR
@chrysakis2020onlinearticleAristotelis Chrysakis, Marie-Francine Moens (2020) — ICML
@douillard2020podnetarticleArthur Douillard, Matthieu Cord, Charles Ollion, et al. (2020) — ECCV
@chen2020slidearticleBeidi Chen, Tri Dao, Eric Winsor, et al. (2020) — MLSys
@mildenhall2020nerfarticleBen Mildenhall, Pratul P. Srinivasan, Matthew Tancik, et al. (2020) — ECCV
@zhao2020maintaining_waarticleBowen Zhao, Xi Xiao, Guojun Gan, et al. (2020) — CVPR
@hafner2020dreamarticleDanijar Hafner, Timothy Lillicrap, Jimmy Ba, et al. (2020) — ICLR
@cubuk2020randaugmentarticleEkin D. Cubuk, Barret Zoph, Jonathon Shlens, et al. (2020) — CVPR Workshops
@locatello2020objectarticleFrancesco Locatello, Dirk Weissenborn, Thomas Unterthiner, et al. (2020) — NeurIPS
@gupta2020lamamlarticleGunshi Gupta, Karmesh Yadav, Liam Paull (2020) — NeurIPS
@cai2020onceforallarticleHan Cai, Chuang Gan, Tianzhe Wang, et al. (2020) — ICLR
@cai2020oncearticleHan Cai, Chuang Gan, Tianzhe Wang, et al. (2020) — ICLR
@beltagy2020longformerarticleIz Beltagy, Matthew E. Peters, Arman Cohan (2020) — arXiv
@kaplan2020scalingarticleJared Kaplan, Sam McCandlish, Tom Henighan (2020) — arXiv
@cordonnier2020relationshiparticleJean-Baptiste Cordonnier, Andreas Loukas, Martin Jaggi (2020) — ICLR
@ho2020ddpmarticleJonathan Ho, Ajay Jain, Pieter Abbeel (2020) — NeurIPS
@schrittwieser2020masteringarticleJulian Schrittwieser, Ioannis Antonoglou, Thomas Hubert (2020) — Nature
@guu2020realmarticleKelvin Guu, Kenton Lee, Zora Tung, et al. (2020) — ICML
@kaiser2020modelarticleLukasz Kaiser, Mohammad Babaeizadeh, Piotr Milos (2020) — ICLR
@zaheer2020bigbirdarticleManzil Zaheer, Guru Guruganesh, Kumar Avinava Dubey (2020) — NeurIPS
@zaheer2020bigarticleManzil Zaheer, Guru Guruganesh, Kumar Avinava Dubey, et al. (2020) — NeurIPS
@chen2020generativearticleMark Chen, Alec Radford, Rewon Child, et al. (2020) — ICML
@tancik2020fourierarticleMatthew Tancik, Pratul P. Srinivasan, Ben Mildenhall (2020) — NeurIPS
@wortsman2020supermasksarticleMitchell Wortsman, Vivek Ramanujan, Rosanne Liu (2020) — NeurIPS
@shoeybi2020megatronlmarticleMohammad Shoeybi, Mostofa Patwary, Raul Puri, et al. (2020) — arXiv
@lambert2020objectivearticleNathan Lambert, Brandon Amos, Omry Yadan, et al. (2020) — L4DC
@thompson2020computationalarticleNeil C. Thompson, Kristjan Greenewald, Keeheon Lee, et al. (2020) — arXiv
@kitaev2020reformerarticleNikita Kitaev, Lukasz Kaiser, Anselm Levskaya (2020) — ICLR
@khattab2020colbertarticleOmar Khattab, Matei Zaharia (2020) — SIGIR
@ahmed2020causalworldarticleOssama Ahmed, Frederik Träuble, Anirudh Goyal, et al. (2020) — ICLR
@ferragina2020pgmarticlePaolo Ferragina, Giorgio Vinciguerra (2020) — VLDB
@lewis2020ragarticlePatrick Lewis, Ethan Perez, Aleksandra Piktus (2020) — NeurIPS
@martinsson2020randomizedarticlePer-Gunnar Martinsson, Joel A. Tropp (2020) — Acta Numerica
@buzzega2020darkarticlePietro Buzzega, Matteo Boschini, Angelo Porrello, et al. (2020) — NeurIPS
@khosla2020supervisedarticlePrannay Khosla, Piotr Teterwak, Chen Wang, et al. (2020) — NeurIPS
@sekar2020planningarticleRamanan Sekar, Oleh Rybkin, Kostas Daniilidis, et al. (2020) — ICML
@rajbhandari2020zeroarticleSamyam Rajbhandari, Jeff Rasley, Olatunji Ruwase, et al. (2020) — SC
@beaulieu2020learningarticleShawn Beaulieu, Lapo Frati, Thomas Miconi (2020) — ECAI
@wang2020linformerarticleSinong Wang, Belinda Z. Li, Madian Khabsa, et al. (2020) — arXiv
@lee2020neural_clarticleSoochan Lee, Junsoo Ha, Dongsu Zhang, et al. (2020) — ICLR
@bhojanapalli2020lowarticleSrinadh Bhojanapalli, Chulhee Yun, Ankit Singh Rawat, et al. (2020) — ICML
@james2020rlbencharticleStephen James, Zicong Ma, David Rovick Arrojo, et al. (2020) — IEEE Robotics and Automation Letters
@gururangan2020dontarticleSuchin Gururangan, Ana Marasovic, Swabha Swayamdipta, et al. (2020) — ACL
@kipf2020contrastivearticleThomas Kipf, Elise van der Pol, Max Welling (2020) — ICLR
@yu2020metaworldarticleTianhe Yu, Deirdre Quillen, Zhanpeng He, et al. (2020) — CoRL
@brown2020gpt3articleTom Brown, Benjamin Mann, Nick Ryder, et al. (2020) — NeurIPS
@hayes2020remindarticleTyler L. Hayes, Kushal Kafle, Robik Shrestha, et al. (2020) — ECCV
@karpukhin2020dprarticleVladimir Karpukhin, Barlas Oguz, Sewon Min, et al. (2020) — EMNLP
@zhao2020simarticleWenshuai Zhao, Jorge Pena Queralta, Tomi Westerlund (2020) — IEEE Symposium Series on Computational Intelligence
@malkov2020hnswarticleYu A. Malkov, Dmitry A. Yashunin (2020) — IEEE TPAMI
@radford2019languagearticleAlec Radford, Jeffrey Wu, Rewon Child, et al. (2019) — OpenAI Blog
@razavi2019generatingarticleAli Razavi, Aaron van den Oord, Oriol Vinyals (2019) — NeurIPS
@howard2019searchingarticleAndrew Howard, Mark Sandler, Grace Chu, et al. (2019) — ICCV
@chaudhry2019tinyarticleArslan Chaudhry, Marcus Rohrbach, Mohamed Elhoseiny, et al. (2019) — arXiv
@chaudhry2019efficientarticleArslan Chaudhry, Marc'Aurelio Ranzato, Marcus Rohrbach, et al. (2019) — ICLR
@burgess2019monetarticleChristopher P. Burgess, Loic Matthey, Nicholas Watters, et al. (2019) — arXiv
@hafner2019learningarticleDanijar Hafner, Timothy Lillicrap, Ian Fischer (2019) — ICML
@narayanan2019pipedreamarticleDeepak Narayanan, Aaron Harlap, Amar Phanishayee, et al. (2019) — SOSP
@voita2019analyzingarticleElena Voita, David Talbot, Fedor Moiseev, et al. (2019) — ACL
@strubell2019energyarticleEmma Strubell, Ananya Ganesh, Andrew McCallum (2019) — ACL
@parisi2019continualarticleGerman I. Parisi, Ronald Kemker, Jose L. Part, et al. (2019) — Neural Networks
@vandeven2019threearticleGido M. van de Ven, Andreas S. Tolias (2019) — NeurIPS Continual Learning Workshop
@husain2019codesearchnetarticleHamel Husain, Ho-Hsiang Wu, Tiferet Gazit, et al. (2019) — arXiv
@liu2019dartsarticleHanxiao Liu, Karen Simonyan, Yiming Yang (2019) — ICLR
@loshchilov2019adamwarticleIlya Loshchilov, Frank Hutter (2019) — ICLR
@devlin2019bertarticleJacob Devlin, Ming-Wei Chang, Kenton Lee, et al. (2019) — NAACL
@frankle2019lotteryarticleJonathan Frankle, Michael Carlin (2019) — ICLR
@javed2019metaarticleKhurram Javed, Martha White (2019) — NeurIPS
@greff2019multiarticleKlaus Greff, Raphael Lopez Kaufman, Rishabh Kabra, et al. (2019) — ICML
@savva2019habitatarticleManolis Savva, Abhishek Kadian, Oleksandr Maksymets (2019) — ICCV
@riemer2019learningarticleMatthew Riemer, Ignacio Cases, Robert Ajemian (2019) — ICLR
@janner2019whenarticleMichael Janner, Justin Fu, Marvin Zhang, et al. (2019) — NeurIPS
@janner2019trustarticleMichael Janner, Justin Fu, Marvin Zhang, et al. (2019) — NeurIPS
@tan2019efficientnetarticleMingxing Tan, Quoc V. Le (2019) — ICML
@tan2019mnasnetarticleMingxing Tan, Bo Chen, Ruoming Pang, et al. (2019) — CVPR
@ke2019modelingarticleNan Rosemary Ke, Amanpreet Singh, Ahmed Touati (2019) — ICLR
@rahaman2019spectralarticleNasim Rahaman, Aristide Baratin, Devansh Arpit (2019) — ICML
@houlsby2019parameterarticleNeil Houlsby, Andrei Giurgiu, Stanislaw Jastrzebski, et al. (2019) — ICML
@ivkin2019communicationarticleNikita Ivkin, Daniel Rothchild, Enayat Ullah (2019) — NeurIPS
@shazeer2019fastarticleNoam Shazeer (2019) — arXiv
@aljundi2019taskfreearticleRahaf Aljundi, Klaas Kelchtermans, Tinne Tuytelaars (2019) — CVPR
@aljundi2019onlinearticleRahaf Aljundi, Eugene Belilovsky, Tinne Tuytelaars, et al. (2019) — NeurIPS
@aljundi2019gradientarticleRahaf Aljundi, Min Lin, Baptiste Goujaud, et al. (2019) — NeurIPS
@child2019generatingarticleRewon Child, Scott Gray, Alec Radford, et al. (2019) — arXiv
@spring2019compressingarticleRyan Spring, Anshumali Shrivastava (2019) — ICML
@hou2019learningarticleSaihui Hou, Xinyu Pan, Chen Change Loy, et al. (2019) — CVPR
@yun2019cutmixarticleSangdoo Yun, Dongyoon Han, Seong Joon Oh, et al. (2019) — ICCV
@farquhar2019towardsarticleSebastian Farquhar, Yarin Gal (2019) — Privacy in Machine Learning Workshop at NeurIPS
@elsken2019neuralarticleThomas Elsken, Jan Hendrik Metzen, Frank Hutter (2019) — JMLR
@kwiatkowski2019naturalarticleTom Kwiatkowski, Jennimaria Palomaki, Olivia Redfield (2019) — TACL
@sanh2019distilbertarticleVictor Sanh, Lysandre Debut, Julien Chaumond, et al. (2019) — EMC2 Workshop at NeurIPS
@chen2019progressive_dartsarticleXin Chen, Lingxi Xie, Jun Wu, et al. (2019) — IJCV
@song2019scorearticleYang Song, Stefano Ermon (2019) — NeurIPS
@huang2019gpipearticleYanping Huang, Youlong Cheng, Ankur Bapna (2019) — NeurIPS
@wu2019large_bicarticleYue Wu, Yinpeng Chen, Lijuan Wang, et al. (2019) — CVPR
@tsai2019transformerarticleYun-Hsuan Hsiao Tsai, Shaojie Bai, Barnabas Poczos, et al. (2019) — EMNLP
@allen2019convergencearticleZeyuan Allen-Zhu, Yuanzhi Li, Zhao Song (2019) — ICML
@oord2018infoncearticleAaron van den Oord, Yazhe Li, Oriol Vinyals (2018) — arXiv
@nichol2018firstarticleAlex Nichol, Joshua Achiam, John Schulman (2018) — arXiv
@nagabandi2018neuralarticleAnusha Nagabandi, Gregory Kahn, Ronald S. Feisal, et al. (2018) — ICRA
@chaudhry2018riemannianarticleArslan Chaudhry, Puneet K. Dokania, Thalaiyasingam Ajanthan, et al. (2018) — ECCV
@jacot2018ntkarticleArthur Jacot, Franck Gabriel, Clement Hongler (2018) — NeurIPS
@jacot2018neuralarticleArthur Jacot, Franck Gabriel, Clément Hongler (2018) — NeurIPS
@mallya2018packnetarticleArun Mallya, Svetlana Lazebnik (2018) — CVPR
@louart2018randomarticleCosme Louart, Zhenyu Liao, Romain Couillet (2018) — Annals of Applied Probability
@ha2018recurrentarticleDavid Ha, Jurgen Schmidhuber (2018) — NeurIPS
@silver2018generalarticleDavid Silver, Thomas Hubert, Julian Schrittwieser, et al. (2018) — Science
@denton2018stochasticarticleEmily Denton, Rob Fergus (2018) — ICML
@ghiasi2018dropblockarticleGolnaz Ghiasi, Tsung-Yi Lin, Quoc V. Le (2018) — NeurIPS
@zhang2018mixuparticleHongyi Zhang, Moustapha Cisse, Yann N. Dauphin, et al. (2018) — ICLR
@clavera2018modelarticleIgnasi Clavera, Jonas Rothfuss, John Schulman, et al. (2018) — CoRL
@yoon2018lifelongarticleJaehong Yoon, Eunho Yang, Jeongtae Lee, et al. (2018) — ICLR
@thorne2018feverarticleJames Thorne, Andreas Vlachos, Christos Christodoulopoulos, et al. (2018) — NAACL
@wang2018surveyarticleJingdong Wang, Ting Zhang, Jingkuan Song, et al. (2018) — IEEE TPAMI
@serra2018overcomingarticleJoan Serra, Didac Suris, Marius Miron, et al. (2018) — ICML
@schwarz2018progressarticleJonathan Schwarz, Wojciech Czarnecki, Jelena Luketina (2018) — ICML
@chua2018deeparticleKurtland Chua, Roberto Calandra, Rowan McAllister, et al. (2018) — NeurIPS
@kriss2018adaptivearticleMichael Kriss, Michael Mitzenmacher, Sergei Vassilvitskii (2018) — ALENEX
@diaz2018dontarticleNatalia Diaz-Rodriguez, Vincenzo Lomonaco, David Filliat, et al. (2018) — NeurIPS Workshop
@micikevicius2018mixedarticlePaulius Micikevicius, Sharan Narang, Jonah Alben (2018) — ICLR
@velickovic2018gatarticlePetar Velickovic, Guillem Cucurull, Arantxa Casanova, et al. (2018) — ICLR
@battaglia2018relationalarticlePeter W. Battaglia, Jessica B. Hamrick, Victor Bapst, et al. (2018) — arXiv
@aljundi2018memoryarticleRahaf Aljundi, Francesca Babiloni, Mohamed Elhoseiny, et al. (2018) — ECCV
@sutton2018reinforcementbookRichard S. Sutton, Andrew G. Barto (2018) — MIT Press
@vershynin2018highbookRoman Vershynin (2018) — Cambridge University Press
@kraska2018casearticleTim Kraska, Alex Beutel, Ed H. Chi, et al. (2018) — SIGMOD
@hsu2018reevaluatingarticleYen-Chang Hsu, Yen-Cheng Liu, Anita Ramasamy, et al. (2018) — NeurIPS CL Workshop
@tassa2018deepmindarticleYuval Tassa, Yotam Doron, Alistair Muldal (2018) — arXiv
@yang2018hotpotqaarticleZhilin Yang, Peng Qi, Saizheng Zhang (2018) — EMNLP
@van2017neuralarticleAaron van den Oord, Oriol Vinyals, Koray Kavukcuoglu (2017) — NeurIPS
@gomez2017reversiblearticleAidan N. Gomez, Mengye Ren, Raquel Urtasun, et al. (2017) — NeurIPS
@dosovitskiy2017carlaarticleAlexey Dosovitskiy, German Ros, Felipe Codevilla, et al. (2017) — CoRL
@vaswani2017attentionarticleAshish Vaswani, Noam Shazeer, Niki Parmar, et al. (2017) — NeurIPS
@zoph2017neuralarticleBarret Zoph, Quoc V. Le (2017) — ICLR
@finn2017mamlarticleChelsea Finn, Pieter Abbeel, Sergey Levine (2017) — ICML
@finn2017modelarticleChelsea Finn, Pieter Abbeel, Sergey Levine (2017) — ICML
@lopezpaz2017gradientarticleDavid Lopez-Paz, Marc'Aurelio Ranzato (2017) — NeurIPS
@silver2017masteringarticleDavid Silver, Julian Schrittwieser, Karen Simonyan, et al. (2017) — Nature
@pathak2017curiosityarticleDeepak Pathak, Pulkit Agrawal, Alexei A. Efros, et al. (2017) — ICML
@oyallon2017scalingarticleEdouard Oyallon, Eugene Belilovsky, Sergey Zagoruyko (2017) — ICCV
@bach2017equivalencearticleFrancis Bach (2017) — JMLR
@zenke2017continualarticleFriedemann Zenke, Ben Poole, Surya Ganguli (2017) — ICML
@shin2017continualarticleHanul Shin, Jung Kwon Lee, Jaehong Kim, et al. (2017) — NeurIPS
@kirkpatrick2017overcomingarticleJames Kirkpatrick, Razvan Pascanu, Neil Rabinowitz (2017) — PNAS
@pennington2017nonlineararticleJeffrey Pennington, Pratik Worah (2017) — NeurIPS
@schulman2017ppoarticleJohn Schulman, Filip Wolski, Prafulla Dhariwal, et al. (2017) — arXiv
@larsen2017optimalityarticleKasper Green Larsen, Jelani Nelson (2017) — FOCS
@clarkson2017lowarticleKenneth L. Clarkson, David P. Woodruff (2017) — JACM
@joshi2017triviaqaarticleMandar Joshi, Eunsol Choi, Daniel S. Weld, et al. (2017) — ACL
@arjovsky2017wganarticleMartin Arjovsky, Soumith Chintala, Leon Bottou (2017) — ICML
@mitzenmacher2017probabilitybookMichael Mitzenmacher, Eli Upfal (2017) — Cambridge University Press
@watters2017visualarticleNicholas Watters, Daniel Zoran, Theophane Weber, et al. (2017) — NeurIPS
@shazeer2017outrageouslyarticleNoam Shazeer, Azalia Mirhoseini, Krzysztof Maziarz (2017) — ICLR
@aljundi2017expertarticleRahaf Aljundi, Punarjay Chakravarty, Tinne Tuytelaars (2017) — CVPR
@chiappa2017recurrentarticleSilvia Chiappa, Sebastien Racaniere, Daan Wierstra, et al. (2017) — ICLR
@rebuffi2017icarlarticleSylvestre-Alvise Rebuffi, Alexander Kolesnikov, Georg Sperl, et al. (2017) — CVPR
@kipf2017gcnarticleThomas N. Kipf, Max Welling (2017) — ICLR
@lin2017fpnarticleTsung-Yi Lin, Piotr Dollar, Ross Girshick, et al. (2017) — CVPR
@lomonaco2017core50articleVincenzo Lomonaco, Davide Maltoni (2017) — CoRL
@lotter2017deeparticleWilliam Lotter, Gabriel Kreiman, David Cox (2017) — ICLR
@cao2017hashnetarticleZhangjie Cao, Mingsheng Long, Jianmin Wang, et al. (2017) — ICCV
@gittens2016revisitingarticleAlex Gittens, Michael W. Mahoney (2016) — JMLR
@rusu2016progressivearticleAndrei A. Rusu, Neil C. Rabinowitz, Guillaume Desjardins (2016) — arXiv
@poole2016exponentialarticleBen Poole, Subhaneil Lahiri, Maithra Raghu, et al. (2016) — NeurIPS
@finn2016unsupervisedarticleChelsea Finn, Ian Goodfellow, Sergey Levine (2016) — NeurIPS
@silver2016masteringarticleDavid Silver, Aja Huang, Chris J. Maddison, et al. (2016) — Nature
@kumaran2016whatarticleDharshan Kumaran, Demis Hassabis, James L. McClelland (2016) — Trends in Cognitive Sciences
@yu2016orthogonalarticleFelix X. Yu, Ananda Theertha Suresh, Krzysztof Choromanski, et al. (2016) — NeurIPS
@huang2016stochastic_deptharticleGao Huang, Yu Sun, Zhuang Liu, et al. (2016) — ECCV
@huang2016deeparticleGao Huang, Yu Sun, Zhuang Liu, et al. (2016) — ECCV
@liu2016deeparticleHaomiao Liu, Ruiping Wang, Shiguang Shan, et al. (2016) — CVPR
@jung2016lessarticleHeechul Jung, Jeongwoo Ju, Minju Jung, et al. (2016) — arXiv
@he2016deepinproceedingsKaiming He, Xiangyu Zhang, Shaoqing Ren, et al. (2016) — CVPR
@he2016resnetarticleKaiming He, Xiangyu Zhang, Shaoqing Ren, et al. (2016) — CVPR
@mathieu2016deeparticleMichael Mathieu, Camille Couprie, Yann LeCun (2016) — ICLR
@hardt2016trainarticleMoritz Hardt, Ben Recht, Yoram Singer (2016) — ICML
@battaglia2016interactionarticlePeter W. Battaglia, Razvan Pascanu, Matthew Lai, et al. (2016) — NeurIPS
@mallat2016understandingarticleStephane Mallat (2016) — Phil. Trans. R. Soc. A
@chen2016trainingarticleTianqi Chen, Bing Xu, Chiyuan Zhang, et al. (2016) — arXiv
@gal2016dropoutarticleYarin Gal, Zoubin Ghahramani (2016) — ICML
@andoni2015practicalarticleAlexandr Andoni, Piotr Indyk, Thijs Laarhoven, et al. (2015) — NeurIPS
@andoni2015optimalarticleAlexandr Andoni, Ilya Razenshteyn (2015) — STOC
@kingma2015adamarticleDiederik P. Kingma, Jimmy Ba (2015) — ICLR
@hinton2015distillingarticleGeoffrey Hinton, Oriol Vinyals, Jeff Dean (2015) — NeurIPS Workshop
@tropp2015introductionarticleJoel A. Tropp (2015) — Foundations and Trends in Machine Learning
@oh2015actionarticleJunhyuk Oh, Xiaoxiao Guo, Honglak Lee, et al. (2015) — NeurIPS
@chung2021rethinkingarticleJunyoung Chung, Sungjin Ahn, Yoshua Bengio (2015) — arXiv
@schmidhuber2015learningarticleJurgen Schmidhuber (2015) — arXiv
@he2015initarticleKaiming He, Xiangyu Zhang, Shaoqing Ren, et al. (2015) — ICCV
@he2015delvingarticleKaiming He, Xiangyu Zhang, Shaoqing Ren, et al. (2015) — ICCV
@srivastava2015unsupervisedarticleNitish Srivastava, Elman Mansimov, Ruslan Salakhutdinov (2015) — ICML
@ronneberger2015unetarticleOlaf Ronneberger, Philipp Fischer, Thomas Brox (2015) — MICCAI
@ioffe2015batchnormarticleSergey Ioffe, Christian Szegedy (2015) — ICML
@han2015learningarticleSong Han, Jeff Pool, John Tran, et al. (2015) — NeurIPS
@saxe2014exactarticleAndrew M. Saxe, James L. McClelland, Surya Ganguli (2014) — ICLR
@fan2014cuckooarticleBin Fan, Dave G. Andersen, Michael Kaminsky, et al. (2014) — CoNEXT
@woodruff2014sketchingarticleDavid P. Woodruff (2014) — Foundations and Trends in Theoretical Computer Science
@kingma2014adamarticleDiederik P. Kingma, Jimmy Ba (2014) — ICLR
@goodfellow2014generativearticleIan Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, et al. (2014) — NeurIPS
@anden2014deeparticleJoakim Anden, Stephane Mallat (2014) — IEEE Transactions on Signal Processing
@cho2014learningarticleKyunghyun Cho, Bart van Merrienboer, Caglar Gulcehre, et al. (2014) — EMNLP
@srivastava2014dropoutarticleNitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, et al. (2014) — JMLR
@bruna2013invariantarticleJoan Bruna, Stephane Mallat (2013) — IEEE TPAMI
@wan2013dropconnectarticleLi Wan, Matthew Zeiler, Sixin Zhang, et al. (2013) — ICML
@mermillod2013stabilityarticleMartial Mermillod, Aurélia Bugaiska, Patrick Bonin (2013) — Frontiers in Psychology
@le2013fastfoodarticleQuoc Le, Tamas Sarlos, Alexander Smola (2013) — ICML
@wager2013dropoutarticleStefan Wager, Sida Wang, Percy Liang (2013) — NeurIPS
@todorov2012mujocoarticleEmanuel Todorov, Tom Erez, Yuval Tassa (2012) — IROS
@mallat2012grouparticleStephane Mallat (2012) — Communications on Pure and Applied Mathematics
@deisenroth2011pilcoarticleMarc Peter Deisenroth, Carl Edward Rasmussen (2011) — ICML
@halko2011findingarticleNathan Halko, Per-Gunnar Martinsson, Joel A. Tropp (2011) — SIAM Review
@glorot2010initarticleXavier Glorot, Yoshua Bengio (2010) — AISTATS
@glorot2010understandingarticleXavier Glorot, Yoshua Bengio (2010) — AISTATS
@rahimi2009weightedarticleAli Rahimi, Benjamin Recht (2009) — NeurIPS
@candes2009exactarticleEmmanuel J. Candes, Benjamin Recht (2009) — Foundations of Computational Mathematics
@pearl2009causalitybookJudea Pearl (2009) — Cambridge University Press
@weinberger2009featurearticleKilian Weinberger, Anirban Dasgupta, John Langford, et al. (2009) — ICML
@mahoney2009curarticleMichael W. Mahoney, Petros Drineas (2009) — PNAS
@robertson2009probabilisticarticleStephen Robertson, Hugo Zaragoza (2009) — Foundations and Trends in Information Retrieval
@bengio2009curriculumarticleYoshua Bengio, Jerome Louradour, Ronan Collobert, et al. (2009) — ICML
@matousek2008variantsarticleJiří Matoušek (2008) — Random Structures & Algorithms
@bottou2008tradeoffsarticleLeon Bottou, Olivier Bousquet (2008) — NeurIPS
@abraham2008metaplasticityarticleWickliffe C. Abraham (2008) — Nature Reviews Neuroscience
@rahimi2007randomarticleAli Rahimi, Benjamin Recht (2007) — NeurIPS
@ji2007coordinatedarticleDaoyun Ji, Matthew A. Wilson (2007) — Nature Neuroscience
@flajolet2007hyperloglogarticlePhilippe Flajolet, Eric Fusy, Olivier Gandouet, et al. (2007) — DMTCS Proceedings
@andoni2006neararticleAlexandr Andoni, Piotr Indyk (2006) — FOCS
@donoho2006compressedarticleDavid L. Donoho (2006) — IEEE Transactions on Information Theory
@candes2006robustarticleEmmanuel J. Candes, Justin Romberg, Terence Tao (2006) — IEEE Transactions on Information Theory
@ailon2006fastarticleNir Ailon, Bernard Chazelle (2006) — STOC
@efraimidis2006weightedarticlePavlos S. Efraimidis, Paul G. Spirakis (2006) — Information Processing Letters
@ferguson2006optimalbookThomas S. Ferguson (2006) — Mathematics Department, UCLA
@cormode2005improvedarticleGraham Cormode, S. Muthukrishnan (2005) — Journal of Algorithms
@charikar2004findingarticleMoses Charikar, Kevin Chen, Martin Farach-Colton (2004) — Theoretical Computer Science
@achlioptas2003databasearticleDimitris Achlioptas (2003) — Journal of Computer and System Sciences
@dasgupta2003elementaryarticleSanjoy Dasgupta, Anupam Gupta (2003) — Random Structures & Algorithms
@charikar2002similarityarticleMoses S. Charikar (2002) — STOC
@williams2001nystromarticleChristopher K.I. Williams, Matthias Seeger (2001) — NeurIPS
@french1999catastrophicarticleRobert M. French (1999) — Trends in Cognitive Sciences
@indyk1998approximatearticlePiotr Indyk, Rajeev Motwani (1998) — STOC
@broder1997resemblancearticleAndrei Z. Broder (1997) — SEQUENCES
@karger1997consistentarticleDavid Karger, Eric Lehman, Tom Leighton (1997) — STOC
@mcclelland1995therearticleJames L. McClelland, Bruce L. McNaughton, Randall C. O'Reilly (1995) — Psychological Review
@motwani1995randomizedbookRajeev Motwani, Prabhakar Raghavan (1995) — Cambridge University Press
@robertson1995okapiarticleStephen E. Robertson, Steve Walker, Susan Jones, et al. (1995) — TREC
@jordan1994hierarchicalarticleMichael I. Jordan, Robert A. Jacobs (1994) — Neural Computation
@hornik1991approximationarticleKurt Hornik (1991) — Neural Networks
@jacobs1991adaptivearticleRobert A. Jacobs, Michael I. Jordan, Steven J. Nowlan, et al. (1991) — Neural Computation
@sutton1990integratedarticleRichard S. Sutton (1990) — ICML
@ratcliff1990connectionistarticleConnectionist Models of Recognition Memory: Constraints Imposed by Learning and Forgetting Functions
Roger Ratcliff (1990) — Psychological Review
@cybenko1989approximationarticleGeorge Cybenko (1989) — Mathematics of Control, Signals and Systems
@mccloskey1989catastrophicarticleMichael McCloskey, Neal J. Cohen (1989) — Psychology of Learning and Motivation
@mallat1989theoryarticleStephane Mallat (1989) — IEEE Transactions on Pattern Analysis and Machine Intelligence
@vitter1985randomarticleJeffrey S. Vitter (1985) — ACM Transactions on Mathematical Software
@johnson1984extensionsarticleWilliam B. Johnson, Joram Lindenstrauss (1984) — Contemporary Mathematics
@johnsonlaird1983mentalbookPhilip N. Johnson-Laird (1983) — Harvard University Press
@oja1982simplifiedarticleErkki Oja (1982) — Journal of Mathematical Biology
@grossberg1980howarticleStephen Grossberg (1980) — Psychological Review
@vapnik1971uniformarticleVladimir N. Vapnik, Alexey Ya. Chervonenkis (1971) — Theory of Probability and Its Applications
@bloom1970spacearticleBurton H. Bloom (1970) — Communications of the ACM
@robbins1951stochasticarticleHerbert Robbins, Sutton Monro (1951) — Annals of Mathematical Statistics
@kullback1951informationarticleSolomon Kullback, Richard A. Leibler (1951) — Annals of Mathematical Statistics
@craik1943naturebookKenneth J. W. Craik (1943) — Cambridge University Press
@turing1936computablearticleAlan M. Turing (1936) — Proceedings of the London Mathematical Society
@kolmogorov1933foundationsbookAndrey N. Kolmogorov (1933) — Julius Springer