Publication Text Embeddings by Weakly-Supervised Contrastive Pre-training Liang Wang, Nan Yang, Xiaolong Huang, Binxing Jiao, Linjun Yang, Daxin Jiang (姜大昕), Rangan Majumder, Furu Wei December 2022
Publication SLATE: A Sequence Labeling Approach for Task Extraction from Free-form Inked Content Apurva Gandhi, Ryan Serrao, Biyi Fang, Gilbert Antonius, Jenna Hong, My Nguyen, Sheng Yi, Ehi Nosakhare, Irene Shaffer, Soundar Srinivasan, Vivek Gupta 2022 Empirical Methods in Natural Language Processing | December 2022
Publication Random-LTD: Random and Layerwise Token Dropping Brings Efficient Training for Large-scale Transformers Zhewei Yao, Xiaoxia Wu, Conglong Li, Connor Holmes, Minjia Zhang, Cheng Li, Yuxiong He November 2022
Publication Disparate Impacts on Online Information Access during the COVID-19 Pandemic Jina Suh, Eric Horvitz, Ryen W. White, Tim Althoff Nature Communications | November 2022, Vol 13 Project
Publication OOD-DiskANN: Efficient and Scalable Graph ANNS for Out-of-Distribution Queries Harsha Simhadri November 2022
Publication Using Interventions to Improve Out-of-Distribution Generalization of Text-Matching Recommendation Systems Parikshit Bansal, Yashoteja Prabhu, Emre Kiciman, Amit Sharma NeurIPS 2022 Workshop on Distribution Shifts (DistShift) | November 2022
Publication COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning Yue Yu, Chenyan Xiong, Si Sun, Chao Zhang, Arnold Overwijk EMNLP 2022 | October 2022
Publication SimANS: Simple Ambiguous Negatives Sampling for Dense Text Retrieval Kun Zhou, Yeyun Gong, Xiao Liu, Wayne Xin Zhao, Yelong Shen, Anlei Dong, Jingwen Lu, Rangan Majumder, Ji-Rong Wen, Nan Duan, Weizhu Chen EMNLP 2022 | October 2022
Publication Dimension Reduction for Efficient Dense Retrieval via Conditional Autoencoder Zhenghao Liu, Han Zhang, Chenyan Xiong, Zhiyuan Liu, Yu Gu, Xiaohua Li EMNLP 2022 | October 2022
Publication Information decay and enzymatic information recovery for DNA data storage Linda C. Meiser, Andreas L. Gimpel, Tejas Deshpande, Gabriela Libort, Weida D. Chen, Reinhard Heckel, Bichlien Nguyen, Karin Strauss, Wendelin J. Stark, Robert N. Grass Communications Biology | October 2022 Project