Publication MathScale: Scaling Instruction Tuning for Mathematical Reasoning Zhengyang Tang, Xingxing Zhang, Benyou Wang, Furu Wei ICML 2024 | March 2024
Publication Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning Yiming Huang, Xiao Liu, Yeyun Gong, Zhibin Gou, Yelong Shen, Nan Duan, Weizhu Chen March 2024
Publication ResLoRA: Identity Residual Mapping in Low-Rank Adaption Shuhua Shi, Shaohan Huang, Minghui Song, Zhoujun Li, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang February 2024
Publication The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Shuming Ma, Hongyu Wang, Lingxiao Ma, Lei Wang, Wenhui Wang, Shaohan Huang, Lifeng Dong, Ruiping Wang, Jilong Xue, Furu Wei February 2024 Work in progress
Publication Towards Optimal Learning of Language Models Yuxian Gu, Li Dong, Yaru Hao, Qingxiu Dong, Minlie Huang, Furu Wei February 2024
Publication C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control Theory Tianjiao Luo, Tim Pearce, Huayu Chen, Jianfei Chen, Jun Zhu NeurIPS 2024 | February 2024
Publication Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models Tianyi Tang, Wenyang Luo, Haoyang Huang, Dongdong Zhang, Xiaolei Wang, Xin Zhao, Furu Wei, Ji-Rong Wen ACL 2024 | February 2024
Publication LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Yiran Ding, Li Lyna Zhang, Chengruidong Zhang, Yuanyuan Xu, Ning Shang, Jiahang Xu, Fan Yang, Mao Yang ICML 2024 | February 2024 Github
Publication DyVal 2: Dynamic Evaluation of Large Language Models by Meta Probing Agents Kaijie Zhu, Jindong Wang, Qinlin Zhao, Ruochen Xu, Xing Xie ICML 2024 | February 2024
Publication Slot-VLM: SlowFast Slots for Video-Language Modeling Jiaqi Xu, Cuiling Lan, Wenxuan Xie, Xuejin Chen, Yan Lu NeurIPS 2024 | February 2024