Publication Training-free Spatially Grounded Geometric Shape Encoding (Technical Report) Yuhan He April 2026
Publication FlowInOne:Unifying Multimodal Generation as Image-in, Image-out Flow Matching Junchao Yi, Rui Zhao, Jiahao Tang, Weixian Lei, Linjie Li, Qi Su, Zhengyuan Yang, Lijuan Wang, Xiaofeng Zhu, Alex Jinpeng Wang April 2026
Publication FlowInOne:Unifying Multimodal Generation as Image-in, Image-out Flow Matching Junchao Yi, Rui Zhao, Jiahao Tang, Weixian Lei, Linjie Li, Qi Su, Zhengyuan Yang, Lijuan Wang, Xiaofeng Zhu, Alex Jinpeng Wang April 2026
Publication FunRec: Reconstructing Functional 3D Scenes from Egocentric Interaction Videos Alexandros Delitzas, Chenyangguang Zhang, Alexey Gavryushin, T. Mario, Boyang Sun, Rishabh Dabral, Leonidas J. Guibas, C. Theobalt, Marc Pollefeys, Francis Engelmann, Dániel Baráth April 2026
Publication TORA: Topological Representation Alignment for 3D Shape Assembly Nahyuk Lee, Zhiang Chen, Marc Pollefeys, Sung‐Jin Hong April 2026
Publication FunFact: Building Probabilistic Functional 3D Scene Graphs via Factor-Graph Reasoning Zhengyu Fu, Ren'e Zurbrugg, Kaixian Qu, Marc Pollefeys, Marco Hutter, Hermann Blum, Z. Bauer April 2026
Publication Learning Additively Compositional Latent Actions for Embodied AI Hangxing Wei, Xiaoyu Chen, Chuheng Zhang, Tim Pearce, Jianyu Chen, Alex Lamb, Li Zhao, Jiang Bian April 2026
Publication DynaVid: Learning to Generate Highly Dynamic Videos using Synthetic Motion Data Wonjoon Jin, J. Won, Janghyeok Han, Qi Dai, Chong Luo, Seung-Hwan Baek, Sunghyun Cho April 2026
Publication GeoAI Agency Primitives Akram Zaytar, Rohan Sawahn, Caleb Robinson, Gilles Quentin Hacheme, Girmaw Abebe Tadesse, I. Becker-Reshef, Rahul Dodhia, Juan M. Lavista Ferres April 2026
Publication STRIVE: Structured Spatiotemporal Exploration for Reinforcement Learning in Video Question Answering E. Bahrami, Olga Zatsarynna, Parth Pathak, Sunando Sengupta, Juergen Gall, Mohsen Fayyaz April 2026