About
I am a principal researcher at Microsoft Research, Redmond working on computer use agents and efficient pre/post-training methodologies of large vision-language models. Check out our recent work on computer use agent OmniParser (ranked #1 Trending repo on GitHub and HuggingFace model hub, 24k+ stars), and scaling synthetic trajectory data for web agent.