Sep 22, 2025 | Glad to share πWebDreamer is accepted to TMLR! |
Sep 18, 2025 | πMind2Web 2 is accepted to NeurIPS 2025! See you in San Diego π! |
Jun 26, 2025 | πMind2Web 2 is released! A rigorous agentic search benchmark with long-horizon tasks and Agent-as-a-Judge evaluation. |
Apr 10, 2025 | Excited to share my first project during my PhD - πWebDreamer, using your LLMs as a web world model for model-based planning. |
Oct 07, 2024 | Check out our π¬ScienceAgentBench, a new benchmark to rigorously assess language agents for data-driven scientific discovery. |
Aug 20, 2024 | Start my PhD at OSU! π₯³ |
Jun 10, 2024 | Check out πEduNLP: a unified and modularized library for educational resources. Donβt forget to star our project! |
May 24, 2024 | Check out our world model πͺPandora! |
Nov 14, 2023 | Excited to share our new work In Search of the Long-Tail: Systematic Generation of Long-Tail Knowledge via Logical Rule Guided Search! |
Jul 25, 2023 | I am joining INK Lab at USC as a visiting graduate student, working with Prof. Xiang Ren! |
Jun 20, 2023 | Our work Efficiently Measuring the Cognitive Ability of LLMs: An Adaptive Testing Perspective is now available at arXiv. |
Feb 09, 2023 | Our report paper A Novel Approach for Auto-Formulation of Optimization Problems for NL4Opt NeurIPS 2022 competition is now available at arXiv. |
Dec 08, 2022 | Our team, Long, won the 4th and 3rd place for two subtasks respectively in NL4Opt NeurIPS 2022 competition. I will present our solutions in the workshop! Check out our poster and slides. |
Nov 19, 2022 | Our paper Towards a Holistic Understanding of Mathematical Questions with Contrastive Pre-training is accepted by AAAI 2023. |