News

Sep 22, 2025 Glad to share πŸ’­WebDreamer is accepted to TMLR!
Sep 18, 2025 πŸ”Mind2Web 2 is accepted to NeurIPS 2025! See you in San Diego 🌊!
Jun 26, 2025 πŸ”Mind2Web 2 is released! A rigorous agentic search benchmark with long-horizon tasks and Agent-as-a-Judge evaluation.
Apr 10, 2025 Excited to share my first project during my PhD - πŸ’­WebDreamer, using your LLMs as a web world model for model-based planning.
Oct 07, 2024 Check out our πŸ”¬ScienceAgentBench, a new benchmark to rigorously assess language agents for data-driven scientific discovery.
Aug 20, 2024 Start my PhD at OSU! πŸ₯³
Jun 10, 2024 Check out πŸ“‘EduNLP: a unified and modularized library for educational resources. Don’t forget to star our project!
May 24, 2024 Check out our world model πŸͺ„Pandora!
Nov 14, 2023 Excited to share our new work In Search of the Long-Tail: Systematic Generation of Long-Tail Knowledge via Logical Rule Guided Search!
Jul 25, 2023 I am joining INK Lab at USC as a visiting graduate student, working with Prof. Xiang Ren!
Jun 20, 2023 Our work Efficiently Measuring the Cognitive Ability of LLMs: An Adaptive Testing Perspective is now available at arXiv.
Feb 09, 2023 Our report paper A Novel Approach for Auto-Formulation of Optimization Problems for NL4Opt NeurIPS 2022 competition is now available at arXiv.
Dec 08, 2022 Our team, Long, won the 4th and 3rd place for two subtasks respectively in NL4Opt NeurIPS 2022 competition. I will present our solutions in the workshop! Check out our poster and slides.
Nov 19, 2022 Our paper Towards a Holistic Understanding of Mathematical Questions with Contrastive Pre-training is accepted by AAAI 2023.
β–Ά nnnyt.github.io's clustrmaps 🌎