News | Yuting Ning

Feb 10, 2026	Excited to share our latest work on agent safety about Misaligned Actions and Unintended Behaviors in computer-use agents!
Sep 22, 2025	Glad to share 💭WebDreamer is accepted to TMLR!
Sep 18, 2025	🔍Mind2Web 2 is accepted to NeurIPS 2025! See you in San Diego 🌊!
Jun 26, 2025	🔍Mind2Web 2 is released! A rigorous agentic search benchmark with long-horizon tasks and Agent-as-a-Judge evaluation.
Apr 10, 2025	Excited to share my first project during my PhD - 💭WebDreamer, using your LLMs as a web world model for model-based planning.
Oct 07, 2024	Check out our 🔬ScienceAgentBench, a new benchmark to rigorously assess language agents for data-driven scientific discovery.
Aug 20, 2024	Start my PhD at OSU! 🥳
Jun 10, 2024	Check out 📑EduNLP: a unified and modularized library for educational resources. Don’t forget to star our project!
May 24, 2024	Check out our world model 🪄Pandora!
Nov 14, 2023	Excited to share our new work In Search of the Long-Tail: Systematic Generation of Long-Tail Knowledge via Logical Rule Guided Search!
Jul 25, 2023	I am joining INK Lab at USC as a visiting graduate student, working with Prof. Xiang Ren!
Jun 20, 2023	Our work Efficiently Measuring the Cognitive Ability of LLMs: An Adaptive Testing Perspective is now available at arXiv.
Feb 09, 2023	Our report paper A Novel Approach for Auto-Formulation of Optimization Problems for NL4Opt NeurIPS 2022 competition is now available at arXiv.
Dec 08, 2022	Our team, Long, won the 4th and 3rd place for two subtasks respectively in NL4Opt NeurIPS 2022 competition. I will present our solutions in the workshop! Check out our poster and slides.
Nov 19, 2022	Our paper Towards a Holistic Understanding of Mathematical Questions with Contrastive Pre-training is accepted by AAAI 2023.