Yuting Ning

photo.jpg

I am a second-year PhD student at The Ohio State University (OSU), advised by Prof. Huan Sun and working closely with Prof. Yu Su. Previously, I obtained my Master’s and Bachelor’s degrees from University of Science and Technology of China (USTC), advised by Prof. Enhong Chen.

I am broadly interested in NLP and related fields. My current research focuses on language agents, with the goal of building and trustworthy AI agents that can reliably assist humans by automating complex or tedious tasks. Recently, my work has explored world model–based planning for web agents (WebDreamer), evaluation of agents on long-horizon web search tasks (Mind2Web 2) and scientific discovery tasks (ScienceAgentBench), and agent safety.

Feel free to reach out if you are interested in my research or potential collaborations.

📢 I am looking for a research internship for Summer 2026. Feel free to reach out if you have any opportunities!

News

Sep 22, 2025 Glad to share 💭WebDreamer is accepted to TMLR!
Sep 18, 2025 🔍Mind2Web 2 is accepted to NeurIPS 2025! See you in San Diego 🌊!
Jun 26, 2025 🔍Mind2Web 2 is released! A rigorous agentic search benchmark with long-horizon tasks and Agent-as-a-Judge evaluation.
Apr 10, 2025 Excited to share my first project during my PhD - 💭WebDreamer, using your LLMs as a web world model for model-based planning.
Oct 07, 2024 Check out our 🔬ScienceAgentBench, a new benchmark to rigorously assess language agents for data-driven scientific discovery.

Selected publications

  1. NeurIPS
    Mind2Web2.png
    Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge
    Boyu Gou*, Zanming Huang*, Yuting Ning*, Yu Gu, Michael Lin, Botao Yu, Andrei Kopanev, Weijian Qi, Yiheng Shu, Jiaman Wu, Chan Hee Song, Bernal Jimenez Gutierrez, Yifei Li, Zeyi Liao, Hanane Nour Moussa, Tianshu Zhang, Jian Xie, Tianci Xue, Shijie Chen, Boyuan Zheng, Kai Zhang, Zhaowei Cai, Viktor Rozgic, Morteza Ziyadi, Huan Sun, and Yu Su
    In The Thirty-ninth Annual Conference on Neural Information Processing Systems Datasets and Benchmarks Track, 2025
  2. TMLR
    WebDreamer.png
    Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents
    Yu Gu*, Kai Zhang*, Yuting Ning*, Boyuan Zheng*, Boyu Gou, Tianci Xue, Cheng Chang, Sanjari Srivastava, Yanan Xie, Peng Qi, and others
    Transactions on Machine Learning Research, 2025
  3. Preprint
    early_experience.png
    Agent Learning via Early Experience
    Kai Zhang, Xiangchao Chen, Bo Liu, Tianci Xue, Zeyi Liao, Zhihan Liu, Xiyao Wang, Yuting Ning, Zhaorun Chen, Xiaohan Fu, Jian Xie, Yuxuan Sun, Boyu Gou, Qi Qi, Zihang Meng, Jianwei Yang, Ning Zhang, Xian Li, Ashish Shah, Dat Huynh, Hengduo Li, Zi Yang, Sara Cao, Lawrence Jang, Shuyan Zhou, Jiacheng Zhu, Huan Sun, Jason Weston, Yu Su, and Yifan Wu
    arXiv preprint arXiv:2510.08558, 2025
  4. ICLR
    ScienceAgentBench.png
    ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
    Ziru Chen, Shijie Chen, Yuting Ning, Qianheng Zhang, Boshi Wang, Botao Yu, Yifei Li, Zeyi Liao, Chen Wei, Zitong Lu, and others
    The Thirteenth International Conference on Learning Representations, 2025
  5. Preprint
    Pandora.png
    Pandora: Towards General World Model with Natural Language Actions and Video States
    Jiannan Xiang*, Guangyi Liu*, Yi Gu*, Qiyue Gao, Yuting Ning, Yuheng Zha, Zeyu Feng, Tianhua Tao, Shibo Hao, Yemin Shi, Zhengzhong Liu, Eric P. Xing, and Zhiting Hu
    2024
  6. EMNLP
    LINK.png
    In Search of the Long-Tail: Systematic Generation of Long-Tail Knowledge via Logical Rule Guided Search
    Huihan Li, Yuting Ning, Zeyi Liao, Siyuan Wang, Xiang Lorraine Li, Ximing Lu, Faeze Brahman, Wenting Zhao, Yejin Choi, and Xiang Ren
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
▶ nnnyt.github.io's clustrmaps 🌎