JING-JING LI

BUILDING SAFE INTELLIGENT SYSTEMS

PhD Student at UC Berkeley

Cognitive Scientist • AI Safety Researcher

prof_pic.jpg

JING-JING LI

RESEARCH

My research interests center on understanding intelligence—both natural and artificial—and building more capable and safe AI models.

I’m a rising 5th-year PhD student at UC Berkeley advised by Professor Anne Collins. My thesis investigates how intelligent agents efficiently learn, reason, plan, and make decisions in complex, dynamic environments—insights I derive from modeling human behavior. My goal is to reverse-engineer these effective learning algorithms to inform the development of next-generation AI.

I’m also committed to ensuring that the AI systems we build are safe and aligned with human values. Currently, I’m researching AI agent safety as an Applied Scientist Intern at Amazon Web Services Agentic AI. Previously, as a Research Intern at the Allen Institute for AI, I led a project on improving the interpretability and transparency of the safety moderation of AI behavior.

NEWS

Jul 13, 2025
Presenting my work on AI safety at ICML in Vancouver!
May 16, 2025
Started my internship at AWS Agentic AI in Seattle!
Apr 04, 2025
New work on exploration under uncertainty accepted as a spotlight at RLDM and a talk at CogSci!
Oct 04, 2024
Presenting my newly accepted Cognition paper at SfN24 on the TPDA award!
Aug 06, 2024
Two projects featured at the CCN conference in Boston.

SELECTED PUBLICATIONS

  1. safetyanalyst.jpg
    SafetyAnalyst: Interpretable, transparent, and steerable safety moderation for AI behavior
    Jing-Jing Li, Valentina Pyatkin, Max Kleiman-Weiner, Liwei Jiang, Nouha Dziri, Anne G. E. Collins, Jana Schaich Borg, Maarten Sap, Yejin Choi, and Sydney Levine
    ICML, 2025 Conference AI Safety LLM
  2. learning_hierarchy.jpg
    An algorithmic account for how humans efficiently learn, transfer, and compose hierarchically structured decision policies
    Jing-Jing Li, and Anne Collins
    Cognition, 2025 Journal Human Intelligence Reinforcement Learning Computational Modeling
  3. dynamic_noise_estimation.jpg
    Dynamic noise estimation: A generalized method for modeling noise fluctuations in decision-making
    Jing-Jing Li, Chengchun Shi, Lexin Li, and Anne GE Collins
    Journal of Mathematical Psychology, 2024 Journal Computational Modeling Method Development