Jing-Jing Li

prof_pic.jpg

I’m a 4th-year PhD student at UC Berkeley advised by Professor Anne Collins working at the intersection of cognitive science, neuroscience, and AI. My thesis focuses on interpreting the computational principles of human intelligence through the lens of learning and decision-making. I use computational cognitive modeling to reverse-engineer the algorithms implemented by the human brain to navigate complex, dynamic learning environments. Particularly, I’m interested in how humans learn complex decision structures and flexibly transfer them to solve new problems.

I’m also passionate about AI safety. As a Research Intern at the Allen Institute for AI in Summer 2024, I led a project on improving the interpretability of LLM safety moderation using an approach inspired by world models and cost-benefit analysis.

news

Oct 04, 2024 Presenting my newly accepted Cognition paper at SfN24 on the TPDA award!
Aug 06, 2024 Two projects featured at the CCN conference in Boston.
May 13, 2024 Started my research internship at AI2!

selected publications

  1. SafetyAnalyst: Interpretable, transparent, and steerable safety moderation for AI behavior
    Jing-Jing Li, Valentina Pyatkin, Max Kleiman-Weiner, Liwei Jiang, Nouha Dziri, Anne G. E. Collins, Jana Schaich Borg, Maarten Sap, Yejin Choi, and Sydney Levine
    2024
  2. An algorithmic account for how humans efficiently learn, transfer, and compose hierarchically structured decision policies
    Jing-Jing Li, and Anne Collins
    Cognition, 2025
  3. Dynamic noise estimation: A generalized method for modeling noise fluctuations in decision-making
    Jing-Jing Li, Chengchun Shi, Lexin Li, and Anne GE Collins
    Journal of Mathematical Psychology, 2024