Skip to main content

Appl Math: Melanie Mitchell on "Six Principles For Evaluating Cognitive Capabilities in AI Models" - Reiss Lecture

Thursday, May 14, 2026 | 11:15 AM - 12:15 PM CT
Technological Institute, M416, 2145 Sheridan Road, Evanston, IL 60208 map it
Webcast Link (Hybrid)

Title: Six Principles For Evaluating Cognitive Capabilities in AI Models - Reiss Lecture

Speaker: Melanie Mitchell, Santa Fe Institute

Abstract: Modern AI systems have exceeded human performance on many benchmarks meant to evaluate general cognitive capacities. However, it is often the case that benchmark performance does a poor job of predicting general capacities in real-world settings. In this article I describe several issues related to evaluation that can cause this mismatch, and propose six principles, inspired by developmental and comparative psychology, that need to be adopted to enable rigorous evaluation for AI systems. These principles are illustrated by case studies from the psychology and AI literature.

Zoom: https://northwestern.zoom.us/j/96257055494

-----

To subscribe to the Applied Mathematics Colloquia List send a message to LISTSERV@LISTSERV.IT.NORTHWESTERN.EDU with the command:

SUBSCRIBE esam-seminar FirstName LastName

Cost: Free

Audience

  • Faculty/Staff
  • Student
  • Public
  • Post Docs/Docs
  • Graduate Students

Contact

Noor Kaur
Email

Interest

  • Academic (general)

Add Event To My Group

Please sign-in