Northwestern Events Calendar

Apr
15
2024

CS Seminar: Simpler Machine Learning Models for a Complicated World (Cynthia Rudin)

When: Monday, April 15, 2024
12:15 PM - 1:15 PM CT

Where: Kellogg Global Hub, KGH 1120, 2211 Campus Drive, Evanston, IL 60208 map it

Audience: Faculty/Staff - Student - Post Docs/Docs - Graduate Students

Cost: free

Contact: Wynante R Charles   (847) 467-8174

Group: Department of Computer Science (CS)

Category: Academic

Description:

In collaboration with the Kellogg Operations Department

Monday / CS Seminar
April 15th / 12:15 PM
In Person / Kellogg Global Hub 1120

Speaker
Cynthia Rudin, Duke University

Talk Title
Simpler Machine Learning Models for a Complicated World

Abstract
While the trend in machine learning has tended towards building more complicated (black box) models, such models have not shown any performance advantages for many real-world datasets, and they are more difficult to troubleshoot and use. For these datasets, simpler models (sometimes small enough to fit on an index card) can be just as accurate. However, the design of interpretable models is quite challenging due to the "interaction bottleneck" where domain experts must interact with machine learning algorithms.

I will present a new paradigm for interpretable machine learning that solves the interaction bottleneck. In this paradigm, machine learning algorithms are not focused on finding a single optimal model, but instead capture the full collection of good (i.e., low-loss) models, which we call "the Rashomon set." Finding Rashomon sets is extremely computationally difficult, but the benefits are massive. I will present the first algorithm for finding Rashomon sets for a nontrivial function class (sparse decision trees) called TreeFARMS. TreeFARMS, along with its user interface TimberTrek, mitigate the interaction bottleneck for users. TreeFARMS also allows users to incorporate constraints (such as fairness constraints) easily.

I will also present a "path," that is, a mathematical explanation, for the existence of simpler-yet-accurate models and the circumstances under which they arise. In particular, problems where the outcome is uncertain tend to admit large Rashomon sets and simpler models. Hence, the Rashomon set can shed light on the existence of simpler models for many real-world high-stakes decisions. This conclusion has significant policy implications, as it undermines the main reason for using black box models for decisions that deeply affect people's lives.

This is joint work with my colleagues Margo Seltzer and Ron Parr, as well as our exceptional students Chudi Zhong, Lesia Semenova, Jiachang Liu, Rui Xin, Zhi Chen, and Harry Chen. It builds upon the work of many past students and collaborators over the last decade.

Here are papers I will discuss in the talk:

Rui Xin, Chudi Zhong, Zhi Chen, Takuya Takagi, Margo Seltzer, Cynthia Rudin
Exploring the Whole Rashomon Set of Sparse Decision Trees, NeurIPS (oral), 2022.
https://arxiv.org/abs/2209.08040

Zijie J. Wang, Chudi Zhong, Rui Xin, Takuya Takagi, Zhi Chen, Duen Horng Chau, Cynthia Rudin, Margo Seltzer
TimberTrek: Exploring and Curating Sparse Decision Trees with Interactive Visualization, IEEE VIS, 2022.
https://poloclub.github.io/timbertrek/

Lesia Semenova, Cynthia Rudin, and Ron Parr
On the Existence of Simpler Machine Learning Models. ACM Conference on Fairness, Accountability, and Transparency (ACM FAccT), 2022.
https://arxiv.org/abs/1908.01755

Lesia Semenova, Harry Chen, Ronald Parr, Cynthia Rudin
A Path to Simpler Models Starts With Noise, NeurIPS, 2023.
https://arxiv.org/abs/2310.19726

Biography
TBA

Add to Calendar

Add Event To My Group:

Please sign-in