Statistics and Data Science Seminar: "Do Large Language Models (Really) Need Statistical Foundations?"

Name: Statistics and Data Science Seminar: "Do Large Language Models (Really) Need Statistical Foundations?"
Start: 2025-10-17T11:00:00-05:00
End: 2025-10-17T12:00:00-05:00
Location: Chambers Hall, Ruan Conference Room – lower level

Friday, October 17, 2025 | 11:00 AM - 12:00 PM CT

Chambers Hall, Ruan Conference Room – lower level, 600 Foster St, Evanston, IL 60208 map it

Do Large Language Models (Really) Need Statistical Foundations?

Weijie Su, Associate Professor, Wharton Statistics and Data Science Department, University of Pennsylvania

Abstract: In this talk, we advocate for developing statistical foundations for large language models (LLMs). We begin by examining two key characteristics that necessitate statistical perspectives for LLMs: (1) the probabilistic, autoregressive nature of next-token prediction, and (2) the inherent complexity and black box nature of Transformer architectures. To demonstrate how statistical insights can advance LLM development and applications, we present two examples. First, we demonstrate statistical inconsistencies and biases arising from the current approach to aligning LLMs with human preference. We propose a regularization term for aligning LLMs that is both necessary and sufficient to ensure consistent alignment. Second, we introduce a novel statistical framework for analyzing the efficacy of watermarking schemes, with a focus on a watermarking scheme developed by OpenAI for which we derive optimal detection rules that outperform existing ones. Time permitting, we will explore how statistical principles can inform rigorous evaluation for LLMs. Collectively, these findings demonstrate how statistical insights can effectively address several pressing challenges emerging from LLMs. This talk is based on arXiv:2404.01245, 2405.16455, 2503.10990, 2505.19145, and 2506.12350.

Cost: free

Audience

Faculty/Staff
Student
Post Docs/Docs
Graduate Students

Contact

Kisa Kowal
(847) 491-3974
Email

Group

Department of Statistics and Data Science

Interest

Academic (general)

Statistics and Data Science Seminar: "Do Large Language Models (Really) Need Statistical Foundations?"

Audience

Contact

Group

Interest

Add Event To My Group