Fall 2025 Seminar Series
Department of Statistics and Data Science 2025-2026 Seminar Series - Fall 2025
The 2025-2026 Seminar Series will primarily be in person, but some talks will be offered virtually using Zoom. Talks that are virtual will be clearly designated and registration for the Zoom talks will be required to receive the zoom link for the event. Please email Kisa Kowal at k-kowal@northwestern.edu if you have questions.
Seminar Series talks are free and open to faculty, graduate students, and advanced undergraduate students
Do Large Language Models (Really) Need Statistical Foundations?
Friday, October 17, 2025
Time: 11:00 a.m. to 12:00 p.m. central time
Location: Ruan Conference Room – lower level (Chambers Hall 600 Foster Street)
Speaker: Weijie Su, Associate Professor, Wharton Statistics and Data Science Department, University of Pennsylvania
Abstract: In this talk, we advocate for developing statistical foundations for large language models (LLMs). We begin by examining two key characteristics that necessitate statistical perspectives for LLMs: (1) the probabilistic, autoregressive nature of next-token prediction, and (2) the inherent complexity and black box nature of Transformer architectures. To demonstrate how statistical insights can advance LLM development and applications, we present two examples. First, we demonstrate statistical inconsistencies and biases arising from the current approach to aligning LLMs with human preference. We propose a regularization term for aligning LLMs that is both necessary and sufficient to ensure consistent alignment. Second, we introduce a novel statistical framework for analyzing the efficacy of watermarking schemes, with a focus on a watermarking scheme developed by OpenAI for which we derive optimal detection rules that outperform existing ones. Time permitting, we will explore how statistical principles can inform rigorous evaluation for LLMs. Collectively, these findings demonstrate how statistical insights can effectively address several pressing challenges emerging from LLMs. This talk is based on arXiv:2404.01245, 2405.16455, 2503.10990, 2505.19145, and 2506.12350.
This talk will be given in person on Northwestern's Evanston campus.
planitpurple.northwestern.edu/event/632777
TBA
Friday, October 24, 2025
Time: 11:00 a.m. to 12:00 p.m. central time
Location: Ruan Conference Room – lower level (Chambers Hall 600 Foster Street)
Speaker: Youngtak Sohn, Assistant Professor, Applied Mathematics, Brown University
Abstract: TBA
This talk will be given in person on Northwestern's Evanston campus.
Trees against gerrymandering
Friday, October 31, 2025
Time: 11:00 a.m. to 12:00 p.m. central time
Location: Ruan Conference Room – lower level (Chambers Hall 600 Foster Street)
Speaker: Moon Duchin, Professor of Computer Science and Data Science, University of Chicago
Abstract: Motivated by the study of political redistricting, many mathematicians have gotten interested in sampling algorithms for graph partitions. (In this case the graph is a contact network of geographic units in a state.) There has been quite a lot of recent progress developing spanning-tree methods to do the sampling, and I'll survey some of what is and is not known. Bonus: I'll show you how this is being used in the current redistricting court cases in Texas!
This talk will be given in person on Northwestern's Evanston campus at the location listed above.
planitpurple.northwestern.edu/event/632793
TBA
Friday, November 7, 2024
Time: 11:00 a.m. to 12:00 p.m. central time
Location: Ruan Conference Room – lower level (Chambers Hall 600 Foster Street)
Speaker: TBA
TBA
Friday, November 14, 2025
Time: 11:00 a.m. to 12:00 p.m. central time
Location: Ruan Conference Room – lower level (Chambers Hall 600 Foster Street)
Speaker: Adityanarayanan Radhakrishnan, Assistant Professor of Mathematics, Massachusetts Institute of Technology
Abstract: TBA
This talk will be given in person on Northwestern's Evanston campus.
TBA
Friday, November 21, 2025
Time: 11:00 a.m. to 12:00 p.m. central time
Location: Ruan Conference Room – lower level (Chambers Hall 600 Foster Street)
Speaker: TBA