CANCELLED
When:
Wednesday, January 30, 2019
11:00 AM - 12:00 PM CT
Where: 2006 Sheridan Road, B02, 2006 Sheridan Road , Evanston, IL 60208 map it
Audience: Faculty/Staff - Post Docs/Docs - Graduate Students
Contact:
Kisa Kowal
(847) 491-3974
Group: Department of Statistics and Data Science
Category: Academic
Due to the extremely cold temperatures predicted for Wednesday, January 30 we are cancelling this talk. It will be rescheduled for later date. Thank you.
Department of Statistics Winter 2019 Seminar Series
Talk Title: Score-Matching Representative Approach for Big Data Analysis with Generalized Linear Models
Speaker: Keren Li, Postdoctoral Fellow, NSF-Simons Center for Quantitative Biology, Northwestern University
Time: 11:00am
Abstract: We propose a fast and efficient strategy, called the representative approach, for big data analysis with linear models and generalized linear models. With a given partition of big dataset, this approach constructs a representative data point for each data block and fits the target model using the representative dataset. In terms of time complexity, it is as fast as the subsampling approaches in the literature. As for efficiency, its accuracy in estimating parameters is better than the divide-and-conquer method.
With comprehensive simulation studies and theoretical justifications, we recommend two representative approaches. For linear models or generalized linear models with a flat inverse link function and moderate coefficients of continuous variables, we recommend mean representatives (MR). For other cases, we recommend score-matching representatives (SMR).
As an illustrative application to the Airline on-time performance data, MR and SMR are as good as the full data estimate when available. Furthermore, the proposed representative strategy is ideal for analyzing massive data dispersed over a network of interconnected computers.
Location: Department of Statistics, Room B02, 2006 Sheridan Rd, Evanston 60208