Communications and Signal Processing Seminar
Understanding Distribution Learning of Diffusion Models via Low-dimensional Modeling
This event is free and open to the publicAdd to Google Calendar
Abstract: Recent empirical studies have demonstrated that diffusion models can effectively learn the image distribution and generate new samples. Remarkably, these models can achieve this even with a small number of training samples despite a large image dimension, circumventing the curse of dimensionality. In this work, we provide theoretical insights into this phenomenon by leveraging key empirical observations: (i) the low intrinsic dimensionality of image data, (ii) a union of manifold structure of image data, and (iii) the low-rank property of the denoising autoencoder in trained diffusion models. These observations motivate us to assume the underlying data distribution of image data as a mixture of low-rank Gaussians and to parameterize the denoising autoencoder as a low-rank model according to the score function of the assumed distribution. With these setups, we rigorously show that optimizing the training loss of diffusion models is equivalent to solving the canonical subspace clustering problem over the training samples. Based on this equivalence, we further show that the minimal number of samples required to learn the underlying distribution scales linearly with the intrinsic dimensions under the above data and model assumptions. This insight sheds light on why diffusion models can break the curse of dimensionality and exhibit the phase transition in learning distributions. Moreover, we empirically establish a correspondence between the subspaces and the semantic representations of image data, facilitating image editing. We validate these results with corroborated experimental results on both simulated distributions and image datasets.
Bio sketch: Peng Wang is a postdoctoral research fellow at the Department of EECS at the University of Michigan, supervised by Professors Laura Balzano and Qing Qu. Before that, he earned his Ph.D. in Systems Engineering and Engineering Management from The Chinese University of Hong Kong. His research interest lies at the intersections of optimization, machine learning, and data science. Currently, he mainly focuses on studying the foundations of diffusion models and large language models through the lens of low-dimensional modeling.
*** The event will take place in a hybrid format. The location for in-person attendance will be room 3427 EECS. Attendance will also be available via Zoom.
Join Zoom Meeting: https://umich.zoom.us/j/93679028340
Meeting ID: 936 7902 8340
Passcode: XXX (Will be sent via email to attendees)
Zoom Passcode information is available upon request to Kristi Rieger ([email protected]).