Communications and Signal Processing Seminar

Learning from Noisy Labels without Knowing Noise Rates

Yang LiuAssistant Professor, Computer Science and EngineeringUC Santa Cruz

Abstract:  Learning with noisy labels is a prevalent challenge in machine learning: in supervised learning, the training labels are often solicited from human annotators, which encode human-level mistakes; in semi-supervised learning, the artificially supervised pseudo labels are immediately imperfect. The list goes on. Existing approaches with theoretical guarantees often require practitioners to specify a set of parameters controlling the severity of label noises in the problem. The specifications are either assumed to be given or estimated using additional approaches.

In this talk, I introduce peer loss functions, which enable learning from noisy labels and do not require a priori specification of the noise rates. Peer loss functions associate each training sample with a specific form of “peer” sample, which helps evaluate a classifier’s predictions jointly. We show that, under mild conditions, performing empirical risk minimization (ERM) with peer loss functions on the noisy dataset leads to the optimal or a near-optimal classifier as if performing ERM over the clean training data. Peer loss provides a way to simplify model development when facing potentially noisy training labels. I will also discuss extensions of peer loss and some emerging challenges concerning biased data.

Bio:  Yang Liu is currently an Assistant Professor of Computer Science and Engineering at UC Santa Cruz (2019 – present). He was previously a postdoctoral fellow at Harvard University (2016 – 2018). He obtained his Ph.D. degree from the University of Michigan, Ann Arbor in 2015, advised by Professor Mingyan Liu. He is interested in weakly supervised learning and algorithmic fairness. He is a recipient of the NSF CAREER Award and the NSF Fairness in AI award. He has been selected to participate in several high-profile projects, including DARPA SCORE and IARPA HFC. His recent works have won four best paper awards at relevant workshops.

***Event will take place in hybrid format. The location for in-person attendance will be room 1008 EECS.   Attendance will also be available via Zoom.

Join Zoom Meeting https:

Meeting ID: 914 1429 7851

Passcode: XXXXXX (Will be sent via e-mail to attendees)

Zoom Passcode information is also available upon request to Michele Feldkamp ([email protected]).

This seminar will be recorded and posted to the CSP Seminar website.

See full seminar by Professor Yang

Faculty Host

Qing QuAssistant Professor, Electrical Engineering and Computer ScienceUniversity of Michigan