Cost-efficient learning via weak annotators

Hao Qiu (Ph.D. Student)

In this project, we plan to design and analyze new algorithms for sequentially classifying a stream of data points based on a set of costly, noisy, and potentially malicious annotators. Assuming the algorithm obtains noisy labels by adaptively querying selected annotators, we are interested in studying trade-offs between the classification accuracy and the money spent to obtain the labels. We consider scenarios where annotators have fixed costs as opposed to accepting variable payments, where the annotators' accuracy may depend on the features of the data points, and where some annotators may act strategically in order to maximize their profit while minimizing their annotating effort. Our goal is to prove bounds on the learning algorithm's classification error that depend on the (unknown) functions relating data point features and received payments to annotation accuracy.

Primary Host: Nicolò Cesa-Bianchi (Università degli Studi di Milano)
Exchange Host: Wouter M. Koolen (Centrum Wiskunde & Informatica)
PhD Duration: 01 October 2022 - 30 September 2025
Exchange Duration: 01 September 2023 - 30 November 2023 01 September 2024 - 30 November 2024