Postdocs on diversity and inclusion in machine learning competitions at IT University of Copenhagen
The PURRlab (Pattern Recognition Revisited lab) at the IT University of Copenhagen invites motivated individuals to apply for postdoctoral positions starting in June 2026 or soon thereafter.
The project is funded by the Novo Nordisk Foundation Data Science Ascending Investigator grant titled "CHEETAH: CHallenges of Evaluating Teams and Algorithms" and is led by Full Professor Veronika Cheplygina.
Project description
Machine learning (ML) competitions are often touted as drivers of algorithm development in healthcare but face limitations in real-world applications. An example competition is detecting lung cancer in chest images, where the team correctly identifying the most images with cancer wins the competition. Such competitions attract many international teams with monetary or prestigious incentives. While competitions are said to spur innovation, they often result in too similar algorithms that only excel on a specific accuracy metric, but are not robust and fail to generalize to diverse, real-world data.
I posit that a single performance metric such as accuracy is insufficient to capture algorithm robustness, for example how the algorithm performs on rare patient cases. Having a single performance metric also leads to too similar algorithms which do not bring added value despite their high training costs and carbon footprint. Furthermore, as research on women and other underrepresented groups in computer science shows, competition may deter them from entering or staying in the field.
I therefore propose to design competitions with multiple metrics, both in what the metric measures (e.g. accuracy or sensitivity) and which subgroups of patients this is measured on. We will first develop novel methods to evaluate and increase the diversity of the evaluation data (RQ1). We will then design how to evaluate similarity of algorithms, and develop methods to combine them, such that robustness can be increased without the disproportionate carbon footprint (RQ2). Finally, we will study competitions in education and at conferences, to investigate how the novel design affects underrepresented groups in data science (RQ3).
The advertised postdoc positions will be focusing on RQ3.
More information about the project and application process: https://candidate.hr-manager.net/ApplicationInit.aspx?cid=119&ProjectId=181837&DepartmentId=3439&MediaId=5