Brief Introduction

Hello, I'm a third-year undergraduate at Tsinghua University, pursuing a degree in Computer Science. Currently, I am actively engaged in research with Prof. Peng Cui and Prof. Huan Zhang.

My primary research interest centers around Trust-Worthy Machine Learning. Specifically, I'm familiar with Out-of-Distribution (OOD) generalization, stable learning, domain generalization, subpopulation shift and data-centric AI.

To know me better, please check out my CV and research statement.

Publications (“*” indicates equal contribution)

1. Domain-wise Data Acquisition to Improve Performance under Distribution Shift.
  Yue He*, Dongbai Li*, Pengfei Tian, Han Yu, Jiashuo Liu, Hao Zou, Peng Cui. ICML 2024

2. Sample Weight Averaging for Stable Prediction.
  Han Yu, Yue He, Renzhe Xu, Dongbai Li, Jiayin Zhang, Wenchao Zou, Peng Cui. Under review.

3. SATE: A Two-Stage Approach for Performance Prediction in Subpopulation Shift Scenarios.
Dongbai Li, Huan Zhang. Under review.

4. The Emperor’s New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination.
  Yifan Sun*, Han Wang*, Dongbai Li*, Gang Wang, Huan Zhang. Under review.

Skills

Programming Languages: C++, Python, Verilog

Machine Learning / Deep Learning: scikit-learn, PyTorch, Hugging Face

Selected Courses

Fundamentals of Programming A

Programing and Training A

Introduction to Computer Systems A+

Probability and Statistics A

Software Engineering A