Tong Xie
Los Angeles, CA
tongxie@ucla.edu
hi there! 👋
I am a first-year PhD student in Computer Science at UCLA, fortunate to be advised by Prof. Cho-Jui Hsieh.
I am broadly interested in Post-training of Large Language Models. Currently, I am working on LLM supervised fine-tuning (SFT), reinforcement learning (RL), and reward modeling, to improve reasoning capabilities and encourage stronger generalization.
Feel free to connect with me and explore opportunities, collaborations, and exciting ventures together!
news
| Dec 06, 2025 | Our new work When Distance Distracts: Representation Distance Bias in BT-Loss for Reward Models is now on arXiv! |
|---|---|
| Jun 03, 2024 | I am excited to intern with QSG, RBC’s buyside quant group, for summer 2024. |
| Jun 01, 2024 | Our work Data Attribution for Diffusion Models: Timestep-induced Bias in Influence Estimation is accepted at TMLR 2024! |
| Jun 26, 2023 | I am excited to be part of Summer Undergraduate Research Program (SURP) for summer 2023. Check out our poster. |