Wei Fu

fuwth17 AT gmail DOT com

mypic.jpg

I’m currently a third-year PhD student at IIIS, Tsinghua University, studying computer science. I am very fortunate to be advised by Professor Yi Wu. Prior to this, I received my BEng degree from the Department of Electric Engineering in 2021.

My research interests lie in the intersection of reinforcement learning (RL) and distributed systems. Currently, I focus on developing efficient distributed systems for large-scale DRL/MARL/RLHF applications.

I’d describe myself more as a programmer than a researcher.

I enjoy coding, cooking, and music (Jay Chou, funk, Jazz, and J-POP).

News

Jun 23, 2024 Introducing ReaLHF, a highly efficient system for RLHF training of LLMs. It can achieve up to 10x higher training speedup than existing open-source systems! Check our open-sourced code and the documentation to get started ReaL quickly! 🚀
Jun 02, 2024 Our ICML 2024 paper has been selected as an oral presentation! Congrats to Shusheng and see you in Vienna again!
May 18, 2024 Our bipedal motion paper has been short-listed as a finalist for the Best Demo Award at ICRA Expo 2024! Congrats to Yunfei!
Apr 29, 2024 I will be attending ICLR and ICRA 2024. See you in Vienna and Yokohama! 🫵

Selected Publications

  1. ReaLHF
    realhf.png
    ReaLHF: Optimized RLHF Training for Large Language Models through Parameter Reallocation
    Zhiyu Mei* , Wei Fu*, Kaiwei Li , Guangju Wang , Huanchen Zhang , and Yi Wu
    Preprint (*: Equal Contribution) , Jul 2024
  2. dpo.png
    Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
    Shusheng Xu , Wei Fu, Jiaxuan Gao , Wenjie Ye , Weilin Liu , Zhiyu Mei , Guangju Wang , Chao Yu , and Yi Wu
    ICML. (Oral) , Jul 2024
  3. SRL
    srl.png
    SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
    Zhiyu Mei* , Wei Fu*, Guangju Wang , Huanchen Zhang , and Yi Wu
    ICLR. (*: Equal Contribution) , May 2024
  4. ar.png
    Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning
    Wei Fu, Chao Yu , Zelai Xu , Jiaqi Yang , and Yi Wu
    ICML, Jul 2022