Wei Fu

fuwth17 AT gmail DOT com

mypic.jpg

I’m currently a third-year PhD student at IIIS, Tsinghua University, studying computer science. I am very fortunate to be advised by Professor Yi Wu. Prior to this, I received my BEng degree from the Department of Electric Engineering in 2021.

My research interests lie in the intersection of reinforcement learning (RL) and distributed systems. Currently, I focus on developing efficient distributed systems for large-scale DRL/MARL/RLHF applications.

I’d describe myself more as a programmer than a researcher.

I enjoy coding, cooking, and music (Jay Chou, funk, and J-POP), but I’m not skilled at any of them.

News

Apr 29, 2024 I will be attending ICLR and ICRA 2024. See you in Vienna and Yokohama! 🫵

Selected Publications

  1. dpo.png
    Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
    Shusheng Xu , Wei Fu, Jiaxuan Gao , Wenjie Ye , Weilin Liu , Zhiyu Mei , Guangju Wang , Chao Yu , and Yi Wu
    Preprint, Apr 2024
  2. SRL
    srl.png
    SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
    Zhiyu Mei* , Wei Fu*, Guangju Wang , Huanchen Zhang , and Yi Wu
    ICLR. (*: Equal Contribution) , May 2024
  3. ar.png
    Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning
    Wei Fu, Chao Yu , Zelai Xu , Jiaqi Yang , and Yi Wu
    ICML, Jul 2022
  4. RSPO
    smac.gif
    Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization
    Zihan Zhou* , Wei Fu*, Bingliang Zhang , and Yi Wu
    ICLR. (*: Equal Contribution) , Apr 2022