PAIR Lab: PKU Alignment and Interaction Research Lab
PAIR Lab: PKU Alignment and Interaction Research Lab
Open-Source Projects
People
News
Publications
Resources
Contact
1
MANSA: Learning Fast and Slow in Multi-Agent Systems
In multi-agent reinforcement learning (MARL), independent learning (IL) often shows remarkable performance and easily scales with the …
David Mguni
,
Haojun Chen
,
Taher Jafferjee
,
Jianhong Wang
,
Long Fei
,
Xidong Feng
,
Stephen McAleer
,
Feifei Tong
,
Jun Wang
,
Yaodong Yang
PDF
Cite
Learning to Shape Rewards using a Game of Two Partners
Reward shaping (RS) is a powerful method in reinforcement learning (RL) for overcoming the problem of sparse or uninformative rewards. …
David Mguni
,
Taher Jafferjee
,
Jianhong Wang
,
Nicolas Perez Nieves
,
Tianpei Yang
,
Matthew Taylor
,
Wenbin Song
,
Feifei Tong
,
Hui Chen
,
Jiangcheng Zhu
,
Jun Wang
,
Yaodong Yang
PDF
Cite
Quality-Similar Diversity via Population Based Reinforcement Learning
Diversity is a growing research topic in Reinforcement Learning (RL). Previous research on diversity has mainly focused on promoting …
Shuang Wu
,
Jian Yao
,
Haobo Fu
,
Ye Tian
,
Chao Qian
,
Yaodong Yang
,
QIANG FU
,
Yang Wei
PDF
Cite
A Game-Theoretic Approach to Multi-agent Trust Region Optimization
Trust region methods are widely applied in single-agent reinforcement learning problems due to their monotonic performance-improvement …
Ying Wen
,
Hui Chen
,
Yaodong Yang
,
Minne Li
,
Zheng Tian
,
Xu Chen
,
Jun Wang
PDF
Cite
ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency
Multi-agent reinforcement learning (MARL) suffers from the non-stationarity problem, which is the ever-changing targets at every …
Chuming Li
,
Jie Liu
,
Yinmin Zhang
,
Yuhong Wei
,
Yazhe Niu
,
Yaodong Yang
,
Yu Liu
,
Wanli Ouyang
PDF
Cite
Contextual Transformer for Offline Meta Reinforcement Learning
The pretrain-finetuning paradigm in large-scale sequence models has made significant progress in natural language processing and …
Runji Lin
,
Ye Li
,
Xidong Feng
,
Zhaowei Zhang
,
Xian Hong Wu Fung
,
Haifeng Zhang
,
Jun Wang
,
Yali Du
,
Yaodong Yang
PDF
Cite
TorchOpt: An Efficient Library for Differentiable Optimization
Recent years have witnessed the booming of various differentiable optimization algorithms. These algorithms exhibit different execution …
Jie Ren
,
Xidong Feng
,
Bo Liu
,
Xuehai Pan
,
Yao Fu
,
Luo Mai
,
Yaodong Yang
PDF
Cite
Scalable Model-based Policy Optimization for Decentralized Networked Systems
Reinforcement learning algorithms require a large amount of samples; this often limits their real-world applications on even simple …
Yali Du
,
Chengdong Ma
,
Yuchen Liu
,
Runji Lin
,
Hao Dong
,
Jun Wang
,
Yaodong Yang
PDF
Cite
GenDexGrasp: Generalizable Dexterous Grasping
Generating dexterous grasping has been a long-standing and challenging robotic task. Despite recent progress, existing methods …
Puhao Li
,
Tengyu Liu
,
Yuyang Li
,
Yiran Geng
,
Yixin Zhu
,
Yaodong Yang
,
Siyuan Huang
PDF
Cite
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning
N/A
Bo Liu
,
Xidong Feng
,
Jie Ren
,
Luo Mai
,
Rui Zhu
,
Haifeng Zhang
,
Jun Wang
,
Yaodong Yang
PDF
Cite
«
»
Cite
×