PAIR Lab: PKU Alignment and Interaction Research Lab
PAIR Lab: PKU Alignment and Interaction Research Lab
Open-Source Projects
People
News
Publications
Resources
Contact
1
LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent Learning
Efficient exploration is important for reinforcement learners to achieve high rewards. In multi-agent systems, coordinated exploration …
David Henry Mguni
,
Taher Jafferjee
,
Jianhong Wang
,
Oliver Slumbers
,
Nicolas Perez Nieves
,
Feifei Tong
,
Li Yang
,
Jiangcheng Zhu
,
Yaodong Yang
,
Jun Wang
PDF
Cite
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
Trust region methods rigorously enabled reinforcement learning (RL) agents to learn monotonically improving policies, leading to …
Jakub Grudzien Kuba
,
Ruiqing Chen
,
Muning Wen
,
Ying Wen
,
Fanglei Sun
,
Jun Wang
,
Yaodong Yang
PDF
Cite
Settling the Variance of Multi-Agent Policy Gradients
Policy gradient (PG) methods are popular reinforcement learning (RL) methods where a baseline is often applied to reduce the variance …
Jakub Grudzien Kuba
,
Muning Wen
,
Linghui Meng
,
Shangding Gu
,
Haifeng Zhang
,
David Henry Mguni
,
Jun Wang
,
Yaodong Yang
PDF
Cite
Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games
Measuring and promoting policy diversity is critical for solving games with strong non-transitive dynamics where strategic cycles …
Xiangyu Liu
,
Hangtian Jia
,
Ying Wen
,
Yujing Hu
,
Yingfeng Chen
,
Changjie Fan
,
Zhipeng Hu
,
Yaodong Yang
PDF
Cite
«
Cite
×