PAIR Lab: PKU Alignment and Interaction Research Lab
PAIR Lab: PKU Alignment and Interaction Research Lab
Open-Source Projects
People
News
Publications
Resources
Contact
Representation Learning
TIMAR: Transition-Informed Representation for Sample-Efficient Multi-agent Reinforcement Learning
In MARL (Multi-Agent Reinforcement Learning), the trial-and-error learning paradigm based on multiple agents requires massive …
Mingxiao Feng
,
Yaodong Yang
,
Wengang Zhou
,
Houqiang Li
Cite
Adaptive Pessimism via Target Q-Value for Offline Reinforcement Learning
Offline reinforcement learning (RL) methods learn from datasets without further environment interaction, facing errors due to …
Jie Liu
,
Yinmin Zhang
,
Chuming Li
,
Yaodong Yang
,
Yu Liu
,
Wanli Ouyang
Cite
Cite
×