PAIR Lab: PKU Alignment and Interaction Research Lab
PAIR Lab: PKU Alignment and Interaction Research Lab
Open-Source Projects
People
News
Publications
Resources
Contact
Machine Learning
Adaptive Pessimism via Target Q-Value for Offline Reinforcement Learning
Offline reinforcement learning (RL) methods learn from datasets without further environment interaction, facing errors due to …
Jie Liu
,
Yinmin Zhang
,
Chuming Li
,
Yaodong Yang
,
Yu Liu
,
Wanli Ouyang
Cite
Cite
×