PAIR Lab: PKU Alignment and Interaction Research Lab
PAIR Lab: PKU Alignment and Interaction Research Lab
Open-Source Projects
People
News
Publications
Resources
Contact
Gang Pan
Latest
Mitigating Reward Over-Optimization in RLHF via Behavior-Supported Regularization
Constrained Update Projection Approach to Safe Policy Optimization
Cite
×