Search

PAIR Lab: PKU Alignment and Interaction Research Lab

PAIR Lab: PKU Alignment and Interaction Research Lab

Open-Source Projects
People
News
Publications
Resources
Contact

Gang Pan

Latest

Mitigating Reward Over-Optimization in RLHF via Behavior-Supported Regularization
Constrained Update Projection Approach to Safe Policy Optimization

© 2023-2024 Copyright by the PAIR Lab

Published with Hugo Blox Builder — the free, open source website builder that empowers creators.

Cite