Search

PAIR Lab: PKU Alignment and Interaction Research Lab

PAIR Lab: PKU Alignment and Interaction Research Lab

Open-Source Projects
People
News
Publications
Resources
Contact

Jiayi Zhou

Latest

Sequence to Sequence Reward Modeling: Improving RLHF by Language Feedback
Omnisafe: An infrastructure for accelerating safe reinforcement learning research
Language models resist alignment: Evidence from data compression
Safety Gymnasium: A Unified Safe Reinforcement Learning Benchmark

© 2023-2024 Copyright by the PAIR Lab

Published with Hugo Blox Builder — the free, open source website builder that empowers creators.

Cite