A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.
deep-learning
ensembles
best-of-n
large-language-models
reinforcement-learning-from-human-feedback
reward-models
-
Updated
Mar 9, 2024 - Python